Commit Graph

8843 Commits

Author SHA1 Message Date
Sage Weil
65cf69c6ff qa/tasks/nvme_loop: set up nvme_loop on scratch_devs
Using an nvme loop device makes the LVs look like "real" disks,
which means we can exercise all of the normal code paths for
provisioning, deprovisioning, and zapping.

Signed-off-by: Sage Weil <sage@newdream.net>
2021-10-08 16:06:28 -05:00
Patrick Donnelly
c6c9734a91
Merge PR #43163 into master
* refs/pull/43163/head:
	qa: fsync dir for asynchronous creat on stray tests
	qa: refactor and generalize create_n_files
	qa: only set frag confs for workloads
	mds: improve debugging for fragment size check

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Jeff Layton <jlayton@redhat.com>
2021-10-08 14:38:35 -04:00
Sage Weil
0144e3a4d7 qa/suites/orch/rook: replicate across osds for 1-node clusters
Signed-off-by: Sage Weil <sage@newdream.net>
2021-10-08 11:00:48 -05:00
Dai Zhiwei
eaa385f3da qa: support isal ec test for aarch64
modified:   qa/standalone/erasure-code/test-erasure-code-plugins.sh
	new file:   qa/suites/rados/thrash-erasure-code-isa/arch/aarch64.yaml

Signed-off-by: Dai Zhiwei <daizhiwei3@huawei.com>
2021-10-08 14:37:25 +08:00
Patrick Donnelly
d33debc643
qa: fsync dir for asynchronous creat on stray tests
Use the enhanced create_n_files to dedup code. Also split the large test
into three.

Fixes: https://tracker.ceph.com/issues/52606
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-10-07 15:08:34 -04:00
Patrick Donnelly
395d20a2b7
qa: refactor and generalize create_n_files
Few things:

- Allow calling fsync on directory (to support async create kernel).
- Allow immediately unlinking the created file (for stray testing).
- Close any file descriptors created.
- Write unique content (the i variable) to each file.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-10-07 15:08:34 -04:00
Patrick Donnelly
a5675535ba
qa: only set frag confs for workloads
Otherwise these local conf overrides prevent functional testing.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-10-07 15:08:34 -04:00
Matt Benjamin
103b6cc355
Merge pull request #43442 from linuxbox2/wip-rgwadmin-logfix
qa/rgw: fix ops log tests to handle non-bucket ops (which are now valid)
2021-10-07 11:04:16 -04:00
Patrick Donnelly
2363078751
Merge PR #43231 into master
* refs/pull/43231/head:
	qa: fix promotion test

Reviewed-by: Ramana Raja <rraja@redhat.com>
2021-10-07 09:16:34 -04:00
Venky Shankar
ff88d7de52 qa: skip internal metadata directory when scanning ceph debugfs directory
kclient patchset

        https://patchwork.kernel.org/project/ceph-devel/list/?series=556049

introduces `meta` directory to add debugging entries. This needs to be filtered
when scanning ceph debugfs directory.

Fixes: https://tracker.ceph.com/issues/52824
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2021-10-07 00:40:13 -04:00
Kefu Chai
958b22e3ab
Merge pull request #43335 from liewegas/debug-51815
mon,auth: fix proposal (and mon db rebuild) of rotating secrets

Reviewed-by: Neha Ojha <nojha@redhat.com>
2021-10-07 06:45:45 +08:00
Matt Benjamin
788da98cde qa/rgw: fix ops log tests to handle non-bucket ops (which are now valid)
After 3863eb89512f1698b8e56f1f1ffc78a6ca8d5826--rgw: permit logging of
list-bucket (and any other no-bucket op-- the radosgw ops-log
contains entries for ops with no associated buckets--e.g., list_buckets.
When examining such a log object in the radosgw_admin task, don't assert
that it has any bucket name.

Fixes: https://tracker.ceph.com/issues/52647

Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
2021-10-06 16:08:06 -04:00
Ernesto Puerta
df89e6a174
Merge pull request #43256 from rhcs-dashboard/fix-48845-master
qa/mgr/dashboard/test_pool: don't check HEALTH_OK

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
2021-10-06 21:49:12 +02:00
Neha Ojha
363b223844
Merge pull request #42964 from trociny/wip-52448
osd: re-cache peer_bytes on every peering state activate

Reviewed-by: Neha Ojha <nojha@redhat.com>
2021-10-06 09:26:16 -07:00
Patrick Donnelly
97f9f5362b
qa: skip c-v install on pacific start
Continuation of 4e525127fb.

Fixes: https://tracker.ceph.com/issues/52822
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-10-05 20:01:05 -04:00
Patrick Donnelly
b56623342e
qa: fix promotion test
The test is not needing to check that the new MDS becomes active, only
that a replacement occurs.

Fixes: https://tracker.ceph.com/issues/52677
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-10-05 19:57:18 -04:00
Rishabh Dave
eb25549b8c qa/cephfs: update xfstests_dev for centos stream
Fixes: https://tracker.ceph.com/issues/52821
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2021-10-06 00:26:32 +05:30
Rishabh Dave
485841b255 qa: import CommandFailedError from exceptions not run
Stop importing CommandFailedError from teuthology.orchestra.run, it is
actually defined in teuthology.exception.

Fixes: https://tracker.ceph.com/issues/51226
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2021-10-05 23:41:09 +05:30
Patrick Donnelly
b1420e5771
qa: add test for cephfs upgrade sequence
This also checks max_mds>1 and allow_standby_replay are restored to
previous values.

Future work can add tests for multiple file systems (or volumes).

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-10-05 13:32:15 -04:00
Patrick Donnelly
5a7382214f
qa: add tasks to check mds upgrade state
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-10-05 13:32:15 -04:00
Patrick Donnelly
dbe5573ed4
qa: add note about where caps are generated
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-10-05 13:32:15 -04:00
Patrick Donnelly
24bb450d39
qa: use ctx's ceph_manager to run ceph commands by mount
This allows hooks for `cephadm shell` to function so that this code
works with cephadm deployments.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-10-05 13:32:15 -04:00
Patrick Donnelly
7812cfb674
qa: move CephManager cluster instantiation to subtask
This needs to be available for the cephfs_setup task so administration
mounts can run ceph commands, potentially through `cephadm shell`.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-10-05 13:32:15 -04:00
Sage Weil
eddfbbc421 qa/suites/rados/singleton/rebuild-mon-db: debug auth 30
Hunting https://tracker.ceph.com/issues/51815

Signed-off-by: Sage Weil <sage@newdream.net>
2021-10-01 14:42:23 -04:00
Aishwarya Mathuria
1b4e416f81 osd/scrub: Add scrub duration to pg dump stats
Addition of a new column, SCRUB_DURATION, to the pg stats that stores the time taken for a PG scrub.

Fixes: https://tracker.ceph.com/issues/52605
Signed-off-by: Aishwarya Mathuria <amathuri@redhat.com>
2021-10-01 13:27:27 +05:30
Ernesto Puerta
2283cb068b
qa/mgr/dashboard/test_pool: don't check HEALTH_OK
Fixes: https://tracker.ceph.com/issues/48845
Signed-off-by: Ernesto Puerta <epuertat@redhat.com>
2021-09-30 14:16:46 +02:00
Sage Weil
e3bede0008 Merge PR #43287 into master
* refs/pull/43287/head:
	mgr/rook, qa/tasks/rook: change rgw daemon service name
	mgr/rook: fix placement_spec_to_node_selector
	mgr/rook: orch rm no longer uses rook api delete
	qa/tasks/rook: fix cluster deletion hanging due to CephObjectStore CR
	mgr/rook: use default replication size in orch apply rgw
	mgr/rook: add placement specs to apply rgw

Reviewed-by: Sage Weil <sage@redhat.com>
2021-09-29 14:38:47 -04:00
Ernesto Puerta
156defa48e
Merge pull request #43255 from rhcs-dashboard/fix-49344-master
qa/mgr/dashboard: add extra wait to test

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
2021-09-29 20:23:23 +02:00
Ernesto Puerta
f947e6ec67
Merge pull request #42821 from rhcs-dashboard/force-maintenance-workflow
mgr/dashboard: Move force maintenance test to the workflow test suite 

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
2021-09-29 13:36:45 +02:00
Mykola Golub
d35920da5e qa/suites/rados: add inconsistent hinfo test
Signed-off-by: Mykola Golub <mgolub@suse.com>
2021-09-28 16:43:02 +01:00
Nizamudeen A
e33d9e15b9 mgr/dashboard: Move force maintenance test to the workflow test suite
Fixes: https://tracker.ceph.com/issues/52276
Signed-off-by: Nizamudeen A <nia@redhat.com>
2021-09-28 14:17:20 +05:30
Joseph Sawaya
8990280b22 mgr/rook, qa/tasks/rook: change rgw daemon service name
This commit changes the rgw daemon service name format from
rgw.<realm name>.<zone name> to rgw.<resource_name> and changes the daemon
removal in the QA accordingly. This also gets rid of the Rook API when
describing services.

Signed-off-by: Joseph Sawaya <jsawaya@redhat.com>
2021-09-27 14:52:59 -04:00
Joseph Sawaya
387c4f1310 qa/tasks/rook: fix cluster deletion hanging due to CephObjectStore CR
This commit fixes the issue where the cluster deletion hangs in the QA
while a CephObjectStore CR is still up by removing all rgw/nfs/mds/rbd-mirror
daemons before tearing down the rest of the cluster.

Signed-off-by: Joseph Sawaya <jsawaya@redhat.com>
2021-09-27 14:51:13 -04:00
Joseph Sawaya
f4980c39db mgr/rook: use default replication size in orch apply rgw
This commit changes `orch apply rgw` to use the osd_pool_default_size
when setting the replication size for the data pool and metadata pool
of the rgw daemon. This commit also adds `orch apply rgw` to the Rook
QA.

Signed-off-by: Joseph Sawaya <jsawaya@redhat.com>
2021-09-27 14:51:13 -04:00
Adam King
35f895aa45 mgr/cephadm: remove cephadm exporter code
It's functionality has been effectively replaced by the cephadm agent

Signed-off-by: Adam King <adking@redhat.com>
2021-09-27 12:11:55 -04:00
Ernesto Puerta
99f1f0f4c0
Merge pull request #43285 from rhcs-dashboard/api-endpoint-version-compatibility
mgr/dashboard: make modified API endpoints backward compatible

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
2021-09-27 14:13:07 +02:00
Ilya Dryomov
75824a113b
Merge pull request #43228 from idryomov/wip-rbd-mirror-snapshot-rx-only
rbd-mirror: unbreak one-way snapshot-based mirroring

Reviewed-by: Mykola Golub <mgolub@suse.com>
2021-09-25 10:58:17 +02:00
Avan Thakkar
88a8732215 mgr/dashboard: make modified API endpoints backward compatible
Fixes: https://tracker.ceph.com/issues/52480
Signed-off-by: Avan Thakkar <athakkar@redhat.com>

Introducing APIVersion class to handle versioning for API-endpints and making
them backward compatible.
2021-09-24 18:48:35 +05:30
Ilya Dryomov
366e9c51a8 qa/suites/rbd: test case for one-way snapshot-based mirroring
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2021-09-24 12:30:14 +02:00
Ernesto Puerta
9ff778cdaa
qa/mgr/dashboard: add extra wait to test
Fixes: https://tracker.ceph.com/issues/49344
Signed-off-by: Ernesto Puerta <epuertat@redhat.com>
2021-09-22 14:11:23 +02:00
Sage Weil
6bf32314f7 Merge PR #43156 into master
* refs/pull/43156/head:
	qa/suites/orch/rook/smoke: add flannel back in

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
2021-09-21 11:37:50 -04:00
Patrick Donnelly
541cc173c6 Merge PR #43179 into master
* refs/pull/43179/head:
	qa: lengthen grace for fs map showing dead MDS

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Jeff Layton <jlayton@redhat.com>
2021-09-20 20:48:00 -04:00
Patrick Donnelly
c8a900c6c6 Merge PR #42763 into master
* refs/pull/42763/head:
	mon/FSCommands: add 'recover' flag in `fs new` command

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-09-20 20:46:25 -04:00
Sage Weil
34c483965f Merge PR #43211 into master
* refs/pull/43211/head:
	qa/suites/upgrade/pacific-x: skip c-v install on pacific start
	qa/packages: install ceph-volume

Reviewed-by: Kefu Chai <kchai@redhat.com>
2021-09-20 10:59:56 -04:00
Sage Weil
4e525127fb qa/suites/upgrade/pacific-x: skip c-v install on pacific start
Signed-off-by: Sage Weil <sage@newdream.net>
2021-09-20 08:47:45 -04:00
Sage Weil
0b361fc8b9 qa/packages: install ceph-volume
Signed-off-by: Sage Weil <sage@newdream.net>
2021-09-19 21:51:19 -04:00
Matt Benjamin
19ecb52c6d
Merge pull request #43165 from linuxbox2/wip-perl-semicolon
workunits/rgw: semicolon terminates perl statements
2021-09-17 10:16:45 -04:00
Patrick Donnelly
0d9753fa3c Merge PR #43122 into master
* refs/pull/43122/head:
	qa: add test for standby-replay marking rank damaged
	MDSMonitor: handle damaged from standby-replay
	mds: add config to mark rank damaged in standby-replay
	include: unset std::hex after printing CompatSet
	mds: refactor iterator lookup
	mds: harden rank lookup

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2021-09-16 21:47:40 -04:00
Sage Weil
1a19d69679 Merge PR #43172 into master
* refs/pull/43172/head:
	qa/tasks/kubeadm: modify (do not clobber) daemon.json

Reviewed-by: Joseph Sawaya <jsawaya@redhat.com>
2021-09-15 22:48:36 -04:00
Patrick Donnelly
91c6f3364d Merge PR #42719 into master
* refs/pull/42719/head:
	mgr/volumes: Fix permission during subvol creation with mode

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-09-15 22:34:23 -04:00
Patrick Donnelly
33331cf4aa Merge PR #42584 into master
* refs/pull/42584/head:
	doc: fix `daemon status` interface (exclude file system name)
	test: adjust mirroring tests for `daemon status` change
	mgr/mirroring: `daemon status` command does not require file system name

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-09-15 22:33:18 -04:00
Patrick Donnelly
ef5d7febeb
qa: lengthen grace for fs map showing dead MDS
Fixes: https://tracker.ceph.com/issues/52625
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-09-15 22:21:03 -04:00
Sage Weil
2a6ad93a76 qa/tasks/kubeadm: modify (do not clobber) daemon.json
Otherwise we blow away the mirror config.

Signed-off-by: Sage Weil <sage@newdream.net>
2021-09-15 15:16:50 -05:00
Mykola Golub
76743e0058 qa/suites/rados: add backfill_toofull test
Signed-off-by: Mykola Golub <mgolub@suse.com>
2021-09-15 17:21:11 +03:00
Xiubo Li
0cb06740a9 qa: enable dynamic debug support to kclient
Add a 'kmount_count' counter in ctx to make sure the dynamic debug
log won't be disabled until the last kernel mounter is unmounted.

Fixes: https://tracker.ceph.com/issues/48736
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2021-09-15 09:31:04 +08:00
Matt Benjamin
221fdb858b workunits/rgw: semicolon terminates perl statements
Fixes a lexical error in one line of code added in
90e9307ab0, removing the dependency
on lsb_release, on 8/16/2021.

Fixes: https://tracker.ceph.com/issues/52613

Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
2021-09-14 17:13:49 -04:00
Sage Weil
688f963da7 qa/suites/orch/rook/smoke: add flannel back in
Signed-off-by: Sage Weil <sage@newdream.net>
2021-09-13 17:28:52 -05:00
Sage Weil
13238ade13 Merge PR #43136 into master
* refs/pull/43136/head:
	qa/tasks/kubeadm: change calico encap to IPIPCrossSubnet
	qa/suites/orch/rook/smoke: add host networking to matrix
	qa/tasks/rook: fix shadowing of config arg in rook_cluster()

Reviewed-by: Joseph Sawaya <jsawaya@redhat.com>
2021-09-13 18:28:43 -04:00
Sage Weil
528880d3bb qa/tasks/kubeadm: change calico encap to IPIPCrossSubnet
Signed-off-by: Sage Weil <sage@newdream.net>
2021-09-13 15:26:54 -05:00
Ramana Raja
67bb13859a mon/FSCommands: add 'recover' flag in fs new command
Currently, to recover a file system after recovering monitor store, you
need to stop all the MDSs; create FSMap with defaults using `fs new`
command; execute `fs reset` command to get the file system's rank 0 into
existing but failed state; and then restart MDSs.

Add 'recover' flag to the `fs new` command that sets the file system's
rank 0 to existing but failed state, and sets the file system's
'joinable' setting to False. Using the `fs new` command with 'recover'
flag gets rid of the steps to stop all the MDSs and execute `fs reset`
command when recovering the file system after recoving monitor store.

Fixes: https://tracker.ceph.com/issues/51716
Signed-off-by: Ramana Raja <rraja@redhat.com>
2021-09-13 00:15:39 -04:00
Joseph Sawaya
7dfbf7c9f5 qa/tasks/rook: add testing for host label add/rm
This commit adds simple tests that check if the orchestrator
can add and remove labels from nodes.

Signed-off-by: Joseph Sawaya <jsawaya@redhat.com>
2021-09-10 09:44:20 -05:00
Mykola Golub
e0a926a2c1 qa/tasks/ceph_manager: fix assertion
The osd may be 0.

Signed-off-by: Mykola Golub <mgolub@suse.com>
2021-09-10 15:47:41 +03:00
Patrick Donnelly
f4a11a3290
qa: add test for standby-replay marking rank damaged
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-09-09 20:16:03 -04:00
Yuri Weinstein
3b779e712f
Merge pull request #42853 from sseshasa/wip-fix-vstart-mon-permissions
mon/MonCap: Update osd profile to allow cmd to set iops capacity on mon db

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2021-09-09 10:10:06 -07:00
Neha Ojha
e273418bbb
Merge pull request #42604 from sseshasa/wip-skip-osd-benchmark
osd: Add config option to skip running the osd benchmark during init and update documentation.

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2021-09-08 11:03:09 -07:00
Sebastian Wagner
fe734adddd
Merge pull request #43045 from sebastian-philipp/qa-tox-import-yaml
qa: tox.ini: verify yaml syntax

Reviewed-by: Sage Weil <sage@newdream.net>
2021-09-08 17:10:13 +02:00
Kotresh HR
7440ef842a mgr/volumes: Fix permission during subvol creation with mode
The subvolume creation with specific mode leads to
creation of parent directories ('/volumes/_no_group') with
the same mode if it's not already created. Fixed the same.

Similarly, the subvolumegroup creation with specific mode
leads to creation of parent directory ('/volumes') with
same mode if it's not already created. Fixed the same.

Fixes: https://tracker.ceph.com/issues/51870
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2021-09-07 15:51:21 +05:30
Sebastian Wagner
7777603e8b
qa: tox.ini: verify yaml syntax
Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
2021-09-07 10:20:34 +02:00
Patrick Donnelly
ca906d0d7a Merge PR #42529 into master
* refs/pull/42529/head:
	qa: verify rank 0 does not fail during journal repair tests
	qa: avoid stopping/restarting mds in journal repair tests

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2021-09-06 14:00:41 -04:00
Kefu Chai
0b64a9f129
Merge pull request #42992 from ifed01/wip-ifed-fix-migrate
os/bluestore: fix bluefs migrate command

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
2021-09-06 22:50:20 +08:00
Sage Weil
9f050f094a qa/suites/orch/rook/smoke: add host networking to matrix
Signed-off-by: Sage Weil <sage@newdream.net>
2021-09-03 11:03:38 -05:00
Sage Weil
42b4108073 qa/tasks/rook: fix shadowing of config arg in rook_cluster()
Signed-off-by: Sage Weil <sage@newdream.net>
2021-09-03 10:49:54 -05:00
Sebastian Wagner
b88bd99066
Merge pull request #43040 from sebastian-philipp/qa-deadsymlinks
qa/tox.ini: check for dead symlinks

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Sage Weil <sage@newdream.net>
2021-09-03 17:17:03 +02:00
Sebastian Wagner
4e7c9c3a4c
Merge pull request #43039 from sebastian-philipp/rm-kubic
qa/distros: Remove stale kubic distros

Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
2021-09-03 11:44:44 +02:00
Sebastian Wagner
5313da14b9
Merge pull request #43021 from sebastian-philipp/rhel_8.3_container_tools_rhel8.yaml
qa/distros/podman: Add rhel_8.3_container_tools_rhel8.yaml

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-09-03 11:44:19 +02:00
Sebastian Wagner
c6779fdf13
qa/tox.ini: check for dead symlinks
IMO the amount of symlinks we have to manually maintain
is tedious and error prone. Any ideas on improving thing?

Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
2021-09-03 11:38:04 +02:00
Sebastian Wagner
65e2cc0840
qa/distros: Remove stale kubic distros
Cause they're broken

Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
2021-09-03 10:13:54 +02:00
Sebastian Wagner
0293580b50
qa/distros/podman: Add rhel_8.rhel_8.4_container_tools_3.0.yaml
mainly for the cephfs suite

Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
2021-09-03 10:04:45 +02:00
Sage Weil
9cb2f444fd Merge PR #42873 into master
* refs/pull/42873/head:
	qa/tasks/rook: add OSD creation to Rook QA

Reviewed-by: Sage Weil <sage@redhat.com>
2021-09-02 17:11:51 -04:00
Sridhar Seshasayee
7dcede75df qa: Use osd_op_queue=wpq for tests using filestore backend.
Force a subset of tests that explicitly employ the filestore backend to
use WPQ scheduler. This is because mclock scheduler will not be
optimized for filestore.

Fixes: https://tracker.ceph.com/issues/52025
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2021-09-02 18:15:54 +05:30
Joseph Sawaya
4b6de11169 qa/tasks/rook: add OSD creation to Rook QA
This commit adds OSD creation to the Rook QA tasks. The Rook task will
explicitly wait for the mgr to start and the CLI to work (instead of
implicitly doing so while waiting for 'ceph osd dump' to work).
Then it will do `ceph orch apply osd --all-available-devices` to create
OSDs on the rest of the PVs.

Signed-off-by: Joseph Sawaya <jsawaya@redhat.com>
2021-09-01 11:27:40 -04:00
Kalpesh Pandya
9c1e5d5c52 qa/tasks: Addition of new code for session tags in STS
Signed-off-by: Kalpesh Pandya <kapandya@redhat.com>
2021-09-01 17:09:54 +05:30
Kalpesh Pandya
74b5ec876c qa/tasks: Addition of two new parameters for sts-tests
Addition of SUB and AZP parameter for some new sts-tests

Signed-off-by: Kalpesh Pandya <kapandya@redhat.com>
2021-09-01 17:09:54 +05:30
Sridhar Seshasayee
f539bedc96 qa/standalone: Add standalone test to validate osd-mclock-skip-benchmark option
Add a standalone test - test_activate_osd_skip_benchmark() in ceph-helpers.sh
that exercises the osd-mclock-skip-benchmark option.

Fixes: https://tracker.ceph.com/issues/52025
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2021-09-01 14:19:03 +05:30
Sridhar Seshasayee
4b0dba28b6 qa/tasks: Set default caps for 'osd' type in generate_caps()
Assign the default caps for osds to be the same as what the AuthMonitor
sets for a new osd. See AuthMonitor::validate_osd_new() which sets the
following caps for a new osd:

 mon='allow profile osd'
 mgr='allow profile osd'
 osd=''allow *'

When an actual real world cluster is deployed, the above caps are applied.
Unless the user modifies the defaults, a cluster will operate with the
above caps. Therefore, it makes sense to use the defaults when testing
Ceph so that issues if any due to the default settings may be caught and
fixed.

Therefore, the caps for the 'osd' type is reset to the default in
generate_caps(). The caps for 'mgr' already reflects the system defaults.
The caps for 'mds' type is not changed in this commit and will be
investigated and changed if necessary later.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2021-09-01 13:46:01 +05:30
Neha Ojha
de8a68b768
Merge pull request #41912 from trociny/wip-crushdiff
tools/crushdiff: new tool to test crushmap change

Reviewed-by: Neha Ojha <nojha@redhat.com>
2021-08-31 14:18:37 -07:00
Igor Fedotov
0b0f8ef12f qa/osd-bluefs-volume-ops: reproduce bluefs migrate bug
Reproduces: https://tracker.ceph.com/issues/40434
Signed-off-by: Igor Fedotov <ifedotov@suse.com>
2021-08-31 16:23:22 +03:00
Deepika Upadhyay
69a5ae539f
Merge pull request #42770 from ktdreyer/no-lsb-release
Remove dependency on lsb_release

Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
2021-08-30 14:13:02 +05:30
Patrick Donnelly
297fcdce24 Merge PR #42687 into master
* refs/pull/42687/head:
	qa: test the "ms_mode" options in kclient workflows

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-08-27 21:29:17 -04:00
Patrick Donnelly
ec69208deb Merge PR #38481 into master
* refs/pull/38481/head:
	qa/vstart_runner: inherit methods instead of duplicating them
	qa/ceph_manager: make it possible to reuse few methods
	qa/vstart_runner: don't use "shell=False" in run_ceph_w()
	qa/ceph_manager: minor refactor

Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-08-27 21:26:41 -04:00
Mykola Golub
7311f6656f qa/suites/rados: add crushdiff test
Signed-off-by: Mykola Golub <mykola.golub@clyso.com>
2021-08-27 17:45:40 +03:00
Patrick Donnelly
1b1f108449 Merge PR #42530 into master
* refs/pull/42530/head:
	qa: ignore expected metadata cluster log error

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
2021-08-26 22:36:28 -04:00
Casey Bodley
bdee7f85f3
Merge pull request #42869 from cbodley/wip-49955
qa/rgw: move mixed-version multisite tests out of upgrade suite

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
2021-08-26 10:32:23 -04:00
Patrick Donnelly
ea04087786 Merge PR #42371 into master
* refs/pull/42371/head:
	mgr/volumes: Fix a race during clone cancel
	mgr/volumes: Fail subvolume removal if it's in progress

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
2021-08-23 20:02:31 -04:00
Alfonso Martínez
7291426fd0
Merge pull request #42865 from rhcs-dashboard/fix-rgw-bucket-list-api
mgr/dashboard: stats=false not working when listing buckets

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
2021-08-23 14:43:56 +02:00
Avan Thakkar
95543bb150 mgr/dashboard: stats=false not working when listing buckets
Fixes: https://tracker.ceph.com/issues/51154
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
2021-08-23 15:57:54 +05:30
Casey Bodley
214f2f3b84 qa/rgw: move mixed-version multisite tests out of upgrade suite
not really fixing anything, but moves the failures out of the normal
upgrade suite

Fixes: https://tracker.ceph.com/issues/49955

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2021-08-20 10:30:42 -04:00
Sebastian Wagner
e436483c77
qa/distro: Add centos_8.2_container_tools_3.0.yaml
Let's avoid latest kubic stable

Fixes: https://tracker.ceph.com/issues/52279
Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
2021-08-20 10:53:11 +02:00
Yuri Weinstein
4e6fbccea4
Merge pull request #42852 from sebastian-philipp/centos_8.3_container_tools_3.0.yaml
qa/cephadm: centos_8.3_container_tools_3.0.yaml

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2021-08-19 12:45:01 -07:00
Sebastian Wagner
c6bd968eff
qa/cephadm: centos_8.3_container_tools_3.0.yaml
Let's avoid latest stable

Fixes: https://tracker.ceph.com/issues/52279

Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
2021-08-19 16:55:58 +02:00
Ken Dreyer
65f4d6eb36 workunits/rbd: remove lsb_release
The lsb_release utility brings in a lot of other dependencies. Remove
it from the RBD workunit script.

Signed-off-by: Ken Dreyer <kdreyer@redhat.com>
2021-08-18 13:08:24 -04:00
Ken Dreyer
db66f6f357 workunits/rgw: remove lsb_release
The lsb_release utility brings in a lot of other dependencies. Remove
it from the RGW workunit Perl scripts.

Signed-off-by: Ken Dreyer <kdreyer@redhat.com>
2021-08-18 13:08:24 -04:00
Arthur Outhenin-Chalandre
e135403c73
rbd-mirror: add image_map cleanup in LoadRequest
In the LoadRequest in the ImageMap class add initial cleanup to remove
stale entries. To cleanup the LoadRequest will query the mirror image
list and remove all the image_map that are notin the list.

Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch>
2021-08-18 18:52:37 +02:00
Arthur Outhenin-Chalandre
4db66da512
qa/rbd-mirror: add OMAP cleanup checks
This make sure that all images are deleted in the existing qa scripts
and checks if all rbd-mirror metadata in OMAP are correctly deleted.

Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch>
2021-08-18 18:50:44 +02:00
Joseph Sawaya
ee90f5e5c0 qa/tasks/rook: add device ls command to rook task
This commit adds the device ls command to the rook qa task
since that command should be working from now on.

Signed-off-by: Joseph Sawaya <jsawaya@redhat.com>
2021-08-18 11:11:15 -04:00
Jeff Layton
cd41b1c150 qa: test the "ms_mode" options in kclient workflows
Note that I didn't bother adding the prefer-* options, as I figure it's
better to be definite.

Fixes: https://tracker.ceph.com/issues/52068
Signed-off-by: Jeff Layton <jlayton@redhat.com>
2021-08-18 09:24:13 -04:00
Sage Weil
6f8bdfbb90 Merge PR #42252 into master
* refs/pull/42252/head:
	mgr/dashboard: set rgw credentials: fix api tests
	mgr/dashboard: run-frontend-e2e-tests.sh: remove unneeded rgw setting
	mgr/dashboard: rgw service creation form: add realm and zone to service spec.
	mgr/dashboard: connect-rgw: rename to set-rgw-credentials; refactoring
	mgr/dashboard: connect-rgw: adaptation and test coverage
	mgr/cephadm: re-check dashboard <-> rgw creds when rgw daemons created/destroyed
	mgr/dashboard: add 'dashboard connect-rgw' command
	doc/mgr/dashboard: simplify dashboard+rgw config docs

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
2021-08-11 11:28:28 -04:00
Alfonso Martínez
a682b9d7a4 mgr/dashboard: set rgw credentials: fix api tests
Fixes: https://tracker.ceph.com/issues/44605
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
2021-08-11 08:59:13 +02:00
Sage Weil
4b9a3b2171 Merge PR #42613 into master
* refs/pull/42613/head:
	qa/suites/roch/rook/smoke: test rook 1.7.0, not 1.6.2
	qa/tasks/rook: set storage_class to scratch

Reviewed-by: merge 42318
2021-08-10 16:47:22 -04:00
Sage Weil
3331a0a7ea Merge PR #42691 into master
* refs/pull/42691/head:
	mgr/nfs: add --port to 'nfs cluster create' and port to 'nfs cluster info'
	qa/suites/orch/cephadm/smoke-roleless: test taking ganeshas offline
	qa/tasks/vip: exec with bash -ex
	qa/suites/orch/cephadm: separate test_nfs from test_orch_cli

Reviewed-by: Varsha Rao <varao@redhat.com>
2021-08-10 16:37:38 -04:00
Alfonso Martínez
6e20ef1dd3 mgr/dashboard: connect-rgw: rename to set-rgw-credentials; refactoring
- Rename the dashboard command to better reflect its behavior.
- Rename '_radosgw_admin' method to 'send_rgwadmin_command' for consistency with
  'send_mon_command' and move it to the mgr_module.py .
- Cleanup: remove unneeded rgw settings.
- Better error handling and test coverage.

Fixes: https://tracker.ceph.com/issues/44605
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
2021-08-10 14:06:03 +02:00
Sage Weil
84479e03a7 Merge PR #42709 into master
* refs/pull/42709/head:
	qa/tasks/kubeadm: force docker cgroup engine to systemd

Reviewed-by: Travis Nielsen <tnielsen@redhat.com>
2021-08-09 15:23:11 -04:00
Neha Ojha
8e76b1448d
Merge pull request #42722 from neha-ojha/wip-remove-rgw-perf
qa/suites/rados/perf/ceph.yaml: remove rgw

Reviewed-by: Sridhar Seshasayee <sseshasa@redhat.com>
2021-08-09 11:42:41 -07:00
Neha Ojha
97b23d85f1 qa/suites/perf-basic/ceph.yaml: remove rgw
This is no longer required because we removed cosbench workloads in
fd350fd015.

Signed-off-by: Neha Ojha <nojha@redhat.com>
2021-08-09 17:31:50 +00:00
Casey Bodley
95f2161ee3
Merge pull request #42688 from cbodley/wip-52069
qa/rgw: update apache-maven mirror for rgw/hadoop-s3a

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
2021-08-09 11:51:36 -04:00
Casey Bodley
e514b3a374
Merge pull request #42689 from cbodley/wip-52070
qa/rgw: barbican and pykmip tasks upgrade pip before installing pytz

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
2021-08-09 11:51:21 -04:00
Sage Weil
8ebe341198 mgr/nfs: add --port to 'nfs cluster create' and port to 'nfs cluster info'
Fixes: https://tracker.ceph.com/issues/51787
Signed-off-by: Sage Weil <sage@newdream.net>
2021-08-09 11:41:08 -04:00
Neha Ojha
119544bb29 qa/suites/rados/perf/ceph.yaml: remove rgw
This is no longer required because we removed cosbench workloads in
fd350fd015. This is also required to prevent
failures like the following or any other changes that break the rgw task:

```
2021-08-06T20:13:25.812 INFO:teuthology.orchestra.run.smithi060.stderr:curl: (7) Failed to connect to smithi060.front.sepia.ceph.com port 80: Connection refused
2021-08-06T20:15:33.813 ERROR:teuthology.contextutil:Saw exception from nested tasks
Traceback (most recent call last):
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_04c2febe7099917d97a71271f17abb5710030132/teuthology/contextutil.py", line 31, in nested
    vars.append(enter())
  File "/usr/lib/python3.6/contextlib.py", line 81, in __enter__
    return next(self.gen)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_3c0f8c8164075af7aac4d1f2805d3f4580709461/qa/tasks/rgw.py", line 191, in start_rgw
    wait_for_radosgw(url, remote)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_3c0f8c8164075af7aac4d1f2805d3f4580709461/qa/tasks/util/rgw.py", line 94, in wait_for_radosgw
    assert exit_status == 0
AssertionError
```

Signed-off-by: Neha Ojha <nojha@redhat.com>
2021-08-09 15:08:11 +00:00
Sage Weil
a5e8227a83 qa/suites/orch/cephadm/smoke-roleless: test taking ganeshas offline
Signed-off-by: Sage Weil <sage@newdream.net>
2021-08-09 11:03:55 -04:00
Sage Weil
972f6468b9 qa/suites/roch/rook/smoke: test rook 1.7.0, not 1.6.2
The RBAC changes so that we can see the PVs etc are needed by mgr/rook.

Signed-off-by: Sage Weil <sage@newdream.net>
2021-08-06 17:50:20 -04:00
Sage Weil
517b7759b3 qa/tasks/kubeadm: force docker cgroup engine to systemd
Signed-off-by: Sage Weil <sage@newdream.net>
2021-08-06 14:21:08 -05:00
Kefu Chai
62944aefa0
Merge pull request #42277 from tchaikov/wip-vstart-runner-cleanups
qa/tasks/vstart_runner: do not send SIGTERM if no matched pid

Reviewed-by: Rishabh Dave <ridave@redhat.com>
2021-08-06 10:33:19 +08:00
Sage Weil
3c1e086be0 qa/tasks/vip: exec with bash -ex
Signed-off-by: Sage Weil <sage@newdream.net>
2021-08-05 17:45:56 -04:00
Casey Bodley
e5a5b4e379 qa/rgw: barbican and pykmip tasks upgrade pip before installing pytz
Downloading 461087a514/cryptography-3.4.7.tar.gz (546kB)
  Complete output from command python setup.py egg_info:

          =============================DEBUG ASSISTANCE==========================
          If you are seeing an error here please try the following to
          successfully install cryptography:

          Upgrade to the latest pip and try again. This will fix errors for most
          users. See: https://pip.pypa.io/en/stable/installing/#upgrading-pip
          =============================DEBUG ASSISTANCE==========================

  Traceback (most recent call last):
    File "<string>", line 1, in <module>
    File "/tmp/pip-build-7fhnk5us/cryptography/setup.py", line 14, in <module>
      from setuptools_rust import RustExtension
  ModuleNotFoundError: No module named 'setuptools_rust'

Fixes: https://tracker.ceph.com/issues/52070

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2021-08-05 16:45:02 -04:00
Casey Bodley
9253733d08 qa/rgw: update apache-maven mirror for rgw/hadoop-s3a
Fixes: https://tracker.ceph.com/issues/52069

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2021-08-05 14:50:09 -04:00
Sage Weil
0ae020979d qa/suites/orch/cephadm: separate test_nfs from test_orch_cli
Signed-off-by: Sage Weil <sage@newdream.net>
2021-08-05 14:28:17 -04:00
Kefu Chai
a17ebc0406
Merge pull request #42575 from tchaikov/wip-venv
*: s/virtualenv/python -m venv/

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
2021-08-04 18:37:45 +08:00
Sage Weil
460d7a215a qa/tasks/rook: set storage_class to scratch
Signed-off-by: Sage Weil <sage@newdream.net>
2021-08-03 16:13:13 -04:00
Ilya Dryomov
df96b85b58 qa/workunits/rbd: use xenial version of qemu-iotests for centos stream 8
It is already used for centos 8(.3) and rhel 8(.4).

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2021-08-03 09:44:18 +02:00
Venky Shankar
11b61b4fb9 test: adjust mirroring tests for daemon status change
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2021-08-02 06:39:16 -04:00
Rishabh Dave
d86bfbfe2d qa/vstart_runner: inherit methods instead of duplicating them
Inherit methods run_ceph_w(), run_cluster_cmd(), raw_cluster_cmd() and
raw_cluster_cmd_result() from ceph_manager.CephManager in
vstart_runner.LocalCephManager instead of duplicating them.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2021-08-02 11:37:49 +05:30
Rishabh Dave
93677576c1 qa/ceph_manager: make it possible to reuse few methods
Make minor adjustments to ceph_manager.CephManager so that methods
run_ceph_w(), run_cluster_cmd() raw_cluster_cmd() and
raw_cluster_cmd_result() can be reused, instead of duplicating, in
subclasses. The adjustments are -

* Having variables contain arguments that'll be prepended to every
  command received by the methods above.
* Grouping variables that needs to be overridden together so that it is
  easy to spot and override them for users.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2021-08-02 11:37:49 +05:30
Rishabh Dave
047c90f881 qa/vstart_runner: don't use "shell=False" in run_ceph_w()
Instead prepend "exec sudo" to the command arguments of
LocalCephManager.run_ceph_w(). This makes the default parameter
"shell=False" redundant in case of
ceph_manager.CephManager.run_ceph_w(), so get rid of it too and update
calls to run_ceph_w() accordingly.

The reason behind using any of these workarounds is that running "ceph
-w" with "shell" set to True leads to crash for Ceph API CI job. See
this ticket for more details: https://tracker.ceph.com/issues/49644.

The reason behind switching the workaround is that in the following
commits to reduce duplication LocalCephManager.run_ceph_w() will be
deleted and CephManager.run_ceph_w() will be used by LocalCephManager
via inheritance. However, due to the issue described above, Ceph API
test will fail since "shell" is set to "True" for the command issued by
CephManager.run_ceph_w(). Prepending "exec sudo" to the command when it
is used in LocalCephManager makes this duplication unnecessary and also
prevents Ceph API test from failing.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2021-08-02 11:37:44 +05:30
Rishabh Dave
4101f76ed6 qa/ceph_manager: minor refactor
Save the return value of method "teuthology.get_testdir()" instead of
calling it repeatedly in the same class.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2021-08-02 10:07:23 +05:30
Kefu Chai
f0ed7a188f qa/tasks: s/virtualenv/python3 -m venv/
so we don't need to use virtualenv python package for creating a
virtualenv, the "venv" module in Python3 would suffice.

see also https://docs.python.org/3/library/venv.html

Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-07-31 22:34:05 +08:00
Patrick Donnelly
2cd3494771 qa: update mds_pre_upgrade to no longer stop standbys
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-07-30 16:28:54 -07:00
Patrick Donnelly
8e0b9bcad6 qa: update mds_pre_upgrade to disable standby-replay
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-07-30 16:28:54 -07:00
Patrick Donnelly
295971b9c6 qa: add tests for compat manipulation and upgrade
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-07-30 16:28:54 -07:00
Patrick Donnelly
5ae7b9202b Merge PR #42513 into master
* refs/pull/42513/head:
	qa: multifs already enabled as default

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-07-30 14:03:36 -07:00
Patrick Donnelly
c99a5e56a6 Merge PR #42201 into master
* refs/pull/42201/head:
	qa: fold frag confs into conf/mds.yaml

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
2021-07-30 14:00:19 -07:00
Sridhar Seshasayee
464e9ea6c0 qa/standalone/misc: ver-health.sh: Increase wait_for_health_string() timeout
Modified test cases:

1. ver-health.sh:
  a. TEST_check_version_health_1():
    To avoid intermittent timeouts observed in wait_for_health_string(),
    increase the wait time to 20 secs.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2021-07-30 18:16:00 +05:30
Sridhar Seshasayee
33d2a2c93b qa/standalone/scrub: Force a subset of scrub tests to use "wpq" scheduler
The following tests in the test files mentioned below use the
"osd_scrub_sleep" option to introduce delays during scrubbing to help
determine scrubbing states, validate reservations during scrubbing etc..
This works when using the "wpq" scheduler.

But when the "mclock_scheduler" is enabled, the "osd_scrub_sleep" is
disabled and overridden to 0. This is done to delegate the scheduling of
the background scrubs to the "mclock_scheduler" based on the set QoS
parameters. Due to this, the checks to verify the scrub states,
reservations etc. fail since the window to check them is very short
due to scrubs completing very quickly. This affects a small subset of
scrub tests mentioned below,

1. osd-scrub-dump.sh -> TEST_recover_unexpected()
2. osd-scrub-repair.sh -> TEST_auto_repair_bluestore_tag()
3. osd-scrub-test.sh -> TEST_scrub_abort(), TEST_deep_scrub_abort()

Only for the above tests, until there's a reliable way to query scrub
states with "--osd-scrub-sleep" set to 0, the "osd_op_queue" config
option is set to "wpq".

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2021-07-30 18:16:00 +05:30
Sridhar Seshasayee
f658ff3511 qa/standalone/erasure-code: Modify erasure-code tests for mclock scheduler
Modified test cases:

1. test-erasure-eio.sh:
  a. Test_ec_backfill_unfound():
    - Set osd_mclock_profile to high_recovery_ops profile.
    - Increase the wait for backfill_unfound timeout to 240 secs.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2021-07-30 18:16:00 +05:30
Sridhar Seshasayee
bdf36cf045 qa/standalone/osd-backfill: Modify backfill tests for mclock scheduler
Modified test cases:

1. osd-backfill-prio.sh:
  Set osd_op_queue = wpq for all tests since the mclock doesn't
  consider recovery priority as part of its scheduling algorithm.

2. osd-backfill-space.sh:
  Set osd_mclock_profile to high_recovery_ops and increase the wait
  for backfills timeout to 1200 secs for the following tests:
  - TEST_backfill_test_simple()
  - TEST_backfill_test_multi()
  - TEST_backfill_test_sametarget()
  - TEST_backfill_multi_partial()
  - TEST_ec_backfill_simple()
  - TEST_ec_backfill_multi()
  - SKIP_TEST_ec_backfill_multi_partial()
  - SKIP_TEST_ec_backfill_multi_partial()

3. osd-backfill-stats:
  - TEST_backfill_ec_down_all_out():
   Set osd_mclock_profile to high_recovery_ops and increase the wait
   for recovery timeout to 240 secs.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2021-07-30 18:16:00 +05:30
Sridhar Seshasayee
2c577040cb qa/standalone/osd: Modify osd tests for mclock scheduler
Modified test cases:
1. osd-recovery-prio.sh:
   Set osd_op_queue = wpq for all tests since mclock
   doesn't consider recovery priority as part of its
   scheduling algorithm.

2. osd-recovery-stats.sh:
   a. TEST_recovery_undersized():
     - Set osd_mclock_profile to high_recovery_ops profile.
     - Increase wait for recovery timeout to 300 secs.

3. osd-rep-recov-eio.sh:
   a. TEST_rep_backfill_unfound():
     - Set osd_mclock_profile to high_recovery_ops profile.
     - Increase wait for backfill_unfound to 360 secs.

4. repeer-on-acting-back.sh:
   a. TEST_repeer_on_down_act():
     - Set osd_mclock_profile to high_recovery_ops profile.
       (To improve the test duration)

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2021-07-30 18:16:00 +05:30
Sridhar Seshasayee
5a85a6a035 qa/standalone: Modify ceph-helpers.sh tests for mclock scheduler.
List of changes:

1. Remove the enforcement to use osd_op_queue=wpq when an osd is brought
   up in the following functions:
   - run_osd()
   - run_osd_filestore() and
   - activate_osd()

2. New functions:
   - get_op_scheduler() - Get the current osd_op_queue for an osd.

3. Modified test cases:
   - test_run_osd() - Add check for osd_max_backfill count.
     The mclock scheduler overrides the count to 1000.

4. New test cases:
   - test_activate_osd_after_mark_down()
   - test_get_op_scheduler()

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2021-07-30 18:16:00 +05:30
Kotresh HR
103c7bdc70 mgr/volumes: Fail subvolume removal if it's in progress
Removing an in-progress subvolume clone with force doesn't
remove the clone index (tracker). This results in the cloner
thread to stuck in loop trying to clone the deleted one.

This patch addresses the issue by not allowing the subvolume clone
to be removed if it's not complete/cancelled/failed even with force option.
It throws the error EAGAIN, asking the user to cancel the pending clone
and retry.

Fixes: https://tracker.ceph.com/issues/51707
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2021-07-30 13:14:28 +05:30
Patrick Donnelly
0efa23572a qa: verify rank 0 does not fail during journal repair tests
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-07-29 13:53:20 -07:00
Patrick Donnelly
14324ab5c2 qa: avoid stopping/restarting mds in journal repair tests
It is enough to just fail ranks and manipulate the "joinable" flag of
the fs.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-07-29 13:53:20 -07:00
Brad Hubbard
434b325c40
Merge pull request #42442 from badone/wip-insights-reports-non-persistent-storage
Don't persist report data

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2021-07-29 09:19:32 +10:00