Commit Graph

2329 Commits

Author SHA1 Message Date
Ali Maredia
c4bec75221
Merge pull request #47044 from alimaredia/wip-rgw-suite-s3test-refactor
add s3tests-brach.yaml for rgw teuthology suites that run s3tests

Reviewed-by: Ali Maredia <amaredia@redhat.com>
2022-07-18 15:21:29 -04:00
Ilya Dryomov
0a6a70760a qa/suites/rbd: disable workunit timeout for dynamic_features_no_cache
The I/O workload in this test is xfstests (qa/run_xfstests_qemu.sh)
which isn't subjected to any timeout other than global max_job_time
limit in any other subsuite (e.g. qemu/workloads/qemu_xfstests.yaml).
But here, there is a parallel "op" workload defined as a workunit.
The workunit task has a default timeout of 3 hours which is effectively
imposed on the entire job.  In the "rbd cache = false" configuration,
it's sometimes exceeded.

Fixes: https://tracker.ceph.com/issues/48038
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-07-17 19:06:18 +02:00
Ilya Dryomov
3d710cda2f
Merge pull request #44217 from CongMinYin/fix-pwl-recovery-test
qa/suites/rbd/pwl-cache: ensure recovery is actually tested

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
2022-07-16 11:34:55 +02:00
Ilya Dryomov
2de0574382 qa/tasks: rename persistent write log cache trash task
It doesn't really thrash anything, just repeatedly restarts the
workload on top of a dirty cache file.  rbd_pwl_cache_recovery is
more on point and gets covered by existing CODEOWNERS.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-07-16 09:46:58 +02:00
Yin Congmin
0eab8de3c0 qa/tasks: add thrash test for persistent write log cache
add thrash test for persistent write log cache. run rbd bench
on persistent write log cache, thrashes rbd bench, test the
recovery function of persistent write log cache.

Signed-off-by: Yin Congmin <congmin.yin@intel.com>
2022-07-13 13:31:02 +08:00
Ali Maredia
71a648dc02 qa: remove iam subsuite from rgw suite, run iam tests by default
Minor refactor for RGW user creation

Signed-off-by: Ali Maredia <amaredia@redhat.com>
2022-07-12 16:22:35 -04:00
Ali Maredia
e63ff36b7c rgw: add .yaml that configures s3tests branch for rgw suite
Signed-off-by: Ali Maredia <amaredia@redhat.com>
2022-07-12 16:21:27 -04:00
Rishabh Dave
15ccc52a81
Merge pull request #46861 from ajarr/fix-54108
qa/suites/fs: skip check-counters for iogen workload

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2022-07-08 19:36:22 +05:30
Venky Shankar
01e0a9a7e2
Merge pull request #46746 from vshankar/wip-fs-snaps-timeout
qa: increase fs/snaps workunit timeout to 6h

Reviewed-by: Xiubo Li <xiubli@redhat.com>
2022-07-04 20:41:51 +05:30
Venky Shankar
c37bd10303
Merge pull request #46575 from batrick/i51332
qa: exercise metadata replication in workloads

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-07-04 08:25:15 +05:30
Adam King
1c0813b90e
Merge pull request #44493 from zenomri/wip-omri-cephadm-jaeger
mgr/cephadm: Add jaeger-tracing services support

Reviewed-by: Adam King <adking@redhat.com>
2022-07-01 16:00:50 -04:00
Ali Maredia
79a085e11a
Merge pull request #46660 from pritha-srivastava/wip-rgw-user-policy-test-changes
Changes to integrate user-policy(iam) tests in teuthology.

Reviewed-by: Ali Maredia <amaredia@redhat.com>
2022-06-28 12:04:02 -04:00
Ramana Raja
69b29008fb qa/suites/fs: skip check-counters for iogen workload
We currently run "iogen -n 5 -s 2g" for about 10 minutes. This workload
does not always generate export/import of subtrees that is being checked
by iogen.yaml. iogen workload is suited for running heavily fragmented I/O
on a file system, and not for growing directory trees.

Fixes: https://tracker.ceph.com/issues/54108
Signed-off-by: Ramana Raja <rraja@redhat.com>
2022-06-27 13:34:28 -04:00
Ronen Friedman
2f71185ad6
Merge pull request #46643 from ronen-fr/wip-rf-blocked
scrub/osd: add clearer reminders that a scrub is blocked

Reviewed-by: Laura Flores <lflores@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
2022-06-23 18:29:27 +03:00
Pritha Srivastava
e94873ad85 qa/rgw: 1. adding yaml files to integrate iam tests in teuthology.
2. Modifying s3tests.py for iam tests.

Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>
2022-06-23 09:15:24 +05:30
Ronen Friedman
996d9064c6 scrub/osd: disable blocked-scrub warnings during some tests
As some Teuthology tests seem to block objects for long minutes,
we must not issue the "scrub is blocked for too long" warning
(that warning causes the tests to fail).

A new configuration parameter now controls the grace period before
the warning is issued. Some tests were modified to set this
configuration parameter to a large value.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2022-06-22 05:24:37 +00:00
Venky Shankar
8c50e1ee2c qa: increase fs/snaps workunit timeout to 6h
commit 4fbf4c4f58 increases the
number of tags used in snaptest-git-ceph.sh tests. This makes
the tests run longer (than default 3h) thereby timing out.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
2022-06-20 07:21:22 +05:30
Omri Zeneva
a33d77b813 qa: add jaeger deployment test in teuthology
Signed-off-by: Omri Zeneva <ozeneva@redhat.com>
2022-06-16 12:29:58 +03:00
Zack Cerza
d9e7c1b797
Merge pull request #46582 from ceph/rhel86
qa: Default to RHEL8.6 instead of 8.5
2022-06-15 15:05:55 -06:00
David Galloway
b8c4488c55 qa: Default to RHEL8.6 instead of 8.5
Signed-off-by: David Galloway <dgallowa@redhat.com>
2022-06-15 14:13:35 -04:00
Venky Shankar
27f4729256
Merge pull request #45556 from mchangir/qa-add-subvolume-option-flavors
qa: add subvolume option flavors

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
2022-06-13 12:29:43 +05:30
Patrick Donnelly
7d3982201f
qa: move export yaml to separate file
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-06-11 20:17:41 -04:00
Patrick Donnelly
0208513273
qa: exercise metadata replication in workloads
Fixes: https://tracker.ceph.com/issues/51332
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-06-11 20:17:41 -04:00
Venky Shankar
45c9fd6187
Merge pull request #46078 from kotreshhr/fuse-directory-dacs-issue
client/fuse: Fix directory DACs overriding for root

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
2022-06-07 22:45:05 +05:30
Ilya Dryomov
9324b49ebb
Merge pull request #46045 from idryomov/wip-qa-rwl-on-tmpfs
qa/suites/rbd: place cache file on tmpfs for xfstests

Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
2022-06-07 10:39:31 +02:00
Ilya Dryomov
23759e0034 qa/suites/rbd: place cache file on tmpfs for xfstests
The RWL mode needs DAX and is dog slow otherwise -- qemu_xfstests.yaml
job always hits the 6 hour max_job_time limit.

As our tmpfs instance is limited and qemu_xfstests.yaml opens three
images at the same time, reduce the "big cache" size to 5G.  This facet
was added to iron out 32-bit head/tail pointer issues and 5G still does
the job there.

Going through the loop device is needed because tmpfs doesn't support
O_DIRECT.

Fixes: https://tracker.ceph.com/issues/55400
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-06-03 19:20:37 +02:00
Ilya Dryomov
3475f9ef07 qa/suites/rbd: refactor persistent-writeback-cache suite
Rename to pwl-cache, introduce home subdirectory and 4-cache-path.yaml.
No functional changes.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-06-03 00:06:35 +02:00
Casey Bodley
63f84c50e0
Merge pull request #39002 from ceph/wip-rgw-multisite-reshard
rgw multisite: bucket reshard work in progress

Reviewed-by: Matt Benjamin <mbenjami@redhat.com>
2022-06-02 16:04:30 -04:00
Casey Bodley
8e51eb8c05 test/rgw/multisite: enable zonegroup resharding feature
qa/tasks/rgw_multisite.py uses 'zonegroup set' to create zonegroups from
their json format. this doesn't enable any of the supported zonegroup
features by default, so this adds the 'enabled_features' field to the
json representations

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2022-06-01 14:10:26 -04:00
0xavi0
895ee7921c
rgw/dbstore: change default value of dbstore_db_dir to /var/lib/ceph/radosgw
Changes a few NULL to nullptr.

Adds std::filesystem for path building so they're platform independant.

Fixes a bug for DBStoreManager's second constructor not creating the DB.

Adds unit tests to test DB path and prefix.

Fixes: https://tracker.ceph.com/issues/55731

Signed-off-by: 0xavi0 <xavi.garcia@suse.com>
2022-05-31 09:01:56 +02:00
Casey Bodley
8521fb2be0 qa/rgw: temporarily disable multisite reshard tests
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2022-05-27 15:47:32 -04:00
Casey Bodley
e7f3ac92a7 qa/rgw: disable multisite tests for 'bucket sync disable'
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2022-05-27 15:47:31 -04:00
Patrick Donnelly
6606d91ace
Merge commit 'b76b6ea16899861d6c266797b3c11d9f68f924d7' into main
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-05-25 16:02:14 -04:00
Venky Shankar
b76b6ea168
Merge pull request #44486 from batrick/fs-workload-kclient-switches
qa: reduce fs:workload use of random selection

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-05-25 06:54:53 +05:30
David Galloway
805d2320b3
Merge pull request #46385 from ceph/inclusive-rename-yaml
Rename/re-symlink whitelist_*.yaml
2022-05-24 17:04:50 -04:00
Zack Cerza
0666dd34cf Rename/re-symlink whitelist_*.yaml
whitelist_health.yaml -> ignorelist_health.yaml
whitelist_wrongly_marked_down.yaml -> ignore_wrongly_marked_down.yaml

This was mostly addressed in
2ee9365d0b,
but the rename wasn't done there.

Signed-off-by: Zack Cerza <zack@cerza.org>
2022-05-24 14:14:04 -06:00
Adam King
7dbddc008c
Merge pull request #46324 from ceph/dgalloway-nodockerio
qa: Pull from quay.io not docker.io

Reviewed-by: Adam King <adking@redhat.com>
2022-05-24 10:27:03 -04:00
Neha Ojha
f0aeb2ec29
Merge pull request #46346 from ljflores/wip-lflores-testing-recovery
qa/suites/rados/thrash-erasure-code-big/thrashers: add `osd max backfills` setting to mapgap and pggrow

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Sridhar Seshasayee <sseshasa@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2022-05-23 16:26:27 -07:00
Casey Bodley
51969f86a2
Merge pull request #45985 from soumyakoduri/wip-skoduri-dbstore-tests
rgw/qa: Run s3tests on dbstore backend in teuthology

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2022-05-23 12:55:41 -04:00
Soumya Koduri
a7397100aa rgw/qa: Run s3tests on dbstore backend in teuthology
Add a new test-suite to run s3tests on dbstore backend.

Signed-off-by: Soumya Koduri <skoduri@redhat.com>
2022-05-21 23:49:47 +05:30
Laura Flores
40062676c2 qa/suites/rados/thrash-erasure-code-big/thrashers: add osd max backfills setting to mapgap and pggrow
All `rados/thrash-erasure-code-big` tests that die due to the “wait_for_recovery” timeout have one thing in common: They contain either `thrashers/pggrow` or `thrashers/mapgap`.

The difference between pggrow and mapgap vs. all other non-offending thrashers (default, careful, fastread, and morepggrow) is that they lack an override setting for `osd max backfills`. `osd max backfills` is the max number of backfill operations allowed to/from an OSD. The higher the number, the quicker the recovery. By default, this value is 1. On all of the non-offending thrashers (default, careful, fastread, and morepggrow), the default 1 value gets overridden in their .yaml files with a value > 1. This is not the case for pggrow and mapgap, however, as they lack an `osd max backfills` override setting.

The mclock op scheduler is known to override `osd max backfills` with a high value, but all of the thrash-erasure-code-big thrashers have their op queue set to “debug_random”, which chooses randomly between op queues (the debug_random op queue is set to override the default mclock_scheduler in qa/config/rados.yaml). So, coupled with the “debug_random” op queue, the low `osd max backfill` setting is causing some tests to time out in recovery.

WITHOUT `osd max backfills`, as they are now, “mapgap” and “pggrow” tests die due to timed-out recovery about 17/100 times, as seen here with a pggrow test: http://pulpito.front.sepia.ceph.com/lflores-2022-05-18_14:24:29-rados:thrash-erasure-code-big-master-distro-default-smithi/

WITH `osd max backfills` specified, as I have suggested in this PR, 99/100 tests passed, with one test failing for a different reason:
http://pulpito.front.sepia.ceph.com/lflores-2022-05-17_22:40:27-rados:thrash-erasure-code-big-master-distro-default-smithi/

I also scheduled 145 tests WITH `osd max backfills` that are a mix of pggrow and mapgap thrashers. 144/145 tests passed, with one test failing for a different reason. http://pulpito.front.sepia.ceph.com/lflores-2022-05-17_15:27:54-rados:thrash-erasure-code-big-master-distro-default-smithi/

Fixes: https://tracker.ceph.com/issues/51076
Signed-off-by: Laura Flores <lflores@redhat.com>
2022-05-19 18:29:00 -05:00
Adam King
791e1d20b3 qa/suites/orch/cephadm: staggered upgrade test
Signed-off-by: Adam King <adking@redhat.com>
2022-05-19 17:24:56 -04:00
Soumya Koduri
aafa3d173f
Merge pull request #46008 from soumyakoduri/wip-skoduri-cloudtier-tests
rgw/qa: enable s3-tests related to cloud-transition feature

Reviewed-by: casey Bodley <cbodley@redhat.com>
Reviewed-by: Maredia, Ali <amaredia@redhat.com>
2022-05-19 15:53:54 +05:30
Soumya Koduri
fa86d2ea42 rgw/qa: Run tests on multiple cloudtier config
Run cloudtier tests with parameter 'retain_head_object'
set to true and false.

However having multiple cloudtier storage classes in the same task
is increasing the transition time and resulting in spurious failures.
Hence until there is a consistent way of running the tests, without
having to depend on lc_debug_interval, disabled one of the config for
now.

Signed-off-by: Soumya Koduri <skoduri@redhat.com>
2022-05-19 13:57:46 +05:30
Casey Bodley
0b7ab95c81
Merge pull request #46232 from cbodley/wip-qa-rgw-sse
qa/rgw: use 'with-sse-s3' override for s3tests

Reviewed-by: Ali Maredia <amaredia@redhat.com>
2022-05-18 16:44:28 -04:00
David Galloway
ae2e93f5f1 qa: Pull from quay.io not docker.io
We are pushing to quay.io now due to docker.io rate limiting.

Signed-off-by: David Galloway <dgallowa@redhat.com>
2022-05-18 14:22:52 -04:00
Soumya Koduri
f846f6ff02 rgw/qa: Add separate task for cloudtier tests
Signed-off-by: Soumya Koduri <skoduri@redhat.com>
2022-05-17 20:54:58 +05:30
Soumya Koduri
d0ac12dc38 rgw/qa: enable s3-tests related to cloud-transition feature
Added a new test-suite to configure cloudtier storage class and run related s3-tests.

Signed-off-by: Soumya Koduri <skoduri@redhat.com>
2022-05-17 20:54:58 +05:30
Kotresh HR
2e1f43c99b client/fuse: Fix directory DACs overriding for root
DACs are overridable for directories. For files,
Read/write DACs are always overridable but executable
DACs are overridable when there is at least one exec bit
set.

The files and directory DACS overriding were handled the
same way for root which is incorrect. This patch fixes
DACs overriding as described above for the root.

Fixes: https://tracker.ceph.com/issues/55313
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2022-05-17 12:15:41 +05:30
Venky Shankar
44a0199114
Merge pull request #45834 from ajarr/wip-53996
qa/suites/fs/upgrade: change octopus installations and references

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-05-12 15:14:53 +05:30