Commit Graph

10039 Commits

Author SHA1 Message Date
Casey Bodley
596e32f542 qa/s3tests: remove stale comment about rfc2616
this comment was about the old fails_strict_rfc2616 attr that was
removed in 6238eaaf0b

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2023-06-22 08:54:33 -04:00
Casey Bodley
1e65b50f60 qa/s3tests: clean up extra_attrs and extra_args logic
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2023-06-22 08:54:33 -04:00
Casey Bodley
1c006c92e1 qa/s3tests: make extra_attrs additive
the s3tests.py task is filtering out several attrs by default. but
when dbstore uses `extra_attrs` to add 'not fails_on_dbstore', it
overwrites those other filters

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2023-06-22 08:54:14 -04:00
Ilya Dryomov
ca20127c70
Merge pull request #52109 from idryomov/wip-61616
librbd: avoid object map corruption in snapshots taken under I/O

Reviewed-by: Mykola Golub <mgolub@suse.com>
2023-06-21 11:43:26 +02:00
Ilya Dryomov
acb270a3dd qa/workunits/rbd: make continuous export-diff test actually work
The current version is pretty useless:

- "rbd bench" writes the same byte (0xff) over and over again, so
  almost all checksumming is in vain
- snapshots are taken in a steady state (i.e. not under I/O), so no
  race conditions can get exposed
- even with these caveats, it's not wired up into the suite

Redo this workunit to be a reliable reproducer for the issue fixed
in the previous commit and wire it up for both krbd and rbd-nbd.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2023-06-20 22:14:39 +02:00
Yuri Weinstein
5ae95880bf
Merge pull request #48209 from kamoltat/wip-ksirivad-fix-tracker-57570
osd/OSDMap: Check for uneven weights & != 2 buckets post stretch mode

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2023-06-19 13:29:21 -04:00
Rishabh Dave
67b1935a18
Merge pull request #51132 from lxbsz/wip-59349
qa: wait for 100 seconds to make sure the quota to be enforced

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2023-06-16 17:52:00 +05:30
Adam King
8e5a983a73
Merge pull request #51267 from phlogistonjohn/jjm-cephadm-versioning
Support cephadm "binary" versioning

Reviewed-by: Adam King <adking@redhat.com>
2023-06-14 14:23:27 -04:00
Casey Bodley
212745267d
Merge pull request #52012 from cbodley/wip-61428
qa: suppress SyscallParam error during startup on jammy

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Ali Maredia <amaredia@redhat.com>
2023-06-13 13:42:05 -04:00
Matan
eeae19fba3
Merge pull request #51973 from Matan-B/wip-matan-lsan-supp
qa/lsan.supp: update heap_profiler suppression and ASAN_OPTIONS

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2023-06-13 09:11:14 +02:00
Xiubo Li
6183c992d7
Merge pull request #51703 from lxbsz/wip-59683
xfstests_dev: install extra packages from powertools repo for xfsprogs
2023-06-13 09:44:24 +08:00
Xiubo Li
4a60f6749a
Merge pull request #50728 from lxbsz/wip-59195
qa: switch to use the merge fragment for fscrypt
2023-06-13 07:39:00 +08:00
Casey Bodley
522251642e qa: suppress SyscallParam error during startup on jammy
Fixes: https://tracker.ceph.com/issues/61428

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2023-06-12 16:25:56 -04:00
Xiubo Li
dedf3aae65 xfstests_dev: install extra packages from powertools repo for xfsprogs
Centos Stream 8 has removed the 'device-mapper-devel', 'libedit-devel'
and 'userspace-rcu-devel' packages from the mirrors and we need to
install it from powertools repo.

Fixes: https://tracker.ceph.com/issues/59683
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-06-12 15:08:28 +08:00
Casey Bodley
4167b5c797
Merge pull request #48879 from samarahu/wip-d4n
RGW: WIP D4N

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
2023-06-08 11:44:55 -04:00
Matan Breizman
d074e910df qa/lsan.supp: update heap_profiler suppression and ASAN_OPTIONS
In continuation to: 8c099a5340

Regsiter singleton leak seems to be replaced by:
```
-----------------------------------------------------
Suppressions used:
  count      bytes template
      1          8 ^InitModule
-----------------------------------------------------
```

In similarity to Crimson's suppression: 6ed8d839b4

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
2023-06-08 14:16:01 +00:00
Nizamudeen A
098f175d71
Merge pull request #51955 from rhcs-dashboard/teuth-e2e-fix
qa/workunits: fix test_dashboard_e2e.sh: no spec files found

Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
2023-06-08 14:18:14 +05:30
Ronen Friedman
6620562d85
Merge pull request #51911 from ronen-fr/wip-rf-recovery-updates
qa/standalone: osd-recovery-scrub: fix slow updates and recovery concurrency

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Sridhar Seshasayee <sseshasa@redhat.com>
2023-06-08 11:43:59 +03:00
Nizamudeen A
bec3140b20 qa/workunits: fix test_dashboard_e2e.sh: no spec files found
Fixes: https://tracker.ceph.com/issues/61578
Signed-off-by: Nizamudeen A <nia@redhat.com>
2023-06-07 14:04:33 +05:30
Laura Flores
8c6374e8da
Merge pull request #51927 from ljflores/wip-rook-tests
qa/suites/rados: remove rook coverage from the rados suite
2023-06-06 13:35:35 -05:00
Samarah
cd1d249678 QA: Add D4N teuthology suite
Signed-off-by: Samarah <samarah.uriarte@ibm.com>
2023-06-06 14:04:47 -04:00
Laura Flores
66a6e7fdeb qa/suites/rados: whitelist POOL_APP_NOT_ENABLED for rados cls tests
Fixes: https://tracker.ceph.com/issues/59192
Signed-off-by: Laura Flores <lflores@redhat.com>
2023-06-05 15:35:54 -05:00
Laura Flores
c26674ef4c qa/suites/rados: remove rook coverage from the rados suite
The rook team relies on a daily CI system to validate
rook changes. It doesn't seem that the teuthology tests
are maintained, so it makes sense to remove them from the
rados suite.

By removing this symlink, rook test coverage will remain
in the orch suite, and coverage will only be removed from the
rados suite.

Workaround for: https://tracker.ceph.com/issues/58585
Signed-off-by: Laura Flores <lflores@redhat.com>
2023-06-05 15:23:42 -05:00
Yuri Weinstein
b2ec2aff80
Merge pull request #50651 from rosinL/cleanup
Cleanup the LevelDB residue


Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2023-06-05 11:32:51 -04:00
Ronen Friedman
83607c0610 qa/standalone: osd-recovery-scrub: fix slow updates and recovery concurrency
1. Setting frequent scrub status updates, to compensate for the removal
of some 'send updates' in PR#50283.

2. Switching back to using the wpq scheduler, as otherwise the number of
concurrent recovery operations is below what the test expects.

Fixes: https://tracker.ceph.com/issues/61386

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2023-06-04 07:34:02 -05:00
Nizamudeen A
553a0c9ad1
Merge pull request #51844 from rhcs-dashboard/fix-qa-failure-orch
mgr/dashboard: fix test_dashboard_e2e.sh failure

Reviewed-by: Pegonzal <NOT@FOUND>
2023-06-01 11:59:23 +05:30
Nizamudeen A
7c5d92ad48 mgr/dashboard: fix test_dashboard_e2e.sh failure
The qa e2e is failing because the script is not adapted with cypress 10.

Fixes: https://tracker.ceph.com/issues/61519
Signed-off-by: Nizamudeen A <nia@redhat.com>
2023-05-31 11:09:25 +05:30
John Mulligan
a1f6314fd8 qa/cephadm: teuthology test for nfs ingress-mode=haproxy-protocol
Signed-off-by: John Mulligan <jmulligan@redhat.com>
2023-05-26 10:43:11 -04:00
Yuri Weinstein
e513690ad1
Merge pull request #51570 from NitzanMordhai/wip-nitzan-test-mon-thrasher-quorum-delay-inc
test: monitor thrasher wait until quorum

Reviewed-by: Kamoltat (Junior) Sirivadhna <ksirivad@redhat.com>
2023-05-25 12:09:18 -04:00
Yuri Weinstein
925edda1cb
Merge pull request #51527 from NitzanMordhai/wip-nitzan-thrash-eio-pool-size-correct
test: correct osd pool default size


Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
Reviewed-by: Matan Breizman <Matan.Brz@gmail.com>
2023-05-25 12:08:48 -04:00
Yuri Weinstein
40b9a8b2cc
Merge pull request #50876 from rishabh-d-dave/qa-ceph-man-get-keyring
qa/ceph_manager: preserve newline char at EOF in keyring

Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
2023-05-25 10:49:53 -04:00
Patrick Donnelly
62d1cc0568
Merge PR #50875 into main
* refs/pull/50875/head:
	mon/MDSMonitor: ignore extraneous up:boot messages
	qa: add test case for mds sending multiple boot messages
	qa: support checking for a log message that should not exist

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
2023-05-25 08:25:34 -04:00
Kamoltat
2c25b29347 qa/standalone/mon-stretch/mon-stretch-uneven-crush-weights.sh: init
Initialize standalone test for stretched clusters,
testing uneven weight warnings and != 2 buckets
warnings.

Added `wait_for_health_gone()` function in ceph-helpers.sh
this function allows us to wait for health condition to
disappear when doing standalone tests.

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2023-05-24 18:35:27 +00:00
Yuri Weinstein
7873afce66
Merge pull request #51528 from NitzanMordhai/wip-nitzan-tests-using-override-instead-overrides
tests: change override to overrides so conf will take affect

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
2023-05-22 14:33:10 -04:00
John Mulligan
bd59b80919 qa/workunits/cephadm: align test_cephadm.sh with new cephadm version
The `cephadm version` command no longer bases the output on the
container images, rather it uses a special python file added to the
zipapp during the build to report on the version of cephadm (the
binary).

The other option was to preserve this behavior and add a new version
command or make it behave differently depending on what options were
provided. I discussed the options with AMK in person and we decided that
changing the tests was preferable.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2023-05-22 13:26:07 -04:00
Nitzan Mordechai
fbd10badbf test: monitor thrasher wait until quorum
With 1 sec. delay we may sometimes fail to get correct length of
quorum since the monitor didn't updated on time.
With the following fix, we will wait for quorum and check every few
seconds (3) until timeout (30).

Fixes: https://tracker.ceph.com/issues/52316
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2023-05-22 13:54:14 +00:00
Sridhar Seshasayee
0f6404222f
Merge pull request #51480 from sseshasa/wip-fix-pr48703-followup
osd/scheduler: Reset ephemeral changes to mClock built-in profile

Reviewed-by: Samuel Just <sjust@redhat.com>
2023-05-22 16:37:53 +05:30
Yuri Weinstein
ecebe2f4b2
Merge pull request #50616 from batrick/i59120
qa: use parallel gzip for compressing logs

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
2023-05-20 10:04:13 -04:00
Casey Bodley
aaa04882d9
Merge pull request #51494 from cbodley/wip-61168
qa/rgw: add POOL_APP_NOT_ENABLED to log-ignorelist

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
2023-05-19 09:17:52 -04:00
Nizamudeen A
515aa566e5
Merge pull request #51532 from rhcs-dashboard/reorder-daemon-page
mgr/dashboard: reorder rgw daemons list items

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: cloudbehl <NOT@FOUND>
2023-05-19 14:53:25 +05:30
Sridhar Seshasayee
aed71b56be qa/tasks: Allow override of recovery configs for tests
With mClock scheduler enabled, a small subset of config options related
to recovery limits are not allowed to be modified unless
osd_mclock_override_recovery_settings option is enabled. This override
option is disabled by default. The following options cannot be modified
without enabling the override option:

 - osd_max_backfills
 - osd_recovery_max_active[_(hdd|ssd)]

The above options are removed from the mon kv store which effectively
restores them to the default values.

This was resulting in tests for example,
test_cluster_configuration.ClusterConfigurationTest to fail since it
modifies the recovery options and expects to verify the modified value.

Therefore, for tests, osd_mclock_override_recovery_settings option is
enabled in vstart_runner.py so that current and future tests
are not affected.

Fixes: https://tracker.ceph.com/issues/61155
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2023-05-18 21:42:27 +05:30
Nizamudeen A
130a52ed50 mgr/dashboard: reorder rgw daemons list items
Fixes: https://tracker.ceph.com/issues/61212
Signed-off-by: Nizamudeen A <nia@redhat.com>
2023-05-18 15:07:37 +05:30
Sridhar Seshasayee
414ac7dd2c osd/scheduler: Reset ephemeral changes to mClock built-in profile
This is a follow-up to PR: https://github.com/ceph/ceph/pull/48703.
This commit also considers changes made ephemerally using either the
'daemon' or the 'tell' interfaces to override the built-in mClock
QoS parameters. In such a scenario, the ephemeral changes are removed
using the rm_val() method exposed by the config subsytem and logging
this information.

Other changes:

1. Add a standalone test to exercise the fix.
2. Add documentation note on the outcome of the attempt to modify
   built-in profile defaults.

Fixes: https://tracker.ceph.com/issues/61155
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2023-05-18 14:03:45 +05:30
Ilya Dryomov
95551071f2
Merge pull request #51449 from amathuria/wip-rbd-suite-change-mclock-profile
qa/tasks: Changing default mClock profile to high_recovery_ops

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2023-05-18 09:28:00 +02:00
Venky Shankar
7b2968570a Merge PR #49691 into main
* refs/pull/49691/head:
	qa: add test for opening a file via a hard link that is not in the same mds as the inode
	mds: rdlock_path_xlock_dentry supports returning auth target inode

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2023-05-18 12:49:33 +05:30
Nitzan Mordechai
c9d98ec310 test: correct osd pool default size
Using the default pool size of 2 with random eio thrashing can cause
some of the object to mark as lost.
fixing typo from 'osd default pool size: 3' to 'osd pool default size: 3'
so we will have pool size 3 correctly.

Fixes: https://tracker.ceph.com/issues/49888
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2023-05-18 04:34:50 +00:00
Aishwarya Mathuria
a7c0029ecc qa/tasks: Change default mClock profile to high_recovery_ops
With the new mClock default profile, tests were failing with "Exiting scrub checking -- not all pgs scrubbed" due to slower scrubs.
Changing the default profile to high_recovery_ops for testing purposes will fix this issue.

Fixes: https://tracker.ceph.com/issues/61228
Signed-off-by: Aishwarya Mathuria <amathuri@redhat.com>
2023-05-18 09:32:20 +05:30
Nitzan Mordechai
3a91670aa5 tests: change override to overrides so conf will take affect
We have few test suites that using 'override' in yaml file
while ceph.py task is looking for 'overrides', in that case
those configure params won't take any affects.

Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2023-05-17 10:39:59 +00:00
Casey Bodley
f0d53e56f8 qa/rgw: add POOL_APP_NOT_ENABLED to log-ignorelist
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2023-05-15 14:26:48 -04:00
Venky Shankar
2dec176827 Merge PR #51386 into main
* refs/pull/51386/head:
	qa: ignore cluster warning when fs flag refuse_client_session is set

Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Kotresh Hiremath Ravishankar <khiremat@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2023-05-15 12:26:46 +05:30