Commit Graph

2920 Commits

Author SHA1 Message Date
Shilpa Jagannath
4f18f8a607
Merge pull request #56792 from cbodley/wip-qa-rgw-notifications-valgrind
qa/rgw: enable valgrind in rgw/notifications suite
2024-07-31 10:10:26 -07:00
Yuri Weinstein
1a2a617edd
Merge pull request #54671 from baum/ceph-nvmeof-mon
mon: add NVMe-oF gateway monitor and HA

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2024-07-31 09:05:41 -07:00
Vallari Agrawal
d0c4182bf5 qa/suites/rbd/nvmeof: add multi-subsystem setup and thrash test
1. qa/tasks/nvmeof.py:
    1.1. create multiple rbd images for all subsystems
    1.2. add NvmeofThrasher and ThrashTest
2. qa/tasks/mon_thrash.py: add 'switch_thrashers' option
3. nvmeof_setup_subsystem.sh: create multiple subsystems and enable HA
4. Restructure qa/suites/rbd/nvmeof: Create two sub-suites
   - "basic" (nvmeof_initiator job)
   - "thrash" (new: nvmeof_mon_thrash and nvmeof_thrash jobs)

Signed-off-by: Vallari Agrawal <val.agl002@gmail.com>
2024-07-31 08:52:05 +00:00
Venky Shankar
2c8802a3dd qa: some test set refuse_client_session, so the cluster log is expected
Fixes: http://tracker.ceph.com/issues/66639
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2024-07-30 10:51:46 +05:30
Yuri Weinstein
3956c4278a
Merge pull request #58205 from NitzanMordhai/wip-nitzan-rados-dashboard-test-update-ignorelist
suites: test should ignore osd_down warnings

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
2024-07-26 10:24:44 -07:00
Ilya Dryomov
e25b81ea93
Merge pull request #58781 from idryomov/wip-67097
qa/suites/krbd: stress test for recovering from watch errors for -o exclusive

Reviewed-by: Ramana Raja <rraja@redhat.com>
2024-07-25 14:45:18 +02:00
Adam King
6e8aab4731
Merge pull request #58153 from anoopcs9/modify-smb-dot-conf
mgr/smb: Make appropriate changes to global and share specific options

Reviewed-by: John Mulligan <jmulligan@redhat.com>
2024-07-24 11:13:00 -04:00
Ilya Dryomov
2a9f92f234 qa/suites/krbd: override extra_system_packages directly on install task
Follow commit c61cb16223 ("qa/suites/rbd: override
extra_system_packages directly on install task").

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2024-07-24 08:48:47 +02:00
Ilya Dryomov
8fee41da8b qa/suites/krbd: stress test for recovering from watch errors for -o exclusive
This is based on a test added in commit 237aa221eb ("qa/suites/krbd:
stress test for recovering from watch errors") for regular mappings.

Fixes: https://tracker.ceph.com/issues/67097
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2024-07-24 08:48:47 +02:00
Patrick Donnelly
f4083b7f9f
Merge PR #57857 into main
* refs/pull/57857/head:
	qa: set debug_ms=2 for cephfs tests
	msg: decrease the level to print the sending msgs from the protocol

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2024-07-23 15:55:49 -04:00
Yuri Weinstein
4adc795c49
Merge pull request #58215 from badone/wip-tracker-59380-admin-socket-injectfull
qa/suites/rados: Cancel injectfull to allow cleanup

Reviewed-by: Neha Ojha <nojha@redhat.com>
2024-07-23 10:57:08 -07:00
Yuri Weinstein
1fa959e982
Merge pull request #57485 from sseshasa/wip-fix-validator-osd-down-grace-tmout
qa/suites/rados/verify/validater: increase heartbeat grace timeout

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
2024-07-23 10:50:32 -07:00
Laura Flores
39a09a3590
Merge pull request #58275 from NitzanMordhai/wip-nitzn-host-thraser-fix-min-in-checks
suites: host thrasher should check min_in before thrashing host
2024-07-22 13:22:30 -05:00
Laura Flores
1161a9fb43
Merge pull request #58277 from NitzanMordhai/wip-nitzan-upgrade-suites-quincy-ignore-list-adding
suites/upgrade/quincy-x: update the ignore list
2024-07-22 13:15:50 -05:00
Adam Kupczyk
337c8bf901
Merge pull request #57002 from aclamk/wip-aclamk-bs-storetest-expand-synthetic
Improved structure for objectstore unit tests.
2024-07-22 13:48:06 +02:00
Matan Breizman
420dcdb2a6
Merge pull request #58256 from NitzanMordhai/wip-nitzan-wait-osd-admin-command
crimson/osd: adding osdmap subscribe

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
2024-07-21 13:22:16 +03:00
Adam King
51fd35d7c7
Merge pull request #57817 from phlogistonjohn/jjm-smb-sqlite
mgr/smb: add sqlite internal store backend for smb mgr module

Reviewed-by: Adam King <adking@redhat.com>
2024-07-18 10:58:16 -04:00
Yuri Weinstein
bc2bb38657
Merge pull request #57906 from kamoltat/wip-ksirivad-stretch-mode-netsplit-test
mon/ElectionLogic: tie-breaker mon ignore proposal from marked down mon

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2024-07-16 15:08:20 -07:00
Nitzan Mordechai
a9de8be0f4 suites: crimson basic adding wait for test
Fixes: https://tracker.ceph.com/issues/66381
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2024-07-15 11:11:16 +00:00
Venky Shankar
2ab14159a6
Merge pull request #49974 from neesingh-rh/wip-58619
mds: fix session/client evict command.

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-07-15 15:55:20 +05:30
Matan Breizman
c07bae3bfd
Merge pull request #58396 from NitzanMordhai/wip-nitzan-crimson-stat-pybind-test
suite/crimson: enable stats tests

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
2024-07-11 11:47:29 +03:00
Adam King
59acb549cd
Merge pull request #58382 from adk3798/cephadm-upgrade-test-start-points
qa/cephadm: change upgrade test start point to reef/squid

Reviewed-by: John Mulligan <jmulligan@redhat.com>
2024-07-10 13:09:33 -04:00
John Mulligan
d77139e74b qa/suites/orch: enable cephadm single_host_defaults for smb tests
Set the cephadm single_host_defaults option for the smb tests that use a
single node ceph cluster. Without this option the mgr_pool and the
sqlite3 db support needed by the smb mgr module will not be available.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2024-07-09 14:28:49 -04:00
Anoop C S
6472b9bc2a mgr/smb: Use the default for minimum protocol version
'server min protocol' smb.conf option indicates the minimum SMB protocol
version that the server can support during client negotiation phase. We
can be generous in this case to accept various client implementations
with a subset of supported protocol versions. With 'SMB2' it selects the
highest available version from the list of sub protocols. Instead we
refrain from configuring the option to support the least possible SMB2
version by default(which is SMB2_02 at this point in time) from Samba.

ref: https://www.samba.org/samba/docs/current/man-html/smb.conf.5.html
Signed-off-by: Anoop C S <anoopcs@cryptolab.net>
2024-07-05 19:50:45 +05:30
Anoop C S
5ac43ef8af mgr/smb: Add acl_xattr configuration to share definition
Samba allows to store full Windows(NT) ACLs and bypass any lossy mapping
attempts to and from POSIX ACLs. Therefore we stack an additional module
in front of ceph to store NTACLs in a special extended attribute. Due to
certain limitations within containers the default xattr from security
namespace can't be used. Instead we configure a similar xattr from more
flexible 'user' namespace with the help of an option to the module.

ref: https://www.samba.org/samba/docs/current/man-html/vfs_acl_xattr.8.html
Signed-off-by: Anoop C S <anoopcs@cryptolab.net>
2024-07-05 19:50:20 +05:30
Ali Masarwa
a236221246
Merge pull request #58408 from AliMasarweh/wip-alimasa-http-to-kafka
RGW|BN : change endpoint for http failing tests

Reviewed-by: yuvalif<ylifshit@redhat.com>
2024-07-04 16:17:49 +03:00
Ali Masarwa
5d6db362d3 RGW|BN : change endpoint for http failing tests
Signed-off-by: Ali Masarwa <amasarwa@redhat.com>
2024-07-04 10:06:26 +03:00
Guillaume Abrioux
53b462764c qa: fix log errors for cephadm tests
This fixes a lot of errors induced by the log scrapper check.

Fixes: https://tracker.ceph.com/issues/66751

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
2024-07-03 16:25:49 +02:00
Nitzan Mordechai
d66d9f35d3 suite/crimson: enable stats tests
Fixes: https://tracker.ceph.com/issues/66382
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2024-07-02 11:37:49 +00:00
Adam King
cfb26d81eb qa/cephadm: change upgrade test start point to reef/squid
Since main is now what will become the T release,
we only need to be testing upgrades from reef and
squid. Upgrades from quincy won't be valid once
we've branched off

Signed-off-by: Adam King <adking@redhat.com>
2024-07-01 14:22:47 -04:00
Matan Breizman
94c6055307
Merge pull request #56150 from xxhdx1985126/wip-seastore-teuthology
qa/suites/crimson-rados: add seastore

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
2024-07-01 12:02:57 +03:00
Matan Breizman
69b241a462 qa/suites/crimson-rados: disable seastore with default thrash
Currently we have 2 types of thrash tests: default and simple.
Seastore passes only the simple one.

The 2 types are seperated to diffrent dirs. `thrash` dir
will be tested with bluestore only.
`thrash_simple` will be tested with both backend.

Once Seastore passes default ones, we can merge back the two dirs

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
2024-06-30 12:32:09 +03:00
Xuehan Xu
1ad78107b1 qa/suites/crimson-rados: add seastore
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
2024-06-27 17:49:21 +03:00
Venky Shankar
086c282494 Merge PR #56476 into main
* refs/pull/56476/head:
	qa: add README to every fs:upgrade sub-suite
	qa: upgrade sub-suite mds_upgrade_sequence from n-1 and n-2 releases

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-06-27 19:56:29 +05:30
Nitzan Mordechai
2ef093a32d suites/upgrade/quincy-x: update the ignore list
Our upgared suites failed beacuse of those warning without good
reason, we can ignore them and continue testings.

Fixes: https://tracker.ceph.com/issues/65422
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2024-06-27 05:09:18 +00:00
Kamoltat
7a90d9d852 qa/suites/netsplit/tests/2-dc-1-arbiter-stretch-cluster-netsplit.yaml
Test netsplit between 2 datacenters
in a stretch mode cluster.

Observe if:

- PGs are active
- Cluster is accessible
- Writes and Reads went through

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2024-06-26 15:22:07 +00:00
NitzanMordhai
64880d7c87
Merge pull request #58126 from NitzanMordhai/wip-nitzan-suites-rados-cephadm-stray-daemon
suites: adding ignore list for stray daemon
2024-06-26 17:44:13 +03:00
Nitzan Mordechai
89d695fb8b suites: check for host thrasher
The last PR modified the suites to only check for host thrasher.
This update fixes that issue by implementing different settings
with dedicated YAML files for host thrashing

Fixes: https://tracker.ceph.com/issues/66657
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2024-06-26 12:16:48 +00:00
Brad Hubbard
4c5d0e30d2 qa/suites/rados: Cancel injectfull to allow cleanup
IO is frozen when the injectfull command is sent as part of the test
which causes the cleanup to hang so we need to clear it.

Fixes: https://tracker.ceph.com/issues/59380
Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
2024-06-26 10:03:43 +10:00
Yuri Weinstein
359d20f326
Merge pull request #58141 from ljflores/wip-tracker-65852
qa/suites/rados/thrash/workloads: remove cache tiering workload

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
2024-06-25 06:47:14 -07:00
Venky Shankar
d17c681296 Merge PR #56052 into main
* refs/pull/56052/head:
	qa/suites: ignore unresponsive client when the test passes
	qa: enhance per-client labelled perf counters test

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-06-25 12:40:25 +05:30
Nitzan Mordechai
2c65f1da96 suites: test should ignore osd_down warnings
Fixes: https://tracker.ceph.com/issues/64870
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2024-06-23 08:49:45 +00:00
Laura Flores
7ca11f41de
Merge pull request #57974 from ljflores/wip-tracker-65189
Telemetry upgrade test fixes
2024-06-21 16:48:27 -05:00
Laura Flores
914149df5f qa/suites/upgrade/telemetry-upgrade/quincy-x: update image reference
`quay.io/ceph/daemon-base:latest-quincy` points to the latest quincy tag (v17.2.7),
not the tip of quincy, which is actually what we want to test.

Pulling the daemon-base:latest-quincy image shows it is running v17.2.7:
```
lflores:ceph$ podman run --rm -it daemon-base:latest-quincy
[root@e4eb265e7fa0 /]# ceph -v
ceph version 17.2.7 (b12291d110049b2f35e32e0de30d70e9a4c060d2) quincy (stable)
```

It is important that we test the tip of quincy in case there is a fix there that
we want to pick up.

I referenced this commit to make the change: d0381ed7e6

Signed-off-by: Laura Flores <lflores@ibm.com>
2024-06-21 12:36:38 -05:00
Laura Flores
fcb22a0220 qa/suites/upgrade/telemetry-upgrade/reef-x: update how cephadm is pulled and change image reference
Update how cephadm is pulled:

`cephadm_git_url` and `cephadm_branch` are used in releases older than reef
to install cephadm. Both of these keys are needed to install it from the github
repo.

However, in reef and on, the compiled zipapp cephadm needs to be pulled differently
than the old single python script `cephadm` from earlier releases.

This commit is what I referenced for this change: 967716f2e6

-------

Change image reference:

`quay.io/ceph/daemon-base:latest-reef` points to the latest reef tag (v18.2.2),
not the tip of reef, which is actually what we want to test.

Pulling the daemon-base:latest-reef image shows it is running v18.2.2:
```
lflores:ceph$ podman run --rm -it daemon-base:latest-reef
[root@d4d9a0eb04be /]# ceph -v
ceph version 18.2.2 (531c0d11a1c5d39fbfe6aa8a521f023abf3bf3e2) reef (stable)
```

It is important that we test the tip of reef in case there is a fix there that we
want to pick up.

I referenced this commit to make the change: d0381ed7e6

Signed-off-by: Laura Flores <lflores@ibm.com>
2024-06-21 12:35:03 -05:00
Laura Flores
dc65e55d6e qa/suites/upgrade/telemetry-upgrade: add more ignorelist items and require_osd_release=squid
The warnings added to the ignorelist show up in the cluster log, but they are
expected during upgrades and should thus be ignored.

We also need to set require_osd_release=squid to avoid this warning:
```
cluster [WRN] Health check failed: all OSDs are running squid or later but require_osd_release < squid (OSD_UPGRADE_FINISHED)
```

Signed-off-by: Laura Flores <lflores@ibm.com>
2024-06-21 12:22:22 -05:00
Adam King
98c986f1f5
Merge pull request #57412 from adk3798/stray-laundry2
qa/cephadm: fix ignorelist of CEPHADM_STRAY_DAEMON for rados_api_tests

Reviewed-by: Laura Flores <lflores@ibm.com>
2024-06-20 12:06:18 -04:00
Patrick Donnelly
959f007d83
Merge PR #55792 into main
* refs/pull/55792/head:
	tools/cephfs: recover alternate_name of dentries from journal
	qa: add test to verify recovery of alternate_name from journal
	tools/cephfs/JournalTool: add some more debugging
	tools/cephfs/JournalTool: remove extraneous 0x in debug output
	mds: dump alternate_name to formatter
	mds: add warning about encoding new fields

Reviewed-by: Christopher Hoffman <choffman@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
2024-06-19 15:19:56 -04:00
Laura Flores
35505a7f1f qa/suites/rados/thrash/workloads: remove cache tiering workload
Fixes: https://tracker.ceph.com/issues/65852
Signed-off-by: Laura Flores <lflores@ibm.com>
2024-06-19 12:53:44 -05:00
Nitzan Mordechai
3c7ffde71f suites: adding ignore list for stray daemon
Few tests that deploying osds that causing the test fail
beacuse of warning during the period time that the osds
restart. adding to ignore list.

Fixes: https://tracker.ceph.com/issues/64872
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2024-06-19 05:59:51 +00:00