Commit Graph

4428 Commits

Author SHA1 Message Date
Vallari Agrawal
02fe44ac60
Merge pull request #59434 from VallariAg/fix-nvmeof-apply-teuthology
qa/tasks/nvmeof.py: add nvmeof gw-group to deployment
2024-08-28 18:07:35 +05:30
Milind Changire
277423c15a
qa: add test for data uninlining
Signed-off-by: Milind Changire <mchangir@redhat.com>
2024-08-28 13:28:27 +05:30
Adam King
639916859f
Merge pull request #59421 from phlogistonjohn/jjm-teuth-cephadm-from-ctr
qa/tasks: add a new cephadm_from_container feature to cephadm task

Reviewed-by: Adam King <adking@redhat.com>
2024-08-27 13:32:43 -04:00
Patrick Donnelly
64e2bd347b
Merge PR #58419 into main
* refs/pull/58419/head:
	mds: generate correct path for unlinked snapped files
	qa: add test for cephx path check on unlinked snapped dir tree
	mds: add debugging for stray_prior_path

Reviewed-by: Milind Changire <mchangir@redhat.com>
2024-08-27 13:10:54 -04:00
Avan Thakkar
1abb4113ca mgr/nfs: adopt API & unit tests for nfs exports
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
2024-08-27 12:03:17 +05:30
Venky Shankar
409001969e
Merge pull request #54620 from rishabh-d-dave/mgr-vol-clone-stats
mgr/vol: show progress and stats for the subvolume snapshot clones

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-08-26 15:44:53 +05:30
Vallari Agrawal
c9a6fedbfa
qa/tasks/nvmeof.py: add nvmeof gw-group to deployment
Groups was made a required parameter to be
`ceph orch apply nvmeof <pool> <group>` in
https://github.com/ceph/ceph/pull/58860.
That broke the `nvmeof` suite so this PR fixes that.

Right now, all gateway are deployed in a single group.
Later, this would be changed to have multi groups for a better test.

Signed-off-by: Vallari Agrawal <val.agl002@gmail.com>
2024-08-26 15:15:10 +05:30
John Mulligan
0baf2e4f19 qa/tasks: add a new cephadm_from_container feature to cephadm task
The cephadm_from_container allows one to do a single container build
and then point teuthology at that image as the "single source of truth".
I find this extremely convenient when running teuthology locally and
I keep carrying this patch around - I figure having it upstream will
simplify my workflow. Maybe someday it'll benefit others too.

To use it I set up a yaml overrides file with the following content:
```yaml
overrides:
  cephadm:
    image: "quay.io/phlogistonjohn/ceph:dev"
    cephadm_from_container: true
  verify_ceph_hash: false
verify_ceph_hash: false
```

This let's me test my custom builds fairly easily!

Signed-off-by: John Mulligan <phlogistonjohn@asynchrono.us>
2024-08-23 14:35:55 -04:00
Venky Shankar
db4959e44f Merge PR #58487 into main
* refs/pull/58487/head:
	qa/suites/fs/workload: drop mgrmodules stanza
	qa/tasks/ceph: fix "ceph mgr module enable" command

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Milind Changire <mchangir@redhat.com>
2024-08-23 22:02:34 +05:30
Ilya Dryomov
9ac05d9030
Merge pull request #44470 from orozery/rbd-external-migrate
librbd/migration: add external clusters support

Reviewed-by: Mykola Golub <mgolub@suse.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
2024-08-23 10:20:47 +02:00
Venky Shankar
6b1d0de80e Merge PR #56816 into main
* refs/pull/56816/head:
	doc: mention the peer status failed when snapshot created on the remote filesystem.
	qa: add test_cephfs_mirror_remote_snap_corrupt_fails_synced_snapshot
	cephfs_mirror: update peer status for invalid metadata in remote snapshot

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
2024-08-22 14:52:33 +05:30
Venky Shankar
76226b35ce Merge PR #59166 into main
* refs/pull/59166/head:
	mon/thrasher: set stopping

Reviewed-by: Jos Collin <jcollin@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2024-08-22 14:50:51 +05:30
Patrick Donnelly
9296c2a36f
qa: load all dirfrags before testing altname recovery
The journal reset effectively cleared the cache so the rank may not have the
dirfrag in memory when we verify alternate name recovery.

Fixes: https://tracker.ceph.com/issues/67511
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2024-08-21 10:10:36 -04:00
Jos Collin
db9825d314
qa: make all replayer threads busy and query 'syncing' status
* Make all replayer threads busy and then query for 'syncing' state
instead of just fetching the current status.

* Dropped 'current_syncing_snap' check, as it's not compulsory for
this test. The actual intension is to make threads in 'syncing' status
and 'current_syncing_snap' check is not necessary for that.

* Drop 'snaps_deleted' metrics check in test_cephfs_mirror_cancel_mirroring_and_readd.
test_cephfs_mirror_cancel_mirroring_and_readd primarily focusses
on the synchronization of the newly added directory paths post removal
of the previously added/syncing directory paths. So checking of 'snaps_deleted'
metrics is unnecessary here.

* Wait for more time to finish the new snapshot creations and the sync backoff.
We need to wait for more time in test_cephfs_mirror_cancel_mirroring_and_readd,
as the test makes all replayer threads busy.

Fixes: https://tracker.ceph.com/issues/64711
Signed-off-by: Jos Collin <jcollin@redhat.com>
2024-08-20 13:20:07 +05:30
Venky Shankar
9f27bde0b9 qa/cephfs: add test to verify backtrace update failure on deleted data pool
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2024-08-20 07:07:47 +00:00
Casey Bodley
630f8e04c0
Merge pull request #59241 from tobias-urdin/openstack-upperconstraints
qa: barbican: restrict python packages with upper-constraints

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2024-08-19 13:10:57 -04:00
Tobias Urdin
bbcb820c77 qa: barbican: restrict python packages with upper-constraints
We install barbican by doing a pip install directly on the
cloned git repository but we don't honor the upper-constraints
from the OpenStack Requirements project that handles what
versions is supported.

This changes the pip install command that we issue when
installing barbican to honor the requirements for the
version (derived from the branch) that we use, in
this case it's the 2023.1 release upper-constraints [1].

This prevents us from pulling in untested Python packages.

This only updates Barbican because for the Keystone job
we dont directly issue pip but install using tox using the
`venv` environment which already by default sets the
constraints as you can see in [2].

[1] https://releases.openstack.org/constraints/upper/2023.1
[2] https://github.com/openstack/keystone/blob/stable/2023.1/tox.ini#L12

Fixes: https://tracker.ceph.com/issues/67444
Signed-off-by: Tobias Urdin <tobias.urdin@binero.com>
2024-08-19 15:02:16 +02:00
Yuval Lifshitz
7d2cf2cabc
Merge pull request #59239 from yuvalif/wip-yuval-67513
Reviewed-By: Casey Bodley <cbodley@ibm.com>


test/rgw/notification: use real ip address instead of localhost

based on that comment:
https://tracker.ceph.com/issues/67206#note-6
the address used by the endpoint is taken as the real IP address of the
host where the test script is running and not localhost.

we also changed the rabbitmq-server conf to allow "guest"
user to connect over non localhost address

Fixes: https://tracker.ceph.com/issues/67206

Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
2024-08-19 13:37:07 +03:00
Nizamudeen A
60be38e077
Merge pull request #58995 from rhcs-dashboard/fix-66844-main
qa/mgr/dashboard: fix test race condition

Reviewed-by: Nizamudeen A <nia@redhat.com>
2024-08-19 11:19:52 +05:30
Or Ozeri
409d36c708 qa/tasks/qemu: remove hard-coding of cluster name
This commit allows running the qemu task on an arbitrary cluster name.

Signed-off-by: Or Ozeri <oro@il.ibm.com>
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2024-08-18 21:34:09 +02:00
Or Ozeri
4f8dc834cc qa/tasks/rbd: support non-default ceph clusters
This commit allows running the rbd task on an arbitrary cluster name.

Signed-off-by: Or Ozeri <oro@il.ibm.com>
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2024-08-18 21:34:09 +02:00
Yuval Lifshitz
cdd75b499f test/rgw/notification: use real ip address instead of localhost
based on that comment:
https://tracker.ceph.com/issues/67206#note-6
the address used by the endpoint is taken as the real IP address of the
host where the test script is running and not localhost.

we also changed the rabbitmq-server conf to allow "guest"
user to connect over non localhost address

Fixes: https://tracker.ceph.com/issues/67206

Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
2024-08-16 14:36:17 +03:00
Casey Bodley
5000b233c1
Merge pull request #59169 from cbodley/wip-67464
rgw: revert account-related changes to get_iam_policy_from_attr()

Reviewed-by: Pritha Srivastava <prsrivas@redhat.com>
2024-08-14 09:14:13 -04:00
Rishabh Dave
a6b95a5a69 qa/cephfs: update a test in test_volumes.py
Test name is test_subvolume_snapshot_info_if_clone_pending_for_no_group,
located in class TestSubvolumeSnapshotClones in test_volumes.py

5 seconds can (sometimes) be insufficient as value of the config option
"snapshot_clone_delay" in this. Increase it to avoid unnecessary race
conditions which leads to irrelevant failures.

Following is an example where 5 seconds was insufficient as waiting
period since instead it took 8 seconds -

2024-07-28T18:16:10.088 DEBUG:teuthology.orchestra.run.smithi064:> sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph --cluster ceph config set mgr mgr/volumes/snapshot_clone_no_wait False
...
2024-07-28T18:16:18.694 DEBUG:teuthology.orchestra.run.smithi064:> sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph --cluster ceph fs subvolume snapshot info cephfs subvol79370 subvol_snap40980

This issue was seen during testing of PR to which this commit belongs.

This commit has been separated from the commit that adds tests for clone
progress reporting so that it's easy to document need for this code
patch and also track it.

This commit is not being moved to a different PR and been kept on the
same PR since it can't be reproduced otherwise. This also ensures that
commit is backported to older release along with code that caused this
issue, causing no one to need to find this commit while backporting
effort.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2024-08-14 15:32:46 +05:30
Rishabh Dave
e0c85b86ae qa/cephfs: add tests for clone progress
Clone progress is shown to user through "ceph fs clone status" output
and through "ceph status" output. Test both these features.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2024-08-14 15:05:46 +05:30
Rishabh Dave
92aecabdb2 qa/cephfs: return size of data generated in _do_subvolume_io()
TestVolumesHelper._do_subvolume_io() is a helper method that allows
users to generate data for testing. mgr/vol code that reports progress
made by clone jobs depends on the value set for xattr rbytes. It takes
a bit of a time for rbytes to be set.

And, therefore, all tests in TestCloneProgressReporter needs to wait for
subvolume's rbytes xattr's value to be set to the actual amount of data
present on the subvolume before proceeding to actually testing.

So that this can be achieved make _do_subvolume_io() return size of the
data it has generated.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2024-08-14 15:05:46 +05:30
Rishabh Dave
db0e736d4a qa/cephfs: use safe_while in test_volumes.py
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2024-08-14 15:05:46 +05:30
Rishabh Dave
9f60848df4 qa/cephfs: add method to run and get shell cmd stdout
Add a helper method that accepts command arguments (along with rest of
paramters accepted by the method run_shell()) and return the stdout of
the command.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2024-08-14 15:05:46 +05:30
Rishabh Dave
10949bf68d qa/cephfs: improvements for helper methods for clone state
1. Let caller check for multiple states. It might happen that clone
   finishes while it is being cancelled, in such cases user might want
    to check for both.
2. Add a helper method to check if clone is in pending state and add a
   separate method to check if clone is in cancelled state.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2024-08-14 15:05:46 +05:30
Jos Collin
9a46c52387
qa: do the set/get attribute on the remote filesystem
Fixes: https://tracker.ceph.com/issues/67435
Signed-off-by: Jos Collin <jcollin@redhat.com>
2024-08-14 13:13:08 +05:30
Casey Bodley
f38429b857 qa/s3tests: configure tenant name for 's3 tenant' section
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2024-08-13 13:06:50 -04:00
Patrick Donnelly
8017ef589d
Merge PR #58532 into main
* refs/pull/58532/head:
	qa: improve and add commit ID to comment for context

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2024-08-12 12:11:16 -04:00
Nitzan Mordechai
2e0e469195 mon/thrasher: set stopping
stopping was set by PR #58282 to bool instead of set()

Fixes: https://tracker.ceph.com/issues/67496
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2024-08-12 11:44:52 +00:00
Jos Collin
db65867f77
qa: add test_cephfs_mirror_remote_snap_corrupt_fails_synced_snapshot
Fixes: https://tracker.ceph.com/issues/65226
Signed-off-by: Jos Collin <jcollin@redhat.com>
2024-08-12 11:48:04 +05:30
Guillaume Abrioux
1924efd8db
Merge pull request #57037 from adk3798/cephadm-limit-shell-mounts
cephadm: limit mounts for shell and ceph-volume commands
2024-08-08 21:12:38 +02:00
Venky Shankar
052cb8deb1
Merge pull request #58583 from lxbsz/wip-66946
qa/cephfs: fix TestRenameCommand and unmount the clinet before failin…

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-08-08 17:55:24 +05:30
Kamoltat (Junior) Sirivadhna
6a0d503a59
Merge pull request #56233 from kamoltat/wip-ksirivad-fix-64802
RADOS: Generalize stretch mode pg temp handling to be usable without stretch mode
Samuel Just <sjust@redhat.com>
2024-08-07 09:45:54 -04:00
Xiubo Li
ae3faac004 qa/cephfs: fix TestRenameCommand and unmount the clinet before failing fs
We should else bring and wait for MDS to be up since it is needed
for unmounting of CephFS in CephFSTestCase.tearDown() to be successful,
or just unmount the mountpoints before failing the filesystem.

Since the mountpoint won't be used in later tests so we just unmount
it.

Fixes: https://tracker.ceph.com/issues/66946
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2024-08-07 13:23:07 +08:00
Venky Shankar
c9ab079494
Merge pull request #56782 from joscollin/wip-B65171-support-replication-start-end-notifications
cephfs_mirror: provide metrics for last successful snapshot sync

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-08-07 09:51:36 +05:30
Venky Shankar
92ad0c83aa
Merge pull request #57458 from lxbsz/wip-session-evict
qa/cephfs: add test_session_evict_non_blocklisted test case

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-08-07 09:50:00 +05:30
Nitzan Mordechai
a035b5a22f thrashers: standardize stop and join method names
Thrashers that do not inherit from ThrasherGreenlet previously used a
method called do_join, which combined stop and join functionality. To
ensure consistency and clarity, we want all thrashers to use separate
stop, join, and stop_and_join methods.

This commit renames methods and implements missing stop and stop_and_join
methods in thrashers that did not inherit from ThrasherGreenlet.

Fixes: https://tracker.ceph.com/issues/66698
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2024-08-06 06:57:40 +00:00
Nitzan Mordechai
21b4b89e52 qa/tasks: watchdog terminate thrasher
If a thrasher exception occurs, the do_dump_ops thread will continue
looping until the Teuthology timeout is reached.
The watchdog should terminate the thrasher to free up resources.

Fixes: https://tracker.ceph.com/issues/66698
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2024-08-06 06:56:00 +00:00
Ernesto Puerta
6c7253be6f
qa/mgr/dashboard: fix test race condition
Fixes: https://tracker.ceph.com/issues/66844

Signed-off-by: Ernesto Puerta <epuertat@redhat.com>
2024-08-05 18:48:14 +02:00
kyr
01a9f08544
Merge pull request #58373 from ceph/wip-tasks-ceph-mkfs-has-no-f
qa: ceph.py mkfs.ext4 has no -f
2024-08-05 17:44:14 +02:00
Adam King
8164efe5d0 qa/cephadm: use cephadm ceph-volume directly to zap devs
This was running ceph-volume through the
cephadm shell previously, but as we are trying
to remove mount points from cephadm shell, this
no longer works (specifically without the /dev mount)

Signed-off-by: Adam King <adking@redhat.com>
2024-08-02 12:32:51 -04:00
Venky Shankar
f4b5465815 Merge PR #51332 into main
* refs/pull/51332/head:
	qa: add test for ceph tell with unknown cephtype
	pybind/ceph_argparse: fixing error message for ceph tell command

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2024-08-01 12:19:52 +05:30
Yuri Weinstein
1a2a617edd
Merge pull request #54671 from baum/ceph-nvmeof-mon
mon: add NVMe-oF gateway monitor and HA

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2024-07-31 09:05:41 -07:00
Vallari Agrawal
d0c4182bf5 qa/suites/rbd/nvmeof: add multi-subsystem setup and thrash test
1. qa/tasks/nvmeof.py:
    1.1. create multiple rbd images for all subsystems
    1.2. add NvmeofThrasher and ThrashTest
2. qa/tasks/mon_thrash.py: add 'switch_thrashers' option
3. nvmeof_setup_subsystem.sh: create multiple subsystems and enable HA
4. Restructure qa/suites/rbd/nvmeof: Create two sub-suites
   - "basic" (nvmeof_initiator job)
   - "thrash" (new: nvmeof_mon_thrash and nvmeof_thrash jobs)

Signed-off-by: Vallari Agrawal <val.agl002@gmail.com>
2024-07-31 08:52:05 +00:00
Kotresh HR
983f893fb9 qa: Add mds caps test for testing fs read and a path rw
Fixes: https://tracker.ceph.com/issues/67212
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2024-07-30 23:18:22 +05:30
Jos Collin
5535e72364
qa: check last successful snapshot sync metrics in perf counters
Fixes: https://tracker.ceph.com/issues/65171
Signed-off-by: Jos Collin <jcollin@redhat.com>
2024-07-30 11:01:07 +05:30
Patrick Donnelly
cdef41d908
qa: generate valgrind suppressions
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2024-07-23 11:18:54 -04:00
Laura Flores
39a09a3590
Merge pull request #58275 from NitzanMordhai/wip-nitzn-host-thraser-fix-min-in-checks
suites: host thrasher should check min_in before thrashing host
2024-07-22 13:22:30 -05:00
Laura Flores
24ae2c9a0f
Merge pull request #58393 from NitzanMordhai/wip-nitzan-old-client-thrasher-slow-ops-ignorelist
suites: thrash old client missing SLOW_OPS in the ignore-list
2024-07-22 13:19:12 -05:00
Kamoltat
ed7f4e8829 qa: Added mon connection score tests
Basically when we deploy a 3 MONS

Check if the connection scores are clean
with a 60 seconds grace period

Fixes: https://tracker.ceph.com/issues/65695

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2024-07-17 22:26:55 +00:00
Kamoltat
7b41aff3f0 qa/suites/rados: 3-az-stretch-cluster-netsplit test
Test the case where 2 DC loses connection with each other
for a 3 AZ stretch cluster with stretch pool enabled.
Check if cluster is accessible and PGs are active+clean
after reconnected.

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2024-07-17 22:16:01 +00:00
Kamoltat
4ca1320727 qa/suites/rados/singleton/all: init mon-stretch-pool.yaml
Test the following new Ceph CLI commands:

`ceph osd pool stretch set`
`ceph osd pool stretch unset`
`ceph osd pool stretch show`

`qa/workunits/mon/mon-stretch-pool.sh`

will create the stretch cluster
while performing input validation for the CLI
Commands mentioned above.

`qa/tasks/stretch_cluster.py`

is in charge of
setting a pool to stretch cluster
and checks whether it prevents PGs
from the going active when there is not
enough buckets available in the acting
set of PGs to go active.

Also, test different MON fail over scenarios
after setting pool as stretch

`qa/suites/rados/singleton/all/mon-stretch-pool.yaml`

brings the scripts together.

Fixes: https://tracker.ceph.com/issues/64802

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2024-07-17 22:12:04 +00:00
Yuri Weinstein
bc2bb38657
Merge pull request #57906 from kamoltat/wip-ksirivad-stretch-mode-netsplit-test
mon/ElectionLogic: tie-breaker mon ignore proposal from marked down mon

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2024-07-16 15:08:20 -07:00
Kamoltat (Junior) Sirivadhna
69e7b4dd4e
Merge pull request #58374 from kamoltat/wip-ksirivad-fix-65824
qa/suites/rados/thrash-old-clients: Add noscrub, nodeep-scrub to ignorelist
Reviewed by: Laura Flores <lflores@ibm.com>
2024-07-15 10:18:23 -04:00
neeraj pratap singh
decf32e823 qa: add test for ceph tell with unknown cephtype
Fixes: https://tracker.ceph.com/issues/59624
Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
2024-07-15 17:28:30 +05:30
Venky Shankar
2ab14159a6
Merge pull request #49974 from neesingh-rh/wip-58619
mds: fix session/client evict command.

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-07-15 15:55:20 +05:30
Venky Shankar
62eb72731a
Merge pull request #56193 from joscollin/wip-B64927-test_cephfs_mirror_blocklist-fail
cephfs_mirror, qa: fix mirror daemon doesn't restart when blocklisted or failed

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-07-15 15:25:19 +05:30
Rishabh Dave
3f4aee27ee
Merge pull request #58311 from rishabh-d-dave/fs-auth-nonalpha-test
Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
2024-07-12 16:23:47 +05:30
Nizamudeen A
67afcf6ce7
Merge pull request #58386 from YiteGu/dashboard-rename-api
mgr/dashboard: add cephfs rename REST API

Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
2024-07-12 12:54:47 +05:30
Rishabh Dave
dc4bd3ea08 qa: improve and add commit ID to comment for context
The comment was unclear without looking at previous version of code.
Therefore improve comment a bit and add link to commit ID due to which
this comment was introduced to give future readers context.

Also, place the comment before the command arguments. This too adds the
context.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2024-07-12 10:27:46 +05:30
Rishabh Dave
c6e2c97c6e cephfs: disallow removing root_squash via "fs authorize" cmd
Removing root_squasn from MDS auth caps through "fs authorize" command
should not be allowed as this command it not allowed to/meant for
removing caps.

Fixes: https://tracker.ceph.com/issues/65808
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2024-07-12 09:59:45 +05:30
Rishabh Dave
2130ec8ebc qa/cephfs: fix test_single_path_authorize_on_nonalphanumeric_fsname
This test deletes the CephFS already present on the cluster at the very
beginning and unmounts the first client beforehand. But it leaves the
second client mounted on this deleted CephFS that doesn't exist for the
rest of the test. And then at the very end of this test it attempts to
remount the second client (during tearDown()) which hangs and causes
test runner to crash.

Unmount the second client beforehand to prevent the bug and delete
mount_b object to avoid confusion for the readers in future about
whether or not 2nd mountpoint exists.

Fixes: https://tracker.ceph.com/issues/66077
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2024-07-11 22:08:06 +05:30
Ilya Dryomov
f92b4f5359
Merge pull request #58507 from idryomov/wip-66886
qa: account for rbd_trash object in krbd_data_pool.sh + related ceph{,adm} task fixes

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: N Balachandran <nibalach@redhat.com>
2024-07-11 14:03:16 +02:00
Rishabh Dave
384acdeb47
Merge pull request #57492 from rishabh-d-dave/qa-fs-mds-fail-improve
qa/cephfs: improvements for "mds fail" and "fs fail"

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2024-07-11 16:37:16 +05:30
Nizamudeen A
2038b3e326
Merge pull request #58224 from Serrini/fix-66490-main
mgr/dashboard: add restful api for creating crush rule with type of 'erasure'

Reviewed-by: Nizamudeen A <nia@redhat.com>
2024-07-11 11:49:30 +05:30
Ilya Dryomov
26cee25873 qa/tasks/cephadm: drop --cluster and sudo in create_rbd_pool()
The cluster (name) is already specified in the arguments passed to
_shell() and this command doesn't need privileges.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2024-07-10 12:44:12 +02:00
Ilya Dryomov
0bda782952 qa/tasks/cephadm: don't wait for OSDs in create_rbd_pool()
This fails because teuthology.wait_until_osds_up() wants to use
adjust-ulimits wrapper which isn't available in "cephadm shell"
environment.  The whole thing is also redundant because cephadm task
is supposed to wait for OSDs to come up earlier, in ceph_osds().

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2024-07-10 12:33:45 +02:00
Ilya Dryomov
fcc440da9a qa/tasks: initialize rbd pool instead of just tagging it
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2024-07-10 12:33:45 +02:00
sunlan
3a7f8e5e37 mgr/dashboard: add restful api for creating crush rule with type of 'erasure'
Fixes: https://tracker.ceph.com/issues/66490
Signed-off-by: sunlan <sunlan@asiainfo.com>
2024-07-10 14:44:07 +08:00
Ilya Dryomov
8b5e7cc5b5 qa/tasks/ceph: fix "ceph mgr module enable" command
This was typoed in commit 9d485ae1f4 ("qa/tasks/ceph: provide
configuration for setting configs via mon") and went unnoticed likely
because 3-snaps/yes.yaml in fs:workload is the only user of the new
mgr-modules stanza so far and fs:workload suite runs exclusively on
cephadm.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2024-07-09 23:16:37 +02:00
Venky Shankar
69704e91bf Merge PR #53301 into main
* refs/pull/53301/head:
	qa: adding test for preventing scrub when mds is inactive
	mds: prevent scrub start for standby-replay MDS

Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Milind Changire <mchangir@redhat.com>
2024-07-09 11:08:26 +05:30
neeraj pratap singh
b9a2d0571f qa: adding test for preventing scrub when mds is inactive
Fixes: https://tracker.ceph.com/issues/62537
Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
2024-07-08 15:33:54 +05:30
Jos Collin
a9a56919ff
qa: Wait for mirror daemon restart before getting new rados_inst
After blocklisted/failed, wait for the mirror daemon restart
which is after 30 seconds timeout and then check for the new rados_inst.

Fixes: https://tracker.ceph.com/issues/64927
Signed-off-by: Jos Collin <jcollin@redhat.com>
2024-07-05 10:14:20 +05:30
yite.gu
bac2689fdd mgr/dashboard: add cephfs rename REST API
Introduce rename for the cephfs REST API controller, we can
rename the existing file or directory by it.

Fixes: https://tracker.ceph.com/issues/66797

Signed-off-by: Yite Gu <yitegu0@gmail.com>
2024-07-04 23:50:16 +08:00
Ali Masarwa
5d6db362d3 RGW|BN : change endpoint for http failing tests
Signed-off-by: Ali Masarwa <amasarwa@redhat.com>
2024-07-04 10:06:26 +03:00
Patrick Donnelly
61381768ba
qa: add test for cephx path check on unlinked snapped dir tree
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2024-07-03 22:18:09 -04:00
Nitzan Mordechai
3ac8b61aaa suites: thrash old client missing SLOW_OPS in the ignore-list
The current ignore-list have \(SLOW_OPS\)  but missing SLOW_OPS

Fixes: https://tracker.ceph.com/issues/66604
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2024-07-03 06:56:05 +00:00
Sridhar Seshasayee
9265bdb1da qa/tasks: Initialize 'monitoring_profiles' spec to an empty dict
Initialize 'monitoring_profiles' to an empty python dictionary instead of
'None' to prevent the cbt task from failing due to the TypeError exception
when attempting to iterate a 'NoneType'.

The bug was introduced as part of https://github.com/ceph/ceph/pull/51438
and commit e174c6e2cf.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
Fixes: https://tracker.ceph.com/issues/66799
2024-07-02 13:17:27 +05:30
Kamoltat
3a401d7a55 qa/suites/rados/thrash-old-clients: Add noscrub, nodeep-scrub to ignorelist
Fixes: https://tracker.ceph.com/issues/65824

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2024-07-01 15:24:42 +00:00
Kyr Shatskyy
4e54a9034c qa: ceph.py mkfs.ext4 has no -f
Addresses probable issue:

  mkfs.ext4: invalid option -- 'f'

Fixes: https://tracker.ceph.com/issues/10797

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@gmail.com>
2024-07-01 14:28:03 +02:00
Guillaume Abrioux
cbc6d66d94
Merge pull request #58290 from guits/fix_nvme_loop
tests: make sure nvmetcli and nvme-cli are up to date
2024-06-28 12:25:32 +02:00
Venky Shankar
dbc9816d2e Merge PR #57619 into main
* refs/pull/57619/head:
	qa/cephfs: use wait_for_daemon() instead of sleep()-ing
	qa/cephfs: mark file system joinable for fs rename tests before unmounting clients

Reviewed-by: Rishabh Dave <ridave@redhat.com>
2024-06-27 22:04:37 +05:30
Venky Shankar
cac7dcd634 Merge PR #53755 into main
* refs/pull/53755/head:
	PendingReleaseNotes: add note about CephFS set_vxattrs
	doc/cephfs: Update docs to match remove functionality and respective vxattrs
	qa: Add test coverage for vxattr behavior
	qa: Add removexattr to support setfattr removal.
	mds: Implement remove for ceph vxattrs

Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Milind Changire <mchangir@redhat.com>
2024-06-27 19:58:50 +05:30
Guillaume Abrioux
d707c41eac tests: make sure nvmetcli and nvme-cli are up to date
Otherwise jobs end up with the following failure:

```
2024-06-25T14:22:18.659 INFO:teuthology.orchestra.run.smithi098.stderr:Failed to write to /dev/nvme-fabrics: Invalid argument
```

Also, the output of nvme list has changed so we have to update
qa/tasks/nvme_loop.py accordingly.

Fixes: https://tracker.ceph.com/issues/66707

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
2024-06-27 15:43:26 +02:00
Kamoltat
7a90d9d852 qa/suites/netsplit/tests/2-dc-1-arbiter-stretch-cluster-netsplit.yaml
Test netsplit between 2 datacenters
in a stretch mode cluster.

Observe if:

- PGs are active
- Cluster is accessible
- Writes and Reads went through

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2024-06-26 15:22:07 +00:00
Nitzan Mordechai
66b42c3f5c suites: host thrasher should check min_in before thrashing host
We need to check if taking host out will cause the total in osds
to be less then min_in

Fixes: https://tracker.ceph.com/issues/66657
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2024-06-26 12:16:48 +00:00
Patrick Donnelly
c8053b11e7
Merge PR #53503 into main
* refs/pull/53503/head:
	qa: add tests for `mds last-seen` command
	doc/cephfs: add documentation for `mds last-seen`
	PendingReleaseNotes: add note on last-seen command
	mon/MDSMonitor: add command to lookup when mds was last seen
	mon/MDSMonitor: set birth time on FSMap during encode
	pybind/mgr/dashboard: show context diff for openapi check

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-06-25 12:27:28 -04:00
Yuri Weinstein
eebdee97e6
Merge pull request #57401 from kamoltat/wip-ksirivad-fix-65826
qa/tasks/mgr/test_progress.py: deal with pre-exisiting pool

Reviewed-by: Laura Flores <lflores@redhat.com>
2024-06-25 06:44:44 -07:00
Nizamudeen A
2856be9bd9
Merge pull request #57930 from rhcs-dashboard/get-replication-info
mgr/dashboard: get and update bucket replication policies

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
2024-06-25 13:13:05 +05:30
Venky Shankar
d17c681296 Merge PR #56052 into main
* refs/pull/56052/head:
	qa/suites: ignore unresponsive client when the test passes
	qa: enhance per-client labelled perf counters test

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-06-25 12:40:25 +05:30
Venky Shankar
0627148110 Merge PR #57034 into main
* refs/pull/57034/head:
	qa: cleanup snapshots before subvolume delete

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-06-25 12:37:56 +05:30
Nizamudeen A
013d0826c1 mgr/dashboard: select default daemon based on the default zonegroup
if multisite is configured, the default daemon needs to be selected
based on the default zonegroup. Otherwise dashboard gives you incorrect
details when doing the period commit

The issue occurs when you do a period update --commit and you reload one
of the block page, the api assigns the zonegroup of the second gateway
because for a moment, the first gateway reflects the period changes...
This is not true because the default zonegroup is of the previous active
gateway but even though the back-end correctly says the active
zonegroup, the dashboard api says it wrongly.

Fixes: https://tracker.ceph.com/issues/66394
Signed-off-by: Nizamudeen A <nia@redhat.com>
2024-06-21 16:45:50 +05:30
Patrick Donnelly
edc584a533
qa: add tests for mds last-seen command
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2024-06-20 21:32:56 -04:00
Patrick Donnelly
7637b199eb
mon/MDSMonitor: set birth time on FSMap during encode
So we can begin to answer questions like: when did we last see an MDS?

Fixes: https://tracker.ceph.com/issues/62849
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2024-06-20 21:32:56 -04:00
Ilya Dryomov
daddc9cf16
Merge pull request #58055 from zmc/qemu-task-version
qa/tasks/qemu: Fix OS version comparison

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2024-06-20 17:05:11 +02:00
Patrick Donnelly
959f007d83
Merge PR #55792 into main
* refs/pull/55792/head:
	tools/cephfs: recover alternate_name of dentries from journal
	qa: add test to verify recovery of alternate_name from journal
	tools/cephfs/JournalTool: add some more debugging
	tools/cephfs/JournalTool: remove extraneous 0x in debug output
	mds: dump alternate_name to formatter
	mds: add warning about encoding new fields

Reviewed-by: Christopher Hoffman <choffman@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
2024-06-19 15:19:56 -04:00