Commit Graph

9314 Commits

Author SHA1 Message Date
Rishabh Dave
1423f959e8 qa/cephfs: set omit_sudo to False in test_acls.py
When running "sudo ./check generic/099" in test_acls.py's test method
named test_acls(), set omit_sudo to False because without it
vstart_runner.py will remove "sudo" from command arguments and so the
command will fail unnecessarily.

Fixes: https://tracker.ceph.com/issues/55374
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2022-04-27 14:43:41 +05:30
Rishabh Dave
b60b73c0fe qa/cephfs: just import getLogger instead of whole module
Import just the necessary object instead of importing entire module.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2022-04-27 14:43:41 +05:30
Adam King
9268e32047
Merge pull request #45899 from rkachach/fix_issue_53527
mgr/cephadm: fixing upgrade candidate versions listing

Reviewed-by: Adam King <adking@redhat.com>
2022-04-26 15:09:37 -04:00
Neha Ojha
cf78cf4980
Merge pull request #46010 from gregsfortytwo/wip-55419-blocklist-test
test: Fix a blocklist API test

Reviewed-by: Laura Flores <lflores@redhat.com>
2022-04-26 10:59:07 -07:00
Venky Shankar
76b33670b5
Merge pull request #44958 from rishabh-d-dave/qa-cephfs-timeout
qa/cephfs: change default timeout from 900 secs to 300

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-04-26 22:30:43 +05:30
Rishabh Dave
ac5de474cf qa/cephfs: change default timeout from 900 secs to 300
15 minutes is unnecessarily large as a default value for timeout for a
command. Not having to wait unnecessarily on a crash of a command will
reduce teuthology's testing queue and will save individual developer's
time while running tests locally.

Whatever lines are modified for this purpose are also modified to follow
the stlye guideline, specfically wrapping at 80 characters.

Fixes: https://tracker.ceph.com/issues/54236
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2022-04-26 18:34:42 +05:30
Samuel Just
f6a9ee72d7 bluestore: Revert "os/bluestore: Add CoDel to BlueStore for Bufferbloat mitigation"
Merge caused: https://tracker.ceph.com/issues/55433

Signed-off-by: Samuel Just <sjust@redhat.com>
2022-04-25 12:33:45 -07:00
Greg Farnum
63db714890 test: use the same address input format as output will generate
Otherwise, our grep fails!

Fixes: https://tracker.ceph.com/issues/55419

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2022-04-23 00:00:59 +00:00
Samuel Just
78ffeb71a0
Merge pull request #43413 from esmaeil-mirvakili/bluestore-bufferbloat-mitigation
os/bluestore: Add CoDel to BlueStore for Bufferbloat mitigation

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
2022-04-22 14:37:04 -07:00
Redouane Kachach
671442c4b4
mgr/cephadm: fixing upgrade candidate verions listing
Fixes: https://tracker.ceph.com/issues/53527

Signed-off-by: Redouane Kachach <rkachach@redhat.com>
2022-04-22 15:54:57 +02:00
Adam King
5720cca2f8
Merge pull request #45920 from adk3798/stop-mgr-nfs-upgrade-fail
qa/suites/orch/cephadm: stop upgrade tests if failures are seen

Reviewed-by: Redouane Kachach <rkachach@redhat.com>
2022-04-22 09:28:35 -04:00
Rishabh Dave
8ca2700976 cephfs-shell: exit with non-zero on getting unrecognized cmd
Right now cephfs-shell on receiving unrecognized command prints an
appropriate message on stderr but the return value is zero. This is a
serious problem for users as well as for tests. It must exit with
non-zero return value.

The return value chosen for this case is 127, same as bash.

Changes in test_cephfs_shell.py, besides addition of TestGeneric, are
tests that are buggy and the bug now changes the test's behaviour since
the cephfs-shell bug has now been fixed.

Fixes: https://tracker.ceph.com/issues/55399
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2022-04-22 11:34:41 +05:30
Gregory Farnum
6536d0c1af
Merge pull request #44151 from gregsfortytwo/wip-cidr-blocklist
Implement CIDR blocklisting

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2022-04-21 08:31:16 -07:00
Venky Shankar
91dcd8bda1
Merge pull request #45971 from rishabh-d-dave/cephfs-shell-exclam-marks
qa/cephfs: bug fix and some cleanup for test_cephfs_shell.py

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-04-21 18:52:20 +05:30
Rishabh Dave
a4bbb62bd6 qa/cephfs: cleanup some code in test_cephfs_shell.py
Also delete a comment since it's incorrect.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2022-04-21 16:47:39 +05:30
Rishabh Dave
faa506798d qa/cephfs: don't use exclam mark in tests in test_cephfs_shell.py
Exclamation mark is a special character for bash as well as
cephfs-shell. For bash, it substitutes current command with matching
command from command history and for cephfs-shell it runs the command
as OS-level command and not inside the cephfs-shell.

And evey command executed in tests (say "ls") is run by passing it as a
parameter to cephfs-shell command (that is "cephfs-shell -c <conf> --
ls"). So, exclamation mark, when used in tests, is consumed by bash
instead of cephfs-shell.

To avoid these complications it's best (and even simpler!) to issue the
command meant for bash on bash without going through cephfs-shell.

Fixes: https://tracker.ceph.com/issues/55394
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2022-04-21 16:40:29 +05:30
Xiubo Li
cd3e903b0c qa: add file/filesystem sync crash test case
This is one test case for the possible kernel crash bug when doing
the file sync or filesystem sync.

Fixes: https://tracker.ceph.com/issues/55329
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2022-04-21 15:08:11 +08:00
Xiubo Li
3db3b4e2a4 qa: add file sync stuck test support
This will test the file sync of a directory, which maybe stuck for
at most 5 seconds. This was because the related code will wait for
all the unsafe requests to get safe reply from MDSes, but the MDSes
just think that it's unnecessary to flush the mdlog immediately
after early reply, and the mdlog will be flushed every 5 seconds
in the tick thread.

This should have been fixed in kclient and libcephfs by triggering
mdlog flush before waiting requests' safe reply.

Fixes: https://tracker.ceph.com/issues/55283
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2022-04-21 15:07:43 +08:00
Xiubo Li
b6fc5480f6 qa: add filesystem sync stuck test support
This will test the sync of the filesystem, which maybe stuck for
at most 5 seconds. This was because the related code will wait
for all the unsafe requests to get safe reply from MDSes, but the
MDSes just think that it's unnecessary to flush the mdlog immediately
after early reply, and the mdlog will be flushed every 5 seconds
in the tick thread.

This should have been fixed in kclient and libcephfs by triggering
mdlog flush before waiting requests' safe reply.

Fixes: https://tracker.ceph.com/issues/55283
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2022-04-21 15:07:43 +08:00
Ionut Balutoiu
d8113881a2 qa/crontab/teuthology-cronjobs: add windows suite
Signed-off-by: Ionut Balutoiu <ibalutoiu@cloudbasesolutions.com>
2022-04-20 19:56:23 +03:00
Marcus Watts
20cb9805c9 qa/rgw - run sse-s3 test cases by default
Signed-off-by: Marcus Watts <mwatts@redhat.com>
2022-04-19 17:35:40 -04:00
Casey Bodley
065b9aad86 qa/rgw: run sse-s3 test cases conditionally
Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 4e644832b0)
2022-04-19 17:35:39 -04:00
Casey Bodley
e346a75399 qa/rgw: configure sse-s3 in vault_transit.yaml
Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 1b679e0828)
2022-04-19 17:35:39 -04:00
Ionut Balutoiu
8054199787 qa: add basic Ceph on Windows integration test
Due to lack of Windows support in the Teuthology, the test case adopts
the following workaround:

* Deploy baremetal machine with `ubuntu_latest.yaml` and
  configure it with libvirt KVM.
* Create a libvirt VM and provision it with Windows Server 2019, using
  the official ISO from Microsoft.
* Configure SSH in the Windows VM, and run the tests remotely via SSH.

The implementation of the test case consists of workunit scripts.

`qa/workunits/windows/test_rbd_wnbd.py` is the main Python script
to test Ceph on Windows basic functionality. This is executed in the
libvirt VM configured with Windows Server 2019.

Co-authored-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
Co-authored-by: Daniel Vincze <dvincze@cloudbasesolutions.com>

Signed-off-by: Ionut Balutoiu <ibalutoiu@cloudbasesolutions.com>
2022-04-19 17:21:52 +03:00
Venky Shankar
3f13df8388
Merge pull request #45603 from nmshelke/feature-54472
mgr/volumes: set, get, list and remove metadata of subvolume

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-04-19 16:52:13 +05:30
Neeraj Pratap Singh
5aa304e128 qa: test chmod is allowed above 0777.
Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
2022-04-19 16:51:23 +05:30
Venky Shankar
73e806b7da
Merge pull request #43879 from rishabh-d-dave/vr-clearer-msg
qa/vstart_runner: print "success" explicitly

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-04-18 15:27:52 +05:30
Venky Shankar
4070f734ee
Merge pull request #45924 from rishabh-d-dave/qa-cephfs-test_cephfs_shell
qa/cephfs: expand note for test_cephfs_shell.py

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-04-18 15:02:55 +05:30
Rishabh Dave
bc02c3c7e9 qa/cephfs: expand note for test_cephfs_shell.py
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2022-04-18 13:15:10 +05:30
Rishabh Dave
902420cbd4 qa/vstart_runner: print "success" explicitly
This makes it effortless for new vstart_runner.py users to spot that the
test passed.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2022-04-18 10:39:08 +05:30
Venky Shankar
1ac01c60df
Merge pull request #45816 from kotreshhr/clone-failure-status
mgr/volumes: Show clone failure reason in clone status command

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-04-18 09:43:40 +05:30
Adam King
017aa9cfe8 qa/suites/orch/cephadm: stop upgrade tests if failures are seen
Otherwise the tests may run forever. This was already done for
mds upgrade sequence, justadding it in the other two places here

Related to: https://tracker.ceph.com/issues/53939

Signed-off-by: Adam King <adking@redhat.com>
2022-04-17 12:21:44 -04:00
Adam King
5360de431d
Merge pull request #45765 from m-ildefons/1196785-cephadm-status-trace
cephadm: avoid crashing on expected non-zero exit

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
2022-04-15 11:05:05 -04:00
Venky Shankar
b4ea90ad82
Merge pull request #45549 from nmshelke/fuse-linux-only
ceph-fuse: restrict already_fuse_mounted function only for linux

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-04-14 17:38:20 +05:30
Venky Shankar
8316aa95fc
Merge pull request #45636 from joscollin/wip-B54971-rank0-stale-perf-stats-assertion-error
qa: make test_perf_stats_stale_metrics check only the clients created for the tests

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-04-14 17:36:42 +05:30
Venky Shankar
1380b565b8
Merge pull request #45683 from kotreshhr/subvolume-retainsnap-rm-fix
mgr/volumes: Fix idempotent subvolume rm

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-04-14 17:31:46 +05:30
Moritz Röhrich
a02be6f22f
cephadm: avoid crashing on expected non-zero exit
- Avoid crashing when a call out to an external program expectedly does
  not return exit status zero.

There are programs that communicate other information than error/no
error through exit status. E.g. `systemctl status` will return different
exit codes depending on the actual status of the units in question.
In cases where this is expected crashing with a RuntimeError exception
is inappropriate and should be avoided.

Fixes: https://tracker.ceph.com/issues/55117
Signed-off-by: Moritz Röhrich <moritz.rohrich@suse.com>
2022-04-13 17:22:22 +02:00
Kotresh HR
916a5981cf qa: Add test for clone failure status
Fixes: https://tracker.ceph.com/issues/55190
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2022-04-13 14:26:20 +05:30
Greg Farnum
1ca89ca76d test: check range blocklist in cephtool/test.sh
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2022-04-13 00:14:16 +00:00
Greg Farnum
8246892172 test: fix negative blocklist tests
These tests are supposed to be validating we don't accept invalid IPs,
but they left out the "add" subcommand so they're all failing on that!

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2022-04-13 00:14:16 +00:00
Ilya Dryomov
88db8db1b4
Merge pull request #45798 from adk3798/iscsi-only-pid-limit
cephadm: only apply unlimited pids-limit to iscsi and rgw

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2022-04-11 16:02:50 +02:00
Ilya Dryomov
a23b9c9938 qa: test_iscsi_pids_limit.sh: increase sleep time
It could take longer than 30 seconds to fork off 40000 processes on
a busy system.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-04-11 13:15:22 +02:00
Milind Changire
8d9bf97a58 qa: replace subvolume.sh with test_subvolume.py
qa/workunits/fs/misc/subvolume.sh is getting in the way of fs:workload
testing with subvolumes. Hence moved this script to a python test.

Signed-off-by: Milind Changire <mchangir@redhat.com>
2022-04-10 21:35:23 +05:30
Laura Flores
bd82d21943
Merge pull request #45749 from josephsawaya/fix-rook-tests
Remove orchestrator from rook task and suite
2022-04-08 16:36:38 -05:00
Casey Bodley
1ab04ddc38
Merge pull request #45304 from dvanders/cleanup_46456
rgw_reshard: drop olh entries with empty name

Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2022-04-08 09:05:33 -04:00
Dan van der Ster
16511fd15f workunits/rgw: reshard cleans empty name entries
Check that reshard prunes olh entries with empty name.

Fixes: https://tracker.ceph.com/issues/54500
Signed-off-by: Dan van der Ster <daniel.vanderster@cern.ch>
2022-04-07 18:12:06 +02:00
Milind Changire
f6c68c32aa qa: add test fragments to test subvols
Signed-off-by: Milind Changire <mchangir@redhat.com>
2022-04-07 14:18:06 +05:30
Milind Changire
bf83eaa4e7 qa: enhancement for subvol creation and mounting
Fixes: https://tracker.ceph.com/issues/54317
Signed-off-by: Milind Changire <mchangir@redhat.com>
2022-04-07 14:15:56 +05:30
Nikhilkumar Shelke
d1f4ecd557 qa: set, get, list and remove custom metadata for subvolume
Following test are added:
1. Set custom metadata for subvolume.
2. Set custom metadata for subvolume(Idempotency).
3. Get custom metadata for specified key.
4. Get custom metadata if specified key not exist (Expecting error ENOENT).
5. Get custom metadata if no any key-value is added means section not exist (Expecting error ENOENT).
6. Update value for existing key in custom metadata.
7. List custom metadata of subvolume.
8. List custom metadata of subvolume if no any key-value is added (Expect empty json/dictionary)
9. Remove custom metadata for specified key.
10. Remove custom metadata if specified key not exist (Expecting error ENOENT).
11. Remove custom metadata if no any key-value is added means section not exist (Expecting error ENOENT).
12. Remove custom metadata with --force option.
13. Remove custom metadata with --force option if specified key not exist (Expecting command to succeed because of '--force' option)
14. Set and Get custom metadata for legacy subvolume.
15. List and Remove custom metadata from legacy subvolume.

Fixes: https://tracker.ceph.com/issues/54472
Signed-off-by: Nikhilkumar Shelke <nshelke@redhat.com>
2022-04-07 12:30:42 +05:30
David Galloway
fcd512b50b qa: Latest RHEL8 -> RHEL8.5
Signed-off-by: David Galloway <dgallowa@redhat.com>
2022-04-01 15:00:01 -04:00
Joseph Sawaya
043a330ed0 Remove orchestrator from rook task and suite
This commit removes orchestrator commands from the
Rook task and the Rook test suite because the Rook
orchestrator is not being maintained, and the Rook
orchestrator CLI is obsolete. This should also
clarify the issue:

https://tracker.ceph.com/issues/53680

Signed-off-by: Joseph Sawaya <jsawaya@redhat.com>
2022-04-01 11:37:55 -04:00
Adam King
6e4dd0e082
Merge pull request #45361 from adk3798/mds-upgrade-test
qa/suites/fs: stop looping in mds upgrade test if upgrade failed

Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-03-30 18:56:45 -04:00
Neeraj Pratap Singh
9855e15de2 qa: test for verification of client metrics and metadata.
Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
2022-03-29 12:43:44 +05:30
Kotresh HR
15a2ab4e26 mgr/volumes: Fix idempotent subvolume rm
The subvolume deletion of a subvolume which is already deleted
with retain snapshots option fails with 'EAGAIN: clone in progress'
error. After subvolume deletion with retain snapshots, the subvolume
exists until the trash directory (resides inside subvolume) is
cleaned up. The subvolume deletion issued while the trash directory is not empty, should
pass. This patch fixes the same.

Credit: Issue discovery and fix suggestion to John Mulligan <jmulligan@redhat.com>
Fixes: https://tracker.ceph.com/issues/54625
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2022-03-29 10:57:47 +05:30
Nikhilkumar Shelke
23b3a27eaa ceph-fuse: restrict already_fuse_mounted function only for linux
Signed-off-by: Nikhilkumar Shelke <nshelke@redhat.com>
2022-03-28 13:25:09 +05:30
Sridhar Seshasayee
3aa2df2e0f qa/standalone: Fix test_activate_osd() test in ceph-helpers.sh
Modify test_activate_osd() to get the type of scheduler in use and then
verify the value of osd_max_backfills. This is because mclock scheduler
overrides this option to 1000 upon OSD initialization.

The test earlier used to pass because the OSD daemon was killed but not
marked down and upon being brought up, the wait for OSD up check was
passing quickly. But the OSD still didn't have the latest config values.

But now upon killing the OSD, the osd_fast_shutdown sequence notifies the
mon (see PR: https://github.com/ceph/ceph/pull/44807) and is marked down
and dead. Upon bringing it up, the wait for OSD up check takes a longer
time and this is sufficient for the config values to be updated. This
results in the correct values being read from the config 'Values' map.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2022-03-25 22:10:31 +05:30
Jos Collin
1621308214
qa: make test_perf_stats_stale_metrics check only the clients created for the tests
Uses the client's global id to get the metrics, instead of using the index.
This ensures that test_perf_stats_stale_metrics checks only the clients mounted for
the tests.

Fixes: https://tracker.ceph.com/issues/54971
Signed-off-by: Jos Collin <jcollin@redhat.com>
2022-03-25 18:19:48 +05:30
Milind Changire
ac6c7240d3 qa: test snap_schedule with mgr restart
Scheduled snaps should follow the created schedule even across mgr
restart.

Signed-off-by: Milind Changire <mchangir@redhat.com>
2022-03-24 18:01:44 +05:30
Ilya Dryomov
f0ade57458 qa/suites/orch/cephadm: restrict test_iscsi_pids_limit to CentOS
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-03-22 21:42:41 +01:00
Teoman ONAY
de8b3c2676 cephadm: remove containers pids-limit
The default pids-limit (docker 4096/podman 2048) prevent some
customization from working (http threads on RGW) or limits the number
of luns per iscsi target.

Fixes: https://tracker.ceph.com/issues/52898
Signed-off-by: Teoman ONAY <tonay@redhat.com>
2022-03-22 21:41:26 +01:00
Esmaeil Mirvakili
b4268f5e1a os/bluestore: SlowFastCoDel added to the BlueStore
Signed-off-by: Esmaeil Mirvakili <smirvaki@ucsc.edu>
2022-03-22 07:22:12 -07:00
Adam King
37019aad80 qa/suites/fs: stop looping in mds upgrade test if upgrade failed
Signed-off-by: Adam King <adking@redhat.com>
2022-03-22 08:05:31 -04:00
Venky Shankar
88ddf3350e
Merge pull request #44491 from nmshelke/fix-46075
ceph-fuse: ignore fuse mount failure if path is already mounted

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-03-21 18:39:35 +05:30
Ilya Dryomov
90a90ad47d qa/workunits/rbd/cli_generic.sh: relax trash purge schedule status assert
Commit 08df6e0fd0 ("qa/workunits/rbd: expand LevelSpec parsing
coverage") didn't account for images with a separate data pool.  This
was missed because of small-cache-pool.yaml breakage.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-03-19 14:54:00 +01:00
Yuri Weinstein
53df8bafed
Merge pull request #45392 from sseshasa/wip-snaptrim-qos-with-mclock
mon, osd: Add snaptrim stats to the existing PG stats.

Reviewed-by: Samuel Just <sjust@redhat.com>
2022-03-17 09:48:12 -07:00
Venky Shankar
69da8a88c0
Merge pull request #44795 from ajarr/dynamic-update-cephfs-nfs-export
mgr/nfs: allow dynamic update of cephfs nfs export

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-03-17 18:45:08 +05:30
Samuel Just
987465eea0
Merge pull request #45176 from liu-chunmei/crimson-teuthology
crimson: teuthology suite for seastore

Reviewed-by: Samuel Just <sjust@redhat.com>
2022-03-16 12:06:04 -07:00
Yuri Weinstein
531e0bb816
Merge pull request #45402 from ceph/wip-yuri-master-crontab
qa/tests: fixed typo

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2022-03-16 08:25:51 -07:00
Venky Shankar
8938a12542
Merge pull request #45290 from nmshelke/fix-54375
mgr/volumes: the 'mode' should honor idempotent subvolume creation

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-03-16 09:23:25 +05:30
Yuri Weinstein
600c1e18c9 qa/tests: fixed typo
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2022-03-15 16:00:20 -07:00
Ramana Raja
d7904e78d1 cephfs/test_nfs: Add test for dynamic update export
Add test to verify that the NFS servers don't restart when the
access type of a CephFS NFS export is updated.

And check the NFS servers are restarted when the pseudo path of
a CephFS NFS export is updated.

Signed-off-by: Ramana Raja <rraja@redhat.com>
2022-03-15 18:27:10 -04:00
Josh Durgin
3c928d9ec3
Merge pull request #45380 from ceph/wip-yuri-master-crontab
qa/tests: added client upgrade suites to the crontab

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2022-03-15 14:22:54 -07:00
Sridhar Seshasayee
a86ead953d osd: Add snaptrim duration to pg dump stats.
Add the snaptrim duration to the json formatted output of the pg dump
stats. Define methods for a PG to set the snaptrim begin time and then to
calculate the total time spent to trim all the objects for the snaps in
the snap_trimq for the PG.

Tests:
  - Librados C and C++ API tests to verify the time spent for a snaptrim
    operation on a PG. These tests use the self-managed snaps APIs.
  - Standalone tests to verify snaptrim duration using rados pool snaps.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2022-03-16 00:33:24 +05:30
Sridhar Seshasayee
00249dc0cc mon, osd: Add objects trimmed to pg dump stats.
Add a new column, OBJECTS_TRIMMED, to the pg dump stats that shows the
number of objects trimmed when a snap is removed.

When a pg splits, the stats from the parent pg is copied to the child
pg. In such a case, reset objects_trimmed to 0 for the child pg
(see PeeringState::split_into()). Otherwise, this will result in incorrect
stats to be shown for a child pg after the split operation.

Tests:
 - Librados C and C++ API tests to verify the number of objects trimmed
   during snaptrim operation. These tests use the self-managed snaps APIs.
 - Standalone tests to verify objects trimmed using rados pool snaps.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2022-03-16 00:30:56 +05:30
Yuri Weinstein
7bd2450f11
Merge pull request #45078 from kamoltat/wip-ksirivad-fix-target-size-ratio-bound
mon/MonCommands.h: fix target_size_ratio range

Reviewed-by: Neha Ojha <nojha@redhat.com>
2022-03-15 09:22:58 -07:00
Yuri Weinstein
1572be9b8f qa/tests: added client upgrade suites to the crontab
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2022-03-14 08:28:37 -07:00
Ramana Raja
3c22f6eb71 cephfs/test_nfs: fix _write_to_read_only_export()
... to fail the test if write to read only export succeeded.

Signed-off-by: Ramana Raja <rraja@redhat.com>
2022-03-13 20:14:50 -04:00
Yuri Weinstein
ad7a527c34
Merge pull request #44827 from ceph/wip-yuriw-crontab-master2
qa/tests: added quincy and removed master runs

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Adam King adking@redhat.com
2022-03-13 09:10:27 -07:00
Zack Cerza
e39304dcd6
Merge pull request #45034 from ceph/cephadm-conditional-hostname
qa/tasks/cephadm: Make hostname change idempotent
2022-03-10 19:07:17 -07:00
Kamoltat Sirivadhna
5bfc71e85d
Merge pull request #45200 from kamoltat/wip-ksirivad-pg-num-max-reorder
osd/osd_types: pg_num_max reordering
Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2022-03-10 12:01:27 -05:00
Nikhilkumar Shelke
cc64c57c5c qa: verify idempotent subvolume creation for 'mode' argument
Fixes: https://tracker.ceph.com/issues/54375
Signed-off-by: Nikhilkumar Shelke <nshelke@redhat.com>
2022-03-09 11:55:14 +05:30
chunmei-liu
3cca3ed552 qa/suites/crimson-rados: add fixed-1 for seastore
Signed-off-by: chunmei-liu <chunmei.liu@intel.com>
2022-03-08 21:29:04 -08:00
Adam King
03a0c3b384
Merge pull request #45039 from Matan-B/wip-matanb-nfs-timeout
qa/tasks/cephfs: increase timeout in test_nfs.py

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-03-08 13:31:40 -05:00
Ilya Dryomov
4ba375ea2b
Merge pull request #45149 from sunnyku/wip-fix-schedule-status
mgr/rbd_support: cast pool_id from int to str when collecting LevelSpec

Reviewed-by: Mykola Golub <mgolub@suse.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2022-03-07 12:14:36 +01:00
chunmei-liu
815300a5bd qa/suites/crimson-rados: add seastore test case
Signed-off-by: chunmei-liu <chunmei.liu@intel.com>
2022-03-07 01:35:08 -08:00
Nikhilkumar Shelke
34cf3ec723 qa: test ceph-fuse mount on already mounted path
Fixes: https://tracker.ceph.com/issues/46075
Signed-off-by: Nikhilkumar Shelke <nshelke@redhat.com>
2022-03-07 14:13:42 +05:30
Venky Shankar
c0c05efaf3
Merge pull request #42098 from joscollin/wip-B50033-rank0-stale-perf-stats
mgr/stats: be resilient to offline rank0 MDS

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-03-07 10:37:23 +05:30
Ilya Dryomov
08df6e0fd0 qa/workunits/rbd: expand LevelSpec parsing coverage
Invoke "rbd mirror snapshot schedule ls -R" and "rbd mirror snapshot
schedule status" commands on all levels, consistently.  In particular,
make sure that an image level schedule is listed for a recursive query
at the pool level both before and after the schedule kicks in:

  $ rbd create --size 1G --mirror-image-mode snapshot -p foo bar
  $ rbd mirror snapshot schedule add -p foo --image bar 1m
  $ rbd mirror snapshot schedule ls -p foo -R
  POOL  NAMESPACE  IMAGE  SCHEDULE
  foo              bar    every 1m
  <wait for schedule to become visible in status>
  $ rbd mirror snapshot schedule ls -p foo -R
  POOL  NAMESPACE  IMAGE  SCHEDULE
  foo              bar    every 1m

Also, make sure that pool and image level status queries work:

  $ rbd mirror snapshot schedule status -p foo
  SCHEDULE TIME        IMAGE
  2022-03-04 07:14:00  foo/bar
  $ rbd mirror snapshot schedule status -p foo --image bar
  SCHEDULE TIME        IMAGE
  2022-03-04 07:14:00  foo/bar

Both of these issues are fixed by the previous commit.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-03-05 11:20:33 +01:00
Kamoltat
4add5feff6 qa/workunits/mon/pg_autoscaler.sh: clean up white space
remove white space and weird indentations

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2022-03-04 16:40:07 +00:00
Kamoltat
09785475f1 qa/workunits/cephtool/test.sh: added test cases for target_size_ratio
Test the commands:

`osd pool create` <pool> --target_size_ratio <float>

`osd pool set` <pool> target_size_ratio <float>

`osd pool get` <pool> target_size_ratio

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2022-03-04 16:38:44 +00:00
Ronen Friedman
d654839222 test: osd-scrub-snaps.sh: fix expected 'missing snaps' log string
Fix the expected log message to match the scrub code, by removing
the redundant part.

Fixes: https://tracker.ceph.com/issues/54458

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2022-03-03 08:03:00 +00:00
Kamoltat
1bc51f057f upgrade/pacific-x/parallel: Added mds.a and mds.b
Added mds daemons so that it can create
cephFS pools and set options using
`do_set_pool()` in FSCommand.cc. Such that
we can cover corner cases like that in

https://tracker.ceph.com/issues/54263

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2022-03-02 22:34:21 +00:00
Yuri Weinstein
1a684be89d
Merge pull request #44656 from NitzanMordhai/fix_44092_config_get
mon/ConfigMonitor: fix config get key with whitespace

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2022-03-02 08:02:05 -08:00
Venky Shankar
fe3d543a0d
Merge pull request #44873 from kotreshhr/clone-inherit-quota-files
mgr/volumes: Inherit file quota attr to clone

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-02-21 14:37:44 +05:30
Venky Shankar
b8afe74b49
Merge pull request #44854 from kotreshhr/subvolumegroup_ls
mgr/volumes: Fix subvolumegroup ls

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-02-21 14:37:03 +05:30
Christopher Hoffman
19d46b9181 qa/suites/krbd: add rbd_default_map_options override coverage
Add coverage to test precedence, override, and option merge on rbd map.

Signed-off-by: Christopher Hoffman <choffman@redhat.com>
2022-02-18 17:19:45 +01:00
Ilya Dryomov
7f391c5688 qa/suites/krbd: rename rxbounce subsuite
A new job that doesn't want ms_mode to be set underneath it is about to
be added.  Rename rxbounce to ms_modeless to make this purpose obvious.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-02-18 17:19:45 +01:00
Yuri Weinstein
7f4292a914 qa/tests: replaced master wth quincy runs
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2022-02-16 11:36:32 -08:00
Zack Cerza
0f1118e228 cephadm: Make hostname change idempotent
Inside a container, we may already have a short hostname - and also lack
the privileges to change it.

Signed-off-by: Zack Cerza <zack@redhat.com>
2022-02-16 12:13:05 -07:00
Ilya Dryomov
8f0fd0af3d qa/suites/rbd: make sure block-rbd.so is installed
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-02-16 12:20:44 +01:00
Kotresh HR
95a83efcbe qa: Add test for subvolumegroup ls filter
Fixes: https://tracker.ceph.com/issues/54099
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2022-02-16 11:49:47 +05:30
Jos Collin
116e89a2f2
qa: test ceph fs perf stats doesn't output stale metrics
That `ceph fs perf stats` doesn't output stale metrics
after the rank0 MDS failover.

Fixes: https://tracker.ceph.com/issues/50033
Signed-off-by: Jos Collin <jcollin@redhat.com>
2022-02-16 11:18:45 +05:30
Ilya Dryomov
525ff61cfc qa/tasks/qemu: make sure block-rbd.so is installed
Fixes: https://tracker.ceph.com/issues/54286
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-02-15 14:57:51 +01:00
Matan Breizman
44ad552093 qa/tasks/cephfs: increase timeout in test_nfs.py
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
2022-02-15 10:47:43 +00:00
Kotresh HR
96c7963404 qa: Validate file quota attrs on clone subvolume
Fixes: https://tracker.ceph.com/issues/54121
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2022-02-15 13:40:17 +05:30
Patrick Donnelly
c2f132140d
qa: reduce fs:workload use of random selection
It's more appropriate to use --subset to reduce the scheduling size. It
was previously laid out this way because we wanted to link to the common
`qa/cephfs/mount` directory so that ceph-fuse mounts are not needlessly
multiplied. We should just organize it correctly so that is not an
issue.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-02-14 13:52:14 -05:00
Adam King
c1b48ce400
Merge pull request #44917 from guits/fix_rook_smoke_nvme_loop
qa/orch: use hwe kernel in rook/smoke tests (ubuntu 20.04)

Reviewed-by: Neha Ojha nojha@redhat.com
Reviewed-by: Yuri Weinstein yweinste@redhat.com
2022-02-14 10:55:36 -05:00
Adam King
884dc76683
Merge pull request #44965 from adk3798/test_cli_timeout
qa/tasks/cephadm_cases: increase timeouts in test_cli.py

Reviewed-by: Michael Fritch mfritch@suse.com
2022-02-14 08:24:32 -05:00
Venky Shankar
43b3a7c845
Merge pull request #39315 from shenhang/fetchkey
mds: fetch single dentry instead of complete dirfrag for lookup

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-02-14 09:53:10 +05:30
Yuri Weinstein
2624f51a72
Merge pull request #44588 from kamoltat/wip-ksirivad-disable-progress-by-default
pybind/mgr/progress: disable pg recovery event by default

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2022-02-11 14:49:17 -08:00
Adam King
46f939f057 qa/tasks/cephadm_cases: increase timeouts in test_cli.py
These seem to be failing sometimes but in my testing
sometimes these events are happening a few seconds after
we hit the timeout. Trying to see if this makes the tests
more consistent. No need to mark the test as failed
if we report something up in 34 seconds vs 25 especially
when cephadm works on a cyclic daemon refresh.

Signed-off-by: Adam King <adking@redhat.com>
2022-02-09 20:42:42 -05:00
Patrick Donnelly
e883dc3b82
Merge PR #42000 into master
* refs/pull/42000/head:
	qa: update rhel kclient to setup container tools
	qa: stop overriding distro for k-testing
	qa: only use RHEL for workload testing
	qa: convert fs:workload to use cephadm
	qa: split fs begin task
	qa/tasks/cephadm: setup CephManager when OSDs are provisioned
	qa/tasks/cephadm: setup file system if MDS are provisioned

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-02-09 09:34:49 -05:00
shenhang
d7feb89823 mds: add an option to decide whether prefetching entire dir or not.
Accessing one single dentry could be fastened by set this option to
false, when dir is not in the memory.
Signed-off-by: "Shen, Hang" <shenhang@kuaishou.com>
2022-02-09 17:31:46 +08:00
Venky Shankar
b7af2a94a4
Merge pull request #42549 from ajarr/wip-add-volume-rename
mgr/volumes: Add `fs volume rename` command

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-02-09 11:34:32 +05:30
Guillaume Abrioux
b252c6495c qa/orch: use hwe kernel in rook/smoke tests (ubuntu 20.04)
Use the override in ./src/qa/distros/container-hosts/ubuntu_20.04.yaml
in order to use hwe kernel for Ubuntu 20.04

This is because the ubuntu 20.04 kernel (5.4) has a bug that prevents
from using nvme-loop.

see https://lkml.org/lkml/2020/9/21/1456

Fixes: https://tracker.ceph.com/issues/54094

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
2022-02-08 10:58:02 +01:00
Ali Maredia
8de2884e06
Merge pull request #44856 from cbodley/wip-54103
qa/rgw: install s3cmd with extra_system_packages

Reviewed-by: Ali Maredia <amaredia@redhat.com>
2022-02-07 16:02:40 -05:00
Ilya Dryomov
bad21fa497
Merge pull request #44842 from idryomov/wip-krbd-rxbounce-option
rbd: recognize rxbounce map option

Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
2022-02-06 20:37:31 +01:00
Ilya Dryomov
512f8da720
Merge pull request #43910 from sunnyku/wip-rbd-snap-sched
mgr/rbd_support: fix schedule remove

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2022-02-06 20:32:09 +01:00
Venky Shankar
0e37e6a7be
Merge pull request #44851 from batrick/i54081
mds: add inline flag to MDS bootstrap compat in MDSMap

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-02-05 18:04:29 +05:30
Neha Ojha
8a8945e640
Merge pull request #44868 from neha-ojha/wip-move-to-stream
qa/distros: remove centos8

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2022-02-04 11:56:08 -08:00
Casey Bodley
ca8e0a64b1
Merge pull request #44386 from ivancich/wip-reshard-tentative-goal
rgw: in bucket reshard list, clarify new num shards is tentative

Reviewed-by: Matt Benjamin <mbenjami@redhat.com>
2022-02-04 14:54:36 -05:00
Ilya Dryomov
fbf8c1d68b qa/suites/krbd: add legacy+rxbounce and crc+rxbounce coverage
For basic, rbd and rbd-nomount subsuites, replace legacy and crc
facets with "legacy or legacy+rxbounce" and "crc or crc+rxbounce"
facets (chosen at random).

For fsx, singleton and thrash subsuites, add legacy+rxbounce and
crc+rxbounce facets and drop prefer-crc facet.  The expected behaviour
of the latter depends on cluster configuration and should be tested
separately.

The total number of jobs remains the same.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-02-04 19:04:38 +01:00
Ilya Dryomov
95d30b534e qa: krbd rxbounce test
Lives in its own directory since ms_mode doesn't need to be permuted
here.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-02-04 19:04:37 +01:00
Sunny Kumar
a9312d4777 qa/workunits/rbd: improve schedule add/remove cli test
This patch adds few tests to cover schedule add/remove with invalid
inputs.

Signed-off-by: Sunny Kumar <sunkumar@redhat.com>
2022-02-04 16:16:03 +00:00
Casey Bodley
a2d2ed2d8e
Merge pull request #44348 from soumyakoduri/wip-skoduri-lc-teuthology
rgw/qa: Add test suite for lifecycle cases

Reviewed-by: Matt Benjamin <mbenjami@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2022-02-04 11:04:13 -05:00
Kamoltat
f06da20dff pybind/mgr/progress: disable pg recovery event by default
The progress module disabled the pg recovery event by default
since the event is expensive and has interrupted other serviceis
when there is OSDs being marked in/out from the the cluster.

To turn the event on manually:

ceph config set mgr mgr/progress/allow_pg_recovery_event true

Updated qa/tasks/mgr/test_progress.py to enable
the pg recovery event when testing the progress module.

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2022-02-03 17:51:42 +00:00
Patrick Donnelly
445cdd4120
qa: test inline compat set on older MDSMap
Reproduced here:

    /ceph/teuthology-archive/pdonnell-2022-01-31_19:13:02-fs:upgrade-master-distro-default-smithi/6651572/teuthology.log

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-02-03 11:01:59 -05:00
Patrick Donnelly
1f714da814
qa: fix or add missing .qa links
Using this command:

    find qa/suites/ -type d -execdir ln -sfT ../.qa/ {}/.qa \;

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-02-03 10:08:30 -05:00
Patrick Donnelly
ec6cd23b20
qa: move .qa terminator
So links can be elsewhere in the qa suite (not used yet) and to simplify
a find command in a follow-up commit.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-02-03 10:08:30 -05:00
Neha Ojha
8ca5729d21 qa/suites/rados/thrash-old-clients: remove centos_8.3_container_tools_3.0
Signed-off-by: Neha Ojha <nojha@redhat.com>
2022-02-02 23:26:54 +00:00
Neha Ojha
77b0791760 qa/distros/container-hosts: remove centos_8.2 and centos_8.3
Related to https://tracker.ceph.com/issues/54087

Signed-off-by: Neha Ojha <nojha@redhat.com>
2022-02-02 20:03:33 +00:00
Neha Ojha
8f1d1b7c55 qa/distros/podman: remove centos_8.2 and centos_8.3
Related to https://tracker.ceph.com/issues/54087

Signed-off-by: Neha Ojha <nojha@redhat.com>
2022-02-02 20:02:48 +00:00
Soumya Koduri
9dfe5ac714 rgw/qa: Add test suite for lifecycle cases
Execute lifecycle s3-tests in the teuthology test-suite by configuring
required storage classes and 'rgw lc debug interval' option.

Signed-off-by: Soumya Koduri <skoduri@redhat.com>
2022-02-03 00:16:03 +05:30
Patrick Donnelly
0fcf8922dc
qa: update rhel kclient to setup container tools
To fix [1,2].

[1] https://github.com/ceph/ceph/pull/42000#issuecomment-905628920
[2] https://github.com/ceph/ceph/pull/42000#issuecomment-906276775

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-02-02 10:44:35 -05:00
Patrick Donnelly
fb75ed6d39
qa: stop overriding distro for k-testing
This is a continuation of previous commit

    qa: only use RHEL for workload testing

We don't want to test fs:workload with centos/ubuntu to avoid packaging
issues and to reduce the matrix of distros we're running workloads on.
Also, the testing kernel should install fine on the distros we test with
"supported" random distros.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-02-02 10:44:35 -05:00
Patrick Donnelly
017ccd21e6
qa: only use RHEL for workload testing
It's not useful testing workloads with different distributions; it just
adds to the maintenance burden of this qa suite as distro upgrades often
break compilation of various tests.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-02-02 10:44:35 -05:00
Patrick Donnelly
405bb2e48e
qa: convert fs:workload to use cephadm
Note: it's important to keep the install task which supplies packages
needed for some workloads.

Fixes: https://tracker.ceph.com/issues/51333
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-02-02 10:44:35 -05:00
Patrick Donnelly
50c39dc007
qa: split fs begin task
To allow switching to cephadm task.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-02-02 10:44:35 -05:00
Patrick Donnelly
27c1110129
qa/tasks/cephadm: setup CephManager when OSDs are provisioned
The Filesystem object may use this when configuring EC data pools at
file system creation (via a FuseMount).

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-02-02 10:44:34 -05:00
Patrick Donnelly
2436405c5d
qa/tasks/cephadm: setup file system if MDS are provisioned
This is the same behavior/code as what the ceph task does.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-02-02 10:44:34 -05:00
Casey Bodley
12d23d3f63 qa/rgw: install s3cmd with extra_system_packages
this was preventing installation on ubuntu because of
https://tracker.ceph.com/issues/25026

Fixes: https://tracker.ceph.com/issues/54103

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2022-02-01 10:39:18 -05:00
Casey Bodley
40a7174491
Merge pull request #44850 from cbodley/wip-qa-rgw-centos8-eol
qa/rgw: rgw/verify no longer pins centos 8.0

Reviewed-by: Ali Maredia <amaredia@redhat.com>
2022-02-01 10:18:13 -05:00
Ilya Dryomov
9dbcf6ab28
Merge pull request #44834 from idryomov/wip-rbd-image-id-spec
mgr/rbd_support: fix mon command API regressions

Reviewed-by: Mykola Golub <mgolub@suse.com>
2022-02-01 12:08:02 +01:00
Casey Bodley
0fad609d4d qa/rgw: rgw/verify no longer pins centos 8.0
the symlink rgw/verify/centos_latest.yaml already selects centos

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2022-01-31 17:23:27 -05:00
Casey Bodley
3b71b41190 qa/distros: remove duplicate centos_8.stream.yaml from supported
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2022-01-31 14:52:09 -05:00
Casey Bodley
0f4e51f05f qa/distros: centos_8.yaml is now a symlink to centos_8.stream.yaml
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2022-01-31 14:51:00 -05:00
Ilya Dryomov
4ed1e74d83 qa/suites/rbd: add cram-based mon command API test
With mon (rbd_support mgr module in this case) command definitions
generated automatically by @CLI{Read,Write}Command decorator, it's
very easy to accidentally break the external facing API.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-01-30 14:22:35 +01:00
Ilya Dryomov
a5eef01e92 mgr/rbd_support: level_spec is optional for schedule list/status
Commit fea6fdff4c ("mgr/rbd_support: level_spec passed to some
commands is not optional") is wrong.  While it is true that a valid
level_spec is needed to create a LevelSpec instance, an empty string
is very much a valid level spec -- it signifies "all levels".

This wasn't caught because within Ceph these commands are wrapped by
rbd CLI which injects an empty string in get_level_spec_args().

Fixes: https://tracker.ceph.com/issues/54058
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-01-30 14:22:35 +01:00
J. Eric Ivancich
aa0071ce8b rgw: in bucket reshard list, clarify new num shards is tentative
With dynamic bucket index resharding, when the average number of
objects per shard exceeds the configured value, that bucket is
scheduled for reshard. That bucket may receive more new objects before
the resharding takes place. As a result, the existing code
re-calculates the number of new shards just prior to resharding,
rather than waste a resharding opportunity with too low a value.

The same holds true for a user-scheduled resharding.

A user reported confusion that the number reported in `radosgw-admin
reshard list` wasn't the number that the reshard operation ultimately
used. This commit makes it clear that the new number of shards is
"tentative". And test_rgw_reshard.py is updated to reflect this
altered output.

Additionally this commit adds some modernization and efficiency to the
"reshard list" subcommand.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
2022-01-28 12:05:54 -05:00
Yuri Weinstein
e96a3fc847
Merge pull request #44735 from neha-ojha/wip-reduce-cephadm
qa/suites/rados: reduce the number of cephadm tests

Reviewed-by: Adam King adking@redhat.com
Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
2022-01-25 09:22:57 -08:00
Ilya Dryomov
82219b3bea
Merge pull request #44282 from orozery/qa-qemu-nbd-ide-interface
qa/tasks/qemu: switch nbd devices from virtio to ide

Reviewed-by: Mykola Golub <mgolub@suse.com>
Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2022-01-25 15:06:14 +01:00
Nitzan Mordechai
674489490f qa/workunit: Adding whitespace unit test for config set,get,rm
Update the workunit/mon/config.sh to include set/get/rm commands with and without whitespaces

Fixes: https://tracker.ceph.com/issues/44092
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
2022-01-25 11:45:50 +02:00
Patrick Donnelly
f7c931108a
Merge PR #44574 into master
* refs/pull/44574/head:
	qa: Add syslog ignore list

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2022-01-24 09:32:48 -05:00
Ilya Dryomov
b274bca1b2 qa/run_xfstests_qemu.sh: disable 251, 260 and 288
All three are skipped with virtio disks:

251	 [not run] FITRIM not supported on /dev/vdc
260	 [not run] FITRIM not supported on /dev/vdc
288	 [not run] FITRIM not supported on /dev/vdc

But 260 and 288 fail with ide disks, where discard defaults to on.  The
ancient kernel in our ubuntu-12.04.qcow2 doesn't support virtio discard
anyway so let's just disable them for consistency.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-01-23 20:16:29 +01:00
Ilya Dryomov
1f07820579 qa/run_xfstests_qemu.sh: fall back to ide disks if needed
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-01-22 12:32:02 +01:00
Neha Ojha
f849f1554c qa/suites/rados: reduce the number of cephadm tests
Currently, every rados run of ~400 jobs is running ~150 cephadm tests,
which is unnecessary and redundant. With this change, we will run some
basic cephadm tests within the rados suite. The following seems to be
a good start.

qa/suites/rados/cephadm/osds
qa/suites/rados/cephadm/smoke
qa/suites/rados/cephadm/smoke-singlehost
qa/suites/rados/cephadm/workunits

Signed-off-by: Neha Ojha <nojha@redhat.com>
2022-01-21 23:38:53 +00:00
Ali Maredia
99f0e82a95 qa: move certificates for kmip task into /etc/ceph
On rhel/centos the ceph user does not have permission
to access these certs which leads to s3-test failures
in teuthology.

Signed-off-by: Ali Maredia <amaredia@redhat.com>
2022-01-20 17:43:55 -05:00
Venky Shankar
ac28356234
Merge pull request #44557 from kotreshhr/clone-quota-failure
mgr/volumes: Fix subvoume snapshot clone failure

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-01-20 16:42:35 +05:30
Yuri Weinstein
0ccd1f69be
Merge pull request #44615 from ceph/wip-yuriw-crontab-master
qa/tests: run upgrades on quincy

Reviewed-by: Neha Ojha <nojha@redhat.com>
2022-01-19 08:15:20 -08:00
Aashish Sharma
f771cd492c mgr/dashboard: Improve notifications for osd nearfull, full
This PR adds some visual hints for osds that are near full or full

Fixes: https://tracker.ceph.com/issues/53334
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
2022-01-19 16:35:27 +05:30
Ernesto Puerta
86590ea131
Merge pull request #44578 from rhcs-dashboard/fix-53843-master
qa/dashboard: ensure node 16 is installed

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: David Galloway <dgallowa@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
Reviewed-by: sebastian-philipp <NOT@FOUND>
Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2022-01-18 20:23:07 +01:00
Ernesto Puerta
197987a5a8
Merge pull request #42603 from cypherean/feedback_frontend
mgr/dashboard: report ceph tracker bug/feature through GUI

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
2022-01-18 19:47:13 +01:00
Casey Bodley
12301f63df
Merge pull request #40011 from cbodley/wip-49723
rgw: allow rgw_data_notify_interval_msec=0 to disable notifications

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
2022-01-18 13:21:06 -05:00
Guillaume Abrioux
5c0f0698a5 qa/cephadm: install hwe kernel only for focal
Let's install hwe kernel only on Ubuntu focal, otherwise we only shift the
problem on Ubuntu bionic given that the hwe kernel for bionic is 5.4.

Fixes: https://tracker.ceph.com/issues/53863

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
2022-01-18 10:36:24 +01:00
Casey Bodley
52bfa9a866 qa/rgw: run multisite tests with some async notifications disabled
disable the sending of async datalog notifications on one zone per
cluster. this helps to verify that tests don't rely on notifications to
succeed

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2022-01-17 13:54:39 -05:00
Yuri Weinstein
095f970710 qa/tests: run upgrades on quincy
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2022-01-17 10:18:04 -08:00
Guillaume Abrioux
f8e22fb3da qa/nvme_loop: fix an issue on ubuntu 18.04
The following command:

```
echo /dev/sda | tee /sys/kernel/config/nvmet/subsystems/sda/namespaces/1/device_path
```

makes nvme_loop fail because fascinatingly, it adds an unexpected newline.

See:
```
/dev/sda
/dev/sda

1
tee: /sys/kernel/config/nvmet/subsystems/sda/namespaces/1/enable: No such file or directory
/dev/sda
1
```

Other distros don't have the same behavior:

```
CentOS 8
/dev/sda
/dev/sda
1

Ubuntu 20.04
/dev/sda
/dev/sda
1
```

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
2022-01-17 17:10:08 +01:00
Avan Thakkar
ed2b4e7a56 mgr/dashboard: report ceph tracker bug/feature through GUI
Fixes: https://tracker.ceph.com/issues/44851
Signed-off-by: Shreya Sharma <shreyasharma.ss305@gmail.com>
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
2022-01-17 19:45:31 +05:30
Sebastian Wagner
5ff602e5cf
Merge pull request #44506 from sebastian-philipp/orch-suite-add-scsi
qa/suites/orch/cephadm: Also run the rbd/iscsi suite

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Melissa Li <mingkli@redhat.com>
2022-01-17 09:39:50 +01:00
Kotresh HR
7c0d31e52c qa: Add tests snapshot clone failure with quota
Fixes: https://tracker.ceph.com/issues/53848
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2022-01-17 12:14:33 +05:30
Yuri Weinstein
e419a29be5
Merge pull request #42735 from amathuria/wip-amathuria-scrub-stats
osd/scrub: Add stats to PG dump for number of objects scrubbed

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
2022-01-14 10:46:28 -08:00
Ernesto Puerta
7225b68e46
qa/dashboard: ensure node 16 is installed
For Ubuntu: https://github.com/nodesource/distributions#manual-installation

Fixes: https://tracker.ceph.com/issues/53843
Signed-off-by: Ernesto Puerta <epuertat@redhat.com>
2022-01-14 18:08:25 +01:00
Yuri Weinstein
755dc55b8e
Merge pull request #44370 from benhanokh/NCB_expand_device_fix
NCB code doesn't update allocation file when we expand-device

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
2022-01-14 09:06:41 -08:00
Ilya Dryomov
3c2b05a252
Merge pull request #44571 from idryomov/wip-xfstests-qemu-cert
qa/run_xfstests_qemu.sh: stop reporting success without actually running any tests

Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
2022-01-14 10:28:06 +01:00
Venky Shankar
e65d88ca58
Merge pull request #44570 from vshankar/wip-53857
qa: adjust for MDSs to get deployed before verifying their availability

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-01-14 08:42:20 +05:30
Ramana Raja
0ccb9595f6 qa/suite/fs/cephadm: switch to using recommended distro
See https://github.com/ceph/ceph/pull/43934

Signed-off-by: Ramana Raja <rraja@redhat.com>
2022-01-13 11:17:17 -05:00
Ramana Raja
0043a9e58c qa/suites: fs volume rename cephadm tests
Signed-off-by: Ramana Raja <rraja@redhat.com>
2022-01-13 11:12:34 -05:00
Ramana Raja
70697629bf mgr/volumes: Add fs volume rename command
The `fs volume rename` command renames the volume, i.e.,
orchestrator MDS service, file system, and the data and
metadata pool of the file system.

Fixes: https://tracker.ceph.com/issues/51162
Signed-off-by: Ramana Raja <rraja@redhat.com>
2022-01-13 10:36:46 -05:00
Casey Bodley
bc81cd1226
Merge pull request #44561 from cbodley/wip-51727
qa/rgw: add PG_DEGRADED cluster warnings to log-ignorelist

Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>
Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
2022-01-13 09:38:49 -05:00
Venky Shankar
8939d8c14b qa: adjust for MDSs to get deployed before verifying their availability
The check happens when some MDSs are *just* deployed by cephadm causing
jobs to fail with:

     Command failed on smithi016 with status 1: 'sudo /home/ubuntu/cephtest/cephadm \
     --image docker.io/ceph/ceph:v16.2.4 shell -c /etc/ceph/ceph.conf -k \
     /etc/ceph/ceph.client.admin.keyring --fsid 403bfcae-706b-11ec-8c32-001a4aab830c \
     -- bash -c \'ceph --format=json mds versions | jq -e ". | add == 4"\''

Fixes: http://tracker.ceph.com/issues/53857
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2022-01-13 18:25:58 +05:30
Venky Shankar
6b59fe1bec
Merge pull request #44397 from lxbsz/wip-53726
mds: dump tree '/' when the path is empty

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-01-13 18:15:24 +05:30
Venky Shankar
b52f86c8a5
Merge pull request #44422 from lxbsz/wip-51705
qa: do not use any time related suffix for *_op_timeouts

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-01-13 18:14:14 +05:30
Kotresh HR
e2ad5f9006 qa: Add syslog ignore list
Fixes: https://tracker.ceph.com/issues/50150
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2022-01-13 16:40:35 +05:30
Yuval Lifshitz
b709091d81
Merge pull request #43995 from TRYTOBE8TME/wip-rgw-kafka-teuth-cleanup
qa/tasks: Checking for kafka cleanup
2022-01-13 11:57:03 +02:00
Casey Bodley
95544e802b qa/rgw: add PG_DEGRADED cluster warnings to log-ignorelist
and cover rgw/singleton suite

Fixes: https://tracker.ceph.com/issues/51727

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2022-01-12 15:56:38 -05:00
Ilya Dryomov
b47965b577 qa/tasks/qemu: get the new Let's Encrypt root certificate
Fixes: https://tracker.ceph.com/issues/53841
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-01-12 20:53:45 +01:00
Ilya Dryomov
387be94794 qa/run_xfstests_qemu.sh: harden against wget failures
If wget fails (e.g. due to a certificate issue), it still creates
an empty file.  Then this file is marked executable, ./"${SCRIPT}"
immediately returns 0 and run_xfstests_qemu.sh exits successfully
without running a single xfstest.

This started on Sep 30, 2021 with the expiration of Let's Encrypt
root certificate -- all qemu jobs with "test: qa/run_xfstests_qemu.sh"
just booted the VM for a couple of seconds and reported success.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-01-12 20:53:45 +01:00
Yuri Weinstein
a8bb49d4d9
Merge pull request #39440 from pdvian/wip-warn-filestore-osds
mon/OSDMonitor, osd: Add warning on filestore deprecation and force use of wpq scheduler for filestore OSDs

Reviewed-by: Neha Ojha <nojha@redhat.com>
2022-01-12 08:49:02 -08:00
Gabriel BenHanokh
a39b1f3cf7 tools/ceph-bluestore-tool: Fix bluefs-bdev-expand command
Update allocation file when we expand-device
Add the expended space to the allocator and then force an update to the allocation file

There is also a new standalone test case for expand

Fixes: https://tracker.ceph.com/issues/53699
Signed-off-by: Gabriel Benhanokh <gbenhano@redhat.com>
2022-01-12 18:07:59 +02:00
Aishwarya Mathuria
91885f1a87 qa/standalone: add test to check if objects_scrubbed is equal to number of objects in a PG once a scrub finishes
Signed-off-by: Aishwarya Mathuria <amathuri@redhat.com>
2022-01-12 14:57:40 +05:30
Casey Bodley
81d3517bde
Merge pull request #42891 from ofriedma/wip-ofriedma-rgw-qos-finale
rgw: Add rgw rate limiting per user and per bucket

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2022-01-11 11:35:05 -05:00
Kalpesh Pandya
6135747a06 qa/tasks: Checking for kafka cleanup
Adding a sleep after running ./kafka-server-stop.sh and ./zookeeper-server-stop.sh
scripts so that nothing gets logged into the kafka logs after the sleep time.
And finally killing the process.

This resolves: https://tracker.ceph.com/issues/53220

Signed-off-by: Kalpesh Pandya <kapandya@redhat.com>
2022-01-11 21:14:15 +05:30
Ernesto Puerta
f5237e8b4a
Merge pull request #44088 from ceph/feature-48388-cache
mgr: TTL cache implementation

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
Reviewed-by: sebastian-philipp <NOT@FOUND>
2022-01-11 12:52:41 +01:00
Matan
27c6784cac
Merge pull request #44495 from Matan-B/wip-matanb-crimson-flavor
qa/suites: Crimson flavor configuration fix 

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: chunmei-liu chunmei.liu@intel.com
2022-01-10 20:30:18 +02:00
Or Friedmann
fd084fd7fc rgw: Add admin ops API for rate limiting
Add admin ops API for rate limiting and some bug fixes

Signed-off-by: Or Friedmann <ofriedma@redhat.com>
2022-01-10 16:48:56 +00:00
Sebastian Wagner
651192aacc
qa/suites/orch/cephadm: Also run the rbd/iscsi suite
Adding a new workload test to our suite.

Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
2022-01-10 10:45:36 +01:00
Matan Breizman
3dad67c9d2 qa/suites: Fix crimson flavor configuration
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
2022-01-06 14:24:20 +00:00
Sage Weil
d2265e81fb Merge PR #44054 into master
* refs/pull/44054/head:
	doc/rados/operations: document pg_num_max
	mgr: set max of 32 pgs for .mgr pool
	mgr/dashboard: expect pg_num_max property for pools
	mon/OSDMonitor: add option --pg-num_max arg for create pool
	mon/OSDMonitor: disallow setting pg_num < min or > max
	mgr/pg_autoscaler: apply pg_num_max
	mon: add pg_num_max pool property

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2022-01-06 08:54:45 -05:00
Venky Shankar
452653844d
Merge pull request #44000 from nmshelke/exception-handling-48473
mgr/stats: exception handling for ceph fs perf stats command

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-01-05 17:55:39 +05:30
Alfonso Martínez
bb56e84c71
Merge pull request #44388 from rhcs-dashboard/02-host-inventory-fix
mgr/dashboard: fix orchestrator/02-hosts-inventory.e2e failure

Reviewed-by: Waad Alkhoury <walkhour@redhat.com>
Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
2022-01-05 11:32:22 +01:00