Commit Graph

9343 Commits

Author SHA1 Message Date
Pere Diaz Bou
80964fb520 mgr/dashboard: fix rbdconfiguration init type
Signed-off-by: Pere Diaz Bou <pdiazbou@redhat.com>
2022-07-13 17:45:37 +02:00
Pere Diaz Bou
08c980ede3 mgr/dashboard: fix api tests + lint
Signed-off-by: Pere Diaz Bou <pdiazbou@redhat.com>
2022-07-12 19:09:03 +02:00
Pere Diaz Bou
3c308804d5 mgr/dashboard: add rbd list sorting support
Support sorting with name, pool name and namespace

Signed-off-by: Pere Diaz Bou <pdiazbou@redhat.com>
2022-07-12 19:09:02 +02:00
Rishabh Dave
15ccc52a81
Merge pull request #46861 from ajarr/fix-54108
qa/suites/fs: skip check-counters for iogen workload

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2022-07-08 19:36:22 +05:30
Rishabh Dave
17b6d4374d
Merge pull request #46191 from rishabh-d-dave/qa-cephfs-mount-rm-sudo
qa/cephfs: remove sudo parameter from run_shell()

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Nikhilkumar Shelke <nshelke@redhat.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
Reviewed-by: Neeraj Pratap Singh <neesingh@redhat.com>
2022-07-08 19:30:16 +05:30
Rishabh Dave
9f95f96dc0
Merge pull request #45998 from dparmar18/implement_ln_cephfs_shell
cephfs-shell: ln command implementation

Reviewed-by: Nikhilkumar Shelke <nshelke@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2022-07-08 19:26:34 +05:30
Yuri Weinstein
8b4877474e
Merge pull request #46700 from sseshasa/wip-fix-mclock-config-set
osd: Set initial mClock QoS params at CONF_DEFAULT level

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2022-07-07 08:44:15 -07:00
David Galloway
a9161b6be9
Merge pull request #46932 from ceph/fix-crontab
qa/tests: Typos and spacing
2022-07-06 12:12:15 -04:00
dparmar18
24d6726b85 qa: tests for ln command
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2022-07-06 16:31:28 +05:30
Sridhar Seshasayee
e0b5316171 osd: Set initial mClock QoS params at CONF_DEFAULT level
Create the initial mClock QoS params at CONF_DEFAULT level using
set_val_default(). This allows switching to a custom profile on a
running OSD and to make necessary changes to the desired QoS params.
Note that Switching to ‘custom’ profile and then subsequently changing
the QoS params using “config set osd.n …” will be at a higher level i.e.
at CONF_MON.

But When switching back to a built-in profile, the new values won’t take
effect since CONF_DEFAULT < CONF_MON. For the values to take effect, the
config keys created as part of the ‘custom’ profile must be removed from
the ConfigMonitor store after switching back to a built-in profile.

- Added a couple of standalone tests to exercise the scenario.
- Updated the mClock configuration document and the mClock internal
  documentation with a couple of typos relating to the best effort weights.
- Added new sections to the mClock configuration document outlining the
  steps to switch between the built-in and custom profile and vice-versa.

Fixes: https://tracker.ceph.com/issues/55153
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2022-07-06 16:15:58 +05:30
Venky Shankar
05976ab9fb
Merge pull request #46157 from nmshelke/fix-55041
mgr/volumes: display in-progress clones for a snapshot

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
2022-07-05 14:39:57 +05:30
Venky Shankar
01e0a9a7e2
Merge pull request #46746 from vshankar/wip-fs-snaps-timeout
qa: increase fs/snaps workunit timeout to 6h

Reviewed-by: Xiubo Li <xiubli@redhat.com>
2022-07-04 20:41:51 +05:30
Nikhilkumar Shelke
f4c6bdb3c6 qa: display in-progress clones for a snapshot
If any clone is in pending or in-progress state then
show these clones in 'fs subvolume snapshot info'
command output.

Fixes: https://tracker.ceph.com/issues/55041
Signed-off-by: Nikhilkumar Shelke <nshelke@redhat.com>
2022-07-04 13:31:40 +05:30
Venky Shankar
c37bd10303
Merge pull request #46575 from batrick/i51332
qa: exercise metadata replication in workloads

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-07-04 08:25:15 +05:30
Venky Shankar
3dbf673084
Merge pull request #46602 from batrick/i55980
mds,qa: some balancer debug messages (<=5) not printed when debug_mds is >=5

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
2022-07-04 08:22:50 +05:30
David Galloway
c714ee5449 qa/tests: Typos and spacing
Signed-off-by: David Galloway <dgallowa@redhat.com>
2022-07-01 16:40:20 -04:00
Adam King
1c0813b90e
Merge pull request #44493 from zenomri/wip-omri-cephadm-jaeger
mgr/cephadm: Add jaeger-tracing services support

Reviewed-by: Adam King <adking@redhat.com>
2022-07-01 16:00:50 -04:00
Venky Shankar
06c35ed791
Merge pull request #46869 from nmshelke/fix-56384
qa/workunits/cephtool: check_response didn't find erasure-code string

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by:  Laura Flores <lflores@redhat.com>
2022-06-30 22:27:14 +05:30
Nikhilkumar Shelke
217c1d2bb2 qa/workunits/cephtool: check_response didn't find erasure-code string
1. If data or metadata pool is already in-use by filesystem
then it is not allowed to reuse the same pool for another
filesystems.

2. Test is failing because above(1) restrictions/checks comes
before checking erasure-code pools. Hence test is failing
and not finding expected error string in output.

3. Proposed fix checks newly added error string instead of
'erasure-code'.

4. Also adding new tests to verify string 'erasure-code'
by passing --force option so that check for pools reuse(1)
will be skipped and check for 'erasure-code' will be hit.

Fixes: https://tracker.ceph.com/issues/56384
Signed-off-by: Nikhilkumar Shelke <nshelke@redhat.com>
2022-06-29 12:28:55 +05:30
Ali Maredia
79a085e11a
Merge pull request #46660 from pritha-srivastava/wip-rgw-user-policy-test-changes
Changes to integrate user-policy(iam) tests in teuthology.

Reviewed-by: Ali Maredia <amaredia@redhat.com>
2022-06-28 12:04:02 -04:00
Ramana Raja
69b29008fb qa/suites/fs: skip check-counters for iogen workload
We currently run "iogen -n 5 -s 2g" for about 10 minutes. This workload
does not always generate export/import of subtrees that is being checked
by iogen.yaml. iogen workload is suited for running heavily fragmented I/O
on a file system, and not for growing directory trees.

Fixes: https://tracker.ceph.com/issues/54108
Signed-off-by: Ramana Raja <rraja@redhat.com>
2022-06-27 13:34:28 -04:00
Rishabh Dave
5967592792 qa/cephfs: remove sudo parameter from run_shell()
Right now, run_shell() in mount.py accepts both "sudo" and "omit_sudo"
as parameters. It's better to accept only one of these two parameters.
A call to run_shell() where both are set to opposing values will be
buggy. Therefore, methods calling run_shell() must add "sudo" to command
arguments before call and set omit_sudo to False in call.

As a result of this change, methods like stat() and run_python() in
mount.py are now modified to add "sudo" to command arguments
and set omit_sudo to False within their own definitions.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2022-06-27 19:56:04 +05:30
Rishabh Dave
d28f2a29a7 qa/cephfs/mount: set omit_sudo when sudo is present in args
Methods run_as_user() and run_python() don't set omit_sudo to False even
when command arguments contain sudo. This will cause vstart_runner.py to
delete "sudo" from command arguments which will/might lead to a bug.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2022-06-27 19:43:54 +05:30
Ronen Friedman
2f71185ad6
Merge pull request #46643 from ronen-fr/wip-rf-blocked
scrub/osd: add clearer reminders that a scrub is blocked

Reviewed-by: Laura Flores <lflores@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
2022-06-23 18:29:27 +03:00
Pritha Srivastava
e94873ad85 qa/rgw: 1. adding yaml files to integrate iam tests in teuthology.
2. Modifying s3tests.py for iam tests.

Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>
2022-06-23 09:15:24 +05:30
Ronen Friedman
996d9064c6 scrub/osd: disable blocked-scrub warnings during some tests
As some Teuthology tests seem to block objects for long minutes,
we must not issue the "scrub is blocked for too long" warning
(that warning causes the tests to fail).

A new configuration parameter now controls the grace period before
the warning is issued. Some tests were modified to set this
configuration parameter to a large value.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2022-06-22 05:24:37 +00:00
Ilya Dryomov
7fb4fdbed0 mgr/rbd_support: always rescan image mirror snapshots on refresh
Establishing a watch on rbd_mirroring object and skipping rescanning
image mirror snapshots on periodic refresh unless rbd_mirroring object
gets notified in the interim is flawed.  rbd_mirroring object is
notified when mirroring is enabled or disabled on some image (including
when the image is removed), but it is not notified when images are
promoted or demoted.  However, load_pool_images() discards images that
are not primary at the time of the scan.  If the image is promoted
later, no snapshots are created even if the schedule is in place.  This
happens regardless of whether the schedule is added before or after the
promotion.

This effectively reverts commit 69259c8d37 ("mgr/rbd_support: make
mirror_snapshot_schedule rescan only updated pools").  An alternative
fix could be to stop discarding non-primary images (i.e. drop

    if not info['primary']:
        continue

check added in commit d39eb283c5 ("mgr/rbd_support: mirror snapshot
schedule should skip non-primary images")), but that would clutter the
queue and therefore "rbd mirror snapshot schedule status" output with
bogus entries.  Performing a rescan roughly every 60 seconds should be
manageable: currently it amounts to a single mirror_image_status_list
request, followed by mirror_image_get, get_snapcontext and snapshot_get
requests for each snapshot-based mirroring enabled image and concluded
by a single dir_list request.  Among these, per-image get_snapcontext
and snapshot_get requests are necessary for determining primaryness.

Fixes: https://tracker.ceph.com/issues/53914
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-06-21 12:46:49 +02:00
Venky Shankar
df19b8dbc7
Merge pull request #44900 from nmshelke/fix-54111
mon: verify data pool is already not in use by any file system

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by:  Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
Reviewed-by: Neeraj Pratap Singh <neesingh@redhat.com>
Reviewed-by: Milind Changire <mchangir@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
2022-06-20 10:02:41 +05:30
Venky Shankar
61673a77d4
Merge pull request #46363 from dparmar18/tracker_55548
qa/cephfs: fallback to older way of get_op_read_count

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
2022-06-20 09:57:26 +05:30
Venky Shankar
615f553cf4
Merge pull request #46560 from lxbsz/wip-55824
qa: wait rank 0 to become up:active state before mounting fuse client

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
Reviewed-by: Neeraj Pratap Singh <neesingh@redhat.com>
2022-06-20 09:52:32 +05:30
Venky Shankar
c88a5c832e
Merge pull request #46635 from nmshelke/fix-55822
mgr/volumes: remove incorrect 'size' from output of 'snapshot info'

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by:  Anthony D'Atri <anthony.datri@gmail.com>
2022-06-20 09:50:20 +05:30
Venky Shankar
0fb05df9ec
Merge pull request #46636 from nmshelke/fix-55759
mgr/volumes: subvolume ls command crashes if groupname as '_nogroup'

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
2022-06-20 09:48:35 +05:30
Venky Shankar
771f6f7d27
Merge pull request #46699 from neesingh-rh/wip-56065
qa: TestMDSMetrics.test_delayed_metrics failure

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Nikhilkumar Shelke <nshelke@redhat.com>
2022-06-20 09:47:09 +05:30
Venky Shankar
8c50e1ee2c qa: increase fs/snaps workunit timeout to 6h
commit 4fbf4c4f58 increases the
number of tags used in snaptest-git-ceph.sh tests. This makes
the tests run longer (than default 3h) thereby timing out.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
2022-06-20 07:21:22 +05:30
Omri Zeneva
a33d77b813 qa: add jaeger deployment test in teuthology
Signed-off-by: Omri Zeneva <ozeneva@redhat.com>
2022-06-16 12:29:58 +03:00
Zack Cerza
d9e7c1b797
Merge pull request #46582 from ceph/rhel86
qa: Default to RHEL8.6 instead of 8.5
2022-06-15 15:05:55 -06:00
David Galloway
b8c4488c55 qa: Default to RHEL8.6 instead of 8.5
Signed-off-by: David Galloway <dgallowa@redhat.com>
2022-06-15 14:13:35 -04:00
Neeraj Pratap Singh
4b26559858 qa: TestMDSMetrics.test_delayed_metrics failure
TestMDSMetrics.test_delayed_metrics is failing due to
the absence of omit_sudo parameter in the remote.run()
of set_inter_mds_block() in qa/tasks/cephfs/filesystem.py

Fixes: https://tracker.ceph.com/issues/56065
Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
2022-06-15 19:21:53 +05:30
Adam King
06f265d057
Merge pull request #46506 from asm0deuz/mod_iscsi_systemd
Modify how Iscsi tcmu-runner container is started within systemd

Reviewed-by: Adam King <adking@redhat.com>
2022-06-14 09:56:49 -04:00
Ernesto Puerta
8bba174615
Merge pull request #46407 from melissa-kun-li/disable-create-image
mgr/dashboard: add rbd status endpoint and error page

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: sunilangadi2 <NOT@FOUND>
2022-06-14 11:00:21 +02:00
Nikhilkumar Shelke
dc4b0ee405 qa: subvolume ls command crashes if groupname as '_nogroup'
If --group_name=_nogroup is provided in the command then
throw error permission denied as it is internal group of ceph fs.

Fixes: https://tracker.ceph.com/issues/55759
Signed-off-by: Nikhilkumar Shelke <nshelke@redhat.com>
2022-06-14 12:34:51 +05:30
Xiubo Li
72194627c1 qa: wait rank 0 to become up:active state before mounting fuse client
When setting the ec pool to the layout the filesystem may not be
ready, so when mounting a fuse client it will fail. To fix this we
need to wait at least the rank 0 to be in up:active state.

Fixes: https://tracker.ceph.com/issues/55824
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2022-06-14 09:20:45 +08:00
Casey Bodley
6f765e25ab
Merge pull request #43597 from pritha-srivastava/wip-rgw-sts-role-multisite
rgw multisite: replicate metadata for iam roles

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2022-06-13 12:04:12 -04:00
Patrick Donnelly
2c57f20161
qa: increase mds balancer debug
Fixes: https://tracker.ceph.com/issues/55980
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2022-06-13 08:03:42 -04:00
Venky Shankar
27f4729256
Merge pull request #45556 from mchangir/qa-add-subvolume-option-flavors
qa: add subvolume option flavors

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
2022-06-13 12:29:43 +05:30
Venky Shankar
f7bc95c2f7
Merge pull request #44347 from kotreshhr/subvolumegroup-quotas
mgr/volumes: subvolumegroup quotas

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
2022-06-13 12:26:59 +05:30
Venky Shankar
67371c1ab4
Merge pull request #46332 from lxbsz/qa-snap
qa: enlarge the tag number and test more for the snapshot

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2022-06-13 12:25:44 +05:30
dparmar18
269567d005 qa/cephfs: fix read_debug_file() return value and a pep8 violation
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2022-06-13 11:33:43 +05:30
dparmar18
0aca27d2fc qa/cephfs: fallback to older way of get_op_read_count
Fixes: https://tracker.ceph.com/issues/55538

Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2022-06-13 11:33:43 +05:30
Nikhilkumar Shelke
9957a036df qa: remove incorrect 'size' from output of 'snapshot info'
The 'size' shown in the output of snapshot info command relies on
rstats which is incorrect snapshot size. It tracks size of the
subvolume from the snapshot has been taken instead of the snapshot
itself. Hence having the 'size' field in the output of 'snapshot info'
doesn't make sense until the rstats is fixed.

Fixes: https://tracker.ceph.com/issues/55822
Signed-off-by: Nikhilkumar Shelke <nshelke@redhat.com>
2022-06-12 17:17:03 +05:30