Commit Graph

4036 Commits

Author SHA1 Message Date
Venky Shankar
23443d3bc8 Merge PR #54031 into main
* refs/pull/54031/head:
	qa: add test to mangle lost+found directory object and ensure safety
	qa: run scrub before mounting client and validations

Reviewed-by: Kotresh Hiremath Ravishankar <khiremat@redhat.com>
2024-01-30 09:59:57 +05:30
Venky Shankar
f155b8fb61 Merge PR #54446 into main
* refs/pull/54446/head:
	qa: bump up scrub status command timeout

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
2024-01-29 18:54:44 +05:30
Venky Shankar
527b465872 Merge PR #54683 into main
* refs/pull/54683/head:
	qa/tasks/cephfs/test_misc: switch duration to timeout
	qa/tasks/cephfs/test_misc: fix the incorrect comments

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
2024-01-29 18:52:16 +05:30
Venky Shankar
320442d791 Merge PR #53177 into main
* refs/pull/53177/head:
	qa: test nfs export apply -i failure returns pseudo path in output JSON
	mgr/nfs: include pseudo in JSON output when nfs export apply -i fails

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: John Mulligan <jmulligan@redhat.com>
2024-01-29 18:44:28 +05:30
Samuel Just
a5ce9c3863 Revert "crush: add multistep retry rules"
This PR was merged by accident before it was ready.
Let's revert for now and open a new PR.

Signed-off-by: Samuel Just <sjust@redhat.com>
2024-01-26 20:32:05 +00:00
Yuri Weinstein
37d5d931b0
Merge pull request #55096 from athanatos/sjust/for-review/wip-crush-msr
crush: add multistep retry rules

Reviewed-by: Laura Flores <lflores@redhat.com>
2024-01-26 11:57:53 -08:00
Yuri Weinstein
5ddd51e819
Merge pull request #54987 from batrick/i63822
pybind/mgr/devicehealth: skip legacy objects that cannot be loaded

Reviewed-by: Nitzan Mordechai <nmordech@redhat.com>
Reviewed-by: Yaarit Hatuka <yaarithatuka@gmail.com>
2024-01-24 13:31:31 -08:00
Gregory Farnum
e5b0db4b2f
Merge pull request #55192 from dparmar18/fix_docstrings_ceph_test_case
qa: typo fixes in ceph_test_case docstrings
2024-01-24 09:37:34 -08:00
Venky Shankar
433fe34047 Merge PR #54682 into main
* refs/pull/54682/head:
	qa: drop mon_host check from test_cephfs_mirror_peer_bootstrap
	pybind/mgr/mirroring: drop mon_host from peer_list

Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-01-18 10:25:47 +05:30
Venky Shankar
b5d0337092 Merge PR #54041 into main
* refs/pull/54041/head:
	qa: fix error reporting string in assert_cluster_log

Reviewed-by: Milind Changire <mchangir@redhat.com>
Reviewed-by: Neeraj Pratap Singh <neesingh@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-01-17 19:59:22 +05:30
Laura Flores
d38b2ba335
Merge pull request #55115 from ljflores/wip-tracker-63967
qa/tasks: fix syntax for deep-scrub command
2024-01-16 13:07:10 -06:00
Dhairya Parmar
d8d18011a8 qa: typo fixes in ceph_test_case docstrings
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2024-01-16 18:09:54 +05:30
Rishabh Dave
158f916f25
Merge pull request #54208 from neesingh-rh/json_asok_fix
qa: fix rank_asok() to handle errors from asok commands

Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2024-01-16 17:45:09 +05:30
Rishabh Dave
bdd0c89185
Merge pull request #54590 from vshankar/wip-57087
qa: set mds config with `config set` for a particular test

Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2024-01-16 17:39:21 +05:30
Milind Changire
f14f60afd5
Merge PR #53999 into main
* refs/pull/53999/head:
	PendingReleaseNotes: support for subvolumes and subvolume groups in snap_schedule
	snap_schedule/tests: fix db upgrade issue
	qa: add yaml for on demand subvol version testing
	qa: add test cases for testing --subvol and --group arguments
	mgr/volumes: conditionalize subvolume upgrade
	mgr/volumes: ensure correct init of v1 subvol
	mgr/snap_schedule: add subvol and subvol group arguments to cli
	mds/snap_schedule: add subvolume group column management
	mgr/volumes: add remote helper methods to fetch subvolume info

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2024-01-11 10:31:31 +05:30
Patrick Donnelly
aa30adbaa7
qa: test devicehealth legacy load of deleted snap obj
Failure without fix looks like:

    2023-12-21T16:05:55.737+0000 7fbe585b0700  0 [devicehealth DEBUG root] loading object ABC_DEADB33F_FA
    2023-12-21T16:05:55.737+0000 7fbe585b0700 -1 log_channel(cluster) log [ERR] : Unhandled exception from module 'devicehealth' while running on mgr.x: [errno 2] RADOS object not found (Failed to operate read op for oid ABC_DEADB33F_FA)
    2023-12-21T16:05:55.737+0000 7fbe585b0700 -1 devicehealth.serve:
    2023-12-21T16:05:55.737+0000 7fbe585b0700 -1 Traceback (most recent call last):
    File "/home/pdonnell/ceph/src/pybind/mgr/devicehealth/module.py", line 394, in serve
        self._do_serve()
    File "/home/pdonnell/ceph/src/pybind/mgr/mgr_module.py", line 524, in check
        return func(self, *args, **kwargs)
    File "/home/pdonnell/ceph/src/pybind/mgr/devicehealth/module.py", line 354, in _do_serve
        finished_loading_legacy = self.check_legacy_pool()
    File "/home/pdonnell/ceph/src/pybind/mgr/devicehealth/module.py", line 326, in check_legacy_pool
        if self._load_legacy_object(ioctx, obj.key):
    File "/home/pdonnell/ceph/src/pybind/mgr/devicehealth/module.py", line 300, in _load_legacy_object
        ioctx.operate_read_op(op, oid)
    File "rados.pyx", line 3723, in rados.Ioctx.operate_read_op
    rados.ObjectNotFound: [errno 2] RADOS object not found (Failed to operate read op for oid ABC_DEADB33F_FA)

Credit to Greg Farnum for postulating the cause.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2024-01-10 14:52:37 -05:00
Patrick Donnelly
a066467c9d
qa: allow failing whatever the active mgr is
Vanilla `mgr fail` will just fail the active.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2024-01-10 14:52:37 -05:00
Patrick Donnelly
9ebcbdbed0
qa: add unit tests for MgrMap down flag
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2024-01-10 14:52:37 -05:00
Samuel Just
ef461914ee tasks/.../test_erasure_code_profile: assertSubset in test_create_plugin
Newly added profile options may break this test otherwise.

Signed-off-by: Samuel Just <sjust@redhat.com>
2024-01-09 11:35:45 -08:00
Laura Flores
25c16d7883 qa/tasks: fix syntax for deep-scrub command
Fixes: https://tracker.ceph.com/issues/63967
Signed-off-by: Laura Flores <lflores@ibm.com>
2024-01-09 10:54:01 -06:00
Adam King
67c5669157 qa/cephadm: support to pull stable branch compiled cephadm
This is to allow us to pull the latest build of
cephadm off of a stable branch (currently the only
valid option for that is reef, although this hopefully
will work with squid, T release, etc. in the future).

This should allow us to bootstrap cliusters based on
those stable branches for use in upgrade testing

Signed-off-by: Adam King <adking@redhat.com>
2024-01-08 09:51:52 -05:00
Milind Changire
006c68f492
qa: add test cases for testing --subvol and --group arguments
Signed-off-by: Milind Changire <mchangir@redhat.com>
2024-01-06 08:58:36 +05:30
Samuel Just
066953050c
Merge pull request #54214 from athanatos/sjust/wip-crimson-scrub
crimson: add initial scrub support

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
2024-01-03 15:26:45 -08:00
Venky Shankar
79ba10e807 qa: add test to mangle lost+found directory object and ensure safety
Earlier ceph versions didn't allow the lost+found directory to be removed
and nor the entries inside it. Users are recommended to fail the filesystem
and remove the directory object using rados cli commands. Therefore, include
this step as part of our testing.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
2024-01-02 15:59:37 +05:30
Venky Shankar
46eba3333a qa: run scrub before mounting client and validations
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2024-01-02 15:59:36 +05:30
Matan Breizman
e78ed2e7ab
Merge pull request #53227 from Matan-B/wip-matanb-cleanup-stale-osdmap
osd/OSD: introduce trim_stale_maps

Reviewed-by: Samuel Just <sjust@redhat.com>
2023-12-18 16:36:49 +02:00
Matan Breizman
7ea40f0ee6 qa/tasks/ceph_manager: thrash - add trim stale osdmaps
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
2023-12-18 12:56:24 +00:00
Patrick Donnelly
ca7230d1d8
Merge PR #52196 into main
* refs/pull/52196/head:
	qa: configure balancer for multi-mds workloads
	qa: create qa subvolumes in named subvolumegroup
	qa: do not rely on default max_mds value
	qa: add automate_balance to dashboard qa schema
	doc/cephfs: add docs for balance_automate
	doc/cephfs: use bash prompt for shell code
	mds: add balance_automate fs setting

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
2023-12-15 08:18:28 -05:00
Venky Shankar
5365d997ce Merge PR #48895 into main
* refs/pull/48895/head:
	qa: test cases for checking the health status after scrub repair
	mds: scrub repair does not clear earlier damage health status

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
2023-12-13 20:53:13 +05:30
Venky Shankar
bbf35d1f94 Merge PR #54704 into main
* refs/pull/54704/head:
	qa: don't fail on gather_mount_info() failure

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Kotresh Hiremath Ravishankar <khiremat@redhat.com>
2023-12-13 20:38:51 +05:30
Patrick Donnelly
e555ab774b
qa: create qa subvolumes in named subvolumegroup
So we can twiddle the subvolumegroup settings (like pins).

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2023-12-12 11:45:28 -05:00
Patrick Donnelly
7a2f59562c
qa: add automate_balance to dashboard qa schema
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2023-12-12 11:45:27 -05:00
Samuel Just
77fe84c095 qa/tasks/ceph: use tell <pgid> deep_scrub in osd_scrub_pgs
This is the more modern variant.  Crimson doesn't currently
support the pg <pgid> deep_scrub variant, so let's just use
this one generally.

Signed-off-by: Samuel Just <sjust@redhat.com>
2023-12-11 04:10:17 +00:00
Jos Collin
1c241e49e5
qa: drop mon_host check from test_cephfs_mirror_peer_bootstrap
Fixes: https://tracker.ceph.com/issues/63614
Signed-off-by: Jos Collin <jcollin@redhat.com>
2023-12-08 15:18:24 +05:30
Ilya Dryomov
d9147a14c4
Merge pull request #54205 from VallariAg/wip-nvmeof-test
qa: add rbd/nvmeof integration test

Reviewed-by: Zack Cerza <zack@redhat.com>
Reviewed-by: Aviv Caro <Aviv.Caro@ibm.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2023-12-04 18:14:38 +01:00
Vallari Agrawal
42e121a42a
qa: add rbd/nvmeof test
A basic test for ceph-nvmeof[1] where
nvmeof initiator is created.
It requires use of a new task "nvmeof_gateway_cfg"
under cephadm which shares config information
between two remote hosts.

[1] https://github.com/ceph/ceph-nvmeof/

Signed-off-by: Vallari Agrawal <val.agl002@gmail.com>
2023-12-04 19:27:54 +05:30
Dhairya Parmar
e1ebfc023a qa: test nfs export apply -i failure returns pseudo path in output JSON
Fixes: https://tracker.ceph.com/issues/62626
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2023-12-04 18:56:19 +05:30
Venky Shankar
6b1f123866 Merge PR #53431 into main
* refs/pull/53431/head:
	qa: add test cases to verify error reporting works as expected
	mgr: fix some doc strings in object_format.py
	mgr/tests: test returning error status works as expected
	mgr: make object_format's Responder class capable of responding err status
	mgr/nfs: report proper errno with err status

Reviewed-by: John Mulligan <jmulligan@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
2023-11-30 11:28:10 +05:30
Rishabh Dave
533ed43609
Merge pull request #54631 from rishabh-d-dave/mgr-vol-tests
qa/cephfs: improvements for name generators in test_volumes.py

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2023-11-30 10:47:42 +05:30
Rishabh Dave
5d471c2712
Merge pull request #54308 from rishabh-d-dave/mgr-vol-flake-warnings
mgr/vol: fix flake8 warnings

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Milind Changire <mchangir@redhat.com>
2023-11-30 00:49:28 +05:30
Rishabh Dave
ad96e91ee4 qa/cephfs: improvements for name generators in test_volumes.py
Generate a name that is shorter and easier to remember.

Also, write a simpler, faster & better helper method for generating
unique names. This method will also have shorter and more concise name,
so this will be easier to type and easier to read.

Fixes: https://tracker.ceph.com/issues/63680
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2023-11-29 21:15:29 +05:30
Casey Bodley
67ffb05d71
Merge pull request #54618 from alimaredia/wip-remove-vstart-runner-radosgw-admin-task
qa: remove vstart runner from radosgw_admin task

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2023-11-29 15:18:57 +00:00
Neeraj Pratap Singh
7f0cf0b7a2 qa: test cases for checking the health status after scrub repair
Fixes: https://tracker.ceph.com/issues/54557
Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
2023-11-29 16:09:01 +05:30
Vallari Agrawal
b896bebf38
Merge pull request #54209 from VallariAg/wip-xml-scanner
qa: use Remote.run_unit_test and ValgrindScanner
2023-11-29 12:21:02 +05:30
Venky Shankar
e5e1d6254a qa: don't fail on gather_mount_info() failure
kernel 5.4 (Ubuntu 20.04) has the following missing commits:

- 5a9e2f5d5590 ceph: add ceph.{cluster_fsid/client_id} vxattrs
- 247b1f19dbeb ceph: add status debugfs file

fs suite relies on these debugfs entries to gather mount information
(client-id, addr/inst) which are required by some tests. In fs suite,
the disto kernel gets overridden by the testing kernel and therefore
even if Ubuntu 20.04 is chosen as the distro, the testing kernel is
installed. However, with smoke suite, the distro kernel is used and
the missing patches causes certain essential information gathering to
fail early on (client-id, etc..) causing the test to not even start
execution. PR #54515 fixes a bug in the client-id fetching path but
isn't complete due to the missing patches - details here:

        https://tracker.ceph.com/issues/63488#note-8

But its essential to have the smoke tests running since those tests
have lately uncovered bugs in the MDS (w/ distro kernels). In order
to benefit from those tests, this change ignores failures when
gathering mount information (which aren't used by the fs relevant
smoke tests). The test (in fs suite) that rely on this piece of
information would fail when run with 20.04 distro kernel (but the
fs suite overrides it with the testing kernel).

Signed-off-by: Venky Shankar <vshankar@redhat.com>
2023-11-29 11:47:14 +05:30
Ali Maredia
a44fd94b96 qa: remove vstart runner from radosgw_admin task
Fixes: https://tracker.ceph.com/issues/63583

Signed-off-by: Ali Maredia <amaredia@redhat.com>
2023-11-28 02:58:49 -05:00
Xiubo Li
f0ebac137b qa/tasks/cephfs/test_misc: switch duration to timeout
The duration is impresise and sometimes will give a false alarm
just in case the shell command itself is issued late.

https://tracker.ceph.com/issues/63587
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-11-28 11:05:36 +08:00
Xiubo Li
4e085c5d2f qa/tasks/cephfs/test_misc: fix the incorrect comments
https://tracker.ceph.com/issues/63587
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-11-28 11:05:36 +08:00
Josh Soref
ed70d843df rgw: fix spelling errors
* access
* acquire
* across
* algorithm
* alternatively
* ambiguous
* attributes
* authenticated
* automatically
* based
* between
* beware
* bootstrap
* bucket
* cache
* character
* check
* cleanup
* combinations
* committed
* comparison
* compatibility
* concurrent
* conditions
* conjunction
* connections
* considered
* contains
* correct
* creates
* crypto
* dependencies
* dequeued
* dequeuing
* derivative
* described
* determine
* deterministic
* detritus
* discrepancy
* display
* either way
* elements
* entirely
* entries
* error
* everything
* expected
* explicitly
* extend
* false
* fclose
* feature
* following
* foreign
* format
* forward
* functionality
* functions
* generated
* getting
* gracefully
* happened
* hashicorp
* healthcheck
* hierarchy
* however
* http
* identify
* idleness
* ignored
* incremental
* information
* initialization
* length
* locally
* macros
* malformed
* means
* mechanism
* multipart
* multiple
* must
* namespace
* necessary
* nonexistent
* notifications
* occurred
* official
* outputs
* outputted
* overridden
* ownership
* parallelism
* partition
* passthrough
* pause
* penguin
* perform
* permission
* populated
* position
* preferably
* prefixes
* presumably
* previous
* print
* quadruple
* reference
* regardless
* registry
* renewed
* request
* requires
* reshard
* resource
* result to
* satisfy
* secret
* separate
* separator
* setting
* shouldn't
* skipping
* some
* somehow
* something
* specific
* struct
* succeeded
* success
* successful
* successfully
* syntax
* testing
* truncated
* unambiguously
* unittest
* unmodified
* unnecessary
* valid_action
* verifying
* versioning
* visible
* whatever
* whenever

Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com>
2023-11-27 10:27:32 -05:00
Yuval Lifshitz
213923165c
Merge pull request #52931 from yuvalif/wip-yuval-lua-teuthology
rgw/test/lua: add lua integration tests suite

reviewed-by: TRYTOBE8TME, cbodley
2023-11-24 11:38:51 +02:00