Commit Graph

7708 Commits

Author SHA1 Message Date
Kamoltat
993bb02b30 mgr/progress: introduce turn off/on feature
progress module can be turned off/on by using
the commands: 'progress off' and 'progress on'

As well as refractoring teuthology test suite
to prevent future bugs that can possibly occur

fixes: https://tracker.ceph.com/issues/47238

Signed-off-by: kamoltat <ksirivad@redhat.com>
2020-11-16 03:46:42 +00:00
Yan, Zheng
c42570a329 qa/cephfs: log-ignorelist scrub errors
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2020-11-16 09:10:03 +08:00
Sidharth Anupkrishnan
ba9b85c481 qa/cephfs: Add more tests for multimds scrub
Signed-off-by: Sidharth Anupkrishnan <sanupkri@redhat.com>
2020-11-16 09:02:18 +08:00
Yan, Zheng
904c959df4 qa/cephfs: add tests for multimds scrub
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2020-11-16 09:02:18 +08:00
Yan, Zheng
31deca1fec qa/cephfs: update existing scrub test cases
Now scrub is always async.

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2020-11-16 09:02:18 +08:00
Patrick Donnelly
07cba31a03
Merge PR #34842 into master
* refs/pull/34842/head:
	qa/tasks/vstart_runner.py: disable the ns unsharing if using localhost

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2020-11-15 12:39:11 -08:00
Patrick Donnelly
3541c8de12
Merge PR #37982 into master
* refs/pull/37982/head:
	qa/cephfs: add code for when config is None in __init__

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
2020-11-15 12:37:02 -08:00
Patrick Donnelly
359c8b0ddc
Merge PR #38016 into master
* refs/pull/38016/head:
	qa: switch to 'osdop_read' instead of 'op_r'

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-15 12:35:58 -08:00
Kefu Chai
f47f13f446
Merge pull request #38038 from bk201/wip-48209
mgr/dashboard: fix cephadm e2e test failure on deleting OSDs

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
2020-11-16 01:29:46 +08:00
Kefu Chai
0463a774c9
Merge pull request #37908 from dzafman/wip-47930
test: Fix race in TEST_recovery_scrub test

Reviewed-by: Neha Ojha <nojha@redhat.com>
2020-11-16 01:00:56 +08:00
Kefu Chai
f7962b1767
Merge pull request #37992 from rishabh-d-dave/dashboard-qa-skipTest
qa/mgr: mgr_test_case: raise SkipTest instead of calling skipTest()

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
2020-11-16 00:30:42 +08:00
Kefu Chai
9536625558 qa/tasks/ceph: use Cluster.sh() and Remote.sh() when appropriate
for better readability

Signed-off-by: Kefu Chai <kchai@redhat.com>
2020-11-14 15:22:10 +08:00
Kefu Chai
de71f6b0a3 qa/tasks/ceph: update_archive_setting() only if ctx.archive is valid
When running teuthology interactively, ctx.archive might not be set.
If it's not set, there is no point trying to access files there.

Fixes: https://tracker.ceph.com/issues/48058

Signed-off-by: Marcus Watts <mwatts@redhat.com>
Signed-off-by: Kefu Chai <kchai@redhat.com>
2020-11-14 15:22:00 +08:00
Neha Ojha
4a6f53afe6
Merge pull request #37902 from pdvian/wip-clog-health-detail
mon: Log "ceph health detail" periodically in cluster log

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2020-11-12 15:02:52 -08:00
kyr
6f75704a2e
Merge pull request #38029 from kshtsk/wip-cephadm-bootstrap-remote
qa/task/cephadm: run cephadm only on bootstrap_remote
2020-11-12 14:19:47 +01:00
Kiefer Chang
7df12da184
mgr/dashboard: fix cephadm e2e test failure on deleting OSDs
- Adapt the OSD ID column change in this PR:
  https://github.com/ceph/ceph/pull/37506
- Turn off Cypress retries because it makes the failure log misleading.

Fixes: https://tracker.ceph.com/issues/48209
Signed-off-by: Kiefer Chang <kiefer.chang@suse.com>
2020-11-12 17:20:48 +08:00
Kefu Chai
4dc60e9ece
Merge pull request #37922 from mgfritch/project-kubic
cephadm: install podman from the Kubic project

Reviewed-by: Kiefer Chang <kiefer.chang@suse.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2020-11-12 10:40:24 +08:00
Kefu Chai
dfe9fce746
Merge pull request #35865 from tchaikov/wip-crimson-recovery-qa
qa/suites: add crimson-rados suite

Reviewed-by: Samuel Just <sjust@redhat.com>
2020-11-12 09:52:33 +08:00
Xiubo Li
871ac6b376 qa/tasks/vstart_runner.py: disable the ns unsharing if using localhost
If the -l option is specified in :

$ MDS=3 MON=3 MGR=1 OSD=3 ../src/vstart.sh -n -l

The ceph cluster will use the localhost IP address as the mon host,
and then in the network namespace container the ceph-fuse daemon
couldn't connect to it.

Fixes: https://tracker.ceph.com/issues/45342
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-11-12 08:54:57 +08:00
Prashant D
f45712c190 mon: Log "ceph health detail" periodically in cluster log
change mon_health_to_clog_interval from 1_hr -> 10_min to
log health summary or detail more frequently.

Fixes: https://tracker.ceph.com/issues/48042

Signed-off-by: Prashant Dhange <pdhange@redhat.com>
2020-11-11 19:08:51 -05:00
David Zafman
870bde04a5 test: Changes based on code review comments
Signed-off-by: David Zafman <dzafman@redhat.com>
2020-11-11 15:31:26 -08:00
David Zafman
93373746f5 osd test: Delay reporting until mon_warn_older_version_delay has passed
Move release notes description to 16.0.0 and update
Update documentation

Signed-off-by: David Zafman <dzafman@redhat.com>
2020-11-11 15:10:11 -08:00
David Zafman
9d988c3dbc test: Simple test case for version health warning
Signed-off-by: David Zafman <dzafman@redhat.com>
2020-11-11 15:10:11 -08:00
Kyr Shatskyy
f75b00984e qa/task/cephadm: run cephadm only on bootstrap_remote
Because not all the node in teuthology config might
have any cephadm related role we should not execute
cephadm on random node.

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
2020-11-11 23:17:59 +01:00
Patrick Donnelly
d2769ff180
Merge PR #37978 into master
* refs/pull/37978/head:
	qa: fix broken links in multimds suite

Reviewed-by: Jeff Layton <jlayton@redhat.com>
2020-11-11 11:07:27 -08:00
Xiubo Li
4f1817aa8f qa: switch to 'osdop_read' instead of 'op_r'
The 'op_r' will just acount CEPH_OSD_FLAG_READ flag, which will
include some other none real data read opcodes, like the CEPH_OSD_OP_STAT.

Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-11-11 10:13:51 +08:00
David Zafman
9929588ce3
Merge pull request #37775 from dzafman/wip-upgrade-mimic-fix
test: Remove mimic-x* upgrade testing in master

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
2020-11-10 16:27:43 -08:00
Mykola Golub
9c9b14fab6 qa/workunits/rbd: improve regex for parsing ps output
On some platforms the pid may be indented with spaces.

Signed-off-by: Mykola Golub <mgolub@suse.com>
2020-11-10 17:16:00 +00:00
Mykola Golub
4cda97a822 rbd-nbd: when unmapping or detaching by device try to find process
For `detach` failing to find the process is fatal while unmap
will still try to send disconnect to the device.

Signed-off-by: Mykola Golub <mgolub@suse.com>
2020-11-10 17:15:59 +00:00
Sridhar Seshasayee
ce5d0b37d2 qa/suite/rados: Introduce "scheduler" directory within rados perf suite.
Introduce a "scheduler" directory under the rados:perf tree to allow perf
suite to specify tests with the default scheduler(WPQ) and also with
the dmClock scheduler. One specification also overrides the number of
shards(1) and the number of threads per shard(16) to test with apart from
the default settings. This allows testing and performance benchmarking
with the new proposal to use one shard and multiple threads per shard with
the dmClock scheduler.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2020-11-10 19:22:53 +05:30
Rishabh Dave
ce32cf4f81 qa/cephfs: add code for when config is None in __init__
When tests are launched with kernel client using vstart_runner.py,
config is None and, therefore, the call "config.get()" leads to a crash.
Assigning self.rbytes None is important since leaving it undefined will
to lead a crash since the code executed later assumes that self.rbytes
is defined.

Fixes: https://tracker.ceph.com/issues/48147
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2020-11-10 15:47:24 +05:30
Kefu Chai
33da4a5a05
Merge pull request #37997 from rhcs-dashboard/48155-fix-api-test-full-health
mgr/dashboard: fix API test: test_full_health

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
2020-11-10 17:53:50 +08:00
Alfonso Martínez
6add853fe4 mgr/dashboard: fix API test: test_full_health
Fixes: https://tracker.ceph.com/issues/48155
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
2020-11-10 09:29:45 +01:00
David Zafman
410e230d09 test: Fix race in TEST_recovery_scrub test
Fixes: https://tracker.ceph.com/issues/47930

Signed-off-by: David Zafman <dzafman@redhat.com>
2020-11-10 00:45:13 +00:00
David Zafman
ada78607e6
Merge pull request #37941 from dzafman/wip-48077
Allowing scrub configs begin_day/end_day to include 7 and begin_hour/end_hour to include 24 is confusing

Reviewed-by: Neha Ojha <nojha@redhat.com>
2020-11-09 15:12:34 -08:00
David Zafman
d3cc647583 osd: Eliminate day of weeek 7 and hour 24
Add test case for permitted hours to make sure scrub doesn't start
Remove permitted hours in extended sleep test

Fixes: https://tracker.ceph.com/issues/48077

Signed-off-by: David Zafman <dzafman@redhat.com>
2020-11-09 22:47:00 +00:00
Kefu Chai
154830be84
Merge pull request #37928 from tchaikov/wip-qa-ceph-ctx-archive
qa/tasks/ceph: extract update_archive_setting()

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2020-11-09 23:48:30 +08:00
Rishabh Dave
eb70274fa2 qa/mgr: mgr_test_case: raise SkipTest instead of calling skipTest()
skipTest() is being used as a class method which is not correct since
it's an instance method. Replace call to skipTest() by a raising
SkipTest instead.

Fixes: https://tracker.ceph.com/issues/48152
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2020-11-09 20:49:42 +05:30
Patrick Donnelly
4129ed92a2
qa: fix broken links in multimds suite
Fixes: #37899
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-07 11:16:29 -08:00
Patrick Donnelly
4052b11f18
Merge PR #36998 into master
* refs/pull/36998/head:
	doc/mgr/volumes: Document authorize/deauthorize cli commands
	qa/tasks/cephfs: test `fs subvolume authorize/deauthorize`
	mgr/volumes: Allow/deny auth IDs access to FS subvolumes
	mon/MonCap: allow 'profile mgr' to create/update/del auth IDs

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-06 20:51:54 -08:00
Patrick Donnelly
0e6f238ce1
Merge PR #37618 into master
* refs/pull/37618/head:
	mds: throttle cap acquisition via readdir

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-06 20:51:06 -08:00
Ramana Raja
7c98dc1ad3 qa/tasks/cephfs: test fs subvolume authorize/deauthorize
Fixes: https://tracker.ceph.com/issues/40401
Signed-off-by: Ramana Raja <rraja@redhat.com>
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2020-11-05 15:29:48 +05:30
Patrick Donnelly
254253780d
Merge PR #37869 into master
* refs/pull/37869/head:
	qa: for the latest kclient it will also return EIO

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-04 21:03:02 -08:00
Patrick Donnelly
0939625530
qa: avoid using ceph.dir.entries for empty check
This avoids a bug [1] in the kernel client.

[1] https://tracker.ceph.com/issues/48104

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 15:43:04 -08:00
David Zafman
ef47a3e708 test: set mon_allow_pool_size_one for consistency with original test intention
Signed-off-by: David Zafman <dzafman@redhat.com>
2020-11-03 21:49:00 +00:00
Patrick Donnelly
89a1817134
qa: do not test dir size equals rbytes
Avoiding this failure:

	2020-11-03T03:11:59.980 INFO:tasks.cephfs_test_runner:test_subvolume_resize_fail_quota_lt_used_size_no_shrink (tasks.cephfs.test_volumes.TestSubvolumes) ... FAIL
	2020-11-03T03:11:59.981 INFO:tasks.cephfs_test_runner:
	2020-11-03T03:11:59.981 INFO:tasks.cephfs_test_runner:======================================================================
	2020-11-03T03:11:59.981 INFO:tasks.cephfs_test_runner:FAIL: test_subvolume_resize_fail_quota_lt_used_size_no_shrink (tasks.cephfs.test_volumes.TestSubvolumes)
	2020-11-03T03:11:59.982 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-11-03T03:11:59.982 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
	2020-11-03T03:11:59.982 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_ceph-c_wip-pdonnell-testing-20201102.231317/qa/tasks/cephfs/test_volumes.py", line 1310, in test_subvolume_resize_fail_quota_lt_used_size_no_shrink
	2020-11-03T03:11:59.982 INFO:tasks.cephfs_test_runner:    self.assertEqual(usedsize, susedsize)
	2020-11-03T03:11:59.982 INFO:tasks.cephfs_test_runner:AssertionError: 8912896 != 1

From: /ceph/teuthology-archive/pdonnell-2020-11-03_02:22:55-fs:volumes-wip-pdonnell-testing-20201102.231317-distro-basic-smithi/5585383/teuthology.log

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:01:31 -08:00
Patrick Donnelly
319dfe9119
qa: disable check for value kclient fails to get
See-also: https://tracker.ceph.com/issues/48075
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:01:28 -08:00
Patrick Donnelly
59451923d3
qa: leave one standby available to avoid warning
Four file systems will use all MDS and generate this warning:

	2020-11-02T03:48:33.407 INFO:teuthology.orchestra.run.smithi003.stdout:2020-11-02T03:24:21.817337+0000 mon.a (mon.0) 481 : cluster [WRN] Health check failed: insufficient standby MDS daemons available (MDS_INSUFFICIENT_STANDBY)

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:01:25 -08:00
Patrick Donnelly
492aa452e0
qa: do not check access timestamps
kclient does not update these like ceph-fuse.

This avoids failures like:

	2020-11-01T08:05:53.115 INFO:tasks.cephfs_test_runner:test_subvolume_clone_in_progress_getpath (tasks.cephfs.test_volumes.TestSubvolumeSnapshotClones) ... FAIL
	2020-11-01T08:05:53.115 INFO:tasks.cephfs_test_runner:
	2020-11-01T08:05:53.116 INFO:tasks.cephfs_test_runner:======================================================================
	2020-11-01T08:05:53.116 INFO:tasks.cephfs_test_runner:FAIL: test_subvolume_clone_in_progress_getpath (tasks.cephfs.test_volumes.TestSubvolumeSnapshotClones)
	2020-11-01T08:05:53.116 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-11-01T08:05:53.116 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
	2020-11-01T08:05:53.117 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/github.com_batrick_ceph_cephfs-qa-reorg/qa/tasks/cephfs/test_volumes.py", line 2295, in test_subvolume_clone_in_progress_getpath
	2020-11-01T08:05:53.117 INFO:tasks.cephfs_test_runner:    self._verify_clone(subvolume, snapshot, clone)
	2020-11-01T08:05:53.117 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/github.com_batrick_ceph_cephfs-qa-reorg/qa/tasks/cephfs/test_volumes.py", line 154, in _verify_clone
	2020-11-01T08:05:53.117 INFO:tasks.cephfs_test_runner:    self._verify_clone_attrs(path1, path2)
	2020-11-01T08:05:53.117 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/github.com_batrick_ceph_cephfs-qa-reorg/qa/tasks/cephfs/test_volumes.py", line 107, in _verify_clone_attrs
	2020-11-01T08:05:53.118 INFO:tasks.cephfs_test_runner:    self.assertEqual(sval, cval)
	2020-11-01T08:05:53.118 INFO:tasks.cephfs_test_runner:AssertionError: 1604217951 != 1604217941

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:01:22 -08:00
Patrick Donnelly
a4941c1d5b
qa: ignore skip errors for kclient
To avoid this failure:

	2020-11-01T07:21:35.117 INFO:tasks.cephfs_test_runner:test_volume_without_namespace_isolation (tasks.cephfs.test_volume_client.TestVolumeClient) ... ok
	2020-11-01T07:21:35.118 INFO:tasks.cephfs_test_runner:
	2020-11-01T07:21:35.118 INFO:tasks.cephfs_test_runner:======================================================================
	2020-11-01T07:21:35.119 INFO:tasks.cephfs_test_runner:FAIL: test_evict_client (tasks.cephfs.test_volume_client.TestVolumeClient)
	2020-11-01T07:21:35.119 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-11-01T07:21:35.119 INFO:tasks.cephfs_test_runner:Requires FUSE client to inject client metadata
	2020-11-01T07:21:35.119 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-11-01T07:21:35.119 INFO:tasks.cephfs_test_runner:Ran 18 tests in 732.749s
	2020-11-01T07:21:35.120 INFO:tasks.cephfs_test_runner:
	2020-11-01T07:21:35.120 INFO:tasks.cephfs_test_runner:FAILED (failures=1)

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:01:19 -08:00
Patrick Donnelly
f9ca58a3f3
qa: add more clients for test_volume_client
It requires 4.

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:01:16 -08:00
Patrick Donnelly
f033cd3f20
qa: accept client returning pool id for layout
Resolving this failure:

	2020-10-30T22:52:25.898 INFO:tasks.cephfs_test_runner:test_subvolume_group_create_with_desired_data_pool_layout (tasks.cephfs.test_volumes.TestSubvolumeGroups) ... FAIL
	2020-10-30T22:52:25.898 INFO:tasks.cephfs_test_runner:
	2020-10-30T22:52:25.899 INFO:tasks.cephfs_test_runner:======================================================================
	2020-10-30T22:52:25.899 INFO:tasks.cephfs_test_runner:FAIL: test_subvolume_group_create_with_desired_data_pool_layout (tasks.cephfs.test_volumes.TestSubvolumeGroups)
	2020-10-30T22:52:25.899 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-10-30T22:52:25.899 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
	2020-10-30T22:52:25.900 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/github.com_batrick_ceph_cephfs-qa-reorg/qa/tasks/cephfs/test_volumes.py", line 568, in test_subvolume_group_create_with_desired_data_pool_layout
	2020-10-30T22:52:25.900 INFO:tasks.cephfs_test_runner:    self.assertEqual(desired_pool, new_pool)
	2020-10-30T22:52:25.900 INFO:tasks.cephfs_test_runner:AssertionError: '34' != 'new_pool'
	2020-10-30T22:52:25.900 INFO:tasks.cephfs_test_runner:- 34
	2020-10-30T22:52:25.900 INFO:tasks.cephfs_test_runner:+ new_pool

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:01:13 -08:00
Patrick Donnelly
6afc98bd20
qa: ignore expected warning about client missing features
2020-10-30T22:54:14.842 INFO:teuthology.orchestra.run.smithi168.stdout:2020-10-30T22:43:34.067194+0000 mds.a (mds.0) 1 : cluster [WRN] evicting session smithi168:0 (8757), missing required features '0x0000000000000400'

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:01:10 -08:00
Patrick Donnelly
eb3d38ccf7
qa: do not fail on skip
Avoiding these failures:

	2020-10-30T22:49:00.248 INFO:tasks.cephfs_test_runner:test_client_release_bug (tasks.cephfs.test_client_limits.TestClientLimits) ...
	2020-10-30T22:49:00.248 INFO:tasks.cephfs_test_runner:======================================================================
	2020-10-30T22:49:00.248 INFO:tasks.cephfs_test_runner:FAIL: test_client_cache_size (tasks.cephfs.test_client_limits.TestClientLimits)
	2020-10-30T22:49:00.249 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-10-30T22:49:00.249 INFO:tasks.cephfs_test_runner:Require FUSE client to inject client release failure
	2020-10-30T22:49:00.249 INFO:tasks.cephfs_test_runner:======================================================================
	2020-10-30T22:49:00.250 INFO:tasks.cephfs_test_runner:FAIL: test_client_oldest_tid (tasks.cephfs.test_client_limits.TestClientLimits)
	2020-10-30T22:49:00.250 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-10-30T22:49:00.251 INFO:tasks.cephfs_test_runner:Require FUSE client to inject client release failure
	2020-10-30T22:49:00.251 INFO:tasks.cephfs_test_runner:======================================================================
	2020-10-30T22:49:00.252 INFO:tasks.cephfs_test_runner:FAIL: test_client_release_bug (tasks.cephfs.test_client_limits.TestClientLimits)
	2020-10-30T22:49:00.252 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-10-30T22:49:00.252 INFO:tasks.cephfs_test_runner:Require FUSE client to inject client release failure
	2020-10-30T22:49:00.252 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-10-30T22:49:00.252 INFO:tasks.cephfs_test_runner:Ran 8 tests in 368.825s
	2020-10-30T22:49:00.253 INFO:tasks.cephfs_test_runner:
	2020-10-30T22:49:00.253 INFO:tasks.cephfs_test_runner:FAILED (failures=3)
	2020-10-30T22:49:00.253 INFO:tasks.cephfs_test_runner:
	2020-10-30T22:49:00.253 INFO:tasks.cephfs_test_runner:======================================================================
	2020-10-30T22:49:00.253 INFO:tasks.cephfs_test_runner:FAIL: test_client_cache_size (tasks.cephfs.test_client_limits.TestClientLimits)
	2020-10-30T22:49:00.254 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-10-30T22:49:00.254 INFO:tasks.cephfs_test_runner:Require FUSE client to inject client release failure
	2020-10-30T22:49:00.254 INFO:tasks.cephfs_test_runner:======================================================================
	2020-10-30T22:49:00.254 INFO:tasks.cephfs_test_runner:FAIL: test_client_oldest_tid (tasks.cephfs.test_client_limits.TestClientLimits)
	2020-10-30T22:49:00.255 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-10-30T22:49:00.255 INFO:tasks.cephfs_test_runner:Require FUSE client to inject client release failure
	2020-10-30T22:49:00.255 INFO:tasks.cephfs_test_runner:======================================================================
	2020-10-30T22:49:00.255 INFO:tasks.cephfs_test_runner:FAIL: test_client_release_bug (tasks.cephfs.test_client_limits.TestClientLimits)
	2020-10-30T22:49:00.255 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-10-30T22:49:00.256 INFO:tasks.cephfs_test_runner:Require FUSE client to inject client release failure

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:01:06 -08:00
Patrick Donnelly
3e25a080bc
qa: fix mount directory
Otherwise we get this error:

	2020-10-30T03:57:44.256 INFO:teuthology.run_tasks:Running task exec...
	2020-10-30T03:57:44.265 INFO:teuthology.task.exec:Executing custom commands...
	2020-10-30T03:57:44.266 INFO:teuthology.task.exec:Running commands on role client.0 host ubuntu@smithi095.front.sepia.ceph.com
	2020-10-30T03:57:44.266 INFO:teuthology.orchestra.run.smithi095:> sudo TESTDIR=/home/ubuntu/cephtest bash -c 'cd $TESTDIR/mnt.* && dd if=/dev/zero of=./foo count=100'
	2020-10-30T03:57:44.380 INFO:teuthology.orchestra.run.smithi095.stderr:bash: line 0: cd: too many arguments

From: /ceph/teuthology-archive/pdonnell-2020-10-30_02:26:51-fs-master-distro-basic-smithi/5573179/teuthology.log

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:01:03 -08:00
Patrick Donnelly
c495b23090
qa: rename distro link to distros
This needs to become any overrides present in "mount". Otherwise kclient
distro overrides have no effect.

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:01:00 -08:00
Patrick Donnelly
c099ff9ebf
qa: accept eio for mount failure
Example:

	2020-10-30T03:37:33.752 INFO:tasks.cephfs_test_runner:======================================================================
	2020-10-30T03:37:33.752 INFO:tasks.cephfs_test_runner:FAIL: test_mount_mon_and_osd_caps_present_mds_caps_absent (tasks.cephfs.test_multifs_auth.TestClientsWithoutAuth)
	2020-10-30T03:37:33.752 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
	2020-10-30T03:37:33.753 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
	2020-10-30T03:37:33.753 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/github.com_batrick_ceph_cephfs-qa-reorg/qa/tasks/cephfs/test_multifs_auth.py", line 311, in test_mount_mon_and_osd_caps_present_mds_caps_absent
	2020-10-30T03:37:33.753 INFO:tasks.cephfs_test_runner:    self.check_that_mount_failed_for_right_reason(retval[2])
	2020-10-30T03:37:33.753 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/github.com_batrick_ceph_cephfs-qa-reorg/qa/tasks/cephfs/test_multifs_auth.py", line 269, in check_that_mount_failed_for_right_reason
	2020-10-30T03:37:33.753 INFO:tasks.cephfs_test_runner:    raise AssertionError('can\'t find expected set of words in the '
	2020-10-30T03:37:33.754 INFO:tasks.cephfs_test_runner:AssertionError: can't find expected set of words in the stderr
	2020-10-30T03:37:33.754 INFO:tasks.cephfs_test_runner:self.errmsgs - ('permission denied', 'no mds server is up or the cluster is laggy', 'no such file or directory')
	2020-10-30T03:37:33.754 INFO:tasks.cephfs_test_runner:stderr - mount error 5 = input/output error

From: /ceph/teuthology-archive/pdonnell-2020-10-30_02:26:51-fs-master-distro-basic-smithi/5573109/teuthology.log

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:57 -08:00
Patrick Donnelly
4ca8aaafa3
qa: unmount volumes before removal
To avoid potential failures/hangs in umount.

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:53 -08:00
Patrick Donnelly
83c361ff04
qa: some sessionmap tests require fuse
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:50 -08:00
Patrick Donnelly
2df7caae4b
qa: remove obsolete test
Since the work in [1], the kclient can be killed with network namespaces
without powercycling.

[1] https://tracker.ceph.com/issues/44044

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:47 -08:00
Patrick Donnelly
6d880eac3f
qa: do not fail on test skip
test_client_config only works with ceph-fuse.

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:45 -08:00
Patrick Donnelly
a6ac673fc0
qa: use more OSDs for workload tests
Running out of space / slow requests are less likely this way.

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:42 -08:00
Patrick Donnelly
726c00f069
qa: use 4 clients for volumes testing
The test_volume_client.py tests require 4 clients.

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:37 -08:00
Patrick Donnelly
9861420771
qa: relocate tasks
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:37 -08:00
Patrick Donnelly
2e493a1d74
qa: rename workload suite
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:37 -08:00
Patrick Donnelly
9cacc92756
qa: relocate new tasks from fs:basic_functional
These were not tested with kcephfs before, let's see if there's any
bugs!

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:37 -08:00
Patrick Donnelly
326f472950
qa: remove kcephfs suite
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:36 -08:00
Patrick Donnelly
dbdb6537e2
qa: move kcephfs:mixed-clients to fs
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:36 -08:00
Patrick Donnelly
853ce2a872
qa: remove dead kcephfs:cephfs suite
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:36 -08:00
Patrick Donnelly
276de433d6
qa: merge snap tests for kcephfs/fs
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:36 -08:00
Patrick Donnelly
218daa9a38
qa: add new fs:shell sub-suite
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:36 -08:00
Patrick Donnelly
5458e66246
qa: remove task-less kcephfs:recovery suite
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:36 -08:00
Patrick Donnelly
024d5b37f4
qa: add new fs:volumes sub-suite
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:36 -08:00
Patrick Donnelly
73c8c900bc
qa: merge fs:basic_functional and kcephfs:recovery
These functional tests can be run with both clients.

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:35 -08:00
Patrick Donnelly
5bbd97fac1
qa: move kcephfs:thrash to fs:thrash
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:35 -08:00
Patrick Donnelly
742d75a7fe
qa: reorganize thrash tasks
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:35 -08:00
Patrick Donnelly
a9b7df2ebc
qa: move multifs/failover tests
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:35 -08:00
Patrick Donnelly
e8642c742a
qa: run norstats workunit for klient
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:35 -08:00
Patrick Donnelly
99bfa7cfe8
qa: merge workload tests from kcephfs with fs
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:35 -08:00
Patrick Donnelly
ca0d99b135
qa: use consistent syntax
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:35 -08:00
Patrick Donnelly
43875d7bee
qa: remove inline testing
inline feature is now deprecated and TBR.

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:35 -08:00
Patrick Donnelly
49abce2290
qa: move log-rotate config to common yaml
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:34 -08:00
Patrick Donnelly
f719ebb305
qa: remove fuse default permissions config
It is the default.

Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:34 -08:00
Patrick Donnelly
c4eaceaead
qa: fix whitespace
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-03 13:00:34 -08:00
Kefu Chai
43f2738a0e qa/tasks/ceph: extract update_archive_setting()
for better readability

also update the comment in `ceph_crash()` to reflect the changed
settings

Signed-off-by: Kefu Chai <kchai@redhat.com>
2020-11-03 12:01:14 +08:00
Jason Dillaman
9ca496a7d1 qa/suite/rbd: simple migration import test case
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2020-11-02 19:58:55 -05:00
Jason Dillaman
81f9b91d62 qa/tasks: support explicit disk configuration for qemu task
The 'disks' key will now be treated as a dictionary where all previous
global settings can be individually applied. Additionally, a disk can be
pre-created and provided for use by qemu.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2020-11-02 19:58:55 -05:00
Michael Fritch
22605b7e98
qa/suites/rados/cephadm: install podman from the Kubic project
ppa:projectatomic is no longer maintained, updates are now provided via
the Kubic project

Fixes: https://tracker.ceph.com/issues/48072
Signed-off-by: Michael Fritch <mfritch@suse.com>
2020-11-02 08:54:35 -07:00
Kefu Chai
038750c78a
Merge pull request #37327 from kamoltat/wip-mgr-progress-global-event
mgr/progress: Global Recovery Event in ceph -s

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2020-10-31 11:43:27 +08:00
Lenz Grimmer
587a84049b
Merge pull request #35769 from Codom/master
mgr/dashboard: Added Versioning to the REST API

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
2020-10-30 09:50:05 +01:00
Xiubo Li
61a4b8c670 qa: for the latest kclient it will also return EIO
Fixes: https://tracker.ceph.com/issues/45100
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-10-29 21:30:36 -04:00
Casey Bodley
897f96eef1
Merge pull request #34866 from nickjanus/issue-45193
rgw-admin: fixes BucketInfo for missing buckets

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2020-10-29 12:00:52 -04:00
Mykola Golub
5e10672e4e
Merge pull request #37884 from dillaman/wip-48032
qa/workunits/rbd: fix permission issue when removing mirror peer

Reviewed-by: Mykola Golub <mgolub@suse.com>
2020-10-29 15:48:35 +02:00
Jason Dillaman
caaba8eabe qa/workunits/rbd: fix permission issue when removing mirror peer
Fixes: https://tracker.ceph.com/issues/48032
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2020-10-28 14:06:57 -04:00
Christopher Odom
dcac50e322 mgr/dashboard: Added Versioning to the REST API
Versioning is handled by the RESTContoller decorators.
It works by adding a version attribute to the endpoint object,
which will be checked by the _request_wrapper against the requested
version before the controller method is dispatched.

This commit also updates all of the testing to support
version vendor mime types, as well as adding an http
interceptor to add versioned mime types to all frontend
requests.

Fixes: https://tracker.ceph.com/issues/40909
Signed-off-by: Avan Thakkar  <athakkar@redhat.com>
2020-10-28 19:33:39 +05:30
Neha Ojha
df7adbf387 qa/tasks/ceph_manager.py: remove redundant quorum status logging
2020-10-21T03:42:45.985 INFO:teuthology.orchestra.run.smithi114:> sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph --cluster ceph quorum_status
2020-10-21T03:42:58.574 INFO:teuthology.orchestra.run.smithi114.stdout:{"election_epoch":1650,"quorum":[0,2],"quorum_names":["a","c"],"quorum_leader_name":"a","quorum_age":0,"features":{"quorum_con":"4540138297136906239","quorum_mon":["kraken","luminous","mimic","osdmap-prune","nautilus","octopus","pacific","elector-pinging"]},"monmap":{"epoch":1,"fsid":"807c36f1-9e85-4fa3-81fc-95915ab50584","modified":"2020-10-21T00:34:48.421341Z","created":"2020-10-21T00:34:48.421341Z","min_mon_release":16,"min_mon_release_name":"pacific","election_strategy":3,"disallowed_leaders":"","features":{"persistent":["kraken","luminous","mimic","osdmap-prune","nautilus","octopus","pacific","elector-pinging"],"optional":[]},"mons":[{"rank":0,"name":"a","public_addrs":{"addrvec":[{"type":"v1","addr":"172.21.15.114:6789","nonce":0}]},"addr":"172.21.15.114:6789/0","public_addr":"172.21.15.114:6789/0","priority":0,"weight":0},{"rank":1,"name":"b","public_addrs":{"addrvec":[{"type":"v1","addr":"172.21.15.133:6789","nonce":0}]},"addr":"172.21.15.133:6789/0","public_addr":"172.21.15.133:6789/0","priority":0,"weight":0},{"rank":2,"name":"c","public_addrs":{"addrvec":[{"type":"v1","addr":"172.21.15.114:6790","nonce":0}]},"addr":"172.21.15.114:6790/0","public_addr":"172.21.15.114:6790/0","priority":0,"weight":0}]}}
2020-10-21T03:42:58.589 INFO:tasks.mon_thrash.ceph_manager:quorum_status is {"election_epoch":1650,"quorum":[0,2],"quorum_names":["a","c"],"quorum_leader_name":"a","quorum_age":0,"features":{"quorum_con":"4540138297136906239","quorum_mon":["kraken","luminous","mimic","osdmap-prune","nautilus","octopus","pacific","elector-pinging"]},"monmap":{"epoch":1,"fsid":"807c36f1-9e85-4fa3-81fc-95915ab50584","modified":"2020-10-21T00:34:48.421341Z","created":"2020-10-21T00:34:48.421341Z","min_mon_release":16,"min_mon_release_name":"pacific","election_strategy":3,"disallowed_leaders":"","features":{"persistent":["kraken","luminous","mimic","osdmap-prune","nautilus","octopus","pacific","elector-pinging"],"optional":[]},"mons":[{"rank":0,"name":"a","public_addrs":{"addrvec":[{"type":"v1","addr":"172.21.15.114:6789","nonce":0}]},"addr":"172.21.15.114:6789/0","public_addr":"172.21.15.114:6789/0","priority":0,"weight":0},{"rank":1,"name":"b","public_addrs":{"addrvec":[{"type":"v1","addr":"172.21.15.133:6789","nonce":0}]},"addr":"172.21.15.133:6789/0","public_addr":"172.21.15.133:6789/0","priority":0,"weight":0},{"rank":2,"name":"c","public_addrs":{"addrvec":[{"type":"v1","addr":"172.21.15.114:6790","nonce":0}]},"addr":"172.21.15.114:6790/0","public_addr":"172.21.15.114:6790/0","priority":0,"weight":0}]}}

Signed-off-by: Neha Ojha <nojha@redhat.com>
2020-10-27 21:14:54 +00:00
Neha Ojha
1fa25e0780 qa/suites/rados/dashboard: ignore MON_DOWN
This warning is expected during:

2020-10-27T16:29:40.549+0000 7fa1f0927700 10 mon.c@2(peon) e1 handle_auth_request haven't formed initial quorum, EBUSY

and is cleared after:

2020-10-27T16:29:42.597+0000 7fe7b3ada700  7 mon.a@1(peon).log v7 update_from_paxos applying incremental log 7 2020-10-27T16:29:42.319274+0000 mon.b (mon.0) 76 : cluster [INF] mon.b is new leader, mons b,a,c in quorum (ranks 0,1,2)
2020-10-27T16:29:42.597+0000 7fe7b3ada700  7 mon.a@1(peon).log v7 update_from_paxos applying incremental log 7 2020-10-27T16:29:42.326711+0000 mon.b (mon.0) 82 : cluster [INF] Health check cleared: MON_DOWN (was: 1/3 mons down, quorum b,a)

Signed-off-by: Neha Ojha <nojha@redhat.com>
2020-10-27 21:14:54 +00:00
Jason Dillaman
0db501b2e3
Merge pull request #37273 from trociny/wip-rbd-nbd-persist
rbd-nbd: allow to restart daemon without destroying device

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2020-10-26 07:57:47 -04:00
Lenz Grimmer
10cb89241f
Merge pull request #37568 from s0nea/wip-dashboard-qa-linting
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Tiago Melo <tmelo@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
2020-10-26 11:27:23 +01:00
Patrick Donnelly
c569036c5d
Merge PR #37629 into master
* refs/pull/37629/head:
	qa/cephfs: add session_timeout option support
	qa/cephfs: move the cephfs's opertions setting to create()
	qa/cephfs: add 'cephfs:' section support

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2020-10-25 16:26:36 -07:00
Patrick Donnelly
e059e9666c
Merge PR #37652 into master
* refs/pull/37652/head:
	qa/tasks: tear down the background process before unmounting
	qa/tasks: switch to _kill_background() helper to terminate the daemons

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2020-10-25 16:25:24 -07:00
David Zafman
954b21fb13 test: Remove mimic-x* upgrade testing in master
Signed-off-by: David Zafman <dzafman@redhat.com>
2020-10-24 00:01:24 +00:00
Neha Ojha
27871caa7f
Merge pull request #37678 from ideepika/wip-drop-hammer-from-qa
qa: drop hammer branch qa tests

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2020-10-23 08:35:02 -07:00
Lenz Grimmer
1e131d5a99
Merge pull request #36449 from s0nea/wip-dashboard-osd-flags-indiv
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Tiago Melo <tmelo@suse.com>
2020-10-23 16:45:28 +02:00
Xiubo Li
0422673b61 qa/cephfs: add session_timeout option support
When the mds revoking the Fwbl caps, the clients need to flush
the dirty data back to the OSDs, but the flush may make the OSDs
to be overloaded and slow, which may take more than 60 seconds to
finish. Then the MDS daemons will report the WRN messages.

For the teuthology test cases, let's just increase the timeout
value to make it work.

Fixes: https://tracker.ceph.com/issues/47565
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-10-23 14:27:37 +08:00
Xiubo Li
cb8081ce7f qa/cephfs: move the cephfs's opertions setting to create()
Fixes: https://tracker.ceph.com/issues/47565
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-10-23 14:27:37 +08:00
Xiubo Li
3b5303482f qa/cephfs: add 'cephfs:' section support
Fixes: https://tracker.ceph.com/issues/47565
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-10-23 14:27:30 +08:00
Patrick Donnelly
24137e1ffa
Merge PR #36544 into master
* refs/pull/36544/head:
	vstart_runner: omit result line only when testcases are run one by one
	vstart_runner: don't quit running testsuite on a test failure

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2020-10-22 12:52:01 -07:00
Kamoltat
2af2afa5e9 mgr/progress: Global Recovery Event in ceph -s
Modified the progress module and BaseMgrModule to
support Global Recovert Event. Adding more arguments
to update_progress_event, ceph_update_progress_event.
To only show global recovery event progress with `ceph -s`.
All sub events have been move to `ceph progress`

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2020-10-22 16:44:50 +00:00
Kotresh HR
c0de657d3f mds: throttle cap acquisition via readdir
A trivial "find" command on a large directory hierarchy will cause the
client to receive caps significantly faster than it will release. The
MDS will try to have the client reduce its caps below the
mds_max_caps_per_client limit but the recall throttles prevent it from
catching up to the pace of acquisition. The solution is to throttle
readdir from client. This patch does the same.

The readdir is throttled on the condition that the number of caps
acquired is greater than certain percentage of mds_max_caps_per_client
(default is 10%) and cap acquisition via readdir is certain percentage
of mds_max_caps_per_client (the default is 50%). When the above
condition is met, the readdir request is retried after
'mds_cap_acquisition_throttle_retry_request_timeout' (default is 0.5)
seconds.

Fixes: https://tracker.ceph.com/issues/47307
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2020-10-22 18:56:43 +05:30
Tatjana Dehler
1b67691492 mgr/dashboard/qa: add pylint check
Fixes: https://tracker.ceph.com/issues/47757
Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2020-10-22 14:12:14 +02:00
Tatjana Dehler
1474259b65 mgr/dashboard/qa: add isort check
Add isort check to the QA test files and
fix issues.

Fixes: https://tracker.ceph.com/issues/47757
Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2020-10-22 14:12:14 +02:00
Tatjana Dehler
bc3d1997bd mgr/dashboard/qa: add flake8 linting
Add flake8 linting to the QA test files and
fix issues.

Fixes: https://tracker.ceph.com/issues/47757
Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2020-10-22 14:12:14 +02:00
Xuehan Xu
951f47fc1c qa/suites: add crimson-rados/thrash tests
Signed-off-by: Xuehan Xu <xxhdx1985126@163.com>
Signed-off-by: Kefu Chai <kchai@redhat.com>
2020-10-22 11:27:35 +08:00
Tatjana Dehler
3639332f34 mgr/dashboard: assign flags to single OSDs
Add the possibility to assign the flags ['noup',
'nodown', 'noin', 'noout'] to single OSDs.

Fixes: https://tracker.ceph.com/issues/40739
Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2020-10-21 15:11:23 +02:00
Lenz Grimmer
7d9d3a60d5
Merge pull request #37416 from tspmelo/wip-rbd-clone
mgr/dashboard: Disable RBD clone action when conditions are not met

Reviewed-by: Stephan Müller <smueller@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
2020-10-21 15:06:27 +02:00
Xiubo Li
7acd445fcc qa/tasks: tear down the background process before unmounting
If the background process keeps running by opening the mountpoint
directory, the unmount will fail with BUSY.

Fixes: https://tracker.ceph.com/issues/46883
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-10-21 04:02:38 -04:00
Xiubo Li
def177ff3b qa/tasks: switch to _kill_background() helper to terminate the daemons
Fixes: https://tracker.ceph.com/issues/46883
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-10-21 04:01:35 -04:00
Lenz Grimmer
2437ef8648
Merge pull request #36948 from bk201/wip-46492
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Varsha Rao <varao@redhat.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
2020-10-20 12:58:51 +02:00
Patrick Donnelly
008eaca6c1
Merge PR #37529 into master
* refs/pull/37529/head:
	qa: set rados op timeouts for mds/ceph-fuse
	qa: print debug info on mount cleanup
	qa: remove redundant rmr
	qa: use null mode to prevent undesired changes to mountpoint
	qa: unmount all clients before deleting the file system
	osdc: add timeout configs for mons/osds
	common: accept timespan for SaferCond.wait_for

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2020-10-19 19:26:52 -07:00
Kiefer Chang
a9accaeccf mgr/dashboard: support Orchestrator and user-defined Ganesha clusters
This change make the Dashboard support two types of Ganesha clusters:

- Orchestrator clusters (Since Octopus)
  - Deployed by the Orchestrator.
  - The Dashboard gets the pool/namespace that stores Ganesha
    configuration objects from the Orchestrator.
  - The Dashboard gets the daemons in a cluster from the Orchestrator.

- User-defined clusters (Since Nautilus)
  - Clusters defined by using `ceph dashboard
    set-ganesha-clusters-rados-pool-namespace` command is treated as
    user-defined clusters.
  - Each daemon has its own RADOS configuration objects. The
    Dashboard uses these objects to deduce daemons.

Fixes: https://tracker.ceph.com/issues/46492
Signed-off-by: Kiefer Chang <kiefer.chang@suse.com>
2020-10-19 19:11:14 +08:00
Kefu Chai
3ec03efc70
Merge pull request #37175 from changchengx/log_start
common: remove log_early configuration option

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2020-10-19 18:44:29 +08:00
Lenz Grimmer
86f6a89c1b
Merge pull request #37564 from s0nea/wip-dashboard-nfs-daemons-param
mgr/dashboard/qa: consider `daemons` parameter

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Kiefer Chang <kiefer.chang@suse.com>
2020-10-19 10:42:22 +02:00
Kefu Chai
2e9db1c936
Merge pull request #37658 from neha-ojha/wip-crimson-suite
qa/crontab/teuthology-cronjobs: add crimson-rados

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2020-10-19 14:56:00 +08:00
Changcheng Liu
dbdcb2535d common: remove log_early configuration option
After deciding to always enable tracking log in early phase, there's no
need to keep "log_early" option here and remove it directly.

Suggested-by: Kefu Chai <kefu@redhat.com>
Signed-off-by: Changcheng Liu <changcheng.liu@aliyun.com>
2020-10-19 14:30:28 +08:00
Kefu Chai
a40337c2d7
Merge pull request #37635 from jecluis/wip-46816
mon: have 'mon stat' output json as well

Reviewed-by: Jan Fajerski <jfajerski@suse.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2020-10-19 14:08:09 +08:00
J. Eric Ivancich
ff342f2f02
Merge pull request #37684 from ivancich/rgw-yummy-orphan-testing
rgw: use yum rather than dnf for teuthology testing of rgw-orphan-list

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
2020-10-15 16:14:07 -04:00
J. Eric Ivancich
11a08a5bb8 rgw: use yum rather than dnf for testing rgw-orphan-list
The teuthology testing for rgw-orphan-list needs to install
`s3cmd`. Switch from using dnf to yum to work on a wider variety of
platforms.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
2020-10-15 15:08:45 -04:00
Joshua Schmid
eb94d2b325
Merge pull request #37575 from bk201/wip-47742
qa/suites/rados/cephadm/dashboard: fix creating OSD failure
2020-10-15 15:02:50 +02:00
Deepika Upadhyay
7ef18559cb qa: drop hammer branch qa tests
fixes: https://tracker.ceph.com/issues/47731
Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>
2020-10-15 17:32:06 +05:30
Kefu Chai
1d15e8c2d1
Merge pull request #37624 from lxbsz/cram
qa/tasks: add a 'parallel' option support for the cram task

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2020-10-15 18:53:38 +08:00
Kefu Chai
2724a73743
Merge pull request #37601 from tspmelo/wip-rm-wait-for-expected-get-result
qa/mgr: Remove _wait_for_expected_get_result

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
2020-10-15 18:12:54 +08:00
Kefu Chai
7c1544c04c cmake: do not always add py3 to TOX_ENVS
before this change add_tox_test() always add "py3" to testenv, even the
caller specifies TOX_ENVS explicitly.

after this change, py3 is added only if the caller does not specify any
TOX_ENVS.

this change helps with the readability.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2020-10-14 15:30:37 +08:00
Mykola Golub
3043d80168 rbd-nbd: make unmap/detach wait for rbd-nbd process to terminate
Signed-off-by: Mykola Golub <mgolub@suse.com>
2020-10-13 18:32:50 +01:00
Patrick Donnelly
d060c9a768
qa: set rados op timeouts for mds/ceph-fuse
Now that the osdc Objecter obeys updates to these configs, let's use
them to avoid having them block forever on operations that may never
complete (or should complete in a timely manner).

Fixes: https://tracker.ceph.com/issues/47734
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-10-13 10:16:40 -07:00
Patrick Donnelly
f8f607dbd1
qa: print debug info on mount cleanup
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-10-13 10:16:40 -07:00
Patrick Donnelly
8e7a89009c
qa: remove redundant rmr
The mount.cleanup method will remove the mount point. This `rm -rf` will
always fail (with exit status 0).

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-10-13 10:16:40 -07:00
Patrick Donnelly
2432871a1b
qa: use null mode to prevent undesired changes to mountpoint
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-10-13 10:16:40 -07:00
Patrick Donnelly
bc25bd70f2
qa: unmount all clients before deleting the file system
Otherwise we have unnecessary timeout waits.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-10-13 10:16:40 -07:00
Neha Ojha
f0addf0b16 qa/crontab/teuthology-cronjobs: add crimson-rados
Signed-off-by: Neha Ojha <nojha@redhat.com>
2020-10-13 17:00:07 +00:00
Neha Ojha
580710b87a qa/suites/crimson-rados: add .qa helper
Fixes:
OSError: /home/nojha/src/github.com_ceph_ceph_master/qa/suites/crimson-rados/basic/centos_latest.yaml
does not exist (abs /home/nojha/src/github.com_ceph_ceph_master/qa/suites/crimson-rados/basic/centos_latest.yaml)

Signed-off-by: Neha Ojha <nojha@redhat.com>
2020-10-13 15:56:05 +00:00
Yan, Zheng
e76abf517b qa/cephfs: update ephemeral pin tests
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2020-10-13 20:52:40 +08:00
Joao Eduardo Luis
122388429d
qa/cephtool: test 'mon stat' commands
Signed-off-by: Joao Eduardo Luis <joao@suse.com>
2020-10-13 08:31:24 +00:00
Rishabh Dave
f9f4c78101 vstart_runner: omit result line only when testcases are run one by one
vstart_runner omits the result line ("Ran X tests in Y") generated by
unittest unconditionally. Don't do so when vstart_runner triggers entire
test module at once.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2020-10-13 09:12:45 +05:30
Rishabh Dave
80ecc017e8 vstart_runner: don't quit running testsuite on a test failure
Add an option that makes it possible to not to quit running testsuite on
a test failure. This way user can get a better idea on the current state
of the testsuite and test its own code patches more effectively and
easily.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2020-10-13 09:12:45 +05:30
Patrick Donnelly
daa2f85156
Merge PR #29951 into master
* refs/pull/29951/head:
	test: add tests for validating MDS metrics via `perf stats` module
	test: Filesystem class helpers to grow and shrink MDS cluster
	mgr/stats: mds performance stats module
	mds: support sending empty perf metrics to ceph-manager

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2020-10-12 18:50:33 -07:00
Patrick Donnelly
473d8e9a60
Merge PR #37593 into master
* refs/pull/37593/head:
	qa/tasks/cephfs: Refactor test_volumes

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2020-10-12 07:42:13 -07:00
Xiubo Li
833eca15f2 qa/tasks: add a 'parallel' option support for the cram task
For the ceph-iscsi test case we need to run the tests sequentially,
because the client test will depend on the gateway ones.

Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-10-12 21:46:34 +08:00
Tatjana Dehler
1edaa17181
Merge pull request #37154 from bk201/wip-47397
mgr/dashboard: fix the error when exporting CephFS path "/" in NFS exports

Reviewed-by: Laura Paduano lpaduano@suse.com
Reviewed-by: Stephan Müller smueller@suse.com
Reviewed-by: Varsha Rao varao@redhat.com
2020-10-12 14:32:39 +02:00
Venky Shankar
a95b364e3e test: add tests for validating MDS metrics via perf stats module
Fixes: http://tracker.ceph.com/issues/24285
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2020-10-12 07:34:51 -04:00