Commit Graph

60 Commits

Author SHA1 Message Date
Patrick Donnelly
98f161b210
qa: remove unnecessary background job cleanup
This is done in CephFSTestCase.tearDown.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2024-04-03 09:08:53 -04:00
Xiubo Li
f0ebac137b qa/tasks/cephfs/test_misc: switch duration to timeout
The duration is impresise and sometimes will give a false alarm
just in case the shell command itself is issued late.

https://tracker.ceph.com/issues/63587
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-11-28 11:05:36 +08:00
Xiubo Li
4e085c5d2f qa/tasks/cephfs/test_misc: fix the incorrect comments
https://tracker.ceph.com/issues/63587
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-11-28 11:05:36 +08:00
Rishabh Dave
f0588bd3b3
Merge pull request #50569 from rishabh-d-dave/CephManager-in-CephFSTestCase
qa/cephfs: add helper methods in CephFSTestCase

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2023-07-13 12:44:22 +05:30
Rishabh Dave
c7c38ba558 qa/cephfs: when cmd output is not needed call run_ceph_cmd()
instead of get_ceph_cmd_stdout().

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2023-06-28 17:38:19 +05:30
Rishabh Dave
13168834e3 qa/cephfs: add and use get_ceph_cmd_stdout()
Add method get_ceph_cmd_stdout() to class CephFSTestCase so that one
doesn't have to type something as long as
"self.mds_cluster.mon_manager.raw_cluster_cmd()" to execute a
command and get its output. And delete and replace
CephFSTestCase.run_cluster_cmd() too.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2023-06-28 17:38:19 +05:30
neeraj pratap singh
36bf907f9e qa: test for session ls with filters
Fixes: https://tracker.ceph.com/issues/61444
Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
2023-06-26 17:55:15 +05:30
Xiubo Li
d851a9475c qa: fix journal flush failure issue due to the MDS daemon crashes
After the MDS daemon crashing, the journal flush request will fail.

Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-05-05 18:46:01 +08:00
Xiubo Li
71797091a2 qa: add test support for the alloc ino failing
Fixes: https://tracker.ceph.com/issues/52280
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-05-05 18:46:01 +08:00
Zhansong Gao
d4438d32ca qa: add test for 'dump dir'
Signed-off-by: Zhansong Gao <zhsgao@hotmail.com>
2023-02-01 18:52:38 +08:00
Jos Collin
a1a916547e
qa: test dump recent in-memory logs on extra-ordinary events
Fixes: https://tracker.ceph.com/issues/40633
Signed-off-by: Jos Collin <jcollin@redhat.com>
2022-06-22 15:56:49 +05:30
Nikhilkumar Shelke
0f4add67eb qa: verify command status if data or metadata pool already in use
Fixes: https://tracker.ceph.com/issues/54111
Signed-off-by: Nikhilkumar Shelke <nshelke@redhat.com>
2022-06-12 16:25:34 +05:30
Xiubo Li
cd3e903b0c qa: add file/filesystem sync crash test case
This is one test case for the possible kernel crash bug when doing
the file sync or filesystem sync.

Fixes: https://tracker.ceph.com/issues/55329
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2022-04-21 15:08:11 +08:00
Xiubo Li
3db3b4e2a4 qa: add file sync stuck test support
This will test the file sync of a directory, which maybe stuck for
at most 5 seconds. This was because the related code will wait for
all the unsafe requests to get safe reply from MDSes, but the MDSes
just think that it's unnecessary to flush the mdlog immediately
after early reply, and the mdlog will be flushed every 5 seconds
in the tick thread.

This should have been fixed in kclient and libcephfs by triggering
mdlog flush before waiting requests' safe reply.

Fixes: https://tracker.ceph.com/issues/55283
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2022-04-21 15:07:43 +08:00
Xiubo Li
b6fc5480f6 qa: add filesystem sync stuck test support
This will test the sync of the filesystem, which maybe stuck for
at most 5 seconds. This was because the related code will wait
for all the unsafe requests to get safe reply from MDSes, but the
MDSes just think that it's unnecessary to flush the mdlog immediately
after early reply, and the mdlog will be flushed every 5 seconds
in the tick thread.

This should have been fixed in kclient and libcephfs by triggering
mdlog flush before waiting requests' safe reply.

Fixes: https://tracker.ceph.com/issues/55283
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2022-04-21 15:07:43 +08:00
Nikhilkumar Shelke
23b3a27eaa ceph-fuse: restrict already_fuse_mounted function only for linux
Signed-off-by: Nikhilkumar Shelke <nshelke@redhat.com>
2022-03-28 13:25:09 +05:30
Nikhilkumar Shelke
34cf3ec723 qa: test ceph-fuse mount on already mounted path
Fixes: https://tracker.ceph.com/issues/46075
Signed-off-by: Nikhilkumar Shelke <nshelke@redhat.com>
2022-03-07 14:13:42 +05:30
Rishabh Dave
485841b255 qa: import CommandFailedError from exceptions not run
Stop importing CommandFailedError from teuthology.orchestra.run, it is
actually defined in teuthology.exception.

Fixes: https://tracker.ceph.com/issues/51226
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2021-10-05 23:41:09 +05:30
Xiubo Li
c1cea71299 qa: use the pg autoscale mode to calcuate the pg_num
Setting the pg_num to 8 is too small that some osds maybe not covered by the
pools, some osds maybe overloaded. Remove the hardcodeing pg_num here and let
the pg autoscale mode to calculate it as needed, and at the same time set the
pg_num_min to 64 to avoid the pg_num to small.

If ec pool is used, for the test cases most datas will go to the ec pool and
the primary replicated pool will store a small amount of metadata for all the
files only, so set the target size ratio to 0.05 should be enough.

Fixes: https://tracker.ceph.com/issues/45434
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2021-07-27 09:48:56 +08:00
Jos Collin
58b62f10d9
qa: test fs lsflags command
Fixes: https://tracker.ceph.com/issues/48682
Signed-off-by: Jos Collin <jcollin@redhat.com>
2021-04-08 11:09:18 +05:30
Patrick Donnelly
5292e88201
qa: reduce dependence on teuthology role list for mds
It's not yet possible to completely remove the dependency on
mds_ids/mds_daemons in the CephFS tests but this commit reduces it
enough for most code paths to work with cephadm.

The main change here is use of CephManager.do_rados, with some
improvements.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-03-21 10:35:07 -07:00
Patrick Donnelly
8e45af05c9
qa: remove get_lone_mds_id
This helps remove a dependence on the list of mds_daemons from
teuthology/cephadm.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-03-21 10:35:07 -07:00
Patrick Donnelly
0825d6aa9e
qa: simplify tests which stop MDS ranks
Instead of stopping MDS daemons and individually failing MDS daemons,
just fail the ranks or the entire file system, where possible.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-03-21 10:35:06 -07:00
Rishabh Dave
4d0f56fcc5 qa/cephfs: move is_blocklisted() to filesystem.CephCluster
Using self.fs.mon_manager in mount.py can lead to a crash since self.fs
can be None. Move is_blocklisted() to tasks.filesystem.CephCluster where
it can get access to mon_manager without depending on objects
representing Ceph FSs.

Fixes: https://tracker.ceph.com/issues/49511
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2021-03-06 09:11:36 +05:30
Kefu Chai
5ca820fb9e Revert "msg,mon,common: log when DispatchQueue throttle limit is reached"
Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-02-08 18:03:14 +08:00
Jos Collin
8367448765
qa: test DispatchQueue throttling
Fixes: https://tracker.ceph.com/issues/46226
Signed-off-by: Jos Collin <jcollin@redhat.com>
2021-01-25 11:18:04 +05:30
Xiubo Li
def177ff3b qa/tasks: switch to _kill_background() helper to terminate the daemons
Fixes: https://tracker.ceph.com/issues/46883
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-10-21 04:01:35 -04:00
Patrick Donnelly
d33c6157a3
qa: test for statfs on deleted file system
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-08-29 10:22:12 -07:00
Sage Weil
dfd01d7653 blacklist -> blocklist
Signed-off-by: Sage Weil <sage@newdream.net>
Signed-off-by: Neha Ojha <nojha@redhat.com>
2020-08-24 19:53:08 +00:00
Xiubo Li
5c24d91327 qa/tasks/cephfs: add mount_wait() support to simplify the code
Mostly we should wait the mountpoint to get ready, especially for
the fuse mountpoint, sometimes it may take a few seconds to get
ready.

Fixes: https://tracker.ceph.com/issues/44044
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-04-14 07:47:04 -04:00
Xiubo Li
ed5b737d00 qa/tasks/cephfs/test_misc: switch to netns suspend/resume
https://tracker.ceph.com/issues/44044
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-04-14 07:47:04 -04:00
Kefu Chai
83e41daec4 qa/tasks/cephfs: compare number with number
to address following failure:
```
2020-04-05T12:25:30.997 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2020-04-05T12:25:30.997 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/github.com_tchaikov_ceph_wip-qa-py3/qa/tasks/cephfs/test_misc.py", line 50, in test_root_rctime
2020-04-05T12:25:30.998 INFO:tasks.cephfs_test_runner:    self.assertGreaterEqual(rctime, t-10)
2020-04-05T12:25:30.998 INFO:tasks.cephfs_test_runner:  File "/usr/lib/python3.5/unittest/case.py", line 1215, in assertGreaterEqual
2020-04-05T12:25:30.998 INFO:tasks.cephfs_test_runner:    if not a >= b:
2020-04-05T12:25:30.999 INFO:tasks.cephfs_test_runner:TypeError: unorderable types: str() >= float()
```

Signed-off-by: Kefu Chai <kchai@redhat.com>
2020-04-07 21:51:22 +08:00
Patrick Donnelly
590368e956
qa: ignore trimmed cache items for dead cache drop
Fixes: https://tracker.ceph.com/issues/42986
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-01-20 17:52:12 -08:00
Patrick Donnelly
a8a19501b7
qa: use unit test comparisons
So they show values on failure.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-01-20 17:08:44 -08:00
Thomas Bechtold
0127cd1e88 qa: Enable flake8 tox and fix failures
There were a couple of problems found by flake8 in the qa/
directory (most of them fixed now). Enabling flake8 during the usual
check runs hopefully avoids adding new issues in the future.

Signed-off-by: Thomas Bechtold <tbechtold@suse.com>
2019-12-12 10:21:01 +01:00
Patrick Donnelly
fc88e6c6c5
qa: use small default pg count for CephFS pools
The pg count needs to be a power-of-two since
dff5697464.

Also, mon_pg_warn_min_per_osd is disabled by default now (or set to a
low value in vstart/testing) so there's no need to base the pg count on
this value.

Ideally someday we can remove this so that the default cluster value is
used but we need to keep this for deployments of older versions of Ceph.

Fixes: https://tracker.ceph.com/issues/42228
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-10-14 10:45:05 -07:00
Yan, Zheng
cd29206974 mds: remove the code that skip evicting the only client
There is already logic that defer marking unresponsive client stale.
No reason to defer evicting the only stale client.

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2019-06-19 14:28:31 +08:00
Yan, Zheng
4fd0202fdf qa/cephfs: fix test_evict_client
explictly kill stale session, otherwise session count check is racy.

Fixes: https://tracker.ceph.com/issues/40173
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2019-06-05 22:04:22 +08:00
Patrick Donnelly
1071f73c76
qa: use skipTest method instead of exception
This is the recommended method to skip a test according to [1]. It also lets us
avoid an unnecessary import.

[1] https://docs.python.org/2/library/unittest.html#unittest.TestCase.skipTest

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-04-24 09:38:52 -07:00
Patrick Donnelly
a20a61f476
qa: unmount clients before deleting fs
Client unmount during test cleanup will hang if the file system was deleted.

Fixes: http://tracker.ceph.com/issues/38518
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-02-28 09:02:33 -08:00
Patrick Donnelly
b28dcd9d75
qa: config recall settings to test cache drop
If we use the defaults, the MDS/client will recall/release everything quickly.
We want it to take time to see things like the timeout get hit.

Fixes: http://tracker.ceph.com/issues/38348
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-02-18 16:28:37 -08:00
Patrick Donnelly
b150b22d2c
qa: check cache dump works without timeout
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-02-18 16:28:37 -08:00
Patrick Donnelly
7fa1e3c37f
mds: remove cache drop asok command
`cache drop` is a long running command that will block the asok interface
(while the tell version does not). Attempting to abort the command with ^C or
equivalents will simply cause the `ceph` command to exit but won't stop the
asok command handler from waiting for the cache drop operation to complete.

Instead, just allow the tell version.

Fixes: http://tracker.ceph.com/issues/38020
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-01-23 06:48:58 -08:00
Patrick Donnelly
b62daf3b25
qa: test hex dump_inode
Fixes: http://tracker.ceph.com/issues/24721

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-01-17 07:38:12 -08:00
Patrick Donnelly
6a4cc58a9d
Merge PR #24292 into master
* refs/pull/24292/head:
	qa: add test for rctime on root inode
	mds: set rctime on new system inode
	mds: small refactor

Reviewed-by: Zheng Yan <zyan@redhat.com>
2018-10-15 21:31:04 -07:00
Patrick Donnelly
ddbc3016a5
qa: add test for rctime on root inode
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-03 10:10:47 -07:00
Venky Shankar
43d1b8e174 test: add test for mds drop cache command
Fixes: http://tracker.ceph.com/issues/23362
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2018-09-28 00:07:59 -04:00
Kefu Chai
4b0e2c8ed4 qa: fix typos
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-09-21 12:41:42 +08:00
Venky Shankar
c0b1dacc9f test: validate client eviction for cap revoke non-responders
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2018-08-21 01:58:55 -04:00
taodd
0bd385f30b qa/cephfs: add test for "dump inode"
Signed-off-by: dongdong tao <tdd21151186@gmail.com>
2018-06-16 17:46:04 +08:00