RepoMirrors/ceph

mirror of https://github.com/ceph/ceph synced 2025-02-24 03:27:10 +00:00

Author	SHA1	Message	Date
Lianne	2b50cefa89	qa/tasks/mds_thrash: fix thrash iteration never skip Signed-off-by: Lianne <liyan.wang@xtaotech.com>	2021-05-24 17:17:44 +08:00
Patrick Donnelly	abe7c86337	qa: remove ceph file systems on completion So that we can avoid MDS replacement warnings. Fixes: https://tracker.ceph.com/issues/48757 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2021-01-08 10:44:53 -08:00
Ramana Raja	7016a2001d	qa/tasks: allow per file system config setting Signed-off-by: Ramana Raja <rraja@redhat.com>	2020-11-20 13:23:21 +05:30
Kefu Chai	bb2c587435	qa/tasks/mds_thrash: s/random.sample/random.choice/ * use list comprehension instead of concatenating two ranges for better readablity -- we want to skip current max_mds for changing it. this helps reader to understand the goal of thrashing * random.sample() is replaced with random.choice(). the latter is a better alternative, if the number of samples is 1. Signed-off-by: Kefu Chai <kchai@redhat.com>	2020-05-23 11:29:53 +08:00
Patrick Donnelly	8d51b33e5d	qa: use py3 compat list from range Fixes: https://tracker.ceph.com/issues/45590 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2020-05-18 12:57:43 -07:00
Kefu Chai	726c59be58	qa/tasks: use list comprehension for checking the length instead of using filter(), use `sum()` for counting its length, as in Python3, `filter()` actually returns a `filter` object instead of a list. in this change, `filter()` calls are replaced with `sum()` for Python3 compatibility. Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>	2020-04-30 23:18:03 +08:00
Kefu Chai	947a74349d	qa: import with full path to be py3 compatible Signed-off-by: Kefu Chai <kchai@redhat.com>	2020-03-24 18:27:55 +08:00
Thomas Bechtold	f5e77561e9	qa: Fix problems detected by mypy This is a first step to enable mypy on the qa/ directory. Signed-off-by: Thomas Bechtold <tbechtold@suse.com>	2020-03-05 06:53:31 +01:00
Jos Collin	4c67888ad8	qa/tasks: Fixed AttributeError: can't set attribute Fixes: https://tracker.ceph.com/issues/42636 Signed-off-by: Jos Collin <jcollin@redhat.com>	2019-11-27 08:30:58 +05:30
Jos Collin	003550c493	qa/tasks: drop/update name from Thrasher * Dropped name setter and property from Thrasher base class * Updated each Thrasher class with a name attribute Signed-off-by: Jos Collin <jcollin@redhat.com>	2019-11-27 08:27:04 +05:30
Patrick Donnelly	154f1ccc86	Merge PR #31207 into master * refs/pull/31207/head: qa/tasks: Better handling of thrasher names and __init__ calls Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>	2019-11-01 15:37:12 -07:00
Jos Collin	8138cd5198	qa/tasks: Better handling of thrasher names and __init__ calls Fixes: https://tracker.ceph.com/issues/42062 Fixes: https://tracker.ceph.com/issues/42478 Signed-off-by: Jos Collin <jcollin@redhat.com>	2019-10-30 10:21:25 +05:30
Kyr Shatskyy	5f95b532aa	qa: get rid of iterkeys for py3 compatibility Fixes: https://tracker.ceph.com/issues/42287 Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>	2019-10-11 18:54:29 +02:00
Jos Collin	f13f9f9fc1	qa/tasks: drop object inherit Signed-off-by: Jos Collin <jcollin@redhat.com>	2019-08-23 15:29:27 +05:30
Patrick Donnelly	dad94db7ae	Merge PR #28378 into master * refs/pull/28378/head: qa/tasks: introduce Thrasher base class qa/tasks: Fix typo qa/tasks: manage thrashers qa/tasks: start DaemonWatchdog when ceph starts qa/tasks: make watch and bark handle more daemons qa/tasks: move DaemonWatchdog to new file Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>	2019-08-21 10:57:15 -07:00
Jos Collin	f31791e35d	qa/tasks: introduce Thrasher base class * Introduced a Thrasher base class. * Updated thrashers to inherit from Thrasher. * Replaced the magic variable e with Thrasher.exception as per the discussion. Now the exception variable sets by default as the thrashers are inheriting from the Thrasher class. Fixes: https://github.com/ceph/ceph/pull/28378#discussion_r309337928 Fixes: https://tracker.ceph.com/issues/41133 Signed-off-by: Jos Collin <jcollin@redhat.com>	2019-08-21 10:49:46 +05:30
Jos Collin	51d851815e	qa/tasks: fixed typo in the comment Signed-off-by: Jos Collin <jcollin@redhat.com>	2019-08-20 15:31:07 +05:30
Jos Collin	3f13a355c7	qa/tasks: manage thrashers * Added daemons to thrashers * Join the mds thrasher, as the other thrashers did Fixes: http://tracker.ceph.com/issues/10369 Signed-off-by: Jos Collin <jcollin@redhat.com>	2019-08-06 06:36:39 +05:30
Jos Collin	08b99eef27	qa/tasks: start DaemonWatchdog when ceph starts * Start DaemonWatchdog when ceph starts * Drop the DaemonWatchdog starting in mds_thrash.py * Bring the thrashers in mds_thrash.py into the context Fixes: http://tracker.ceph.com/issues/10369 Signed-off-by: Jos Collin <jcollin@redhat.com>	2019-08-06 06:36:33 +05:30
Jos Collin	b7a1f5ca6c	qa/tasks: move DaemonWatchdog to new file * Moved DaemonWatchdog class to a new file daemonwatchdog.py * Dropped the client watch Signed-off-by: Jos Collin <jcollin@redhat.com>	2019-08-06 06:36:11 +05:30
Patrick Donnelly	8cbdad9f9b	qa: update testing for standby-replay Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2019-02-27 21:39:12 -08:00
Patrick Donnelly	1dc5b62557	qa: mds_thrash updates for new max_mds behavior Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2018-04-17 11:26:56 -07:00
Shengjing Zhu	2cbba835aa	misc: fix various spelling errors Signed-off-by: Shengjing Zhu <i@zhsj.me>	2018-03-10 23:39:20 +08:00
Patrick Donnelly	a84e3c89bf	qa: thrash max_mds and deactivate ranks Fixes: http://tracker.ceph.com/issues/10792 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-07-06 22:29:41 -07:00
Yan, Zheng	8d1828dc60	qa: update thrash max mds testing Current monitor only allows deactivating one mds at a time. Besides, the mds to deactivate should have max rank id. Signed-off-by: "Yan, Zheng" <zyan@redhat.com>	2017-06-27 22:08:26 +08:00
Patrick Donnelly	d748226f00	qa: add DaemonWatchdog to stop tests on failure Thrashing MDS will often result in failures which often do not stop the test. The failure may also cause the test to stall which will force the machines to needlessly be locked until a timeout is reached. This watchdog will unmount mounts and kill daemons when a failure is detected. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-02-06 14:07:14 -05:00
Patrick Donnelly	f005e8af6b	qa: disable max_mds changes during thrashing While the trasher supports the behavior desired by issue 10792 [1], the bugs uncovered due to deactivating MDS (and sometimes killing deactivating MDS) are presently a distraction from addressing issues during normal failures. So now thrashing max_mds is turned off by default. I have added a TODO to deactivate ranks in order (configurably) as random deactivation causes a lot of other problems. This also fixes a bug: random.randrange(0.0, 1.0) always returns 0. Oops. [1] http://tracker.ceph.com/issues/10792 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-02-06 14:07:14 -05:00
Patrick Donnelly	a0052fc2d6	qa: use gevent.sleep so greenlet yields Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-02-06 14:07:14 -05:00
Patrick Donnelly	fd4b61890d	qa: allow revived MDS to be up:active Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-02-06 14:07:13 -05:00
Patrick Donnelly	884215d933	qa: timeout waiting for thrashed MDS to revive Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-02-06 14:07:13 -05:00
Patrick Donnelly	8e9ea7b6ac	qa: configure thrashing while MDS are stopping Currently multimds is prone to many failures when killing an active or stopping MDS when there are MDS in the cluster which have been deactivated (stopping). Have this turned off by default for now. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-02-06 14:07:13 -05:00
Patrick Donnelly	6304b6ed5d	qa: add deactivation log message Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-02-06 14:07:13 -05:00
Patrick Donnelly	1185326c45	qa: avoid infinite wait if no repl. can be made The thrasher can enter an infinite loop waiting for an MDS to take a certain rank when a replacement may not be possible. For example, max_mds actives are already running. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-02-06 14:07:12 -05:00
Patrick Donnelly	638bccb2bb	qa: timeout thrasher if fs does not stabilize After 5 minutes of waiting, it's reasonable to stop as the cluster is probably stuck. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-02-06 14:07:12 -05:00
Patrick Donnelly	8f3e745344	qa: check replacement MDS is active in thrasher Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-02-06 14:07:12 -05:00
Patrick Donnelly	19289725c8	qa: handle thrashing ranks with holes During the course of thrashing max_mds, the ranks assigned to MDSs may develop holes. This causes the thrasher to try to wrongly deactivate ranks that are not assigned. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-02-06 14:07:12 -05:00
Sage Weil	c01f2ee0e2	move ceph-qa-suite dirs into qa/	2016-12-14 11:29:55 -06:00

37 Commits