Commit Graph

9 Commits

Author SHA1 Message Date
Sage Weil
71d74aa8c6 qa: more tries for mon tell when injecting msgr failures
With failure injection the default 2 tries isn't quite enough

Signed-off-by: Sage Weil <sage@redhat.com>
2019-10-11 14:16:42 -05:00
David Zafman
ded58ef91d test: Ignore OSD_SLOW_PING_TIME* if injecting socket failures
Fixes: https://tracker.ceph.com/issues/41743

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-10-03 09:09:10 -07:00
Patrick Donnelly
716db6e2fd
qa: add .qa helper link
This utilizes the recent feature in teuthology [1] to skip hidden files in
suites when building the job matrix.

Idea of this change is to enable referring to the top-level qa directory in a
position-independent way such that copies of a suite to another location do not
break any symlinks.

[1] https://github.com/ceph/teuthology/pull/1185

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-06-26 11:33:48 -07:00
Kefu Chai
966c76330b qa: reduce "mon client hunt interval max multiple" to 2 for all clients
because with high failure rate, we need to connect to mon more
frequently if the connection fails.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-04-26 12:04:49 +08:00
Kefu Chai
6b3d0f61f9 qa: decrease the msg_inject_socket_failures from 1/500 to 1/1000
Fixes: http://tracker.ceph.com/issues/22093
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-12-15 14:21:43 +08:00
Sage Weil
431d1482ff qa/suites/rados/thrash: extend mgr beacon grace when many msgr failures injected
Fixes: http://tracker.ceph.com/issues/21147
Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-29 10:29:52 -06:00
Kefu Chai
358f53cbc4 qa: reduce mon-client-hunt-interval-max-multiple to 2
monclient could experience frequent connection issues if "ms inject
socket failures" is enabled with high frequency. so reduce this number
to relieve the symptom.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-13 12:06:18 +08:00
Joao Eduardo Luis
6f6fbe7870 qa: flush out monc's dropped msgs on msgr failure injection
We have a few open tickets regarding the mgr being down during suites
involving messenger failure injection. There are a few suspicions that
this may be related with the monclient, but we'll need more logs to
validate those suspicions and, more, to validate we're actually fixing
the issue.

Signed-off-by: Joao Eduardo Luis <joao@suse.de>
2017-07-21 15:29:21 +01:00
Sage Weil
c01f2ee0e2 move ceph-qa-suite dirs into qa/ 2016-12-14 11:29:55 -06:00