ceph/qa/suites
Ilya Dryomov 5011cc926c qa/suites/krbd: run unmap subsuite with msgr1 only
pre-single-major.yaml kernel doesn't have any of the monitor client
fixes that came in 4.6.  If the connection is closed, it closes the
session and retries only after 10 seconds.  On top of that, there is
nothing to prevent it from picking the same monitor when reconnecting.
This means that when given both v1 and v2 ports (which look like two
different monitors), it is susceptible to mount_timeout (60 seconds):

  $ sudo rbd map img
  rbd: sysfs write failed
  In some cases useful info is found in syslog - try "dmesg | tail".
  rbd: map failed: (5) Input/output error

  [  822.242313] libceph: mon0 172.21.15.132:3300 socket closed (con state CONNECTING)
  [  832.265494] libceph: mon0 172.21.15.132:3300 socket closed (con state CONNECTING)
  [  842.296175] libceph: mon0 172.21.15.132:3300 socket closed (con state CONNECTING)
  [  852.326924] libceph: mon0 172.21.15.132:3300 socket closed (con state CONNECTING)
  [  862.357611] libceph: mon0 172.21.15.132:3300 socket closed (con state CONNECTING)
  [  872.388373] libceph: mon0 172.21.15.132:3300 socket closed (con state CONNECTING)
  [  882.676136] libceph: mon0 172.21.15.132:3300 socket closed (con state CONNECTING)

Unlike newer kernels that return ETIMEDOUT, it returns EIO.

Newer kernels are much more aggressive about retries and will pick
a different monitor when reconnecting, hence they are always able to
establish the session in time.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2019-10-30 19:51:55 +01:00
..
big
buildpackages
ceph-ansible qa/ceph-ansible: Disable dashboard 2019-08-27 13:51:37 +10:00
ceph-deploy
cephmetrics qa/ceph-ansible: Replace pgs with pg_num 2019-07-01 16:10:54 +10:00
dummy
experimental qa: update testing for standby-replay 2019-02-27 21:39:12 -08:00
fs qa: disable too few PG warning during Mimic deploy 2019-10-24 15:12:43 -07:00
hadoop
kcephfs qa: whitelist "Error recovering journal" for cephfs-data-scan 2019-10-17 21:19:54 +08:00
krbd qa/suites/krbd: run unmap subsuite with msgr1 only 2019-10-30 19:51:55 +01:00
marginal
mixed-clients
multimds qa: ignore RECENT_CRASH for multimds snapshot testing 2019-08-26 15:23:51 -07:00
perf-basic os: remove KineticStore 2019-10-01 12:56:01 +08:00
powercycle qa: ignore slow metadata io wrn during osd thrash 2019-03-08 10:49:10 -08:00
rados Merge PR #31168 into master 2019-10-28 14:59:43 -05:00
rbd qa/suites/rbd: fix errant tab in yaml which is causing parsing failures 2019-10-15 13:51:58 -04:00
rgw Merge pull request #30997 from cbodley/wip-qa-rgw-objectstores 2019-10-23 11:37:32 -04:00
samba
smoke qa: more tries for mon tell when injecting msgr failures 2019-10-11 14:16:42 -05:00
stress
teuthology
tgt qa: more tries for mon tell when injecting msgr failures 2019-10-11 14:16:42 -05:00
upgrade qa/suites/upgrade/*-x-singleton: suppress TOO_FEW_PGS warning 2019-10-22 15:53:05 -05:00
.qa