RepoMirrors/ceph

Fork 0

mirror of https://github.com/ceph/ceph synced 2025-01-29 14:34:40 +00:00

Commit Graph

Author	SHA1	Message	Date
Patrick Donnelly	9aee68a25f	qa: correct daemon for warning conf The mgr emits this health check instead of the mons. Fixes: https://tracker.ceph.com/issues/67737 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2024-09-16 15:39:12 -04:00
Milind Changire	daf4798086	qa: failfast mount for better performance During teuthology tests, the tearing down of the cluster between two tests causes the config to be reset and a config_notify generated. This leads to a race to create a new mount using the old fscid. But by the time the mount is attempted the new fs gets created with a new fscid. This situation leads to the client mount waiting for a connection completion notification from the mds for 5 minutes (default timeout) and eventually giving up. However, the default teuthology command timeout is 2 minutes. So, teuthology fails the command and declares the job as failed way before the mount can timeout. The resolution to this case is to lower the client mount timeout to 30 seconds so that the config_notify fails fast paving the way for successive commands to get executed with the new fs. An unhandled cluster warning about an unresponsive client also gets emitted later during qa job termination which leads to teuthology declaring the job as failed. As of now this warning seems harmless since it is emitted during cluster cleanup phase. So, this warning is added to the log-ignorelist section in the snap-schedule YAML. Fixes: https://tracker.ceph.com/issues/66009 Signed-off-by: Milind Changire <mchangir@redhat.com>	2024-08-23 15:06:13 +05:30
Patrick Donnelly	d2d844b60b	qa/cephfs: add mgr debugging Fixes: https://tracker.ceph.com/issues/64985 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2024-03-19 11:01:48 -04:00

Author

SHA1

Message

Date

Patrick Donnelly

9aee68a25f

qa: correct daemon for warning conf

The mgr emits this health check instead of the mons.

Fixes: https://tracker.ceph.com/issues/67737
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

2024-09-16 15:39:12 -04:00

Milind Changire

daf4798086

qa: failfast mount for better performance

During teuthology tests, the tearing down of the cluster between two
tests causes the config to be reset and a config_notify generated. This
leads to a race to create a new mount using the old fscid. But by the
time the mount is attempted the new fs gets created with a new fscid.
This situation leads to the client mount waiting for a connection
completion notification from the mds for 5 minutes (default timeout)
and eventually giving up.
However, the default teuthology command timeout is 2 minutes. So,
teuthology fails the command and declares the job as failed way before
the mount can timeout.

The resolution to this case is to lower the client mount timeout to 30
seconds so that the config_notify fails fast paving the way for
successive commands to get executed with the new fs.

An unhandled cluster warning about an unresponsive client also gets
emitted later during qa job termination which leads to teuthology
declaring the job as failed. As of now this warning seems harmless since
it is emitted during cluster cleanup phase.
So, this warning is added to the log-ignorelist section in the
snap-schedule YAML.

Fixes: https://tracker.ceph.com/issues/66009
Signed-off-by: Milind Changire <mchangir@redhat.com>

2024-08-23 15:06:13 +05:30

Patrick Donnelly

d2d844b60b

qa/cephfs: add mgr debugging

Fixes: https://tracker.ceph.com/issues/64985
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

2024-03-19 11:01:48 -04:00

3 Commits