ceph/verify at 767161a347ceb53edf4b71a0c0876d2aa2111ae1 - ceph

mirror of https://github.com/ceph/ceph synced 2025-01-18 09:02:08 +00:00

History

Sage Weil cf352c3ac0 osd: add osd_fast_shutdown option (default true) If we get a SIGINT or SIGTERM or are deleted from the OSDMap, do a fast shutdown by exiting immediately. This has a few important benefits: - We immediately stop responding (binding) to any sockets, which means other OSDs will immediately decide we are down (and dead!). This minimizes IO interruption. - We avoid the complex "clean" shutdown process, which is historically a source of bugs. In reality, the only purpose of the "clean" shutdown is to try to tear down everything in memory so we can do memory leak checking with valgrind. Set this option to false for valgrind QA runs so we can still do that. Not that with the new read leases in octopus, we rely on the default behavior that a ECONNREFUSED is taken to mean that the OSD is fully dead, so that we don't have to wait for any leases to time out. This works in sane environments with normal IP networks, but that behavior could conceivably be a bad idea if there are some weird network shenanigans going on. If osd_fast_fail_on_connection_refused were disabled, then this fast shutdown procedure might be worse than the clean shutdown because we would have to wait for the heartbeat timeout. Signed-off-by: Sage Weil <sage@redhat.com>		2019-11-15 09:31:50 -06:00
..
clusters
msgr-failures	qa: more tries for mon tell when injecting msgr failures	2019-10-11 14:16:42 -05:00
proto
tasks	test/rgw: run ceph_test_rgw_gc_log in rgw verify suite	2019-10-19 13:28:18 +05:30
validater	osd: add osd_fast_shutdown option (default true)	2019-11-15 09:31:50 -06:00
.qa
%
frontend
objectstore	qa/rgw: drop some objectstore types	2019-10-18 13:20:20 -04:00
overrides.yaml	qa/rgw: use config overrides for 'testing' kms backend	2019-10-15 11:30:15 -04:00
rgw_pool_type