ceph/systemd
Loic Dachary b3887379d6 build/ops: restart ceph-osd@.service after 20s instead of 100ms
Instead of the default 100ms pause before trying to restart an OSD, wait
20 seconds instead and retry 30 times instead of 3. There is no scenario
in which restarting an OSD almost immediately after it failed would get
a better result.

It is possible that a failure to start is due to a race with another
systemd unit at boot time. For instance if ceph-disk@.service is
delayed, it may start after the OSD that needs it. A long pause may give
the racing service enough time to complete and the next attempt to start
the OSD may succeed.

This is not a sound alternative to resolve a race, it only makes the OSD
boot process less sensitive. In the example above, the proper fix is to
enable --runtime ceph-osd@.service so that it cannot race at boot time.

The wait delay should not be minutes to preserve the current runtime
behavior. For instance, if an OSD is killed or fails and restarts after
10 minutes, it will be marked down by the ceph cluster.  This is not a
change that could break things but it is significant and should be
avoided.

Refs: http://tracker.ceph.com/issues/17889

Signed-off-by: Loic Dachary <loic@dachary.org>
2016-12-01 08:28:20 +01:00
..
50-ceph.preset systemd: add ceph-mgr service and target files 2016-09-29 17:27:08 +01:00
ceph doc: fix message typos in systemd 2015-11-23 16:50:07 +01:00
ceph-disk@.service systemd/ceph-disk: reduce ceph-disk flock contention 2016-11-28 17:55:39 +01:00
ceph-mds.target fine-grained control systemd to start/stop/restart ceph services at once 2015-10-26 15:13:19 +08:00
ceph-mds@.service systemd: make Ceph daemon units "want" time-sync.target 2016-04-23 17:48:08 +02:00
ceph-mgr.target systemd: add ceph-mgr service and target files 2016-09-29 17:27:08 +01:00
ceph-mgr@.service systemd: autogenerate ceph-mgr key during daemon startup 2016-09-29 17:27:08 +01:00
ceph-mon.target fine-grained control systemd to start/stop/restart ceph services at once 2015-10-26 15:13:19 +08:00
ceph-mon@.service systemd/ceph-mon@.service:Remove ceph-create-keys for mon in systemd 2016-11-04 23:05:26 +01:00
ceph-osd.target fine-grained control systemd to start/stop/restart ceph services at once 2015-10-26 15:13:19 +08:00
ceph-osd@.service build/ops: restart ceph-osd@.service after 20s instead of 100ms 2016-12-01 08:28:20 +01:00
ceph-radosgw.target fine-grained control systemd to start/stop/restart ceph services at once 2015-10-26 15:13:19 +08:00
ceph-radosgw@.service systemd: make Ceph daemon units "want" time-sync.target 2016-04-23 17:48:08 +02:00
ceph-rbd-mirror.target systemd: new ceph-rbd-mirror scripts 2016-03-18 17:51:23 -04:00
ceph-rbd-mirror@.service Drop any systemd imposed process/thread limits 2016-04-05 17:33:57 +01:00
ceph.target better systemd support 2014-12-01 14:45:31 +01:00
ceph.tmpfiles.d systemd: tmpfiles.d in /run, not /var/run 2015-09-11 11:38:47 -04:00
CMakeLists.txt systemd/CMakeLists.txt:Remove ceph-create-keys cmake 2016-11-04 23:05:44 +01:00
rbdmap.service Add Install section to systemd rbdmap.service file 2016-09-01 00:42:34 +02:00