alertmanager

mirror of https://github.com/prometheus/alertmanager synced 2025-01-10 15:59:32 +00:00

History

Max Inden 3735df3ac7 cluster: Do not exit when failing to join cluster (#1465 ) Alertmanager is exiting with a non-zero exit code if the initial cluster join fails. This behavior could be not wanted because: - As Alertmanager is a critical component with an at-least-once guarantee, failing on joining the cluster is unnecessary as Alertmanager still functions by itself. - In an environment like Kubernetes discovering peers via DNS, peers might roll out one-by-one, leaving the DNS entries unpopulated for the first peer of a set. Failing on initial join prevents a roll-out. Instead of failing on the initial join this patch only logs the failure. The cluster can be later joined via the `handleReconnect`. This is a regression introduced in PR #1456 [1]. [1] https://github.com/prometheus/alertmanager/pull/1456 Signed-off-by: Max Leonard Inden <IndenML@gmail.com>	2018-07-11 17:19:33 +02:00
..
alertmanager	cluster: Do not exit when failing to join cluster (#1465 )	2018-07-11 17:19:33 +02:00
amtool	*: add missing license headers	2018-05-14 17:37:13 +02:00

cluster: Do not exit when failing to join cluster (#1465 )

Alertmanager is exiting with a non-zero exit code if the initial cluster
join fails. This behavior could be not wanted because:

- As Alertmanager is a critical component with an at-least-once
guarantee, failing on joining the cluster is unnecessary as
Alertmanager still functions by itself.

- In an environment like Kubernetes discovering peers via DNS, peers
might roll out one-by-one, leaving the DNS entries unpopulated for the
first peer of a set. Failing on initial join prevents a roll-out.

Instead of failing on the initial join this patch only logs the failure.
The cluster can be later joined via the `handleReconnect`.

This is a regression introduced in PR #1456 [1].

[1] https://github.com/prometheus/alertmanager/pull/1456

Signed-off-by: Max Leonard Inden <IndenML@gmail.com>

2018-07-11 17:19:33 +02:00

alertmanager

cluster: Do not exit when failing to join cluster (#1465 )

2018-07-11 17:19:33 +02:00

amtool

*: add missing license headers

2018-05-14 17:37:13 +02:00