ceph/monitoring at 83f84f601aada043fdece5c733a85230786545cb - ceph

mirror of https://github.com/ceph/ceph synced 2024-12-19 09:57:05 +00:00

History

Paul Cuzner 2010432b50 mgr/prometheus: Add healthcheck metric for SLOW_OPS SLOW_OPS is triggered by op tracker, and generates a health alert but healthchecks do not create metrics for prometheus to use as alert triggers. This change adds SLOW_OPS metric, and provides a simple means to extend to other relevant health checks in the future If the extract of the value from the health check message fails we log an error and remove the metric from the metric set. In addition the metric description has changed to better reflect the scenarios where SLOW_OPS can be triggered. Signed-off-by: Paul Cuzner <pcuzner@redhat.com>	2020-11-02 15:30:49 +13:00
..
grafana	monitoring: Use null yaxes min for OSD read latency	2020-10-12 19:56:18 +03:30
prometheus	mgr/prometheus: Add healthcheck metric for SLOW_OPS	2020-11-02 15:30:49 +13:00

Paul Cuzner 2010432b50 mgr/prometheus: Add healthcheck metric for SLOW_OPS

SLOW_OPS is triggered by op tracker, and generates a health
alert but healthchecks do not create metrics for prometheus to
use as alert triggers. This change adds SLOW_OPS metric, and
provides a simple means to extend to other relevant health
checks in the future

If the extract of the value from the health check message fails
we log an error and remove the metric from the metric set. In
addition the metric description has changed to better reflect
the scenarios where SLOW_OPS can be triggered.

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>

2020-11-02 15:30:49 +13:00

grafana

monitoring: Use null yaxes min for OSD read latency

2020-10-12 19:56:18 +03:30

prometheus

mgr/prometheus: Add healthcheck metric for SLOW_OPS

2020-11-02 15:30:49 +13:00