mgr/dashboard: Compare values of MTU alert by device

Fixes: https://tracker.ceph.com/issues/55004

Signed-off-by: Patrick Seidensal <pseidensal@suse.com>
This commit is contained in:
Patrick Seidensal 2022-03-23 14:53:58 +01:00
parent 9aa5c79363
commit 3821548a37
No known key found for this signature in database
GPG Key ID: E8DE675E1D128B94

View File

@ -704,7 +704,18 @@ groups:
rate of the past 48 hours.
- alert: CephNodeInconsistentMTU
expr: node_network_mtu_bytes{device!="lo"} * (node_network_up{device!="lo"} > 0) != on() group_left() (quantile(0.5, node_network_mtu_bytes{device!="lo"}))
expr: |
node_network_mtu_bytes * (node_network_up{device!="lo"} > 0) ==
scalar(
max by (device) (node_network_mtu_bytes * (node_network_up{device!="lo"} > 0)) !=
quantile by (device) (.5, node_network_mtu_bytes * (node_network_up{device!="lo"} > 0))
)
or
node_network_mtu_bytes * (node_network_up{device!="lo"} > 0) ==
scalar(
min by (device) (node_network_mtu_bytes * (node_network_up{device!="lo"} > 0)) !=
quantile by (device) (.5, node_network_mtu_bytes * (node_network_up{device!="lo"} > 0))
)
labels:
severity: warning
type: ceph_default
@ -712,7 +723,7 @@ groups:
summary: MTU settings across Ceph hosts are inconsistent
description: >
Node {{ $labels.instance }} has a different MTU size ({{ $value }})
than the median value on device {{ $labels.device }}.
than the median of devices named {{ $labels.device }}.
- name: pools
rules: