Commit Graph

27 Commits

Author SHA1 Message Date
Volker Theile
8e6838c740 monitoring: SNMP OID per every Prometheus alert rule
Use the Ceph enterprise OID 50495 (https://www.iana.org/assignments/enterprise-numbers/enterprise-numbers) and create OIDs for every Prometheus alert rule according to the schema at https://github.com/SUSE/prometheus-webhook-snmp/blob/master/README.md.

Example OID:
1.3.6.1.4.1.50495.15.1.2.2.1

All alert rule OIDs are located below the object identifier 15 (15 for p which is the first character of prometheus). Check out the MIB at https://github.com/SUSE/prometheus-webhook-snmp/blob/master/PROMETHEUS-ALERT-CEPH-MIB.txt for more details.

Signed-off-by: Volker Theile <vtheile@suse.com>
2019-05-28 09:59:50 +02:00
Jan Fajerski
e7a4437fdc monitoring: update Grafana dashboards
Fix various panels that used outdated metric names, cluncky or
unnecessary label_replace calls. Also unify the style of many panels.

Fixes: http://tracker.ceph.com/issues/39652

Signed-off-by: Jan Fajerski <jfajerski@suse.com>
2019-05-14 13:47:55 +02:00
Jan Fajerski
c0e58bd8ae monitoring: add a few prometheus alerts
Alerts are from
https://github.com/SUSE/DeepSea/blob/SES5/srv/salt/ceph/monitoring/prometheus/files/ses_default_alerts.yml
but updated for the mgr module and node_exporter >= 0.15.

Signed-off-by: Jan Fajerski <jfajerski@suse.com>
2019-04-26 11:21:39 +02:00
Jan Fajerski
287e209351 monitoring/grafana: fix typo in README
Signed-off-by: Jan Fajerski <jfajerski@suse.com>
2019-04-16 14:19:51 +02:00
Neha Gupta
739fdbad37 mgr/dashboard: Fixed performance details context for host list row selection
Fixes: http://tracker.ceph.com/issues/37854

Signed-off-by: Neha Gupta <gnehapk@gmail.com>
2019-01-18 13:36:49 +09:00
Jason Dillaman
f4ac899950 monitoring/grafana: new RBD overview dashboard page
This page pulls RBD stats from the Natuatilus prometheus exporter.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2019-01-11 16:41:46 -05:00
Boris Ranto
1ade714910 cmake: Support grafana dashboard installation
We are currently hosting the grafana dashboards in our repo but we do
not install them. This patch adds the cmake support.

Signed-off-by: Boris Ranto <branto@redhat.com>
2018-10-25 17:09:02 +02:00
Lenz Grimmer
94aefee3b0
Merge pull request #24314 from rhcs-dashboard/dashboards
mgr/dashboard: Grafana dashboard updates and additions

Reviewed-by: Boris Ranto <branto@redhat.com>
2018-10-19 12:42:23 +02:00
Paul Cuzner
a848411bd8 MGR/dashboard: make grafana datasource selectable
Grafana dashboard updated to use a templating
variable for the datasource

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
a99618ce41 MGR/dashboard: make grafana datasource selectable
Grafana dashboard updated to use a templating
variable for the datasource

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
b64289ca3d MGR/dashboard: make grafana datasource selectable
Grafana dashboard updated to use a templating
variable for the datasource

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
5432470914 MGR/dashboard: make grafana datasource selectable
Grafana dashboard updated to use a templating
variable for the datasource

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
bc5eea09c8 MGR/dashboard: make grafana datasource selectable
Grafana dashboard updated to use a templating
variable for the datasource

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
ba1a3b3a09 MGR/dashboard: make grafana datasource selectable
Grafana dashboard updated to use a templating
variable for the datasource

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
f97fee3a83 MGR/dashboard: make grafana datasource selectable
Grafana dashboard updated to use a templating
variable for the datasource

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
02b5414d19 MGR/dashboard: make grafana datasource selectable
Grafana dashboard updated to use a templating
variable for the datasource

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
7c04098e68 MGR/dashboard: make grafana datasource selectable
Grafana dashboard updated to use a templating
variable for the datasource

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
2c346efd12 Fix linewidth issue in pools overview dashboard
Linewidth was set to two, but the idea is that
a linewidth of >1 is reserved for eye-catcher
plot lines like maximums

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
b84f0ce45f Refresh of the dashboards
Fixes some minor anomalies and tested against
node_exporter 0.15 and 0.16

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
7d97bb28a8 Updated requirements information
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
0e655f8400 Added new Overview dashboards
These new dashboard definitions provide the high
level views for the hosts in the cluster and the
OSDs.

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
4292a7a357 Screenshots added for all dashboards
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
3c7c32f2ed Add Host level details dashboard
The host-details.json file provides a view of host
level metrics. The panels are arranged in two
rows;
Overview : Cpu/RAM/Network related stats
OSD Performance: OSD physical drive stats

The overview row is shown by default. Click on
the OSD Performance row to show the remaining
graphs

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Paul Cuzner
a0d9325c4d Document the current state of the dashboards
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:26:08 +13:00
Paul Cuzner
8ebf2ede7f Initial grafana dashboard definitions
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
2018-10-09 08:23:39 +13:00
Maxime
68b044a75e [grafana] Fix OSD Capacity Utlization graph
Signed-off-by: Maxime <maxime@root314.com>
2018-10-04 13:44:12 +02:00
Jan Fajerski
7e7ae7a0fe add monitoring subdir and Grafana cluster dashboard
Signed-off-by: Jan Fajerski <jfajerski@suse.com>
2018-05-07 14:25:29 +02:00