ceph/monitoring/grafana/dashboards
Patrick Seidensal 7d7488018e monitoring: Add unit tests for OSD panels in ceph-cluster dashboard
Signed-off-by: Patrick Seidensal <pseidensal@suse.com>
2022-01-13 13:27:55 +01:00
..
jsonnet mgr/prometheus: Fix regression with OSD/host details/overview dashboards 2022-01-13 13:27:55 +01:00
tests monitoring: Add unit tests for OSD panels in ceph-cluster dashboard 2022-01-13 13:27:55 +01:00
.pylintrc monitoring/grafana: Grafana query tester 2021-11-16 10:30:49 +01:00
CMakeLists.txt Merge pull request #44190 from rhcs-dashboard/grafana-regex 2021-12-21 17:58:17 +01:00
README
ceph-cluster.json monitoring: fix display ceph_osd_in in Grafana panel 2022-01-13 13:27:55 +01:00
cephfs-overview.json mgr/dashboard: monitoring: grafonnet refactoring for cephfs dashboards 2021-10-19 12:36:31 +05:30
host-details.json mgr/prometheus: Fix regression with OSD/host details/overview dashboards 2022-01-13 13:27:55 +01:00
hosts-overview.json mgr/prometheus: Fix regression with OSD/host details/overview dashboards 2022-01-13 13:27:55 +01:00
osd-device-details.json mgr/prometheus: Fix regression with OSD/host details/overview dashboards 2022-01-13 13:27:55 +01:00
osds-overview.json mgr/dashboard: monitoring:Implement BlueStore onode hit/miss counters into the dashboard 2022-01-05 14:22:53 +05:30
pool-detail.json mgr/dashboard: monitoring: grafonnet refactoring for pools dashboards 2021-10-19 12:10:56 +05:30
pool-overview.json mgr/dashboard: monitoring: grafonnet refactoring for pools dashboards 2021-10-19 12:10:56 +05:30
radosgw-detail.json mgr/dashboard: monitoring: grafonnet refactoring for radosgw dashboards 2021-10-19 11:57:28 +05:30
radosgw-overview.json mgr/dashboard: introduce HAProxy metrics for RGW 2021-12-09 20:03:03 +05:30
radosgw-sync-overview.json mgr/dashboard: monitoring: grafonnet refactoring for radosgw dashboards 2021-10-19 11:57:28 +05:30
rbd-details.json mgr/dashboard: monitoring: grafonnet refactoring for cephfs dashboards 2021-10-19 12:36:31 +05:30
rbd-overview.json mgr/dashboard: monitoring: grafonnet refactoring for rbd dashboards 2021-10-19 12:09:04 +05:30
requirements-grafonnet.txt
requirements-lint.txt monitoring/grafana: Grafana query tester 2021-11-16 10:30:49 +01:00
test-jsonnet.sh
tox.ini monitoring/grafana: doctest util regex 2021-12-15 09:36:08 +01:00

README

Context
These dashboards should be enough to get started on the integration. It's not a complete set, so more will be added in the next week.

Bare in mind that the osd device details dashboard needs node_exporter active - all the other dashboards pick data out of ceph-mgr based metrics.


The cephfs dashboard only has 2 panels currently. The counter available are
a little light at the moment. Patrick/Venky have been addressing this with
https://bugzilla.redhat.com/show_bug.cgi?id=1618523
cephfs-overview.json

Host Information
host-details.json combines generic server metrics that show cpu/memory/network stats (including network errors/drops),
with disk level stats for OSD hosts. OSD charts show the physical device name together with it's corresponding osd id for correlation.

Ceph Pools
two dashboards. Overview gives the high level combined view, pool-detail needs a pool_name variable passed to it (currently uses a templating var which is visible)
pool-overview.json
pool-detail.json

OSD Device Details. This dashboard needs some further work. It currently shows
OSD level stats with physical device stats but leaves out some of the counters
that cephmetrics provides for trouble shooting.
osd-device-details.json

Object gateway dashboards, again split into overview and detail. The detail dashboard needs the relevant ceph-deamon name for the rgw instance.
radosgw-overview.json
radosgw-detail.json