faeea8d165
Fix most of the issues reported by dashboards-linter: - Add matcher/template for job (and also cluster) - use $__rate_interval everywhere Also this change all the irate functions to rate as most of irate where not actually used correctly. While using irate on graph for instance you can easily miss some of the metrics values as irate only take the two last values and the query steps can be quite large if you want a graph for a few hours/a day or more. Fixes: https://tracker.ceph.com/issues/55003 Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch> ceph-mixin: add config with matchers and tags Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch> |
||
---|---|---|
.. | ||
dashboards | ||
dashboards_out | ||
tests_alerts | ||
tests_dashboards | ||
.gitignore | ||
.pylintrc | ||
alerts.libsonnet | ||
CMakeLists.txt | ||
config.libsonnet | ||
dashboards.jsonnet | ||
jsonnet-build.sh | ||
jsonnet-bundler-build.sh | ||
jsonnetfile.json | ||
jsonnetfile.lock.json | ||
lint-jsonnet.sh | ||
Makefile | ||
mixin.libsonnet | ||
prometheus_alerts.yml | ||
README.md | ||
requirements-alerts.txt | ||
requirements-grafonnet.txt | ||
requirements-lint.txt | ||
test-jsonnet.sh | ||
tox.ini |
Prometheus Monitoring Mixin for Ceph
A set of Grafana dashboards and Prometheus alerts for Ceph.
All the Grafana dashboards are already generated in the dashboards_out
directory and alerts in the prometheus_alerts.yml
file.
You can use the Grafana dashboards and alerts with Jsonnet like any other prometheus mixin. You can find more ressources about mixins in general on monitoring.mixins.dev.
Grafana dashboards for Ceph
In dashboards_out
you can find a collection of
Grafana dashboards for Ceph Monitoring.
These dashboards are based on metrics collected from prometheus scraping the prometheus mgr plugin and the node_exporter.
Requirements
- Status Panel installed on your Grafana instance
- Pie Chart Panel installed on your Grafana instance
Prometheus alerts
In prometheus_alerts.yml
you'll find a set of Prometheus
alert rules that should provide a decent set of default alerts for a
Ceph cluster. Just put this file in a place according to your Prometheus
configuration (wherever the rules
configuration stanza points).
SNMP
Ceph provides a MIB (CEPH-PROMETHEUS-ALERT-MIB.txt) to support sending Prometheus alerts through to an SNMP management platform. The translation from Prometheus alert to SNMP trap requires the Prometheus alert to contain an OID that maps to a definition within the MIB. When making changes to the Prometheus alert rules file, developers should include any necessary changes to the MIB.
Building from Jsonnet
- Install jsonnet (at least v0.18.0)
- By installing the package
jsonnet
in most of the distro andgolang-github-google-jsonnet
in fedora
- By installing the package
- Install jsonnet-bundler
To rebuild all the generated files, you can run tox -egrafonnet-fix
.
The jsonnet code located in this directory depends on some Jsonnet third party
libraries. To update those libraries you can run jb update
and then update
the generated files using tox -egrafonnet-fix
.