RepoMirrors/ceph

mirror of https://github.com/ceph/ceph synced 2024-12-28 06:23:08 +00:00

Author	SHA1	Message	Date
Pere Diaz Bou	8e07fbd2ea	Merge pull request #48843 from rhcs-dashboard/expose_slow_ops mgr/prometheus: expose daemon health metrics Reviewed-by: Anthony D Atri <anthony.datri@gmail.com> Reviewed-by: Avan Thakkar <athakkar@redhat.com> Reviewed-by: Ernesto Puerta <epuertat@redhat.com> Reviewed-by: Nizamudeen A <nia@redhat.com>	2022-12-20 12:25:32 +01:00
Pere Diaz Bou	5a2b7c25b6	mgr/prometheus: expose daemon health metrics Until now daemon health metrics were stored without being used. One of the most helpful metrics there is SLOW_OPS with respect to OSDs and MONs which this commit tries to expose to bring fine grained metrics to find troublesome OSDs instead of having a lone healthcheck of slow ops in the whole cluster. Signed-off-by: Pere Diaz Bou <pdiazbou@redhat.com>	2022-12-20 09:44:49 +01:00
Tatjana Dehler	08352b6540	ceph-mixing: fix ceph_hosts variable Do only use `instance` to query for hostnames in single-cluster-mode. Consider the cluster matcher only in multi-cluster-mode. In this case the query will look like: `"label_values({cluster=~\"$cluster\"}, instance)"`. Fixes: https://tracker.ceph.com/issues/57987 Signed-off-by: Tatjana Dehler <tdehler@suse.com>	2022-11-11 16:35:05 +01:00
Tatjana Dehler	15fa97d49d	monitoring/ceph-mixin: add RGW host to label info Add the missing information about the RGW instance to the labels of the "Average GET/PUT Latencies" panel on the "RGW Overview" dashboard. Fixes: https://tracker.ceph.com/issues/57166 Signed-off-by: Tatjana Dehler <tdehler@suse.com>	2022-09-06 16:19:19 +02:00
Tatjana Dehler	8faaca2082	monitoring/ceph-mixin: OSD overview typo fix Correct a wrongly set bracket on ceph-dashboard -> OSD Overview -> OSD Objectstore Types resulting in a parser error. Fixes: https://tracker.ceph.com/issues/56948 Signed-off-by: Tatjana Dehler <tdehler@suse.com>	2022-07-28 15:15:32 +02:00
Arthur Outhenin-Chalandre	37add644d1	ceph-mixin: remove timepicker override in every dashboards Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch>	2022-05-24 11:54:26 +02:00
Arthur Outhenin-Chalandre	5db37300fd	ceph-mixin: rationalize local helper functions to utils Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch>	2022-05-24 11:50:49 +02:00
Arthur Outhenin-Chalandre	0b7cc6bc99	ceph-mixin: fix typos Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch>	2022-05-18 10:02:54 +02:00
Arthur Outhenin-Chalandre	3b6356c872	ceph-mixin: don't add cluster matcher if showcluster is disabled Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch>	2022-05-17 09:41:21 +02:00
Arthur Outhenin-Chalandre	fd4f484d22	ceph-mixin: refactor the structure of _config and utils Before this refactor we couln't override the config externally. Now the _config is correctly propagated and not only taken from the config.libsonnet file. Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch>	2022-05-16 15:26:56 +02:00
Arthur Outhenin-Chalandre	faeea8d165	ceph-mixin: fix linting issue and add cluster template support Fix most of the issues reported by dashboards-linter: - Add matcher/template for job (and also cluster) - use $__rate_interval everywhere Also this change all the irate functions to rate as most of irate where not actually used correctly. While using irate on graph for instance you can easily miss some of the metrics values as irate only take the two last values and the query steps can be quite large if you want a graph for a few hours/a day or more. Fixes: https://tracker.ceph.com/issues/55003 Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch> ceph-mixin: add config with matchers and tags Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch>	2022-05-16 15:26:53 +02:00
Arthur Outhenin-Chalandre	1452311a9b	ceph-mixin: rewrite promql queries to multiline Fixes: https://tracker.ceph.com/issues/55005 Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch>	2022-04-27 17:55:52 +02:00
Aashish Sharma	9719cc795e	mgr/dashboard: Pool overall performance shows multiple entries of same pool in pool overview This PR intends to fix this issue Fixes:https://tracker.ceph.com/issues/54513 Signed-off-by: Aashish Sharma <aasharma@redhat.com>	2022-03-28 18:25:25 +05:30
Arthur Outhenin-Chalandre	98236e3a1d	mgr/dashboard: monitoring: refactor into ceph-mixin Mixin is a way to bundle dashboards, prometheus rules and alerts into jsonnet package. Shifting to mixin will allow easier integration with monitoring automation that some users may use. This commit moves `/monitoring/grafana/dashboards` and `/monitoring/prometheus` to `/monitoring/ceph-mixin`. Prometheus alerts was also converted to Jsonnet using an automated way (from yaml to json to jsonnet). This commit minimises any change made to the generated files and should not change neithers the dashboards nor the Prometheus alerts. In the future some configuration will also be added to jsonnet to add more functionalities to the dashboards or alerts (i.e.: multi cluster). Fixes: https://tracker.ceph.com/issues/53374 Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@cern.ch>	2022-02-03 13:08:20 +01:00

14 Commits