prometheus/docs/configuration/recording_rules.md

---
title: Recording rules
sort_rank: 2
---

# Defining recording rules

## Configuring rules

Prometheus supports two types of rules which may be configured and then
evaluated at regular intervals: recording rules and [alerting
rules](alerting_rules.md). To include rules in Prometheus, create a file
containing the necessary rule statements and have Prometheus load the file via
the `rule_files` field in the [Prometheus configuration](configuration.md).
Rule files use YAML.

The rule files can be reloaded at runtime by sending `SIGHUP` to the Prometheus
process. The changes are only applied if all rule files are well-formatted.

_Note about native histograms (experimental feature): Native histogram are always
recorded as gauge histograms (for now). Most cases will create gauge histograms
naturally, e.g. after `rate()`._

## Syntax-checking rules

To quickly check whether a rule file is syntactically correct without starting
a Prometheus server, you can use Prometheus's `promtool` command-line utility
tool:

```bash
promtool check rules /path/to/example.rules.yml
```

The `promtool` binary is part of the `prometheus` archive offered on the
project's [download page](https://prometheus.io/download/).

When the file is syntactically valid, the checker prints a textual
representation of the parsed rules to standard output and then exits with
a `0` return status.

If there are any syntax errors or invalid input arguments, it prints an error 
message to standard error and exits with a `1` return status.

## Recording rules

Recording rules allow you to precompute frequently needed or computationally
expensive expressions and save their result as a new set of time series.
Querying the precomputed result will then often be much faster than executing
the original expression every time it is needed. This is especially useful for
dashboards, which need to query the same expression repeatedly every time they
refresh.

Recording and alerting rules exist in a rule group. Rules within a group are
run sequentially at a regular interval, with the same evaluation time.
The names of recording rules must be
[valid metric names](https://prometheus.io/docs/concepts/data_model/#metric-names-and-labels).
The names of alerting rules must be
[valid label values](https://prometheus.io/docs/concepts/data_model/#metric-names-and-labels).

The syntax of a rule file is:

```yaml
groups:
  [ - <rule_group> ]
```

A simple example rules file would be:

```yaml
groups:
  - name: example
    rules:
    - record: code:prometheus_http_requests_total:sum
      expr: sum by (code) (prometheus_http_requests_total)
```

### `<rule_group>`
```
# The name of the group. Must be unique within a file.
name: <string>

# How often rules in the group are evaluated.
[ interval: <duration> | default = global.evaluation_interval ]

# Limit the number of alerts an alerting rule and series a recording
# rule can produce. 0 is no limit.
[ limit: <int> | default = 0 ]

# Offset the rule evaluation timestamp of this particular group by the specified duration into the past.
[ query_offset: <duration> | default = global.rule_query_offset ]

rules:
  [ - <rule> ... ]
```

### `<rule>`

The syntax for recording rules is:

```
# The name of the time series to output to. Must be a valid metric name.
record: <string>

# The PromQL expression to evaluate. Every evaluation cycle this is
# evaluated at the current time, and the result recorded as a new set of
# time series with the metric name as given by 'record'.
expr: <string>

# Labels to add or overwrite before storing the result.
labels:
  [ <labelname>: <labelvalue> ]
```

The syntax for alerting rules is:

```
# The name of the alert. Must be a valid label value.
alert: <string>

# The PromQL expression to evaluate. Every evaluation cycle this is
# evaluated at the current time, and all resultant time series become
# pending/firing alerts.
expr: <string>

# Alerts are considered firing once they have been returned for this long.
# Alerts which have not yet fired for long enough are considered pending.
[ for: <duration> | default = 0s ]

# How long an alert will continue firing after the condition that triggered it
# has cleared.
[ keep_firing_for: <duration> | default = 0s ]

# Labels to add or overwrite for each alert.
labels:
  [ <labelname>: <tmpl_string> ]

# Annotations to add to each alert.
annotations:
  [ <labelname>: <tmpl_string> ]
```

See also the
[best practices for naming metrics created by recording rules](https://prometheus.io/docs/practices/rules/#recording-rules).

# Limiting alerts and series

A limit for alerts produced by alerting rules and series produced recording rules
can be configured per-group. When the limit is exceeded, _all_ series produced
by the rule are discarded, and if it's an alerting rule, _all_ alerts for
the rule, active, pending, or inactive, are cleared as well. The event will be
recorded as an error in the evaluation, and as such no stale markers are
written.

# Rule query offset
This is useful to ensure the underlying metrics have been received and stored in Prometheus. Metric availability delays are more likely to occur when Prometheus is running as a remote write target due to the nature of distributed systems, but can also occur when there's anomalies with scraping and/or short evaluation intervals.

# Failed rule evaluations due to slow evaluation

If a rule group hasn't finished evaluating before its next evaluation is supposed to start (as defined by the `evaluation_interval`), the next evaluation will be skipped. Subsequent evaluations of the rule group will continue to be skipped until the initial evaluation either completes or times out. When this happens, there will be a gap in the metric produced by the recording rule. The `rule_group_iterations_missed_total` metric will be incremented for each missed iteration of the rule group.
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00			`---`
			`title: Recording rules`
Consolidate configuration and rules docs in docs/configuration/ 2017-10-27 07:47:38 +00:00			`sort_rank: 2`
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00			`---`

			`# Defining recording rules`

			`## Configuring rules`

			`Prometheus supports two types of rules which may be configured and then`
			`evaluated at regular intervals: recording rules and [alerting`
Consolidate configuration and rules docs in docs/configuration/ 2017-10-27 07:47:38 +00:00			`rules](alerting_rules.md). To include rules in Prometheus, create a file`
			`containing the necessary rule statements and have Prometheus load the file via`
			the `rule_files` field in the [Prometheus configuration](configuration.md).
docs: Document new recording rule format (#3378) 2017-11-01 12:58:32 +00:00			`Rule files use YAML.`
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00
			The rule files can be reloaded at runtime by sending `SIGHUP` to the Prometheus
			`process. The changes are only applied if all rule files are well-formatted.`

docs: Update recording rule docs about native histograms Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com> 2023-01-12 08:05:41 +00:00			`_Note about native histograms (experimental feature): Native histogram are always`
			`recorded as gauge histograms (for now). Most cases will create gauge histograms`
			naturally, e.g. after `rate()`._
doc: Add notes about feature not yet supported for native histograms (#11453) Namely federation and recording rules. Signed-off-by: beorn7 <beorn@grafana.com> 2022-10-14 11:05:27 +00:00
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00			`## Syntax-checking rules`

			`To quickly check whether a rule file is syntactically correct without starting`
Recommend to get promtool from a binary distribution. Rather than compile it yourself, which doesn't work as shown anymore because of Go Modules. Signed-off-by: beorn7 <beorn@grafana.com> 2021-02-15 20:59:32 +00:00			a Prometheus server, you can use Prometheus's `promtool` command-line utility
			`tool:`
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00
			```bash
docs: Document new recording rule format (#3378) 2017-11-01 12:58:32 +00:00			`promtool check rules /path/to/example.rules.yml`
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00			```

Recommend to get promtool from a binary distribution. Rather than compile it yourself, which doesn't work as shown anymore because of Go Modules. Signed-off-by: beorn7 <beorn@grafana.com> 2021-02-15 20:59:32 +00:00			The `promtool` binary is part of the `prometheus` archive offered on the
			`project's [download page](https://prometheus.io/download/).`

Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00			`When the file is syntactically valid, the checker prints a textual`
			`representation of the parsed rules to standard output and then exits with`
			a `0` return status.

docs: Fix minor issues with the docs. (#3389) Signed-off-by: Goutham Veeramachaneni <cs14btech11014@iith.ac.in> 2017-11-01 15:35:50 +00:00			`If there are any syntax errors or invalid input arguments, it prints an error`
			message to standard error and exits with a `1` return status.
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00
			`## Recording rules`

			`Recording rules allow you to precompute frequently needed or computationally`
			`expensive expressions and save their result as a new set of time series.`
			`Querying the precomputed result will then often be much faster than executing`
			`the original expression every time it is needed. This is especially useful for`
			`dashboards, which need to query the same expression repeatedly every time they`
			`refresh.`

docs: Document new recording rule format (#3378) 2017-11-01 12:58:32 +00:00			`Recording and alerting rules exist in a rule group. Rules within a group are`
Clarify: all rules in a group are concomitant (#8248) Improve the documentation to clarify the differences beetween rules in a group and outside a group. Signed-off-by: Thibault Jamet <tjamet@users.noreply.github.com> 2020-12-03 10:32:10 +00:00			`run sequentially at a regular interval, with the same evaluation time.`
			`The names of recording rules must be`
			`[valid metric names](https://prometheus.io/docs/concepts/data_model/#metric-names-and-labels).`
Fix the alerting rules name description (#7083) (#8197) commit 9875afc491983cc7462fef336ab1c6b67da45020 changed the type from metric names to label values, we might as well adjust the description. The alternative is to revert that commit and restrict names of alerting rules again even if that was not really enforced. Signed-off-by: Peter Wu <pwu@cloudflare.com> 2020-11-18 19:29:01 +00:00			`The names of alerting rules must be`
			`[valid label values](https://prometheus.io/docs/concepts/data_model/#metric-names-and-labels).`
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00
docs: Document new recording rule format (#3378) 2017-11-01 12:58:32 +00:00			`The syntax of a rule file is:`
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00
docs: Document new recording rule format (#3378) 2017-11-01 12:58:32 +00:00			```yaml
			`groups:`
			`[ - <rule_group> ]`
			```

			`A simple example rules file would be:`

			```yaml
			`groups:`
			`- name: example`
[minor] docs: recording_rules: fix missing key 2017-11-14 15:22:25 +00:00			`rules:`
Update example rules file to be valid with the default scrape config (#11692) * Update docs example rules for default config The prometheus download includes a default config to scrape itself. This self-scraping prometheus doesn't include any metric named as `http_inprogress_requests`, but does include one named `prometheus_http_requests_total`. Updating this example rule in the docs to one which can be used out-of-the-box with the default download would be a nice improvement. Signed-off-by: Sam Jewell <sam.jewell@grafana.com> * Update syntax as per @LeviHarrison's review Co-authored-by: Levi Harrison <levisamuelharrison@gmail.com> Signed-off-by: Sam Jewell <2903904+samjewell@users.noreply.github.com> Signed-off-by: Sam Jewell <sam.jewell@grafana.com> Signed-off-by: Sam Jewell <2903904+samjewell@users.noreply.github.com> Co-authored-by: Levi Harrison <levisamuelharrison@gmail.com> 2023-01-10 00:36:07 +00:00			`- record: code:prometheus_http_requests_total:sum`
			`expr: sum by (code) (prometheus_http_requests_total)`
docs: Document new recording rule format (#3378) 2017-11-01 12:58:32 +00:00			```

			### `<rule_group>`
			```
			`# The name of the group. Must be unique within a file.`
			`name: <string>`

			`# How often rules in the group are evaluated.`
			`[ interval: <duration> \| default = global.evaluation_interval ]`

Rule alerts/series limit updates (#9541) * Add docs and do not limit inactive alerts. Signed-off-by: Levi Harrison <git@leviharrison.dev> 2021-10-21 21:14:17 +00:00			`# Limit the number of alerts an alerting rule and series a recording`
			`# rule can produce. 0 is no limit.`
Limit number of alerts or series produced by a rule (#9260) * Add limit to rules Signed-off-by: Levi Harrison <git@leviharrison.dev> 2021-09-15 07:48:26 +00:00			`[ limit: <int> \| default = 0 ]`

Feature: Allow configuration of a rule evaluation delay (#14061) * [PATCH] Allow having evaluation delay for rule groups Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com> * [PATCH] Fix lint Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com> * [PATCH] Move the option to ManagerOptions Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com> * [PATCH] Include evaluation_delay in the group config Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com> * Fix comments Signed-off-by: gotjosh <josue.abreu@gmail.com> * Add a server configuration option. Signed-off-by: gotjosh <josue.abreu@gmail.com> * Appease the linter #1 Signed-off-by: gotjosh <josue.abreu@gmail.com> * Add the new server flag documentation Signed-off-by: gotjosh <josue.abreu@gmail.com> * Improve documentation of the new flag and configuration Signed-off-by: gotjosh <josue.abreu@gmail.com> * Use named parameters for clarity on the `Rule` interface Signed-off-by: gotjosh <josue.abreu@gmail.com> * Add `initial` to the flag help Signed-off-by: gotjosh <josue.abreu@gmail.com> * Change the CHANGELOG area from `ruler` to `rules` Signed-off-by: gotjosh <josue.abreu@gmail.com> * Rename evaluation_delay to `rule_query_offset`/`query_offset` and make it a global configuration option. Signed-off-by: gotjosh <josue.abreu@gmail.com> E Your branch is up to date with 'origin/gotjosh/evaluation-delay'. * more docs Signed-off-by: gotjosh <josue.abreu@gmail.com> * Improve wording on CHANGELOG Signed-off-by: gotjosh <josue.abreu@gmail.com> * Add `RuleQueryOffset` to the default config in tests in case it changes Signed-off-by: gotjosh <josue.abreu@gmail.com> * Update docs/configuration/recording_rules.md Co-authored-by: Julius Volz <julius.volz@gmail.com> Signed-off-by: gotjosh <josue.abreu@gmail.com> * Rename `RuleQueryOffset` to `QueryOffset` when in the group context. Signed-off-by: gotjosh <josue.abreu@gmail.com> * Improve docstring and documentation on the `rule_query_offset` Signed-off-by: gotjosh <josue.abreu@gmail.com> --------- Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com> Signed-off-by: gotjosh <josue.abreu@gmail.com> Co-authored-by: Ganesh Vernekar <ganeshvern@gmail.com> Co-authored-by: Julius Volz <julius.volz@gmail.com> 2024-05-30 10:49:50 +00:00			`# Offset the rule evaluation timestamp of this particular group by the specified duration into the past.`
			`[ query_offset: <duration> \| default = global.rule_query_offset ]`

docs: Document new recording rule format (#3378) 2017-11-01 12:58:32 +00:00			`rules:`
			`[ - <rule> ... ]`
			```

			### `<rule>`
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00
docs: Document new recording rule format (#3378) 2017-11-01 12:58:32 +00:00			`The syntax for recording rules is:`
Fix markdown in recording rules. (#3432) Resolves an issue where rendered markdown was incorrect. 2017-11-08 10:38:03 +00:00
docs: Document new recording rule format (#3378) 2017-11-01 12:58:32 +00:00			```
			`# The name of the time series to output to. Must be a valid metric name.`
			`record: <string>`

			`# The PromQL expression to evaluate. Every evaluation cycle this is`
			`# evaluated at the current time, and the result recorded as a new set of`
			`# time series with the metric name as given by 'record'.`
			`expr: <string>`

			`# Labels to add or overwrite before storing the result.`
			`labels:`
			`[ <labelname>: <labelvalue> ]`
			```

			`The syntax for alerting rules is:`
Fix markdown in recording rules. (#3432) Resolves an issue where rendered markdown was incorrect. 2017-11-08 10:38:03 +00:00
docs: Document new recording rule format (#3378) 2017-11-01 12:58:32 +00:00			```
Fix the type of the alert name (#7523) The alert name should be a valid label value, not a metric name. Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> 2020-07-06 21:00:16 +00:00			`# The name of the alert. Must be a valid label value.`
docs: Document new recording rule format (#3378) 2017-11-01 12:58:32 +00:00			`alert: <string>`

			`# The PromQL expression to evaluate. Every evaluation cycle this is`
			`# evaluated at the current time, and all resultant time series become`
			`# pending/firing alerts.`
			`expr: <string>`

			`# Alerts are considered firing once they have been returned for this long.`
			`# Alerts which have not yet fired for long enough are considered pending.`
			`[ for: <duration> \| default = 0s ]`
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00
Add 'keep_firing_for' field to alerting rules This commit adds a new 'keep_firing_for' field to Prometheus alerting rules. The 'resolve_delay' field specifies the minimum amount of time that an alert should remain firing, even if the expression does not return any results. This feature was discussed at a previous dev summit, and it was determined that a feature like this would be useful in order to allow the expression time to stabilize and prevent confusing resolved messages from being propagated through Alertmanager. This approach is simpler than having two PromQL queries, as was sometimes discussed, and it should be easy to implement. This commit does not include tests for the 'resolve_delay' field. This is intentional, as the purpose of this commit is to gather comments on the proposed design of the 'resolve_delay' field before implementing tests. Once the design of the 'resolve_delay' field has been finalized, a follow-up commit will be submitted with tests." See https://github.com/prometheus/prometheus/issues/11570 Signed-off-by: Julien Pivotto <roidelapluie@o11y.eu> 2023-01-09 11:21:38 +00:00			`# How long an alert will continue firing after the condition that triggered it`
			`# has cleared.`
			`[ keep_firing_for: <duration> \| default = 0s ]`

docs: Document new recording rule format (#3378) 2017-11-01 12:58:32 +00:00			`# Labels to add or overwrite for each alert.`
			`labels:`
			`[ <labelname>: <tmpl_string> ]`

			`# Annotations to add to each alert.`
			`annotations:`
			`[ <labelname>: <tmpl_string> ]`
			```
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00
Add link to best practices in "Defining Recording Rules" page (#11696) * docs: Add link to best practices in "Defining Recording Rules" page Signed-off-by: John Carlo Roberto <10111643+Irizwaririz@users.noreply.github.com> * docs: Improve wording Signed-off-by: John Carlo Roberto <10111643+Irizwaririz@users.noreply.github.com> Signed-off-by: John Carlo Roberto <10111643+Irizwaririz@users.noreply.github.com> 2022-12-12 15:08:45 +00:00			`See also the`
			`[best practices for naming metrics created by recording rules](https://prometheus.io/docs/practices/rules/#recording-rules).`

Rule alerts/series limit updates (#9541) * Add docs and do not limit inactive alerts. Signed-off-by: Levi Harrison <git@leviharrison.dev> 2021-10-21 21:14:17 +00:00			`# Limiting alerts and series`

			`A limit for alerts produced by alerting rules and series produced recording rules`
			`can be configured per-group. When the limit is exceeded, _all_ series produced`
			`by the rule are discarded, and if it's an alerting rule, _all_ alerts for`
			`the rule, active, pending, or inactive, are cleared as well. The event will be`
			`recorded as an error in the evaluation, and as such no stale markers are`
			`written.`
Clarify what happens when a rule group takes too long to execute Namely, call out that all subsequent evaluations will be skipped until the initial evaluation completes. Signed-off-by: Jennifer Villa <jvilla2013@gmail.com> 2023-09-30 14:23:54 +00:00
Feature: Allow configuration of a rule evaluation delay (#14061) * [PATCH] Allow having evaluation delay for rule groups Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com> * [PATCH] Fix lint Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com> * [PATCH] Move the option to ManagerOptions Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com> * [PATCH] Include evaluation_delay in the group config Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com> * Fix comments Signed-off-by: gotjosh <josue.abreu@gmail.com> * Add a server configuration option. Signed-off-by: gotjosh <josue.abreu@gmail.com> * Appease the linter #1 Signed-off-by: gotjosh <josue.abreu@gmail.com> * Add the new server flag documentation Signed-off-by: gotjosh <josue.abreu@gmail.com> * Improve documentation of the new flag and configuration Signed-off-by: gotjosh <josue.abreu@gmail.com> * Use named parameters for clarity on the `Rule` interface Signed-off-by: gotjosh <josue.abreu@gmail.com> * Add `initial` to the flag help Signed-off-by: gotjosh <josue.abreu@gmail.com> * Change the CHANGELOG area from `ruler` to `rules` Signed-off-by: gotjosh <josue.abreu@gmail.com> * Rename evaluation_delay to `rule_query_offset`/`query_offset` and make it a global configuration option. Signed-off-by: gotjosh <josue.abreu@gmail.com> E Your branch is up to date with 'origin/gotjosh/evaluation-delay'. * more docs Signed-off-by: gotjosh <josue.abreu@gmail.com> * Improve wording on CHANGELOG Signed-off-by: gotjosh <josue.abreu@gmail.com> * Add `RuleQueryOffset` to the default config in tests in case it changes Signed-off-by: gotjosh <josue.abreu@gmail.com> * Update docs/configuration/recording_rules.md Co-authored-by: Julius Volz <julius.volz@gmail.com> Signed-off-by: gotjosh <josue.abreu@gmail.com> * Rename `RuleQueryOffset` to `QueryOffset` when in the group context. Signed-off-by: gotjosh <josue.abreu@gmail.com> * Improve docstring and documentation on the `rule_query_offset` Signed-off-by: gotjosh <josue.abreu@gmail.com> --------- Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com> Signed-off-by: gotjosh <josue.abreu@gmail.com> Co-authored-by: Ganesh Vernekar <ganeshvern@gmail.com> Co-authored-by: Julius Volz <julius.volz@gmail.com> 2024-05-30 10:49:50 +00:00			`# Rule query offset`
			`This is useful to ensure the underlying metrics have been received and stored in Prometheus. Metric availability delays are more likely to occur when Prometheus is running as a remote write target due to the nature of distributed systems, but can also occur when there's anomalies with scraping and/or short evaluation intervals.`

Clarify what happens when a rule group takes too long to execute Namely, call out that all subsequent evaluations will be skipped until the initial evaluation completes. Signed-off-by: Jennifer Villa <jvilla2013@gmail.com> 2023-09-30 14:23:54 +00:00			`# Failed rule evaluations due to slow evaluation`

Update recording_rules.md updated language to be a bit more clear Signed-off-by: Jennifer Villa <jvilla2013@gmail.com> 2023-10-03 01:44:54 +00:00			If a rule group hasn't finished evaluating before its next evaluation is supposed to start (as defined by the `evaluation_interval`), the next evaluation will be skipped. Subsequent evaluations of the rule group will continue to be skipped until the initial evaluation either completes or times out. When this happens, there will be a gap in the metric produced by the recording rule. The `rule_group_iterations_missed_total` metric will be incremented for each missed iteration of the rule group.