prometheus/docs/querying/examples.md

---
title: Querying examples
nav_title: Examples
sort_rank: 4
---

# Query examples

## Simple time series selection

Return all time series with the metric `http_requests_total`:

    http_requests_total

Return all time series with the metric `http_requests_total` and the given
`job` and `handler` labels:

    http_requests_total{job="apiserver", handler="/api/comments"}

Return a whole range of time (in this case 5 minutes) for the same vector,
making it a range vector:

    http_requests_total{job="apiserver", handler="/api/comments"}[5m]

Note that an expression resulting in a range vector cannot be graphed directly,
but viewed in the tabular ("Console") view of the expression browser.

Using regular expressions, you could select time series only for jobs whose
name match a certain pattern, in this case, all jobs that end with `server`:

    http_requests_total{job=~".*server"}

All regular expressions in Prometheus use [RE2
syntax](https://github.com/google/re2/wiki/Syntax).

To select all HTTP status codes except 4xx ones, you could run:

    http_requests_total{status!~"4.."}

## Subquery

Return the 5-minute rate of the `http_requests_total` metric for the past 30 minutes, with a resolution of 1 minute.

    rate(http_requests_total[5m])[30m:1m]

This is an example of a nested subquery. The subquery for the `deriv` function uses the default resolution. Note that using subqueries unnecessarily is unwise.

    max_over_time(deriv(rate(distance_covered_total[5s])[30s:5s])[10m:])

## Using functions, operators, etc.

Return the per-second rate for all time series with the `http_requests_total`
metric name, as measured over the last 5 minutes:

    rate(http_requests_total[5m])

Assuming that the `http_requests_total` time series all have the labels `job`
(fanout by job name) and `instance` (fanout by instance of the job), we might
want to sum over the rate of all instances, so we get fewer output time series,
but still preserve the `job` dimension:

    sum(rate(http_requests_total[5m])) by (job)

If we have two different metrics with the same dimensional labels, we can apply
binary operators to them and elements on both sides with the same label set
will get matched and propagated to the output. For example, this expression
returns the unused memory in MiB for every instance (on a fictional cluster
scheduler exposing these metrics about the instances it runs):

    (instance_memory_limit_bytes - instance_memory_usage_bytes) / 1024 / 1024

The same expression, but summed by application, could be written like this:

    sum(
      instance_memory_limit_bytes - instance_memory_usage_bytes
    ) by (app, proc) / 1024 / 1024

If the same fictional cluster scheduler exposed CPU usage metrics like the
following for every instance:

    instance_cpu_time_ns{app="lion", proc="web", rev="34d0f99", env="prod", job="cluster-manager"}
    instance_cpu_time_ns{app="elephant", proc="worker", rev="34d0f99", env="prod", job="cluster-manager"}
    instance_cpu_time_ns{app="turtle", proc="api", rev="4d3a513", env="prod", job="cluster-manager"}
    instance_cpu_time_ns{app="fox", proc="widget", rev="4d3a513", env="prod", job="cluster-manager"}
    ...

...we could get the top 3 CPU users grouped by application (`app`) and process
type (`proc`) like this:

    topk(3, sum(rate(instance_cpu_time_ns[5m])) by (app, proc))

Assuming this metric contains one time series per running instance, you could
count the number of running instances per application like this:

    count(instance_cpu_time_ns) by (app)
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00			`---`
			`title: Querying examples`
			`nav_title: Examples`
			`sort_rank: 4`
			`---`

			`# Query examples`

			`## Simple time series selection`

			Return all time series with the metric `http_requests_total`:

			`http_requests_total`

			Return all time series with the metric `http_requests_total` and the given
			`job` and `handler` labels:

			`http_requests_total{job="apiserver", handler="/api/comments"}`

			`Return a whole range of time (in this case 5 minutes) for the same vector,`
			`making it a range vector:`

			`http_requests_total{job="apiserver", handler="/api/comments"}[5m]`

			`Note that an expression resulting in a range vector cannot be graphed directly,`
			`but viewed in the tabular ("Console") view of the expression browser.`

			`Using regular expressions, you could select time series only for jobs whose`
Remove bits about substring matching from the docs (#5549) Substring matching for labels doesn't work anymore. These parts are misleading, and contradict with the rest of the documentation. Signed-off-by: Semyon Slepov <slepovss@gmail.com> 2019-05-09 10:12:32 +00:00			name match a certain pattern, in this case, all jobs that end with `server`:
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00
Fix docs that use regexp anchors (#3504) Remove/fix docs that use anchors in label regexp matches. 2017-11-22 12:11:21 +00:00			`http_requests_total{job=~".*server"}`
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00
Docs: state that all regular expressions are RE2. (#3518) We already mentioned that regular expressions are RE2 for [relabeling][0], but left open what the regular expression syntax anywhere else is. In the querying examples and reference, make it explicit that _all_ regular expressions are RE2. [0]: https://prometheus.io/docs/prometheus/latest/configuration/configuration/#relabel_config 2017-12-01 17:26:06 +00:00			`All regular expressions in Prometheus use [RE2`
			`syntax](https://github.com/google/re2/wiki/Syntax).`

Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00			`To select all HTTP status codes except 4xx ones, you could run:`

Fix docs that use regexp anchors (#3504) Remove/fix docs that use anchors in label regexp matches. 2017-11-22 12:11:21 +00:00			`http_requests_total{status!~"4.."}`
Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00
Subquery (#4831) Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in> 2018-12-22 13:47:13 +00:00			`## Subquery`

docs/querying: improve wording (#5442) This commit improves the wording of the subquery examples and makes them more consistent with other examples. Signed-off-by: Lucas Servén Marín <lserven@gmail.com> 2019-04-09 16:48:41 +00:00			Return the 5-minute rate of the `http_requests_total` metric for the past 30 minutes, with a resolution of 1 minute.
Subquery (#4831) Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in> 2018-12-22 13:47:13 +00:00
			`rate(http_requests_total[5m])[30m:1m]`

docs/querying: improve wording (#5442) This commit improves the wording of the subquery examples and makes them more consistent with other examples. Signed-off-by: Lucas Servén Marín <lserven@gmail.com> 2019-04-09 16:48:41 +00:00			This is an example of a nested subquery. The subquery for the `deriv` function uses the default resolution. Note that using subqueries unnecessarily is unwise.
Subquery (#4831) Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in> 2018-12-22 13:47:13 +00:00
			`max_over_time(deriv(rate(distance_covered_total[5s])[30s:5s])[10m:])`

Import querying documentation from prometheus/docs 2017-10-26 13:53:27 +00:00			`## Using functions, operators, etc.`

			Return the per-second rate for all time series with the `http_requests_total`
			`metric name, as measured over the last 5 minutes:`

			`rate(http_requests_total[5m])`

			Assuming that the `http_requests_total` time series all have the labels `job`
			(fanout by job name) and `instance` (fanout by instance of the job), we might
			`want to sum over the rate of all instances, so we get fewer output time series,`
			but still preserve the `job` dimension:

			`sum(rate(http_requests_total[5m])) by (job)`

			`If we have two different metrics with the same dimensional labels, we can apply`
			`binary operators to them and elements on both sides with the same label set`
			`will get matched and propagated to the output. For example, this expression`
			`returns the unused memory in MiB for every instance (on a fictional cluster`
			`scheduler exposing these metrics about the instances it runs):`

			`(instance_memory_limit_bytes - instance_memory_usage_bytes) / 1024 / 1024`

			`The same expression, but summed by application, could be written like this:`

			`sum(`
			`instance_memory_limit_bytes - instance_memory_usage_bytes`
			`) by (app, proc) / 1024 / 1024`

			`If the same fictional cluster scheduler exposed CPU usage metrics like the`
			`following for every instance:`

			`instance_cpu_time_ns{app="lion", proc="web", rev="34d0f99", env="prod", job="cluster-manager"}`
			`instance_cpu_time_ns{app="elephant", proc="worker", rev="34d0f99", env="prod", job="cluster-manager"}`
			`instance_cpu_time_ns{app="turtle", proc="api", rev="4d3a513", env="prod", job="cluster-manager"}`
			`instance_cpu_time_ns{app="fox", proc="widget", rev="4d3a513", env="prod", job="cluster-manager"}`
			`...`

			...we could get the top 3 CPU users grouped by application (`app`) and process
			type (`proc`) like this:

			`topk(3, sum(rate(instance_cpu_time_ns[5m])) by (app, proc))`

			`Assuming this metric contains one time series per running instance, you could`
			`count the number of running instances per application like this:`

			`count(instance_cpu_time_ns) by (app)`