prometheus/docs/storage.md

---
title: Storage
sort_rank: 5
---

# Storage

Prometheus includes a local on-disk time series database, but also optionally integrates with remote storage systems.

## Local storage

Prometheus's local time series database stores time series data in a custom format on disk.

### On-disk layout

Ingested samples are grouped into blocks of two hours. Each two-hour block consists of a directory containing one or more chunk files that contain all time series samples for that window of time, as well as a metadata file and index file (which indexes metric names and labels to time series in the chunk files).  When series are deleted via the API, deletion records are stored in separate tombstone files (instead of deleting the data immediately from the chunk files).

The block for currently incoming samples is kept in memory and not fully persisted yet. It is secured against crashes by a write-ahead-log (WAL) that can be replayed when the Prometheus server restarts after a crash. Write-ahead log files are stored in the `wal` directory in 128MB segments. These files contain raw data that has not been compacted yet, so they are significantly larger than regular block files. Prometheus will keep a minimum of 3 write-ahead log files, however high-traffic servers may see more than three WAL files since it needs to keep at least two hours worth of raw data.

The directory structure of a Prometheus server's data directory will look something like this:

```
./data/01BKGV7JBM69T2G1BGBGM6KB12
./data/01BKGV7JBM69T2G1BGBGM6KB12/meta.json
./data/01BKGTZQ1SYQJTR4PB43C8PD98
./data/01BKGTZQ1SYQJTR4PB43C8PD98/meta.json
./data/01BKGTZQ1SYQJTR4PB43C8PD98/index
./data/01BKGTZQ1SYQJTR4PB43C8PD98/chunks
./data/01BKGTZQ1SYQJTR4PB43C8PD98/chunks/000001
./data/01BKGTZQ1SYQJTR4PB43C8PD98/tombstones
./data/01BKGTZQ1HHWHV8FBJXW1Y3W0K
./data/01BKGTZQ1HHWHV8FBJXW1Y3W0K/meta.json
./data/01BKGV7JC0RY8A6MACW02A2PJD
./data/01BKGV7JC0RY8A6MACW02A2PJD/meta.json
./data/01BKGV7JC0RY8A6MACW02A2PJD/index
./data/01BKGV7JC0RY8A6MACW02A2PJD/chunks
./data/01BKGV7JC0RY8A6MACW02A2PJD/chunks/000001
./data/01BKGV7JC0RY8A6MACW02A2PJD/tombstones
./data/wal/00000000
./data/wal/00000001
./data/wal/00000002
```

The initial two-hour blocks are eventually compacted into longer blocks in the background.

Note that a limitation of the local storage is that it is not clustered or replicated. Thus, it is not arbitrarily scalable or durable in the face of disk or node outages and should thus be treated as more of an ephemeral sliding window of recent data. However, if your durability requirements are not strict, you may still succeed in storing up to years of data in the local storage.

For further details on file format, see [TSDB format](https://github.com/prometheus/tsdb/blob/master/docs/format/README.md).

## Operational aspects

Prometheus has several flags that allow configuring the local storage. The most important ones are:

* `--storage.tsdb.path`: This determines where Prometheus writes its database. Defaults to `data/`.
* `--storage.tsdb.retention.time`: This determines when to remove old data. Defaults to `15d`. Overrides `storage.tsdb.retention` if this flag is set to anything other than default.
* `--storage.tsdb.retention.size`: [EXPERIMENTAL] This determines the maximum number of bytes that storage blocks can use (note that this does not include the WAL size, which can be substantial). The oldest data will be removed first. Defaults to `0` or disabled. This flag is experimental and can be changed in future releases. Units supported: KB, MB, GB, PB. Ex: "512MB"
* `--storage.tsdb.retention`: This flag has been deprecated in favour of `storage.tsdb.retention.time`.

On average, Prometheus uses only around 1-2 bytes per sample. Thus, to plan the capacity of a Prometheus server, you can use the rough formula:

```
needed_disk_space = retention_time_seconds * ingested_samples_per_second * bytes_per_sample
```

To tune the rate of ingested samples per second, you can either reduce the number of time series you scrape (fewer targets or fewer series per target), or you can increase the scrape interval. However, reducing the number of series is likely more effective, due to compression of samples within a series.

If your local storage becomes corrupted for whatever reason, your best bet is to shut down Prometheus and remove the entire storage directory. Non POSIX compliant filesystems are not supported by Prometheus's local storage, corruptions may happen, without possibility to recover. However, you can also try removing individual block directories to resolve the problem. This means losing a time window of around two hours worth of data per block directory. Again, Prometheus's local storage is not meant as durable long-term storage.

If both time and size retention policies are specified, whichever policy triggers first will be used at that instant.

## Remote storage integrations

Prometheus's local storage is limited by single nodes in its scalability and durability. Instead of trying to solve clustered storage in Prometheus itself, Prometheus has a set of interfaces that allow integrating with remote storage systems.

### Overview

Prometheus integrates with remote storage systems in two ways:

* Prometheus can write samples that it ingests to a remote URL in a standardized format.
* Prometheus can read (back) sample data from a remote URL in a standardized format.

![Remote read and write architecture](images/remote_integrations.png)

The read and write protocols both use a snappy-compressed protocol buffer encoding over HTTP. The protocols are not considered as stable APIs yet and may change to use gRPC over HTTP/2 in the future, when all hops between Prometheus and the remote storage can safely be assumed to support HTTP/2.

For details on configuring remote storage integrations in Prometheus, see the [remote write](configuration/configuration.md#remote_write) and [remote read](configuration/configuration.md#remote_read) sections of the Prometheus configuration documentation.

For details on the request and response messages, see the [remote storage protocol buffer definitions](https://github.com/prometheus/prometheus/blob/master/prompb/remote.proto).

Note that on the read path, Prometheus only fetches raw series data for a set of label selectors and time ranges from the remote end. All PromQL evaluation on the raw data still happens in Prometheus itself. This means that remote read queries have some scalability limit, since all necessary data needs to be loaded into the querying Prometheus server first and then processed there. However, supporting fully distributed evaluation of PromQL was deemed infeasible for the time being.

### Existing integrations

To learn more about existing integrations with remote storage systems, see the [Integrations documentation](https://prometheus.io/docs/operating/integrations/#remote-endpoints-and-storage).
Add 2.0 storage docs (#3423) * Add 2.0 storage docs * Review fixups * More review fixups 2017-11-07 21:00:38 +00:00			`---`
			`title: Storage`
			`sort_rank: 5`
			`---`

			`# Storage`

			`Prometheus includes a local on-disk time series database, but also optionally integrates with remote storage systems.`

			`## Local storage`

			`Prometheus's local time series database stores time series data in a custom format on disk.`

			`### On-disk layout`

Update WAL directory structure, and mention that the last three are always kept (#5096) * Small tweaks to storage doc page Signed-off-by: Daniel Lo Nigro <d@d.sb> 2019-01-15 10:32:29 +00:00			`Ingested samples are grouped into blocks of two hours. Each two-hour block consists of a directory containing one or more chunk files that contain all time series samples for that window of time, as well as a metadata file and index file (which indexes metric names and labels to time series in the chunk files). When series are deleted via the API, deletion records are stored in separate tombstone files (instead of deleting the data immediately from the chunk files).`

			The block for currently incoming samples is kept in memory and not fully persisted yet. It is secured against crashes by a write-ahead-log (WAL) that can be replayed when the Prometheus server restarts after a crash. Write-ahead log files are stored in the `wal` directory in 128MB segments. These files contain raw data that has not been compacted yet, so they are significantly larger than regular block files. Prometheus will keep a minimum of 3 write-ahead log files, however high-traffic servers may see more than three WAL files since it needs to keep at least two hours worth of raw data.
Add 2.0 storage docs (#3423) * Add 2.0 storage docs * Review fixups * More review fixups 2017-11-07 21:00:38 +00:00
			`The directory structure of a Prometheus server's data directory will look something like this:`

			```
			`./data/01BKGV7JBM69T2G1BGBGM6KB12`
			`./data/01BKGV7JBM69T2G1BGBGM6KB12/meta.json`
			`./data/01BKGTZQ1SYQJTR4PB43C8PD98`
			`./data/01BKGTZQ1SYQJTR4PB43C8PD98/meta.json`
			`./data/01BKGTZQ1SYQJTR4PB43C8PD98/index`
			`./data/01BKGTZQ1SYQJTR4PB43C8PD98/chunks`
			`./data/01BKGTZQ1SYQJTR4PB43C8PD98/chunks/000001`
			`./data/01BKGTZQ1SYQJTR4PB43C8PD98/tombstones`
			`./data/01BKGTZQ1HHWHV8FBJXW1Y3W0K`
			`./data/01BKGTZQ1HHWHV8FBJXW1Y3W0K/meta.json`
			`./data/01BKGV7JC0RY8A6MACW02A2PJD`
			`./data/01BKGV7JC0RY8A6MACW02A2PJD/meta.json`
			`./data/01BKGV7JC0RY8A6MACW02A2PJD/index`
			`./data/01BKGV7JC0RY8A6MACW02A2PJD/chunks`
			`./data/01BKGV7JC0RY8A6MACW02A2PJD/chunks/000001`
			`./data/01BKGV7JC0RY8A6MACW02A2PJD/tombstones`
Update WAL directory structure, and mention that the last three are always kept (#5096) * Small tweaks to storage doc page Signed-off-by: Daniel Lo Nigro <d@d.sb> 2019-01-15 10:32:29 +00:00			`./data/wal/00000000`
			`./data/wal/00000001`
			`./data/wal/00000002`
Add 2.0 storage docs (#3423) * Add 2.0 storage docs * Review fixups * More review fixups 2017-11-07 21:00:38 +00:00			```

			`The initial two-hour blocks are eventually compacted into longer blocks in the background.`

			`Note that a limitation of the local storage is that it is not clustered or replicated. Thus, it is not arbitrarily scalable or durable in the face of disk or node outages and should thus be treated as more of an ephemeral sliding window of recent data. However, if your durability requirements are not strict, you may still succeed in storing up to years of data in the local storage.`

Add link to TSDB format page. (#4402) Signed-off-by: Catalin Patulea <catalinp@google.com> 2018-07-28 07:02:03 +00:00			`For further details on file format, see [TSDB format](https://github.com/prometheus/tsdb/blob/master/docs/format/README.md).`

Add 2.0 storage docs (#3423) * Add 2.0 storage docs * Review fixups * More review fixups 2017-11-07 21:00:38 +00:00			`## Operational aspects`

			`Prometheus has several flags that allow configuring the local storage. The most important ones are:`

			* `--storage.tsdb.path`: This determines where Prometheus writes its database. Defaults to `data/`.
Add flag for size based retention (#5109) * Add flag for size based retention Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com> * Deprecate the old retention flag for a new one. Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com> * Add ability to take a suffix for size flag Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com> * Address feedback Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com> 2019-01-18 13:48:36 +00:00			* `--storage.tsdb.retention.time`: This determines when to remove old data. Defaults to `15d`. Overrides `storage.tsdb.retention` if this flag is set to anything other than default.
			* `--storage.tsdb.retention.size`: [EXPERIMENTAL] This determines the maximum number of bytes that storage blocks can use (note that this does not include the WAL size, which can be substantial). The oldest data will be removed first. Defaults to `0` or disabled. This flag is experimental and can be changed in future releases. Units supported: KB, MB, GB, PB. Ex: "512MB"
			* `--storage.tsdb.retention`: This flag has been deprecated in favour of `storage.tsdb.retention.time`.
Add 2.0 storage docs (#3423) * Add 2.0 storage docs * Review fixups * More review fixups 2017-11-07 21:00:38 +00:00
			`On average, Prometheus uses only around 1-2 bytes per sample. Thus, to plan the capacity of a Prometheus server, you can use the rough formula:`

			```
			`needed_disk_space = retention_time_seconds * ingested_samples_per_second * bytes_per_sample`
			```

			`To tune the rate of ingested samples per second, you can either reduce the number of time series you scrape (fewer targets or fewer series per target), or you can increase the scrape interval. However, reducing the number of series is likely more effective, due to compression of samples within a series.`

docs: Add filesystem POSIX requirement Signed-off-by: Frederic Branczyk <fbranczyk@gmail.com> 2019-01-29 12:51:16 +00:00			If your local storage becomes corrupted for whatever reason, your best bet is to shut down Prometheus and remove the entire storage directory. Non POSIX compliant filesystems are not supported by Prometheus's local storage, corruptions may happen, without possibility to recover. However, you can also try removing individual block directories to resolve the problem. This means losing a time window of around two hours worth of data per block directory. Again, Prometheus's local storage is not meant as durable long-term storage.
Add 2.0 storage docs (#3423) * Add 2.0 storage docs * Review fixups * More review fixups 2017-11-07 21:00:38 +00:00
Add flag for size based retention (#5109) * Add flag for size based retention Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com> * Deprecate the old retention flag for a new one. Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com> * Add ability to take a suffix for size flag Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com> * Address feedback Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com> 2019-01-18 13:48:36 +00:00			`If both time and size retention policies are specified, whichever policy triggers first will be used at that instant.`

Add 2.0 storage docs (#3423) * Add 2.0 storage docs * Review fixups * More review fixups 2017-11-07 21:00:38 +00:00			`## Remote storage integrations`

Improve wording of remote write documentation. (#3817) Reduce the use of the term `long-term`, when what we're really talking about is remote clustered storage for increased capacity and durability. Signed-off-by: Ben Kochie <superq@gmail.com> 2018-05-05 15:38:45 +00:00			`Prometheus's local storage is limited by single nodes in its scalability and durability. Instead of trying to solve clustered storage in Prometheus itself, Prometheus has a set of interfaces that allow integrating with remote storage systems.`
Add 2.0 storage docs (#3423) * Add 2.0 storage docs * Review fixups * More review fixups 2017-11-07 21:00:38 +00:00
			`### Overview`

			`Prometheus integrates with remote storage systems in two ways:`

			`* Prometheus can write samples that it ingests to a remote URL in a standardized format.`
			`* Prometheus can read (back) sample data from a remote URL in a standardized format.`

			`![Remote read and write architecture](images/remote_integrations.png)`

Improve wording of remote write documentation. (#3817) Reduce the use of the term `long-term`, when what we're really talking about is remote clustered storage for increased capacity and durability. Signed-off-by: Ben Kochie <superq@gmail.com> 2018-05-05 15:38:45 +00:00			`The read and write protocols both use a snappy-compressed protocol buffer encoding over HTTP. The protocols are not considered as stable APIs yet and may change to use gRPC over HTTP/2 in the future, when all hops between Prometheus and the remote storage can safely be assumed to support HTTP/2.`
Add 2.0 storage docs (#3423) * Add 2.0 storage docs * Review fixups * More review fixups 2017-11-07 21:00:38 +00:00
			`For details on configuring remote storage integrations in Prometheus, see the [remote write](configuration/configuration.md#remote_write) and [remote read](configuration/configuration.md#remote_read) sections of the Prometheus configuration documentation.`

Wrong URL for remote.proto (#3431) Change wrong URL for remote.proto 2017-11-08 10:38:39 +00:00			`For details on the request and response messages, see the [remote storage protocol buffer definitions](https://github.com/prometheus/prometheus/blob/master/prompb/remote.proto).`
Add 2.0 storage docs (#3423) * Add 2.0 storage docs * Review fixups * More review fixups 2017-11-07 21:00:38 +00:00
			`Note that on the read path, Prometheus only fetches raw series data for a set of label selectors and time ranges from the remote end. All PromQL evaluation on the raw data still happens in Prometheus itself. This means that remote read queries have some scalability limit, since all necessary data needs to be loaded into the querying Prometheus server first and then processed there. However, supporting fully distributed evaluation of PromQL was deemed infeasible for the time being.`

			`### Existing integrations`

			`To learn more about existing integrations with remote storage systems, see the [Integrations documentation](https://prometheus.io/docs/operating/integrations/#remote-endpoints-and-storage).`