prometheus

mirror of https://github.com/prometheus/prometheus synced 2024-12-28 17:52:22 +00:00

Author	SHA1	Message	Date
Julius Volz	6dc36d0c3e	Don't keep extra labels in aggregations by default. MIN/MAX/SUM/AVG/COUNT aggregations will now by default drop all labels that are not specifically part of a BY-clause, even if a label value is the same within all timeseries of an aggregation group. The old behavior of keeping extra labels may still be switched on by adding KEEPING_EXTRA to the end of an aggregation statement: sum(http_requests) by (job, method) keeping_extra I'm open to better syntax/naming suggestions. Change-Id: I21d3fe7af9e98552ce3dffa3ce7c0a4ba4c0b4a4	2013-12-16 12:53:10 +01:00
Julius Volz	740d448983	Use custom timestamp type for sample timestamps and related code. So far we've been using Go's native time.Time for anything related to sample timestamps. Since the range of time.Time is much bigger than what we need, this has created two problems: - there could be time.Time values which were out of the range/precision of the time type that we persist to disk, therefore causing incorrectly ordered keys. One bug caused by this was: https://github.com/prometheus/prometheus/issues/367 It would be good to use a timestamp type that's more closely aligned with what the underlying storage supports. - sizeof(time.Time) is 192, while Prometheus should be ok with a single 64-bit Unix timestamp (possibly even a 32-bit one). Since we store samples in large numbers, this seriously affects memory usage. Furthermore, copying/working with the data will be faster if it's smaller. MEMORY USAGE RESULTS Initial memory usage comparisons for a running Prometheus with 1 timeseries and 100,000 samples show roughly a 13% decrease in total (VIRT) memory usage. In my tests, this advantage for some reason decreased a bit the more samples the timeseries had (to 5-7% for millions of samples). This I can't fully explain, but perhaps garbage collection issues were involved. WHEN TO USE THE NEW TIMESTAMP TYPE The new clientmodel.Timestamp type should be used whenever time calculations are either directly or indirectly related to sample timestamps. For example: - the timestamp of a sample itself - all kinds of watermarks - anything that may become or is compared to a sample timestamp (like the timestamp passed into Target.Scrape()). When to still use time.Time: - for measuring durations/times not related to sample timestamps, like duration telemetry exporting, timers that indicate how frequently to execute some action, etc. NOTE ON OPERATOR OPTIMIZATION TESTS We don't use operator optimization code anymore, but it still lives in the code as dead code. It still has tests, but I couldn't get all of them to pass with the new timestamp format. I commented out the failing cases for now, but we should probably remove the dead code soon. I just didn't want to do that in the same change as this. Change-Id: I821787414b0debe85c9fffaeb57abd453727af0f	2013-12-03 09:11:28 +01:00
Julius Volz	be8024e18c	Add scalar() function. Change-Id: I1d1183e926a18fc98c9e94bbb9a808a3fb313102	2013-09-17 15:01:16 +02:00
Matt T. Proud	7db518d3a0	Abstract high watermark cache into standard LRU. Conflicts: storage/metric/memory.go storage/metric/tiered.go storage/metric/watermark.go Change-Id: Iab2aedbd8f83dc4ce633421bd4a55990fa026b85	2013-08-19 12:26:55 +02:00
Julius Volz	0003027dce	Add needed trailing spaces in logs.	2013-08-12 18:22:48 +02:00
Julius Volz	aa5d251f8d	Use github.com/golang/glog for all logging.	2013-08-12 17:54:36 +02:00
Julius Volz	81f0b85013	Return [] instead of null for empty result vectors.	2013-07-25 12:16:32 +02:00
Julius Volz	d2da21121c	Implement getValueRangeAtIntervalOp for faster range queries. This also short-circuits optimize() for now, since it is complex to implement for the new operator, and ops generated by the query layer already fulfill the needed invariants. We should still investigate later whether to completely delete operator optimization code or extend it to support getValueRangeAtIntervalOp operators.	2013-06-26 18:10:36 +02:00
Matt T. Proud	30b1cf80b5	WIP - Snapshot of Moving to Client Model.	2013-06-25 15:52:42 +02:00
Julius Volz	0226d1ac7a	Implement alerts dashboard and expression console links.	2013-06-13 22:35:40 +02:00
Julius Volz	fc97e688c6	Improve printing of rules and expressions.	2013-06-11 11:39:31 +02:00
Julius Volz	74cb676537	Implement Stringer interface for rules and all their children.	2013-06-07 15:54:32 +02:00
Julius Volz	51689d965d	Add debug timers to instant and range queries. This adds timers around several query-relevant code blocks. For now, the query timer stats are only logged for queries initiated through the UI. In other cases (rule evaluations), the stats are simply thrown away. My hope is that this helps us understand where queries spend time, especially in cases where they sometimes hang for unusual amounts of time.	2013-06-05 18:32:54 +02:00
Julius Volz	138334fb31	Fix handling of negative deltas for non-counter values.	2013-05-28 17:36:53 +02:00
Julius Volz	66d4620061	Don't assume delta has at least one sample per vector element.	2013-05-28 14:02:36 +02:00
Julius Volz	21c3be0814	Skip any empty range/boundary elements, not only nil ones.	2013-05-28 14:02:08 +02:00
Matt T. Proud	c10780c966	Introduce telemetry for rule evaluator durations. This commit adds telemetry for the Prometheus expression rule evaluator, which will enable meta-Prometheus monitoring of customers to ensure that no instance is falling behind in answering routine queries. A few other sundry simplifications are introduced, too.	2013-05-23 21:29:27 +02:00
Julius Volz	750f862d9a	Use GetBoundaryValues() for non-counter deltas.	2013-05-22 19:13:47 +02:00
Julius Volz	5b105c77fc	Repointerize fingerprints.	2013-05-21 14:28:14 +02:00
Matt T. Proud	8f4c7ece92	Destroy naked returns in half of corpus. The use of naked return values is frowned upon. This is the first of two bulk updates to remove them.	2013-05-16 10:53:25 +03:00
Julius Volz	83c60ad43a	Fix GetMetricForFingerprint() metric mutability. Some users of GetMetricForFingerprint() end up modifying the returned metric labelset. Since the memory storage's implementation of GetMetricForFingerprint() returned a pointer to the metric (and maps are reference types anyways), the external mutation propagated back into the memory storage. The fix is to make a copy of the metric before returning it.	2013-05-14 16:46:30 +02:00
Julius Volz	0877680761	Implement a COUNT ... BY aggregation operator. This also removes the now obsolete scalar count() function and corrects the expressions test naming (broken in `2202cd71c9 (L6R59)`) so that the expression tests will actually run.	2013-05-08 16:35:16 +02:00
Julius Volz	56324d8ce2	Make AST query storage non-global.	2013-05-07 13:15:10 +02:00
Matt T. Proud	ce45787dbf	Storage interface to TieredStorage. This commit drops the Storage interface and just replaces it with a publicized TieredStorage type. Storage had been anticipated to be used as a wrapper for testability but just was not used due to practicality. Merely overengineered. My bad. Anyway, we will eventually instantiate the TieredStorage dependencies in main.go and pass them in for more intelligent lifecycle management. These changes will pave the way for managing the curators without Law of Demeter violations.	2013-05-03 15:54:14 +02:00
Julius Volz	dcf2e82752	Cleanup and idiomaticize rule/expression dot graph output.	2013-04-29 12:57:34 +02:00
Julius Volz	99dcbe0f94	Integrate memory and disk layers in view rendering.	2013-04-19 16:01:27 +02:00
Julius Volz	63625bd244	Make view use memory persistence, remove obsolete code. This makes the memory persistence the backing store for views and adjusts the MetricPersistence interface accordingly. It also removes unused Get* method implementations from the LevelDB persistence so they don't need to be adapted to the new interface. In the future, we should rethink these interfaces. All staleness and interpolation handling is now removed from the storage layer and will be handled only by the query layer in the future.	2013-04-18 22:26:29 +02:00
Julius Volz	5f5ea03105	Run "make format".	2013-04-16 17:23:59 +02:00
Julius Volz	1cff4f3d91	Fix rate() per-second adjustment. This got broken during the depointerization of the Vector type.	2013-04-15 14:41:34 +02:00
juliusv	62f33f1fc2	Merge pull request #138 from prometheus/julius-fix-aliasing Correct delta()/rate() intervals and temporal aliasing.	2013-04-15 05:38:48 -07:00
Julius Volz	d53b8cf956	Correct delta()/rate() intervals and temporal aliasing.	2013-04-15 12:30:46 +02:00
Julius Volz	a0d311c9e6	Constantize job name label.	2013-04-15 11:47:54 +02:00
juliusv	f9c291120f	Merge pull request #123 from prometheus/julius-propagate-rule-errors Propagate more errors during rule evaluation.	2013-04-11 06:38:33 -07:00
Julius Volz	6cb3c51d24	Add sort() and sort_desc() expression language functions.	2013-04-10 18:05:45 +02:00
Julius Volz	c4d0969c00	Propagate more errors during rule evaluation.	2013-04-09 13:47:20 +02:00
Julius Volz	ec413459fa	Depointerize Matrix/Vector types as well as time.Time arguments.	2013-03-28 18:07:12 +01:00
Julius Volz	676845afaf	Implement sample interpolation in query layer.	2013-03-28 16:41:51 +01:00
Matt T. Proud	c53a72a894	Test data for the curator.	2013-03-27 18:13:43 +01:00
Julius Volz	b836066c71	Eliminate need to get fingerprints during query execution time.	2013-03-27 14:42:03 +01:00
Julius Volz	2b8f0b2cc7	Constantize metric name label name.	2013-03-26 16:20:23 +01:00
Julius Volz	3880a86c9c	In case of empty query results, return an empty matrix.	2013-03-25 12:14:48 +01:00
Julius Volz	8e4c5b0cea	Use AST query analyzer and views with tiered storage.	2013-03-21 18:16:52 +01:00
Julius Volz	2f814d0e6d	AST persistence adapter simplifications after storage changes.	2013-03-21 18:11:03 +01:00
Julius Volz	6001d22f87	Change Get* methods to receive fingerprints instead of metrics.	2013-03-21 18:11:03 +01:00
Matt T. Proud	5959cd9e53	Include Julius' feedback.	2013-03-21 18:08:48 +01:00
Matt T. Proud	a70ee43ad3	Niladic ``ToString()` `to idiomatic` `String()``.	2013-03-21 18:08:47 +01:00
Matt T. Proud	13ae29b304	Initial in-memory arena implementation. It is unbounded, and nothing uses it except for a gating flag in main.	2013-02-18 09:38:14 -06:00
Julius Volz	c3d31febd6	Move durationToString to common place and cleanup error handling.	2013-02-14 19:02:23 +01:00
Matt T. Proud	efbe0e8a12	Interface simplification. GetMetricForFingerprint(model.Fingerprint) (*Metric, error) -> GetMetricForFingerprint(model.Fingerprint) (Metric, error)	2013-02-14 08:43:02 -08:00
Matt T. Proud	e8a733b525	Interface simplifications. GetFingerprintsForLabelSet ([]*Fingerprint, error) -> GetFingerprintsForLabelSet ([]Fingerprint, error)	2013-02-14 08:07:59 -08:00

1 2

69 Commits