prometheus

Commit Graph

Author	SHA1	Message	Date
Julius Volz	35ee2cd3cb	Add alertmanager notification support to Prometheus. Alert definitions now also have mandatory SUMMARY and DESCRIPTION fields that get sent along a firing alert to the alert manager.	2013-07-30 17:23:41 +02:00
Julius Volz	81f0b85013	Return [] instead of null for empty result vectors.	2013-07-25 12:16:32 +02:00
Julius Volz	64b0ade171	Swap rules lexer for much faster one. This swaps github.com/kivikakk/golex for github.com/cznic/golex. The old lexer would have taken 3.5 years to load a set of 5000 test rules (quadratic time complexity for input length), whereas this one takes only 32ms. Furthermore, since the new lexer is embedded differently, this gets rid of the global parser variables and makes the rule loader fully reentrant without a lock.	2013-07-11 19:35:29 +02:00
Julius Volz	d2da21121c	Implement getValueRangeAtIntervalOp for faster range queries. This also short-circuits optimize() for now, since it is complex to implement for the new operator, and ops generated by the query layer already fulfill the needed invariants. We should still investigate later whether to completely delete operator optimization code or extend it to support getValueRangeAtIntervalOp operators.	2013-06-26 18:10:36 +02:00
Matt T. Proud	30b1cf80b5	WIP - Snapshot of Moving to Client Model.	2013-06-25 15:52:42 +02:00
Julius Volz	8ee7947b1e	Ensure metric name is dropped correctly from alert labels in UI.	2013-06-14 13:03:19 +02:00
Julius Volz	0226d1ac7a	Implement alerts dashboard and expression console links.	2013-06-13 22:35:40 +02:00
Julius Volz	ba29d07901	Show loaded rules in Status dashboard.	2013-06-11 11:39:31 +02:00
Julius Volz	fc97e688c6	Improve printing of rules and expressions.	2013-06-11 11:39:31 +02:00
Julius Volz	74cb676537	Implement Stringer interface for rules and all their children.	2013-06-07 15:54:32 +02:00
Matt T. Proud	2c3df44af6	Ensure database access waits until it is started. This commit introduces a channel message to ensure serving state has been reached with the storage stack before anything attempts to use it.	2013-06-06 10:42:21 +02:00
Julius Volz	51689d965d	Add debug timers to instant and range queries. This adds timers around several query-relevant code blocks. For now, the query timer stats are only logged for queries initiated through the UI. In other cases (rule evaluations), the stats are simply thrown away. My hope is that this helps us understand where queries spend time, especially in cases where they sometimes hang for unusual amounts of time.	2013-06-05 18:32:54 +02:00
Julius Volz	adb87816f4	Put RuleManager concurrency in hands of caller, fix races.	2013-06-05 13:56:56 +02:00
Julius Volz	138334fb31	Fix handling of negative deltas for non-counter values.	2013-05-28 17:36:53 +02:00
Julius Volz	66d4620061	Don't assume delta has at least one sample per vector element.	2013-05-28 14:02:36 +02:00
Julius Volz	21c3be0814	Skip any empty range/boundary elements, not only nil ones.	2013-05-28 14:02:08 +02:00
Matt T. Proud	c10780c966	Introduce telemetry for rule evaluator durations. This commit adds telemetry for the Prometheus expression rule evaluator, which will enable meta-Prometheus monitoring of customers to ensure that no instance is falling behind in answering routine queries. A few other sundry simplifications are introduced, too.	2013-05-23 21:29:27 +02:00
Julius Volz	750f862d9a	Use GetBoundaryValues() for non-counter deltas.	2013-05-22 19:13:47 +02:00
Julius Volz	5b105c77fc	Repointerize fingerprints.	2013-05-21 14:28:14 +02:00
Matt T. Proud	8f4c7ece92	Destroy naked returns in half of corpus. The use of naked return values is frowned upon. This is the first of two bulk updates to remove them.	2013-05-16 10:53:25 +03:00
juliusv	516101f015	Merge pull request #250 from prometheus/refactor/drop-unused-storage-setting Drop unused writeMemoryInterval	2013-05-14 08:45:59 -07:00
juliusv	9ff00b651d	Merge pull request #251 from prometheus/fix/memory-metric-mutability Fix GetMetricForFingerprint() metric mutability.	2013-05-14 08:12:45 -07:00
Bernerd Schaefer	63d9988b9c	Drop unused writeMemoryInterval	2013-05-14 17:03:03 +02:00
Bernerd Schaefer	aa96c7d141	Fix rules_test.go This is smelly, but for now we copy a helper method from the metric tests into rules.	2013-05-14 16:55:18 +02:00
Julius Volz	83c60ad43a	Fix GetMetricForFingerprint() metric mutability. Some users of GetMetricForFingerprint() end up modifying the returned metric labelset. Since the memory storage's implementation of GetMetricForFingerprint() returned a pointer to the metric (and maps are reference types anyways), the external mutation propagated back into the memory storage. The fix is to make a copy of the metric before returning it.	2013-05-14 16:46:30 +02:00
Bernerd Schaefer	428d91c86f	Rename test helper files to helpers_test.go This ensures that these files are properly included only in testing.	2013-05-14 16:30:47 +02:00
Matt T. Proud	244a4a9cdb	Update to go1.1. This commit updates the documentation, Makefiles, formatting, and code semantics to support the 1.1. runtime, which includes ... 1. ``make advice``, 2. ``make format``, and 3. ``go fix`` on various targets.	2013-05-14 12:39:08 +02:00
Matt T. Proud	161c8fbf9b	Include deletion processor for long-tail values. This commit extracts the model.Values truncation behavior into the actual tiered storage, which uses it and behaves in a peculiar way—notably the retention of previous elements if the chunk were to ever go empty. This is done to enable interpolation between sparse sample values in the evaluation cycle. Nothing necessarily new here—just an extraction. Now, the model.Values TruncateBefore functionality would do what a user would expect without any surprises, which is required for the DeletionProcessor, which may decide to split a large chunk in two if it determines that the chunk contains the cut-off time.	2013-05-10 12:19:12 +02:00
Julius Volz	0877680761	Implement a COUNT ... BY aggregation operator. This also removes the now obsolete scalar count() function and corrects the expressions test naming (broken in `2202cd71c9 (L6R59)`) so that the expression tests will actually run.	2013-05-08 16:35:16 +02:00
Julius Volz	56324d8ce2	Make AST query storage non-global.	2013-05-07 13:15:10 +02:00
Matt T. Proud	ce45787dbf	Storage interface to TieredStorage. This commit drops the Storage interface and just replaces it with a publicized TieredStorage type. Storage had been anticipated to be used as a wrapper for testability but just was not used due to practicality. Merely overengineered. My bad. Anyway, we will eventually instantiate the TieredStorage dependencies in main.go and pass them in for more intelligent lifecycle management. These changes will pave the way for managing the curators without Law of Demeter violations.	2013-05-03 15:54:14 +02:00
Julius Volz	9cea5d9df8	Convert the Prometheus configuration to protocol buffers.	2013-04-30 22:26:00 +02:00
Julius Volz	d8110fcd9c	Send sample arrays instead of single samples over channels.	2013-04-29 17:24:17 +02:00
Julius Volz	dcf2e82752	Cleanup and idiomaticize rule/expression dot graph output.	2013-04-29 12:57:34 +02:00
Matt T. Proud	b3e34c6658	Implement batch database sample curator. This commit introduces to Prometheus a batch database sample curator, which corroborates the high watermarks for sample series against the curation watermark table to see whether a curator of a given type needs to be run. The curator is an abstract executor, which runs various curation strategies across the database. It remarks the progress for each type of curation processor that runs for a given sample series. A curation procesor is responsible for effectuating the underlying batch changes that are request. In this commit, we introduce the CompactionProcessor, which takes several bits of runtime metadata and combine sparse sample entries in the database together to form larger groups. For instance, for a given series it would be possible to have the curator effectuate the following grouping: - Samples Older than Two Weeks: Grouped into Bunches of 10000 - Samples Older than One Week: Grouped into Bunches of 1000 - Samples Older than One Day: Grouped into Bunches of 100 - Samples Older than One Hour: Grouped into Bunches of 10 The benefits hereof of such a compaction are 1. a smaller search space in the database keyspace, 2. better employment of compression for repetious values, and 3. reduced seek times.	2013-04-27 17:38:18 +02:00
Julius Volz	2202cd71c9	Track alerts over time and write out alert timeseries.	2013-04-26 14:35:21 +02:00
Julius Volz	c0601abf46	Implement initial no-op alert parsing and rule parsing tests.	2013-04-23 13:48:24 +02:00
Matt T. Proud	f9e99bd08a	Refresh SampleValue to 64-bit floating point. We always knew that this needed to be fixed.	2013-04-21 20:31:50 +02:00
Julius Volz	99dcbe0f94	Integrate memory and disk layers in view rendering.	2013-04-19 16:01:27 +02:00
Julius Volz	63625bd244	Make view use memory persistence, remove obsolete code. This makes the memory persistence the backing store for views and adjusts the MetricPersistence interface accordingly. It also removes unused Get* method implementations from the LevelDB persistence so they don't need to be adapted to the new interface. In the future, we should rethink these interfaces. All staleness and interpolation handling is now removed from the storage layer and will be handled only by the query layer in the future.	2013-04-18 22:26:29 +02:00
Julius Volz	1eb586db7d	Fix rule evaluation closure.	2013-04-17 15:11:21 +02:00
Julius Volz	5f5ea03105	Run "make format".	2013-04-16 17:23:59 +02:00
Julius Volz	1cff4f3d91	Fix rate() per-second adjustment. This got broken during the depointerization of the Vector type.	2013-04-15 14:41:34 +02:00
juliusv	62f33f1fc2	Merge pull request #138 from prometheus/julius-fix-aliasing Correct delta()/rate() intervals and temporal aliasing.	2013-04-15 05:38:48 -07:00
Matt T. Proud	167504efd6	Merge pull request #142 from prometheus/julius-lowercase-by Allow lower-case BY operator.	2013-04-15 05:13:35 -07:00
Julius Volz	d53b8cf956	Correct delta()/rate() intervals and temporal aliasing.	2013-04-15 12:30:46 +02:00
Julius Volz	000f6a2e23	Allow lower-case BY operator.	2013-04-15 11:56:23 +02:00
Julius Volz	a0d311c9e6	Constantize job name label.	2013-04-15 11:47:54 +02:00
Julius Volz	1bc83e1b65	Also allow lower-cased aggregation ops.	2013-04-11 18:25:22 +02:00
juliusv	f9c291120f	Merge pull request #123 from prometheus/julius-propagate-rule-errors Propagate more errors during rule evaluation.	2013-04-11 06:38:33 -07:00

1 2 3

102 Commits