node_exporter

Commit Graph

Author	SHA1	Message	Date
Paul Gier	cb9e23c536	Systemd refactor (#1254 ) This reduces the system metric collection time by using a wait group and go routines to allow the systemd metric calls happen concurrently. Also, makes the start time, restarts, tasks_max, and tasks_current metrics disabled by default because these can be time consuming to gather. Signed-off-by: Paul Gier <pgier@redhat.com>	2019-02-11 23:27:21 +01:00
mpursley	1ba436e194	add md_info_detail.sh (#1204 ) Signed-off-by: Matt Pursley <mpursley@gmail.com>	2019-02-10 15:20:42 +01:00
Sachi King	18fc512fc4	Bond: Monitor bond mii_status not link operstate (#1124 ) With a bond interface the state of the slave interface from the bond's point of view is reflected in `mii_status` and is independent of the link's `operstate`. When a bond is monitored with `miimon`, `mii_status` will reflect the state of the physical link as configured via the operator. When a bond is monitored via `arp_interval` the `mii_status` will reflect the results of the bond ARP checking. This means the link can be down from the bond's point of view, but up from a physical connection point of view. If a bond is not monitored via miimon or arp, the `mii_status` should likely be always `up`, however I have observed a case where this is not true and the `operstate` is `up` while `mii_status` is `down`. Kernel bond documentation stresses that a bond should not be configured without one of `mii_mon` or `arp_interval` configured however. This change results in the metric 'node_bonding_active' matching the up/down state of the bond's point of view rather than operstate. Signed-off-by: Sachi King <nakato@nakato.io>	2019-02-10 11:00:04 +01:00
Paul Gier	e0d6d11859	netclass_linux: remove varying labels from the 'up' metric (#1243 ) * netclass_linux: remove varying labels from the 'up' metric This moves the variable label values such as 'operstate' out of the 'network_up' metric and into a separate metric called '_info'. This allows the 'up' metric to remain continous over state changes. Fixes #1236 Signed-off-by: Paul Gier <pgier@redhat.com>	2019-02-07 15:59:32 +01:00
Johannes 'fish' Ziemke	6ea0aa73e4	Rename interface to device in netclass collector (#1224 ) * Rename interface to device in netclass collector This makes it consistent with other networking metrics like node_network_receive_bytes_total This closes #1223 Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	2019-02-06 20:02:48 +01:00
Ralf Horstmann	3867ad5ab0	Add diskstats collector for OpenBSD (#1250 ) * Add diskstats collector for OpenBSD Tested on i386 and amd64, OpenBSD 6.4 and -current. * Refactor diskstats collectors This moves common descriptors from Linux, Darwin, OpenBSD diskstats collectors into diskstats_common.go Signed-off-by: Ralf Horstmann <ralf+github@ackstorm.de>	2019-02-06 11:36:22 +01:00
David O'Rourke	d442108d7a	collector: Implement uname collector for FreeBSD (#1239 ) * collector: Implement uname collector for FreeBSD Signed-off-by: David O'Rourke <david.orourke@gmail.com>	2019-02-05 17:39:24 +01:00
Paul Gier	2b81bff518	collector: use path/filepath for handling file paths (#1245 ) Similar to #1228. Update the remaining collectors to use 'path/filepath' intead of 'path' for manipulating file paths. Signed-off-by: Paul Gier <pgier@redhat.com>	2019-02-05 16:37:27 +01:00
Ralf Horstmann	dda51ad06a	Fix staticcheck ST1003 warnings (#1249 ) This fixes a few staticcheck ST1003 warnings in OpenBSD CPU collector. No functional change. Signed-off-by: Ralf Horstmann <ralf+github@ackstorm.de>	2019-02-05 07:46:50 +01:00
James Hartig	62e87ca00c	Fixed capitalization of linux in Makefile (#1252 ) Signed-off-by: James Hartig <james@getadmiral.com>	2019-02-04 20:10:26 +01:00
mknapphrt	7fbdd0ae93	Update procfs vendor (#1248 ) Signed-off-by: Mark Knapp <mknapp@hudson-trading.com>	2019-02-04 16:54:41 +01:00
mpursley	7d150d5782	add physical disk "state" to megaraid_pd_info metric (#1226 ) Signed-off-by: Matt Pursley <mpursley@gmail.com>	2019-01-31 12:40:37 +01:00
Paul Gier	40dce45d8d	collector/systemd: add new label "type" for systemd_unit_state (#1229 ) Adds a new label called "type" systemd_unit_state which contains the Type field from the unit file. This applies only to the .service and .mount unit types. The other unit types do not include the optional type field. Fixes #1210 Signed-off-by: Paul Gier <pgier@redhat.com>	2019-01-29 23:54:47 +01:00
Paul Gier	6a3b92ce57	cleanup makefile (#1232 ) The recent updates to Makefile.common make some of the stuff in Makefile unnecessary. Signed-off-by: Paul Gier <pgier@redhat.com>	2019-01-23 21:44:12 +01:00
Matt Layher	3b5c2f6463	collector: use path/filepath for handling file paths (#1228 ) Signed-off-by: Matt Layher <mdlayher@gmail.com>	2019-01-21 17:44:55 +01:00
Jon Davies	e766485286	Add kstat-based Solaris metrics (#1197 ) * collector/loadavg_solaris.go: Use libkstat to gather load averages. * go.mod: Added go-kstat. * boot_time_solaris.go: Added. * cpu_solaris.go: Added. * README.md: Updated entries for Solaris. * collector/zfs_solaris.go: Added. * CHANGELOG.md: Added note about kstat-based Solaris metrics. Signed-off-by: Jonathan Davies <jpds@protonmail.com>	2019-01-12 13:33:56 +01:00
Mateusz Piotrowski	a616953b9a	Do not use .PHONY for $(PROMTOOL) (#1216 ) Adding $(PROMTOOL) to .PHONY makes it impossible to provide an alternative path to promtool. Signed-off-by: Mateusz Piotrowski <0mp@FreeBSD.org>	2019-01-10 17:44:10 +01:00
Johannes 'fish' Ziemke	8a6a464b7e	Add staticcheck.conf to enable ST1003 (#1214 ) > ST1003 – Poorly chosen identifier (non-default) > Identifiers, such as variable and package names, follow certain rules. Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	2019-01-04 16:36:49 +00:00
Ben Kochie	070e4b2e17	Update Makefile.common (#1220 ) * Update Makefile.common Update to new staticcheck method[0]. [0]: https://github.com/prometheus/prometheus/pull/5057 Signed-off-by: Ben Kochie <superq@gmail.com> * Fix staticcheck errors. Signed-off-by: Ben Kochie <superq@gmail.com>	2019-01-04 15:58:53 +00:00
Dai Dang Van	085d872aaf	Add S.M.A.R.T metrics (#1209 ) Update metrics following SMART attributes in [1][2] - Seek_Error_Rate - ID: 7 - Reallocated_Event_Count - ID: 196 [1] https://en.wikipedia.org/wiki/S.M.A.R.T.#Known_ATA_S.M.A.R.T._attributes [2] https://en.wikibooks.org/wiki/Minimizing_Hard_Disk_Drive_Failure_and_Data_Loss/Self-Monitoring,_Analysis,_and_Reporting_Technology Signed-off-by: Dai, Dang Van <daikk115@gmail.com>	2019-01-03 18:12:28 +01:00
Anton Tolchanov	cf8b29d1fb	Add a sample btrfs stats collector script (#1200 ) Signed-off-by: Anton Tolchanov <commits@knyar.net>	2018-12-21 14:10:03 +01:00
Simon Pasquier	97dab59e18	Fix go.sum after Go1.11.4 bump (#1202 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2018-12-19 11:41:27 +00:00
dhewg	7c960fd683	smartmon.sh: add metric for active/low-power mode (#1192 ) Add this new metric (where sda is active and sdb is in standby mode): smartmon_device_active{disk="/dev/sda",type="sat"} 1 smartmon_device_active{disk="/dev/sdb",type="sat"} 0 Also skip further metrics if the drive is in a low-power mode. This prevents spinning up disks just to get the metrics (which matches e.g. debian's default behavior for smartd). Signed-off-by: Andre Heider <a.heider@gmail.com>	2018-12-13 16:11:23 +01:00
Paul Gier	03bb276deb	Makefile.common: fix promu download path for arm32 (#1196 ) Signed-off-by: Paul Gier <pgier@redhat.com>	2018-12-13 16:07:22 +01:00
Paul Gier	614b815e00	Makefile.common: fix format rule (#1195 ) Signed-off-by: Paul Gier <pgier@redhat.com>	2018-12-11 17:47:09 +01:00
Ben Kochie	73ddf5f1f7	netstat: Add TCP In/Out Segs (#1185 ) * netstat: Add TCP In/Out Segs In order to get a better idea of TCP packet loss, we need to know how many `node_netstat_Tcp_OutSegs` there are so we can compare this to `node_netstat_Tcp_RetransSegs`. Signed-off-by: Ben Kochie <superq@gmail.com> * Update fixtures Signed-off-by: Ben Kochie <superq@gmail.com>	2018-12-08 12:16:02 +01:00
Tariq Ibrahim	6bd51269b7	update to host_statistics64 for Darwin meminfo (#1183 ) Signed-off-by: tariqibrahim <tariq181290@gmail.com>	2018-12-06 16:47:20 +01:00
Ben Kochie	f9dd8e9b8c	Release v0.17.0 (#1168 ) * Update CHANGELOG * Update VERSION Signed-off-by: Ben Kochie <superq@gmail.com>	2018-11-30 15:18:48 +01:00
Ben Kochie	4abc6fba7d	Add fallback for missing /proc/1/mounts (#1172 ) * Add fallback for missing /proc/1/mounts On some systems, `/proc/1/mounts` is hidden from non-root users due to the `hidepid` procfs feature. Attempt to fallback to `/proc/mounts` if `/proc/1/mounts` is not found. Signed-off-by: Ben Kochie <superq@gmail.com> * Add tests. Signed-off-by: Ben Kochie <superq@gmail.com> * Add CHANGELOG entry. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-11-30 14:01:55 +01:00
Jerome Froelich	0cb0c4d911	Remove unused variable readOnly from filesystem_linux.go. (#1173 ) The pull request #1002 changed the logic used on Linux servers to determine if a filesystem is read-only. As a result of this change, the variable `readOnly` is now unused and can be removed. Signed-off-by: Jerome Froelich <jeromefroelich@hotmail.com>	2018-11-30 14:01:39 +01:00
Ben Kochie	becca1275c	Convert to Go modules (#1178 ) * Convert to Go modules * Update promu config. * Convert to Go modules. * Update vendoring. * Update Makefile.common. * Update circleci config. * Use Prometheus release tar for promtool. * Fixup unpack * Use temp dir for unpacking tools. * Use BSD compatible tar command. * OpenBSD mkdir doesn't support `-v`. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-11-30 14:01:20 +01:00
Ben Kochie	1732478361	circleci: switch to 2.1 config Signed-off-by: Ben Kochie <superq@gmail.com>	2018-11-29 12:06:34 +01:00
Andreas Wirooks	9c9e17aba7	Handle 'Unknown' as measurement value. (#1113 ) We use the output-compatible perccli and storcli.py does not handle 'Unknown' as a result: ``` sg="Error parsing \"/var/lib/node_exporter/perccli.prom\": text format parsing error in line 222: expected float as value, got \"Unknown\"" source="textfile.go:212" ``` I know, the perccli should not return 'Unknown' but this error breaks all other useful measurements because the prom file is not parsable. My if condition fixes this. Signed-off-by: Andreas Wirooks <andreas.wirooks@1und1.de>	2018-11-23 16:29:56 +01:00
ioriveur	ea8e1373f7	Change Dfly's CPU counting frequency (#1140 ) * Change Dfly's CPU counting frequency, see: https://github.com/prometheus/node_exporter/issues/1129 * Convert Dfly's CPU unit into second Signed-off-by: iori-yja <fivo.11235813@gmail.com>	2018-11-21 13:45:22 +01:00
Ben Kochie	ffefc8e74d	Add a limit to the number of in-flight requests (#1166 ) In order to avoid stuck collectors using up all system resources, add a limit to the number of parallel in-flight scrape requests. This will return a 503 error. Default to 40 requests, this seems like a reasonable number based on: * Two Prometheus servers scraping every 15 seconds. * Failing scrapes after 5 minutes of stuckness. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-11-20 18:11:40 +01:00
Johannes 'fish' Ziemke	bcec99e0aa	Add link to prometheus-dcgm (#1164 ) Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	2018-11-19 19:35:01 +01:00
Nemikolh	62f99f95f0	Add receive/transmit bytes total metric (wifi collector). (#1150 ) Signed-off-by: Nemikolh <Nemikolh@users.noreply.github.com>	2018-11-19 19:15:54 +01:00
ioriveur	17fee8081f	Check BSD's mib which accounts for swap size (#1149 ) * Change Dfly's CPU counting frequency, see: https://github.com/prometheus/node_exporter/issues/1129 Signed-off-by: iori-yja <fivio.11235813@gmail.com> * Convert Dfly's CPU unit into second Signed-off-by: iori-yja <fivio.11235813@gmail.com> * Check BSD's mib which accounts for swap size; see #1127 Signed-off-by: iori-yja <fivo.11235813@gmail.com> * fix swap check code Signed-off-by: iori-yja <fivo.11235813@gmail.com>	2018-11-17 11:02:54 +01:00
Paul Gier	3cf5b006fb	examples/init.d: fix web.listen-address flag (#1157 ) CLI flags use two dashes instead of one since v0.15.0 Also, use default port number Fixes #1156 Signed-off-by: Paul Gier <pgier@redhat.com>	2018-11-16 00:50:09 +01:00
Ben Kochie	ab19e0c831	Add changelog entry for #1148 (#1154 ) Signed-off-by: Ben Kochie <superq@gmail.com>	2018-11-15 04:22:02 +01:00
Arno Uhlig	6edd9d217e	[systemd] collect taskCurrent, tasksMax per systemd unit (#1098 ) * [systemd] collect taskCurrent, tasksMax per systemd unit Signed-off-by: Arno Uhlig <arno.uhlig@sap.com>	2018-11-14 10:50:39 +01:00
Björn Rabenstein	174b854080	Merge pull request #1148 from prometheus/beorn7/metrics Add --web.disable-exporter-metrics flag	2018-11-13 15:24:38 +01:00
beorn7	cd2331a185	Add --web.disable-exporter-metrics flag If this flag is set, the metrics about the exporter itself (go_, process_, promhttp_*) will be excluded from /metrics. The Kingpin way of handling boolean flags makes the negative flag wording (_dis_able) the most reasonably one. This also refactors the flow in node_exporter.go quite a bit to avoid mixing up the global and a local registry and to avoid re-creating a registry even if no filtering is requested. Signed-off-by: beorn7 <beorn@soundcloud.com>	2018-11-13 14:22:25 +01:00
Ben Kochie	b1eec66640	Add TCPSynRetrans to netstat default filter (#1143 ) Tcp SYN packet retransmits are a very useful signal as they affect network performance disproportionately to regular TCP retransmits. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-11-07 17:21:18 +01:00
Christopher Blum	1b98db9fa7	textfile example storcli enhancements (#1145 ) * storcli.py: Remove IntEnum This removes an external dependency. Moved VD state to VD info labels * storcli.py: Fix BBU health detection BBU Status is 0 for a healthy cache vault and 32 for a healthy BBU. * storcli.py: Strip all strings from PD Strip all strings that we get from PDs. They often contain whitespaces.... * storcli.py: Add formatting options Add help text explaining how this documented was formatted * storcli.py: Add DG to pd_info label Add disk group to pd_info. That way we can relate to PDs in the same DG. For example to check if all disks in one RAID use the same interface... * storcli.py: Fix promtool issues Fix linting issues reported by promtool check-metrics * storcli.py: Exit if storcli reports issues storcli reports if the command was a success. We should not continue if there are issues. * storcli.py: Try to parse metrics to float This will sanitize the values we hand over to node_exporter - eliminating any unforeseen values we read out... * storcli.py: Refactor code to implement handle_sas_controller() Move code into methods so that we can now also support HBA queries. * storcli.py: Sort inputs "...like a good python developer" - Daniel Swarbrick * storcli.py: Replace external dateutil library with internal datetime Removes external dependency... * storcli.py: Also collect temperature on megaraid cards We have already collected them on mpt3sas cards... * storcli.py: Clean up old code Removed dead code that is not used any more. * storcli.py: strip() all information for labels They often contain whitespaces... * storcli.py: Try to catch KeyErrors generally If some key we expect is not there, we will want to still print whatever we have collected so far... * storcli.py: Increment version number We have made some changes here and there. The general look of the data has not been changed. * storcli.py: Fix CodeSpell issue Split string to avoid issues with Codespell due to Celcius in JSON Key Signed-off-by: Christopher Blum <zeichenanonym@web.de>	2018-11-07 17:12:23 +01:00
Sven Haardiek	29d4629f55	Introduce example to get pending updates from pacman (#1114 ) * Introduce example to get pending updates from pacman Signed-off-by: Sven Haardiek <sven@haardiek.de>	2018-11-05 22:27:57 +01:00
Cougar	764da30556	Add compat rules for node_time, node_memory_ShmemHugePages and node_memory_ShmemPmdMapped (#1138 ) Signed-off-by: Cougar <cougar@random.ee>	2018-11-05 16:40:19 +01:00
Benjamin Drung	2d5fcdeef4	Add mellanox_hca_temp text collector example (#1128 ) * deleted_libraries: Upgrade to Python 3 Python 2.7 will not be maintained past 2020. Therefore upgrade text_collector_examples/deleted_libraries.py to Python 3. * Add mellanox_hca_temp text collector example mellanox_hca_temp is a script that reads Mellanox HCA temperature using the Mellanox mget_temp_ext tool. Signed-off-by: Benjamin Drung <benjamin.drung@cloud.ionos.com>	2018-11-01 12:23:06 +01:00
Matt Layher	073e056121	Merge pull request #1131 from prometheus/mdl-collector-export collector: export NodeCollector for documentation purposes	2018-10-31 12:38:48 -04:00
Matt Layher	c0a55e3f80	collector: add bounds check and test for filesystem collector (#1133 ) Signed-off-by: Matt Layher <mdlayher@gmail.com>	2018-10-30 22:12:42 +01:00

1 2 3 4 5 ...

1169 Commits All Branches Search

1169 Commits

All Branches