node_exporter

Commit Graph

Author	SHA1	Message	Date
Paul Gier	988f049040	collector/hwmon_linux: handle temperature sensor file which doesn't have item suffix (#1123 ) In some cases the file might be called "temp" instead of the usual format "temp<index>_<item>" as described in the kernel docs: https://www.kernel.org/doc/Documentation/hwmon/sysfs-interface In this case, treat this as an _input file containing the current temperature reading. Fixes #1122 Signed-off-by: Paul Gier <pgier@redhat.com>	2018-10-30 18:49:22 +01:00
Paul Gier	38163f234f	collector/diskstats: don't fail if there are extra stats, just ignore… (#1125 ) * collector/diskstats: don't fail if there are extra stats, just ignore them Signed-off-by: Paul Gier <pgier@redhat.com>	2018-10-30 18:45:00 +01:00
Kazumasa Kohtaka	9bd4416822	Makefile: add target for checking Prometheus rules (#1126 ) Signed-off-by: Kazumasa Kohtaka <kkohtaka@gmail.com>	2018-10-30 18:44:17 +01:00
Matt Layher	778124a56c	collector: add bounds check and test for tcpstat collector (#1134 ) Signed-off-by: Matt Layher <mdlayher@gmail.com>	2018-10-27 09:21:36 +02:00
Matt Layher	3d798aa4a1	collector: fix golint problems in ZFS collector (#1132 ) Signed-off-by: Matt Layher <mdlayher@gmail.com>	2018-10-27 09:18:33 +02:00
Ben Kochie	7519967619	Fix promu config (#1119 ) Rename promu no-cgo config to default promu name to avoid crossbuild problems. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-10-20 08:21:51 +02:00
Ben Kochie	0da9d248e7	Update for 0.17.0-rc.0 release (#1118 ) * Update VERSION. * Update CHANGELOG. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-10-19 17:29:19 +02:00
Ben Kochie	a0a164defb	Update cpufreq metrics collector (#1117 ) * Update Linux cpufreq collector to use new procfs library functions. * Split thermal throttle collection to a separate function. * Add new required fixtures and repack ttar file. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-10-18 17:28:19 +02:00
Ben Kochie	ef7a02dfa8	Update vendor github.com/prometheus/client_golang/...@v0.9.0 (#1111 ) * Update vendor github.com/prometheus/client_golang/...@v0.9.0 * Update vendor github.com/prometheus/common/... Signed-off-by: Ben Kochie <superq@gmail.com>	2018-10-15 20:40:34 +02:00
Paul Gier	7057c64f45	fix a few minor golint warnings (#1110 ) Signed-off-by: Paul Gier <pgier@redhat.com>	2018-10-15 18:44:06 +02:00
Paul Gier	e8d8199072	Update diskstats for linux kernel 4.19 (#1109 ) The format of /proc/diskstats is changing in linux-4.19 to include some additional fields. See: https://www.kernel.org/doc/Documentation/iostats.txt * collector/diskstats: use constants for some hard coded strings * collector/diskstats: update diskstats for linux-4.19 * collector/diskstats: remove kernel doc url from individual metrics Signed-off-by: Paul Gier <pgier@redhat.com>	2018-10-15 17:24:28 +02:00
Ben Kochie	0880d460d7	Ignore additional virtual filesystems (#1104 ) Add more virtual filesystems to the default ignore list * bpf * cgroup2 * selinuxfs * squashfs Signed-off-by: Ben Kochie <superq@gmail.com>	2018-10-12 11:24:32 +02:00
Ben Kochie	9cf508e673	Update vendoring (#1105 ) * Update vendor github.com/sirupsen/logrus@v1.1.1 * Update vendor github.com/coreos/go-systemd/dbus@v17 * Update vendor github.com/golang/protobuf/proto@v1.2.0 * Update vendor github.com/konsorten/go-windows-terminal-sequences@v1.0.1 * Update vendor github.com/mdlayher/... * Update vendor github.com/prometheus/procfs/... * Update vendor golang.org/x/... Signed-off-by: Ben Kochie <superq@gmail.com>	2018-10-11 18:41:41 +02:00
Bryan Boreham	f0d2a06b11	Update readme (#1107 ) * State that wifi collector is disabled by default Signed-off-by: Bryan Boreham <bjboreham@gmail.com> * Add the 'processes' collector to the Readme Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2018-10-11 18:27:41 +02:00
dbalakirev	5273b00df9	launchctl example based on LaunchDaemons (#1102 ) LaunchDaemons are the correct way to create services that are restart proof. There is now only a single destination place mentioned in the readme for the plist file. Signed-off-by: Dávid Balakirev <dave00ster@gmail.com>	2018-10-10 12:44:05 +02:00
Björn Rabenstein	bddf41d327	Update prometheus/client_golang vendoring (#1099 ) This is mostly required to fix a bug with histograms on 32bit platforms. (Which might or might not be used in node_exporter. Just in case...) Signed-off-by: beorn7 <beorn@soundcloud.com>	2018-10-05 16:05:02 +02:00
Dario Maiocchi	01ec8c5c5c	Remove continue with label (#1084 ) Instead of continue with label use helper function Signed-off-by: dmaiocchi <dmaiocchi@suse.com>	2018-10-05 13:20:30 +02:00
Ben Kochie	a1ce712e22	Cleanup unused /proc/mounts fixture. (#1097 ) * Cleanup unused /proc/mounts fixture. * Ignore Uint -> Unit in codespell. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-10-04 18:07:12 +02:00
Mario Trangoni	3659260b66	infiniband: Handle iWARP* RDMA modules N/A (#974 ) * infiniband: Add not connected i40iw0/ports/1 fixtures * infiniband: Handle issue when iWARP* RDMA modules are not available This is related to #966, and handle this error, Jun 07 13:33:24 hostname node_exporter[81888]: time="2018-06-07T13:33:24+02:00" level=error msg="ERROR: infiniband collector failed after 0.000929s: strconv.ParseUint: parsing \"N/A (no PMA)\": invalid syntax" source="collector.go:132" Signed-off-by: Mario Trangoni <mjtrangoni@gmail.com>	2018-10-04 15:05:59 +02:00
Yecheng Fu	0f9842f20a	[continue 912] strip rootfs prefix for run in docker (#1058 ) * strip rootfs prefix for run in docker * Use `/` as default value of path.rootfs, and parse mounts from `/proc/1/mounts`. * No need to mount `/proc` and `/sys` because we share host's PID namespace, which allows processes within the container to see all of the processes on the system. Closes: #66 Signed-off-by: Ivan Mikheykin <ivan.mikheykin@flant.com> Signed-off-by: Yecheng Fu <cofyc.jackson@gmail.com>	2018-10-04 14:11:21 +02:00
gentlejo	2269df255c	Add node_exporter script for init.d (#1059 ) * Add node_exporter script for init.d Signed-off-by: gentlejo <josungil@gmail.com>	2018-10-04 13:57:49 +02:00
Andrew Banchich	5da107b02c	Add missing words and update markdown syntax (#1095 ) Signed-off-by: Andrew Banchich <andrewbanchich@gmail.com>	2018-10-03 09:03:25 +02:00
Ralf Horstmann	9f820bd3ee	Update cpu collector for OpenBSD 6.4 (#1094 ) Starting with (not yet released) OpenBSD 6.4, sysctl KERN_CPTIME2 will return ENODEV for offline CPUs. SMT siblings are reported as offline when hw.smt is disabled, which is the default since one of the later Spectre variants. So this might affect a few systems. For more details see: https://cvsweb.openbsd.org/src/sys/kern/kern_sysctl.c#rev1.348 Signed-off-by: Ralf Horstmann <ralf+github@ackstorm.de>	2018-10-02 10:21:30 +02:00
Ben Kochie	5a461d261c	Add linux/s390x build (#1092 ) Signed-off-by: Ben Kochie <superq@gmail.com>	2018-09-30 16:45:32 +02:00
Ben Kochie	526eac15c5	Add ppc64 build. (#1089 ) Add ppc64 build.	2018-09-30 13:45:47 +02:00
Fabian Heymann	2f381f0c44	Update dependency mattn/go-xmlrpc (#1091 ) Signed-off-by: Fabian Heymann <fabian.heymann@finanzcheck.de>	2018-09-30 09:27:14 +02:00
Daniele Sluijters	d999dacdc6	filesystem: Ignore netns/nsfs mounts (#1047 ) When starting Docker containers a whole bunch of netns (network namespace) mounts are created that the node exporter can't make any sense of (and can't read either). This ignores all nsfs filesystems. Fixes #875 Signed-off-by: Daniele Sluijters <daenney@users.noreply.github.com>	2018-09-26 10:45:51 +02:00
Ben Kochie	c7dfb82dac	Update build (#1081 ) * Update build * Only use CGO when building non-Linux. * Update build to Go 1.11 * Use tab indenting consistently. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-09-25 16:02:42 +02:00
Ben Kochie	0fdc089187	Change systemd unit filtering (#1083 ) * Change systemd unit filtering Get all units from systemd and filter in Go. * Improves compatibility with older versions of systemd. * Improve debugging by printing when units pass the filter. * Remove extraneous newlines from log messages. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-09-24 15:04:55 +02:00
Luca Bruno	4672ea1671	collector/timex: remove cgo dependency (#1079 ) This removes the cgo import from timex collector, as it was only used to define two constants. Those are part of the Linux kernel<->userspace interface, thus there is no need to depend on libc to source them: https://github.com/torvalds/linux/blob/v4.18/include/uapi/linux/timex.h Signed-off-by: Luca Bruno <luca.bruno@coreos.com>	2018-09-20 11:51:34 +02:00
Christopher Blum	6aa5cfba6c	textfile example script rework (#1074 ) * textfile smartmon.sh Added functions to also parse megaraid disks. Added parsing to also detect the grown_defects counters. * textfile storcli.py Reworked the example file to export lots more information about megaraid attached controllers, VDs and PDs. Signed-off-by: Christopher Blum <christopher.blum@profitbricks.com>	2018-09-18 22:43:20 +02:00
Björn Rabenstein	1c9ea46cca	Update vendoring for client_golang and friends (#1076 ) Signed-off-by: beorn7 <beorn@soundcloud.com>	2018-09-17 17:09:52 +02:00
Mateusz Piotrowski	b46cd80200	Note how to get moreutils on FreeBSD (#1073 ) Signed-off-by: Mateusz Piotrowski <0mp@FreeBSD.org>	2018-09-14 14:14:45 +02:00
Ben Kochie	ebdd524123	Correctly cast Darwin memory info (#1060 ) * Correctly cast Darwin memory info * Cast stats to float64 before doing math on them to avoid integer wrapping. * Remove invalid `_total` suffix from gauge values. * Handle counters in `meminfo.go`. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-09-07 22:27:52 +02:00
Marco Tulio R Braga	05e55bddad	Fix typo on description of read_time_seconds_total (#1057 ) Fix typo on unit description of metric `*read_time_seconds_total` from milliseconds to seconds. Signed-off-by: Marco Tulio R Braga <marco.tulio@mtulio.eng.br>	2018-09-02 09:46:45 +02:00
Tariq Ibrahim	834e35112c	Using the recommended syntax for maintainer label (#1053 ) Signed-off-by: Tariq Ibrahim <tariq.ibrahim@microsoft.com>	2018-08-28 19:28:58 +02:00
Dan Fredell	c52e0d3353	Fix SmartOS build #1017 (#1018 ) Signed-off-by: Dan Fredell <Dan.Fredell@gmail.com>	2018-08-23 10:57:15 +00:00
Matt Bostock	9e0aee8ae7	Add metrics exposing extended md RAID info (#958 ) Add metrics that expose more information about MD RAID devices and disks: - the RAID level in use - the RAID set that a disk belongs to This allows for things like alert on unusually high I/O utilisation for a disk compared to other disks in the same RAID set, which usually means the disk is failing, and for comparing write/read latency across RAID sets. Output looks like: node_md_disk_info{disk_device="/dev/dm-0", md_device="md1", md_set="A"} 1 node_md_disk_info{disk_device="/dev/dm-3", md_device="md1", md_set="B"} 1 node_md_disk_info{disk_device="/dev/dm-2", md_device="md1", md_set="A"} 1 node_md_disk_info{disk_device="/dev/dm-1", md_device="md1", md_set="B"} 1 node_md_disk_info{disk_device="/dev/dm-4", md_device="md1", md_set="A"} 1 node_md_disk_info{disk_device="/dev/dm-5", md_device="md1", md_set="B"} 1 node_md_info{md_device="md1", md_name="foo", raid_level="10", md_metadata_version="1.2"} 1 The `node_md_info` metric, which gives additional information about the RAID array, is intentionally separate to avoid adding all of those labels to each disk. If you need to query using the labels contained in `node_md_info`, you can do that using PromQL: https://www.robustperception.io/how-to-have-labels-for-machine-roles/ I looked at adding the array UUID, but there's no sysfs entry for it and I'm not sure there's a strong use case for it. This patch to add a sysfs entry for the UUID was apparently not accepted: https://www.spinics.net/lists/raid/msg40667.html Add these metrics as a textfile script rather than adding them to the Go 'md' module as they're perhaps less commonly useful. If lots of people find them useful, we can later rewrite this in Go. Signed-off-by: Matt Bostock <mbostock@cloudflare.com>	2018-08-18 08:57:51 +00:00
Matt Layher	d84873727f	vendor: bump github.com/mdlayher/wifi and dependencies (#1045 ) Signed-off-by: Matt Layher <mdlayher@gmail.com>	2018-08-14 21:15:07 +02:00
James Hartig	60c827231a	NRestarts or NRefused aren't available on older systemd versions (#1039 ) * If NRestarts or NRefused are not available, don't ignore the unit itself * Don't report systemd metrics (NRestarts/NRefused) that are not available Signed-off-by: James Hartig <james@getadmiral.com>	2018-08-14 14:28:26 +02:00
Ben Kochie	fe5a117831	Handle vanishing PIDs (#1043 ) PIDs can vanish (exit) from /proc/ between gathering the list of PIDs and getting all of their stats. * Ignore file not found errors. * Explicitly count the PIDs we find. * Cleanup some error style issues. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-08-13 17:27:23 +02:00
Ben Kochie	099c1527f1	Update build (#1041 ) Update build * Update to Go 1.10. * Enable `ppc64le` build. * Enable MIPS builds. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-08-13 17:26:55 +02:00
Ben Kochie	0662673ad6	Disable wifi collector by default (#1037 ) * Disable wifi collector by default Disable the wifi collector by default due to suspected cashing issues and goroutine leaks. * https://github.com/prometheus/node_exporter/issues/870 * https://github.com/prometheus/node_exporter/issues/1008 Signed-off-by: Ben Kochie <superq@gmail.com>	2018-08-07 10:27:20 +02:00
Ben Kochie	5d23ad0ca7	Fix supervisord collector (#978 ) * Replace supervisord xmlrpc library * Use `github.com/mattn/go-xmlrpc` that doesn't leak goroutines. * Fix uptime metric * Use Prometheus best practices for uptime metric. * Use "start time" rather than "uptime". * Don't emit a start time if the process is down. * Add changelog entry. * Add example compatibility rules. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-08-06 16:54:46 +02:00
Julius Volz	2c52b8c761	systemd: Remove unneeded/unhandled error returns (#1035 ) Signed-off-by: Julius Volz <julius.volz@gmail.com>	2018-08-05 16:55:25 +02:00
Christian Hoffmann	6bdc5558ec	build: make staticcheck happy by using real regexp patterns #1025 (#1026 ) Signed-off-by: Christian Hoffmann <mail@hoffmann-christian.info>	2018-07-30 07:57:18 +02:00
Rene Treffer	80a5712b97	Fix sample rules for migration (#1022 ) - add conversion from _ms to _seconds on disk metrics - add missing node_textfile_mtime section - add groups: header to pass promtool check rules Signed-off-by: Rene Treffer <rene.treffer@soundcloud.com>	2018-07-27 14:27:44 +02:00
Hannes Körber	14a4f0028e	Enable nfs protocol (#998 ) * vendor: Update prometheus/procfs Signed-off-by: Hannes Körber <hannes.koerber@haktec.de> * mountstats: Use new NFS protocol field In https://github.com/prometheus/procfs/pull/100, the NFSTransportStats struct was expanded by a field called protocol that specifies the NFS protocol in use, either "tcp" or "udp". This commit adds the protocol as a label to all NFS metrics exported via the mountstats collector. Signed-off-by: Hannes Körber <hannes.koerber@haktec.de> * Update fixtures for UDP mount Signed-off-by: Hannes Körber <hannes.koerber@haktec.de>	2018-07-24 00:47:12 +02:00
Johannes Wienke	5c780d132c	Exclude only subdirectories of /var/lib/docker (#1003 ) It is quite common to put /var/lib/docker itself on a separate partition and that should be monitored as well. Signed-off-by: Johannes Wienke <languitar@semipol.de>	2018-07-23 15:43:42 +02:00
Ben Kochie	ca2fa4684b	Fix docker build (#1016 ) Fix override of make docker target to include new `DOCKER_REPO` variable pattern. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-07-23 10:56:20 +02:00

1 2 3 4 5 ...

1117 Commits All Branches Search

1117 Commits

All Branches