haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2024-12-16 16:34:42 +00:00

Author	SHA1	Message	Date
Willy Tarreau	a28df3e19a	MEDIUM: stats: report the last check and last agent's output on the CSV status Now that we can quote unsafe string, it becomes possible to dump the health check responses on the CSV page as well. The two new fields are "last_chk" and "last_agt".	2014-06-16 18:20:26 +02:00
Willy Tarreau	a3310dc66c	DOC: clarify the CSV format Indicate that the text cells in the CSV format may contain quotes to escape ambiguous texts. We don't have this case right now since we limit the output, but it may happen in the future.	2014-06-16 18:20:14 +02:00
Willy Tarreau	588297f2f9	MINOR: tools: add new functions to quote-encode strings qstr() and cstr() will be used to quote-encode strings. The first one does it unconditionally. The second one is aimed at CSV files where the quote-encoding is only needed when the field contains a quote or a comma.	2014-06-16 18:20:14 +02:00
Thierry FOURNIER	148f40866b	MINOR: regex: fix a little configuration memory leak. The function regfree free the memory allocated to the pattern buffer by the compiling process. It is not freeing the buffer itself.	2014-06-16 16:47:20 +02:00
Simon Horman	75ab8bdb83	MEDIUM: Add port_to_str helper This helper is similar to addr_to_str but tries to convert the port rather than the address of a struct sockaddr_storage. This is in preparation for supporting an external agent check. Signed-off-by: Simon Horman <horms@verge.net.au>	2014-06-16 10:10:33 +02:00
Willy Tarreau	7799267f43	MEDIUM: connection: add support for proxy protocol v2 in accept-proxy The "accept-proxy" statement of bind lines was still limited to version 1 of the protocol, while send-proxy-v2 is now available on the server lines. This patch adds support for parsing v2 of the protocol on incoming connections. The v2 header is automatically recognized so there is no need for a new option.	2014-06-14 11:46:03 +02:00
Willy Tarreau	8fccfa256e	CLEANUP: connection: merge proxy proto v2 header and address block This is in order to simplify the PPv2 header parsing code to look more like the one provided as an example in the spec. No code change was performed beyond just merging the proxy_addr union into the proxy_hdr_v2 struct.	2014-06-14 11:46:02 +02:00
Willy Tarreau	7a6f134121	DOC: minor updates to the proxy protocol doc Update the release data, revision history and the link to the Forwarded HTTP extension.	2014-06-14 11:46:02 +02:00
Willy Tarreau	01320c9a34	DOC: proxy protocol example parser was still wrong Now that version and cmd are in the same byte, it is not possible anymore to compare the version as a 13th byte.	2014-06-14 11:46:02 +02:00
Willy Tarreau	4c20d29c29	BUG/MINOR: connection: make proxy protocol v1 support the UNKNOWN protocol If haproxy receives a connection over a unix socket and forwards it to another haproxy instance using proxy protocol v1, it sends an UNKNOWN protocol, which is rejected by the other side. Make the receiver accept the UNKNOWN protocol as per the spec, and only use the local connection's address for this.	2014-06-14 11:46:02 +02:00
Simon Horman	b00d17a034	MEDIUM: Break out check establishment into connect_chk() This is in preparation for adding a new type of check that uses a process rather than a socket. Signed-off-by: Simon Horman <horms@verge.net.au>	2014-06-13 18:31:11 +02:00
Willy Tarreau	215663dbf3	MINOR: config: warn when tcp-check rules are used without option tcp-check Since this case means that the rules will be ignored, better emit a warning.	2014-06-13 18:30:23 +02:00
Willy Tarreau	33a14e515b	MEDIUM: session: redispatch earlier when possible As discussed with Dmitry Sivachenko, is a server farm has more than one active server, uses a guaranteed non-determinist algorithm (round robin), and a connection was initiated from a non-persistent connection, there's no point insisting to reconnect to the same server after a connect failure, better redispatch upon the very first retry instead of insisting on the same server multiple times.	2014-06-13 17:53:55 +02:00
Willy Tarreau	db6d012270	MEDIUM: session: don't apply the retry delay when redispatching The retry delay is only useful when sticking to a same server. During a redispatch, it's useless and counter-productive if we're sure to switch to another server, which is almost guaranteed when there's more than one server and the balancing algorithm is round robin, so better not pass via the turn-around state in this case. It could be done as well for leastconn, but there's a risk of always killing the delay after the recovery of a server in a farm where it's almost guaranteed to take most incoming traffic. So better only kill the delay when using round robin.	2014-06-13 17:48:45 +02:00
Willy Tarreau	b02906659b	MEDIUM: session: allow shorter retry delay if timeout connect is small As discussed with Dmitry Sivachenko, the default 1-second connect retry delay can be large for situations where the connect timeout is much smaller, because it means that an active connection reject will take more time to be retried than a silent drop, and that does not make sense. This patch changes this so that the retry delay is the minimum of 1 second and the connect timeout. That way people running with sub-second connect timeout will benefit from the shorter reconnect.	2014-06-13 17:04:44 +02:00
Willy Tarreau	18bf01e900	MEDIUM: tcp: add a new tcp-request capture directive This new directive captures the specified fetch expression, converts it to text and puts it into the next capture slot. The capture slots are shared with header captures so that it is possible to dump all captures at once or selectively in logs and header processing. The purpose is to permit logs to contain whatever payload is found in a request, for example bytes at a fixed location or the SNI of forwarded SSL traffic.	2014-06-13 16:45:53 +02:00
Willy Tarreau	3a4ac422ce	MINOR: tcp: prepare support for the "capture" action A few minor entries will be needed to capture sample fetches in requests or responses. This patch just prepares the code for this.	2014-06-13 16:32:48 +02:00
Willy Tarreau	54da8db40b	MINOR: capture: extend the captures to support non-header keys This patch adds support for captures with no header name. The purpose is to allow extra captures to be defined and logged along with the header captures.	2014-06-13 16:32:48 +02:00
Willy Tarreau	5b4bf70a95	MINOR: sample: improve sample_fetch_string() to report partial contents Currently, all callers to sample_fetch_string() call it with SMP_OPT_FINAL. Now we improve it to support the case where this option is not set, and to make it return the original sample as-is. The purpose is to let the caller check the SMP_F_MAY_CHANGE flag in the result and know that it should wait to get complete contents. Currently this has no effect on existing code.	2014-06-13 16:32:48 +02:00
Willy Tarreau	d9ed3d2848	MINOR: logs: don't limit HTTP header captures to HTTP frontends Similar to previous patches, HTTP header captures are performed when a TCP frontend switches to an HTTP backend, but are not possible to report. So let's relax the check to explicitly allow them to be present in TCP frontends.	2014-06-13 16:32:48 +02:00
Willy Tarreau	4bf9963a78	MINOR: log: allow the HTTP status code to be logged even in TCP frontends Log format is defined in the frontend, and some frontends may be chained to an HTTP backend. Sometimes it's very convenient to be able to log the HTTP status code of these HTTP backends. This status is definitely present in the internal structures, it's just that we used to limit it to be used in HTTP frontends. So let's simply relax the check to allow it to be used in TCP frontends as well.	2014-06-13 16:32:48 +02:00
Willy Tarreau	be722a2d64	DOC: fix remaining occurrences of "pattern extraction"	2014-06-13 16:32:48 +02:00
Remi Gacogne	c1eab8c96f	MEDIUM: ssl: fix detection of ephemeral diffie-hellman key exchange by using the cipher description. In OpenSSL, the name of a cipher using ephemeral diffie-hellman for key exchange can start with EDH, but also DHE, EXP-EDH or EXP1024-DHE. We work around this issue by using the cipher's description instead of the cipher's name. Hopefully the description is less likely to change in the future.	2014-06-12 20:52:41 +02:00
Remi Gacogne	f46cd6e4ec	MEDIUM: ssl: Add the option to use standardized DH parameters >= 1024 bits When no static DH parameters are specified, this patch makes haproxy use standardized (rfc 2409 / rfc 3526) DH parameters with prime lenghts of 1024, 2048, 4096 or 8192 bits for DHE key exchange. The size of the temporary/ephemeral DH key is computed as the minimum of the RSA/DSA server key size and the value of a new option named tune.ssl.default-dh-param.	2014-06-12 16:12:23 +02:00
Simone Gotti	b7f1cfc846	BUG/MEDIUM: Fix unhandled connections problem with systemd daemon mode and SO_REUSEPORT. Using the systemd daemon mode the parent doesn't exits but waits for his childs without closing its listening sockets. As linux 3.9 introduced a SO_REUSEPORT option (always enabled in haproxy if available) this will give unhandled connections problems after an haproxy reload with open connections. The problem is that when on reload a new parent is started (-Ds $oldchildspids), in haproxy.c main there's a call to start_proxies that, without SO_REUSEPORT, should fail (as the old processes are already listening) and so a SIGTOU is sent to old processes. On this signal the old childs will call (in pause_listener) a shutdown() on the listening fd. From my tests (if I understand it correctly) this affects the in kernel file (so the listen is really disabled for all the processes, also the parent). Instead, with SO_REUSEPORT, the call to start_proxies doesn't fail and so SIGTOU is never sent. Only SIGUSR1 is sent and the listen isn't disabled for the parent but only the childs will stop listening (with a call to close()) So, with SO_REUSEPORT, the old childs will close their listening sockets but will wait for the current connections to finish or timeout, and, as their parent has its listening socket open, the kernel will schedule some connections on it. These connections will never be accepted by the parent as it's in the waitpid loop. This fix will close all the listeners on the parent before entering the waitpid loop. Signed-off-by: Simone Gotti <simone.gotti@gmail.com>	2014-06-11 21:27:34 +02:00
Willy Tarreau	0f6093a9b2	DOC: fix proxy protocol v2 decoder example Richard Russo reported that the example code in the PP spec is wrong now that we slightly changed the format to merge <ver> and <cmd>. Also rename the field <ver_cmd> to avoid any ambiguity on the usage.	2014-06-11 21:21:26 +02:00
Simone Gotti	1b48cc9c6f	BUG/MEDIUM: fix ignored values for half-closed timeouts (client-fin and server-fin) in defaults section. Signed-off-by: Simone Gotti <simone.gotti@gmail.com> WT: bug introduced with the new feature in 1.5-dev25, no backport is needed.	2014-06-11 21:07:16 +02:00
Nenad Merdanovic	6639a7cf0d	MINOR: checks: mysql-check: Add support for v4.1+ authentication MySQL will in stop supporting pre-4.1 authentication packets in the future and is already giving us a hard time regarding non-silencable warnings which are logged on each health check. Warnings look like the following: "[Warning] Client failed to provide its character set. 'latin1' will be used as client character set." This patch adds basic support for post-4.1 authentication by sending the proper authentication packet with the character set, along with the QUIT command.	2014-06-11 18:13:46 +02:00
Willy Tarreau	1592d1e72a	CLEANUP: http: don't clear CF_READ_NOEXP twice Last patch cleared the flag twice in the response, which is useless. Thanks Lukas for spotting it :-)	2014-06-11 16:49:14 +02:00
Willy Tarreau	77d29029af	BUG/MEDIUM: http: clear CF_READ_NOEXP when preparing a new transaction Commit `b1982e2` ("BUG/MEDIUM: http/session: disable client-side expiration only after body") was tricky and caused an issue which was fixed by commit `0943757` ("BUG/MEDIUM: session: don't clear CF_READ_NOEXP if analysers are not called"). But that's not enough, another issue was introduced and further emphasized by last fix. The issue is that the CF_READ_NOEXP flag needs to be cleared when waiting for a new request over that connection, otherwise we cannot expire anymore an idle connection waiting for a new request. This explains the neverending keepalives reported by at least 3 different persons since dev24. No backport is needed.	2014-06-11 14:11:44 +02:00
Todd Lyons	d1dcea064c	DOC: Add Exim as Proxy Protocol implementer.	2014-06-03 22:36:46 +02:00
Willy Tarreau	ac49707158	BUILD: stats: workaround stupid and bogus -Werror=format-security behaviour As reported by Vincent Bernat and Ryan O'Hara, building haproxy with the option above causes this : src/dumpstats.c: In function 'stats_dump_sv_stats': src/dumpstats.c:3059:4: error: format not a string literal and no format arguments [-Werror=format-security] cc1: some warnings being treated as errors make: *** [src/dumpstats.o] Error 1 With that option, gcc wants an argument after a string format even when that string format is a const but not a litteral. It can be anything invalid, for example an integer when a string is expected, it just wants something. So feed it with something :-(	2014-05-29 01:07:31 +02:00
Willy Tarreau	c874653bb4	BUILD: don't use type "uint" which is not portable Dmitry Sivachenko reported that "uint" doesn't build on FreeBSD 10. On Linux it's defined in sys/types.h and indicated as "old". Just get rid of the very few occurrences.	2014-05-28 23:05:07 +02:00
Willy Tarreau	2e85840266	[RELEASE] Released version 1.5-dev26 Released version 1.5-dev26 with the following main changes : - BUG/MEDIUM: polling: fix possible CPU hogging of worker processes after receiving SIGUSR1. - BUG/MINOR: stats: fix a typo on a closing tag for a server tracking another one - OPTIM: stats: avoid the calculation of a useless link on tracking servers in maintenance - MINOR: fix a few memory usage errors - CONTRIB: halog: Filter input lines by date and time through timestamp - MINOR: ssl: SSL_CTX_set_options() and SSL_CTX_set_mode() take a long, not an int - BUG/MEDIUM: regex: fix risk of buffer overrun in exp_replace() - MINOR: acl: set "str" as default match for strings - DOC: Add some precisions about acl default matching method - MEDIUM: acl: strenghten the option parser to report invalid options - BUG/MEDIUM: config: a stats-less config crashes in 1.5-dev25 - BUG/MINOR: checks: tcp-check must not stop on '\0' for binary checks - MINOR: stats: improve alignment of color codes to save one line of header - MINOR: checks: simplify and improve reporting of state changes when using log-health-checks - MINOR: server: remove the SRV_DRAIN flag which can always be deduced - MINOR: server: use functions to detect state changes and to update them - MINOR: server: create srv_was_usable() from srv_is_usable() and use a pointer - BUG/MINOR: stats: do not report "100%" in the thottle column when server is draining - BUG/MAJOR: config: don't free valid regex memory - BUG/MEDIUM: session: don't clear CF_READ_NOEXP if analysers are not called - BUG/MINOR: stats: tracking servers may incorrectly report an inherited DRAIN status - MEDIUM: proxy: make timeout parser a bit stricter - REORG/MEDIUM: server: split server state and flags in two different variables - REORG/MEDIUM: server: move the maintenance bits out of the server state - MAJOR: server: use states instead of flags to store the server state - REORG: checks: put the functions in the appropriate files ! - MEDIUM: server: properly support and propagate the maintenance status - MEDIUM: server: allow multi-level server tracking - CLEANUP: checks: rename the server_status_printf function - MEDIUM: checks: simplify server up/down/nolb transitions - MAJOR: checks: move health checks changes to set_server_check_status() - MINOR: server: make the status reporting function support a reason - MINOR: checks: simplify health check reporting functions - MINOR: server: implement srv_set_stopped() - MINOR: server: implement srv_set_running() - MINOR: server: implement srv_set_stopping() - MEDIUM: checks: simplify failure notification using srv_set_stopped() - MEDIUM: checks: simplify success notification using srv_set_running() - MEDIUM: checks: simplify stopping mode notification using srv_set_stopping() - MEDIUM: stats: report a server's own state instead of the tracked one's - MINOR: server: make use of srv_is_usable() instead of checking eweight - MAJOR: checks: add support for a new "drain" administrative mode - MINOR: stats: use the admin flags for soft enable/disable/stop/start on the web page - MEDIUM: stats: introduce new actions to simplify admin status management - MINOR: cli: introduce a new "set server" command - MINOR: stats: report a distinct output for DOWN caused by agent - MINOR: checks: support specific check reporting for the agent - MINOR: checks: support a neutral check result - BUG/MINOR: cli: "agent" was missing from the "enable"/"disable" help message - MEDIUM: cli: add support for enabling/disabling health checks. - MEDIUM: stats: report down caused by agent prior to reporting up - MAJOR: agent: rework the response processing and support additional actions - MINOR: stats: improve the stats web page to support more actions - CONTRIB: halog: avoid calling time/localtime/mktime for each line - DOC: document the workarouds for Google Chrome's bogus pre-connect - MINOR: stats: report SSL key computations per second - MINOR: stats: add counters for SSL cache lookups and misses	2014-05-28 17:50:53 +02:00
Willy Tarreau	ce3f913e48	MINOR: stats: add counters for SSL cache lookups and misses One important aspect of SSL performance tuning is the cache size, but there's no metric to know whether it's large enough or not. This commit introduces two counters, one for the cache lookups and another one for cache misses. These counters are reported on "show info" on the stats socket. This way, it suffices to see the cache misses counter constantly grow to know that a larger cache could possibly help.	2014-05-28 16:53:04 +02:00
Willy Tarreau	0c9c2720dc	MINOR: stats: report SSL key computations per second It's commonly needed to know how many SSL asymmetric keys are computed per second on either side (frontend or backend), and to know the SSL session reuse ratio. Now we compute these values and report them in "show info".	2014-05-28 12:28:58 +02:00
Sasha Pachev	c600204ddf	BUG/MEDIUM: regex: fix risk of buffer overrun in exp_replace() Currently exp_replace() (which is used in reqrep/reqirep) is vulnerable to a buffer overrun. I have been able to reproduce it using the attached configuration file and issuing the following command: wget -O - -S -q http://localhost:8000/`perl -e 'print "a"x4000'`/cookie.php Str was being checked only in in while (str) and it was possible to read past that when more than one character was being accessed in the loop. WT: Note that this bug is only marked MEDIUM because configurations capable of triggering this bug are very unlikely to exist at all due to the fact that most rewrites consist in static string additions that largely fit into the reserved area (8kB by default). This fix should also be backported to 1.4 and possibly even 1.3 since it seems to have been present since 1.1 or so. Config: ------- global maxconn 500 stats socket /tmp/haproxy.sock mode 600 defaults timeout client 1000 timeout connect 5000 timeout server 5000 retries 1 option redispatch listen stats bind :8080 mode http stats enable stats uri /stats stats show-legends listen tcp_1 bind :8000 mode http maxconn 400 balance roundrobin reqrep ^([^\ :])\ /(.)/(.)\.php(.) \1\ /\3.php?arg=\2\2\2\2\2\2\2\2\2\2\2\2\2\4 server srv1 127.0.0.1:9000 check port 9000 inter 1000 fall 1 server srv2 127.0.0.1:9001 check port 9001 inter 1000 fall 1	2014-05-27 14:36:06 +02:00
Willy Tarreau	2705a61d8c	DOC: document the workarouds for Google Chrome's bogus pre-connect More and more people are complaining about the bugs experienced by Chrome users due to the pre-connect feature and the fact that Chrome does not monitor its connections and happily displays the error page instead of re-opening a new connection. Since we can work around this bug, let's document how to do it.	2014-05-23 17:38:34 +02:00
Willy Tarreau	9f66aa9cc4	CONTRIB: halog: avoid calling time/localtime/mktime for each line The last commit provides time-based filtering. Unfortunately, it wastes 90% of the time calling the expensive time()/localtime()/mktime() functions. This patch does 3 things : - call time()/localtime() only once to initialize the correct struct timeinfo ; - call mktime() only when the time has changed regardless of the current second. - manually add the current second to the cached result. Doing just this is enough to multiply the parsing speed by 8.	2014-05-23 16:40:25 +02:00
Olivier Burgard	e97b904801	CONTRIB: halog: Filter input lines by date and time through timestamp I wanted to make a graph with average answer time in nagios that takes only the last 5 mn of the log. Filtering the log before using halog was too slow, so I added that filter to halog. The patch attached to this mail is a proposal to add a new option : -time [min][:max] The values are min timestamp and/or max timestamp of the lines to be used for stats. The date and time of the log lines between '[' and ']' are converted to timestamp and compared to these values. Here is an exemple of usage : cat /var/log/haproxy.log \| ./halog -srv -H -q -time $(date --date '-5 min' +%s)	2014-05-23 16:18:48 +02:00
Willy Tarreau	248a60e9bf	MINOR: stats: improve the stats web page to support more actions It is now possible to enable/disable agent and health checks, as well as to force their status.	2014-05-23 15:42:49 +02:00
Willy Tarreau	81f5d94a0b	MAJOR: agent: rework the response processing and support additional actions We now retrieve a lot of information from a single line of response, which can be made up of various words delimited by spaces/tabs/commas. We try to arrange all this and report whatever unusual we detect. The agent now supports : - "up", "down", "stopped", "fail" for the operational states - "ready", "drain", "maint" for the administrative states - any "%" number for the weight - an optional reason after a "#" that can be reported on the stats page The line parser and processor should move to its own function so that we can reuse the exact same one for http-based agent checks later.	2014-05-23 15:42:49 +02:00
Willy Tarreau	cf2924bc25	MEDIUM: stats: report down caused by agent prior to reporting up When an agent is enabled and forces a down state, it's important to have this exact information and to report the agent's status, so let's check the agent before checking the health check.	2014-05-23 15:42:49 +02:00
Willy Tarreau	9b5aecd5be	MEDIUM: cli: add support for enabling/disabling health checks. "enable health" and "disable health" are introduced to manipulate the health check subsystem.	2014-05-23 15:42:49 +02:00
Willy Tarreau	29e50f7507	BUG/MINOR: cli: "agent" was missing from the "enable"/"disable" help message Commit `671b6f0` ("MEDIUM: Add enable and disable agent unix socket commands") forgot to update the relevant help messages. This was done in 1.5-dev20, no backport is needed.	2014-05-23 15:42:49 +02:00
Willy Tarreau	23964187ae	MINOR: checks: support a neutral check result Agent will have the ability to return a weight without indicating an up/down status. Currently this is not possible, so let's add a 5th result CHK_RES_NEUTRAL for this purpose. It has been mapped to the unused HCHK_STATUS_CHECKED which already serves as a neutral delimitor between initiated checks and those returning a result.	2014-05-23 15:42:49 +02:00
Willy Tarreau	12634e1428	MINOR: checks: support specific check reporting for the agent Indicate "Agent" instead of "Health" in health check reports sent when "option log-health-checks" is set. Also, ensure that any agent check status change is correctly reported. Till now we used not to emit logs when the agent could not be reached.	2014-05-23 15:42:49 +02:00
Willy Tarreau	9638efa2a0	MINOR: stats: report a distinct output for DOWN caused by agent Till now we only had "DOWN" on the stats page, whether it's the agent or regular checks which caused this status. Let's differentiate the two with "DOWN (agent)" so that admins know that the agent is causing this status.	2014-05-23 15:42:49 +02:00
Willy Tarreau	2a4b70fffd	MINOR: cli: introduce a new "set server" command This command supports "agent", "health", "state" and "weight" to adjust various server attributes as well as changing server health check statuses on the fly or setting the drain mode.	2014-05-23 15:42:42 +02:00
Willy Tarreau	ed7df90068	MEDIUM: stats: introduce new actions to simplify admin status management Instead of enabling/disabling maintenance mode and drain mode separately using 4 actions, we now offer 3 simplified actions : - set state to READY - set state to DRAIN - set state to MAINT They have the benefit of reporting the same state as displayed on the page, and of doing the double-switch atomically eg when switching from drain to maint. Note that the old actions are still supported for users running scripts.	2014-05-23 14:29:11 +02:00

1 2 3 4 5 ...

4109 Commits