haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2025-05-06 01:37:59 +00:00

Author	SHA1	Message	Date
Remi Gacogne	8de5415b85	BUG/MEDIUM: ssl: Fix a memory leak in DHE key exchange OpenSSL does not free the DH * value returned by the callback specified with SSL_CTX_set_tmp_dh_callback(), leading to a memory leak for SSL/TLS connections using Diffie Hellman Ephemeral key exchange. This patch fixes the leak by allocating the DH * structs holding the DH parameters once, at configuration time. Note: this fix must be backported to 1.5.	2014-07-15 16:07:05 +02:00
Willy Tarreau	bb2e669f9e	BUG/MAJOR: http: correctly rewind the request body after start of forwarding Daniel Dubovik reported an interesting bug showing that the request body processing was still not 100% fixed. If a POST request contained short enough data to be forwarded at once before trying to establish the connection to the server, we had no way to correctly rewind the body. The first visible case is that balancing on a header does not always work on such POST requests since the header cannot be found. But there are even nastier implications which are that http-send-name-header would apply to the wrong location and possibly even affect part of the request's body due to an incorrect rewinding. There are two options to fix the problem : - first one is to force the HTTP_MSG_F_WAIT_CONN flag on all hash-based balancing algorithms and http-send-name-header, but there's always a risk that any new algorithm forgets to set it ; - the second option is to account for the amount of skipped data before the connection establishes so that we always know the position of the request's body relative to the buffer's origin. The second option is much more reliable and fits very well in the spirit of the past changes to fix forwarding. Indeed, at the moment we have msg->sov which points to the start of the body before headers are forwarded and which equals zero afterwards (so it still points to the start of the body before forwarding data). A minor change consists in always making it point to the start of the body even after data have been forwarded. It means that it can get a negative value (so we need to change its type to signed).. In order to avoid wrapping, we only do this as long as the other side of the buffer is not connected yet. Doing this definitely fixes the issues above for the requests. Since the response cannot be rewound we don't need to perform any change there. This bug was introduced/remained unfixed in 1.5-dev23 so the fix must be backported to 1.5.	2014-07-10 19:29:45 +02:00
Willy Tarreau	0dbfdbaef1	MINOR: samples: add two converters for the date format This patch adds two converters : ltime(<format>[,<offset>]) utime(<format>[,<offset>]) Both use strftime() to emit the output string from an input date. ltime() provides local time, while utime() provides the UTC time.	2014-07-10 16:43:44 +02:00
Willy Tarreau	d9f316ab83	MEDIUM: stick-table: add new converters to fetch table data These new converters make it possible to look up any sample expression in a table, and check whether an equivalent key exists or not, and if it exists, to retrieve the associated data (eg: gpc0, request rate, etc...). Till now it was only possible using tracking, but sometimes tracking is not suited to only retrieving such counters, either because it's done too early or because too many items need to be checked without necessarily being tracked. These converters all take a string on input, and then convert it again to the table's type. This means that if an input sample is of type IPv4 and the table is of type IP, it will first be converted to a string, then back to an IP address. This is a limitation of the current design which does not allow converters to declare that "any" type is supported on input. Since strings are the only types which can be cast to any other one, this method always works. The following converters were added : in_table, table_bytes_in_rate, table_bytes_out_rate, table_conn_cnt, table_conn_cur, table_conn_rate, table_gpc0, table_gpc0_rate, table_http_err_cnt, table_http_err_rate, table_http_req_cnt, table_http_req_rate, table_kbytes_in, table_kbytes_out, table_server_id, table_sess_cnt, table_sess_rate, table_trackers.	2014-07-10 16:43:44 +02:00
Willy Tarreau	8fed9037cd	MEDIUM: stick-table: implement lookup from a sample fetch Currently we have stktable_fetch_key() which fetches a sample according to an expression and returns a stick table key, but we also need a function which does only the second half of it from a known sample. So let's cut the function in two and introduce smp_to_stkey() to perform this lookup. The first function was adapted to make use of it in order to avoid code duplication.	2014-07-10 16:43:44 +02:00
Dan Dubovik	bd57a9f977	BUG/MEDIUM: backend: Update hash to use unsigned int throughout When we were generating a hash, it was done using an unsigned long. When the hash was used to select a backend, it was sent as an unsigned int. This made it difficult to predict which backend would be selected. This patch updates get_hash, and the hash methods to use an unsigned int, to remain consistent throughout the codebase. This fix should be backported to 1.5 and probably in part to 1.4.	2014-07-08 22:00:21 +02:00
Willy Tarreau	fd0e008d9d	BUG/MEDIUM: unix: completely unbind abstract sockets during a pause() Abstract namespace sockets ignore the shutdown() call and do not make it possible to temporarily stop listening. The issue it causes is that during a soft reload, the new process cannot bind, complaining that the address is already in use. This change registers a new pause() function for unix sockets and completely unbinds the abstract ones since it's possible to rebind them later. It requires the two previous patches as well as preceeding fixes. This fix should be backported into 1.5 since the issue apperas there.	2014-07-08 01:13:35 +02:00
Willy Tarreau	1c4b814087	MEDIUM: listener: support rebinding during resume() When a listener resumes operations, supporting a full rebind makes it possible to perform a full stop as a pause(). This will be used for pausing abstract namespace unix sockets.	2014-07-08 01:13:35 +02:00
Willy Tarreau	092d865c53	MEDIUM: listener: implement a per-protocol pause() function In order to fix the abstact socket pause mechanism during soft restarts, we'll need to proceed differently depending on the socket protocol. The pause_listener() function already supports some protocol-specific handling for the TCP case. This commit makes this cleaner by adding a new ->pause() function to the protocol struct, which, if defined, may be used to pause a listener of a given protocol. For now, only TCP has been adapted, with the specific code moved from pause_listener() to tcp_pause_listener().	2014-07-08 01:13:34 +02:00
Willy Tarreau	3c5efa2b32	BUG/MEDIUM: unix: failed abstract socket binding is retryable Jan Seda noticed that abstract sockets are incompatible with soft reload, because the new process cannot bind and immediately fails. This patch marks the binding as retryable and not fatal so that the new process can try to bind again after sending a signal to the old process. Note that this fix is not enough to completely solve the problem, but it is necessary. This patch should be backported to 1.5.	2014-07-08 01:13:34 +02:00
Willy Tarreau	39447b6a57	BUG/MINOR: listener: set the listener's fd to -1 after deletion This is currently harmless, but when stopping a listener, its fd is closed but not set to -1, so it is not possible to re-open it again. Currently this has no impact but can have after the abstract sockets are modified to perform a complete close on soft-reload. The fix can be backported to 1.5 and may even apply to 1.4 (protocols.c).	2014-07-08 01:13:34 +02:00
Willy Tarreau	506c69a50e	BUILD: http: fix isdigit & isspace warnings on Solaris As usual, when touching any is* function, Solaris complains about the type of the element being checked. Better backport this to 1.5 since nobody knows what the emitted code looks like since macros are used instead of functions.	2014-07-08 01:13:34 +02:00
Willy Tarreau	dc3d190b2c	BUILD: checks: kill a minor warning on Solaris in external checks Gcc on Solaris complains that elem->pid is pid_t and that we display it as int. A simple cast fixes this. No backport needed, this is 1.6 only.	2014-07-08 01:13:33 +02:00
Willy Tarreau	9b39dc5e49	BUILD: checks: external checker needs signal.h check.c doesn't build on solaris since `98637e5` ("MEDIUM: Add external check") because sigset_t is unknown. Simply include signal.h to fix the issue. No need to backport, this is 1.6-only.	2014-07-08 01:13:33 +02:00
Jan Seda	7319b64fc4	BUG/MEDIUM: unix: do not unlink() abstract namespace sockets upon failure. When bind() fails (function uxst_bind_listener()), the fail path doesn't consider the abstract namespace and tries to unlink paths held in uninitiliazed memory (tempname and backname). See the strace excerpt; the strings still hold the path from test1. =============================================================================================== 23722 bind(5, {sa_family=AF_FILE, path=@"test2"}, 110) = -1 EADDRINUSE (Address already in use) 23722 unlink("/tmp/test1.sock.23722.tmp") = -1 ENOENT (No such file or directory) 23722 close(5) = 0 23722 unlink("/tmp/test1.sock.23722.bak") = -1 ENOENT (No such file or directory) =============================================================================================== This patch should be backported to 1.5.	2014-07-02 17:57:28 +02:00
Marco Corte	8c27bcaea0	MINOR: stats: fix minor typo in HTML page There is a very small typo in the statistics interface: a "set" in lowercase where allothers are uppercase "Set".	2014-07-02 17:49:34 +02:00
Willy Tarreau	18324f574f	MEDIUM: log: support a user-configurable max log line length With all the goodies supported by logformat, people find that the limit of 1024 chars for log lines is too short. Some servers do not support larger lines and can simply drop them, so changing the default value is not always the best choice. This patch takes a different approach. Log line length is specified per log server on the "log" line, with a value between 80 and 65535. That way it's possibly to satisfy all needs, even with some fat local servers and small remote ones.	2014-06-27 18:13:53 +02:00
Willy Tarreau	1b71eb581e	BUG/MEDIUM: counters: fix track-sc* to wait on unstable contents I've been facing multiple configurations which involved track-sc* rules in tcp-request content without the "if ..." to force it to wait for the contents, resulting in random behaviour with contents sometimes retrieved and sometimes not. Reading the doc doesn't make it clear either that the tracking will be performed only if data are already there and that waiting on an ACL is the only way to avoid this. Since this behaviour is not natural and we now have the ability to fix it, this patch ensures that if input data are still moving, instead of silently dropping them, we naturally wait for them to stabilize up to the inspect-delay. This way it's not needed anymore to implement an ACL-based condition to force to wait for data, eventhough the behaviour is not changed for when an ACL is present. The most obvious usage will be when track-sc is followed by any HTTP sample expression, there's no need anymore for adding "if HTTP". It's probably worth backporting this to 1.5 to avoid further configuration issues. Note that it requires previous patch.	2014-06-25 17:26:54 +02:00
Willy Tarreau	b5975defba	MINOR: stick-table: make stktable_fetch_key() indicate why it failed stktable_fetch_key() does not indicate whether it returns NULL because the input sample was not found or because it's unstable. It causes trouble with track-sc* rules. Just like with sample_fetch_string(), we want it to be able to give more information to the caller about what it found. Thus, now we use the pointer to a sample passed by the caller, and fill it with the information we have about the sample. That way, even if we return NULL, the caller has the ability to check whether a sample was found and if it is still changing or not.	2014-06-25 17:17:53 +02:00
Willy Tarreau	6c616e0b96	BUG/MAJOR: sample: correctly reinitialize sample fetch context before calling sample_process() We used to only clear flags when reusing the static sample before calling sample_process(), but that's not enough because there's a context in samples that can be used by some fetch functions such as auth, headers and cookies, and not reinitializing it risks that a pointer of a different type is used in the wrong context. An example configuration which triggers the case consists in mixing hdr() and http_auth_group() which both make use of contexts : http-request add-header foo2 %[hdr(host)],%[http_auth_group(foo)] The solution is simple, initialize all the sample and not just the flags. This fix must be backported into 1.5 since it was introduced in 1.5-dev19.	2014-06-25 17:12:08 +02:00
Willy Tarreau	d713bcc326	BUG/MINOR: counters: do not untrack counters before logging Baptiste Assmann reported a corner case in the releasing of stick-counters: we release content-aware counters before logging. In the past it was not a problem, but since now we can log them it, it prevents one from logging their value. Simply switching the log production and the release of the counter fixes the issue. This should be backported into 1.5.	2014-06-25 15:36:04 +02:00
Emeric Brun	0abf836ecb	BUG/MINOR: ssl: Fix external function in order not to return a pointer on an internal trash buffer. 'ssl_sock_get_common_name' applied to a connection was also renamed 'ssl_sock_get_remote_common_name'. Currently, this function is only used with protocol PROXYv2 to retrieve the client certificate's common name. A further usage could be to retrieve the server certificate's common name on an outgoing connection.	2014-06-24 22:39:16 +02:00
Willy Tarreau	3caf2afabe	BUG/MEDIUM: http: fetch "base" is not compatible with set-header The sample fetch function "base" makes use of the trash which is also used by set-header/add-header etc... everything which builds a formated line. So we end up with some junk in the header if base is in use. Let's fix this as all other fetches by using a trash chunk instead. This bug was reported by Baptiste Assmann, and also affects 1.5.	2014-06-24 17:27:02 +02:00
Baptiste Assmann	92df370621	BUG/MINOR: config: http-request replace-header arg typo http-request replace-header was introduced with a typo which prevents it to be conditionned by an ACL. This patch fixes this issue.	2014-06-24 11:13:33 +02:00
Willy Tarreau	c7c7be21bf	BUG/MINOR: logs: properly initialize and count log sockets Commit `81ae195` ("[MEDIUM] add support for logging via a UNIX socket") merged in 1.3.14 introduced a few minor issues with log sockets. All of them happen only when a failure is encountered when trying to set up the logging socket (eg: socket family is not available or is temporarily short in resources). The first socket which experiences an error causes the socket setup loop to abort, possibly preventing any log from being sent if it was the first logger. The second issue is that if this socket finally succeeds after a second attempt, errors are reported for the wrong logger (eg: logger #1 failed instead of #2). The last point is that we now have multiple loggers, and it's a waste of time to walk over their list for every log while they're almost always properly set up. So in order to fix all this, let's merge the two lists. If a logger experiences an error, it simply sends an alert and skips to the next one. That way they don't prevent messages from being sent and are all properly accounted for.	2014-06-23 18:15:12 +02:00
Willy Tarreau	6f0a7bac28	BUG/MAJOR: session: revert all the crappy client-side timeout changes This is the 3rd regression caused by the changes below. The latest to date was reported by Finn Arne Gangstad. If a server responds with no content-length and the client's FIN is never received, either we leak the client-side FD or we spin at 100% CPU if timeout client-fin is set. Enough is enough. The amount of tricks needed to cover these side-effects starts to look like used toilet paper stacked over a chocolate cake. I don't want to eat that cake anymore! All this to avoid reporting a server-side timeout when a client stops uploading data and haproxy expires faster than the server... A lot of "ifs" resulting in a technically valid log that doesn't always please users, and whose alternative causes that many issues for all others users. So let's revert this crap merged since 1.5-dev25 : Revert "CLEANUP: http: don't clear CF_READ_NOEXP twice" This reverts commit `1592d1e72a`. Revert "BUG/MEDIUM: http: clear CF_READ_NOEXP when preparing a new transaction" This reverts commit `77d29029af`. Revert "BUG/MEDIUM: session: don't clear CF_READ_NOEXP if analysers are not called" This reverts commit `0943757a21`. Revert "BUG/MEDIUM: http: disable server-side expiration until client has sent the body" This reverts commit `3bed5e9337`. Revert "BUG/MEDIUM: http: correctly report request body timeouts" This reverts commit `b9edf8fbec`. Revert "BUG/MEDIUM: http/session: disable client-side expiration only after body" This reverts commit `b1982e27aa`. If a cleaner AND SAFER way to do something equivalent in 1.6-dev, we might consider backporting it to 1.5, but given the vicious bugs that have surfaced since, I doubt it will happen any time soon. Fortunately, that crap never made it into 1.4 so no backport is needed.	2014-06-23 15:47:00 +02:00
Emeric Brun	1d3865b096	BUG/MINOR: ssl: Fix OCSP resp update fails with the same certificate configured twice.	2014-06-23 12:14:47 +02:00
Emeric Brun	4f3c87a5d9	BUG/MEDIUM: ssl: Fix to not serve expired OCSP responses. For some browsers (firefox), an expired OCSP Response causes unwanted behavior. Haproxy stops serving OCSP response if nextupdate date minus the supported time skew (#define OCSP_MAX_RESPONSE_TIME_SKEW) is in the past.	2014-06-23 12:14:47 +02:00
Emeric Brun	13a6b48e24	BUG/MINOR: ssl: rejects OCSP response without nextupdate. To cache an OCSP Response without expiration time is not safe.	2014-06-23 12:14:47 +02:00
Simon Horman	98637e5bff	MEDIUM: Add external check Add an external check which makes use of an external process to check the status of a server.	2014-06-20 07:10:07 +02:00
Simon Horman	ccaabcdfca	BUG/MEDIUM: Consistently use 'check' in process_chk I am not entirely sure that this is a bug, but it seems to me that it may cause a problem if there agent-check is configured and there is some kind of error making a connection for it. Signed-off-by: Simon Horman <horms@verge.net.au>	2014-06-20 07:04:03 +02:00
Emeric Brun	c8b27b6c68	MEDIUM: ssl: add 300s supported time skew on OCSP response update. OCSP_MAX_RESPONSE_TIME_SKEW can be set to a different value at compilation (default is 300 seconds).	2014-06-19 14:37:30 +02:00
Emeric Brun	af4ef741e9	MINOR: ssl/cli: Fix unapropriate comment in code on 'set ssl ocsp-response'	2014-06-19 14:37:19 +02:00
Emeric Brun	4147b2ef10	MEDIUM: ssl: basic OCSP stapling support. The support is all based on static responses. This doesn't add any request / response logic to HAProxy, but allows a way to update information through the socket interface. Currently certificates specified using "crt" or "crt-list" on "bind" lines are loaded as PEM files. For each PEM file, haproxy checks for the presence of file at the same path suffixed by ".ocsp". If such file is found, support for the TLS Certificate Status Request extension (also known as "OCSP stapling") is automatically enabled. The content of this file is optional. If not empty, it must contain a valid OCSP Response in DER format. In order to be valid an OCSP Response must comply with the following rules: it has to indicate a good status, it has to be a single response for the certificate of the PEM file, and it has to be valid at the moment of addition. If these rules are not respected the OCSP Response is ignored and a warning is emitted. In order to identify which certificate an OCSP Response applies to, the issuer's certificate is necessary. If the issuer's certificate is not found in the PEM file, it will be loaded from a file at the same path as the PEM file suffixed by ".issuer" if it exists otherwise it will fail with an error. It is possible to update an OCSP Response from the unix socket using: set ssl ocsp-response <response> This command is used to update an OCSP Response for a certificate (see "crt" on "bind" lines). Same controls are performed as during the initial loading of the response. The <response> must be passed as a base64 encoded string of the DER encoded response from the OCSP server. Example: openssl ocsp -issuer issuer.pem -cert server.pem \ -host ocsp.issuer.com:80 -respout resp.der echo "set ssl ocsp-response $(base64 -w 10000 resp.der)" \| \ socat stdio /var/run/haproxy.stat This feature is automatically enabled on openssl 0.9.8h and above. This work was performed jointly by Dirkjan Bussink of GitHub and Emeric Brun of HAProxy Technologies.	2014-06-18 18:28:56 +02:00
Emeric Brun	2aab722dc1	MEDIUM: ssl: ignored file names ending as '.issuer' or '.ocsp'. We don't want to load these files found in directories specified in "crt" or "crt-list". These suffixes are reserved for OCSP stapling.	2014-06-18 18:24:55 +02:00
Thierry FOURNIER	26202760a4	MINOR: regex: Use native PCRE API. The pcreposix layer (in the pcre projetc) execute strlen to find thlength of the string. When we are using the function "regex_exex*2", the length is used to add a final \0, when pcreposix is executed a strlen is executed to compute the length. If we are using a native PCRE api, the length is provided as an argument, and these operations disappear. This is useful because PCRE regex are more used than POSIC regex.	2014-06-18 15:14:00 +02:00
Thierry FOURNIER	c9c2daf283	MEDIUM: regex: Remove null terminated strings. The new regex function can use string and length. The HAproxy buffer are not null-terminated, and the use of the regex_exec* functions implies the add of this null character. This patch replace these function by the functions which takes a string and length as input. Just the file "proto_http.c" is change because this one is more executed than other. The file "checks.c" have a very low usage, and it is not interesting to change it. Furthermore, the buffer used by "checks.c" are null-terminated.	2014-06-18 15:12:51 +02:00
Thierry FOURNIER	09af0d6d43	MEDIUM: regex: replace all standard regex function by own functions This patch remove all references of standard regex in haproxy. The last remaining references are only in the regex.[ch] files. In the file src/checks.c, the original function uses a "pmatch" array. In fact this array is unused. This patch remove it.	2014-06-18 15:07:57 +02:00
Thierry FOURNIER	b8f980cc19	MINOR: regex: Create JIT compatible function that return match strings This patchs rename the "regex_exec" to "regex_exec2". It add a new "regex_exec", "regex_exec_match" and "regex_exec_match2" function. This function can match regex and return array containing matching parts. Otherwise, this function use the compiled method (JIT or PCRE or POSIX). JIT require a subject with length. PCREPOSIX and native POSIX regex require a null terminted subject. The regex_exec* function are splited in two version. The first version take a null terminated string, but it execute strlen() on the subject if it is compiled with JIT. The second version (terminated by "2") take the subject and the length. This version adds a null character in the subject if it is compiled with PCREPOSIX or native POSIX functions. The documentation of posix regex and pcreposix says that the function returns 0 if the string matche otherwise it returns REG_NOMATCH. The REG_NOMATCH macro take the value 1 with posix regex and the value 17 with the pcreposix. The documentaion of the native pcre API (used with JIT) returns a negative number if no match, otherwise, it returns 0 or a positive number. This patch fix also the return codes of the regex_exec* functions. Now, these function returns true if the string match, otherwise it returns false.	2014-06-18 15:07:50 +02:00
Willy Tarreau	b854392824	BUG/MINOR: http: fix typos in previous patch When I renamed the modify-header action to replace-value, one of them was mistakenly set to "replace-val" instead. Additionally, differentiation of the two actions must be done on args[0][8] and not *args[8]. Thanks Thierry for spotting...	2014-06-17 19:03:56 +02:00
Sasha Pachev	218f064f55	MEDIUM: http: add actions "replace-header" and "replace-values" in http-req/resp This patch adds two new actions to http-request and http-response rulesets : - replace-header : replace a whole header line, suited for headers which might contain commas - replace-value : replace a single header value, suited for headers defined as lists. The match consists in a regex, and the replacement string takes a log-format and supports back-references.	2014-06-17 18:34:32 +02:00
Willy Tarreau	f5b1cc38b8	MEDIUM: stats: report per-backend and per-server time stats in HTML and CSV outputs The time statistics computed by previous patches are now reported in the HTML stats in the tips related to the total sessions for backend and servers, and as separate columns for the CSV stats.	2014-06-17 17:15:56 +02:00
Willy Tarreau	4bfc580dd3	MEDIUM: session: maintain per-backend and per-server time statistics Using the last rate counters, we now compute the queue, connect, response and total times per server and per backend with a 95% accuracy over the last 1024 samples. The operation is cheap so we don't need to condition it.	2014-06-17 17:15:56 +02:00
Willy Tarreau	a28df3e19a	MEDIUM: stats: report the last check and last agent's output on the CSV status Now that we can quote unsafe string, it becomes possible to dump the health check responses on the CSV page as well. The two new fields are "last_chk" and "last_agt".	2014-06-16 18:20:26 +02:00
Willy Tarreau	588297f2f9	MINOR: tools: add new functions to quote-encode strings qstr() and cstr() will be used to quote-encode strings. The first one does it unconditionally. The second one is aimed at CSV files where the quote-encoding is only needed when the field contains a quote or a comma.	2014-06-16 18:20:14 +02:00
Thierry FOURNIER	148f40866b	MINOR: regex: fix a little configuration memory leak. The function regfree free the memory allocated to the pattern buffer by the compiling process. It is not freeing the buffer itself.	2014-06-16 16:47:20 +02:00
Simon Horman	75ab8bdb83	MEDIUM: Add port_to_str helper This helper is similar to addr_to_str but tries to convert the port rather than the address of a struct sockaddr_storage. This is in preparation for supporting an external agent check. Signed-off-by: Simon Horman <horms@verge.net.au>	2014-06-16 10:10:33 +02:00
Willy Tarreau	7799267f43	MEDIUM: connection: add support for proxy protocol v2 in accept-proxy The "accept-proxy" statement of bind lines was still limited to version 1 of the protocol, while send-proxy-v2 is now available on the server lines. This patch adds support for parsing v2 of the protocol on incoming connections. The v2 header is automatically recognized so there is no need for a new option.	2014-06-14 11:46:03 +02:00
Willy Tarreau	8fccfa256e	CLEANUP: connection: merge proxy proto v2 header and address block This is in order to simplify the PPv2 header parsing code to look more like the one provided as an example in the spec. No code change was performed beyond just merging the proxy_addr union into the proxy_hdr_v2 struct.	2014-06-14 11:46:02 +02:00
Willy Tarreau	4c20d29c29	BUG/MINOR: connection: make proxy protocol v1 support the UNKNOWN protocol If haproxy receives a connection over a unix socket and forwards it to another haproxy instance using proxy protocol v1, it sends an UNKNOWN protocol, which is rejected by the other side. Make the receiver accept the UNKNOWN protocol as per the spec, and only use the local connection's address for this.	2014-06-14 11:46:02 +02:00
Simon Horman	b00d17a034	MEDIUM: Break out check establishment into connect_chk() This is in preparation for adding a new type of check that uses a process rather than a socket. Signed-off-by: Simon Horman <horms@verge.net.au>	2014-06-13 18:31:11 +02:00
Willy Tarreau	215663dbf3	MINOR: config: warn when tcp-check rules are used without option tcp-check Since this case means that the rules will be ignored, better emit a warning.	2014-06-13 18:30:23 +02:00
Willy Tarreau	33a14e515b	MEDIUM: session: redispatch earlier when possible As discussed with Dmitry Sivachenko, is a server farm has more than one active server, uses a guaranteed non-determinist algorithm (round robin), and a connection was initiated from a non-persistent connection, there's no point insisting to reconnect to the same server after a connect failure, better redispatch upon the very first retry instead of insisting on the same server multiple times.	2014-06-13 17:53:55 +02:00
Willy Tarreau	db6d012270	MEDIUM: session: don't apply the retry delay when redispatching The retry delay is only useful when sticking to a same server. During a redispatch, it's useless and counter-productive if we're sure to switch to another server, which is almost guaranteed when there's more than one server and the balancing algorithm is round robin, so better not pass via the turn-around state in this case. It could be done as well for leastconn, but there's a risk of always killing the delay after the recovery of a server in a farm where it's almost guaranteed to take most incoming traffic. So better only kill the delay when using round robin.	2014-06-13 17:48:45 +02:00
Willy Tarreau	b02906659b	MEDIUM: session: allow shorter retry delay if timeout connect is small As discussed with Dmitry Sivachenko, the default 1-second connect retry delay can be large for situations where the connect timeout is much smaller, because it means that an active connection reject will take more time to be retried than a silent drop, and that does not make sense. This patch changes this so that the retry delay is the minimum of 1 second and the connect timeout. That way people running with sub-second connect timeout will benefit from the shorter reconnect.	2014-06-13 17:04:44 +02:00
Willy Tarreau	18bf01e900	MEDIUM: tcp: add a new tcp-request capture directive This new directive captures the specified fetch expression, converts it to text and puts it into the next capture slot. The capture slots are shared with header captures so that it is possible to dump all captures at once or selectively in logs and header processing. The purpose is to permit logs to contain whatever payload is found in a request, for example bytes at a fixed location or the SNI of forwarded SSL traffic.	2014-06-13 16:45:53 +02:00
Willy Tarreau	3a4ac422ce	MINOR: tcp: prepare support for the "capture" action A few minor entries will be needed to capture sample fetches in requests or responses. This patch just prepares the code for this.	2014-06-13 16:32:48 +02:00
Willy Tarreau	54da8db40b	MINOR: capture: extend the captures to support non-header keys This patch adds support for captures with no header name. The purpose is to allow extra captures to be defined and logged along with the header captures.	2014-06-13 16:32:48 +02:00
Willy Tarreau	5b4bf70a95	MINOR: sample: improve sample_fetch_string() to report partial contents Currently, all callers to sample_fetch_string() call it with SMP_OPT_FINAL. Now we improve it to support the case where this option is not set, and to make it return the original sample as-is. The purpose is to let the caller check the SMP_F_MAY_CHANGE flag in the result and know that it should wait to get complete contents. Currently this has no effect on existing code.	2014-06-13 16:32:48 +02:00
Willy Tarreau	d9ed3d2848	MINOR: logs: don't limit HTTP header captures to HTTP frontends Similar to previous patches, HTTP header captures are performed when a TCP frontend switches to an HTTP backend, but are not possible to report. So let's relax the check to explicitly allow them to be present in TCP frontends.	2014-06-13 16:32:48 +02:00
Willy Tarreau	4bf9963a78	MINOR: log: allow the HTTP status code to be logged even in TCP frontends Log format is defined in the frontend, and some frontends may be chained to an HTTP backend. Sometimes it's very convenient to be able to log the HTTP status code of these HTTP backends. This status is definitely present in the internal structures, it's just that we used to limit it to be used in HTTP frontends. So let's simply relax the check to allow it to be used in TCP frontends as well.	2014-06-13 16:32:48 +02:00
Remi Gacogne	c1eab8c96f	MEDIUM: ssl: fix detection of ephemeral diffie-hellman key exchange by using the cipher description. In OpenSSL, the name of a cipher using ephemeral diffie-hellman for key exchange can start with EDH, but also DHE, EXP-EDH or EXP1024-DHE. We work around this issue by using the cipher's description instead of the cipher's name. Hopefully the description is less likely to change in the future.	2014-06-12 20:52:41 +02:00
Remi Gacogne	f46cd6e4ec	MEDIUM: ssl: Add the option to use standardized DH parameters >= 1024 bits When no static DH parameters are specified, this patch makes haproxy use standardized (rfc 2409 / rfc 3526) DH parameters with prime lenghts of 1024, 2048, 4096 or 8192 bits for DHE key exchange. The size of the temporary/ephemeral DH key is computed as the minimum of the RSA/DSA server key size and the value of a new option named tune.ssl.default-dh-param.	2014-06-12 16:12:23 +02:00
Simone Gotti	b7f1cfc846	BUG/MEDIUM: Fix unhandled connections problem with systemd daemon mode and SO_REUSEPORT. Using the systemd daemon mode the parent doesn't exits but waits for his childs without closing its listening sockets. As linux 3.9 introduced a SO_REUSEPORT option (always enabled in haproxy if available) this will give unhandled connections problems after an haproxy reload with open connections. The problem is that when on reload a new parent is started (-Ds $oldchildspids), in haproxy.c main there's a call to start_proxies that, without SO_REUSEPORT, should fail (as the old processes are already listening) and so a SIGTOU is sent to old processes. On this signal the old childs will call (in pause_listener) a shutdown() on the listening fd. From my tests (if I understand it correctly) this affects the in kernel file (so the listen is really disabled for all the processes, also the parent). Instead, with SO_REUSEPORT, the call to start_proxies doesn't fail and so SIGTOU is never sent. Only SIGUSR1 is sent and the listen isn't disabled for the parent but only the childs will stop listening (with a call to close()) So, with SO_REUSEPORT, the old childs will close their listening sockets but will wait for the current connections to finish or timeout, and, as their parent has its listening socket open, the kernel will schedule some connections on it. These connections will never be accepted by the parent as it's in the waitpid loop. This fix will close all the listeners on the parent before entering the waitpid loop. Signed-off-by: Simone Gotti <simone.gotti@gmail.com>	2014-06-11 21:27:34 +02:00
Simone Gotti	1b48cc9c6f	BUG/MEDIUM: fix ignored values for half-closed timeouts (client-fin and server-fin) in defaults section. Signed-off-by: Simone Gotti <simone.gotti@gmail.com> WT: bug introduced with the new feature in 1.5-dev25, no backport is needed.	2014-06-11 21:07:16 +02:00
Nenad Merdanovic	6639a7cf0d	MINOR: checks: mysql-check: Add support for v4.1+ authentication MySQL will in stop supporting pre-4.1 authentication packets in the future and is already giving us a hard time regarding non-silencable warnings which are logged on each health check. Warnings look like the following: "[Warning] Client failed to provide its character set. 'latin1' will be used as client character set." This patch adds basic support for post-4.1 authentication by sending the proper authentication packet with the character set, along with the QUIT command.	2014-06-11 18:13:46 +02:00
Willy Tarreau	1592d1e72a	CLEANUP: http: don't clear CF_READ_NOEXP twice Last patch cleared the flag twice in the response, which is useless. Thanks Lukas for spotting it :-)	2014-06-11 16:49:14 +02:00
Willy Tarreau	77d29029af	BUG/MEDIUM: http: clear CF_READ_NOEXP when preparing a new transaction Commit `b1982e2` ("BUG/MEDIUM: http/session: disable client-side expiration only after body") was tricky and caused an issue which was fixed by commit `0943757` ("BUG/MEDIUM: session: don't clear CF_READ_NOEXP if analysers are not called"). But that's not enough, another issue was introduced and further emphasized by last fix. The issue is that the CF_READ_NOEXP flag needs to be cleared when waiting for a new request over that connection, otherwise we cannot expire anymore an idle connection waiting for a new request. This explains the neverending keepalives reported by at least 3 different persons since dev24. No backport is needed.	2014-06-11 14:11:44 +02:00
Willy Tarreau	ac49707158	BUILD: stats: workaround stupid and bogus -Werror=format-security behaviour As reported by Vincent Bernat and Ryan O'Hara, building haproxy with the option above causes this : src/dumpstats.c: In function 'stats_dump_sv_stats': src/dumpstats.c:3059:4: error: format not a string literal and no format arguments [-Werror=format-security] cc1: some warnings being treated as errors make: *** [src/dumpstats.o] Error 1 With that option, gcc wants an argument after a string format even when that string format is a const but not a litteral. It can be anything invalid, for example an integer when a string is expected, it just wants something. So feed it with something :-(	2014-05-29 01:07:31 +02:00
Willy Tarreau	c874653bb4	BUILD: don't use type "uint" which is not portable Dmitry Sivachenko reported that "uint" doesn't build on FreeBSD 10. On Linux it's defined in sys/types.h and indicated as "old". Just get rid of the very few occurrences.	2014-05-28 23:05:07 +02:00
Willy Tarreau	ce3f913e48	MINOR: stats: add counters for SSL cache lookups and misses One important aspect of SSL performance tuning is the cache size, but there's no metric to know whether it's large enough or not. This commit introduces two counters, one for the cache lookups and another one for cache misses. These counters are reported on "show info" on the stats socket. This way, it suffices to see the cache misses counter constantly grow to know that a larger cache could possibly help.	2014-05-28 16:53:04 +02:00
Willy Tarreau	0c9c2720dc	MINOR: stats: report SSL key computations per second It's commonly needed to know how many SSL asymmetric keys are computed per second on either side (frontend or backend), and to know the SSL session reuse ratio. Now we compute these values and report them in "show info".	2014-05-28 12:28:58 +02:00
Sasha Pachev	c600204ddf	BUG/MEDIUM: regex: fix risk of buffer overrun in exp_replace() Currently exp_replace() (which is used in reqrep/reqirep) is vulnerable to a buffer overrun. I have been able to reproduce it using the attached configuration file and issuing the following command: wget -O - -S -q http://localhost:8000/`perl -e 'print "a"x4000'`/cookie.php Str was being checked only in in while (str) and it was possible to read past that when more than one character was being accessed in the loop. WT: Note that this bug is only marked MEDIUM because configurations capable of triggering this bug are very unlikely to exist at all due to the fact that most rewrites consist in static string additions that largely fit into the reserved area (8kB by default). This fix should also be backported to 1.4 and possibly even 1.3 since it seems to have been present since 1.1 or so. Config: ------- global maxconn 500 stats socket /tmp/haproxy.sock mode 600 defaults timeout client 1000 timeout connect 5000 timeout server 5000 retries 1 option redispatch listen stats bind :8080 mode http stats enable stats uri /stats stats show-legends listen tcp_1 bind :8000 mode http maxconn 400 balance roundrobin reqrep ^([^\ :])\ /(.)/(.)\.php(.) \1\ /\3.php?arg=\2\2\2\2\2\2\2\2\2\2\2\2\2\4 server srv1 127.0.0.1:9000 check port 9000 inter 1000 fall 1 server srv2 127.0.0.1:9001 check port 9001 inter 1000 fall 1	2014-05-27 14:36:06 +02:00
Willy Tarreau	248a60e9bf	MINOR: stats: improve the stats web page to support more actions It is now possible to enable/disable agent and health checks, as well as to force their status.	2014-05-23 15:42:49 +02:00
Willy Tarreau	81f5d94a0b	MAJOR: agent: rework the response processing and support additional actions We now retrieve a lot of information from a single line of response, which can be made up of various words delimited by spaces/tabs/commas. We try to arrange all this and report whatever unusual we detect. The agent now supports : - "up", "down", "stopped", "fail" for the operational states - "ready", "drain", "maint" for the administrative states - any "%" number for the weight - an optional reason after a "#" that can be reported on the stats page The line parser and processor should move to its own function so that we can reuse the exact same one for http-based agent checks later.	2014-05-23 15:42:49 +02:00
Willy Tarreau	cf2924bc25	MEDIUM: stats: report down caused by agent prior to reporting up When an agent is enabled and forces a down state, it's important to have this exact information and to report the agent's status, so let's check the agent before checking the health check.	2014-05-23 15:42:49 +02:00
Willy Tarreau	9b5aecd5be	MEDIUM: cli: add support for enabling/disabling health checks. "enable health" and "disable health" are introduced to manipulate the health check subsystem.	2014-05-23 15:42:49 +02:00
Willy Tarreau	29e50f7507	BUG/MINOR: cli: "agent" was missing from the "enable"/"disable" help message Commit `671b6f0` ("MEDIUM: Add enable and disable agent unix socket commands") forgot to update the relevant help messages. This was done in 1.5-dev20, no backport is needed.	2014-05-23 15:42:49 +02:00
Willy Tarreau	23964187ae	MINOR: checks: support a neutral check result Agent will have the ability to return a weight without indicating an up/down status. Currently this is not possible, so let's add a 5th result CHK_RES_NEUTRAL for this purpose. It has been mapped to the unused HCHK_STATUS_CHECKED which already serves as a neutral delimitor between initiated checks and those returning a result.	2014-05-23 15:42:49 +02:00
Willy Tarreau	12634e1428	MINOR: checks: support specific check reporting for the agent Indicate "Agent" instead of "Health" in health check reports sent when "option log-health-checks" is set. Also, ensure that any agent check status change is correctly reported. Till now we used not to emit logs when the agent could not be reached.	2014-05-23 15:42:49 +02:00
Willy Tarreau	9638efa2a0	MINOR: stats: report a distinct output for DOWN caused by agent Till now we only had "DOWN" on the stats page, whether it's the agent or regular checks which caused this status. Let's differentiate the two with "DOWN (agent)" so that admins know that the agent is causing this status.	2014-05-23 15:42:49 +02:00
Willy Tarreau	2a4b70fffd	MINOR: cli: introduce a new "set server" command This command supports "agent", "health", "state" and "weight" to adjust various server attributes as well as changing server health check statuses on the fly or setting the drain mode.	2014-05-23 15:42:42 +02:00
Willy Tarreau	ed7df90068	MEDIUM: stats: introduce new actions to simplify admin status management Instead of enabling/disabling maintenance mode and drain mode separately using 4 actions, we now offer 3 simplified actions : - set state to READY - set state to DRAIN - set state to MAINT They have the benefit of reporting the same state as displayed on the page, and of doing the double-switch atomically eg when switching from drain to maint. Note that the old actions are still supported for users running scripts.	2014-05-23 14:29:11 +02:00
Willy Tarreau	fae3a7eacd	MINOR: stats: use the admin flags for soft enable/disable/stop/start on the web page Instead of changing the weight to zero or enforcing maintenance mode, we now make use of the new MAINT/DRAIN flags which are correctly propagated.	2014-05-23 14:29:11 +02:00
Willy Tarreau	bfc7b7acd8	MAJOR: checks: add support for a new "drain" administrative mode This patch adds support for a new "drain" mode. So now we have 3 admin modes for a server : - READY - DRAIN - MAINT The drain mode disables load balancing but leaves the server up. It can coexist with maint, except that maint has precedence. It is also inherited from tracked servers, so just like maint, it's represented with 2 bits. New functions were designed to set/clear each flag and to propagate the changes to tracking servers when relevant, and to log the changes. Existing functions srv_set_adm_maint() and srv_set_adm_ready() were replaced to make use of the new functions. Currently the drain mode is not yet used, however the whole logic was tested with all combinations of set/clear of both flags in various orders to catch all corner cases.	2014-05-23 14:29:11 +02:00
Willy Tarreau	9943d3117e	MINOR: server: make use of srv_is_usable() instead of checking eweight srv_is_usable() is broader than srv_is_usable() as it not only considers the weight but the server's state as well. Future changes will allow a server to be in drain mode with a non-zero weight, so we should migrate to use that function instead.	2014-05-23 14:29:11 +02:00
Willy Tarreau	f4e38b36b8	MEDIUM: stats: report a server's own state instead of the tracked one's Now that servers have their own states, let's report this one instead of following the tracked server chain and reporting the tracked server's. However the tracked server is still used to report x/y when a server is going up or down. When the agent reports a down state, this one is still enforced.	2014-05-23 14:29:11 +02:00
Willy Tarreau	db58b79ccd	MEDIUM: checks: simplify stopping mode notification using srv_set_stopping() Function check_set_server_drain() used to set a server into stopping state. Now it first checks if all configured checks are UP, and if the possibly tracked servers is not stopped, and only calls set_srv_stopping() after that. That also simplified the conditions to call the function, and its logic. The function was also renamed check_notify_stopping() to better report this change.	2014-05-23 14:29:11 +02:00
Willy Tarreau	3e04838b8a	MEDIUM: checks: simplify success notification using srv_set_running() Function check_set_server_up() used to set a server up. Now it first checks if all configured checks are UP, and if all tracked servers are UP, and only calls set_srv_running() after that. That also simplified the conditions to call the function, and its logic. The function was also renamed check_notify_success() to better report this change.	2014-05-23 14:29:11 +02:00
Willy Tarreau	4eec547f32	MEDIUM: checks: simplify failure notification using srv_set_stopped() Function check_set_server_down() used to set a server down. Now it first checks if the health check's result differs from the server's state, and only calls srv_set_stopped() if the check reports a failure while the server is not down. Thanks to this, the conditions that were present around its call could be removed. The function was also renamed check_notify_failure() to better report this change.	2014-05-23 14:29:11 +02:00
Willy Tarreau	8eb7784634	MINOR: server: implement srv_set_stopping() This function was taken from check_set_server_drain(). It does not consider health checks at all and only sets a server to stopping provided it's not in maintenance and is not currently stopped. The resulting state will be STOPPING. The state change is propagated to tracked servers. For now the function is not used, but the goal is to split health checks status from server status and to be able to change a server's state regardless of health checks statuses.	2014-05-23 14:29:11 +02:00
Willy Tarreau	dbd5e78f5b	MINOR: server: implement srv_set_running() This function was taken from check_set_server_up(). It does not consider health checks at all and only sets a server up provided it's not in maintenance. The resulting state may be either RUNNING or STARTING depending on the presence of a slowstart or not. The state change is propagated to tracked servers. For now the function is not used, but the goal is to split health checks status from server status and to be able to change a server's state regardless of health checks statuses.	2014-05-23 14:29:11 +02:00
Willy Tarreau	e7d1ef16bf	MINOR: server: implement srv_set_stopped() This function was extracted from check_set_server_down(). In only manipulates the server state and does not consider the health checks at all, nor does it modify their status. It takes a reason message to report in logs, however it passes NULL when recursing through the trackers chain. For now the function is not used, but the goal is to split health checks status from server status and to be able to change a server's state regardless of health checks statuses.	2014-05-23 14:29:11 +02:00
Willy Tarreau	a150cf1a44	MINOR: checks: simplify health check reporting functions check_report_srv_status() was removed in favor of check_reason_string() combined with srv_report_status(). This way we have one function which is dedicated to check decoding, and another one dedicated to server status.	2014-05-23 14:29:11 +02:00
Willy Tarreau	bda92271e6	MINOR: server: make the status reporting function support a reason srv_adm_append_status() was renamed srv_append_status() since it's no more dedicated to maintenance mode. It now supports a reason which if not null is appended to the output string.	2014-05-23 14:29:11 +02:00
Willy Tarreau	7b1d47ce1b	MAJOR: checks: move health checks changes to set_server_check_status() We don't want to manipulate check's statuses anymore in functions which modify the server's state. So since any check is forced to call set_server_check_status() exactly once to report the result of the check, it's the best place to update the check's health.	2014-05-23 14:29:11 +02:00
Willy Tarreau	af54958d72	MEDIUM: checks: simplify server up/down/nolb transitions We don't have to handle the maintenance transition here anymore so we can simplify the functions and conditions. This also means that we don't need the disable/enable functions but only a function to switch to each new state. It's worth mentionning that at this stage there are still confusions between the server state and the checks states. For example, the health check's state is adjusted from tracked servers changing state, while it should not be.	2014-05-23 14:29:11 +02:00
Willy Tarreau	ddd329c059	CLEANUP: checks: rename the server_status_printf function This function is poorly named since it's now used exclusively with checks and cannot be moved to server.c. Call it check_report_srv_status() instead.	2014-05-23 14:29:11 +02:00
Willy Tarreau	3209123fe7	MEDIUM: server: allow multi-level server tracking Now that it is possible to know whether a server is in forced maintenance or inherits its maintenance status from another one, it is possible to allow server tracking at more than one level. We still provide a loop detection however. Note that for the stats it's a bit trickier since we have to report the check state which corresponds to the state of the server at the end of the chain.	2014-05-23 14:29:11 +02:00
Willy Tarreau	a0066ddbda	MEDIUM: server: properly support and propagate the maintenance status This change now involves a new flag SRV_ADMF_IMAINT to note that the maintenance status of a server is inherited from another server. Thus, we know at each server level in the chain if it's running, in forced maintenance or in a maintenance status because it tracks another server, or even in both states. Disabling a server propagates this flag down to other servers. Enabling a server flushes the flag down. A server becomes up again once both of its flags are cleared. Two new functions "srv_adm_set_maint()" and "srv_adm_set_ready()" are used to manipulate this maintenance status. They're used by the CLI and the stats page. Now the stats page always says "MAINT" instead of "MAINT(via)" and it's only the chk/down field which reports "via x/y" when the status is inherited from another server, but it doesn't say it when a server was forced into maintenance. The CSV output indicates "MAINT (via x/y)" instead of only "MAINT(via)". This is the most accurate representation. One important thing is that now entering/leaving maintenance for a tracking server correctly follows the state of the tracked server.	2014-05-22 11:27:00 +02:00
Willy Tarreau	4aac7db940	REORG: checks: put the functions in the appropriate files ! Checks.c has become a total mess. A number of proxy or server maintenance and queue management functions were put there probably because they were used there, but that makes the code untouchable. And that's without saying that their names does not always relate to what they really do! So let's do a first pass by moving these ones : - set_backend_down() => backend.c - redistribute_pending() => queue.c:pendconn_redistribute() - check_for_pending() => queue.c:pendconn_grab_from_px() - shutdown_sessions => server.c:srv_shutdown_sessions() - shutdown_backup_sessions => server.c:srv_shutdown_backup_sessions() All of them were moved at once.	2014-05-22 11:27:00 +02:00
Willy Tarreau	892337c8e1	MAJOR: server: use states instead of flags to store the server state Servers used to have 3 flags to store a state, now they have 4 states instead. This avoids lots of confusion for the 4 remaining undefined states. The encoding from the previous to the new states can be represented this way : SRV_STF_RUNNING \| SRV_STF_GOINGDOWN \| \| SRV_STF_WARMINGUP \| \| \| 0 x x SRV_ST_STOPPED 1 0 0 SRV_ST_RUNNING 1 0 1 SRV_ST_STARTING 1 1 x SRV_ST_STOPPING Note that the case where all bits were set used to exist and was randomly dealt with. For example, the task was not stopped, the throttle value was still updated and reported in the stats and in the http_server_state header. It was the same if the server was stopped by the agent or for maintenance. It's worth noting that the internal function names are still quite confusing.	2014-05-22 11:27:00 +02:00
Willy Tarreau	2012521d7b	REORG/MEDIUM: server: move the maintenance bits out of the server state Now we introduce srv->admin and srv->prev_admin which are bitfields containing one bit per source of administrative status (maintenance only for now). For the sake of backwards compatibility we implement a single source (ADMF_FMAINT) but the code already checks any source (ADMF_MAINT) where the STF_MAINTAIN bit was previously checked. This will later allow us to add ADMF_IMAINT for maintenance mode inherited from tracked servers. Along doing these changes, it appeared that some places will need to be revisited when implementing the inherited bit, this concerns all those modifying the ADMF_FMAINT bit (enable/disable actions on the CLI or stats page), and the checks to report "via" on the stats page. But currently the code is harmless.	2014-05-22 11:27:00 +02:00
Willy Tarreau	c93cd16b6c	REORG/MEDIUM: server: split server state and flags in two different variables Till now, the server's state and flags were all saved as a single bit field. It causes some difficulties because we'd like to have an enum for the state and separate flags. This commit starts by splitting them in two distinct fields. The first one is srv->state (with its counter-part srv->prev_state) which are now enums, but which still contain bits (SRV_STF_*). The flags now lie in their own field (srv->flags). The function srv_is_usable() was updated to use the enum as input, since it already used to deal only with the state. Note that currently, the maintenance mode is still in the state for simplicity, but it must move as well.	2014-05-22 11:27:00 +02:00
Willy Tarreau	fac5b5956b	MEDIUM: proxy: make timeout parser a bit stricter Twice in a week I found people were surprized by a "conditional timeout" not being respected, because they add "if <cond>" after a timeout, and since they don't see any error nor read the doc, the expect it to work. Let's make the timeout parser reject extra arguments to avoid these situations.	2014-05-22 08:26:41 +02:00
Willy Tarreau	efe282260e	BUG/MINOR: stats: tracking servers may incorrectly report an inherited DRAIN status The DRAIN status is not inherited between tracked servers, so the stats page must only use the reported server's status and not the tracked server's status, otherwise it misleadingly indicates a DRAIN state when a server tracks a draining server, while this is wrong.	2014-05-21 17:13:13 +02:00
Willy Tarreau	0943757a21	BUG/MEDIUM: session: don't clear CF_READ_NOEXP if analysers are not called As more or less suspected, commit `b1982e2` ("BUG/MEDIUM: http/session: disable client-side expiration only after body") was hazardous. It introduced a regression causing client side timeout to expire during connection retries if it's lower than the time needed to cover the amount of retries, so clients get a 408 when the connection to the server fails to establish fast enough. The reason is that the CF_READ_NOEXP flag is set after the MSG_DONE state is reached, which protects the timeout from being re-armed, then during the retries, process_session() clears the flag without calling the analyser (since there's no activity for it), so the timeouts are rearmed. Ideally, these one-shot flags should be per-analyser, and the analyser which sets them would be responsible for clearing them, or they would automatically be cleared when switching to another analyser. Unfortunately this is not really possible currently. What can be done however is to only clear them in the following situations : - we're going to call analysers - analysers have all been unsubscribed This method seems reliable enough and approaches the ideal case well enough. No backport is needed, this bug was introduced in 1.5-dev25.	2014-05-21 16:58:17 +02:00
Conrad Hoffmann	041751c13a	BUG/MEDIUM: polling: fix possible CPU hogging of worker processes after receiving SIGUSR1. When run in daemon mode (i.e. with at least one forked process) and using the epoll poller, sending USR1 (graceful shutdown) to the worker processes can cause some workers to start running at 100% CPU. Precondition is having an established HTTP keep-alive connection when the signal is received. The cloned (during fork) listening sockets do not get closed in the parent process, thus they do not get removed from the epoll set automatically (see man 7 epoll). This can lead to the process receiving epoll events that it doesn't feel responsible for, resulting in an endless loop around epoll_wait() delivering these events. The solution is to explicitly remove these file descriptors from the epoll set. To not degrade performance, care was taken to only do this when neccessary, i.e. when the file descriptor was cloned during fork. Signed-off-by: Conrad Hoffmann <conrad@soundcloud.com> [wt: a backport to 1.4 could be studied though chances to catch the bug are low]	2014-05-20 14:57:36 +02:00
Remi Gacogne	af5c3da89e	MINOR: ssl: SSL_CTX_set_options() and SSL_CTX_set_mode() take a long, not an int This is a minor fix, but the SSL_CTX_set_options() and SSL_CTX_set_mode() functions take a long, not an int parameter. As SSL_OP_ALL is now (since OpenSSL 1.0.0) defined as 0x80000BFFL, I think it is worth fixing.	2014-05-19 11:20:23 +02:00
Willy Tarreau	63af98d0dd	BUG/MAJOR: config: don't free valid regex memory Thomas Heil reported that previous commit `07fcaaa` ("MINOR: fix a few memory usage errors") make haproxy crash when req* rules are used. As diagnosed by Cyril Bont�, this commit introduced a regression which makes haproxy free the memory areas allocated for regex even when they're going to be used, resulting in the crashes. This patch does three things : - undo the free() on the valid path - add regfree() on the error path but only when regcomp() succeeds - rename err_code to ret_code to avoid confusing the valid return path with an error path.	2014-05-18 08:11:41 +02:00
Dirkjan Bussink	07fcaaa4cd	MINOR: fix a few memory usage errors These are either use after free errors or small leaks where memory is not free'd after some error state is detected.	2014-05-15 08:06:57 +02:00
Willy Tarreau	e21f84903e	BUG/MINOR: stats: do not report "100%" in the thottle column when server is draining A condition was missing and we used to have "throttle 100%" even when the server was draining connections, which is misleading but harmless.	2014-05-14 00:09:59 +02:00
Willy Tarreau	87eb1d6994	MINOR: server: create srv_was_usable() from srv_is_usable() and use a pointer We used to call srv_is_usable() with either the current state and weights or the previous ones. This causes trouble for future changes, so let's first split it in two variants : - srv_is_usable(srv) considers the current status - srv_was_usable(srv) considers the previous status	2014-05-13 22:34:55 +02:00
Willy Tarreau	c5150dafd8	MINOR: server: use functions to detect state changes and to update them Detecting that a server's status has changed is a bit messy, as well as it is to commit the status changes. We'll have to add new conditions soon and we'd better avoid to multiply the number of touched locations with the high risk of forgetting them. This commit introduces : - srv_lb_status_changed() to report if the status changed from the previously committed one ; - svr_lb_commit_status() to commit the current status The function is now used by all load-balancing algorithms.	2014-05-13 22:18:22 +02:00
Willy Tarreau	02615f9b16	MINOR: server: remove the SRV_DRAIN flag which can always be deduced This flag is only a copy of (srv->uweight == 0), so better get rid of it to reduce some of the confusion that remains in the code, and use a simple function to return this state based on this weight instead.	2014-05-13 22:18:13 +02:00
Willy Tarreau	bef1b32c4e	MINOR: checks: simplify and improve reporting of state changes when using log-health-checks Function set_server_check_status() is very weird. It is called at the end of a check to update the server's state before the new state is even calculated, and possibly to log status changes, only if the proxy has "option log-health-checks" set. In order to do so, it employs an exhaustive list of the combinations which can lead to a state change, while in practice almost all of them may simply be deduced from the change of check status. Better, some changes of check status are currently not detected while they can be very valuable (eg: changes between L4/L6/TOUT/HTTP 500 for example). The doc was updated to reflect this. Also, a minor change was made to consider s->uweight and not s->eweight as meaning "DRAIN" since eweight can be null without the DRAIN mode (eg: throttle, NOLB, ...).	2014-05-13 22:01:28 +02:00
Willy Tarreau	d03fdf41ec	MINOR: stats: improve alignment of color codes to save one line of header Having both "active or backup DOWN" and "not checked" on the left side of the color caption inflates the whole header block for no reason. Simply move them both on the same line and reduce the header height.	2014-05-13 22:01:21 +02:00
Willy Tarreau	ec6b012bf4	BUG/MINOR: checks: tcp-check must not stop on '\0' for binary checks Abuse of copy-paste has made "tcp-check expect binary" to consider a buffer starting with \0 as empty! Thanks to Lukas Benes for reporting this problem and confirming the fix. This is 1.5-only, no backport is needed.	2014-05-13 18:02:04 +02:00
Willy Tarreau	4e5ed29668	BUG/MEDIUM: config: a stats-less config crashes in 1.5-dev25 John-Paul Bader reported a stupid regression in 1.5-dev25, we forget to check that global.stats_fe is initialized before visiting its sockets, resulting in a crash. No backport is needed.	2014-05-13 13:53:27 +02:00
Thierry FOURNIER	9fefbd5926	MINOR: acl: set "str" as default match for strings It appears than many people considers that the default match for a fetch returning string is "exact match string" aka "str". This patch set this match as default for strings.	2014-05-12 15:19:15 +02:00
Cyril Bont�	c7fa7db7ce	OPTIM: stats: avoid the calculation of a useless link on tracking servers in maintenance Commit `f465994198` removed the "via" link when a tracking server is in maintenance, but still calculated an empty link that no one can use. We can safely remove it.	2014-05-12 01:02:46 +02:00
Cyril Bont�	d5de76a25b	BUG/MINOR: stats: fix a typo on a closing tag for a server tracking another one The "via" column includes a link to the tracked server but instead of closing the link with a </a> tag, a new tag is opened. This typo should also be backported to 1.4	2014-05-12 01:02:45 +02:00
Willy Tarreau	2039bba41b	MEDIUM: acl: strenghten the option parser to report invalid options Whatever ACL option beginning with a '-' is considered as a pattern if it does not match a known option. This is a big problem because typos are silently ignored, such as "-" or "-mi". Better clearly check the complete option name and report a parsing error if the option is unknown.	2014-05-11 09:43:46 +02:00
Willy Tarreau	05cdd9655d	MEDIUM: session: implement half-closed timeouts (client-fin and server-fin) Long-lived sessions are often subject to half-closed sessions resulting in a lot of sessions appearing in FIN_WAIT state in the system tables, and no way for haproxy to get rid of them. This typically happens because clients suddenly disconnect without sending any packet (eg: FIN or RST was lost in the path), and while the server detects this using an applicative heart beat, haproxy does not close the connection. This patch adds two new timeouts : "timeout client-fin" and "timeout server-fin". The former allows one to override the client-facing timeout when a FIN has been received or sent. The latter does the same for server-facing connections, which is less useful.	2014-05-10 15:14:05 +02:00
Willy Tarreau	7bb21532f4	MEDIUM: unix: avoid a double connect probe when no data are sent Plain "tcp" health checks sent to a unix socket cause two connect() calls to be made, one to connect, and a second one to verify that the connection properly established. But with unix sockets, we get immediate notification of success, so we can avoid this second attempt. However we need to ensure that we'll visit the connection handler even if there's no remaining handshake pending, so for this we claim we have some data to send in order to enable polling for writes if there's no more handshake.	2014-05-10 09:48:28 +02:00
Willy Tarreau	b1dd9bf308	MEDIUM: pattern: use ebtree's longest match to index/lookup string beginning Being able to map prefixes to values is already used for IPv4/IPv6 but was not yet used with strings. It can be very convenient to map directories to server farms but large lists may be slow. By using ebmb_insert_prefix() and ebmb_lookup_longest(), we can insert strings with their own length as a prefix, and lookup candidate strings and ensure that the longest matching one will be returned, which is the longest string matching the entry.	2014-05-10 08:53:48 +02:00
Willy Tarreau	ccfccefb80	MEDIUM: unix: implement support for Linux abstract namespace sockets These sockets are the same as Unix sockets except that there's no need for any filesystem access. The address may be whatever string both sides agree upon. This can be really convenient for inter-process communications as well as for chaining backends to frontends. These addresses are forced by prepending their address with "abns@" for "abstract namespace".	2014-05-10 01:53:58 +02:00
Willy Tarreau	5cf0b52d29	MEDIUM: checks: only complain about the missing port when the check uses TCP For UNIX socket addresses, we don't need any port, so let's disable the check under this condition.	2014-05-10 01:26:38 +02:00
Willy Tarreau	47f48c4247	MEDIUM: unix: add preliminary support for connecting to servers over UNIX sockets We've had everything in place for this for a while now, we just missed the connect function for UNIX sockets. Note that in order to connect to a UNIX socket inside a chroot, the path will have to be relative to the chroot. UNIX sockets connect about twice as fast as TCP sockets (or consume about half of the CPU at the same rate). This is interesting for internal communications between SSL processes and HTTP processes for example, or simply to avoid allocating source ports on the loopback. The tcp_connect_probe() function is still used to probe a dataless connection, but it is compatible so that's not an issue for now. Health checks are not yet fully supported since they require a port.	2014-05-10 01:26:38 +02:00
Willy Tarreau	9cf8d3f46b	MINOR: protocols: use is_inet_addr() when only INET addresses are desired We used to have is_addr() in place to validate sometimes the existence of an address, sometimes a valid IPv4 or IPv6 address. Replace them carefully so that is_inet_addr() is used wherever we can only use an IPv4/IPv6 address.	2014-05-10 01:26:37 +02:00
Willy Tarreau	640556c692	BUG/MINOR: checks: correctly configure the address family and protocol Currently, mixing an IPv4 and an IPv6 address in checks happens to work by pure luck because the two protocols use the same functions at the socket level and both use IPPROTO_TCP. However, they're definitely wrong as the protocol for the check address is retrieved from the server's address. Now the protocol assigned to the connection is the same as the one the address in use belongs to (eg: the server's address or the explicit check address).	2014-05-10 01:26:37 +02:00
Willy Tarreau	28e9d06201	BUG/MINOR: backend: only match IPv4 addresses with RDP cookies The RDP cookie extractor compares the 32-bit address from the request to the address of each server in the farm without first checking that the server's address is IPv4. This is a leftover from the IPv4 to IPv6 conversion. It's harmless as it's unlikely that IPv4 and IPv6 servers will be mixed in an RDP farm, but better fix it. This patch does not need to be backported.	2014-05-10 01:26:37 +02:00
Willy Tarreau	acf3bf94d0	CLEANUP: config: set the maxaccept value for peers listeners earlier Since we introduced bind_conf in peers, we can set maxaccept in a cleaner way at the proper time, let's do this to make the code more readable.	2014-05-09 22:12:24 +02:00
Willy Tarreau	67c2abc2f3	MINOR: config: only report a warning when stats sockets are bound to more than 1 process Till now a warning was emitted if the "stats bind-process" was not specified when nbproc was greater than 1. Now we can be much finer and only emit a warning when at least of the stats socket is bound to more than one process at a time.	2014-05-09 19:16:26 +02:00
Willy Tarreau	ae30253c27	MAJOR: listener: only start listeners bound to the same processes Now that we know what processes a "bind" statement is attached to, we have the ability to avoid starting some of them when they're not on the proper process. This feature is disabled when running in foreground however, so that debug mode continues to work with everything bound to the first and only process. The main purpose of this change is to finally allow the global stats sockets to be each bound to a different process. It can also be used to force haproxy to use different sockets in different processes for the same IP:port. The purpose is that under Linux 3.9 and above (and possibly other OSes), when multiple processes are bound to the same IP:port via different sockets, the system is capable of performing a perfect round-robin between the socket queues instead of letting any process pick all the connections from a queue. This results in a smoother load balancing and may achieve a higher performance with a large enough maxaccept setting.	2014-05-09 19:16:26 +02:00
Willy Tarreau	3d20958190	MEDIUM: listener: inherit the process mask from the proxy When a process list is specified on either the proxy or the bind lines, the latter is refined to the intersection of the two. A warning is emitted if no intersection is found, and the situation is fixed by either falling back to the first process of the proxy or to all processes.	2014-05-09 19:16:26 +02:00
Willy Tarreau	6ae1ba6f29	MEDIUM: listener: parse the new "process" bind keyword This sets the bind_proc entry in the bind_conf config block. For now it's still unused, but the doc was updated.	2014-05-09 19:16:26 +02:00
Willy Tarreau	102df613a9	MEDIUM: config: check the bind-process settings according to nbproc When a bind-process setting is present in a frontend or backend, we now verify that the specified process range at least shares one common process with those defined globally by nbproc. Then if the value is set, it is reduced to the one enforced by nbproc. A warning is emitted if process count does not match, and the fix is done the following way : - if a single process was specified in the range, it's remapped to process #1 - if more than one process was specified, the binding is removed and all processes are usable. Note that since backends may inherit their settings from frontends, depending on the declaration order, they may or may not be reported as warnings.	2014-05-09 19:16:26 +02:00
Willy Tarreau	a9db57ec5c	MEDIUM: config: limit nbproc to the machine's word size Some consistency checks cannot be performed between frontends, backends and peers at the moment because there is no way to check for intersection between processes bound to some processes when the number of processes is higher than the number of bits in a word. So first, let's limit the number of processes to the machine's word size. This means nbproc will be limited to 32 on 32-bit machines and 64 on 64-bit machines. This is far more than enough considering that configs rarely go above 16 processes due to scalability and management issues, so 32 or 64 should be fine. This way we'll ensure we can always build a mask of all the processes a section is bound to.	2014-05-09 19:16:26 +02:00
Willy Tarreau	3507d5d096	MEDIUM: proxy: only adjust the backend's bind-process when already set By default, a proxy's bind_proc is zero, meaning "bind to all processes". It's only when not zero that its process list is restricted. So we don't want the frontends to enforce the value on the backends when the backends are still set to zero.	2014-05-09 19:16:26 +02:00
Emeric Brun	93ee249fd1	MINOR: ssl: remove fallback to SSL session private cache if lock init fails. Now, haproxy exit an error saying: Unable to initialize the lock for the shared SSL session cache. You can retry using the global statement 'tune.ssl.force-private-cache' but it could increase the CPU usage due to renegotiation if nbproc > 1.	2014-05-09 19:16:13 +02:00
Emeric Brun	8dc6039807	MINOR: ssl: add global statement tune.ssl.force-private-cache. Boolean: used to force a private ssl session cache for each process in case of nbproc > 1.	2014-05-09 19:16:13 +02:00
Emeric Brun	78bd4038d7	BUG/MINOR: chunk: Fix function chunk_strcmp and chunk_strcasecmp match a substring. They could match different strings as equal if the chunk was shorter than the string. Those functions are currently only used for SSL's certificate DN entry extract.	2014-05-09 19:16:13 +02:00
David S	afb768340c	MEDIUM: connection: Implement and extented PROXY Protocol V2 This commit modifies the PROXY protocol V2 specification to support headers longer than 255 bytes allowing for optional extensions. It implements the PROXY protocol V2 which is a binary representation of V1. This will make parsing more efficient for clients who will know in advance exactly how many bytes to read. Also, it defines and implements some optional PROXY protocol V2 extensions to send information about downstream SSL/TLS connections. Support for PROXY protocol V1 remains unchanged.	2014-05-09 08:25:38 +02:00
Willy Tarreau	b4f98098aa	BUG/MAJOR: session: recover the correct connection pointer in half-initialized sessions John-Paul Bader reported a nasty segv which happens after a few hours when SSL is enabled under a high load. Fortunately he could catch a stack trace, systematically looking like this one : (gdb) bt full level = 6 conn = (struct connection ) 0x0 err_msg = <value optimized out> s = (struct session ) 0x80337f800 conn = <value optimized out> flags = 41997063 new_updt = <value optimized out> old_updt = 1 e = <value optimized out> status = 0 fd = 53999616 nbfd = 279 wait_time = <value optimized out> updt_idx = <value optimized out> en = <value optimized out> eo = <value optimized out> count = 78 sr = <value optimized out> sw = <value optimized out> rn = <value optimized out> wn = <value optimized out> The variable "flags" in conn_fd_handler() holds a copy of connection->flags when entering the function. These flags indicate 41997063 = 0x0280d307 : - {SOCK,DATA,CURR}_RD_ENA=1 => it's a handshake, waiting for reading - {SOCK,DATA,CURR}_WR_ENA=0 => no need for writing - CTRL_READY=1 => FD is still allocated - XPRT_READY=1 => transport layer is initialized - ADDR_FROM_SET=1, ADDR_TO_SET=0 => clearly it's a frontend connection - INIT_DATA=1, WAKE_DATA=1 => processing a handshake (ssl I guess) - {DATA,SOCK}_{RD,WR}_SH=0 => no shutdown - ERROR=0, CONNECTED=0 => handshake not completed yet - WAIT_L4_CONN=0 => normal - WAIT_L6_CONN=1 => waiting for an L6 handshake to complete - SSL_WAIT_HS=1 => the pending handshake is an SSL handshake So this is a handshake is in progress. And the only way to reach line 88 is for the handshake to complete without error. So we know for sure that ssl_sock_handshake() was called and completed the handshake then removed the CO_FL_SSL_WAIT_HS flag from the connection. With these flags, ssl_sock_handshake() does only call SSL_do_handshake() and retruns. So that means that the problem is necessarily in data->init(). The fd is wrong as reported but is simply mis-decoded as it's the lower half of the last function pointer. What happens in practice is that there's an issue with the way we deal with embryonic sessions during their conversion to regular sessions. Since they have no stream interface at the beginning, the pointer to the connection is temporarily stored into s->target. Then during their conversion, the first stream interface is properly initialized and the connection is attached to it, then s->target is set to NULL. The problem is that if anything fails in session_complete(), the session is left in this intermediate state where s->target is NULL, and kill_mini_session() is called afterwards to perform the cleanup. It needs the connection, that it finds in s->target which is NULL, dereferences it and dies. The only reasons for dying here are a problem on the TCP connection when doing the setsockopt(TCP_NODELAY) or a memory allocation issue. This patch implements a solution consisting in restoring s->target in session_complete() on the error path. That way embryonic sessions that were valid before calling it are still valid after. The bug was introduced in 1.5-dev20 by commit `f8a49ea` ("MEDIUM: session: attach incoming connection to target on embryonic sessions"). No backport is needed. Special thanks to John for his numerous tests and traces.	2014-05-08 22:46:32 +02:00
Emeric Brun	cd1a526a90	MAJOR: ssl: Change default locks on ssl session cache. Prevously pthread process shared lock were used by default, if USE_SYSCALL_FUTEX is not specified. This patch implements an OS independant kind of lock: An active spinlock is usedf if USE_SYSCALL_FUTEX is not specified. The old behavior is still available if USE_PTHREAD_PSHARED=1.	2014-05-08 22:46:32 +02:00
Emeric Brun	caa19cc867	BUG/MAJOR: ssl: Fallback to private session cache if current lock mode is not supported. Process shared mutex seems not supported on some OSs (FreeBSD). This patch checks errors on mutex lock init to fallback on a private session cache (per process cache) in error cases.	2014-05-08 22:46:32 +02:00
Willy Tarreau	5cbe4ef265	BUILD: ssl: SSL_CTX_set_msg_callback() needs openssl >= 0.9.7 1.5-dev24 introduced SSL_CTX_set_msg_callback(), which came with OpenSSL 0.9.7. A build attempt with an older one failed and we're still compatible with 0.9.6 in 1.5.	2014-05-08 22:46:31 +02:00
Willy Tarreau	bb66030a30	MEDIUM: listener: make the accept function more robust against pauses During some tests in multi-process mode under Linux, it appeared that issuing "disable frontend foo" on the CLI to pause a listener would make the shutdown(read) of certain processes disturb another process listening on the same socket, resulting in a 100% CPU loop. What happens is that accept() returns EAGAIN without accepting anything. Fortunately, we see that epoll_wait() reports EPOLLIN+EPOLLRDHUP (likely because the FD points to the same file in the kernel), so we can use that to stop the other process from trying to accept connections for a short time and try again later, hoping for the situation to change. We must not disable the FD otherwise there's no way to re-enable it. Additionally, during these tests, a loop was encountered on EINVAL which was not caught. Now if we catch an EINVAL, we proceed the same way, in case the socket is re-enabled later.	2014-05-07 23:13:08 +02:00
Willy Tarreau	3bed5e9337	BUG/MEDIUM: http: disable server-side expiration until client has sent the body It's the final part of the 2 previous patches. We prevent the server from timing out if we still have some data to pass to it. That way, even if the server runs with a short timeout and the client with a large one, the server side timeout will only start to count once the client sends everything. This ensures we don't report a 504 before the server gets the whole request. It is not certain whether the 1.4 state machine is fully compatible with this change. Since the purpose is only to ensure that we never report a server error before a client error if some data are missing from the client and when the server-side timeout is smaller than or equal to the client's, it's probably not worth attempting the backport.	2014-05-07 15:23:52 +02:00
Willy Tarreau	b9edf8fbec	BUG/MEDIUM: http: correctly report request body timeouts This is the continuation of previous patch "BUG/MEDIUM: http/session: disable client-side expiration only after body". This one takes care of properly reporting the client-side read timeout when waiting for a body from the client. Since the timeout may happen before or after the server starts to respond, we have to take care of the situation in three different ways : - if the server does not read our data fast enough, we emit a 504 if we're waiting for headers, or we simply break the connection if headers were already received. We report either sH or sD depending on whether we've seen headers or not. - if the server has not yet started to respond, but has read all of the client's data and we're still waiting for more data from the client, we can safely emit a 408 and abort the request ; - if the server has already started to respond (thus it's a transfer timeout during a bidirectional exchange), then we silently break the connection, and only the session flags will indicate in the logs that something went wrong with client or server side. This bug is tagged MEDIUM because it touches very sensible areas, however its impact is very low. It might be worth performing a careful backport to 1.4 once it has been confirmed that everything is correct and that it does not introduce any regression.	2014-05-07 15:22:27 +02:00
Willy Tarreau	b1982e27aa	BUG/MEDIUM: http/session: disable client-side expiration only after body For a very long time, back in the v1.3 days, we used to rely on a trick to avoid expiring the client side while transferring a payload to the server. The problem was that if a client was able to quickly fill the buffers, and these buffers took some time to reach the server, the client should not expire while not sending anything. In order to cover this situation, the client-side timeout was disabled once the connection to the server was OK, since it implied that we would at least expire on the server if required. But there is a drawback to this : if a client stops uploading data before the end, its timeout is not enforced and we only expire on the server's timeout, so the logs report a 504. Since 1.4, we have message body analysers which ensure that we know whether all the expected data was received or not (HTTP_MSG_DATA or HTTP_MSG_DONE). So we can fix this problem by disabling the client-side or server-side timeout at the end of the transfer for the respective side instead of having it unconditionally in session.c during all the transfer. With this, the logs now report the correct side for the timeout. Note that this patch is not enough, because another issue remains : the HTTP body forwarders do not abort upon timeout, they simply rely on the generic handling from session.c. So for now, the session is still aborted when reaching the server timeout, but the culprit is properly reported. A subsequent patch will address this specific point. This bug was tagged MEDIUM because of the changes performed. The issue it fixes is minor however. After some cooling down, it may be backported to 1.4. It was reported by and discussed with Rachel Chavez and Patrick Hemmer on the mailing list.	2014-05-07 14:21:47 +02:00
William Lallemand	07c8b24edb	MINOR: http: export the smp_fetch_cookie function Remove the static attribute of smp_fetch_cookie, and declare the function in proto/proto_http.h for future use.	2014-05-02 18:05:15 +02:00
Emeric Brun	b73a9b039c	MINOR: ssl: convert to binary ssl_fc_unique_id and ssl_bc_unique_id. Previously ssl_fc_unique_id and ssl_bc_unique_id return a string encoded in base64 of the RFC 5929 TLS unique identifier. This patch modify those fetches to return directly the ID in the original binary format. The user can make the choice to encode in base64 using the converter. i.e. : ssl_fc_unique_id,base64	2014-04-30 22:31:11 +02:00
Emeric Brun	53d1a98270	MINOR: ssl: adds sample converter base64 for binary type. The new converter encode binary type sample to base64 string. i.e. : ssl_c_serial,base64	2014-04-30 22:31:11 +02:00
Emeric Brun	55f4fa8825	MINOR: ssl: adds ssl_f_sha1 fetch to return frontend's certificate fingerprint ssl_f_sha1 is a binary binary fetch used to returns the SHA-1 fingerprint of the certificate presented by the frontend when the incoming connection was made over an SSL/TLS transport layer. This can be used to know which certificate was chosen using SNI.	2014-04-30 22:31:11 +02:00
Emeric Brun	ba841a1da1	MINOR: ssl: merge client's and frontend's certificate functions.	2014-04-30 22:31:11 +02:00
Emeric Brun	645ae79b40	MINOR: ssl: adds fetchs and ACLs for ssl back connection. Adds ssl fetchs and ACLs for outgoinf SSL/Transport layer connection with their docs: ssl_bc, ssl_bc_alg_keysize, ssl_bc_cipher, ssl_bc_protocol, ssl_bc_unique_id, ssl_bc_session_id and ssl_bc_use_keysize.	2014-04-30 22:31:11 +02:00
Emeric Brun	5bd99b4bd6	MINOR: ssl: clean unused ACLs declarations Now those ACLs are automatically created from pattern fetch declare.	2014-04-30 22:16:39 +02:00
Willy Tarreau	644c101e2d	BUG/MAJOR: http: connection setup may stall on balance url_param On the mailing list, seri0528@naver.com reported an issue when using balance url_param or balance uri. The request would sometimes stall forever. Cyril Bont� managed to reproduce it with the configuration below : listen test :80 mode http balance url_param q hash-type consistent server s demo.1wt.eu:80 and found it appeared with this commit : `80a92c0` ("BUG/MEDIUM: http: don't start to forward request data before the connect"). The bug is subtle but real. The problem is that the HTTP request forwarding analyzer refrains from starting to parse the request body when some LB algorithms might need the body contents, in order to preserve the data pointer and avoid moving things around during analysis in case a redispatch is later needed. And in order to detect that the connection establishes, it watches the response channel's CF_READ_ATTACHED flag. The problem is that a request analyzer is not subscribed to a response channel, so it will only see changes when woken for other (generally correlated) reasons, such as the fact that part of the request could be sent. And since the CF_READ_ATTACHED flag is cleared once leaving process_session(), it is important not to miss it. It simply happens that sometimes the server starts to respond in a sequence that validates the connection in the middle of process_session(), that it is detected after the analysers, and that the newly assigned CF_READ_ATTACHED is not used to detect that the request analysers need to be called again, then the flag is lost. The CF_WAKE_WRITE flag doesn't work either because it's cleared upon entry into process_session(), ie if we spend more than one call not connecting. Thus we need a new flag to tell the connection initiator that we are specifically interested in being notified about connection establishment. This new flag is CF_WAKE_CONNECT. It is set by the requester, and is cleared once the connection succeeds, where CF_WAKE_ONCE is set instead, causing the request analysers to be scanned again. For future versions, some better options will have to be considered : - let all analysers subscribe to both request and response events ; - let analysers subscribe to stream interface events (reduces number of useless calls) - change CF_WAKE_WRITE's semantics to persist across calls to process_session(), but that is different from validating a connection establishment (eg: no data sent, or no data to send) The bug was introduced in 1.5-dev23, no backport is needed.	2014-04-30 20:02:02 +02:00
Willy Tarreau	7c29f1edca	BUILD: config: remove a warning with clang Commit `fc6c032` ("MEDIUM: global: add support for CPU binding on Linux ("cpu-map")") merged into 1.5-dev13 involves a useless test that clang reports as a warning. The "low" variable cannot be negative here. Issue reported by Charles Carter.	2014-04-29 19:55:25 +02:00
Willy Tarreau	86e0fc1739	BUG/MINOR: auth: fix wrong return type in pat_match_auth() Commit `5338eea` ("MEDIUM: pattern: The match function browse itself the list or the tree") changed the return type of pattern matching functions. One enum was left over in pat_match_auth(). Fortunately, this one equals zero where a null pointer is expected, so it's cast correctly. This detected and reported by Charles Carter was introduced in 1.5-dev23, no backport is needed.	2014-04-29 19:52:16 +02:00
Willy Tarreau	ed44649eb7	MEDIUM: config: warn that '{cli,con,srv}timeout' are deprecated It's been like this since version 1.3 in 2007. It's time to clean up configurations. The warning explains what to use depending on the timeout name.	2014-04-29 01:09:56 +02:00
Willy Tarreau	a3c504c032	MEDIUM: config: inform the user only once that "redispatch" is deprecated It may go away in 1.6, but there's no point reporting it for each and every occurrence.	2014-04-29 01:09:40 +02:00
Willy Tarreau	40bac83734	MEDIUM: config: inform the user that "reqsetbe" is deprecated It will go away in 1.6.	2014-04-29 00:46:01 +02:00
Willy Tarreau	de9d2d7b86	MEDIUM: config: inform the user about the deprecatedness of "block" rules It's just a warning emitted once.	2014-04-29 00:46:01 +02:00
Willy Tarreau	ff05550b5d	MINOR: config: add minimum support for emitting warnings only once This is useful to explain to users what to do during a migration.	2014-04-29 00:46:01 +02:00
Willy Tarreau	0b7483385e	MEDIUM: http: make http-request rules processing return a verdict instead of a rule Till now we used to return a pointer to a rule, but that makes it complicated to later add support for registering new actions which may fail. For example, the redirect may fail if the response is too large to fit into the buffer. So instead let's return a verdict. But we needed the pointer to the last rule to get the address of a redirect and to get the realm used by the auth page. So these pieces of code have moved into the function and they produce a verdict.	2014-04-29 00:46:01 +02:00
Willy Tarreau	ae3c010226	MEDIUM: http: factorize the "auth" action of http-request and stats Both use exactly the same mechanism, except for the choice of the default realm to be emitted when none is selected. It can be achieved by simply comparing the ruleset with the stats' for now. This achieves a significant code reduction and further, removes the dependence on the pointer to the final rule in the caller.	2014-04-29 00:46:01 +02:00
Willy Tarreau	f75e5c3d84	MINOR: http: remove the now unused loop over "block" rules This ruleset is now always empty, simply remove it.	2014-04-28 22:15:00 +02:00
Willy Tarreau	b3dc39dfe1	MEDIUM: http: emulate "block" rules using "http-request" rules The "block" rules are redundant with http-request rules because they are performed immediately before and do exactly the same thing as "http-request deny". Moreover, this duplication has led to a few minor stats accounting issues fixed lately. Instead of keeping the two rule sets, we now build a list of "block" rules that we compile as "http-request block" and that we later insert at the beginning of the "http-request" rules. The only user-visible change is that in case of a parsing error, the config parser will now report "http-request block rule" instead of "blocking condition".	2014-04-28 22:06:57 +02:00
Willy Tarreau	353bc9f43f	CLEANUP: proxy: rename "block_cond" to "block_rules" Next patch will make them real rules, not only conditions. This separate patch makes the next one more readable.	2014-04-28 22:05:31 +02:00
Willy Tarreau	5bd6759a19	MINOR: http: silently support the "block" action for http-request This one will be used to convert "block" rules into "http-request block".	2014-04-28 22:00:46 +02:00
Willy Tarreau	5254259609	MEDIUM: http: remove even more of the spaghetti in the request path Some of the remaining interleaving of request processing after the http-request rules can now safely be removed, because all remaining actions are mutually exclusive. So we can move together all those related to an intercepting rule, then proceed with stats, then with req*. We still keep an issue with stats vs reqrep which forces us to keep the stats split in two (detection and action). Indeed, from the beginning, stats are detected before rewriting and not after. But a reqdeny rule would stop stats, so in practice we have to first detect, then perform the action. Maybe we'll be able to kill this in version 1.6.	2014-04-28 21:35:30 +02:00
Willy Tarreau	179085ccac	MEDIUM: http: move Connection header processing earlier Till now the Connection header was processed in the middle of the http-request rules and some reqadd rules. It used to force some http-request actions to be cut in two parts. Now with keep-alive, not only that doesn't make any sense anymore, but it's becoming a total mess, especially since we need to know the headers contents before proceeding with most actions. The real reason it was not moved earlier is that the "block" or "http-request" rules can see a different version if some fields are changed there. But that is already not reliable anymore since the values observed by the frontend differ from those in the backend. This patch is the equivalent of commit `f118d9f` ("REORG: http: move HTTP Connection response header parsing earlier") but for the request side. It has been tagged MEDIUM as it could theorically slightly affect some setups relying on corner cases or invalid setups, though this does not make real sense and is highly unlikely.	2014-04-28 21:35:29 +02:00
Willy Tarreau	65410831a1	BUG/MINOR: http: block rules forgot to increment the session's request counter The session's backend request counters were incremented after the block rules while these rules could increment the session's error counters, meaning that we could have more errors than requests reported in a stick table! Commit `5d5b5d8` ("MEDIUM: proto_tcp: add support for tracking L7 information") is the most responsible for this. This bug is 1.5-specific and does not need any backport.	2014-04-28 21:34:43 +02:00
Willy Tarreau	5fa7082911	BUG/MINOR: http: block rules forgot to increment the denied_req counter "block" rules used to build the whole response and forgot to increment the denied_req counters. By jumping to the general "deny" label created in previous patch, it's easier to fix this. The issue was already present in 1.3 and remained unnoticed, in part because few people use "block" nowadays.	2014-04-28 18:46:40 +02:00
Willy Tarreau	bbba2a8ecc	MEDIUM: http: jump to dedicated labels after http-request processing Continue the cleanup of http-request post-processing to remove some of the interleaved tests. Here we set up a few labels to deal with the deny and tarpit actions and avoid interleaved ifs.	2014-04-28 18:46:20 +02:00
Willy Tarreau	5e9edce0f0	MEDIUM: http: move reqadd after execution of http_request redirect We still have a plate of spaghetti in the request processing rules. All http-request rules are executed at once, then some responses are built interlaced with other rules that used to be there in the past. Here, reqadd is executed after an http-req redirect rule is decided, but before it is executed. So let's match the doc and config checks, to put the redirect actually before the reqadd completely.	2014-04-28 17:25:40 +02:00
Willy Tarreau	cfe7fdd02d	MINOR: http: rely on the message body parser to send 100-continue There's no point in open-coding the sending of 100-continue in the stats initialization code, better simply rely on the function designed to process the message body which already does it.	2014-04-28 17:25:40 +02:00
Willy Tarreau	e6d24163e5	BUG/MINOR: http: log 407 in case of proxy auth Commit `844a7e7` ("[MEDIUM] http: add support for proxy authentication") merged in v1.4-rc1 added the ability to emit a status code 407 in auth responses, but forgot to set the same status in the logs, which still contain 401. The bug is harmless, no backport is needed.	2014-04-28 17:24:42 +02:00
Willy Tarreau	f767ac55a2	BUG/MINOR: proxy: unsafe initialization of HTTP transaction when switching from TCP frontend A switch from a TCP frontend to an HTTP backend initializes the HTTP transaction. txn->hdr_idx.size is used by hdr_idx_init() but not necessarily initialized yet here, because the first call to hdr_idx_init() is in fact placed in http_init_txn(). Moving it before the call is enough to fix it. We also remove the useless extra confusing call to hdr_idx_init(). The bug was introduced in 1.5-dev8 with commit `ac1932d` ("MEDIUM: tune.http.maxhdr makes it possible to configure the maximum number of HTTP headers"). No backport to stable is needed.	2014-04-28 17:24:39 +02:00
Thierry FOURNIER	e47e4e2385	BUG/MEDIUM: patterns: last fix was still not enough Last fix did address the issue for inlined patterns, but it was not enough because the flags are lost as well when updating patterns dynamically over the CLI. Also if the same file was used once with -i and another time without -i, their references would have been merged and both would have used the same matching method. It's appear that the patterns have two types of flags. The first ones are relative to the pattern matching, and the second are relative to the pattern storage. The pattern matching flags are the same for all the patterns of one expression. Now they are stored in the expression. The storage flags are information returned by the pattern mathing function. This information is relative to each entry and is stored in the "struct pattern". Now, the expression matching flags are forwarded to the parse and index functions. These flags are stored during the configuration parsing, and they are used during the parse and index actions. This issue was introduced in dev23 with the major pattern rework, and is a continuation of commit `a631fc8` ("BUG/MAJOR: patterns: -i and -n are ignored for inlined patterns"). No backport is needed.	2014-04-28 14:19:17 +02:00
Willy Tarreau	a631fc8de8	BUG/MAJOR: patterns: -i and -n are ignored for inlined patterns These flags are only passed to pattern_read_from_file() which loads the patterns from a file. The functions used to parse the patterns from the current line do not provide the means to pass the pattern flags so they're lost. This issue was introduced in dev23 with the major pattern rework, and was reported by Graham Morley. No backport is needed.	2014-04-27 09:21:08 +02:00
Willy Tarreau	3b78696858	BUG/MEDIUM: pattern: a typo breaks automatic acl/map numbering Dmitry Sivachenko reported that nice warning : src/pattern.c:2243:43: warning: if statement has empty body [-Wempty-body] if (&ref2->list == &pattern_reference); ^ src/pattern.c:2243:43: note: put the semicolon on a separate line to silence this warning It was merged as is with the code from commit `af5a29d` ("MINOR: pattern: Each pattern is identified by unique id"). So it looks like we can reassign an ID which is still in use because of this.	2014-04-26 12:41:32 +02:00
Willy Tarreau	aeed672a6d	MINOR: ssl: finally catch the heartbeats missing the padding Previous patch only focused on parsing the packet right and blocking it, so it relaxed one test on the packet length. The difference is not usable for attacking but the logs will not report an attack for such cases, which is probably bad. Better report all known invalid packets cases.	2014-04-26 00:03:48 +02:00
Willy Tarreau	3b2fdb6f55	BUG/MINOR: ssl: really block OpenSSL's response to heartbleed attack Recent commit `f51c698` ("MEDIUM: ssl: implement a workaround for the OpenSSL heartbleed attack") did not always work well, because OpenSSL is fun enough for not testing errors before sending data... So the output sometimes contained some data. The OpenSSL code relies on the max_send_segment value to limit the packet length. The code ensures that a value of zero will result in no single byte leaking. So we're forcing this instead and that definitely fixes the issue. Note that we need to set it the hard way since the regular API checks for valid values.	2014-04-25 23:48:21 +02:00
Willy Tarreau	84815006a0	BUILD: ssl: avoid a warning about conn not used with OpenSSL < 1.0.1 Building with a version of openssl without heartbeat gives this since latest `29f037d` ("MEDIUM: ssl: explicitly log failed handshakes after a heartbeat") : src/ssl_sock.c: In function 'ssl_sock_msgcbk': src/ssl_sock.c:188: warning: unused variable 'conn' Simply declare conn inside the ifdef. No backport is needed.	2014-04-25 21:40:27 +02:00
Willy Tarreau	6c09c2ceae	BUILD: http: remove a warning on strndup The latest commit about set-map/add-acl/... causes this warning for me : src/proto_http.c: In function 'parse_http_req_cond': src/proto_http.c:8863: warning: implicit declaration of function 'strndup' src/proto_http.c:8863: warning: incompatible implicit declaration of built-in function 'strndup' src/proto_http.c:8890: warning: incompatible implicit declaration of built-in function 'strndup' src/proto_http.c:8917: warning: incompatible implicit declaration of built-in function 'strndup' src/proto_http.c:8944: warning: incompatible implicit declaration of built-in function 'strndup' Use my_strndup() instead of strndup() which is not portable. No backport needed.	2014-04-25 21:39:17 +02:00
Willy Tarreau	6e774b455f	BUG/MEDIUM: Revert "MEDIUM: ssl: Add standardized DH parameters >= 1024 bits" This reverts commit `9ece05f590`. Sander Klein reported an important performance regression with this patch applied. It is not yet certain what is exactly the cause but let's not break other setups now and sort this out after dev24. The commit was merged into dev23, no need to backport.	2014-04-25 21:35:23 +02:00
Willy Tarreau	f51c6989b0	MEDIUM: ssl: implement a workaround for the OpenSSL heartbleed attack Using the previous callback, it's trivial to block the heartbeat attack, first we control the message length, then we emit an SSL error if it is out of bounds. A special log is emitted, indicating that a heartbleed attack was stopped so that they are not confused with other failures. That way, haproxy can protect itself even when running on an unpatched SSL stack. Tests performed with openssl-1.0.1c indicate a total success.	2014-04-25 20:06:33 +02:00
Emeric Brun	29f037d872	MEDIUM: ssl: explicitly log failed handshakes after a heartbeat Add a callback to receive the heartbeat notification. There, we add SSL_SOCK_RECV_HEARTBEAT flag on the ssl session if a heartbeat is seen. If a handshake fails, we log a different message to mention the fact that a heartbeat was seen. The test is only performed on the frontend side.	2014-04-25 19:25:33 +02:00
William Lallemand	73025dd7e2	MEDIUM: http: register http-request and http-response keywords The http_(res\|req)_keywords_register() functions allow to register new keywords. You need to declare a keyword list: struct http_req_action_kw_list test_kws = { .scope = "testscope", .kw = { { "test", parse_test }, { NULL, NULL }, } }; and a parsing function: int parse_test(const char *args, int cur_arg, struct proxy px, struct http_req_rule rule, char **err) { rule->action = HTTP_REQ_ACT_CUSTOM_STOP; rule->action_ptr = action_function; return 0; } http_req_keywords_register(&test_kws); The HTTP_REQ_ACT_CUSTOM_STOP action stops evaluation of rules after your rule, HTTP_REQ_ACT_CUSTOM_CONT permits the evaluation of rules after your rule.	2014-04-25 18:48:35 +02:00
Baptiste Assmann	fabcbe0de6	MEDIUM: http: ACL and MAP updates through http-(request\|response) rules This patch allows manipulation of ACL and MAP content thanks to any information available in a session: source IP address, HTTP request or response header, etc... It's an update "on the fly" of the content of the map/acls. This means it does not resist to reload or restart of HAProxy.	2014-04-25 18:48:35 +02:00
Baptiste Assmann	953f74d1b3	MINOR: pattern: find element in a reference This function can be used to look for an entry in either an ACL or a MAP.	2014-04-25 17:31:13 +02:00
Willy Tarreau	c35362a94a	MINOR: http: implement the max-keep-alive-queue setting Finn Arne Gangstad suggested that we should have the ability to break keep-alive when the target server has reached its maxconn and that a number of connections are present in the queue. After some discussion around his proposed patch, the following solution was suggested : have a per-proxy setting to fix a limit to the number of queued connections on a server after which we break keep-alive. This ensures that even in high latency networks where keep-alive is beneficial, we try to find a different server. This patch is partially based on his original proposal and implements this configurable threshold.	2014-04-25 14:14:41 +02:00
Willy Tarreau	6d8bac7ddc	BUG/MAJOR: http: fix the 'next' pointer when performing a redirect Commit `bed410e` ("MAJOR: http: centralize data forwarding in the request path") has woken up an issue in redirects, where msg->next is not reset when flushing the input buffer. The result is an attempt to forward a negative amount of data, making haproxy crash. This bug does not seem to affect versions prior to dev23, so no backport is needed.	2014-04-25 12:21:09 +02:00
Willy Tarreau	1746eecc52	MINOR: checks: add a new global max-spread-checks directive This directive ensures that checks with a huge interval do not start too far apart at the beginning.	2014-04-25 10:52:25 +02:00
Willy Tarreau	3c1b5ec29c	MINOR: http: add capture.req.ver and capture.res.ver These ones report a string as "HTTP/1.0" or "HTTP/1.1" depending on the version of the request message or the response message, respectively. The purpose is to be able to emit custom log lines reporting this version in a persistent way.	2014-04-24 23:41:57 +02:00
Willy Tarreau	8b8995f0f4	MINOR: stats: always emit HTTP/1.1 in responses We used to emit either 1.0 or 1.1 depending on whether we were sending chunks or not. This condition is useless, better always send 1.1. Also that way at least clients and intermediary proxies know we speak 1.1. The "Connection: close" header is still set anyway.	2014-04-24 22:53:43 +02:00

... 2 3 4 5 6 ...

3360 Commits