haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2024-12-17 08:54:41 +00:00

Author	SHA1	Message	Date
Willy Tarreau	f69d4ff006	BUG/MAJOR: http: prevent risk of reading past end with balance url_param The get_server_ph_post() function assumes that the buffer is contiguous. While this is true for all the header part, it is not necessarily true for the end of data the fit in the reserve. In this case there's a risk to read past the end of the buffer for a few hundred bytes, and possibly to crash the process if what follows is not mapped. The fix consists in truncating the analyzed length to the length of the contiguous block that follows the headers. A config workaround for this bug would be to disable balance url_param. This fix must be backported to 1.5. It seems 1.4 did have the check.	2015-05-02 00:10:43 +02:00
Willy Tarreau	e115b49c39	BUG/MEDIUM: http: wait for the exact amount of body bytes in wait_for_request_body Due to the fact that we were still considering only msg->sov for the first byte of data after calling http_parse_chunk_size(), we used to miscompute the input data size and to count the CRLF and the chunk size as part of the input data. The effect is that it was possible to release the processing with 3 or 4 missing bytes, especially if they're typed by hand during debugging sessions. This can cause the stats page to return some errors in admin mode, and the url_param balance algorithm to fail to properly hash a body input. This fix must be backported to 1.5.	2015-05-01 23:24:32 +02:00
Willy Tarreau	30fe818979	DOC: fix the comments about the meaning of msg->sol in HTTP It has a meaning while parsing a body when using chunked encoding. This must be backported to 1.5 since it caused a bug there as well.	2015-05-01 23:24:31 +02:00
Willy Tarreau	82649f9ef3	DOC: document option http-ignore-probes This one was forgotten.	2015-05-01 22:43:17 +02:00
Willy Tarreau	1abc6731ed	DOC: relax the peers restriction to single-process	2015-05-01 20:16:31 +02:00
Willy Tarreau	bf59807a13	MAJOR: peers: allow peers section to be used with nbproc > 1 This only works when the peers are bound to exactly one process.	2015-05-01 20:16:31 +02:00
Willy Tarreau	1e27301866	MEDIUM: config: validate that peers sections are bound to exactly one process If a peers section is bound to no process, it's silently discarded. If its bound to multiple processes, an error is emitted and the process will not start.	2015-05-01 20:16:31 +02:00
Willy Tarreau	f83d3fe00a	MEDIUM: init: stop any peers section not bound to the correct process This will prevent the peers section from remaining in listen state on the incorrect process. The peers_fe pointer is set to NULL, which will tell the peers task to commit suicide if it was already scheduled.	2015-05-01 20:16:31 +02:00
Willy Tarreau	0fca4835b2	MEDIUM: config: propagate the table's process list to the peers sections Now a peers section has its bind_proc set to the union of all those of its users.	2015-05-01 20:16:31 +02:00
Willy Tarreau	46dc1ca761	MEDIUM: peers: unregister peers that were never started The peers initialization sequence is a bit complex, they're attached to stick-tables and initialized very early in the boot process. When we fork, if some must not start, it's too late to find them. Instead, simply add a guard in their respective tasks to stop them once they want to start.	2015-05-01 20:16:31 +02:00
Willy Tarreau	aa729784e1	MINOR: peers: store the pointer to the signal handler We'll need it to unregister stopped peers sections.	2015-05-01 20:16:31 +02:00
Willy Tarreau	77e4bd1497	MEDIUM: peers: add the ability to disable a peers section Sometimes it's very hard to disable the use of peers because an empty section is not valid, so it is necessary to comment out all references to the section, and not to forget to restore them in the same state after the operation. Let's add a "disabled" keyword just like for proxies. A ->state member in the peers struct is even present for this purpose but was never used at all. Maybe it would make sense to backport this to 1.5 as it's really cumbersome there.	2015-05-01 20:16:31 +02:00
Willy Tarreau	6866f3f33f	MEDIUM: config: initialize stick-tables after peers, not before It's dangerous to initialize stick-tables before peers because they start a task that cannot be stopped before we know if the peers need to be disabled and destroyed. Move this after.	2015-05-01 20:16:31 +02:00
Willy Tarreau	c8b679180d	MINOR: stick-table: don't attach to peers in stopped state This will be used to disable peers sections.	2015-05-01 20:16:28 +02:00
Willy Tarreau	edaff0a8f5	MEDIUM: init: don't stop proxies in parent process when exiting That's pointless, and that's confusing when debugging.	2015-05-01 20:15:06 +02:00
Willy Tarreau	02df7740fb	BUG/MINOR: config: clear proxy->table.peers.p for disabled proxies If a table in a disabled proxy references a peers section, the peers name is not resolved to a pointer to a table, but since it belongs to a union, it can later be dereferenced. Right now it seems it cannot happen, but it definitely will after the pending changes. It doesn't cost anything to backport this into 1.5, it will make gdb sessions less head-scratching.	2015-05-01 20:05:25 +02:00
Willy Tarreau	0f228a037a	MEDIUM: http: add option-ignore-probes to get rid of the floods of 408 Recently some browsers started to implement a "pre-connect" feature consisting in speculatively connecting to some recently visited web sites just in case the user would like to visit them. This results in many connections being established to web sites, which end up in 408 Request Timeout if the timeout strikes first, or 400 Bad Request when the browser decides to close them first. These ones pollute the log and feed the error counters. There was already "option dontlognull" but it's insufficient in this case. Instead, this option does the following things : - prevent any 400/408 message from being sent to the client if nothing was received over a connection before it was closed ; - prevent any log from being emitted in this situation ; - prevent any error counter from being incremented That way the empty connection is silently ignored. Note that it is better not to use this unless it is clear that it is needed, because it will hide real problems. The most common reason for not receiving a request and seeing a 408 is due to an MTU inconsistency between the client and an intermediary element such as a VPN, which blocks too large packets. These issues are generally seen with POST requests as well as GET with large cookies. The logs are often the only way to detect them. This patch should be backported to 1.5 since it avoids false alerts and makes it easier to monitor haproxy's status.	2015-05-01 15:39:23 +02:00
Willy Tarreau	13317669d5	MEDIUM: http: disable support for HTTP/0.9 by default There's not much reason for continuing to accept HTTP/0.9 requests nowadays except for manual testing. Now we disable support for these by default, unless option accept-invalid-http-request is specified, in which case they continue to be upgraded to 1.0.	2015-05-01 14:57:54 +02:00
Willy Tarreau	91852eb428	MEDIUM: http: restrict the HTTP version token to 1 digit as per RFC7230 While RFC2616 used to allow an undeterminate amount of digits for the major and minor components of the HTTP version, RFC7230 has reduced that to a single digit for each. If a server can't properly parse the version string and falls back to 0.9, it could then send a head-less response whose payload would be taken for headers, which could confuse downstream agents. Since there's no more reason for supporting a version scheme that was never used, let's upgrade to the updated version of the standard. It is still possible to enforce support for the old behaviour using options accept-invalid-http-request and accept-invalid-http-response. It would be wise to backport this to 1.5 as well just in case.	2015-05-01 14:57:01 +02:00
Willy Tarreau	b4d0c03aee	BUG/MEDIUM: http: remove content-length form responses with bad transfer-encoding The spec mandates that content-length must be removed from messages if Transfer-Encoding is present, not just for valid ones. This must be backported to 1.5 and 1.4.	2015-05-01 13:56:11 +02:00
Willy Tarreau	34dfc60571	BUG/MEDIUM: http: incorrect transfer-coding in the request is a bad request The rules related to how to handle a bad transfer-encoding header (one where "chunked" is not at the final place) have evolved to mandate an abort when this happens in the request. Previously it was only a close (which is still valid for the server side). This must be backported to 1.5 and 1.4.	2015-05-01 13:56:10 +02:00
Willy Tarreau	4979d5c5d1	BUG/MEDIUM: http: do not restrict parsing of transfer-encoding to HTTP/1.1 While Transfer-Encoding is HTTP/1.1, we must still parse it in HTTP/1.0 in case an agent sends it, because it's likely that the other side might use it as well, causing confusion. This will also result in getting rid of the Content-Length header in such abnormal situations and in having a clean connection. This must be backported to 1.5 and 1.4.	2015-05-01 13:56:10 +02:00
Willy Tarreau	557f199fb7	DOC: http: update the comments about the rules for determining transfer-length Let's now use the text from RFC7230 which is stricter and more precise. This must be backported to 1.5 and 1.4.	2015-05-01 13:56:10 +02:00
Willy Tarreau	1c91391df4	BUG/MEDIUM: http: remove content-length from chunked messages RFC7230 clarified the behaviour to adopt when facing both a content-length and a transfer-encoding: chunked in a message. While haproxy already complied with the method for getting the message length right, and used to detect improper content-length duplicates, it still did not remove the content-length header when facing a transfer-encoding: chunked. Usually it is not a problem since other agents (clients and servers) are required to parse the message according to the rules that have been in place since RFC2616 in 1999. However R�gis Leroy reported the existence of at least one such non-compliant agent so haproxy could be abused to get out of sync with it on pipelined requests (HTTP request smuggling attack), it consider part of a payload as a subsequent request. The best thing to do is then to remove the content-length according to RFC7230. It used to be in the todo list with a fixme in the code while waiting for the standard to stabilize, let's apply it now that it's published. Thanks to R�gis for bringing that subject to our attention. This fix must be backported to 1.5 and 1.4.	2015-05-01 13:56:10 +02:00
Simon Horman	1421e21fe4	MEDIUM: Document when email-alerts are sent Document the influence of email-alert level and other configuration parameters on when email-alerts are sent. Signed-off-by: Simon Horman <horms@verge.net.au>	2015-04-30 07:30:51 +02:00
Simon Horman	4cd477f372	MEDIUM: Send email alerts when servers are marked as UP or enter the drain state This is similar to the way email alerts are sent when servers are marked as DOWN. Like the log messages corresponding to these state changes the messages have log level notice. Thus they are suppressed by the default email-alert level of 'alert'. To allow these messages the email-alert level should be set to 'notice', 'info' or 'debug'. e.g: email-alert level notice "email-alert mailers" and "email-alert to" settings are also required in order for any email alerts to be sent. A follow-up patch will document the above. Signed-off-by: Simon Horman <horms@verge.net.au>	2015-04-30 07:30:50 +02:00
Simon Horman	7ea9be012d	MEDIUM: Lower priority of email alerts for log-health-checks messages Lower the priority of email alerts for log-health-checks messages from LOG_NOTICE to LOG_INFO. This is to allow set-ups with log-health-checks enabled to disable email for health check state changes while leaving other email alerts enabled. In order for email alerts to be sent for health check state changes "log-health-checks" needs to be set and "email-alert level" needs to be 'info' or lower. "email-alert mailers" and "email-alert to" settings are also required in order for any email alerts to be sent. A follow-up patch will document the above. Signed-off-by: Simon Horman <horms@verge.net.au>	2015-04-30 07:30:50 +02:00
Willy Tarreau	f3045d2a06	MAJOR: pattern: add LRU-based cache on pattern matching The principle of this cache is to have a global cache for all pattern matching operations which rely on lists (reg, sub, dir, dom, ...). The input data, the expression and a random seed are used as a hashing key. The cached entries contains a pointer to the expression and a revision number for that expression so that we don't accidently used obsolete data after a pattern update or a very unlikely hash collision. Regarding the risk of collisions, 10k entries at 10k req/s mean 1% risk of a collision after 60 years, that's already much less than the memory's reliability in most machines and more durable than most admin's life expectancy. A collision will result in a valid result to be returned for a different entry from the same list. If this is not acceptable, the cache can be disabled using tune.pattern.cache-size. A test on a file containing 10k small regex showed that the regex matching was limited to 6k/s instead of 70k with regular strings. When enabling the LRU cache, the performance was back to 70k/s.	2015-04-29 19:15:24 +02:00
Willy Tarreau	72f073b6c7	MEDIUM: pattern: add a revision to all pattern expressions This will be used to detect any change on the pattern list between two operations, ultimately making it possible to implement a cache which immediately invalidates obsolete keys after an update. The revision is simply taken from the timestamp counter to ensure that even upon a pointer reuse we cannot accidently come back to the same (expr,revision) tuple.	2015-04-29 19:15:24 +02:00
Willy Tarreau	b5684e0081	IMPORT: hash: import xxhash-r39 The xxhash library provides a very fast and excellent hash algorithm suitable for many purposes. It excels at hashing large blocks but is also extremely fast on small ones. It's distributed under a 2-clause BSD license (GPL-compatible) so it can be included here. Updates are distributed here : https://github.com/Cyan4973/xxHash	2015-04-29 19:15:21 +02:00
Willy Tarreau	69c696c138	IMPORT: lru: import simple ebtree-based LRU functions This will be usable to implement some maps/acl caches for heavy datasets loaded from files (mostly regex-based but in general anything that cannot be indexed in a tree).	2015-04-29 19:14:43 +02:00
Willy Tarreau	e6e49cfa93	MINOR: tools: provide an rdtsc() function for time comparisons This one returns a timestamp, either the one from the CPU or from gettimeofday() in 64-bit format. The purpose is to be able to compare timestamps on various entities to make it easier to detect updates. It can also be used for benchmarking in certain situations during development.	2015-04-29 19:14:03 +02:00
Baptiste Assmann	f95bc8e3e0	BUG/MEDIUM: check: tcpcheck regression introduced by `e16c1b3f` The commit `e16c1b3f` changed the way the function tcpcheck_get_step_id is now called (check instead of server). This change introduced a regression since now this function would return 0 all the time because of: if (check->current_step) return 0; This patch fixes this issue by inversing the test: you want to return 0 only if current_step is not yet set :) No backport is needed.	2015-04-29 13:39:22 +02:00
Andrew Hayworth	0ebc55f6b4	MEDIUM: logs: Add HTTP request-line log format directives This commit adds 4 new log format variables that parse the HTTP Request-Line for more specific logging than "%r" provides. For example, we can parse the following HTTP Request-Line with these new variables: "GET /foo?bar=baz HTTP/1.1" - %HM: HTTP Method ("GET") - %HV: HTTP Version ("HTTP/1.1") - %HU: HTTP Request-URI ("/foo?bar=baz") - %HP: HTTP Request-URI without query string ("/foo")	2015-04-28 21:03:05 +02:00
Willy Tarreau	e5843b383d	BUG/MEDIUM: peers: recent applet changes broke peers updates scheduling Since appctx are scheduled out of streams, it's pointless to wake up the task managing the stream to push updates, they won't be seen. In fact unit tests work because silent sessions are restarted after 5s of idle and the exchange is correctly scheduled during startup! So we need to notify the appctx instead. For this we add a pointer to the appctx in the peer session. No backport is needed of course.	2015-04-27 18:42:17 +02:00
Willy Tarreau	6e2979ca31	BUG/MEDIUM: peers: fix applet scheduling Consecutive to the recent changes brought to applets, peers properly connect but do not exchange data anymore because the stream interface is not marked as waiting for data. No backport is needed.	2015-04-27 13:21:15 +02:00
Thierry FOURNIER	7f6192c0d3	BUG/MEDIUM: http: functions set-{path,query,method,uri} breaks the HTTP parser When one of these functions replaces a part of the query string by a shorter or longer new one, the header parsing is broken. This is because the start of the first header is not updated. In the same way, the total length of the request line is not updated. I dont see any bug caused by this miss, but I guess than it is better to store the good length. This bug is only in the development version.	2015-04-27 11:56:52 +02:00
Willy Tarreau	e91ffd093e	BUG/MAJOR: tcp: only call registered actions when they're registered Commit `cc87a11` ("MEDIUM: tcp: add register keyword system.") introduced the registration of new keywords for TCP rulesets. Unfortunately it replaced the "accept" action with an unconditionnal call to the rule's action function, resulting in an immediate segfault when using the "accept" action in a TCP ruleset. This bug reported by Baptiste Assmann was introduced in 1.6-dev1, no backport is needed.	2015-04-24 10:13:18 +02:00
Willy Tarreau	0b1a4541dc	MEDIUM: stream-int: pause the appctx if the task is woken up If we're going to call the task we don't need to call the appctx anymore since the task may decide differently in the end and will do the proper thing using ->update(). This reduces one wake up call per session and may go down to half in case of high concurrency (scheduling races).	2015-04-23 17:56:17 +02:00
Willy Tarreau	fe127937a8	MEDIUM: applet: make the applets only use si_applet_{cant\|want\|stop}_{get\|put} The applets don't fiddle with SI_FL_WAIT_ROOM anymore, instead they indicate what they want, possibly that they failed (eg: WAIT_ROOM), and it's done() / update() which finally updates the WAIT_* flags according to the channels' and stream interface's states. This solves the issue of the pauses during a "show sess" without creating busy loops.	2015-04-23 17:56:17 +02:00
Willy Tarreau	eb406dc73c	MINOR: stream-int: add two flags to indicate an applet's wishes regarding I/O Currently we have a problem. There are some cases where a sleeping applet is not woken up (eg: show sess during an injection). The reason is that the applet is marked WAIT_DATA and is not woken up when WAIT_ROOM leaves, because we wait for both flags to be cleared in order to call it. And if we wait for either flag, then we have the opposite situation, which is that we're not waiting for room in the output buffer so we're spinning calling the applet to do nothing. What is missing is an indication of what the applet needs. Since it only manipulates the WAIT_ROOM/WAIT_DATA which are overwritten later, that cannot work. In the case of connections, the problem doesn't happen because the connection maintains these extra states. Ideally we'd need to have similar states for each appctx and to store those information there. But it would be overcomplicated given that an applet doesn't exist alone without a stream-int, so we can safely put these information into the stream int and make the code simpler. With this patch we introduce two new flags in the stream interface : - SI_FL_WANT_PUT : the applet wants to put something into the buffer - SI_FL_WANT_GET : the applet wants to get something from the buffer We also have the new functions si_applet_{stop\|want\|cant}_{get\|put} to make the code look similar to the connection code. For now these flags are not used yet.	2015-04-23 17:56:17 +02:00
Willy Tarreau	bc39a5d8c8	MAJOR: stream: do not allocate request buffers anymore when the left side is an applet We used to allocate a request buffer so that we could process applets from process_stream(), and this was causing some trouble because it was not possible for an analyzer to return an error to an applet, which we'll need for HTTP/2. Now that we don't call applets anymore from process_stream() we can simplify this and ensure that a response is always allocated to process a stream.	2015-04-23 17:56:17 +02:00
Willy Tarreau	d4da196546	MEDIUM: applet: centralize the call to si_applet_done() in the I/O handler It's much easier to centralize this call into the I/O handler than to do it everywhere with the risk to miss it. Applets are not allowed to unregister themselves anyway so their SI is still present and it is possible to update all the context.	2015-04-23 17:56:17 +02:00
Willy Tarreau	b9c89111ab	MEDIUM: dumpstats: don't unregister the applet anymore Let the session do the job, the applet I/O handler doesn't have to unregister itself.	2015-04-23 17:56:16 +02:00
Willy Tarreau	563cc37609	MAJOR: stream: use a regular ->update for all stream interfaces Now si->update() is used to update any type of stream interface, whether it's an applet, a connection or even nothing. We don't call si_applet_call() anymore at the end of the resync and we don't have the risk that the stream's task is reinserted into the run queue, which makes the code a bit simpler. The stream_int_update_applet() function was simplified to ensure that it remained compatible with this standardized calling convention. It was almost copy-pasted from the update code dedicated to connections. Just like for si_applet_done(), it seems that it should be possible to merge the two functions except that it would require some slow operations, except maybe if the type of end point is tested inside the update function itself.	2015-04-23 17:56:16 +02:00
Willy Tarreau	828824af05	MAJOR: applet: now call si_applet_done() instead of si_update() in I/O handlers The applet I/O handlers now rely on si_applet_done() which itself decides to wake up or sleep the appctx. Now it becomes critical that applte handlers properly call this on every exit path so that the appctx is removed from the active list after I/O have been handled. One such call was added to the Lua socket handler. It used to work without it probably because the main task is woken up by the parent task but now it's needed.	2015-04-23 17:56:16 +02:00
Willy Tarreau	e5f8649102	MEDIUM: stream-int: add a new function si_applet_done() This is the equivalent of si_conn_wake() but for applets. It will be called after changes to the stream interface are brought by the applet I/O handler. Ultimately it will release buffers and may be even wake the stream's task up if some important changes are detected. It would be nice to be able to merge it with the connection's wake function since it mostly manipulates the stream interface, but there are minor differences (such as how to enable/disable polling on a fd vs applet) and some specificities to applets (eg: don't wake the applet up until the output is empty) which would require abstract functions which would slow down everything.	2015-04-23 17:56:16 +02:00
Willy Tarreau	3c595ac3ad	MEDIUM: applet: implement a run queue for active appctx The new function is called for each round of polling in order to call any active appctx. For now we pick the stream interface from the appctx's owner. At the moment there's no appctx queued yet, but we have everything needed to queue them and remove them.	2015-04-23 17:56:16 +02:00
Willy Tarreau	81f38d6f57	MEDIUM: applet: add basic support for an applet run queue This will be needed so that we can schedule applets out of the streams. For now nothing calls the queue yet.	2015-04-23 17:56:16 +02:00
Willy Tarreau	d45b9f8991	REORG: stream-int: create si_applet_ops dedicated to applets These functions are dedicated to applets so that we don't use the default ones anymore in this case.	2015-04-23 17:56:16 +02:00

1 2 3 4 5 ...

4729 Commits