haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2024-12-17 08:54:41 +00:00

Author	SHA1	Message	Date
Willy Tarreau	5ed1bbfc75	CLEANUP: session: move the stick counters declarations to stick_table.h They're really not appropriate in session.h as they always require a stick table, and I'm having a hard time finding them each time I need to.	2014-07-16 17:26:40 +02:00
Willy Tarreau	edee1d60b7	MEDIUM: stick-table: make it easier to register extra data types Some users want to add their own data types to stick tables. We don't want to use a linked list here for performance reasons, so we need to continue to use an indexed array. This patch allows one to reserve a compile-time-defined number of extra data types by setting the new macro STKTABLE_EXTRA_DATA_TYPES to anything greater than zero, keeping in mind that anything larger will slightly inflate the memory consumed by stick tables (not per entry though). Then calling stktable_register_data_store() with the new keyword will either register a new keyword or fail if the desired entry was already taken or the keyword already registered. Note that this patch does not dictate how the data will be used, it only offers the possibility to create new keywords and have an index to reference them in the config and in the tables. The caller will not be able to use stktable_data_cast() and will have to explicitly cast the stable pointers to the expected types. It can be used for experimentation as well.	2014-07-15 19:14:52 +02:00
Willy Tarreau	e12704bfc7	MINOR: session: export the function 'smp_fetch_sc_stkctr' This one is sometimes useful outside of this file.	2014-07-15 19:09:56 +02:00
Thierry FOURNIER	055b9d5c63	MINOR: http: export the function 'smp_fetch_base32' It's sometimes useful outside of proto_http.c.	2014-07-15 19:09:36 +02:00
Willy Tarreau	65d805fdfc	BUILD: fix dependencies between config and compat.h compat.h only depends on the system, and config needs compat, not the opposite. global.h was fixed to explicitly include standard.h for LONGBITS.	2014-07-15 19:09:36 +02:00
Willy Tarreau	bb2e669f9e	BUG/MAJOR: http: correctly rewind the request body after start of forwarding Daniel Dubovik reported an interesting bug showing that the request body processing was still not 100% fixed. If a POST request contained short enough data to be forwarded at once before trying to establish the connection to the server, we had no way to correctly rewind the body. The first visible case is that balancing on a header does not always work on such POST requests since the header cannot be found. But there are even nastier implications which are that http-send-name-header would apply to the wrong location and possibly even affect part of the request's body due to an incorrect rewinding. There are two options to fix the problem : - first one is to force the HTTP_MSG_F_WAIT_CONN flag on all hash-based balancing algorithms and http-send-name-header, but there's always a risk that any new algorithm forgets to set it ; - the second option is to account for the amount of skipped data before the connection establishes so that we always know the position of the request's body relative to the buffer's origin. The second option is much more reliable and fits very well in the spirit of the past changes to fix forwarding. Indeed, at the moment we have msg->sov which points to the start of the body before headers are forwarded and which equals zero afterwards (so it still points to the start of the body before forwarding data). A minor change consists in always making it point to the start of the body even after data have been forwarded. It means that it can get a negative value (so we need to change its type to signed).. In order to avoid wrapping, we only do this as long as the other side of the buffer is not connected yet. Doing this definitely fixes the issues above for the requests. Since the response cannot be rewound we don't need to perform any change there. This bug was introduced/remained unfixed in 1.5-dev23 so the fix must be backported to 1.5.	2014-07-10 19:29:45 +02:00
Willy Tarreau	8fed9037cd	MEDIUM: stick-table: implement lookup from a sample fetch Currently we have stktable_fetch_key() which fetches a sample according to an expression and returns a stick table key, but we also need a function which does only the second half of it from a known sample. So let's cut the function in two and introduce smp_to_stkey() to perform this lookup. The first function was adapted to make use of it in order to avoid code duplication.	2014-07-10 16:43:44 +02:00
Dan Dubovik	bd57a9f977	BUG/MEDIUM: backend: Update hash to use unsigned int throughout When we were generating a hash, it was done using an unsigned long. When the hash was used to select a backend, it was sent as an unsigned int. This made it difficult to predict which backend would be selected. This patch updates get_hash, and the hash methods to use an unsigned int, to remain consistent throughout the codebase. This fix should be backported to 1.5 and probably in part to 1.4.	2014-07-08 22:00:21 +02:00
Willy Tarreau	fd0e008d9d	BUG/MEDIUM: unix: completely unbind abstract sockets during a pause() Abstract namespace sockets ignore the shutdown() call and do not make it possible to temporarily stop listening. The issue it causes is that during a soft reload, the new process cannot bind, complaining that the address is already in use. This change registers a new pause() function for unix sockets and completely unbinds the abstract ones since it's possible to rebind them later. It requires the two previous patches as well as preceeding fixes. This fix should be backported into 1.5 since the issue apperas there.	2014-07-08 01:13:35 +02:00
Willy Tarreau	092d865c53	MEDIUM: listener: implement a per-protocol pause() function In order to fix the abstact socket pause mechanism during soft restarts, we'll need to proceed differently depending on the socket protocol. The pause_listener() function already supports some protocol-specific handling for the TCP case. This commit makes this cleaner by adding a new ->pause() function to the protocol struct, which, if defined, may be used to pause a listener of a given protocol. For now, only TCP has been adapted, with the specific code moved from pause_listener() to tcp_pause_listener().	2014-07-08 01:13:34 +02:00
Willy Tarreau	18324f574f	MEDIUM: log: support a user-configurable max log line length With all the goodies supported by logformat, people find that the limit of 1024 chars for log lines is too short. Some servers do not support larger lines and can simply drop them, so changing the default value is not always the best choice. This patch takes a different approach. Log line length is specified per log server on the "log" line, with a value between 80 and 65535. That way it's possibly to satisfy all needs, even with some fat local servers and small remote ones.	2014-06-27 18:13:53 +02:00
Willy Tarreau	4e957907aa	MINOR: log: make MAX_SYSLOG_LEN overridable at build time This value was set in log.h without any #ifndef around, so when one wanted to change it, a patch was needed. Let's move it to defaults.h with the usual #ifndef so that it's easier to change it.	2014-06-27 18:13:53 +02:00
Willy Tarreau	b5975defba	MINOR: stick-table: make stktable_fetch_key() indicate why it failed stktable_fetch_key() does not indicate whether it returns NULL because the input sample was not found or because it's unstable. It causes trouble with track-sc* rules. Just like with sample_fetch_string(), we want it to be able to give more information to the caller about what it found. Thus, now we use the pointer to a sample passed by the caller, and fill it with the information we have about the sample. That way, even if we return NULL, the caller has the ability to check whether a sample was found and if it is still changing or not.	2014-06-25 17:17:53 +02:00
Emeric Brun	0abf836ecb	BUG/MINOR: ssl: Fix external function in order not to return a pointer on an internal trash buffer. 'ssl_sock_get_common_name' applied to a connection was also renamed 'ssl_sock_get_remote_common_name'. Currently, this function is only used with protocol PROXYv2 to retrieve the client certificate's common name. A further usage could be to retrieve the server certificate's common name on an outgoing connection.	2014-06-24 22:39:16 +02:00
Simon Horman	98637e5bff	MEDIUM: Add external check Add an external check which makes use of an external process to check the status of a server.	2014-06-20 07:10:07 +02:00
Emeric Brun	c8b27b6c68	MEDIUM: ssl: add 300s supported time skew on OCSP response update. OCSP_MAX_RESPONSE_TIME_SKEW can be set to a different value at compilation (default is 300 seconds).	2014-06-19 14:37:30 +02:00
Thierry FOURNIER	f4e6129e30	MINOR: missing regex.h include	2014-06-19 14:29:32 +02:00
Emeric Brun	4147b2ef10	MEDIUM: ssl: basic OCSP stapling support. The support is all based on static responses. This doesn't add any request / response logic to HAProxy, but allows a way to update information through the socket interface. Currently certificates specified using "crt" or "crt-list" on "bind" lines are loaded as PEM files. For each PEM file, haproxy checks for the presence of file at the same path suffixed by ".ocsp". If such file is found, support for the TLS Certificate Status Request extension (also known as "OCSP stapling") is automatically enabled. The content of this file is optional. If not empty, it must contain a valid OCSP Response in DER format. In order to be valid an OCSP Response must comply with the following rules: it has to indicate a good status, it has to be a single response for the certificate of the PEM file, and it has to be valid at the moment of addition. If these rules are not respected the OCSP Response is ignored and a warning is emitted. In order to identify which certificate an OCSP Response applies to, the issuer's certificate is necessary. If the issuer's certificate is not found in the PEM file, it will be loaded from a file at the same path as the PEM file suffixed by ".issuer" if it exists otherwise it will fail with an error. It is possible to update an OCSP Response from the unix socket using: set ssl ocsp-response <response> This command is used to update an OCSP Response for a certificate (see "crt" on "bind" lines). Same controls are performed as during the initial loading of the response. The <response> must be passed as a base64 encoded string of the DER encoded response from the OCSP server. Example: openssl ocsp -issuer issuer.pem -cert server.pem \ -host ocsp.issuer.com:80 -respout resp.der echo "set ssl ocsp-response $(base64 -w 10000 resp.der)" \| \ socat stdio /var/run/haproxy.stat This feature is automatically enabled on openssl 0.9.8h and above. This work was performed jointly by Dirkjan Bussink of GitHub and Emeric Brun of HAProxy Technologies.	2014-06-18 18:28:56 +02:00
Thierry FOURNIER	26202760a4	MINOR: regex: Use native PCRE API. The pcreposix layer (in the pcre projetc) execute strlen to find thlength of the string. When we are using the function "regex_exex*2", the length is used to add a final \0, when pcreposix is executed a strlen is executed to compute the length. If we are using a native PCRE api, the length is provided as an argument, and these operations disappear. This is useful because PCRE regex are more used than POSIC regex.	2014-06-18 15:14:00 +02:00
Thierry FOURNIER	09af0d6d43	MEDIUM: regex: replace all standard regex function by own functions This patch remove all references of standard regex in haproxy. The last remaining references are only in the regex.[ch] files. In the file src/checks.c, the original function uses a "pmatch" array. In fact this array is unused. This patch remove it.	2014-06-18 15:07:57 +02:00
Thierry FOURNIER	b8f980cc19	MINOR: regex: Create JIT compatible function that return match strings This patchs rename the "regex_exec" to "regex_exec2". It add a new "regex_exec", "regex_exec_match" and "regex_exec_match2" function. This function can match regex and return array containing matching parts. Otherwise, this function use the compiled method (JIT or PCRE or POSIX). JIT require a subject with length. PCREPOSIX and native POSIX regex require a null terminted subject. The regex_exec* function are splited in two version. The first version take a null terminated string, but it execute strlen() on the subject if it is compiled with JIT. The second version (terminated by "2") take the subject and the length. This version adds a null character in the subject if it is compiled with PCREPOSIX or native POSIX functions. The documentation of posix regex and pcreposix says that the function returns 0 if the string matche otherwise it returns REG_NOMATCH. The REG_NOMATCH macro take the value 1 with posix regex and the value 17 with the pcreposix. The documentaion of the native pcre API (used with JIT) returns a negative number if no match, otherwise, it returns 0 or a positive number. This patch fix also the return codes of the regex_exec* functions. Now, these function returns true if the string match, otherwise it returns false.	2014-06-18 15:07:50 +02:00
Sasha Pachev	218f064f55	MEDIUM: http: add actions "replace-header" and "replace-values" in http-req/resp This patch adds two new actions to http-request and http-response rulesets : - replace-header : replace a whole header line, suited for headers which might contain commas - replace-value : replace a single header value, suited for headers defined as lists. The match consists in a regex, and the replacement string takes a log-format and supports back-references.	2014-06-17 18:34:32 +02:00
Willy Tarreau	4bfc580dd3	MEDIUM: session: maintain per-backend and per-server time statistics Using the last rate counters, we now compute the queue, connect, response and total times per server and per backend with a 95% accuracy over the last 1024 samples. The operation is cheap so we don't need to condition it.	2014-06-17 17:15:56 +02:00
Willy Tarreau	2438f2b984	MINOR: freq_ctr: introduce a new averaging method While the current functions report average event counts per period, we are also interested in average values per event. For this we use a different method. The principle is to rely on a long tail which sums the new value with a fraction of the previous value, resulting in a sliding window of infinite length depending on the precision we're interested in. The idea is that we always keep (N-1)/N of the sum and add the new sampled value. The sum over N values can be computed with a simple program for a constant value 1 at each iteration : N ,--- \ N - 1 e - 1 > ( --------- )^x ~= N * ----- / N e '--- x = 1 Note: I'm not sure how to demonstrate this but at least this is easily verified with a simple program, the sum equals N * 0.632120 for any N moderately large (tens to hundreds). Inserting a constant sample value V here simply results in : sum = V * N * (e - 1) / e But we don't want to integrate over a small period, but infinitely. Let's cut the infinity in P periods of N values. Each period M is exactly the same as period M-1 with a factor of ((N-1)/N)^N applied. A test shows that given a large N : N - 1 1 ( ------- )^N ~= --- N e Our sum is now a sum of each factor times : NP P ,--- ,--- \ N - 1 e - 1 \ 1 > v ( --------- )^x ~= VN ----- * > --- / N e / e^x '--- '--- x = 1 x = 0 For P "large enough", in tests we get this : P ,--- \ 1 e > --- ~= ----- / e^x e - 1 '--- x = 0 This simplifies the sum above : N*P ,--- \ N - 1 > v ( --------- )^x = VN / N '--- x = 1 So basically by summing values and applying the last result an (N-1)/N factor we just get N times the values over the long term, so we can recover the constant value V by dividing by N. A value added at the entry of the sliding window of N values will thus be reduced to 1/e or 36.7% after N terms have been added. After a second batch, it will only be 1/e^2, or 13.5%, and so on. So practically speaking, each old period of N values represents only a quickly fading ratio of the global sum : period ratio 1 36.7% 2 13.5% 3 4.98% 4 1.83% 5 0.67% 6 0.25% 7 0.09% 8 0.033% 9 0.012% 10 0.0045% So after 10N samples, the initial value has already faded out by a factor of 22026, which is quite fast. If the sliding window is 1024 samples wide, it means that a sample will only count for 1/22k of its initial value after 10k samples went after it, which results in half of the value it would represent using an arithmetic mean. The benefit of this method is that it's very cheap in terms of computations when N is a power of two. This is very well suited to record response times as large values will fade out faster than with an arithmetic mean and will depend on sample count and not time. Demonstrating all the above assumptions with maths instead of a program is left as an exercise for the reader.	2014-06-17 17:15:51 +02:00
Willy Tarreau	588297f2f9	MINOR: tools: add new functions to quote-encode strings qstr() and cstr() will be used to quote-encode strings. The first one does it unconditionally. The second one is aimed at CSV files where the quote-encoding is only needed when the field contains a quote or a comma.	2014-06-16 18:20:14 +02:00
Simon Horman	75ab8bdb83	MEDIUM: Add port_to_str helper This helper is similar to addr_to_str but tries to convert the port rather than the address of a struct sockaddr_storage. This is in preparation for supporting an external agent check. Signed-off-by: Simon Horman <horms@verge.net.au>	2014-06-16 10:10:33 +02:00
Willy Tarreau	8fccfa256e	CLEANUP: connection: merge proxy proto v2 header and address block This is in order to simplify the PPv2 header parsing code to look more like the one provided as an example in the spec. No code change was performed beyond just merging the proxy_addr union into the proxy_hdr_v2 struct.	2014-06-14 11:46:02 +02:00
Willy Tarreau	3a4ac422ce	MINOR: tcp: prepare support for the "capture" action A few minor entries will be needed to capture sample fetches in requests or responses. This patch just prepares the code for this.	2014-06-13 16:32:48 +02:00
Willy Tarreau	54da8db40b	MINOR: capture: extend the captures to support non-header keys This patch adds support for captures with no header name. The purpose is to allow extra captures to be defined and logged along with the header captures.	2014-06-13 16:32:48 +02:00
Remi Gacogne	f46cd6e4ec	MEDIUM: ssl: Add the option to use standardized DH parameters >= 1024 bits When no static DH parameters are specified, this patch makes haproxy use standardized (rfc 2409 / rfc 3526) DH parameters with prime lenghts of 1024, 2048, 4096 or 8192 bits for DHE key exchange. The size of the temporary/ephemeral DH key is computed as the minimum of the RSA/DSA server key size and the value of a new option named tune.ssl.default-dh-param.	2014-06-12 16:12:23 +02:00
Willy Tarreau	c874653bb4	BUILD: don't use type "uint" which is not portable Dmitry Sivachenko reported that "uint" doesn't build on FreeBSD 10. On Linux it's defined in sys/types.h and indicated as "old". Just get rid of the very few occurrences.	2014-05-28 23:05:07 +02:00
Willy Tarreau	ce3f913e48	MINOR: stats: add counters for SSL cache lookups and misses One important aspect of SSL performance tuning is the cache size, but there's no metric to know whether it's large enough or not. This commit introduces two counters, one for the cache lookups and another one for cache misses. These counters are reported on "show info" on the stats socket. This way, it suffices to see the cache misses counter constantly grow to know that a larger cache could possibly help.	2014-05-28 16:53:04 +02:00
Willy Tarreau	0c9c2720dc	MINOR: stats: report SSL key computations per second It's commonly needed to know how many SSL asymmetric keys are computed per second on either side (frontend or backend), and to know the SSL session reuse ratio. Now we compute these values and report them in "show info".	2014-05-28 12:28:58 +02:00
Sasha Pachev	c600204ddf	BUG/MEDIUM: regex: fix risk of buffer overrun in exp_replace() Currently exp_replace() (which is used in reqrep/reqirep) is vulnerable to a buffer overrun. I have been able to reproduce it using the attached configuration file and issuing the following command: wget -O - -S -q http://localhost:8000/`perl -e 'print "a"x4000'`/cookie.php Str was being checked only in in while (str) and it was possible to read past that when more than one character was being accessed in the loop. WT: Note that this bug is only marked MEDIUM because configurations capable of triggering this bug are very unlikely to exist at all due to the fact that most rewrites consist in static string additions that largely fit into the reserved area (8kB by default). This fix should also be backported to 1.4 and possibly even 1.3 since it seems to have been present since 1.1 or so. Config: ------- global maxconn 500 stats socket /tmp/haproxy.sock mode 600 defaults timeout client 1000 timeout connect 5000 timeout server 5000 retries 1 option redispatch listen stats bind :8080 mode http stats enable stats uri /stats stats show-legends listen tcp_1 bind :8000 mode http maxconn 400 balance roundrobin reqrep ^([^\ :])\ /(.)/(.)\.php(.) \1\ /\3.php?arg=\2\2\2\2\2\2\2\2\2\2\2\2\2\4 server srv1 127.0.0.1:9000 check port 9000 inter 1000 fall 1 server srv2 127.0.0.1:9001 check port 9001 inter 1000 fall 1	2014-05-27 14:36:06 +02:00
Willy Tarreau	23964187ae	MINOR: checks: support a neutral check result Agent will have the ability to return a weight without indicating an up/down status. Currently this is not possible, so let's add a 5th result CHK_RES_NEUTRAL for this purpose. It has been mapped to the unused HCHK_STATUS_CHECKED which already serves as a neutral delimitor between initiated checks and those returning a result.	2014-05-23 15:42:49 +02:00
Willy Tarreau	ed7df90068	MEDIUM: stats: introduce new actions to simplify admin status management Instead of enabling/disabling maintenance mode and drain mode separately using 4 actions, we now offer 3 simplified actions : - set state to READY - set state to DRAIN - set state to MAINT They have the benefit of reporting the same state as displayed on the page, and of doing the double-switch atomically eg when switching from drain to maint. Note that the old actions are still supported for users running scripts.	2014-05-23 14:29:11 +02:00
Willy Tarreau	bfc7b7acd8	MAJOR: checks: add support for a new "drain" administrative mode This patch adds support for a new "drain" mode. So now we have 3 admin modes for a server : - READY - DRAIN - MAINT The drain mode disables load balancing but leaves the server up. It can coexist with maint, except that maint has precedence. It is also inherited from tracked servers, so just like maint, it's represented with 2 bits. New functions were designed to set/clear each flag and to propagate the changes to tracking servers when relevant, and to log the changes. Existing functions srv_set_adm_maint() and srv_set_adm_ready() were replaced to make use of the new functions. Currently the drain mode is not yet used, however the whole logic was tested with all combinations of set/clear of both flags in various orders to catch all corner cases.	2014-05-23 14:29:11 +02:00
Willy Tarreau	8eb7784634	MINOR: server: implement srv_set_stopping() This function was taken from check_set_server_drain(). It does not consider health checks at all and only sets a server to stopping provided it's not in maintenance and is not currently stopped. The resulting state will be STOPPING. The state change is propagated to tracked servers. For now the function is not used, but the goal is to split health checks status from server status and to be able to change a server's state regardless of health checks statuses.	2014-05-23 14:29:11 +02:00
Willy Tarreau	dbd5e78f5b	MINOR: server: implement srv_set_running() This function was taken from check_set_server_up(). It does not consider health checks at all and only sets a server up provided it's not in maintenance. The resulting state may be either RUNNING or STARTING depending on the presence of a slowstart or not. The state change is propagated to tracked servers. For now the function is not used, but the goal is to split health checks status from server status and to be able to change a server's state regardless of health checks statuses.	2014-05-23 14:29:11 +02:00
Willy Tarreau	e7d1ef16bf	MINOR: server: implement srv_set_stopped() This function was extracted from check_set_server_down(). In only manipulates the server state and does not consider the health checks at all, nor does it modify their status. It takes a reason message to report in logs, however it passes NULL when recursing through the trackers chain. For now the function is not used, but the goal is to split health checks status from server status and to be able to change a server's state regardless of health checks statuses.	2014-05-23 14:29:11 +02:00
Willy Tarreau	bda92271e6	MINOR: server: make the status reporting function support a reason srv_adm_append_status() was renamed srv_append_status() since it's no more dedicated to maintenance mode. It now supports a reason which if not null is appended to the output string.	2014-05-23 14:29:11 +02:00
Willy Tarreau	af54958d72	MEDIUM: checks: simplify server up/down/nolb transitions We don't have to handle the maintenance transition here anymore so we can simplify the functions and conditions. This also means that we don't need the disable/enable functions but only a function to switch to each new state. It's worth mentionning that at this stage there are still confusions between the server state and the checks states. For example, the health check's state is adjusted from tracked servers changing state, while it should not be.	2014-05-23 14:29:11 +02:00
Willy Tarreau	a0066ddbda	MEDIUM: server: properly support and propagate the maintenance status This change now involves a new flag SRV_ADMF_IMAINT to note that the maintenance status of a server is inherited from another server. Thus, we know at each server level in the chain if it's running, in forced maintenance or in a maintenance status because it tracks another server, or even in both states. Disabling a server propagates this flag down to other servers. Enabling a server flushes the flag down. A server becomes up again once both of its flags are cleared. Two new functions "srv_adm_set_maint()" and "srv_adm_set_ready()" are used to manipulate this maintenance status. They're used by the CLI and the stats page. Now the stats page always says "MAINT" instead of "MAINT(via)" and it's only the chk/down field which reports "via x/y" when the status is inherited from another server, but it doesn't say it when a server was forced into maintenance. The CSV output indicates "MAINT (via x/y)" instead of only "MAINT(via)". This is the most accurate representation. One important thing is that now entering/leaving maintenance for a tracking server correctly follows the state of the tracked server.	2014-05-22 11:27:00 +02:00
Willy Tarreau	4aac7db940	REORG: checks: put the functions in the appropriate files ! Checks.c has become a total mess. A number of proxy or server maintenance and queue management functions were put there probably because they were used there, but that makes the code untouchable. And that's without saying that their names does not always relate to what they really do! So let's do a first pass by moving these ones : - set_backend_down() => backend.c - redistribute_pending() => queue.c:pendconn_redistribute() - check_for_pending() => queue.c:pendconn_grab_from_px() - shutdown_sessions => server.c:srv_shutdown_sessions() - shutdown_backup_sessions => server.c:srv_shutdown_backup_sessions() All of them were moved at once.	2014-05-22 11:27:00 +02:00
Willy Tarreau	892337c8e1	MAJOR: server: use states instead of flags to store the server state Servers used to have 3 flags to store a state, now they have 4 states instead. This avoids lots of confusion for the 4 remaining undefined states. The encoding from the previous to the new states can be represented this way : SRV_STF_RUNNING \| SRV_STF_GOINGDOWN \| \| SRV_STF_WARMINGUP \| \| \| 0 x x SRV_ST_STOPPED 1 0 0 SRV_ST_RUNNING 1 0 1 SRV_ST_STARTING 1 1 x SRV_ST_STOPPING Note that the case where all bits were set used to exist and was randomly dealt with. For example, the task was not stopped, the throttle value was still updated and reported in the stats and in the http_server_state header. It was the same if the server was stopped by the agent or for maintenance. It's worth noting that the internal function names are still quite confusing.	2014-05-22 11:27:00 +02:00
Willy Tarreau	2012521d7b	REORG/MEDIUM: server: move the maintenance bits out of the server state Now we introduce srv->admin and srv->prev_admin which are bitfields containing one bit per source of administrative status (maintenance only for now). For the sake of backwards compatibility we implement a single source (ADMF_FMAINT) but the code already checks any source (ADMF_MAINT) where the STF_MAINTAIN bit was previously checked. This will later allow us to add ADMF_IMAINT for maintenance mode inherited from tracked servers. Along doing these changes, it appeared that some places will need to be revisited when implementing the inherited bit, this concerns all those modifying the ADMF_FMAINT bit (enable/disable actions on the CLI or stats page), and the checks to report "via" on the stats page. But currently the code is harmless.	2014-05-22 11:27:00 +02:00
Willy Tarreau	c93cd16b6c	REORG/MEDIUM: server: split server state and flags in two different variables Till now, the server's state and flags were all saved as a single bit field. It causes some difficulties because we'd like to have an enum for the state and separate flags. This commit starts by splitting them in two distinct fields. The first one is srv->state (with its counter-part srv->prev_state) which are now enums, but which still contain bits (SRV_STF_*). The flags now lie in their own field (srv->flags). The function srv_is_usable() was updated to use the enum as input, since it already used to deal only with the state. Note that currently, the maintenance mode is still in the state for simplicity, but it must move as well.	2014-05-22 11:27:00 +02:00
Conrad Hoffmann	041751c13a	BUG/MEDIUM: polling: fix possible CPU hogging of worker processes after receiving SIGUSR1. When run in daemon mode (i.e. with at least one forked process) and using the epoll poller, sending USR1 (graceful shutdown) to the worker processes can cause some workers to start running at 100% CPU. Precondition is having an established HTTP keep-alive connection when the signal is received. The cloned (during fork) listening sockets do not get closed in the parent process, thus they do not get removed from the epoll set automatically (see man 7 epoll). This can lead to the process receiving epoll events that it doesn't feel responsible for, resulting in an endless loop around epoll_wait() delivering these events. The solution is to explicitly remove these file descriptors from the epoll set. To not degrade performance, care was taken to only do this when neccessary, i.e. when the file descriptor was cloned during fork. Signed-off-by: Conrad Hoffmann <conrad@soundcloud.com> [wt: a backport to 1.4 could be studied though chances to catch the bug are low]	2014-05-20 14:57:36 +02:00
Willy Tarreau	87eb1d6994	MINOR: server: create srv_was_usable() from srv_is_usable() and use a pointer We used to call srv_is_usable() with either the current state and weights or the previous ones. This causes trouble for future changes, so let's first split it in two variants : - srv_is_usable(srv) considers the current status - srv_was_usable(srv) considers the previous status	2014-05-13 22:34:55 +02:00
Willy Tarreau	c5150dafd8	MINOR: server: use functions to detect state changes and to update them Detecting that a server's status has changed is a bit messy, as well as it is to commit the status changes. We'll have to add new conditions soon and we'd better avoid to multiply the number of touched locations with the high risk of forgetting them. This commit introduces : - srv_lb_status_changed() to report if the status changed from the previously committed one ; - svr_lb_commit_status() to commit the current status The function is now used by all load-balancing algorithms.	2014-05-13 22:18:22 +02:00
Willy Tarreau	02615f9b16	MINOR: server: remove the SRV_DRAIN flag which can always be deduced This flag is only a copy of (srv->uweight == 0), so better get rid of it to reduce some of the confusion that remains in the code, and use a simple function to return this state based on this weight instead.	2014-05-13 22:18:13 +02:00
Willy Tarreau	05cdd9655d	MEDIUM: session: implement half-closed timeouts (client-fin and server-fin) Long-lived sessions are often subject to half-closed sessions resulting in a lot of sessions appearing in FIN_WAIT state in the system tables, and no way for haproxy to get rid of them. This typically happens because clients suddenly disconnect without sending any packet (eg: FIN or RST was lost in the path), and while the server detects this using an applicative heart beat, haproxy does not close the connection. This patch adds two new timeouts : "timeout client-fin" and "timeout server-fin". The former allows one to override the client-facing timeout when a FIN has been received or sent. The latter does the same for server-facing connections, which is less useful.	2014-05-10 15:14:05 +02:00
Willy Tarreau	6346f0a534	DOC: stop referencing the slow git repository in the README git.1wt.eu is painfully slow and some people experience issues with it. Better hide it and only advertise git.haproxy.org which is mirrored on a faster server. Also replace haproxy.1wt.eu with www.haproxy.org in the download URL which appears in the stats page.	2014-05-10 11:04:39 +02:00
Willy Tarreau	b1dd9bf308	MEDIUM: pattern: use ebtree's longest match to index/lookup string beginning Being able to map prefixes to values is already used for IPv4/IPv6 but was not yet used with strings. It can be very convenient to map directories to server farms but large lists may be slow. By using ebmb_insert_prefix() and ebmb_lookup_longest(), we can insert strings with their own length as a prefix, and lookup candidate strings and ensure that the longest matching one will be returned, which is the longest string matching the entry.	2014-05-10 08:53:48 +02:00
Willy Tarreau	18ca2d48bf	MINOR: tools: split is_addr() and is_inet_addr() The is_addr() function indicates if an address is set and is an IPv4 or IPv6 address. Let's rename it is_inet_addr() and make is_addr() also accept AF_UNIX addresses.	2014-05-10 01:26:37 +02:00
Willy Tarreau	6ae1ba6f29	MEDIUM: listener: parse the new "process" bind keyword This sets the bind_proc entry in the bind_conf config block. For now it's still unused, but the doc was updated.	2014-05-09 19:16:26 +02:00
Willy Tarreau	a9db57ec5c	MEDIUM: config: limit nbproc to the machine's word size Some consistency checks cannot be performed between frontends, backends and peers at the moment because there is no way to check for intersection between processes bound to some processes when the number of processes is higher than the number of bits in a word. So first, let's limit the number of processes to the machine's word size. This means nbproc will be limited to 32 on 32-bit machines and 64 on 64-bit machines. This is far more than enough considering that configs rarely go above 16 processes due to scalability and management issues, so 32 or 64 should be fine. This way we'll ensure we can always build a mask of all the processes a section is bound to.	2014-05-09 19:16:26 +02:00
Emeric Brun	8dc6039807	MINOR: ssl: add global statement tune.ssl.force-private-cache. Boolean: used to force a private ssl session cache for each process in case of nbproc > 1.	2014-05-09 19:16:13 +02:00
David S	afb768340c	MEDIUM: connection: Implement and extented PROXY Protocol V2 This commit modifies the PROXY protocol V2 specification to support headers longer than 255 bytes allowing for optional extensions. It implements the PROXY protocol V2 which is a binary representation of V1. This will make parsing more efficient for clients who will know in advance exactly how many bytes to read. Also, it defines and implements some optional PROXY protocol V2 extensions to send information about downstream SSL/TLS connections. Support for PROXY protocol V1 remains unchanged.	2014-05-09 08:25:38 +02:00
Emeric Brun	caa19cc867	BUG/MAJOR: ssl: Fallback to private session cache if current lock mode is not supported. Process shared mutex seems not supported on some OSs (FreeBSD). This patch checks errors on mutex lock init to fallback on a private session cache (per process cache) in error cases.	2014-05-08 22:46:32 +02:00
Willy Tarreau	cefad67689	BUILD: syscalls: remove improper inline statement in front of syscalls Trying to build with an old gcc and glibc revealed that we must not state "inline" in our _syscall* definitions since it's already present in the declaration making use of the _syscall* macros.	2014-05-08 22:38:02 +02:00
William Lallemand	2a83111cee	MINOR: http: generic pointer to rule argument Add a void *data which can be used as a generic storage for rule arguments.	2014-05-07 16:50:33 +02:00
William Lallemand	07c8b24edb	MINOR: http: export the smp_fetch_cookie function Remove the static attribute of smp_fetch_cookie, and declare the function in proto/proto_http.h for future use.	2014-05-02 18:05:15 +02:00
Willy Tarreau	644c101e2d	BUG/MAJOR: http: connection setup may stall on balance url_param On the mailing list, seri0528@naver.com reported an issue when using balance url_param or balance uri. The request would sometimes stall forever. Cyril Bont� managed to reproduce it with the configuration below : listen test :80 mode http balance url_param q hash-type consistent server s demo.1wt.eu:80 and found it appeared with this commit : `80a92c0` ("BUG/MEDIUM: http: don't start to forward request data before the connect"). The bug is subtle but real. The problem is that the HTTP request forwarding analyzer refrains from starting to parse the request body when some LB algorithms might need the body contents, in order to preserve the data pointer and avoid moving things around during analysis in case a redispatch is later needed. And in order to detect that the connection establishes, it watches the response channel's CF_READ_ATTACHED flag. The problem is that a request analyzer is not subscribed to a response channel, so it will only see changes when woken for other (generally correlated) reasons, such as the fact that part of the request could be sent. And since the CF_READ_ATTACHED flag is cleared once leaving process_session(), it is important not to miss it. It simply happens that sometimes the server starts to respond in a sequence that validates the connection in the middle of process_session(), that it is detected after the analysers, and that the newly assigned CF_READ_ATTACHED is not used to detect that the request analysers need to be called again, then the flag is lost. The CF_WAKE_WRITE flag doesn't work either because it's cleared upon entry into process_session(), ie if we spend more than one call not connecting. Thus we need a new flag to tell the connection initiator that we are specifically interested in being notified about connection establishment. This new flag is CF_WAKE_CONNECT. It is set by the requester, and is cleared once the connection succeeds, where CF_WAKE_ONCE is set instead, causing the request analysers to be scanned again. For future versions, some better options will have to be considered : - let all analysers subscribe to both request and response events ; - let analysers subscribe to stream interface events (reduces number of useless calls) - change CF_WAKE_WRITE's semantics to persist across calls to process_session(), but that is different from validating a connection establishment (eg: no data sent, or no data to send) The bug was introduced in 1.5-dev23, no backport is needed.	2014-04-30 20:02:02 +02:00
Willy Tarreau	ed44649eb7	MEDIUM: config: warn that '{cli,con,srv}timeout' are deprecated It's been like this since version 1.3 in 2007. It's time to clean up configurations. The warning explains what to use depending on the timeout name.	2014-04-29 01:09:56 +02:00
Willy Tarreau	a3c504c032	MEDIUM: config: inform the user only once that "redispatch" is deprecated It may go away in 1.6, but there's no point reporting it for each and every occurrence.	2014-04-29 01:09:40 +02:00
Willy Tarreau	40bac83734	MEDIUM: config: inform the user that "reqsetbe" is deprecated It will go away in 1.6.	2014-04-29 00:46:01 +02:00
Willy Tarreau	de9d2d7b86	MEDIUM: config: inform the user about the deprecatedness of "block" rules It's just a warning emitted once.	2014-04-29 00:46:01 +02:00
Willy Tarreau	ff05550b5d	MINOR: config: add minimum support for emitting warnings only once This is useful to explain to users what to do during a migration.	2014-04-29 00:46:01 +02:00
Willy Tarreau	0b7483385e	MEDIUM: http: make http-request rules processing return a verdict instead of a rule Till now we used to return a pointer to a rule, but that makes it complicated to later add support for registering new actions which may fail. For example, the redirect may fail if the response is too large to fit into the buffer. So instead let's return a verdict. But we needed the pointer to the last rule to get the address of a redirect and to get the realm used by the auth page. So these pieces of code have moved into the function and they produce a verdict.	2014-04-29 00:46:01 +02:00
Willy Tarreau	353bc9f43f	CLEANUP: proxy: rename "block_cond" to "block_rules" Next patch will make them real rules, not only conditions. This separate patch makes the next one more readable.	2014-04-28 22:05:31 +02:00
Thierry FOURNIER	e47e4e2385	BUG/MEDIUM: patterns: last fix was still not enough Last fix did address the issue for inlined patterns, but it was not enough because the flags are lost as well when updating patterns dynamically over the CLI. Also if the same file was used once with -i and another time without -i, their references would have been merged and both would have used the same matching method. It's appear that the patterns have two types of flags. The first ones are relative to the pattern matching, and the second are relative to the pattern storage. The pattern matching flags are the same for all the patterns of one expression. Now they are stored in the expression. The storage flags are information returned by the pattern mathing function. This information is relative to each entry and is stored in the "struct pattern". Now, the expression matching flags are forwarded to the parse and index functions. These flags are stored during the configuration parsing, and they are used during the parse and index actions. This issue was introduced in dev23 with the major pattern rework, and is a continuation of commit `a631fc8` ("BUG/MAJOR: patterns: -i and -n are ignored for inlined patterns"). No backport is needed.	2014-04-28 14:19:17 +02:00
Willy Tarreau	a631fc8de8	BUG/MAJOR: patterns: -i and -n are ignored for inlined patterns These flags are only passed to pattern_read_from_file() which loads the patterns from a file. The functions used to parse the patterns from the current line do not provide the means to pass the pattern flags so they're lost. This issue was introduced in dev23 with the major pattern rework, and was reported by Graham Morley. No backport is needed.	2014-04-27 09:21:08 +02:00
Willy Tarreau	f51c6989b0	MEDIUM: ssl: implement a workaround for the OpenSSL heartbleed attack Using the previous callback, it's trivial to block the heartbeat attack, first we control the message length, then we emit an SSL error if it is out of bounds. A special log is emitted, indicating that a heartbleed attack was stopped so that they are not confused with other failures. That way, haproxy can protect itself even when running on an unpatched SSL stack. Tests performed with openssl-1.0.1c indicate a total success.	2014-04-25 20:06:33 +02:00
Willy Tarreau	b3966377d8	MINOR: connection: add a new error code for SSL with heartbeat Users have seen a huge increase in the rate of SSL handshake failures starting from 2014/04/08 with the release of the Heartbleed OpenSSL vulnerability (CVE-2014-0160). Haproxy can detect that a heartbeat was received in the incoming handshake, and such heartbeats are not supposed to be common, so let's log a different message when a handshake error happens after a heartbeat is detected. This patch only adds the new message and the new code.	2014-04-25 18:56:57 +02:00
William Lallemand	73025dd7e2	MEDIUM: http: register http-request and http-response keywords The http_(res\|req)_keywords_register() functions allow to register new keywords. You need to declare a keyword list: struct http_req_action_kw_list test_kws = { .scope = "testscope", .kw = { { "test", parse_test }, { NULL, NULL }, } }; and a parsing function: int parse_test(const char *args, int cur_arg, struct proxy px, struct http_req_rule rule, char **err) { rule->action = HTTP_REQ_ACT_CUSTOM_STOP; rule->action_ptr = action_function; return 0; } http_req_keywords_register(&test_kws); The HTTP_REQ_ACT_CUSTOM_STOP action stops evaluation of rules after your rule, HTTP_REQ_ACT_CUSTOM_CONT permits the evaluation of rules after your rule.	2014-04-25 18:48:35 +02:00
Baptiste Assmann	fabcbe0de6	MEDIUM: http: ACL and MAP updates through http-(request\|response) rules This patch allows manipulation of ACL and MAP content thanks to any information available in a session: source IP address, HTTP request or response header, etc... It's an update "on the fly" of the content of the map/acls. This means it does not resist to reload or restart of HAProxy.	2014-04-25 18:48:35 +02:00
Baptiste Assmann	953f74d1b3	MINOR: pattern: find element in a reference This function can be used to look for an entry in either an ACL or a MAP.	2014-04-25 17:31:13 +02:00
Willy Tarreau	c35362a94a	MINOR: http: implement the max-keep-alive-queue setting Finn Arne Gangstad suggested that we should have the ability to break keep-alive when the target server has reached its maxconn and that a number of connections are present in the queue. After some discussion around his proposed patch, the following solution was suggested : have a per-proxy setting to fix a limit to the number of queued connections on a server after which we break keep-alive. This ensures that even in high latency networks where keep-alive is beneficial, we try to find a different server. This patch is partially based on his original proposal and implements this configurable threshold.	2014-04-25 14:14:41 +02:00
Willy Tarreau	1746eecc52	MINOR: checks: add a new global max-spread-checks directive This directive ensures that checks with a huge interval do not start too far apart at the beginning.	2014-04-25 10:52:25 +02:00
Willy Tarreau	f1fd9dc8fb	CLEANUP: general: get rid of all old occurrences of "session *t" All the code inherited from version 1.1 still holds a lot ot sessions called "t" because in 1.1 they were tasks. This naming is very annoying and sometimes even confusing, for example in code involving tables. Let's get rid of this once for all and before 1.5-final. Nothing changed beyond just carefully renaming these variables.	2014-04-24 21:25:50 +02:00
Willy Tarreau	af3cf70d7c	MEDIUM: stats: reimplement HTTP keep-alive on the stats page This basically reimplements commit `f3221f9` ("MEDIUM: stats: add support for HTTP keep-alive on the stats page") which was reverted by commit `51437d2` after Igor Chan reported a broken stats page caused by the bug fix by previous commit.	2014-04-24 17:24:56 +02:00
Willy Tarreau	5a8ba60fe1	CLEANUP: buffers: remove unused function buffer_contig_space_with_res() This function is now unused and was dangerous. Its cousin buffer_contig_space_res() was removed as well since it was the only one to use it.	2014-04-24 17:19:22 +02:00
Willy Tarreau	1234f4a210	MAJOR: http: reset msg->sov after headers are forwarded In order to avoid abusively relying on buf->o to guess how many bytes to rewind during a redispatch, we now clear msg->sov. Thus the meaning of this field is exactly "how many bytes of headers are left to be forwarded". It is still possible to rewind because msg->eoh + msg->eol equal that value before scheduling the forwarding, so we can always subtract them.	2014-04-22 23:15:28 +02:00
Willy Tarreau	0d09050aa5	MEDIUM: http: small helpers to compute how far to rewind to find BODY and DATA http_body_rewind() returns the number of bytes to rewind before buf->p to find the message's body. It relies on http_hdr_rewind() to find the beginning and adds msg->eoh + msg->eol which are always safe. http_data_rewind() does the same to get the beginning of the data, which differs from above when a chunk is present. It uses the function above and adds msg->sol. The purpose is to centralize further ->sov changes aiming at avoiding to rely on buf->o.	2014-04-22 23:15:28 +02:00
Willy Tarreau	da6eed621f	MINOR: http: add a small helper to compute how far to rewind to find URI http_uri_rewind() returns the number of bytes to rewind before buf->p to find the URI. It relies on http_hdr_rewind() to find the beginning and is just here to simplify operations. The purpose is to centralize further ->sov changes aiming at avoiding to rely on buf->o.	2014-04-22 23:15:28 +02:00
Willy Tarreau	211cdece79	MEDIUM: http: add a small helper to compute how far to rewind to find headers http_hdr_rewind() returns the number of bytes to rewind before buf->p to find the beginning of headers. At the moment it's not exact as it still relies on buf->o, assuming that no other data from a past message were pending there, but it's what was done till there. The purpose is to centralize further ->sov changes aiming at avoiding to rely on buf->o.	2014-04-22 23:15:28 +02:00
Willy Tarreau	2d8e485a7c	MINOR: http: add a small helper to compute the amount of body bytes present http_body_bytes() returns the number of bytes of the current message body present in the buffer. It is compatible with being called before and after the headers are forwarded. This is done to centralize further ->sov changes.	2014-04-22 23:15:28 +02:00
Willy Tarreau	877e78dbef	MAJOR: http: do not use msg->sol while processing messages or forwarding data There are still some pending issues in the gzip compressor, and fixing them requires a better handling of intermediate parsing states. Another issue to deal with is the rewinding of a buffer during a redispatch when a load balancing algorithm involves L7 data because the exact amount of data to rewind is not clear. At the moment, this is handled by unwinding all pending data, which cannot work in responses due to pipelining. Last, having a first analysis which parses the body and another one which restarts from where the parsing was left is wrong. Right now it only works because we never both parse and transform in the same direction. But that is wrong anyway. In order to address the first issue, we'll have to use msg->eoh + msg->eol to find the end of headers, and we still need to store the information about the forwarded header length somewhere (msg->sol might be reused for this). msg->sov may only be used for the start of data and not for subsequent chunks if possible. This first implies that we stop sharing it with header length, and stop using msg->sol there. In fact we don't need it already as it is always zero when reaching the HTTP_MSG_BODY state. It was only updated to reflect a copy of msg->sov. So now as a first step into that direction, this patch ensure that msg->sol is never re-assigned after being set to zero and is not used anymore when we're dealing with HTTP processing and forwarding. We'll later reuse it differently but for now it's secured. The patch does nothing magic, it only removes msg->sol everywhere it was already zero and avoids setting it. In order to keep the sov-sol difference, it now resets sov after forwarding data. In theory there's no problem here, but the patch is still tagged major because that code is complex.	2014-04-22 23:15:28 +02:00
Willy Tarreau	0558a02eb1	MINOR: http: make msg->eol carry the last CRLF length One of the issues we face when we need to either forward headers only before compressing, or rewind the stream during a redispatch is to know the proper length of the request headers. msg->eoh always has the total length up to the last CRLF, and we never know whether the request ended with a single LF or a standard CRLF. This makes it hard to rewind the headers without explicitly checking the bytes in the buffer. Instead of doing so, we now use msg->eol to carry the length of the last CRLF (either 1 or 2). Since it is not modified at all after HTTP_MSG_BODY, and was only left in an undefined state, it is safe to use at any moment. Thus, the complete header length to forward or to rewind now is always msg->eoh + msg->eol.	2014-04-22 23:15:28 +02:00
Willy Tarreau	5a8f947f4f	CLEANUP: http: rename http_process_request_body() This function does not process anything, it just waits for the beginning of the request body. Let's rename it http_wait_for_request_body().	2014-04-22 23:15:27 +02:00
Willy Tarreau	226071e0a7	MEDIUM: http: wait for the first chunk or message body length in http_process_body This is the continuation of previous patch. Now that full buffers are not rejected anymore, let's wait for at least the advertised chunk or body length to be present or the buffer to be full. When either condition is met, the message processing can go forward. Thus we don't need to use url_param_post_limit anymore, which was passed in the configuration as an optionnal <max_wait> parameter after the "check_post" value. This setting was necessary when the feature was implemented because there was no support for parsing message bodies. The argument is now silently ignored if set in the configuration.	2014-04-22 23:15:27 +02:00
Thierry FOURNIER	dad3d1d402	MINOR: http: add the function "del-header" to the directives http-request and http-response This patch permits to remove all HTTP request and response header fields whose name is specified in <name>.	2014-04-22 19:13:50 +02:00
Thierry FOURNIER	74c219dc04	BUG/MEDIUM: stick-table: fix IPv4-to-IPv6 conversion in src_* fetches The function addr_to_stktable_key doesn't consider the expected type of key. If the stick table key is based on IPv6 addresses and the input is IPv4, the returned key is IPv4 adddress and his length is 4 bytes, while is expected 16 bytes key. This patch considers the expected key and try to convert IPv4 to IPv6 and IPv6 to IPv4 according with the expected key. This fixes the bug reported by Apollon Oikonomopoulos. This bug was introduced somewhere in the 1.5-dev process.	2014-04-14 18:22:57 +02:00
Nenad Merdanovic	88afe03778	BUG/MINOR: Fix name lookup ordering when compiled with USE_GETADDRINFO When compiled with USE_GETADDRINFO, make sure we use getaddrinfo(3) to perform name lookups. On default dual-stack setups this will change the behavior of using IPv6 first. Global configuration option 'nogetaddrinfo' can be used to revert to deprecated gethostbyname(3).	2014-04-14 15:56:58 +02:00
Willy Tarreau	b4a163a135	CLEANUP: pattern: move array definitions to proto/ and not types/ All symbol definitions should be in proto/ and not types/.	2014-04-02 20:55:23 +02:00
Willy Tarreau	f6c22c8944	BUG/MEDIUM: pattern: fix wrong definition of the pat_prune_fcts array Commit `6f7203d` ("MEDIUM: pattern: add prune function") introduced an array of functions pat_prune_fcts[] but unfortunately declared it in pattern.h without marking it "extern", resulting in each file including it having its own copy.	2014-04-02 20:51:04 +02:00
Willy Tarreau	272adea423	REORG: cfgparse: move server keyword parsing to server.c The cfgparse.c file becomes huge, and a large part of it comes from the server keyword parser. Since the configuration is a bit more modular now, move this parser to server.c. This patch also moves the check of the "server" keyword earlier in the supported keywords list, resulting in a slightly faster config parsing for configs with large numbers of servers (about 10%). No functional change was made, only the code was moved.	2014-03-31 10:42:03 +02:00
Bertrand Jacquin	702d44f2ff	MEDIUM: proxy: support use_backend with dynamic names We have a use case where we look up a customer ID in an HTTP header and direct it to the corresponding server. This can easily be done using ACLs and use_backend rules, but the configuration becomes painful to maintain when the number of customers grows to a few tens or even a several hundreds. We realized it would be nice if we could make the use_backend resolve its name at run time instead of config parsing time, and use a similar expression as http-request add-header to decide on the proper backend to use. This permits the use of prefixes or even complex names in backend expressions. If no name matches, then the default backend is used. Doing so allowed us to get rid of all the use_backend rules. Since there are some config checks on the use_backend rules to see if the referenced backend exists, we want to keep them to detect config errors in normal config. So this patch does not modify the default behaviour and proceeds this way : - if the backend name in the use_backend directive parses as a log format rule, it's used as-is and is resolved at run time ; - otherwise it's a static name which must be valid at config time. There was the possibility of doing this with the use-server directive instead of use_backend, but it seems like use_backend is more suited to this task, as it can be used for other purposes. For example, it becomes easy to serve a customer-specific proxy.pac file based on the customer ID by abusing the errorfile primitive : use_backend bk_cust_%[hdr(X-Cust-Id)] if { hdr(X-Cust-Id) -m found } default_backend bk_err_404 backend bk_cust_1 errorfile 200 /etc/haproxy/static/proxy.pac.cust1 Signed-off-by: Bertrand Jacquin <bjacquin@exosec.fr>	2014-03-31 10:18:30 +02:00
Thierry FOURNIER	fa45f1d06c	MEDIUM: config: Dynamic sections. This patch permit to register new sections in the haproxy's configuration file. This run like all the "keyword" registration, it is used during the haproxy initialization, typically with the "__attribute__((constructor))" functions.	2014-03-31 09:56:40 +02:00
Thierry FOURNIER	9f95e4084c	MINOR: standard: Add ipv6 support in the function url2sa(). The function url2sa() converts faster url like http://<ip>:<port> in a struct sockaddr_storage. This patch add: - the https support - permit to return the length parsed - support IPv6 - support DNS synchronous resolution only during start of haproxy. The faster IPv4 convertion way is keeped. IPv6 is slower, because I use the standard IPv6 parser function.	2014-03-31 09:54:44 +02:00
Thierry FOURNIER	46006bde3c	MINOR: pattern: Add function to prune and reload pattern list. This function it is used for dynamically update all the patterns attached to one file. This function is atomic. All parsing or indexation failures are reported in the haproxy logs.	2014-03-28 13:23:07 +01:00
Thierry FOURNIER	c5a4e98639	MEDIUM: acl: Change the acl register struct This patch replace a lot of pointeur by pattern matching identifier. If the declared ACL use all the predefined pattern matching functions, the register function gets the functions provided by "pattern.c" and identified by the PAT_LATCH_*. In the case of the acl uses his own functions, they can be declared, and the acl registration doesn't change it.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	b7729c96a4	MINOR: pattern: forbid dns resolutions This patch adds the flags "-n" on the acl parser. the flag "-n" forbif the DNS resolutions. The maps have always the dns resolutions disabled.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	66eb9bf691	MINOR: pattern: Remove the flag "PAT_F_FROM_FILE". This flag is no longer used. The last place using this, are the display of the result of pattern matching in the cli command "get map" or "get acl". The first parameter of this command is the reference of the file used to perform the lookup.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	fc7ac7b89c	MINOR: standard: Disable ip resolution during the runtime The function str2net runs DNS resolution if valid ip cannot be parsed. The DNS function used is the standard function of the libc and it performs asynchronous request. The asynchronous request is not compatible with the haproxy archictecture. str2net() is used during the runtime throught the "socket". This patch remove the DNS resolution during the runtime.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	94580c9f52	MINOR: dumpstat/conf: display all the configuration lines that using pattern reference	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	eeaa951726	MINOR: configuration: File and line propagation This patch permits to communicate file and line of the configuration file at the configuration parser.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	0b6d15fdc8	MINOR: regex: The pointer regstr in the struc regex is no longer used. The pointer <regstr> is only used to compare and identify the original regex string with the patterns. Now the patterns have a reference map containing this original string. It is useless to store this value two times.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	364cfdff7a	MEDIUM: dumpstats: Display error message during add of values. This patch adds new display type. This display returns allocated string, when the string is flush into buffers, it is freed. This permit to return the content of "memprintf(err, ...)" messages. The pat_ref_add functions has changed to return error.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	c0bd9100eb	MINOR: pattern: Check if the file reference is not used with acl and map The format of the acl file are not the same than the format of the map files. In some case, the same file can be used, but this is ambiguous for the user because the patterns are not the expected.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	39bef456fb	MINOR: pattern/map/acl: Centralization of the file parsers The acl and map function do the same work with the file parsing. This patch merge these code in only one. Note that the function map_read_entries_from_file() in the file "map.c" is moved to the the function pat_ref_read_from_file_smp() in the file "pattern.c". The code of this function is not modified, only the the name and the arguments order has changed.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	e369ca2e66	MEDIUM: pattern_find_smp: functions find_smp uses the pat_ref_elt to find the element to be removed The find_smp search the smp using the value of the pat_ref_elt pointer. The pat_find_smp_* are no longer used. The function pattern_find_smp() known all pattern indexation, and can be found	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	7acca4b269	MEDIUM: pattern: delete() function uses the pat_ref_elt to find the element to be removed All the pattern delete function can use her reference to the original "struct pat_ref_elt" to find the element to be remove. The functions pat_del_list_str() and pat_del_meth() were deleted because after applying this modification, they have the same code than pat_del_list_ptr().	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	6bb53ff164	MINOR: pattern: Each pattern expression element store the reference struct. Now, each pattern entry known the original "struct pat_ref_elt" from that was built. This patch permit to delete each pattern entry without confusion. After this patch, each reference can use his pointer to be targeted.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	0d6ba513a5	MINOR: pattern: store configuration reference for each acl or map pattern. This patch permit to add reference for each pattern reference. This is useful to identify the acl listed.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	aa222aadb6	MINOR: pattern: The function "pattern_register()" is no longer used. Remove the function "pattern_register()" and its prototype because it is no longer used.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	af5a29d5f8	MINOR: pattern: Each pattern is identified by unique id. The pattern reference are stored with two identifiers: the unique_id and the reference. The reference identify a file. Each file with the same name point to the same reference. We can register many times one file. If the file is modified, all his dependencies are also modified. The reference can be used with map or acl. The unique_id identify inline acl. The unique id is unique for each acl. You cannot force the same id in the configuration file, because this repport an error. The format of the acl and map listing through the "socket" has changed for displaying these new ids.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	5d34408785	MEDIUM: pattern: The expected type is stored in the pattern head, and conversion is executed once. This patch extract the expect_type variable from the "struct pattern" to "struct pattern_head". This variable is set during the declaration of ACL and MAP. With this change, the function "pat_parse_len()" become useless and can be replaced by "pat_parse_int()". Implicit ACLs by default rely on the fetch's output type, so let's simply do the same for all other ones. It has been verified that they all match.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	c5959fd5d4	MEDIUM: pattern: merge same pattern Sometimes the same pattern file is used with the same index, parse and parse_smp functions. If this two condition are true, these two pattern are identical and the same struct can be used.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	1e00d3853b	MAJOR: pattern/map: Extends the map edition system in the patterns This patch add the following socket command line options: show acl [<id>] clear acl <id> get acl <id> <pattern> del acl <id> <pattern> add acl <id> <pattern> The system used for maps is backported in the pattern functions.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	888863534c	MINOR: map/pattern: The sample parser is stored in the pattern We cannot separe the pattern and the value. Now, the patern known the value and the pattern is able to parsehis associated sample staroage.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	3667e514d9	MEDIUM: pattern/dumpstats: The function pattern_lookup() is no longer used This function are used in dumpstats. Now this function is replaced by delete and find_smp function pointer.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	55d0b10f06	MEDIUM: pattern: add sample lookup function. Some functions needs to change the sample associated to pattern. This new pointer permit to return the a pointer to the sample pointer. The caller can use or change the value.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	6f7203d673	MEDIUM: pattern: add prune function This path add specific pointer to each expression to point on prune function. Now, each pattern expression embed his own prune function.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	b113650e54	MEDIUM: pattern: add delete functions This commit adds a delete function for patterns. It looks up all instances of the pattern to delete and deletes them all. The fetch keyword declarations have been extended to point to the appropriate delete function.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	33a7433ac9	MEDIUM: pattern: Index IPv6 addresses in a tree. This commit adds second tree node in the pattern struct and use it to index IPv6 addresses. This commit report feature used in the list. If IPv4 not match the tree, try to convert the IPv4 address in IPv6 with prefixing the IPv4 address by "::ffff", after this operation, the match function try lookup in the IPv6 tree. If the IPv6 sample dont match the IPv6 tree, try to convert the IPv6 addresses prefixed by "2002:IPv4", "::ffff:IPv4" and "::0000:IPv4" in IPv4 address. after this operation, the match function try lookup in the IPv4 tree.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	5338eea8eb	MEDIUM: pattern: The match function browse itself the list or the tree. The match function known the format of the pattern. The pattern can be stored in a list or in a tree. The pattern matching function use itself the good entry point and indexation type. Each pattern matching function return the struct pattern that match. If the flag "fill" is set, the struct pattern is filled, otherwise the content of this struct must not be used. With this feature, the general pattern matching function cannot have exceptions for building the "struct pattern".	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	1794fdf37e	MEDIUM: pattern: The function pattern_exec_match() returns "struct pattern" if the patten match. Before this commit, the pattern_exec_match() function returns the associate sample, the associate struct pattern or the associate struct pattern_tree. This is complex to use, because we can check the type of information returned. Now the function return always a "struct pattern". If <fill> is not set, only the value of the pointer can be used as boolean (NULL or other). If <fill> is set, you can use the <smp> pointer and the pattern information. If information must be duplicated, it is stored in trash buffer. Otherwise, the pattern can point on existing strings.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	d437314979	MEDIUM: sample/http_proto: Add new type called method The method are actuelly stored using two types. Integer if the method is known and string if the method is not known. The fetch is declared as UINT, but in some case it can provides STR. This patch create new type called METH. This type contain interge for known method and string for the other methods. It can be used with automatic converters. The pattern matching can expect method. During the free or prune function, http_meth pettern is freed. This patch initialise the freed pointer to NULL.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	7654c9ff44	MEDIUM: sample: Remove types SMP_T_CSTR and SMP_T_CBIN, replace it by SMP_F_CONST flags The operations applied on types SMP_T_CSTR and SMP_T_STR are the same, but the check code and the declarations are double, because it must declare action for SMP_T_C* and SMP_T_. The declared actions and checks are the same. this complexify the code. Only the "conv" functions can change from "C" to "*" Now, if a function needs to modify input string, it can call the new function smp_dup(). This one duplicate data in a trash buffer.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	b050463375	MINOR: standard: Add function for converting cidr to network mask.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	0e9af55700	MINOR: sample: dont call the sample cast function "c_none" If the cast function to execute is c_none, dont execute it and return true. The function c_none, do nothing. This save a call.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	e1bcac5b8f	MINOR: pattern: Rename "pat_idx_elt" to "pattern_tree" This is just for having coherent struct names.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	edc15c3a35	MEDIUM: pattern: The parse functions just return "struct pattern" without memory allocation The pattern parse functions put the parsed result in a "struct pattern" without memory allocation. If the pattern must reference the input data without changes, the pattern point to the parsed string. If buffers are needed to store translated data, it use th trash buffer. The indexation function that allocate the memory later if it is needed.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	b9b08460a2	MEDIUM: pattern: add indexation function. Before this patch, the indexation function check the declared patttern matching function and index the data according with this function. This is not useful to add some indexation mode. This commit adds dedicated indexation function. Each struct pattern is associated with one indexation function. This function permit to index data according with the type of pattern and with the type of match.	2014-03-17 18:06:06 +01:00
Willy Tarreau	1cf8f08c17	MINOR: sample: move smp_to_type to sample.c This way it can be exported and reused anywhere else to report type names.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	3ead5b93c6	MINOR: pattern: separe list element from the data part. This commit separes the "struct list" used for the chain the "struct pattern" which contain the pattern data. Later, this change will permit to manipulate lists ans trees with the same "struct pattern".	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	972028fa67	MEDIUM: pattern: Change the prototype of the function pattern_register(). Each pattern parser take only one string. This change is reported to the function prototype of the function "pattern_register()". Now, it is called with just one string and no need to browse the array of args.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	580c32cb3a	MEDIUM: pattern: The pattern parser no more uses <opaque> and just takes one string. After the previous patches, the "pat_parse_strcat()" function disappear, and the "pat_parse_int()" and "pat_parse_dotted_ver()" functions dont use anymore the "opaque" argument, and take only one string on his input. So, after this patch, each pattern parser no longer use the opaque variable and take only one string as input. This patch change the prototype of the pattern parsing functions. Now, the "char *args" is replaced by a "char arg", the "int *opaque" is removed and these functions return 1 in succes case, and 0 if fail.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	511e9475f2	MEDIUM: acl/pattern: standardisation "of pat_parse_int()" and "pat_parse_dotted_ver()" The goal of these patch is to simplify the prototype of "pat_pattern_()" functions. I want to replace the argument "char args" by a simple "char arg" and remove the "opaque" argument. "pat_parse_int()" and "pat_parse_dotted_ver()" are the unique pattern parser using the "opaque" argument and using more than one string argument of the char **args. These specificities are only used with ACL. Other systems using this pattern parser (MAP and CLI) just use one string for describing a range. This two functions can read a range, but the min and the max must y specified. This patch extends the syntax to describe a range with implicit min and max. This is used for operators like "lt", "le", "gt", and "ge". the syntax is the following: ":x" -> no min to "x" "x:" -> "x" to no max This patch moves the parsing of the comparison operator from the functions "pat_parse_int()" and "pat_parse_dotted_ver()" to the acl parser. The acl parser read the operator and the values and build a volatile string readable by the functions "pat_parse_int()" and "pat_parse_dotted_ver()". The transformation is done with these rules: If the parser is "pat_parse_int()": "eq x" -> "x" "le x" -> ":x" "lt x" -> ":y" (with y = x - 1) "ge x" -> "x:" "gt x" -> "y:" (with y = x + 1) If the parser is "pat_parse_dotted_ver()": "eq x.y" -> "x.y" "le x.y" -> ":x.y" "lt x.y" -> ":w.z" (with w.z = x.y - 1) "ge x.y" -> "x.y:" "gt x.y" -> "w.z:" (with w.z = x.y + 1) Note that, if "y" is not present, assume that is "0". Now "pat_parse_int()" and "pat_parse_dotted_ver()" accept only one pattern and the variable "opaque" is no longer used. The prototype of the pattern parsers can be changed.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	9eec0a646b	MAJOR: auth: Change the internal authentication system. This patch remove the limit of 32 groups. It also permit to use standard "pat_parse_str()" function in place of "pat_parse_strcat()". The "pat_parse_strcat()" is no longer used and its removed. Before this patch, the groups are stored in a bitfield, now they are stored in a list of strings. The matching is slower, but the number of groups is low and generally the list of allowed groups is short. The fetch function "smp_fetch_http_auth_grp()" used with the name "http_auth_group" return valid username. It can be used as string for displaying the username or with the acl "http_auth_group" for checking the group of the user. Maybe the names of the ACL and fetch methods are no longer suitable, but I keep the current names for conserving the compatibility with existing configurations. The function "userlist_postinit()" is created from verification code stored in the big function "check_config_validity()". The code is adapted to the new authentication storage system and it is moved in the "src/auth.c" file. This function is used to check the validity of the users declared in groups and to check the validity of groups declared on the "user" entries. This resolve function is executed before the check of all proxy because many acl needs solved users and groups.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	d048d8b891	BUG/MINOR: http: fix encoding of samples used in http headers The binary samples are sometimes copied as is into http headers. A sample can contain bytes unallowed by the http rfc concerning header content, for example if it was extracted from binary data. The resulting http request can thus be invalid. This issue does not yet happen because haproxy currently (mistakenly) hex-encodes binary data, so it is not really possible to retrieve invalid HTTP chars. The solution consists in hex-encoding all non-printable chars prefixed by a '%' sign. No backport is needed since existing code is not affected yet.	2014-03-17 16:39:03 +01:00
Thierry FOURNIER	e059ec9393	MINOR: standard: add function "encode_chunk" This function has the same behavior as encode_string(), except it takes a "struct chunk" instead of a "char *" on input.	2014-03-17 16:38:56 +01:00
Willy Tarreau	f79d950163	MEDIUM: proxy: create a tree to store proxies by name Large configurations can take time to parse when thousands of backends are in use. Let's store all the proxies in trees. findproxy_mode() has been modified to use the tree for lookups, which has divided the parsing time by about 2.5. But many lookups are still present at many places and need to be dealt with.	2014-03-15 07:48:35 +01:00
Willy Tarreau	80a92c02f4	BUG/MEDIUM: http: don't start to forward request data before the connect Currently, "balance url_param check_post" randomly works. If the client sends chunked data and there's another chunk after the one containing the data, http_request_forward_body() will advance msg->sov and move the start of data to the beginning of the last chunk, and get_server_ph_post() will not find the data. In order to avoid this, we add an HTTP_MSGF_WAIT_CONN flag whose goal is to prevent the forwarding code from parsing until the connection is confirmed, so that we're certain not to fail on a redispatch. Note that we need to force channel_auto_connect() since the output buffer is empty and a previous analyser might have stopped auto-connect. The flag is currently set whenever some L7 POST analysis is needed for a connect() so that it correctly addresses all corner cases involving a possible rewind of the buffer, waiting for a better fix. Note that this has been broken for a very long time. Even all 1.4 versions seem broken but differently, with ->sov pointing to the end of the arguments. So the fix should be considered for backporting to all stable releases, possibly including 1.3 which works differently.	2014-03-14 12:22:56 +01:00
Willy Tarreau	36346247ac	BUG/MEDIUM: http: continue to emit 503 on keep-alive to different server Finn Arne Gangstad reported that commit `6b726adb35` ("MEDIUM: http: do not report connection errors for second and further requests") breaks support for serving static files by abusing the errorfile 503 statement. Indeed, a second request over a connection sent to any server or backend returning 503 would silently be dropped. The proper solution consists in adding a flag on the session indicating that the server connection was reused, and to only avoid the error code in this case.	2014-02-24 18:26:30 +01:00
Willy Tarreau	7e3127391f	MINOR: config: make the stream interface idle timer user-configurable The new tune.idletimer value allows one to set a different value for idle stream detection. The default value remains set to one second. It is possible to disable it using zero, and to change the default value at build time using DEFAULT_IDLE_TIMER.	2014-02-12 16:36:12 +01:00
Willy Tarreau	b145c78623	MINOR: channel: add the date of last read in the channel We store the time stamp of last read in the channel in order to be able to measure some bit rate and pause lengths. We only use 16 bits which were unused for this. We don't need more, as it allows us to measure with a millisecond precision for up to 65s.	2014-02-12 11:45:59 +01:00
Willy Tarreau	8f39dcdc8d	BUG/MINOR: channel: initialize xfer_small/xfer_large on new buffers These ones are only reset during transfers. There is a low but non-null risk that a first full read causes the previous value to be reused and immediately to immediately set the CF_STREAMER flag. The impact is only to increase earlier than expected the SSL record size and to use splice(). This bug was already present in 1.4, so a backport is possible.	2014-02-12 11:45:45 +01:00

1 2 3 4 5 ...

1699 Commits