haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2025-02-19 20:27:01 +00:00

Author	SHA1	Message	Date
Thierry FOURNIER	74c219dc04	BUG/MEDIUM: stick-table: fix IPv4-to-IPv6 conversion in src_* fetches The function addr_to_stktable_key doesn't consider the expected type of key. If the stick table key is based on IPv6 addresses and the input is IPv4, the returned key is IPv4 adddress and his length is 4 bytes, while is expected 16 bytes key. This patch considers the expected key and try to convert IPv4 to IPv6 and IPv6 to IPv4 according with the expected key. This fixes the bug reported by Apollon Oikonomopoulos. This bug was introduced somewhere in the 1.5-dev process.	2014-04-14 18:22:57 +02:00
Nenad Merdanovic	88afe03778	BUG/MINOR: Fix name lookup ordering when compiled with USE_GETADDRINFO When compiled with USE_GETADDRINFO, make sure we use getaddrinfo(3) to perform name lookups. On default dual-stack setups this will change the behavior of using IPv6 first. Global configuration option 'nogetaddrinfo' can be used to revert to deprecated gethostbyname(3).	2014-04-14 15:56:58 +02:00
Willy Tarreau	b4a163a135	CLEANUP: pattern: move array definitions to proto/ and not types/ All symbol definitions should be in proto/ and not types/.	2014-04-02 20:55:23 +02:00
Willy Tarreau	f6c22c8944	BUG/MEDIUM: pattern: fix wrong definition of the pat_prune_fcts array Commit `6f7203d` ("MEDIUM: pattern: add prune function") introduced an array of functions pat_prune_fcts[] but unfortunately declared it in pattern.h without marking it "extern", resulting in each file including it having its own copy.	2014-04-02 20:51:04 +02:00
Willy Tarreau	272adea423	REORG: cfgparse: move server keyword parsing to server.c The cfgparse.c file becomes huge, and a large part of it comes from the server keyword parser. Since the configuration is a bit more modular now, move this parser to server.c. This patch also moves the check of the "server" keyword earlier in the supported keywords list, resulting in a slightly faster config parsing for configs with large numbers of servers (about 10%). No functional change was made, only the code was moved.	2014-03-31 10:42:03 +02:00
Bertrand Jacquin	702d44f2ff	MEDIUM: proxy: support use_backend with dynamic names We have a use case where we look up a customer ID in an HTTP header and direct it to the corresponding server. This can easily be done using ACLs and use_backend rules, but the configuration becomes painful to maintain when the number of customers grows to a few tens or even a several hundreds. We realized it would be nice if we could make the use_backend resolve its name at run time instead of config parsing time, and use a similar expression as http-request add-header to decide on the proper backend to use. This permits the use of prefixes or even complex names in backend expressions. If no name matches, then the default backend is used. Doing so allowed us to get rid of all the use_backend rules. Since there are some config checks on the use_backend rules to see if the referenced backend exists, we want to keep them to detect config errors in normal config. So this patch does not modify the default behaviour and proceeds this way : - if the backend name in the use_backend directive parses as a log format rule, it's used as-is and is resolved at run time ; - otherwise it's a static name which must be valid at config time. There was the possibility of doing this with the use-server directive instead of use_backend, but it seems like use_backend is more suited to this task, as it can be used for other purposes. For example, it becomes easy to serve a customer-specific proxy.pac file based on the customer ID by abusing the errorfile primitive : use_backend bk_cust_%[hdr(X-Cust-Id)] if { hdr(X-Cust-Id) -m found } default_backend bk_err_404 backend bk_cust_1 errorfile 200 /etc/haproxy/static/proxy.pac.cust1 Signed-off-by: Bertrand Jacquin <bjacquin@exosec.fr>	2014-03-31 10:18:30 +02:00
Thierry FOURNIER	fa45f1d06c	MEDIUM: config: Dynamic sections. This patch permit to register new sections in the haproxy's configuration file. This run like all the "keyword" registration, it is used during the haproxy initialization, typically with the "__attribute__((constructor))" functions.	2014-03-31 09:56:40 +02:00
Thierry FOURNIER	9f95e4084c	MINOR: standard: Add ipv6 support in the function url2sa(). The function url2sa() converts faster url like http://<ip>:<port> in a struct sockaddr_storage. This patch add: - the https support - permit to return the length parsed - support IPv6 - support DNS synchronous resolution only during start of haproxy. The faster IPv4 convertion way is keeped. IPv6 is slower, because I use the standard IPv6 parser function.	2014-03-31 09:54:44 +02:00
Thierry FOURNIER	46006bde3c	MINOR: pattern: Add function to prune and reload pattern list. This function it is used for dynamically update all the patterns attached to one file. This function is atomic. All parsing or indexation failures are reported in the haproxy logs.	2014-03-28 13:23:07 +01:00
Thierry FOURNIER	c5a4e98639	MEDIUM: acl: Change the acl register struct This patch replace a lot of pointeur by pattern matching identifier. If the declared ACL use all the predefined pattern matching functions, the register function gets the functions provided by "pattern.c" and identified by the PAT_LATCH_*. In the case of the acl uses his own functions, they can be declared, and the acl registration doesn't change it.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	b7729c96a4	MINOR: pattern: forbid dns resolutions This patch adds the flags "-n" on the acl parser. the flag "-n" forbif the DNS resolutions. The maps have always the dns resolutions disabled.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	66eb9bf691	MINOR: pattern: Remove the flag "PAT_F_FROM_FILE". This flag is no longer used. The last place using this, are the display of the result of pattern matching in the cli command "get map" or "get acl". The first parameter of this command is the reference of the file used to perform the lookup.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	fc7ac7b89c	MINOR: standard: Disable ip resolution during the runtime The function str2net runs DNS resolution if valid ip cannot be parsed. The DNS function used is the standard function of the libc and it performs asynchronous request. The asynchronous request is not compatible with the haproxy archictecture. str2net() is used during the runtime throught the "socket". This patch remove the DNS resolution during the runtime.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	94580c9f52	MINOR: dumpstat/conf: display all the configuration lines that using pattern reference	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	eeaa951726	MINOR: configuration: File and line propagation This patch permits to communicate file and line of the configuration file at the configuration parser.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	0b6d15fdc8	MINOR: regex: The pointer regstr in the struc regex is no longer used. The pointer <regstr> is only used to compare and identify the original regex string with the patterns. Now the patterns have a reference map containing this original string. It is useless to store this value two times.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	364cfdff7a	MEDIUM: dumpstats: Display error message during add of values. This patch adds new display type. This display returns allocated string, when the string is flush into buffers, it is freed. This permit to return the content of "memprintf(err, ...)" messages. The pat_ref_add functions has changed to return error.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	c0bd9100eb	MINOR: pattern: Check if the file reference is not used with acl and map The format of the acl file are not the same than the format of the map files. In some case, the same file can be used, but this is ambiguous for the user because the patterns are not the expected.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	39bef456fb	MINOR: pattern/map/acl: Centralization of the file parsers The acl and map function do the same work with the file parsing. This patch merge these code in only one. Note that the function map_read_entries_from_file() in the file "map.c" is moved to the the function pat_ref_read_from_file_smp() in the file "pattern.c". The code of this function is not modified, only the the name and the arguments order has changed.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	e369ca2e66	MEDIUM: pattern_find_smp: functions find_smp uses the pat_ref_elt to find the element to be removed The find_smp search the smp using the value of the pat_ref_elt pointer. The pat_find_smp_* are no longer used. The function pattern_find_smp() known all pattern indexation, and can be found	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	7acca4b269	MEDIUM: pattern: delete() function uses the pat_ref_elt to find the element to be removed All the pattern delete function can use her reference to the original "struct pat_ref_elt" to find the element to be remove. The functions pat_del_list_str() and pat_del_meth() were deleted because after applying this modification, they have the same code than pat_del_list_ptr().	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	6bb53ff164	MINOR: pattern: Each pattern expression element store the reference struct. Now, each pattern entry known the original "struct pat_ref_elt" from that was built. This patch permit to delete each pattern entry without confusion. After this patch, each reference can use his pointer to be targeted.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	0d6ba513a5	MINOR: pattern: store configuration reference for each acl or map pattern. This patch permit to add reference for each pattern reference. This is useful to identify the acl listed.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	aa222aadb6	MINOR: pattern: The function "pattern_register()" is no longer used. Remove the function "pattern_register()" and its prototype because it is no longer used.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	af5a29d5f8	MINOR: pattern: Each pattern is identified by unique id. The pattern reference are stored with two identifiers: the unique_id and the reference. The reference identify a file. Each file with the same name point to the same reference. We can register many times one file. If the file is modified, all his dependencies are also modified. The reference can be used with map or acl. The unique_id identify inline acl. The unique id is unique for each acl. You cannot force the same id in the configuration file, because this repport an error. The format of the acl and map listing through the "socket" has changed for displaying these new ids.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	5d34408785	MEDIUM: pattern: The expected type is stored in the pattern head, and conversion is executed once. This patch extract the expect_type variable from the "struct pattern" to "struct pattern_head". This variable is set during the declaration of ACL and MAP. With this change, the function "pat_parse_len()" become useless and can be replaced by "pat_parse_int()". Implicit ACLs by default rely on the fetch's output type, so let's simply do the same for all other ones. It has been verified that they all match.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	c5959fd5d4	MEDIUM: pattern: merge same pattern Sometimes the same pattern file is used with the same index, parse and parse_smp functions. If this two condition are true, these two pattern are identical and the same struct can be used.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	1e00d3853b	MAJOR: pattern/map: Extends the map edition system in the patterns This patch add the following socket command line options: show acl [<id>] clear acl <id> get acl <id> <pattern> del acl <id> <pattern> add acl <id> <pattern> The system used for maps is backported in the pattern functions.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	888863534c	MINOR: map/pattern: The sample parser is stored in the pattern We cannot separe the pattern and the value. Now, the patern known the value and the pattern is able to parsehis associated sample staroage.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	3667e514d9	MEDIUM: pattern/dumpstats: The function pattern_lookup() is no longer used This function are used in dumpstats. Now this function is replaced by delete and find_smp function pointer.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	55d0b10f06	MEDIUM: pattern: add sample lookup function. Some functions needs to change the sample associated to pattern. This new pointer permit to return the a pointer to the sample pointer. The caller can use or change the value.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	6f7203d673	MEDIUM: pattern: add prune function This path add specific pointer to each expression to point on prune function. Now, each pattern expression embed his own prune function.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	b113650e54	MEDIUM: pattern: add delete functions This commit adds a delete function for patterns. It looks up all instances of the pattern to delete and deletes them all. The fetch keyword declarations have been extended to point to the appropriate delete function.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	33a7433ac9	MEDIUM: pattern: Index IPv6 addresses in a tree. This commit adds second tree node in the pattern struct and use it to index IPv6 addresses. This commit report feature used in the list. If IPv4 not match the tree, try to convert the IPv4 address in IPv6 with prefixing the IPv4 address by "::ffff", after this operation, the match function try lookup in the IPv6 tree. If the IPv6 sample dont match the IPv6 tree, try to convert the IPv6 addresses prefixed by "2002:IPv4", "::ffff:IPv4" and "::0000:IPv4" in IPv4 address. after this operation, the match function try lookup in the IPv4 tree.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	5338eea8eb	MEDIUM: pattern: The match function browse itself the list or the tree. The match function known the format of the pattern. The pattern can be stored in a list or in a tree. The pattern matching function use itself the good entry point and indexation type. Each pattern matching function return the struct pattern that match. If the flag "fill" is set, the struct pattern is filled, otherwise the content of this struct must not be used. With this feature, the general pattern matching function cannot have exceptions for building the "struct pattern".	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	1794fdf37e	MEDIUM: pattern: The function pattern_exec_match() returns "struct pattern" if the patten match. Before this commit, the pattern_exec_match() function returns the associate sample, the associate struct pattern or the associate struct pattern_tree. This is complex to use, because we can check the type of information returned. Now the function return always a "struct pattern". If <fill> is not set, only the value of the pointer can be used as boolean (NULL or other). If <fill> is set, you can use the <smp> pointer and the pattern information. If information must be duplicated, it is stored in trash buffer. Otherwise, the pattern can point on existing strings.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	d437314979	MEDIUM: sample/http_proto: Add new type called method The method are actuelly stored using two types. Integer if the method is known and string if the method is not known. The fetch is declared as UINT, but in some case it can provides STR. This patch create new type called METH. This type contain interge for known method and string for the other methods. It can be used with automatic converters. The pattern matching can expect method. During the free or prune function, http_meth pettern is freed. This patch initialise the freed pointer to NULL.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	7654c9ff44	MEDIUM: sample: Remove types SMP_T_CSTR and SMP_T_CBIN, replace it by SMP_F_CONST flags The operations applied on types SMP_T_CSTR and SMP_T_STR are the same, but the check code and the declarations are double, because it must declare action for SMP_T_C* and SMP_T_. The declared actions and checks are the same. this complexify the code. Only the "conv" functions can change from "C" to "*" Now, if a function needs to modify input string, it can call the new function smp_dup(). This one duplicate data in a trash buffer.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	b050463375	MINOR: standard: Add function for converting cidr to network mask.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	0e9af55700	MINOR: sample: dont call the sample cast function "c_none" If the cast function to execute is c_none, dont execute it and return true. The function c_none, do nothing. This save a call.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	e1bcac5b8f	MINOR: pattern: Rename "pat_idx_elt" to "pattern_tree" This is just for having coherent struct names.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	edc15c3a35	MEDIUM: pattern: The parse functions just return "struct pattern" without memory allocation The pattern parse functions put the parsed result in a "struct pattern" without memory allocation. If the pattern must reference the input data without changes, the pattern point to the parsed string. If buffers are needed to store translated data, it use th trash buffer. The indexation function that allocate the memory later if it is needed.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	b9b08460a2	MEDIUM: pattern: add indexation function. Before this patch, the indexation function check the declared patttern matching function and index the data according with this function. This is not useful to add some indexation mode. This commit adds dedicated indexation function. Each struct pattern is associated with one indexation function. This function permit to index data according with the type of pattern and with the type of match.	2014-03-17 18:06:06 +01:00
Willy Tarreau	1cf8f08c17	MINOR: sample: move smp_to_type to sample.c This way it can be exported and reused anywhere else to report type names.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	3ead5b93c6	MINOR: pattern: separe list element from the data part. This commit separes the "struct list" used for the chain the "struct pattern" which contain the pattern data. Later, this change will permit to manipulate lists ans trees with the same "struct pattern".	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	972028fa67	MEDIUM: pattern: Change the prototype of the function pattern_register(). Each pattern parser take only one string. This change is reported to the function prototype of the function "pattern_register()". Now, it is called with just one string and no need to browse the array of args.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	580c32cb3a	MEDIUM: pattern: The pattern parser no more uses <opaque> and just takes one string. After the previous patches, the "pat_parse_strcat()" function disappear, and the "pat_parse_int()" and "pat_parse_dotted_ver()" functions dont use anymore the "opaque" argument, and take only one string on his input. So, after this patch, each pattern parser no longer use the opaque variable and take only one string as input. This patch change the prototype of the pattern parsing functions. Now, the "char *args" is replaced by a "char arg", the "int *opaque" is removed and these functions return 1 in succes case, and 0 if fail.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	511e9475f2	MEDIUM: acl/pattern: standardisation "of pat_parse_int()" and "pat_parse_dotted_ver()" The goal of these patch is to simplify the prototype of "pat_pattern_()" functions. I want to replace the argument "char args" by a simple "char arg" and remove the "opaque" argument. "pat_parse_int()" and "pat_parse_dotted_ver()" are the unique pattern parser using the "opaque" argument and using more than one string argument of the char **args. These specificities are only used with ACL. Other systems using this pattern parser (MAP and CLI) just use one string for describing a range. This two functions can read a range, but the min and the max must y specified. This patch extends the syntax to describe a range with implicit min and max. This is used for operators like "lt", "le", "gt", and "ge". the syntax is the following: ":x" -> no min to "x" "x:" -> "x" to no max This patch moves the parsing of the comparison operator from the functions "pat_parse_int()" and "pat_parse_dotted_ver()" to the acl parser. The acl parser read the operator and the values and build a volatile string readable by the functions "pat_parse_int()" and "pat_parse_dotted_ver()". The transformation is done with these rules: If the parser is "pat_parse_int()": "eq x" -> "x" "le x" -> ":x" "lt x" -> ":y" (with y = x - 1) "ge x" -> "x:" "gt x" -> "y:" (with y = x + 1) If the parser is "pat_parse_dotted_ver()": "eq x.y" -> "x.y" "le x.y" -> ":x.y" "lt x.y" -> ":w.z" (with w.z = x.y - 1) "ge x.y" -> "x.y:" "gt x.y" -> "w.z:" (with w.z = x.y + 1) Note that, if "y" is not present, assume that is "0". Now "pat_parse_int()" and "pat_parse_dotted_ver()" accept only one pattern and the variable "opaque" is no longer used. The prototype of the pattern parsers can be changed.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	9eec0a646b	MAJOR: auth: Change the internal authentication system. This patch remove the limit of 32 groups. It also permit to use standard "pat_parse_str()" function in place of "pat_parse_strcat()". The "pat_parse_strcat()" is no longer used and its removed. Before this patch, the groups are stored in a bitfield, now they are stored in a list of strings. The matching is slower, but the number of groups is low and generally the list of allowed groups is short. The fetch function "smp_fetch_http_auth_grp()" used with the name "http_auth_group" return valid username. It can be used as string for displaying the username or with the acl "http_auth_group" for checking the group of the user. Maybe the names of the ACL and fetch methods are no longer suitable, but I keep the current names for conserving the compatibility with existing configurations. The function "userlist_postinit()" is created from verification code stored in the big function "check_config_validity()". The code is adapted to the new authentication storage system and it is moved in the "src/auth.c" file. This function is used to check the validity of the users declared in groups and to check the validity of groups declared on the "user" entries. This resolve function is executed before the check of all proxy because many acl needs solved users and groups.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	d048d8b891	BUG/MINOR: http: fix encoding of samples used in http headers The binary samples are sometimes copied as is into http headers. A sample can contain bytes unallowed by the http rfc concerning header content, for example if it was extracted from binary data. The resulting http request can thus be invalid. This issue does not yet happen because haproxy currently (mistakenly) hex-encodes binary data, so it is not really possible to retrieve invalid HTTP chars. The solution consists in hex-encoding all non-printable chars prefixed by a '%' sign. No backport is needed since existing code is not affected yet.	2014-03-17 16:39:03 +01:00
Thierry FOURNIER	e059ec9393	MINOR: standard: add function "encode_chunk" This function has the same behavior as encode_string(), except it takes a "struct chunk" instead of a "char *" on input.	2014-03-17 16:38:56 +01:00
Willy Tarreau	f79d950163	MEDIUM: proxy: create a tree to store proxies by name Large configurations can take time to parse when thousands of backends are in use. Let's store all the proxies in trees. findproxy_mode() has been modified to use the tree for lookups, which has divided the parsing time by about 2.5. But many lookups are still present at many places and need to be dealt with.	2014-03-15 07:48:35 +01:00
Willy Tarreau	80a92c02f4	BUG/MEDIUM: http: don't start to forward request data before the connect Currently, "balance url_param check_post" randomly works. If the client sends chunked data and there's another chunk after the one containing the data, http_request_forward_body() will advance msg->sov and move the start of data to the beginning of the last chunk, and get_server_ph_post() will not find the data. In order to avoid this, we add an HTTP_MSGF_WAIT_CONN flag whose goal is to prevent the forwarding code from parsing until the connection is confirmed, so that we're certain not to fail on a redispatch. Note that we need to force channel_auto_connect() since the output buffer is empty and a previous analyser might have stopped auto-connect. The flag is currently set whenever some L7 POST analysis is needed for a connect() so that it correctly addresses all corner cases involving a possible rewind of the buffer, waiting for a better fix. Note that this has been broken for a very long time. Even all 1.4 versions seem broken but differently, with ->sov pointing to the end of the arguments. So the fix should be considered for backporting to all stable releases, possibly including 1.3 which works differently.	2014-03-14 12:22:56 +01:00
Willy Tarreau	36346247ac	BUG/MEDIUM: http: continue to emit 503 on keep-alive to different server Finn Arne Gangstad reported that commit `6b726adb35` ("MEDIUM: http: do not report connection errors for second and further requests") breaks support for serving static files by abusing the errorfile 503 statement. Indeed, a second request over a connection sent to any server or backend returning 503 would silently be dropped. The proper solution consists in adding a flag on the session indicating that the server connection was reused, and to only avoid the error code in this case.	2014-02-24 18:26:30 +01:00
Willy Tarreau	7e3127391f	MINOR: config: make the stream interface idle timer user-configurable The new tune.idletimer value allows one to set a different value for idle stream detection. The default value remains set to one second. It is possible to disable it using zero, and to change the default value at build time using DEFAULT_IDLE_TIMER.	2014-02-12 16:36:12 +01:00
Willy Tarreau	b145c78623	MINOR: channel: add the date of last read in the channel We store the time stamp of last read in the channel in order to be able to measure some bit rate and pause lengths. We only use 16 bits which were unused for this. We don't need more, as it allows us to measure with a millisecond precision for up to 65s.	2014-02-12 11:45:59 +01:00
Willy Tarreau	8f39dcdc8d	BUG/MINOR: channel: initialize xfer_small/xfer_large on new buffers These ones are only reset during transfers. There is a low but non-null risk that a first full read causes the previous value to be reused and immediately to immediately set the CF_STREAMER flag. The impact is only to increase earlier than expected the SSL record size and to use splice(). This bug was already present in 1.4, so a backport is possible.	2014-02-12 11:45:45 +01:00
Bhaskar Maddala	a20cb85eba	MINOR: stats: Enhancement to stats page to provide information of last session time. Summary: Track and report last session time on the stats page for each server in every backend, as well as the backend. This attempts to address the requirement in the ROADMAP - add a last activity date for each server (req/resp) that will be displayed in the stats. It will be useful with soft stop. The stats page reports this as time elapsed since last session. This change does not adequately address the requirement for long running session (websocket, RDP... etc).	2014-02-08 01:19:58 +01:00
Willy Tarreau	7bed945be0	OPTIM: ssl: implement dynamic record size adjustment By having the stream interface pass the CF_STREAMER flag to the snd_buf() primitive, we're able to tell the send layer whether we're sending large chunks or small ones. We use this information in SSL to adjust the max record dynamically. This results in small chunks respecting tune.ssl.maxrecord at the beginning of a transfer or for small transfers, with an automatic switch to full records if the exchanges last long. This allows the receiver to parse HTML contents on the fly without having to retrieve 16kB of data, which is even more important with small initcwnd since the receiver does not need to wait for round trips to start fetching new objects. However, sending large files still produces large chunks. For example, with tune.ssl.maxrecord = 2859, we see 5 write(2885) sent in two segments each and 6 write(16421). This idea was first proposed on the haproxy mailing list by Ilya Grigorik.	2014-02-06 11:37:29 +01:00
Willy Tarreau	1049b1f551	MEDIUM: connection: don't use real send() flags in snd_buf() This prevents us from passing other useful info and requires the upper levels to know these flags. Let's use a new flags category instead : CO_SFL_*. For now, only MSG_MORE has been remapped.	2014-02-06 11:37:29 +01:00
Baptiste Assmann	69e273f3fc	MEDIUM: tcp-check new feature: connect A new tcp-check rule type: connect. It allows HAProxy to test applications which stand on multiple ports or multiple applications load-balanced through the same backend.	2014-02-03 00:24:11 +01:00
Willy Tarreau	70dffdaa10	MAJOR: http: switch to keep-alive mode by default Since we support HTTP keep-alive, there is no more reason for staying in tunnel mode by default. It is confusing for new users and creates more issues than it solves. Option "http-tunnel" is available to force to use it if really desired. Switching to KA by default has implied to change the value of some option flags and some transaction flags so that value zero (default) matches keep-alive. That explains why more code has been changed than expected. Tests have been run on the 25 combinations of frontend and backend options, plus a few with option http-pretend-keepalive, and no anomaly was found. The relation between frontend and backends remains the same. Options have been updated to take precedence over http-keep-alive which is now implicit. All references in the doc to haproxy not supporting keep-alive have been fixed, and the doc for config options has been updated.	2014-01-30 03:14:29 +01:00
Willy Tarreau	02bce8be01	MAJOR: http: update connection mode configuration At the very beginning of haproxy, there was "option httpclose" to make haproxy add a "Connection: close" header in both directions to invite both sides to agree on closing the connection. It did not work with some rare products, so "option forceclose" was added to do the same and actively close the connection. Then client-side keep-alive was supported, so option http-server-close was introduced. Now we have keep-alive with a fourth option, not to mention the implicit tunnel mode. The connection configuration has become a total mess because all the options above may be combined together, despite almost everyone thinking they cancel each other, as judging from the common problem reports on the mailing list. Unfortunately, re-reading the doc shows that it's not clear at all that options may be combined, and the opposite seems more obvious since they're compared. The most common issue is options being set in the defaults section that are not negated in other sections, but are just combined when the user expects them to be overloaded. The migration to keep-alive by default will only make things worse. So let's start to address the first problem. A transaction can only work in 5 modes today : - tunnel : haproxy doesn't bother with what follows the first req/resp - passive close : option http-close - forced close : option forceclose - server close : option http-server-close with keep-alive on the client side - keep-alive : option http-keep-alive, end to end All 16 combination for each section fall into one of these cases. Same for the 256 combinations resulting from frontend+backend different modes. With this patch, we're doing something slightly different, which will not change anything for users with valid configs, and will only change the behaviour for users with unsafe configs. The principle is that these options may not combined anymore, and that the latest one always overrides all the other ones, including those inherited from the defaults section. The "no option xxx" statement is still supported to cancel one option and fall back to the default one. It is mainly needed to ignore defaults sections (eg: force the tunnel mode). The frontend+backend combinations have not changed. So for examplen the following configuration used to put the connection into forceclose : defaults http mode http option httpclose frontend foo. option http-server-close => http-server-close+httpclose = forceclose before this patch! Now the frontend's config replaces the defaults config and results in the more expected http-server-close. All 25 combinations of the 5 modes in (frontend,backend) have been successfully tested. In order to prepare for upcoming changes, a new "option http-tunnel" was added. It currently only voids all other options, and has the lowest precedence when mixed with another option in another frontend/backend.	2014-01-30 03:14:29 +01:00
Emeric Brun	850efd5149	MEDIUM: ssl: Set verify 'required' as global default for servers side. If no CA file specified on a server line, the config parser will show an error. Adds an cmdline option '-dV' to re-set verify 'none' as global default on servers side (previous behavior). Also adds 'ssl-server-verify' global statement to set global default to 'none' or 'required'. WARNING: this changes the default verify mode from "none" to "required" on the server side, and it will break insecure setups.	2014-01-29 17:08:15 +01:00
Willy Tarreau	cc08d2c9ff	MEDIUM: counters: stop relying on session flags at all Till now, we had one flag per stick counter to indicate if it was tracked in a backend or in a frontend. We just had to add another flag per stick-counter to indicate if it relies on contents or just connection. These flags are quite painful to maintain and tend to easily conflict with other flags if their number is changed. The correct solution consists in moving the flags to the stkctr struct itself, but currently this struct is made of 2 pointers, so adding a new entry there to store only two bits will cause at least 16 more bytes to be eaten per counter due to alignment issues, and we definitely don't want to waste tens to hundreds of bytes per session just for things that most users don't use. Since we only need to store two bits per counter, an intermediate solution consists in replacing the entry pointer with a composite value made of the original entry pointer and the two flags in the 2 unused lower bits. If later a need for other flags arises, we'll have to store them in the struct. A few inline functions have been added to abstract the retrieval and assignment of the pointers and flags, resulting in very few changes. That way there is no more dependence on the number of stick-counters and their position in the session flags.	2014-01-28 23:34:45 +01:00
Willy Tarreau	bb519c7cd1	MINOR: tools: add very basic support for composite pointers Very often we want to associate one or two flags to a pointer, to put a type on it or whatever. This patch provides this in standard.h in the form of a few inline functions which combine a void * pointer with an int and return an unsigned long called a composite address. The functions allow to individuall set, retrieve both the pointer and the flags. This is very similar to what is used in ebtree in fact.	2014-01-28 23:34:45 +01:00
Willy Tarreau	f3338349ec	BUG/MEDIUM: counters: flush content counters after each request One year ago, commit `5d5b5d8` ("MEDIUM: proto_tcp: add support for tracking L7 information") brought support for tracking L7 information in tcp-request content rules. Two years earlier, commit `0a4838c` ("[MEDIUM] session-counters: correctly unbind the counters tracked by the backend") used to flush the backend counters after processing a request. While that earliest patch was correct at the time, it became wrong after the second patch was merged. The code does what it says, but the concept is flawed. "TCP request content" rules are evaluated for each HTTP request over a single connection. So if such a rule in the frontend decides to track any L7 information or to track L4 information when an L7 condition matches, then it is applied to all requests over the same connection even if they don't match. This means that a rule such as : tcp-request content track-sc0 src if { path /index.html } will count one request for index.html, and another one for each of the objects present on this page that are fetched over the same connection which sent the initial matching request. Worse, it is possible to make the code do stupid things by using multiple counters: tcp-request content track-sc0 src if { path /foo } tcp-request content track-sc1 src if { path /bar } Just sending two requests first, one with /foo, one with /bar, shows twice the number of requests for all subsequent requests. Just because both of them persist after the end of the request. So the decision to flush backend-tracked counters was not the correct one. In practice, what is important is to flush countent-based rules since they are the ones evaluated for each request. Doing so requires new flags in the session however, to keep track of which stick-counter was tracked by what ruleset. A later change might make this easier to maintain over time. This bug is 1.5-specific, no backport to stable is needed.	2014-01-28 21:40:28 +01:00
Willy Tarreau	12833bbca5	MINOR: cli: add the new "show pools" command show pools Dump the status of internal memory pools. This is useful to track memory usage when suspecting a memory leak for example. It does exactly the same as the SIGQUIT when running in foreground except that it does not flush the pools.	2014-01-28 16:50:35 +01:00
Willy Tarreau	91b843d0d2	REORG: stats: move the stats socket states to dumpstats.c There is no more usage of these values outside of dumpstats.c, and they're easier to maintain there. Also replace the #defines with an enum.	2014-01-28 16:28:21 +01:00
Willy Tarreau	e43d5323c6	MEDIUM: listener: apply a limit on the session rate submitted to SSL Just like the previous commit, we sometimes want to limit the rate of incoming SSL connections. While it can be done for a frontend, it was not possible for a whole process, which makes sense when multiple processes are running on a system to server multiple customers. The new global "maxsslrate" setting is usable to fix a limit on the session rate going to the SSL frontends. The limits applies before the SSL handshake and not after, so that it saves the SSL stack from expensive key computations that would finally be aborted before being accounted for. The same setting may be changed at run time on the CLI using "set rate-limit ssl-session global".	2014-01-28 15:50:10 +01:00
Willy Tarreau	93e7c006c1	MEDIUM: listener: add support for limiting the session rate in addition to the connection rate It's sometimes useful to be able to limit the connection rate on a machine running many haproxy instances (eg: per customer) but it removes the ability for that machine to defend itself against a DoS. Thus, better also provide a limit on the session rate, which does not include the connections rejected by "tcp-request connection" rules. This permits to have much higher limits on the connection rate without having to raise the session rate limit to insane values. The limit can be changed on the CLI using "set rate-limit sessions global", or in the global section using "maxsessrate".	2014-01-28 15:49:27 +01:00
Willy Tarreau	71b734c307	MINOR: cli: add more information to the "show info" output In addition to previous outputs, we also emit the cumulated number of connections, the cumulated number of requests, the maximum allowed SSL connection concurrency, the current number of SSL connections and the cumulated number of SSL connections. This will help troubleshoot systems which experience memory shortage due to SSL.	2014-01-28 15:19:44 +01:00
Willy Tarreau	25002d206b	MINOR: polling: create function fd_compute_new_polled_status() This function is used to compute the new polling state based on the previous state. All pollers have to do this in their update loop, so better centralize the logic for it.	2014-01-26 00:42:32 +01:00
Willy Tarreau	e852545594	MEDIUM: polling: centralize polled events processing Currently, each poll loop handles the polled events the same way, resulting in a lot of duplicated, complex code. Additionally, epoll was the only one to handle newly created FDs immediately. So instead, let's move that code to fd.c in a new function dedicated to this task : fd_process_polled_events(). All pollers now use this function.	2014-01-26 00:42:32 +01:00
Willy Tarreau	6c11bd2f89	OPTIM: raw-sock: don't speculate after a short read if polling is enabled This is the reimplementation of the "done" action : when we experience a short read, we're almost certain that we've exhausted the system's buffers and that we'll meet an EAGAIN if we attempt to read again. If the FD is not yet polled, the stream interface already takes care of stopping the speculative read. When the FD is already being polled, we have two options : - either we're running from a level-triggered poller, in which case we'd rather report that we've reached the end so that we don't speculate over the poller and let it report next time data are available ; - or we're running from an edge-triggered poller in which case we have no choice and have to see the EAGAIN to re-enable events. At the moment we don't have any edge-triggered poller, so it's desirable to avoid speculative I/O that we know will fail. Note that this must not be ported to SSL since SSL hides the real readiness of the file descriptor. Thanks to this change, we observe no EAGAIN anymore during keep-alive transfers, and failed recvfrom() are reduced by half in http-server-close mode (the client-facing side is always being polled and the second recv can be avoided). Doing so results in about 5% performance increase in keep-alive mode. Similarly, we used to have up to about 1.6% of EAGAIN on accept() (1/maxaccept), and these have completely disappeared under high loads.	2014-01-26 00:42:32 +01:00
Willy Tarreau	baf5b9b445	CLEANUP: connection: fix comments in connection.h to reflect new behaviour. The polling has substantially changed, better fix the comments.	2014-01-26 00:42:31 +01:00
Willy Tarreau	aad69387ac	CLEANUP: connection: use conn_xprt_ready() instead of checking the flag It's easier and safer to rely on conn_xprt_ready() everywhere than to check the flag itself. It will also simplify adding extra checks later if needed. Some useless controls for !xprt have been removed, as the XPRT_READY flag itself guarantees xprt is set.	2014-01-26 00:42:31 +01:00
Willy Tarreau	3c72872da1	CLEANUP: connection: use conn_ctrl_ready() instead of checking the flag It's easier and safer to rely on conn_ctrl_ready() everywhere than to check the flag itself. It will also simplify adding extra checks later if needed. Some useless controls for !ctrl have been removed, as the CTRL_READY flag itself guarantees ctrl is set.	2014-01-26 00:42:31 +01:00
Willy Tarreau	e1f50c4b02	MEDIUM: connection: remove conn_{data,sock}_poll_{recv,send} We simply remove these functions and replace their calls with the appropriate ones : - if we're in the data phase, we can simply report wait on the FD - if we're in the socket phase, we may also have to signal the desire to read/write on the socket because it might not be active yet.	2014-01-26 00:42:30 +01:00
Willy Tarreau	310987a038	MAJOR: connection: remove the CO_FL_WAIT_{RD,WR} flags These flags were used to report the readiness of the file descriptor. Now this readiness is directly checked at the file descriptor itself. This removes the need for constantly synchronizing updates between the file descriptor and the connection and ensures that all layers share the same level of information. For now, the readiness is updated in conn_{sock,data}_poll_* by directly touching the file descriptor. This must move to the lower layers instead so that these functions can disappear as well. In this state, the change works but is incomplete. It's sensible enough to avoid making it more complex. Now the sock/data updates become much simpler because they just have to enable/disable access to a file descriptor and not to care anymore about its readiness.	2014-01-26 00:42:30 +01:00
Willy Tarreau	f817e9f473	MAJOR: polling: rework the whole polling system This commit heavily changes the polling system in order to definitely fix the frequent breakage of SSL which needs to remember the last EAGAIN before deciding whether to poll or not. Now we have a state per direction for each FD, as opposed to a previous and current state previously. An FD can have up to 8 different states for each direction, each of which being the result of a 3-bit combination. These 3 bits indicate a wish to access the FD, the readiness of the FD and the subscription of the FD to the polling system. This means that it will now be possible to remember the state of a file descriptor across disable/enable sequences that generally happen during forwarding, where enabling reading on a previously disabled FD would result in forgetting the EAGAIN flag it met last time. Several new state manipulation functions have been introduced or adapted : - fd_want_{recv,send} : enable receiving/sending on the FD regardless of its state (sets the ACTIVE flag) ; - fd_stop_{recv,send} : stop receiving/sending on the FD regardless of its state (clears the ACTIVE flag) ; - fd_cant_{recv,send} : report a failure to receive/send on the FD corresponding to EAGAIN (clears the READY flag) ; - fd_may_{recv,send} : report the ability to receive/send on the FD as reported by poll() (sets the READY flag) ; Some functions are used to report the current FD status : - fd_{recv,send}_active - fd_{recv,send}_ready - fd_{recv,send}_polled Some functions were removed : - fd_ev_clr(), fd_ev_set(), fd_ev_rem(), fd_ev_wai() The POLLHUP/POLLERR flags are now reported as ready so that the I/O layers knows it can try to access the file descriptor to get this information. In order to simplify the conditions to add/remove cache entries, a new function fd_alloc_or_release_cache_entry() was created to be used from pollers while scanning for updates. The following pollers have been updated : ev_select() : done, built, tested on Linux 3.10 ev_poll() : done, built, tested on Linux 3.10 ev_epoll() : done, built, tested on Linux 3.10 & 3.13 ev_kqueue() : done, built, tested on OpenBSD 5.2	2014-01-26 00:42:30 +01:00
Willy Tarreau	033cd9d78c	REORG: polling: rename "fd_process_spec_events()" to "fd_process_cached_events()" This is in order to be coherent with the rest.	2014-01-26 00:42:29 +01:00
Willy Tarreau	899d95757e	REORG: polling: rename the cache allocation functions - alloc_spec_entry() becomes fd_alloc_cache_entry() - release_spec_entry() becomes fd_release_cache_entry()	2014-01-26 00:42:29 +01:00
Willy Tarreau	16f649c82c	REORG: polling: rename "fd_spec" to "fd_cache" So fd_spec was renamed "fd_cache" as it's becoming an event cache, and fd_nbspec becomes fd_cache_num.	2014-01-26 00:42:29 +01:00
Willy Tarreau	15a4dec87e	REORG: polling: rename "spec_e" to "state" and "spec_p" to "cache" We're completely changing the way FDs will be polled. There will be no more speculative I/O since we'll know the exact FD state, so these will only be cached events. First, let's fix a few field names which become confusing. "spec_e" was used to store a speculative I/O event state. Now we'll store the whole R/W states for the FD there. "spec_p" was used to store a speculative I/O cache position. Now let's clearly call it "cache".	2014-01-26 00:42:29 +01:00
Willy Tarreau	69a41fa8a3	CLEANUP: polling: rename "spec_e" to "state" We're completely changing the way FDs will be polled. First, let's fix a few field names which become confusing. "spec_e" was used to store a speculative I/O event state. Now we'll store the whole R/W states for the FD there.	2014-01-26 00:42:28 +01:00
Willy Tarreau	1f0da2485e	BUG/MEDIUM: unique_id: HTTP request counter is not stable Patrick Hemmer reported that using unique_id_format and logs did not report the same unique ID counter since commit `9f09521` ("BUG/MEDIUM: unique_id: HTTP request counter must be unique!"). This is because the increment was done while producing the log message, so it was performed twice. A better solution consists in fetching a new value once per request and saving it in the request or session context for all of this request's life. It happens that sessions already have a unique ID field which is used for debugging and reporting errors, and which differs from the one sent in logs and unique_id header. So let's change this to reuse this field to have coherent IDs everywhere. As of now, a session gets a new unique ID once it is instanciated. This means that TCP sessions will also benefit from a unique ID that can be logged. And this ID is renewed for each extra HTTP request received on an existing session. Thus, all TCP sessions and HTTP requests will have distinct IDs that will be stable along all their life, and coherent between all places where they're used (logs, unique_id header, "show sess", "show errors"). This feature is 1.5-specific, no backport to 1.4 is needed.	2014-01-25 11:07:06 +01:00
Willy Tarreau	45b34e8abc	MINOR: connection: add more error codes to report connection errors It is quite often that an connection error only reports "socket error" with no more information. This is especially problematic with health checks where many causes are possible, including resource exhaustion which do not lead to a valid errno code. So let's add explicit codes to cover these cases.	2014-01-24 16:15:04 +01:00
Thierry FOURNIER	e7ba23633b	MINOR: pattern: move functions for grouping pat_match_* and pat_parse_* and add documentation.	2014-01-21 22:14:21 +01:00
Willy Tarreau	2aefad5df7	MINOR: connection: add a new conn_drain() function Till now there was no way to know from a connection if a previous call to drain() had done any change. This function is used to drain incoming data and to update the connection's flags at the same time. It also correctly sets the polling flags on the connection if the drain function indicates inability to receive. This function will be used preferably over ctrl->drain() when a connection is used.	2014-01-20 22:27:16 +01:00
Willy Tarreau	2317976daa	BUILD: listener: fix recent accept4() again Recent commit `4448925` ("BUILD/MINOR: listener: remove a glibc warning on accept4()") broke accept4() on some systems because the glibc's version may now conflict with the local one.	2014-01-15 16:45:17 +01:00
Willy Tarreau	8663105095	BUG: Revert "OPTIM: poll: restore polling after a poll/stop/want sequence" This reverts commit `1208266356`. It randomly breaks SSL. What happens is that if the SSL response is read at once by the SSL stack and is partially delivered to the buffer, then there's no way to read the next parts because we wait for some polling first. So we'll fix this after the polling rework.	2014-01-13 11:34:42 +01:00
Willy Tarreau	9fe7aae6eb	MINOR: checks: use an inline function for health_adjust() This function is called twice per request, and does almost always nothing. Better use an inline version to avoid entering it when we can. About 0.5% additional performance was gained this way.	2013-12-31 23:47:37 +01:00
Willy Tarreau	9e5a3aacf4	MEDIUM: stream-int: make si_connect() return an established state when possible si_connect() used to only return SI_ST_CON. But it already detect the connection reuse and is the function which avoids calling connect(). So it already knows the connection is valid and reuse. Thus we make it return SI_ST_EST when a connection is reused. This means that connect_server() can return this state and sess_update_stream_int() as well. Thanks to this change, we don't need to leave process_session() in SI_ST_CON state to immediately enter it again to switch to SI_ST_EST. Implementing this removes one call to process_session() per request in keep-alive mode. We're now at 2 calls per request, which is the minimum (one for the request and another one for the response). The number of calls to http_wait_for_response() has also dropped from 2 to one. Tests indicate a performance gain of about 2.6% in request rate in keep-alive mode. There should be no gain in http-server-close() since we don't use this faster path.	2013-12-31 23:32:12 +01:00
Willy Tarreau	d7ad9f5b0d	MAJOR: channel: add a new flag CF_WAKE_WRITE to notify the task of writes Since commit `6b66f3e` ([MAJOR] implement autonomous inter-socket forwarding) introduced in 1.3.16-rc1, we've been relying on a stupid mechanism to wake up the task after a write, which was an exact copy-paste of the reader side. The principle was that if we empty a buffer and there's no forwarding scheduled or if the producer is not in a connected state, then we wake the task up. That does not make any sense. It happens to wake up too late sometimes (eg, when the request analyser waits for some room in the buffer to start to work), and leads to unneeded wakeups in client-side keep-alive, because the task is woken up when the response is sent, while the analysers are simply waiting for a new request. In order to fix this, we introduce a new channel flag : CF_WAKE_WRITE. It is designed so that an analyser can explicitly request being notified when some data were written. It is used only when the HTTP request or response analysers need to wait for more room in the buffers. It is automatically cleared upon wake up. The flag is also automatically set by the functions which try to write into a buffer from an applet when they fail (bi_putblk() etc...). That allows us to remove the stupid condition above and avoid some wakeups. In http-server-close and in http-keep-alive modes, this reduces from 4 to 3 the average number of wakeups per request, and increases the overall performance by about 1.5%.	2013-12-31 18:37:36 +01:00
Willy Tarreau	51437d2c59	Revert "MEDIUM: stats: add support for HTTP keep-alive on the stats page" This reverts commit `f3221f99ac`. Igor reported some very strange breakage of his stats page which is clearly caused by the chunking, though I don't see at first glance what could be wrong. Better revert it for now.	2013-12-29 00:43:40 +01:00
Willy Tarreau	f3221f99ac	MEDIUM: stats: add support for HTTP keep-alive on the stats page In theory the principle is simple as we just need to send HTTP chunks if the client is 1.1 compatible. In practice it's harder because we have to append a CR LF after each block of data and we're never sure to have the room for this. In order not to have to deal with this, we instead send the CR LF prior to each chunk size. The only issue is for the first chunk and for this reason we avoid to send the empty header line when using chunked encoding.	2013-12-28 21:40:16 +01:00
Willy Tarreau	983eb31fd1	BUG/MINOR: channel: CHN_INFINITE_FORWARD must be unsigned This value is stored as unsigned in chn->to_forward. Having it defined as signed makes it impossible to pass channel_forward() a previously saved value because the argument will be zero-extended during the conversion to long long, while the test will be performed using sign extension. There is no impact on existing code right now.	2013-12-28 21:33:37 +01:00
Willy Tarreau	1208266356	OPTIM: poll: restore polling after a poll/stop/want sequence If a file descriptor is being polled, and it stopped (eg: buffer full or end of response), then re-enabled, currently what happens is that the polling is disabled, then the fd is enabled in speculative mode, an I/O attempt is made, it loses (otherwise the FD would surely not have been polled), and the polled is enabled again. This is too bad, especially with HTTP keep-alive on the server side where all operations are performed at once before going back to the poll loop. Now we improve the behaviour by ensuring that if an fd is still being polled, when it's enabled after having been disabled, we re-enable the polling. Doing so saves a number of syscalls and useless wakeups, and results in a significant performance gain on HTTP keep-alive. A 11% increase has been observed on the HTTP request rate in keep-alive thanks to this. It could be considered as a bug fix, but there was no harm with the current behaviour, except extra syscalls.	2013-12-27 20:18:52 +01:00
Willy Tarreau	068621e4ad	MINOR: http: try to stick to same server after status 401/407 In HTTP keep-alive mode, if we receive a 401, we still have a chance of being able to send the visitor again to the same server over the same connection. This is required by some broken protocols such as NTLM, and anyway whenever there is an opportunity for sending the challenge to the proper place, it's better to do it (at least it helps with debugging).	2013-12-23 15:12:44 +01:00
Willy Tarreau	2737562e43	MEDIUM: stream-int: implement a very simplistic idle connection manager Idle connections are not monitored right now. So if a server closes after a response without advertising it, it won't be detected until a next request wants to use the connection. This is a bit problematic because it unnecessarily maintains file descriptors and sockets in an idle state. This patch implements a very simple idle connection manager for the stream interface. It presents itself as an I/O callback. The HTTP engine enables it when it recycles a connection. If a close or an error is detected on the underlying socket, it tries to drain as much data as possible from the socket, detect the close and responds with a close as well, then detaches from the stream interface.	2013-12-17 00:00:28 +01:00
Willy Tarreau	e38feed966	BUG/MINOR: stats: correctly report throttle rate of low weight servers The throttling of low weight servers (<16) could mistakenly be reported as > 100% due to a rounding that was performed before a multiply by 100 instead of after. This was introduced in 1.5-dev20 when fixing a previous reporting issue by commit `d32c399` (MINOR: stats: report correct throttling percentage for servers in slowstart). It should be backported if the patch above is backported.	2013-12-16 18:04:57 +01:00
Willy Tarreau	9420b1271d	MINOR: http: add option prefer-last-server When the load balancing algorithm in use is not deterministic, and a previous request was sent to a server to which haproxy still holds a connection, it is sometimes desirable that subsequent requests on a same session go to the same server as much as possible. Note that this is different from persistence, as we only indicate a preference which haproxy tries to apply without any form of warranty. The real use is for keep-alive connections sent to servers. When this option is used, haproxy will try to reuse the same connection that is attached to the server instead of rebalancing to another server, causing a close of the connection. This can make sense for static file servers. It does not make much sense to use this in combination with hashing algorithms.	2013-12-16 02:23:54 +01:00
Willy Tarreau	b490b4e5ad	MAJOR: stream-int: handle the connection reuse in si_connect() This is the best place to reuse a connection. We centralize all connection requests and we're at the best place to know exactly what the current state of the underlying connection is. If the connection is reused, we just enable polling for send() in order to be able to emit the request.	2013-12-16 02:23:53 +01:00
Willy Tarreau	9471b8ced9	MEDIUM: connection: inform si_alloc_conn() whether existing conn is OK or not When allocating a new connection, only the caller knows whether it's acceptable to reuse the previous one or not. Let's pass this information to si_alloc_conn() which will do the cleanup if the connection is not acceptable.	2013-12-16 02:23:53 +01:00
Willy Tarreau	ad38acedaa	MEDIUM: connection: centralize handling of nolinger in fd management Right now we see many places doing their own setsockopt(SO_LINGER). Better only do it just before the close() in fd_delete(). For this we add a new flag on the file descriptor, indicating if it's safe or not to linger. If not (eg: after a connect()), then the setsockopt() call is automatically performed before a close(). The flag automatically turns to safe when receiving a read0.	2013-12-16 02:23:52 +01:00
Willy Tarreau	d02cdd23be	MINOR: connection: add simple functions to report connection readiness conn_xprt_ready() reports if the transport layer is ready. conn_ctrl_ready() reports if the control layer is ready. The stream interface uses si_conn_ready() to report that the underlying connection is ready. This will be used for connection reuse in keep-alive mode.	2013-12-16 02:23:52 +01:00
Willy Tarreau	3343432fcd	MINOR: checks: add a flag to indicate what check is an agent Currently to know if a check is an agent, we compare its pointer to its servers' agent pointer. Better have a flag in its state to indicate this.	2013-12-14 16:02:20 +01:00
Willy Tarreau	33a08db932	MINOR: checks: add a PAUSED state for the checks Health checks can now be paused. This is the status they get when the server is put into maintenance mode, which is more logical than relying on the server's state at some places. It will be needed to allow agent checks to run when health checks are disabled (currently not possible).	2013-12-14 16:02:20 +01:00
Willy Tarreau	ff5ae35b9f	MINOR: checks: use check->state instead of srv->state & SRV_CHECKED Having the check state partially stored in the server doesn't help. Some functions such as srv_getinter() rely on the server being checked to decide what check frequency to use, instead of relying on the check being configured. So let's get rid of SRV_CHECKED and SRV_AGENT_CHECKED and only use the check's states instead.	2013-12-14 16:02:19 +01:00
Willy Tarreau	2e10f5a759	MINOR: checks: replace state DISABLED with CONFIGURED and ENABLED At the moment, health checks and agent checks are tied : no agent check is emitted if no health check is enabled. Other parameters are considered in the condition for letting checks run. It will help us selectively enable checks (agent and regular checks) to be know whether they're enabled/disabled and configured or not. Now we can already emit an error when trying to enable an unconfigured agent.	2013-12-14 16:02:19 +01:00
Willy Tarreau	2c115e5047	MINOR: checks: rename the state flags The flag CHK_STATE_RUNNING is misleading as one may believe it means the state is enabled (just like SRV_RUNNING). Let's rename these two flags CHK_ST_INPROGRESS and CHK_ST_DISABLED.	2013-12-14 16:02:19 +01:00
Willy Tarreau	6aaa1b87cf	MINOR: checks: use an enum instead of flags to report a check result We used to have up to 4 sets of flags which were almost all exclusive to report a check result. And the names were inherited from the old server states, adding to the confusion. Let's replace that with an enum handling only the possible combinations : SRV_CHK_UNKNOWN => CHK_RES_UNKNOWN SRV_CHK_FAILED => CHK_RES_FAILED SRV_CHK_PASSED => CHK_RES_PASSED SRV_CHK_PASSED \| SRV_CHK_DISABLE => CHK_RES_CONDPASS	2013-12-14 16:02:19 +01:00
Willy Tarreau	8e85ad5211	REORG: checks: retrieve the check-specific defines from server.h to checks.h After the move of checks from servers to autonomous checks, we need a massive cleanup and reordering as it's becoming increasingly difficult to find the definitions of types and enums. Nothing was changed, blocks were just moved.	2013-12-14 16:02:18 +01:00
Willy Tarreau	1a53a3af13	MINOR: checks: improve handling of the servers tracking chain Server tracking uses the same "tracknext" list for servers tracking another one and for the servers being tracked. This caused an issue which was fixed by commit `f39c71c` ([CRITICAL] fix server state tracking: it was O(n!) instead of O(n)), consisting in ensuring that a server is being checked before walking down the list, so that we don't propagate the up/down information via servers being part of the track chain. But the root cause is the fact that all servers share the same list. The correct solution consists in having a list head for the tracked servers and a list of next tracking servers. This simplifies the propagation logic, especially for the case where status changes might be passed to individual servers via the CLI.	2013-12-14 16:02:18 +01:00
Willy Tarreau	89efaed6b6	BUILD: definitely silence some stupid GCC warnings It's becoming increasingly difficult to ignore unwanted function returns in debug code with gcc. Now even when you try to work around it, it suggests a way to write your code differently. For example : src/frontend.c:187:65: warning: if statement has empty body [-Wempty-body] if (write(1, trash.str, trash.len) < 0) /* shut gcc warning */; ^ src/frontend.c:187:65: note: put the semicolon on a separate line to silence this warning 1 warning generated. This is totally unacceptable, this code already had to be written this way to shut it up in earlier versions. And now it comments the form ? What's the purpose of the C language if you can't write anymore the code that does what you want ? Emeric proposed to just keep a global variable to drain such useless results so that gcc stops complaining all the time it believes people who write code are monkeys. The solution is acceptable because the useless assignment is done only in debug code so it will not impact performance. This patch implements this, until gcc becomes even "smarter" to detect that we tried to cheat.	2013-12-13 15:21:36 +01:00
Willy Tarreau	5f3f15f618	BUILD: time: adapt the type of TV_ETERNITY to the local system Some systems use different types for tv_sec/tv_usec, some are signed others not. From time to time new warnings are reported about implicit casts being done. This patch ensures that TV_ETERNITY is cast to the appropriate type in assignments and conversions.	2013-12-13 09:22:23 +01:00
Willy Tarreau	975c1784c8	MINOR: sample: make sample_parse_expr() use memprintf() to report parse errors Doing so ensures that we're consistent between all the functions in the whole chain. This is important so that we can extract the argument parsing from this function.	2013-12-12 23:16:54 +01:00
Thierry FOURNIER	c0e0d7b7cf	MEDIUM: map: dynamic manipulation of maps This patch adds map manipulation commands to the socket interface. add map <map> <key> <value> Add the value <value> in the map <map>, at the entry corresponding to the key <key>. This command does not verify if the entry already exists. clear map <map> Remove entries from the map <map> del map <map> <key> Delete all the map entries corresponding to the <key> value in the map <map>. set map <map> <key> <value> Modify the value corresponding to each key <key> in a map <map>. The new value is <value>. show map [<map>] Dump info about map converters. Without argument, the list of all available maps are returned. If a <map> is specified, is content is dumped.	2013-12-12 15:58:30 +01:00
Thierry FOURNIER	01cdcd4a62	MINOR: pattern: add function to lookup a specific entry in pattern list This is used to dynamically delete or update map entry.	2013-12-12 15:50:01 +01:00
Thierry FOURNIER	b0c0a0f940	MINOR: map: export parse output sample functions This export is used to identify the parser used	2013-12-12 15:44:05 +01:00
Thierry FOURNIER	7609064fc3	MINOR: pattern: make the pattern matching function return a pointer to the matched element This feature will be used by the CLI to look up keys.	2013-12-12 15:44:05 +01:00
Thierry FOURNIER	0b2fe4a5cd	MINOR: pattern: add support for compiling patterns for lookups With this patch, patterns can be compiled for two modes : - match - lookup The match mode is used for example in ACLs or maps. The lookup mode is used to lookup a key for pattern maintenance. For example, looking up a network is different from looking up one address belonging to this network. A special case is made for regex. In lookup mode they return the input regex string and do not compile the regex.	2013-12-12 15:44:02 +01:00
Thierry FOURNIER	39e258fcee	MINOR: regex: Copy the original regex expression into string. This is useful for the debug or for search regex in maps.	2013-12-12 15:43:34 +01:00
Thierry FOURNIER	799c042daa	MINOR: regex: Change the struct containing regex This change permits to remove the typedef. The original regex structs are set in haproxy's struct.	2013-12-12 15:42:58 +01:00
Thierry FOURNIER	7148ce6ef4	MEDIUM: pattern: Extract the index process from the pat_parse_() functions Now, the pat_parse_() functions parses the incoming data. The input "pattern" struct can be preallocated. If the parser needs to add some buffers, it allocates memory. The function pattern_register() runs the call to the parser, process the key indexation and associate the "sample_storage" used by maps.	2013-12-12 15:42:11 +01:00
Thierry FOURNIER	e3ded59706	MEDIUM: acl: Last patch change the output type This patch remove the compatibility check from the input type and the match method. Now, it checks if a casts from the input type to output type exists and the pattern_exec_match() function apply casts before each pattern matching.	2013-12-12 15:42:11 +01:00
Thierry FOURNIER	cc0e0b3dbb	MINOR: pattern: Each pattern sets the expected input type This is used later for increasing the compability with incoming sample types. When multiple compatible types are supported, one is arbitrarily used (eg: UINT).	2013-12-12 11:07:33 +01:00
Thierry FOURNIER	2d4771ba17	MINOR: map: export map_get_reference() function This function is used to identify map with his reference into the CLI functions.	2013-12-11 22:05:03 +01:00
Willy Tarreau	9ba813cd69	CLEANUP: check: server port is unsigned Baptiste Assmann reported some confusing printf() output of the server port since it's declared signed. Better turn it to unsigned. There's no need to backport this, it's only used in 16-bit places.	2013-12-10 23:32:30 +01:00
Willy Tarreau	2d400bb931	MINOR: stream_interface: add reporting of ressouce allocation errors SSL and keep-alive will need to be able to fail on allocation errors, and the stream interface did not allow to report such a cause. The flag will then be "RC" as already documented.	2013-12-09 17:12:18 +01:00
Willy Tarreau	05efc0f33a	DIET/MINOR: task: reduce struct task size by 8 bytes Just by reordering the struct task, we could shrink it by 8 bytes from 120 to 112 bytes. A careful reordering allowed each part to be located closer to the hot parts it's used with, resulting in another performance increase of about 0.5%.	2013-12-09 16:06:22 +01:00
Willy Tarreau	5735d7e2a2	MINOR: http: use an enum for the auth method in http_auth_data This method now takes a single byte, with 7 bytes left to be used after it. No savings were gained but at least now we have an enum.	2013-12-09 16:06:22 +01:00
Willy Tarreau	3770f23a3a	MINOR: http: switch the http state to an enum This reduces its size which is not reused by anything else. However it will significantly improve the debugger's output since we'll now get real state values. The default case had to be enabled in the parsers because gcc tries to optimize the switch/case and noticed some values were missing from the enums and emitted a warning.	2013-12-09 16:06:22 +01:00
Willy Tarreau	c8987b3664	DIET/MINOR: http: reduce the size of struct http_txn by 8 bytes Here again we had some oversized and misaligned entries. The method and the status don't need 4 bytes each, and there was a hole after the status that does not exist anymore. That's 8 additional bytes saved from http_txn and as much for the session. Also some fields were slightly moved to present better memory access patterns resulting in a steady 0.5% performance increase.	2013-12-09 16:06:22 +01:00
Willy Tarreau	721854f0ac	DIET/MINOR: stream-int: rearrange a few fields in struct stream_interface to save 8 bytes The current and previous states are now packed enums instead of ints. This will also help in gdb. The flags have been turned to 16-bit instead of 32 since only 10 are used. This resulted in saving 8 bytes per streamm interface, or 16 per session.	2013-12-09 16:06:21 +01:00
Willy Tarreau	2518db4bfa	DIET/MINOR: session: reduce the struct session size by 8 bytes Move uniq_id upper to fill a hole and kill one. Another hole remains after store_count.	2013-12-09 16:06:21 +01:00
Willy Tarreau	8379c17adf	DIET/MINOR: proxy: rearrange a few fields in struct proxy to save 16 bytes Turn the proxy state to a packed enum (1 char), same for the proxy mode, and store the capabitilies as a char. These 3 ints can now fill the hole after obj_type and save 8 bytes in the proxy struct. Moving the maxconn value just after, which is frequently accessed and was in a block of 3 ints saved another 8 bytes.	2013-12-09 16:06:21 +01:00
Willy Tarreau	f6502c5062	DIET/MINOR: listener: rearrange a few fields in struct listener to save 16 bytes Pack the listener state to 1 char, store it as an enum instead of an int (more gdb-friendly), and move a few fields around to fill holes. The <nice> field can only be -1024..1024 so it was stored as a signed short and completes well with obj_type and li_state. Doing this has reduced the struct listener from 376 to 360 bytes (4.2%).	2013-12-09 16:06:21 +01:00
Willy Tarreau	ad5281ca04	DIET/MINOR: connection: rearrange a few fields to save 8 bytes in the struct By moving the error code to 8 bits the send_proxy_ofs to 16 bits, and moving them just after the obj_type, we can save 8 bytes in the struct connection, down from 328 to 320.	2013-12-09 16:06:15 +01:00
Willy Tarreau	939478d04d	DIET/MINOR: obj: pack the obj_type enum to 8 bits Taking 32-bit in each struct just to store an obj_type is a waste considering the very small amount of possible values. Let's force it to be as small as possible (1 char) and we'll be able to move some structs around to save some space.	2013-12-09 16:06:08 +01:00
Willy Tarreau	4171e9eef0	MEDIUM: stats: delay appctx initialization Now that the session handler can automatically initialize the appctx, let's not do it in stats_accept() anymore.	2013-12-09 15:40:23 +01:00
Willy Tarreau	0a23bcb8be	MAJOR: stream-interface: dynamically allocate the applet context From now on, a call to stream_int_register_handler() causes a call to si_alloc_appctx() and returns an initialized appctx for the current stream interface. If one was previously allocated, it is released. If the stream interface was attached to a connection, it is released as well. The appctx are allocated from the same pools as the connections, because they're substantially smaller in size, and we can't have both a connection and an appctx on an interface at any moment. In case of memory shortage, the call may return NULL, which is already handled by all consumers of stream_int_register_handler(). The field appctx was removed from the stream interface since we only rely on the endpoint now. On 32-bit, the stream_interface size went down from 108 to 44 bytes. On 64-bit, it went down from 144 to 64 bytes. This represents a memory saving of 160 bytes per session. It seems that a later improvement could be to move the call to stream_int_register_handler() to session.c for most cases.	2013-12-09 15:40:23 +01:00
Willy Tarreau	1fbe1c9ec8	MEDIUM: stream-int: return the allocated appctx in stream_int_register_handler() The task returned by stream_int_register_handler() is never used, however we always need to access the appctx afterwards. So make it return the appctx instead. We already plan for it to fail, which is the reason for the addition of a few tests and the possibility for the HTTP analyser to return a status code 500.	2013-12-09 15:40:23 +01:00
Willy Tarreau	7b4b499fde	MEDIUM: stream-int: replace occurrences of si->appctx with si_appctx() We're about to remove si->appctx, so first let's replace all occurrences of its usage with a dynamic extract from si->end. A lot of code was changed by search-n-replace, but the behaviour was intentionally not altered. The code surrounding calls to stream_int_register_handler() was slightly changed since we can only use si->end after the registration.	2013-12-09 15:40:23 +01:00
Willy Tarreau	57cd3e46b9	MEDIUM: connection: merge the send_proxy and local_send_proxy calls We used to have two very similar functions for sending a PROXY protocol line header. The reason is that the default one relies on the stream interface to retrieve the other end's address, while the "local" one performs a local address lookup and sends that instead (used by health checks). Now that the send_proxy_ofs is stored in the connection and not the stream interface, we can make the local_send_proxy rely on it and support partial sends. This also simplifies the code by removing the local_send_proxy function, making health checks use send_proxy_ofs, resulting in the removal of the CO_FL_LOCAL_SPROXY flag, and the associated test in the connection handler. The other flag, CO_FL_SI_SEND_PROXY was renamed without the "SI" part so that it is clear that it is not dedicated anymore to a usage with a stream interface.	2013-12-09 15:40:23 +01:00
Willy Tarreau	1ec74bf660	MINOR: connection: check for send_proxy during the connect(), not the SI It's cleaner to check for a pending send_proxy_ofs while establishing the connection (which already checks it anyway) and not in the stream interface.	2013-12-09 15:40:23 +01:00
Willy Tarreau	b8020cefed	MEDIUM: connection: move the send_proxy offset to the connection Till now the send_proxy_ofs field remained in the stream interface, but since the dynamic allocation of the connection, it makes a lot of sense to move that into the connection instead of the stream interface, since it will not be statically allocated for each session. Also, it turns out that moving it to the connection fils an alignment hole on 64 bit architectures so it does not consume more memory, and removing it from the stream interface was an opportunity to correctly reorder fields and reduce the stream interface's size from 160 to 144 bytes (-10%). This is 32 bytes saved per session.	2013-12-09 15:40:23 +01:00
Willy Tarreau	32e3c6a607	MAJOR: stream interface: dynamically allocate the outgoing connection The outgoing connection is now allocated dynamically upon the first attempt to touch the connection's source or destination address. If this allocation fails, we fail on SN_ERR_RESOURCE. As we didn't use si->conn anymore, it was removed. The endpoints are released upon session_free(), on the error path, and upon a new transaction. That way we are able to carry the existing server's address across retries. The stream interfaces are not initialized anymore before session_complete(), so we could even think about allocating them dynamically as well, though that would not provide much savings. The session initialization now makes use of conn_new()/conn_free(). This slightly simplifies the code and makes it more logical. The connection initialization code is now shorter by about 120 bytes because it's done at once, allowing the compiler to remove all redundant initializations. The si_attach_applet() function now takes care of first detaching the existing endpoint, and it is called from stream_int_register_handler(), so we can safely remove the calls to si_release_endpoint() in the application code around this call. A call to si_detach() was made upon stream_int_unregister_handler() to ensure we always free the allocated connection if one was allocated in parallel to setting an applet (eg: detect HTTP proxy while proceeding with stats maybe).	2013-12-09 15:40:23 +01:00
Willy Tarreau	2a6e8802c0	MEDIUM: stream-interface: introduce si_attach_conn to replace si_prepare_conn si_prepare_conn() is not appropriate in our case as it both initializes and attaches the connection to the stream interface. Due to the asymmetry between accept() and connect(), it causes some fields such as the control and transport layers to be reinitialized. Now that we can separately initialize these fields using conn_prepare(), let's break this function to only attach the connection to the stream interface. Also, by analogy, si_prepare_none() was renamed si_detach(), and si_prepare_applet() was renamed si_attach_applet().	2013-12-09 15:40:23 +01:00
Willy Tarreau	7abddb5c67	MINOR: connection: replace conn_assign with conn_attach We don't want to assign the control nor transport layers anymore at the same time as the data layer, because it prevents one from keeping existing settings when reattaching a connection to an existing stream interface. Let's have conn_attach() replace conn_assign() for this purpose. Thus, conn_prepare() + conn_attach() do exactly the same as the previous conn_assign().	2013-12-09 15:40:23 +01:00
Willy Tarreau	910c6aa5b7	MINOR: connection: reintroduce conn_prepare to set the protocol and transport Now that we can assign conn->xprt regardless of the initialization state, we can reintroduce conn_prepare() to set only the protocol, the transport layer and initialize the transport layer's state.	2013-12-09 15:40:23 +01:00
Willy Tarreau	3ed35ef05b	MINOR: stream-interface: introduce si_reset() and si_set_state() The first function is used to (re)initialize a stream interface and the second to force it into a known state. These are intended for cleaning up the stream interface initialization code in session.c and peers.c and avoiding future issues with missing initializations.	2013-12-09 15:40:23 +01:00
Willy Tarreau	f79c8171b2	MAJOR: connection: add two new flags to indicate readiness of control/transport Currently the control and transport layers of a connection are supposed to be initialized when their respective pointers are not NULL. This will not work anymore when we plan to reuse connections, because there is an asymmetry between the accept() side and the connect() side : - on accept() side, the fd is set first, then the ctrl layer then the transport layer ; upon error, they must be undone in the reverse order, then the FD must be closed. The FD must not be deleted if the control layer was not yet initialized ; - on the connect() side, the fd is set last and there is no reliable way to know if it has been initialized or not. In practice it's initialized to -1 first but this is hackish and supposes that local FDs only will be used forever. Also, there are even less solutions for keeping trace of the transport layer's state. Also it is possible to support delayed close() when something (eg: logs) tracks some information requiring the transport and/or control layers, making it even more difficult to clean them. So the proposed solution is to add two flags to the connection : - CO_FL_CTRL_READY is set when the control layer is initialized (fd_insert) and cleared after it's released (fd_delete). - CO_FL_XPRT_READY is set when the control layer is initialized (xprt->init) and cleared after it's released (xprt->close). The functions have been adapted to rely on this and not on the pointers anymore. conn_xprt_close() was unused and dangerous : it did not close the control layer (eg: the socket itself) but still marks the transport layer as closed, preventing any future call to conn_full_close() from finishing the job. The problem comes from conn_full_close() in fact. It needs to close the xprt and ctrl layers independantly. After that we're still having an issue : we don't know based on ->ctrl alone whether the fd was registered or not. For this we use the two new flags CO_FL_XPRT_READY and CO_FL_CTRL_READY. We now rely on this and not on conn->xprt nor conn->ctrl anymore to decide what remains to be done on the connection. In order not to miss some flag assignments, we introduce conn_ctrl_init() to initialize the control layer, register the fd using fd_insert() and set the flag, and conn_ctrl_close() which unregisters the fd and removes the flag, but only if the transport layer was closed. Similarly, at the transport layer, conn_xprt_init() calls ->init and sets the flag, while conn_xprt_close() checks the flag, calls ->close and clears the flag, regardless xprt_ctx or xprt_st. This also ensures that the ->init and the ->close functions are called only once each and in the correct order. Note that conn_xprt_close() does nothing if the transport layer is still tracked. conn_full_close() now simply calls conn_xprt_close() then conn_full_close() in turn, which do nothing if CO_FL_XPRT_TRACKED is set. In order to handle the error path, we also provide conn_force_close() which ignores CO_FL_XPRT_TRACKED and closes the transport and the control layers in turns. All relevant instances of fd_delete() have been replaced with conn_force_close(). Now we always know what state the connection is in and we can expect to split its initialization.	2013-12-09 15:40:23 +01:00
Willy Tarreau	b97f3b1abf	MINOR: connection: add conn_new() / conn_free() conn_new() will be a more convenient way of allocating and initializing a connection. It calls pool_alloc2() and conn_init() upon success. conn_free() is just a pool_free2() but is provided for symmetry with conn_new().	2013-12-09 15:40:23 +01:00
Willy Tarreau	c10aec299f	MINOR: get rid of si_takeover_conn() Since last commit, this function is an exact copy of si_prepare_conn().	2013-12-09 15:40:23 +01:00
Willy Tarreau	37213433a8	MEDIUM: connection: replace conn_prepare with conn_assign Everywhere conn_prepare() is used, the call to conn_init() has already been done. We can now safely replace all instances of conn_prepare() with conn_assign() which does not reset the transport layer, and remove conn_prepare().	2013-12-09 15:40:23 +01:00
Willy Tarreau	d015577428	MINOR: connection: add conn_init() to (re)initialize a connection This function will ease the initialization of new connections as well as their reuse. It initializes the obj_type and a few fields so that the connection is fresh again. It leaves the addresses and target untouched so it is suitable for use across connection retries.	2013-12-09 15:40:23 +01:00
Willy Tarreau	f8a49eab4f	MEDIUM: session: attach incoming connection to target on embryonic sessions In order to reduce the dependency over stream-interfaces, we now attach the incoming connection to the embryonic session's target instead of the stream-interface's connection. This means we won't need to initialize stream interfaces anymore after we implement dynamic connection allocation. The session's target is reset to NULL after the session has been converted to a complete session.	2013-12-09 15:40:22 +01:00
Willy Tarreau	b363a1f469	MAJOR: stream-int: stop using si->conn and use si->end instead The connection will only remain there as a pre-allocated entity whose goal is to be placed in ->end when establishing an outgoing connection. All connection initialization can be made on this connection, but all information retrieved should be applied to the end point only. This change is huge because there were many users of si->conn. Now the only users are those who initialize the new connection. The difficulty appears in a few places such as backend.c, proto_http.c, peers.c where si->conn is used to hold the connection's target address before assigning the connection to the stream interface. This is why we have to keep si->conn for now. A future improvement might consist in dynamically allocating the connection when it is needed.	2013-12-09 15:40:22 +01:00
Willy Tarreau	691b1f429e	CLEANUP: stream-int: remove obsolete si_ctrl function This function makes no sense anymore and will cause trouble to convert the remains of connection/applet to end points. Let's replace it now with its contents.	2013-12-09 15:40:22 +01:00
Willy Tarreau	cf644ed37a	MEDIUM: stream-int: make ->end point to the connection or the appctx The long-term goal is to have a context for applets as an alternative to the connection and not as a complement. At the moment, the context is still stored into the stream interface, and we only put a pointer to the applet's context in si->end, initialize the context with object type OBJ_TYPE_APPCTX, and this allows us not to allocate an entry when deciding to switch to an applet. A special care is taken to never dereference si->conn anymore when dealing with an applet. That's why it's important that si->end is always set to the proper type : si->end == NULL => not connected to anything si->end == OBJ_TYPE_APPCTX => connected to an applet si->end == OBJ_TYPE_CONN => real connection (server, proxy, ...) The session management code used to check the applet from the connection's target. Now it uses the stream interface's end point and does not touch the connection at all. Similarly, we stop checking the connection's addresses and file descriptors when reporting the applet's status in the stats dump.	2013-12-09 15:40:22 +01:00
Willy Tarreau	4a59f2f954	MAJOR: stream interface: remove the ->release function pointer Since last commit, we now have a pointer to the applet in the applet context. So we don't need the si->release function pointer anymore, it can be extracted from applet->applet.release. At many places, the ->release function was still tested for real connections while it is only limited to applets, so most of them were simply removed. For the remaining valid uses, a new inline function si_applet_release() was added to simplify the check and the call.	2013-12-09 15:40:22 +01:00
Willy Tarreau	48099c7a07	MEDIUM: stream-interface: set the pointer to the applet into the applet context In preparation for a later move of all the applet context outside of the stream interface, we'll need to have access to the applet itself from the context. Let's have a pointer to it inside the context.	2013-12-09 15:40:22 +01:00
Willy Tarreau	7d67d7b9e5	MINOR: stream-int: add a new pointer to the end point The end point will correspond to either an applet context or a connection, depending on the object type. For now the pointer remains null.	2013-12-09 15:40:22 +01:00
Willy Tarreau	372d6708fb	MINOR: stream-int: split si_prepare_embedded into si_prepare_none and si_prepare_applet si_prepare_embedded() was used both to attach an applet and to detach anything from a stream interface. Split it into si_prepare_none() to detach and si_prepare_applet() to attach an applet. si->conn->target is now assigned from within these two functions instead of their respective callers.	2013-12-09 15:40:22 +01:00
Willy Tarreau	9b6c2c721e	MINOR: stream-int: rename ->applet to ->appctx Since this is the applet context, call it ->appctx to avoid the confusion with the pointer to the applet. Many places were changed but it's only a renaming.	2013-12-09 15:40:22 +01:00
Willy Tarreau	0788f47cc1	MINOR: obj: introduce a new type appctx The object type was added to "struct appctx". The purpose will be to identify an appctx when the applet context is detached from the stream interface. For now, it's still attached, so this patch only adds the new type and does not replace its use.	2013-12-09 15:40:22 +01:00
Willy Tarreau	452d3bb0c4	MINOR: stream-interface: move the applet context to its own struct In preparation of making the applet context dynamically allocatable, we create a "struct appctx". Nothing else was changed, it's the same struct as the one which was in the stream interface.	2013-12-09 15:40:22 +01:00
Willy Tarreau	f4acee332b	MEDIUM: stream interface: move the peers' ptr into the applet context A long time ago when peers were introduced, there was no applet nor applet context. Applet contexts were introduced but the peers still did not make use of them and the "ptr" pointer remains present in every stream interface in addition to the other contexts. Simply move this pointer to its own location in the context. Note that this pointer is still a void* because its type and contents varies depending on the peers session state. Probably that this could be cleaned up in the future given that all other contexts already store much more than a single pointer.	2013-12-09 15:40:22 +01:00
Willy Tarreau	51c2184755	MINOR: connection: add a field to store an object type This will soon be used to differenciate connections from applet contexts. Object type "connection" has also been added.	2013-12-09 15:40:22 +01:00
Willy Tarreau	66337a0784	MINOR: obj: provide a safe and an unsafe access to pointed objects Most of the times, the caller of objt_<type>(ptr) will know that <ptr> is valid and of the correct type (eg: in an "if" condition). Let's provide an unsafe variant that does not perform the check again for these usages. The new functions are called "__objt_<type>".	2013-12-09 15:40:22 +01:00
Willy Tarreau	6fe1541285	MINOR: stream-int: make the shutr/shutw functions void This is to be more consistent with the other functions. The only reason why these functions used to return a value was to let the caller adjust polling by itself, but now their only callers were the si_shutr()/si_shutw() inline functions. Now these functions do not depend anymore on the connection. These connection variant of these functions now call conn_data_stop_recv()/conn_data_stop_send() before returning order not to require a return code anymore. The applet version does not need this at all.	2013-12-09 15:40:22 +01:00
Willy Tarreau	8b3d7dfd7c	MEDIUM: stream-int: split the shutr/shutw functions between applet and conn These functions induce a lot of ifs everywhere because they consider two different cases, one which is where the connection exists and has a file descriptor, and the other one which is the default case where at most an applet has to be notified. Let's have them in si_ops and automatically decide which one to use. The connection shutdown sequence has been slightly simplified, and we now clear the flags at the end. Also we remove SHUTR_NOW after a shutw with nolinger, as it's cleaner not to keep it.	2013-12-09 15:40:22 +01:00
Willy Tarreau	347a35d19e	MAJOR: stats: move the HTTP stats handling to its applet There is a big trouble with the way POST is handled for the admin stats page. The POST parameters are extracted from some http-request rules, and if not round they return zero hoping for being called again when more data passes. This results in the HTTP analyser being called several times and all the rules prior to the stats being executed multiple times as well. That includes rewrite rules. So instead of doing this, we now move all the processing of the stats into the stats applet. That way we just set the stats applet in the HTTP analyser when a stats request is detected, and the applet takes the time it needs to read the arguments and respond. We could even imagine improving the applet to support requests larger than a single buffer. The code was almost only moved and minimally changed. Several new HTTP states were added to the stats applet to emit headers, redirects and to read POST. It was necessary to do this because the headers sent depend on the parsing of the POST request. In the end it's beneficial because we removed two stream_int_retnclose() calls.	2013-12-09 15:40:22 +01:00
Willy Tarreau	96d44918f7	MEDIUM: stats: prepare the HTTP stats I/O handler to support more states In preparation for moving the POST processing to the applet, we first add new states to the HTTP I/O handler. Till now st0 was only 0/1 for start/end. We now replace it with an enum.	2013-12-09 15:40:22 +01:00
Willy Tarreau	9f68148321	MEDIUM: peers: don't rely on conn->xprt_ctx anymore We make the peers code use applet->ptr instead of conn->xprt_ctx to store the pointer to the current peer. That way it does not depend on a connection anymore.	2013-12-09 15:40:21 +01:00
Willy Tarreau	787add2932	MINOR: session: add a simple function to retrieve a session from a task This function only casts t->context to (struct session *). It will avoid some ugly and unsafe casts in upcoming changes.	2013-12-09 15:40:21 +01:00
Willy Tarreau	a94d2d7653	MEDIUM: stats: don't use conn->xprt_st anymore We're trying to move the applets out of the struct connection. So let's remove the dependence on xprt_st and introduce si->applet.st2 to store the missing contextual data instead.	2013-12-09 15:40:21 +01:00
Willy Tarreau	08382955fe	CLEANUP: stream_interface: remove unused field err_loc This field was still fed with a pointer to the server that caught an error but was not used anymore. Let's remove it.	2013-12-09 15:40:21 +01:00
Willy Tarreau	37e340ce4b	BUG/MEDIUM: stick: completely remove the unused flag from the store entries The store[] array in the session holds a flag which probably aimed to differenciate store entries learned from the request from those learned from the response, and allowing responses to overwrite only the request ones (eg: have a server set a response cookie which overwrites the request one). But this flag is set when a response data is stored, and is never cleared. So in practice, haproxy always runs with this flag set, meaning that responses prevent themselves from overriding the request data. It is desirable anyway to keep the ability not to override data, because the override is performed only based on the table and not on the key, so that would mean that it would be impossible to retrieve two different keys to store into a same table. For example, if a client sets a cookie and a server another one, both need to be updated in the table in the proper order. This is especially true when multiple keys may be tracked on each side into the same table (eg: list of IP addresses in a header). So the correct fix which also maintains the current behaviour consists in simply removing this flag and never try to optimize for the overwrite case. This fix also has the benefit of significantly reducing the session size, by 64 bytes due to alignment issues caused by this flag! The bug has been there forever (since 1.4-dev7), so a backport to 1.4 would be appropriate.	2013-12-06 23:14:53 +01:00
Willy Tarreau	98aec9ff47	BUG/MINOR: checks: tcp-check actions are enums, not flags In recent commit `5ecb77f` (MEDIUM: checks: add send/expect tcp based check), bitfields were mistakenly used at some places for the actions. Fortunately, the only two actions right now are 1 and 2 so they don't share any bit in common and the bug has no impact. No backport is needed.	2013-12-06 16:16:41 +01:00
Baptiste Assmann	5ecb77f4c7	MEDIUM: checks: add send/expect tcp based check This is a generic health check which can be used to match a banner or send a request and analyse a server response. It works in a send/expect ways and many exchange can be done between HAProxy and a server to decide the server status, making HAProxy able to speak the server's protocol. It can send arbitrary regular or binary strings and match content as a regular or binary string or a regex. Signed-off-by: Baptiste Assmann <bedis9@gmail.com>	2013-12-06 11:50:47 +01:00
Baptiste Assmann	bb77c8e26d	MINOR: tools: function my_memmem() to lookup binary contents This function simply looks for a memory block inside another one. Signed-off-by: Baptiste Assmann <bedis9@gmail.com>	2013-12-06 11:50:47 +01:00
Willy Tarreau	126d40691a	MINOR: tools: add a generic binary hex string parser We currently use such an hex parser in pat_parse_bin() to parse hex string patterns. We'll need another generic one so let's move it to standard.c and have pat_parse_bin() make use of it.	2013-12-06 11:50:47 +01:00
Thierry FOURNIER	0ffe78cfe3	MEDIUM: map: merge identical maps This patch permits to use the same struct pattern for two indentical maps. This permits to preserve memory, and permits to update only one "struct pattern" when the dynamic map update is supported.	2013-12-06 11:40:53 +01:00
Thierry FOURNIER	275db69c07	BUG/MINOR: map: The map list was declared in the map.h file This bug is harmless and post-dev19, it does not require any backport.	2013-12-06 11:37:28 +01:00
Thierry FOURNIER	d18cd0f110	MEDIUM: http: The redirect strings follows the log format rules. We handle "http-request redirect" with a log-format string now, but we leave "redirect" unaffected. Note that the control of the special "/" case is move from the runtime execution to the configuration parsing. If the format rule list is empty, the build_logline() function does nothing.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	d5f624dde7	MEDIUM: sample: add the "map" converter Add a new converter with the following prototype : map(<map_file>[,<default_value>]) map_<match_type>(<map_file>[,<default_value>]) map_<match_type>_<output_type>(<map_file>[,<default_value>]) It searches the for input value from <map_file> using the <match_type> matching method, and return the associated value converted to the type <output_type>. If the input value cannot be found in the <map_file>, the converter returns the <default_value>. If the <default_value> is not set, the converter fails and acts as if no input value could be fetched. If the <match_type> is not set, it defaults to "str". Likewise, if the <output_type> is not set, it defaults to "str". For convenience, the "map" keyword is an alias for "map_str" and maps a string to another string. The following array contains contains the list of all the map* converters. +----+----------+---------+-------------+------------+ \| `-_ out \| \| \| \| \| input `-_ \| str \| int \| ip \| \| / match `-_ \| \| \| \| +---------------+---------+-------------+------------+ \| str / str \| map_str \| map_str_int \| map_str_ip \| \| str / sub \| map_sub \| map_sub_int \| map_sub_ip \| \| str / dir \| map_dir \| map_dir_int \| map_dir_ip \| \| str / dom \| map_dom \| map_dom_int \| map_dom_ip \| \| str / end \| map_end \| map_end_int \| map_end_ip \| \| str / reg \| map_reg \| map_reg_int \| map_reg_ip \| \| int / int \| map_int \| map_int_int \| map_int_ip \| \| ip / ip \| map_ip \| map_ip_int \| map_ip_ip \| +---------------+---------+-------------+------------+ The names are intentionally chosen to reflect the same match methods as ACLs use.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	4b5e422759	MINOR: map: Define map types Define the types used with maps, and add new argument type that can reference the map. This pointer contains the map configuration entries.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	fdbf4842b6	MINOR: sample: add a private field to the struct sample_conv These flags will be used for maps, and possibly later to pass some extra information to other converters if needed.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	b805f71d1b	MEDIUM: sample: let the cast functions set their output type This patch allows each sample cast function to specify the sample output type. The goal is to be able to emit an output type IPv4 or IPv6 depending on what is found in the input if the next converter is able to process them both. The patch also adds a new pseudo type called "ADDR". This type is an alias for IPV4 and IPV6 which is only used as an input type by converters who want to express their compatibility with both address formats. It may not be emitted. The goal is to unify as much as possible the processing of IPv4 and IPv6 in order not to add extra keywords for the maps which act as converters, but will match samples like ACLs do with their patterns.	2013-12-02 23:31:33 +01:00
Willy Tarreau	6f8fe310cf	MINOR: pattern: import acl_find_match_name() into pattern.h It's only dedicated to pattern match lookups, so it was renamed pat_find_match_name().	2013-12-02 23:31:33 +01:00
Willy Tarreau	0cba607400	MINOR: acl/pattern: use types different from int to clarify who does what. We now have the following enums and all related functions return them and consume them : enum pat_match_res { PAT_NOMATCH = 0, /* sample didn't match any pattern / PAT_MATCH = 3, / sample matched at least one pattern / }; enum acl_test_res { ACL_TEST_FAIL = 0, / test failed / ACL_TEST_MISS = 1, / test may pass with more info / ACL_TEST_PASS = 3, / test passed / }; enum acl_cond_pol { ACL_COND_NONE, / no polarity set yet / ACL_COND_IF, / positive condition (after 'if') / ACL_COND_UNLESS, / negative condition (after 'unless') */ }; It's just in order to avoid doubts when reading some code.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	a65b343eee	MEDIUM: pattern: rename "acl" prefix to "pat" This patch just renames functions, types and enums. No code was changed. A significant number of files were touched, especially the ACL arrays, so it is likely that some external patches will not apply anymore. One important thing is that we had to split ACL_PAT_* into two groups : - ACL_TEST_{PASS\|MISS\|FAIL} - PAT_{MATCH\|UNMATCH} A future patch will enforce enums on all these places to avoid confusion.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	d163e1ce30	MEDIUM: pattern: create pattern expression This new structure contains the data needed for pattern matching. It's the first step to the complete independance of the pattern matching.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	ed66c297c2	REORG: acl/pattern: extract pattern matching from the acl file and create pattern.c This patch just moves code without any change. The ACL are just the association between sample and pattern. The pattern contains the match method and the parse method. These two things are different. This patch cleans the code by splitting it.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	dd69a04666	MEDIUM: acl: associate "struct sample_storage" to each "struct acl_pattern" This will be used later with maps. Each map will associate an entry with a sample_storage value. This patch changes the "parse" prototype and all the parsing methods. The goal is to associate "struct sample_storage" to each entry of "struct acl_pattern". Only the "parse" function can add the sample value into the "struct acl_pattern".	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	8ed9697064	MINOR: sample: Define new struct sample_storage This struct is used to store a sample constant. The size of this struct is less than the struct sample. This struct only contains a constant and doesn't need the "ctx" nor the "flags".	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	29d47b87c4	MINOR: acl: Extract the pattern matching function The map feature will need to match acl patterns. This patch extracts the matching function from the global ACL function "acl_exec_cond". The code was only moved to its own function, no functional changes were made.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	3a103c5a6b	MINOR: acl: Extract the pattern parsing and indexation from the "acl_read_patterns_from_file()" function With this split, the pattern indexation can apply to any source. The map feature needs this functionality because the map cannot be loaded with the same file format as the ones supported by acl_read_patterns_from_file(). The code was only moved to its own function, no functional changes were made.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	319e495a96	MINOR: acl: export acl arrays The map feature needs to use the acl parser and converters.	2013-12-02 23:31:32 +01:00
Thierry FOURNIER	d559dd8390	MINOR: tools: Add a function to convert buffer to an ipv6 address The inet_pton function needs an input string with a final \0. This function copies the input string to a temporary buffer, adds the final \0 and converts to address.	2013-12-02 23:31:32 +01:00
Thierry FOURNIER	9c1d67ecbd	MINOR: sample: provide the original sample_conv descriptor struct to the argument checker function. Note that this argument checker is still unused but will be used by maps.	2013-12-02 23:31:32 +01:00
Thierry FOURNIER	348971ea28	MEDIUM: acl: use the fetch syntax 'fetch(args),conv(),conv()' into the ACL keyword If the acl keyword is a "fetch", the dedicated parsing function "sample_parse_expr()" is used. Otherwise, the acl parsing function "parse_acl_expr()" is extended to understand the syntax of a series of converters placed after the "fetch" keyword. Before this patch, each acl uses a "struct sample_fetch" and executes it with the "<fetch>->process()" function. Now, the dedicated function "sample_process()" is called. These syntax are now avalaible: acl bad req.hdr(host),lower -m str www http-request redirect prefix /go-away if bad acl bad hdr_beg(host),lower www http-request redirect prefix /go-away if bad	2013-12-02 23:31:32 +01:00
Thierry FOURNIER	8af6ff12b5	MINOR: sample: export sample_casts just export the sample cast matrix "sample_casts" to prepare the generic sample conversion parser.	2013-12-02 23:31:32 +01:00
Thierry FOURNIER	20f4996738	MINOR: sample: export the generic sample conversion parser just export function "find_sample_conv()" to prepare the generic sample conversion parser.	2013-12-02 23:31:32 +01:00
Willy Tarreau	34c2fb6f89	BUG/MINOR: config: report the correct track-sc number in tcp-rules When parsing track-sc* actions in tcp-request rules, we now automatically compute the track-sc identifier number using %d when displaying an error message. But the ID has become wrong since we introduced sc0, we continue to report id+1 in error messages causing some confusion. No backport is needed.	2013-12-02 23:31:32 +01:00
Willy Tarreau	830bf61815	BUG/MINOR: connection: fix typo in error message report "unknownn" -> "unknown"	2013-12-01 20:29:58 +01:00
Thierry FOURNIER	1c0054fe83	BUG/MINOR: arg: fix error reporting for add-header/set-header sample fetch arguments The 'add-header %[samples]' parsing errors associated to http-request and http-response are displayed with the wrong keyword. Configuration entry: http-request set-header mon-header %[res.hdr(user-agent)] Original error message: [WARNING] 323/150920 (16559) : parsing [haproxy.conf:36] : 'log-format' : sample fetch <res.hdr ... After commit error message: [WARNING] 323/150929 (16580) : parsing [haproxy.conf:36] : 'http-request' : sample fetch <res.hdr ...	2013-11-28 18:25:18 +01:00
Simon Horman	8c3d0be987	MEDIUM: Add DRAIN state and report it on the stats page Add a DRAIN sub-state for a server which will be shown on the stats page instead of UP if its effective weight is zero. Also, log if a server enters or leaves the DRAIN state as the result of an agent check. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-25 07:31:16 +01:00
Simon Horman	671b6f02b5	MEDIUM: Add enable and disable agent unix socket commands The syntax of this new commands are: enable agent <backend>/<server> disable agent <backend>/<server> These commands allow temporarily stopping and subsequently re-starting an auxiliary agent check. The effect of this is as follows: New checks are only initialised when the agent is in the enabled. Thus, disable agent will prevent any new agent checks from begin initiated until the agent re-enabled using enable agent. When an agent is disabled the processing of an auxiliary agent check that was initiated while the agent was set as enabled is as follows: All results that would alter the weight, specifically "drain" or a weight returned by the agent, are ignored. The processing of agent check is otherwise unchanged. The motivation for this feature is to allow the weight changing effects of the agent checks to be paused to allow the weight of a server to be configured using set weight without being overridden by the agent. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-25 07:31:16 +01:00
Simon Horman	58c32978b2	MEDIUM: Set rise and fall of agent checks to 1 This is achieved by moving rise and fall from struct server to struct check. After this move the behaviour of the primary check, server->check is unchanged. However, the secondary agent check, server->agent now has independent rise and fall values each of which are set to 1. The result is that receiving "fail", "stopped" or "down" just once from the agent will mark the server as down. And receiving a weight just once will allow the server to be marked up if its primary check is in good health. This opens up the scope to allow the rise and fall values of the agent check to be configurable, however this has not been implemented at this stage. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-25 07:31:16 +01:00
Simon Horman	d60d69138b	MEDIUM: checks: Add supplementary agent checks Allow an auxiliary agent check to be run independently of the regular a regular health check. This is enabled by the agent-check server setting. The agent-port, which specifies the TCP port to use for the agent's connections, is required. The agent-inter, which specifies the interval between agent checks and timeout of agent checks, is optional. If not set the value for regular checks is used. e.g. server web1_1 127.0.0.1:80 check agent-port 10000 If either the health or agent check determines that a server is down then it is marked as being down, otherwise it is marked as being up. An agent health check performed by opening a TCP socket and reading an ASCII string. The string should have one of the following forms: * An ASCII representation of an positive integer percentage. e.g. "75%" Values in this format will set the weight proportional to the initial weight of a server as configured when haproxy starts. * The string "drain". This will cause the weight of a server to be set to 0, and thus it will not accept any new connections other than those that are accepted via persistence. * The string "down", optionally followed by a description string. Mark the server as down and log the description string as the reason. * The string "stopped", optionally followed by a description string. This currently has the same behaviour as "down". * The string "fail", optionally followed by a description string. This currently has the same behaviour as "down". Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-25 07:31:16 +01:00
Willy Tarreau	d32c399747	MINOR: stats: report correct throttling percentage for servers in slowstart The column used to report the throttle percentage when a server is in slowstart is based on the time only. This is wrong, because server weights in slowstart are updated at most once a second, so the reported value is wrong at least fo rone second during each step, which means all the time when using short delays (< 20s). The second point is that it's disturbing to see a weight < 100% without any throttle at the end of the period (during the last second), because the effective weight has not yet been updated. Instead, we now compute the exact ratio between eweight and uweight and report it. It's always accurate and describes the value being used instead of using only the date. It can be backported to 1.4 though it's not particularly important.	2013-11-21 15:30:45 +01:00
Willy Tarreau	004e045f31	BUG/MAJOR: server: weight calculation fails for map-based algorithms A crash was reported by Igor at owind when changing a server's weight on the CLI. Lukas Tribus could reproduce a related bug where setting a server's weight would result in the new weight being multiplied by the initial one. The two bugs are the same. The incorrect weight calculation results in the total farm weight being larger than what was initially allocated, causing the map index to be out of bounds on some hashes. It's easy to reproduce using "balance url_param" with a variable param, or with "balance static-rr". It appears that the calculation is made at many places and is not always right and not always wrong the same way. Thus, this patch introduces a new function "server_recalc_eweight()" which is dedicated to this task of computing ->eweight from many other elements including uweight and current time (for slowstart), and all users now switch to use this function. The patch is a bit large but the code was not trivially fixable in a way that could guarantee this situation would not occur anymore. The fix is much more readable and has been verified to work with all algorithms, with both consistent and map-based hashes, and even with static-rr. Slowstart was tested as well, just like enable/disable server. The same bug is very likely present in 1.4 as well, so the patch will probably need to be backported eventhough it will not apply as-is. Thanks to Lukas and Igor for the information they provided to reproduce it.	2013-11-21 15:09:02 +01:00
Simon Horman	125d099662	MEDIUM: Move health element to struct check This is in preparation for associating a agent check with a server which runs as well as the server's existing check. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 09:36:07 +01:00
Simon Horman	cd5d7b678e	MEDIUM: Add state to struct check Add state to struct check. This is currently used to store one bit, CHK_RUNNING, which is set if a check is running and clear otherwise. This bit was previously SRV_CHK_RUNNING of the state element of struct server. This is in preparation for associating a agent check with a server which runs as well as the server's existing check. Signed-off-by: Simon Horman <horms+renesas@verge.net.au>	2013-11-19 09:36:04 +01:00
Simon Horman	4a741432be	MEDIUM: Paramatise functions over the check of a server Paramatise the following functions over the check of a server * set_server_down * set_server_up * srv_getinter * server_status_printf * set_server_check_status * set_server_disabled * set_server_enabled Generally the server parameter of these functions has been removed. Where it is still needed it is obtained using check->server. This is in preparation for associating a agent check with a server which runs as well as the server's existing check. By paramatising these functions they may act on each of the checks without further significant modification. Explanation of the SSP_O_HCHK portion of this change: * Prior to this patch SSP_O_HCHK serves a single purpose which is to tell server_status_printf() weather it should print the details of the check of a server or not. With the paramatisation that this patch adds there are two cases. 1) Printing the details of the check in which case a valid check parameter is needed. 2) Not printing the details of the check in which case the contents check parameter are unused. In case 1) we could pass SSP_O_HCHK and a valid check and; In case 2) we could pass !SSP_O_HCHK and any value for check including NULL. If NULL is used for case 2) then SSP_O_HCHK becomes supurfulous and as NULL is used for case 2) SSP_O_HCHK has been removed. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 09:35:54 +01:00
Simon Horman	28b5ffc76f	MEDIUM: Move result element to struct check Move result element from struct server to struct check This allows check results to be independent of the check's server. This is in preparation for associating a agent check with a server which runs as well as the server's existing check. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 09:35:52 +01:00
Simon Horman	6618300e13	MEDIUM: Split up struct server's check element This is in preparation for associating a agent check with a server which runs as well as the server's existing check. The split has been made by: * Moving elements of struct server's check element that will be shared by both checks into a new check_common element of struct server. * Moving the remaining elements to a new struct check and making struct server's check element a struct check. * Adding a server element to struct check, a back-pointer to the server element it is a member of. - At this time the server could be obtained using container_of, however, this will not be so easy once a second struct check element is added to struct server to accommodate an agent health check. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 09:35:48 +01:00
Simon Horman	c69d547638	CLEANUP: Remove unused 'last_slowstart_change' field from struct peer This was inadvertently added by "MEDIUM: checks: Add agent health check". It appears to have never been used. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 08:04:59 +01:00
Simon Horman	a360844735	CLEANUP: Make parameters of srv_downtime and srv_getinter const The parameters of srv_downtime and srv_getinter are not modified and thus may be const. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 08:04:58 +01:00
Willy Tarreau	a0f4271497	MEDIUM: backend: add support for the wt6 hash This function was designed for haproxy while testing other functions in the past. Initially it was not planned to be used given the not very interesting numbers it showed on real URL data : it is not as smooth as the other ones. But later tests showed that the other ones are extremely sensible to the server count and the type of input data, especially DJB2 which must not be used on numeric input. So in fact this function is still a generally average performer and it can make sense to merge it in the end, as it can provide an alternative to sdbm+avalanche or djb2+avalanche for consistent hashing or when hashing on numeric data such as a source IP address or a visitor identifier in a URL parameter.	2013-11-14 16:37:50 +01:00
Bhaskar Maddala	b6c0ac94a4	MEDIUM: backend: Implement avalanche as a modifier of the hashing functions. Summary: Avalanche is supported not as a native hashing choice, but a modifier on the hashing function. Note that this means that possible configs written after 1.5-dev4 using "hash-type avalanche" will get an informative error instead. But as discussed on the mailing list it seems nobody ever used it anyway, so let's fix it before the final 1.5 release. The default values were selected for backward compatibility with previous releases, as discussed on the mailing list, which means that the consistent hashing will still apply the avalanche hash by default when no explicit algorithm is specified. Examples (default) hash-type map-based Map based hashing using sdbm without avalanche (default) hash-type consistent Consistent hashing using sdbm with avalanche Additional Examples: (a) hash-type map-based sdbm Same as default for map-based above (b) hash-type map-based sdbm avalanche Map based hashing using sdbm with avalanche (c) hash-type map-based djb2 Map based hashing using djb2 without avalanche (d) hash-type map-based djb2 avalanche Map based hashing using djb2 with avalanche (e) hash-type consistent sdbm avalanche Same as default for consistent above (f) hash-type consistent sdbm Consistent hashing using sdbm without avalanche (g) hash-type consistent djb2 Consistent hashing using djb2 without avalanche (h) hash-type consistent djb2 avalanche Consistent hashing using djb2 with avalanche	2013-11-14 16:37:50 +01:00
Bhaskar	98634f0c7b	MEDIUM: backend: Enhance hash-type directive with an algorithm options Summary: In testing at tumblr, we found that using djb2 hashing instead of the default sdbm hashing resulted is better workload distribution to our backends. This commit implements a change, that allows the user to specify the hash function they want to use. It does not limit itself to consistent hashing scenarios. The supported hash functions are sdbm (default), and djb2. For a discussion of the feature and analysis, see mailing list thread "Consistent hashing alternative to sdbm" : http://marc.info/?l=haproxy&m=138213693909219 Note: This change does NOT make changes to new features, for instance, applying an avalance hashing always being performed before applying consistent hashing.	2013-11-14 16:37:50 +01:00
Thierry FOURNIER	a054d410db	BUILD/MINOR: missing header file In the header file "types/proto_http.h", the list are used but the header file "mini-clist.h" is not included.	2013-10-23 15:53:56 +02:00
Thierry FOURNIER	de6617b486	MINOR: http: some exported functions were not in the header file Export the following functions: - find_hdr_value_end - http_header_match2 - http_remove_header2 - http_header_add_tail2	2013-10-23 12:21:38 +02:00
Thierry FOURNIER	ef37a66628	CLEANUP: The function "regex_exec" needs the string length but in many case they expect null terminated char. If haproxy is compiled with the USE_PCRE_JIT option, the length of the string is used. If it is compiled without this option the function doesn't use the length and expects a null terminated string. The prototype of the function is ambiguous, and depends on the compilation option. The developer can think that the length is always used, and many bugs can be created. This patch makes sure that the length is used. The regex_exec function adds the final '\0' if it is needed.	2013-10-23 12:19:51 +02:00
Thierry FOURNIER	ed5a4aefae	CLEANUP: regex: Create regex_comp function that compiles regex using compilation options The current file "regex.h" define an abstraction for the regex. It provides the same struct name and the same "regexec" function for the 3 regex types supported: standard libc, basic pcre and jit pcre. The regex compilation function is not provided by this file. If the developper wants to use regex, he must write regex compilation code containing "#define JIT". This patch provides a unique regex compilation function according to the compilation options. In addition, the "regex.h" file checks the presence of the "#define PCRE_CONFIG_JIT" when "USE_PCRE_JIT" is enabled. If this flag is not present, the pcre lib doesn't support JIT and "#error" is emitted.	2013-10-14 14:42:50 +02:00
Thierry FOURNIER	e28f1ecf2b	BUILD/MINOR: missing header file In the header file "common/regex.h", the C keyword NULL is used. This keyword is referenced into the header file "stdlib.h", but this is not included.	2013-10-10 11:38:35 +02:00
Godbach	2b8fd54287	DOC: fix typo in comments Hi Willy, There is a patch to fix typo in comments, please check the attachment for you information. The commit log is as below: commit 9824d1b3740ac2746894f1aa611c795366c84210 Author: Godbach <nylzhaowei@gmail.com> Date: Mon Sep 30 11:05:42 2013 +0800 DOC: fix typo in comments 0x20000000 -> 0x40000000 vuf -> buf ethod -> Method Signed-off-by: Godbach <nylzhaowei@gmail.com> -- Best Regards, Godbach From 9824d1b3740ac2746894f1aa611c795366c84210 Mon Sep 17 00:00:00 2001 From: Godbach <nylzhaowei@gmail.com> Date: Mon, 30 Sep 2013 11:05:42 +0800 Subject: [PATCH] DOC: fix typo in comments 0x20000000 -> 0x40000000 vuf -> buf ethod -> Method Signed-off-by: Godbach <nylzhaowei@gmail.com>	2013-10-01 09:49:21 +02:00
Willy Tarreau	cc1e04b1e8	MINOR: tcp: add new "close" action for tcp-response This new action immediately closes the connection with the server when the condition is met. The first such rule executed ends the rules evaluation. The main purpose of this action is to force a connection to be finished between a client and a server after an exchange when the application protocol expects some long time outs to elapse first. The goal is to eliminate idle connections which take signifiant resources on servers with certain protocols.	2013-09-11 23:28:51 +02:00
Willy Tarreau	3a925c155d	MEDIUM: stick-tables: flush old entries upon soft-stop When a process with large stick tables is replaced by a new one and remains present until the last connection finishes, it keeps these data in memory for nothing since they will never be used anymore by incoming connections, except during syncing with the new process. This is especially problematic when dealing with long session protocols such as WebSocket as it becomes possible to stack many processes and eat a lot of memory. So the idea here is to know if a table still needs to be synced or not, and to purge all unused entries once the sync is complete. This means that after a few hundred milliseconds when everything has been synchronized with the new process, only a few entries will remain allocated (only the ones held by sessions during the restart) and all the remaining memory will be freed. Note that we carefully do that only after the grace period is expired so as not to impact a possible proxy that needs to accept a few more connections before leaving. Doing this required to add a sync counter to the stick tables, to know how many peer sync sessions are still in progress in order not to flush the entries until all synchronizations are completed.	2013-09-04 17:54:01 +02:00
Evan Broder	be55431f9f	MINOR: ssl: Add statement 'verifyhost' to "server" statements verifyhost allows you to specify a hostname that the remote server's SSL certificate must match. Connections that don't match will be closed with an SSL error.	2013-09-01 07:55:49 +02:00
Willy Tarreau	9f09521f2d	BUG/MEDIUM: unique_id: HTTP request counter must be unique! The HTTP request counter is incremented non atomically, which means that many requests can log the same ID. Let's increment it when it is consumed so that we avoid this case. This bug was reported by Patrick Hemmer. It's 1.5-specific and does not need to be backported.	2013-08-13 17:52:20 +02:00
Willy Tarreau	47060b6ae0	MINOR: cli: make it possible to enter multiple values at once with "set table" The "set table" statement allows to create new entries with their respective values. Till now it was limited to a single data type per line, requiring as many "set table" statements as the desired data types to be set. Since this is only a parser limitation, this patch gets rid of it. It also allows the creation of a key with no data types (all reset to their default values).	2013-08-01 21:17:19 +02:00
Willy Tarreau	b4c8493a9f	MINOR: session: make the number of stick counter entries more configurable In preparation of more flexibility in the stick counters, make their number configurable. It still defaults to 3 which is the minimum accepted value. Changing the value alone is not sufficient to get more counters, some bitfields still need to be updated and the TCP actions need to be updated as well, but this update tries to be easier, which is nice for experimentation purposes.	2013-08-01 21:17:14 +02:00
Willy Tarreau	cadd8c9ec3	MINOR: payload: split smp_fetch_rdp_cookie() This function is also called directly from backend.c, so let's stop building fake args to call it as a sample fetch, and have a lower layer more generic function instead.	2013-08-01 21:17:13 +02:00
Willy Tarreau	ef38c39287	MEDIUM: sample: systematically pass the keyword pointer to the keyword We're having a lot of duplicate code just because of minor variants between fetch functions that could be dealt with if the functions had the pointer to the original keyword, so let's pass it as the last argument. An earlier version used to pass a pointer to the sample_fetch element, but this is not the best solution for two reasons : - fetch functions will solely rely on the keyword string - some other smp_fetch_* users do not have the pointer to the original keyword and were forced to pass NULL. So finally we're passing a pointer to the keyword as a const char *, which perfectly fits the original purpose.	2013-08-01 21:17:13 +02:00
Godbach	a34bdc0ea4	BUG/MEDIUM: server: set the macro for server's max weight SRV_UWGHT_MAX to SRV_UWGHT_RANGE The max weight of server is 256 now, but SRV_UWGHT_MAX is still 255. As a result, FWRR will not work well when server's weight is 256. The description is as below: There are some macros related to server's weight in include/types/server.h: #define SRV_UWGHT_RANGE 256 #define SRV_UWGHT_MAX (SRV_UWGHT_RANGE - 1) #define SRV_EWGHT_MAX (SRV_UWGHT_MAX * BE_WEIGHT_SCALE) Since weight of server can be reach to 256 and BE_WEIGHT_SCALE equals to 16, the max eweight of server should be 25616 = 4096, it will exceed SRV_EWGHT_MAX which equals to SRV_UWGHT_MAXBE_WEIGHT_SCALE = 255*16 = 4080. When a server with weight 256 is insterted into FWRR tree during initialization, the key value of this server should be SRV_EWGHT_MAX - s->eweight = 4080 - 4096 = -16 which is closed to UINT_MAX in unsigned type, so the server with highest weight will be not elected as the first server to process request. In addition, it is a better choice to compare with SRV_UWGHT_MAX than a magic number 256 while doing check for the weight. The max number of servers for round-robin algorithm is also updated. Signed-off-by: Godbach <nylzhaowei@gmail.com>	2013-07-22 09:29:34 +02:00
Lukas Tribus	67db8df12b	MEDIUM: http: add IPv6 support for "set-tos" As per RFC3260 #4 and BCP37 #4.2 and #5.2, the IPv6 counterpart of TOS is "traffic class". Add support for IPv6 traffic class in "set-tos" by moving the "set-tos" related code to the new inline function inet_set_tos(), handling IPv4 (IP_TOS), IPv6 (IPV6_TCLASS) and IPv4-mapped sockets (IP_TOS, like ::ffff:127.0.0.1). Also define - if missing - the IN6_IS_ADDR_V4MAPPED() macro in include/common/compat.h for compatibility.	2013-06-23 18:01:38 +02:00
Willy Tarreau	dc13c11c1e	BUG/MEDIUM: prevent gcc from moving empty keywords lists into BSS Benoit Dolez reported a failure to start haproxy 1.5-dev19. The process would immediately report an internal error with missing fetches from some crap instead of ACL names. The cause is that some versions of gcc seem to trim static structs containing a variable array when moving them to BSS, and only keep the fixed size, which is just a list head for all ACL and sample fetch keywords. This was confirmed at least with gcc 3.4.6. And we can't move these structs to const because they contain a list element which is needed to link all of them together during the parsing. The bug indeed appeared with 1.5-dev19 because it's the first one to have some empty ACL keyword lists. One solution is to impose -fno-zero-initialized-in-bss to everyone but this is not really nice. Another solution consists in ensuring the struct is never empty so that it does not move there. The easy solution consists in having a non-null list head since it's not yet initialized. A new "ILH" list head type was thus created for this purpose : create an Initialized List Head so that gcc cannot move the struct to BSS. This fixes the issue for this version of gcc and does not create any burden for the declarations.	2013-06-21 23:29:02 +02:00
Godbach	430f291a99	CLEANUP: session: remove event_accept() which was not used anymore Remove event_accept() in include/proto/proto_http.h and use correct function name in other two files instead of event_accept(). Signed-off-by: Godbach <nylzhaowei@gmail.com>	2013-06-20 08:07:35 +02:00
Willy Tarreau	be4a3eff34	MEDIUM: counters: use sc0/sc1/sc2 instead of sc1/sc2/sc3 It was a bit inconsistent to have gpc start at 0 and sc start at 1, so make sc start at zero like gpc. No previous release was issued with sc3 anyway, so no existing setup should be affected.	2013-06-17 15:04:07 +02:00
Thierry FOURNIER	7eeb435494	CLEANUP: protect checks.h from multiple inclusions The types/checks.h include file isn't protected against multiple inclusions, so let's surround it with "#ifndef _TYPES_CHECKS_H/#endif" to fix this.	2013-06-14 19:53:02 +02:00
Thierry FOURNIER	d3879e8b57	CLEANUP: fix missing include <string.h> in proto/listener.h The file proto/listener.h makes use of strdup() but doesn't include <string.h> so it's sensible to include file ordering.	2013-06-14 19:52:17 +02:00
Willy Tarreau	4f0d919bd4	MEDIUM: tcp: add "tcp-request connection expect-proxy layer4" This configures the client-facing connection to receive a PROXY protocol header before any byte is read from the socket. This is equivalent to having the "accept-proxy" keyword on the "bind" line, except that using the TCP rule allows the PROXY protocol to be accepted only for certain IP address ranges using an ACL. This is convenient when multiple layers of load balancers are passed through by traffic coming from public hosts.	2013-06-11 20:40:55 +02:00
Willy Tarreau	51347ed94c	MEDIUM: http: add the "set-mark" action on http-request/http-response rules "set-mark" is used to set the Netfilter MARK on all packets sent to the client to the value passed in <mark> on platforms which support it. This value is an unsigned 32 bit value which can be matched by netfilter and by the routing table. It can be expressed both in decimal or hexadecimal format (prefixed by "0x"). This can be useful to force certain packets to take a different route (for example a cheaper network path for bulk downloads). This works on Linux kernels 2.6.32 and above and requires admin privileges.	2013-06-11 19:34:13 +02:00
Willy Tarreau	42cf39e3b9	MEDIUM: http: add support for "set-tos" in http-request/http-response This manipulates the TOS field of the IP header of outgoing packets sent to the client. This can be used to set a specific DSCP traffic class based on some request or response information. See RFC2474, 2597, 3260 and 4594 for more information.	2013-06-11 19:04:37 +02:00

... 3 4 5 6 7 ...

1706 Commits