haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2025-02-01 19:13:36 +00:00

Author	SHA1	Message	Date
Baptiste Assmann	bb77c8e26d	MINOR: tools: function my_memmem() to lookup binary contents This function simply looks for a memory block inside another one. Signed-off-by: Baptiste Assmann <bedis9@gmail.com>	2013-12-06 11:50:47 +01:00
Willy Tarreau	126d40691a	MINOR: tools: add a generic binary hex string parser We currently use such an hex parser in pat_parse_bin() to parse hex string patterns. We'll need another generic one so let's move it to standard.c and have pat_parse_bin() make use of it.	2013-12-06 11:50:47 +01:00
Thierry FOURNIER	d559dd8390	MINOR: tools: Add a function to convert buffer to an ipv6 address The inet_pton function needs an input string with a final \0. This function copies the input string to a temporary buffer, adds the final \0 and converts to address.	2013-12-02 23:31:32 +01:00
Simon Horman	58c32978b2	MEDIUM: Set rise and fall of agent checks to 1 This is achieved by moving rise and fall from struct server to struct check. After this move the behaviour of the primary check, server->check is unchanged. However, the secondary agent check, server->agent now has independent rise and fall values each of which are set to 1. The result is that receiving "fail", "stopped" or "down" just once from the agent will mark the server as down. And receiving a weight just once will allow the server to be marked up if its primary check is in good health. This opens up the scope to allow the rise and fall values of the agent check to be configurable, however this has not been implemented at this stage. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-25 07:31:16 +01:00
Willy Tarreau	a0f4271497	MEDIUM: backend: add support for the wt6 hash This function was designed for haproxy while testing other functions in the past. Initially it was not planned to be used given the not very interesting numbers it showed on real URL data : it is not as smooth as the other ones. But later tests showed that the other ones are extremely sensible to the server count and the type of input data, especially DJB2 which must not be used on numeric input. So in fact this function is still a generally average performer and it can make sense to merge it in the end, as it can provide an alternative to sdbm+avalanche or djb2+avalanche for consistent hashing or when hashing on numeric data such as a source IP address or a visitor identifier in a URL parameter.	2013-11-14 16:37:50 +01:00
Bhaskar	98634f0c7b	MEDIUM: backend: Enhance hash-type directive with an algorithm options Summary: In testing at tumblr, we found that using djb2 hashing instead of the default sdbm hashing resulted is better workload distribution to our backends. This commit implements a change, that allows the user to specify the hash function they want to use. It does not limit itself to consistent hashing scenarios. The supported hash functions are sdbm (default), and djb2. For a discussion of the feature and analysis, see mailing list thread "Consistent hashing alternative to sdbm" : http://marc.info/?l=haproxy&m=138213693909219 Note: This change does NOT make changes to new features, for instance, applying an avalance hashing always being performed before applying consistent hashing.	2013-11-14 16:37:50 +01:00
Thierry FOURNIER	ef37a66628	CLEANUP: The function "regex_exec" needs the string length but in many case they expect null terminated char. If haproxy is compiled with the USE_PCRE_JIT option, the length of the string is used. If it is compiled without this option the function doesn't use the length and expects a null terminated string. The prototype of the function is ambiguous, and depends on the compilation option. The developer can think that the length is always used, and many bugs can be created. This patch makes sure that the length is used. The regex_exec function adds the final '\0' if it is needed.	2013-10-23 12:19:51 +02:00
Thierry FOURNIER	ed5a4aefae	CLEANUP: regex: Create regex_comp function that compiles regex using compilation options The current file "regex.h" define an abstraction for the regex. It provides the same struct name and the same "regexec" function for the 3 regex types supported: standard libc, basic pcre and jit pcre. The regex compilation function is not provided by this file. If the developper wants to use regex, he must write regex compilation code containing "#define JIT". This patch provides a unique regex compilation function according to the compilation options. In addition, the "regex.h" file checks the presence of the "#define PCRE_CONFIG_JIT" when "USE_PCRE_JIT" is enabled. If this flag is not present, the pcre lib doesn't support JIT and "#error" is emitted.	2013-10-14 14:42:50 +02:00
Thierry FOURNIER	e28f1ecf2b	BUILD/MINOR: missing header file In the header file "common/regex.h", the C keyword NULL is used. This keyword is referenced into the header file "stdlib.h", but this is not included.	2013-10-10 11:38:35 +02:00
Willy Tarreau	47060b6ae0	MINOR: cli: make it possible to enter multiple values at once with "set table" The "set table" statement allows to create new entries with their respective values. Till now it was limited to a single data type per line, requiring as many "set table" statements as the desired data types to be set. Since this is only a parser limitation, this patch gets rid of it. It also allows the creation of a key with no data types (all reset to their default values).	2013-08-01 21:17:19 +02:00
Willy Tarreau	b4c8493a9f	MINOR: session: make the number of stick counter entries more configurable In preparation of more flexibility in the stick counters, make their number configurable. It still defaults to 3 which is the minimum accepted value. Changing the value alone is not sufficient to get more counters, some bitfields still need to be updated and the TCP actions need to be updated as well, but this update tries to be easier, which is nice for experimentation purposes.	2013-08-01 21:17:14 +02:00
Lukas Tribus	67db8df12b	MEDIUM: http: add IPv6 support for "set-tos" As per RFC3260 #4 and BCP37 #4.2 and #5.2, the IPv6 counterpart of TOS is "traffic class". Add support for IPv6 traffic class in "set-tos" by moving the "set-tos" related code to the new inline function inet_set_tos(), handling IPv4 (IP_TOS), IPv6 (IPV6_TCLASS) and IPv4-mapped sockets (IP_TOS, like ::ffff:127.0.0.1). Also define - if missing - the IN6_IS_ADDR_V4MAPPED() macro in include/common/compat.h for compatibility.	2013-06-23 18:01:38 +02:00
Willy Tarreau	dc13c11c1e	BUG/MEDIUM: prevent gcc from moving empty keywords lists into BSS Benoit Dolez reported a failure to start haproxy 1.5-dev19. The process would immediately report an internal error with missing fetches from some crap instead of ACL names. The cause is that some versions of gcc seem to trim static structs containing a variable array when moving them to BSS, and only keep the fixed size, which is just a list head for all ACL and sample fetch keywords. This was confirmed at least with gcc 3.4.6. And we can't move these structs to const because they contain a list element which is needed to link all of them together during the parsing. The bug indeed appeared with 1.5-dev19 because it's the first one to have some empty ACL keyword lists. One solution is to impose -fno-zero-initialized-in-bss to everyone but this is not really nice. Another solution consists in ensuring the struct is never empty so that it does not move there. The easy solution consists in having a non-null list head since it's not yet initialized. A new "ILH" list head type was thus created for this purpose : create an Initialized List Head so that gcc cannot move the struct to BSS. This fixes the issue for this version of gcc and does not create any burden for the declarations.	2013-06-21 23:29:02 +02:00
Willy Tarreau	2b57cb8f30	MEDIUM: protocol: implement a "drain" function in protocol layers Since commit `cfd97c6f` was merged into 1.5-dev14 (BUG/MEDIUM: checks: prevent TIME_WAITs from appearing also on timeouts), some valid health checks sometimes used to show some TCP resets. For example, this HTTP health check sent to a local server : 19:55:15.742818 IP 127.0.0.1.16568 > 127.0.0.1.8000: S 3355859679:3355859679(0) win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:15.742841 IP 127.0.0.1.8000 > 127.0.0.1.16568: S 1060952566:1060952566(0) ack 3355859680 win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:15.742863 IP 127.0.0.1.16568 > 127.0.0.1.8000: . ack 1 win 257 19:55:15.745402 IP 127.0.0.1.16568 > 127.0.0.1.8000: P 1:23(22) ack 1 win 257 19:55:15.745488 IP 127.0.0.1.8000 > 127.0.0.1.16568: FP 1:146(145) ack 23 win 257 19:55:15.747109 IP 127.0.0.1.16568 > 127.0.0.1.8000: R 23:23(0) ack 147 win 257 After some discussion with Chris Huang-Leaver, it appeared clear that what we want is to only send the RST when we have no other choice, which means when the server has not closed. So we still keep SYN/SYN-ACK/RST for pure TCP checks, but don't want to see an RST emitted as above when the server has already sent the FIN. The solution against this consists in implementing a "drain" function at the protocol layer, which, when defined, causes as much as possible of the input socket buffer to be flushed to make recv() return zero so that we know that the server's FIN was received and ACKed. On Linux, we can make use of MSG_TRUNC on TCP sockets, which has the benefit of draining everything at once without even copying data. On other platforms, we read up to one buffer of data before the close. If recv() manages to get the final zero, we don't disable lingering. Same for hard errors. Otherwise we do. In practice, on HTTP health checks we generally find that the close was pending and is returned upon first recv() call. The network trace becomes cleaner : 19:55:23.650621 IP 127.0.0.1.16561 > 127.0.0.1.8000: S 3982804816:3982804816(0) win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:23.650644 IP 127.0.0.1.8000 > 127.0.0.1.16561: S 4082139313:4082139313(0) ack 3982804817 win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:23.650666 IP 127.0.0.1.16561 > 127.0.0.1.8000: . ack 1 win 257 19:55:23.651615 IP 127.0.0.1.16561 > 127.0.0.1.8000: P 1:23(22) ack 1 win 257 19:55:23.651696 IP 127.0.0.1.8000 > 127.0.0.1.16561: FP 1:146(145) ack 23 win 257 19:55:23.652628 IP 127.0.0.1.16561 > 127.0.0.1.8000: F 23:23(0) ack 147 win 257 19:55:23.652655 IP 127.0.0.1.8000 > 127.0.0.1.16561: . ack 24 win 257 This change should be backported to 1.4 which is where Chris encountered this issue. The code is different, so probably the tcp_drain() function will have to be put in the checks only.	2013-06-10 20:33:23 +02:00
Willy Tarreau	bf43f6eb32	MINOR: defaults: allow REQURI_LEN and CAPTURE_LEN to be redefined Some users want to change these default settings but it was not convenient.	2013-06-10 10:30:09 +02:00
Pieter Baauw	1eb7592bba	MINOR: tproxy: add support for OpenBSD OpenBSD uses (SOL_SOCKET, SO_BINDANY) to enable transparent proxy on a socket. This patch adds support for the relevant setsockopt() calls.	2013-05-11 08:03:50 +02:00
Pieter Baauw	ff30b6667b	MINOR: tproxy: add support for FreeBSD FreeBSD uses (IPPROTO_IP, IP_BINDANY) and (IPPROTO_IPV6, IPV6_BINDANY) to enable transparent proxy on a socket. This patch adds support for the relevant setsockopt() calls.	2013-05-11 08:03:43 +02:00
Pieter Baauw	d551fb5a8d	REORG: tproxy: prepare the transparent proxy defines for accepting other OSes This patch does not change the logic of the code, it only changes the way OS-specific defines are tested. At the moment the transparent proxy code heavily depends on Linux-specific defines. This first patch introduces a new define "CONFIG_HAP_TRANSPARENT" which is set every time the defines used by transparent proxy are present. This also means that with an up-to-date libc, it should not be necessary anymore to force CONFIG_HAP_LINUX_TPROXY during the build, as the flags will automatically be detected. The CTTPROXY flags still remain separate because this older API doesn't work the same way. A new line has been added in the version output for haproxy -vv to indicate what transparent proxy support is available.	2013-05-11 08:03:37 +02:00
Willy Tarreau	dd11293e84	BUG/MINOR: acl: fix a double free during exit when using PCRE_JIT When freeing ACL regex, we don't want to perform the free() in regex_free() as it's already performed in free_pattern(). The double free only happens when using PCRE_JIT when freeing everything during exit so it's harmless but exhibits libc errors during a reload/restart. Bug reported by Seri.	2013-05-08 18:14:18 +02:00
de Lafond Guillaume	88c278fadf	MEDIUM: stats: add proxy name filtering on the statistic page This patch adds a "scope" box in the statistics page in order to display only proxies with a name that contains the requested value. The scope filter is preserved across all clicks on the page.	2013-04-15 22:50:33 +02:00
Lukas Tribus	0999f7662c	BUILD: add explicit support for TFO with USE_TFO TCP Fast Open is supported in server mode since Linux 3.7, but current libc's don't define TCP_FASTOPEN=23. Introduce the new USE flag USE_TFO to define it manually in compat.h. Also note this in the TFO related documentation.	2013-04-02 17:40:43 +02:00
Willy Tarreau	0161d62d23	OPTIM: http: improve branching in chunk size parser By tweaking a bit some conditions in http_parse_chunk_size(), we could improve the overall performance in the worst case by 15%.	2013-04-02 02:00:57 +02:00
Willy Tarreau	bf43927cd7	OPTIM: buffer: remove one jump in buffer_count() We can help gcc build an expression not involving a jump. This function is used a lot when parsing chunks.	2013-04-02 01:25:57 +02:00
Hiroaki Nakamura	7035132349	MEDIUM: regex: Use PCRE JIT in acl This is a patch for using PCRE JIT in acl. I notice regex are used in other places, but they are more complicated to modify to use PCRE APIs. So I focused to acl in the first try. BTW, I made a simple benchmark program for PCRE JIT beforehand. https://github.com/hnakamur/pcre-jit-benchmark I read the manual for PCRE JIT http://www.manpagez.com/man/3/pcrejit/ and wrote my benchmark program. https://github.com/hnakamur/pcre-jit-benchmark/blob/master/test-pcre.c	2013-04-02 00:02:54 +02:00
Willy Tarreau	dad36a3ee3	MAJOR: tools: support environment variables in addresses Now that all addresses are parsed using str2sa_range(), it becomes easy to add support for environment variables and use them everywhere an address is needed. Environment variables are used as $VAR or ${VAR} as in shell. Any number of variables may compose an address, allowing various fantasies such as "fd@${FD_HTTP}" or "${LAN_DC1}.1:80". These ones are usable in logs, bind, servers, peers, stats socket, source, dispatch, and check address.	2013-03-11 01:30:02 +01:00
Willy Tarreau	24709286fe	MEDIUM: tools: support specifying explicit address families in str2sa_range() This change allows one to force the address family in any address parsed by str2sa_range() by specifying it as a prefix followed by '@' then the address. Currently supported address prefixes are 'ipv4@', 'ipv6@', 'unix@'. This also helps forcing resolving for host names (when getaddrinfo is used), and force the family of the empty address (eg: 'ipv4@' = 0.0.0.0 while 'ipv6@' = ::). The main benefits is that unix sockets can now get a local name without being forced to begin with a slash. This is useful during development as it is no longer necessary to have stats socket sent to /tmp.	2013-03-10 22:46:55 +01:00
Willy Tarreau	c120c8d347	CLEANUP: minor cleanup in str2sa_range() and str2ip() Don't use a statically allocated address both for str2ip and str2sa_range, use the same. The inet and unix code paths have been splitted a little better to improve readability.	2013-03-10 21:36:31 +01:00
Willy Tarreau	add0ab1975	CLEANUP: tools: remove str2sun() which is not used anymore.	2013-03-08 14:04:54 +01:00
Willy Tarreau	d393a628bb	MINOR: tools: prepare str2sa_range() to accept a prefix We'll need str2sa_range() to support a prefix for unix sockets. Since we don't always want to use it (eg: stats socket), let's not take it unconditionally from global but let the caller pass it.	2013-03-08 14:04:54 +01:00
Willy Tarreau	df350f1f48	MINOR: tools: prepare str2sa_range() to return an error message We'll need str2sa_range() to return address parsing errors if we want to extend its functionalities. Let's do that now eventhough it's not used yet.	2013-03-08 14:04:53 +01:00
Emeric Brun	6924ef8b12	BUG/MEDIUM: ssl: ECDHE ciphers not usable without named curve configured. Fix consists to use prime256v1 as default named curve to init ECDHE ciphers if none configured.	2013-03-06 19:08:26 +01:00
Willy Tarreau	4f4b18b2ec	BUILD/MINOR: syscall: add definition of NR_accept4 for ARM This platform was not covered and older libc do not provide accept4().	2013-03-04 07:38:08 +01:00
Willy Tarreau	b26cc86b1c	BUG/MINOR: syscall: fix NR_accept4 system call on sparc/linux An invalid copy-paste called it NR_splice instead of NR_accept4. This does not lead to real issues because if this define is used, then the code cannot compile since NR_accept4 is still missing.	2013-03-04 07:31:08 +01:00
Willy Tarreau	d4448bc836	MEDIUM: tools: make str2sa_range support all address syntaxes Right now we have multiple methods for parsing IP addresses in the configuration. This is quite painful. This patch aims at adapting str2sa_range() to make it support all formats, so that the callers perform the appropriate tests on the return values. str2sa() was changed to simply return str2sa_range(). The output values are now the following ones (taken from the comment on top of the function). Converts <str> to a locally allocated struct sockaddr_storage , and a port range or offset consisting in two integers that the caller will have to check to find the relevant input format. The following format are supported : String format \| address \| port \| low \| high addr \| <addr> \| 0 \| 0 \| 0 addr: \| <addr> \| 0 \| 0 \| 0 addr:port \| <addr> \| <port> \| <port> \| <port> addr:pl-ph \| <addr> \| <pl> \| <pl> \| <ph> addr:+port \| <addr> \| <port> \| 0 \| <port> addr:-port \| <addr> \|-<port> \| <port> \| 0 The detection of a port range or increment by the caller is made by comparing <low> and <high>. If both are equal, then port 0 means no port was specified. The caller may pass NULL for <low> and <high> if it is not interested in retrieving port ranges. Note that <addr> above may also be : - empty ("") => family will be AF_INET and address will be INADDR_ANY - "" => family will be AF_INET and address will be INADDR_ANY - "::" => family will be AF_INET6 and address will be IN6ADDR_ANY - a host name => family and address will depend on host name resolving.	2013-02-20 17:29:30 +01:00
Simon Horman	5269cfb458	BUG/MINOR: Correct logic in cut_crlf() This corrects what appears to be logic errors in cut_crlf(). I assume that the intention of this function is to truncate a string at the first cr or lf. However, currently lf are ignored. Also use '\0' instead of 0 as the null character, a cosmetic change. Cc: Krzysztof Piotr Oledzki <ole@ans.pl> Signed-off-by: Simon Horman <horms@verge.net.au> [WT: this fix may be backported to 1.4 too]	2013-02-13 10:52:40 +01:00
Willy Tarreau	eab777c32e	BUG/MINOR: time: frequency counters are not totally accurate When a frontend is rate-limited to 1000 connections per second, the effective rate measured from the client is 999/s, and connections experience an average response time of 99.5 ms with a standard deviation of 2 ms. The reason for this inaccuracy is that when computing frequency counters, we use one part of the previous value proportional to the number of milliseconds remaining in the current second. But even the last millisecond still uses a part of the past value, which is wrong : since we have a 1ms resolution, the last millisecond must be dedicated only to filling the current second. So we slightly adjust the algorithm to use 999/1000 of the past value during the first millisecond, and 0/1000 of the past value during the last millisecond. We also slightly improve the computation by computing the remaining time instead of the current time in tv_update_date(), so that we don't have to negate the value in each frequency counter. Now with the fix, the connection rate measured by both the client and haproxy is a steady 1000/s, the average response time measured is 99.2ms and more importantly, the standard deviation has been divided by 3 to 0.6 millisecond. This fix should also be backported to 1.4 which has the same issue.	2012-12-29 21:50:07 +01:00
Willy Tarreau	56adcf2cc9	MINOR: tools: simplify the use of the int to ascii macros These macros (U2H, U2A, LIM2A, ...) have been used with an explicit index for the local storage variable, making it difficult to change log formats and causing a few issues from time to time. Let's have a single macro with a rotating index so that up to 10 conversions may be used in a single call.	2012-12-23 21:46:30 +01:00
Willy Tarreau	47ca54505c	MINOR: chunks: centralize the trash chunk allocation At the moment, we need trash chunks almost everywhere and the only correctly implemented one is in the sample code. Let's move this to the chunks so that all other places can use this allocator. Additionally, the get_trash_chunk() function now really returns two different chunks. Previously it used to always overwrite the same chunk and point it to a different buffer, which was a bit tricky because it's not obvious that two consecutive results do alias each other.	2012-12-23 21:46:07 +01:00
Willy Tarreau	37994f034c	MINOR: standard: add a simple popcount function This function returns the number of ones in a word.	2012-11-19 12:12:24 +01:00
Emeric Brun	4663577e24	MINOR: build: allow packagers to specify the ssl cache size This is done by passing the default value to SSLCACHESIZE in sessions. User can use tune.sslcachesize to change this value. By default, it is set to 20000 sessions as openssl internal cache size. Currently, a session entry size is between 592 and 616 bytes depending on the arch.	2012-11-15 10:52:19 +01:00
Willy Tarreau	e9f49e78fe	MAJOR: polling: replace epoll with sepoll and remove sepoll Now that all pollers make use of speculative I/O, there is no point having two epoll implementations, so replace epoll with the sepoll code and remove sepoll which has just become the standard epoll method.	2012-11-11 20:53:30 +01:00
Willy Tarreau	ed7f836f07	BUG/MINOR: stream_interface: don't loop over ->snd_buf() It is stupid to loop over ->snd_buf() because the snd_buf() itself already loops and stops when system buffers are full. But looping again onto it, we lose the information of the full buffers and perform one useless syscall. Furthermore, this causes issues when dealing with large uploads while waiting for a connection to establish, as it can report a server reject of some data as a connection abort, which is wrong. 1.4 does not have this issue as it loops maximum twice (once for each buffer half) and exists as soon as system buffers are full. So no backport is needed.	2012-10-29 23:30:33 +01:00
Willy Tarreau	7780473c3b	CLEANUP: replace chunk_printf() with chunk_appendf() This function's naming was misleading as it is used to append data at the end of a string, causing some surprizes when used for the first time! Add a chunk_printf() function which does what its name suggests.	2012-10-29 16:14:26 +01:00
Willy Tarreau	c26ac9deea	MINOR: chunk: add a function to reset a chunk This is a first step in avoiding to constantly reinitialize chunks. It replaces the old chunk_reset() which was not properly named as it used to drop everything and was only used by chunk_destroy(). It has been renamed chunk_drop().	2012-10-29 13:33:42 +01:00
Yuxans Yao	4e25b015a7	MINOR: log: add '%Tl' to log-format The '%Tl' is similar to '%T', but using local timezone.	2012-10-29 11:55:26 +01:00
Willy Tarreau	422a0a5161	MINOR: tools: add a clear_addr() function to unset an address This will be used to unset a from address.	2012-10-26 20:04:26 +02:00
Willy Tarreau	3dd0c4e20e	OPTIM: tools: inline hex2i() This tiny function was not inlined because initially not much used. However it's been used un the chunk parser for a while and it became one of the most CPU-cycle eater there. By inlining it, the chunk parser speed was increased by 74 %. We're almost 3 times faster than original with just the last 4 commits.	2012-10-26 01:13:24 +02:00
Willy Tarreau	ad8f8e8ffb	MINOR: chunk: provide string compare functions It's sometimes needed to be able to compare a zero-terminated string with a chunk, so we now have two functions to do that, one strcmp() equivalent and one strcasecmp() equivalent.	2012-10-19 15:18:06 +02:00
Willy Tarreau	9b28e03b66	MAJOR: channel: replace the struct buffer with a pointer to a buffer With this commit, we now separate the channel from the buffer. This will allow us to replace buffers on the fly without touching the channel. Since nobody is supposed to keep a reference to a buffer anymore, doing so is not a problem and will also permit some copy-less data manipulation. Interestingly, these changes have shown a 2% performance increase on some workloads, probably due to a better cache placement of data.	2012-10-13 09:07:52 +02:00
Willy Tarreau	7fca87fd9d	BUILD: accept4: move the socketcall declaration outside of accept4() Gcc 4.2.4 breaks on the syscall declared inside the function, move it outside and declare it static inline.	2012-10-10 17:42:39 +02:00

1 2 3 4 5 ...

279 Commits