haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2024-12-30 16:42:03 +00:00

Author	SHA1	Message	Date
Willy Tarreau	ba2c4459a5	MINOR: tools: add simple word fingerprinting to find similar-looking words This introduces two functions, one which creates a fingerprint of a word, and one which computes a distance between two words fingerprints. The fingerprint is made by counting the transitions between one character and another one. Here we consider the 26 alphabetic letters regardless of their case, then any digit as a digit, and anything else as "other". We also consider the first and last locations as transitions from begin to first char, and last char to end. The distance is simply the sum of the squares of the differences between two fingerprints. This way, doubling/ missing a letter has the same cost, however some repeated transitions such as "e"->"r" like in "server" are very unlikely to match against situations where they do not exist. This is a naive approach but it seems to work sufficiently well for now. It may be refined in the future if needed.	2021-03-12 14:13:21 +01:00
Willy Tarreau	133c8c412e	CLEANUP: actions: the keyword must always be const from the rule There's no reason for a rule to want to modify an action keyword, let's make sure it is always const.	2021-03-12 14:13:21 +01:00
Christopher Faulet	77e376783e	BUG/MINOR: proxy/session: Be sure to have a listener to increment its counters It is possible to have a session without a listener. It happens for applets on the client side. Thus all accesses to the listener info from the session must be guarded. It was the purpose of the commit 36119de18 ("BUG/MEDIUM: session: NULL dereference possible when accessing the listener"). However, some tests on the session's listener existence are missing in proxy_inc_* functions. This patch should fix the issues #1171, #1172, #1173, #1174 and #1175. It must be backported with the above commit as far as 1.8.	2021-03-12 09:25:45 +01:00
Willy Tarreau	3b728a92bb	BUILD: atomic/arm64: force the register pairs to use in __ha_cas_dw() Since commit f8fb4f75f ("MINOR: atomic: implement a more efficient arm64 __ha_cas_dw() using pairs"), on some modern arm64 (armv8.1+) compiled with -march=armv8.1-a under gcc-7.5.0, a build error may appear on ev_poll.o : /tmp/ccHD2lN8.s:1771: Error: reg pair must start from even reg at operand 1 -- `casp x27,x28,x22,x23,[x12]' Makefile:927: recipe for target 'src/ev_poll.o' failed It appears that the compiler cannot always assign register pairs there for a structure made of two u64. It was possibly later addressed since gcc-9.3 never caused this, but there's no trivially available info on the subject in the changelogs. Unsuprizingly, using a u128 instead does fix this, but it significantly inflates the code (+4kB for just 6 places, very likely that it loaded some extra stubs) and the comparison is ugly, involving two slower conditional jumps instead of a single one and a conditional comparison. For example, ha_random64() grew from 144 bytes to 232. However, simply forcing the base register does work pretty well, and makes the code even cleaner and more efficient by further reducing it by about 4.5kB, possibly because it helps the compiler to pick suitable registers for the pair there. And the perf on 64-cores looks steadily 0.5% above the previous one, so let's do this. Note that the commit above was backported to 2.3 to fix scalability issues on AWS Graviton2 platform, so this one will need to be as well.	2021-03-12 06:26:22 +01:00
Fr�d�ric L�caille	c0ed91910a	BUG/MINOR: connection: Missing QUIC initialization The QUIC connection struct connection member was not initialized. This may make randomly haproxy handle TLS connections as QUIC ones only when QUIC support is enabled leading to such OpenSSL errors (captured from a reg test output, TLS Client-Hello callback failed): OpenSSL error[0x10000085] OPENSSL_internal: CONNECTION_REJECTED OpenSSL error[0x10000410] OPENSSL_internal: SSLV3_ALERT_HANDSHAKE_FAILURE OpenSSL error[0x1000009a] OPENSSL_internal: HANDSHAKE_FAILURE_ON_CLIENT_HELLO This patch should fix #1168 github issue.	2021-03-10 12:21:05 +01:00
Willy Tarreau	060a761248	OPTIM: task: automatically adjust the default runqueue-depth to the threads The recent default runqueue size reduction appeared to have significantly lowered performance on low-thread count configs. Testing various values runqueue values on different workloads under thread counts ranging from 1 to 64, it appeared that lower values are more optimal for high thread counts and conversely. It could even be drawn that the optimal value for various workloads sits around 280/sqrt(nbthread), and probably has to do with both the L3 cache usage and how to optimally interlace the threads' activity to minimize contention. This is much easier to optimally configure, so let's do this by default now.	2021-03-10 11:15:34 +01:00
Daniel Corbett	befef70e23	BUG/MINOR: sample: Rename SenderComID/TargetComID to SenderCompID/TargetCompID The recently introduced Financial Information eXchange (FIX) converters have some hard coded tags based on the specification that were misspelled. Specifically, SenderComID and TargetComID should be SenderCompID and TargetCompID according to the specification [1][2]. This patch updates all references, which includes the converters themselves, the regression test, and the documentation. [1] https://fiximate.fixtrading.org/en/FIX.5.0SP2_EP264/tag49.html [2] https://fiximate.fixtrading.org/en/FIX.5.0SP2_EP264/tag56.html	2021-03-10 10:44:20 +01:00
Emeric Brun	4c75195f5b	BUG/MEDIUM: resolvers: handle huge responses over tcp servers. Parameter "accepted_payload_size" is currently considered regardless the used nameserver is using TCP or UDP. It remains mandatory to annouce such capability to support e-dns, so a value have to be announced also in TCP. Maximum DNS message size in TCP is limited by protocol to 65535 and so for UDP (65507) if system supports such UDP messages. But the maximum value for this option was arbitrary forced to 8192. This patch change this maximum to 65535 to allow user to set bigger value for UDP if its system supports. It also sets accepted_payload_size in TCP allowing to retrieve huge responses if the configuration uses TCP nameservers. The request announcing the accepted_payload_size capability is currently built at resolvers level and is common to all used nameservers of the section regardess transport protocol used. A further patch should be made to at least specify a different payload size depending of the transport, and perhaps could be forced to 65535 in case of TCP and maximum would be forced back to 65507 matching UDP max. This patch is appliable since 2.4 version	2021-03-09 15:44:46 +01:00
Willy Tarreau	e89fae3a4e	CLEANUP: stream: rename a few remaining occurrences of "stream *sess" These are some leftovers from the ancient code where they were still called sessions, but these areas in the code remain confusing due to this naming. They were now called "strm" which will not even affect indenting nor alignment.	2021-03-09 15:44:33 +01:00
Willy Tarreau	c93638e1d1	BUILD: connection: do not use VAR_ARRAY in struct tlv It was brought by commit `c44b8de99` ("CLEANUP: connection: Use `VAR_ARRAY` in `struct tlv` definition") but breaks the build with clang. Actually it had already been done 6 months ago by commit `4987a4744` ("CLEANUP: tree-wide: use VAR_ARRAY instead of [0] in various definitions") then reverted by commit `441b6c31e` ("BUILD: connection: fix build on clang after the VAR_ARRAY cleanup") which explained the same thing but didn't place a comment in the code to justify this (in short it's just an end of struct marker).	2021-03-09 10:15:16 +01:00
Willy Tarreau	018251667e	CLEANUP: config: make the cfg_keyword parsers take a const for the defproxy The default proxy was passed as a variable to all parsers instead of a const, which is not without risk, especially when some timeout parsers used to make some int pointers point to the default values for comparisons. We want to be certain that none of these parsers will modify the defaults sections by accident, so it's important to mark this proxy as const. This patch touches all occurrences found (89).	2021-03-09 10:09:43 +01:00
Willy Tarreau	82a92743fc	BUILD: bug: refine HA_LINK_ERROR() to only be used on gcc and derivatives TCC happens to define __OPTIMIZE__ at -O2 but doesn't proceed with dead code elimination, resulting in ha_free() to always reference the link error symbol. Let's condition this test on __GCC__ which others like Clang also define.	2021-03-09 10:09:43 +01:00
Tim Duesterhus	615f81eb5a	MINOR: connection: Use a `struct ist` to store proxy_authority This makes the code cleaner, because proxy_authority can be handled like proxy_unique_id.	2021-03-09 09:24:32 +01:00
Tim Duesterhus	002bd77a6e	CLEANUP: connection: Use istptr / istlen for proxy_unique_id Don't access the ist's fields directly, use the helper functions instead.	2021-03-09 09:24:32 +01:00
Tim Duesterhus	e004c2beae	CLEANUP: connection: Remove useless test for NULL before calling `pool_free()` `pool_free()` is a noop when the given pointer is NULL. No need to test.	2021-03-09 09:24:32 +01:00
Tim Duesterhus	c44b8de995	CLEANUP: connection: Use `VAR_ARRAY` in `struct tlv` definition This is for consistency with `struct tlv_ssl`.	2021-03-09 09:24:32 +01:00
Olivier Houchard	7b00e31509	BUILD: Fix build when using clang without optimizing. ha_free() uses code that attempts to set a non-existant variable to provoke a link-time error, with the expectation that the compiler will not omit that if the code is unreachable. However, clang will emit it when compiling with no optimization, so only do that if __OPTIMIZE__ is defined.	2021-03-05 16:58:56 +01:00
Willy Tarreau	eef7f7fe68	CLEANUP: server: reorder some fields in the server struct to respect cache lines There's currently quite some thread contention in the server struct because frequently fields accessed fields are mixed with those being often written to by any thread. Let's split this a little bit to separate a few areas: - pure config / admin / operating status (almost never changes) - idle and queuing (fast changes, done almost together) - LB (fast changes, not necessarily dependent on the above) - counters (fast changes, at a different instant again)	2021-03-05 15:00:24 +01:00
Willy Tarreau	d4e78d873c	MINOR: server: move actconns to the per-thread structure The actconns list creates massive contention on low server counts because it's in fact a list of streams using a server, all threads compete on the list's head and it's still possible to see some watchdog panics on 48 threads under extreme contention with 47 threads trying to add and one thread trying to delete. Moving this list per thread is trivial because it's only used by srv_shutdown_streams(), which simply required to iterate over the list. The field was renamed to "streams" as it's really a list of streams rather than a list of connections.	2021-03-05 15:00:24 +01:00
Willy Tarreau	430bf4a483	MINOR: server: allocate a per-thread struct for the per-thread connections stuff There are multiple per-thread lists in the listeners, which isn't the most efficient in terms of cache, and doesn't easily allow to store all the per-thread stuff. Now we introduce an srv_per_thread structure which the servers will have an array of, and place the idle/safe/avail conns tree heads into. Overall this was a fairly mechanical change, and the array is now always initialized for all servers since we'll put more stuff there. It's worth noting that the Lua code still has to deal with its own deinit by itself despite being in a global list, because its server is not dynamically allocated.	2021-03-05 15:00:24 +01:00
Willy Tarreau	198e92a8e5	MINOR: server: add a global list of all known servers It's a real pain not to have access to the list of all registered servers, because whenever there is a need to late adjust their configuration, only those attached to regular proxies are seen, but not the peers, lua, logs nor DNS. What this patch does is that new_server() will automatically add the newly created server to a global list, and it does so as well for the 1 or 2 statically allocated servers created for Lua. This way it will be possible to iterate over all of them.	2021-03-05 15:00:24 +01:00
Willy Tarreau	90e9b8c8b6	CLEANUP: global: reorder some fields to respect cache lines Some entries are atomically updated by various threads, such as the global counters, and they're mixed with others which are read all the time like the mode. This explains why "perf" was seeing a huge access cost on global.mode in process_stream()! Let's reorder them so that the static config stuff is at the beginning and the live stuff is at the end.	2021-03-05 08:30:08 +01:00
Willy Tarreau	cc2672f48b	MINOR: server: don't read curr_used_conns multiple times This one is added atomically and we reread it just after this, causing a second memory load that is visible in the perf profile.	2021-03-05 08:30:08 +01:00
Willy Tarreau	4f8cd4397f	MINOR: xprt: add new xprt_set_idle and xprt_set_used methods These functions are used on the mux layer to indicate that the connection is becoming idle and that the xprt ought to be careful before checking the context or that it's not idle anymore and that the context is safe. The purpose is to allow a mux which is going to release a connection to tell the xprt to be careful when touching it. At the moment, the xprt are always careful and that's costly so we want to have the ability to relax this a bit. No xprt layer uses this yet.	2021-03-05 08:30:08 +01:00
Willy Tarreau	6fa8bcdc78	MINOR: task: add an application specific flag to the state: TASK_F_USR1 This flag will be usable by any application. It will be preserved across wakeups so the application can use it to do various stuff. Some I/O handlers will soon benefit from this.	2021-03-05 08:30:08 +01:00
Willy Tarreau	144f84a09d	MEDIUM: task: extend the state field to 32 bits It's been too short for quite a while now and is now full. It's still time to extend it to 32-bits since we have room for this without wasting any space, so we now gained 16 new bits for future flags. The values were not reassigned just in case there would be a few hidden u16 or short somewhere in which these flags are placed (as it used to be the case with stream->pending_events). The patch is tagged MEDIUM because this required to update the task's process() prototype to use an int instead of a short, that's quite a bunch of places.	2021-03-05 08:30:08 +01:00
Willy Tarreau	e0d5942ddd	MINOR: task: move the nice field to the struct task only The nice field isn't needed anymore for the tasklet so we can move it from the TASK_COMMON area into the struct task which already has a hole around the expire entry.	2021-03-05 08:30:08 +01:00
Willy Tarreau	db4e238938	MINOR: task: stop abusing the nice field to detect a tasklet It's cleaner to use a flag from the task's state to detect a tasklet and it's even cheaper. One of the best benefits is that this will allow to get the nice field out of the common part since the tasklet doesn't need it anymore. This commit uses the last task bit available but that's temporary as the purpose of the change is to extend this.	2021-03-05 08:30:08 +01:00
Willy Tarreau	06e69b556c	REORG: tools: promote the debug PRNG to more general use as a statistical one We frequently need to access a simple and fast PRNG for statistical purposes. The debug_prng() function did exactly this using a xorshift generator but its use was limited to debug only. Let's move this to tools.h and tools.c to make it accessible everywhere. Since it needs to be fast, its state is thread-local. An initialization function starts a different initial value for each thread for better distribution.	2021-03-05 08:30:08 +01:00
Ubuntu	6fa9225628	CLEANUP: stream: explain why we queue the stream at the head of the server list In stream_add_srv_conn() MT_LIST_ADD() is used instead of MT_LIST_ADDQ(), resulting in the stream being queued at the end of the server list. This has no particular effect since we cannot dump the streams on a server, and this is only used by "shutdown sessions" on a server. But it also turns out to be significantly faster due to the shorter recovery from the conflict with an adjacent MT_LIST_DEL(), thus it remains desirable to use it, but at least it deserves a comment. In addition to this, it's worth mentioning that this list should creates extreme contention with threads while almost never used. It should be made per-thread just like the global streams list.	2021-03-05 08:30:08 +01:00
Willy Tarreau	f587003fe9	MINOR: pools: double the local pool cache size to 1 MB The reason is that H2 can already require 32 16kB buffers for the mux output at once, which will deplete the local cache. Thus it makes sense to go further to leave some time to other connection to release theirs. In addition, the L2 cache on modern CPUs is already 1 MB, so this change is welcome in any case.	2021-03-05 08:30:08 +01:00
Willy Tarreau	0bae075928	MEDIUM: pools: add CONFIG_HAP_NO_GLOBAL_POOLS and CONFIG_HAP_GLOBAL_POOLS We've reached a point where the global pools represent a significant bottleneck with threads. On a 64-core machine, the performance was divided by 8 between 32 and 64 H2 connections only because there were not enough entries in the local caches to avoid picking from the global pools, and the contention on the list there was very high. It becomes obvious that we need to have an array of lists, but that will require more changes. In parallel, standard memory allocators have improved, with tcmalloc and jemalloc finding their ways through mainstream systems, and glibc having upgraded to a thread-aware ptmalloc variant, keeping this level of contention here isn't justified anymore when we have both the local per-thread pool caches and a fast process-wide allocator. For these reasons, this patch introduces a new compile time setting CONFIG_HAP_NO_GLOBAL_POOLS which is set by default when threads are enabled with thread local pool caches, and we know we have a fast thread-aware memory allocator (currently set for glibc>=2.26). In this case we entirely bypass the global pool and directly use the standard memory allocator when missing objects from the local pools. It is also possible to force it at compile time when a good allocator is used with another setup. It is still possible to re-enable the global pools using CONFIG_HAP_GLOBAL_POOLS, if a corner case is discovered regarding the operating system's default allocator, or when building with a recent libc but a different allocator which provides other benefits but does not scale well with threads.	2021-03-05 08:30:08 +01:00
Ubuntu	f8fb4f75f1	MINOR: atomic: implement a more efficient arm64 __ha_cas_dw() using pairs There finally is a way to support register pairs on aarch64 assembly under gcc, it's just undocumented, like many of the options there :-( As indicated below, it's possible to pass "%H" to mention the high part of a register pair (e.g. "%H0" to go with "%0"): https://patchwork.ozlabs.org/project/gcc/patch/59368A74.2060908@foss.arm.com/ By making local variables from pairs of registers via a struct (as is used in IST for example), we can let gcc choose the correct register pairs and avoid a few moves in certain situations. The code is now slightly more efficient than the previous one on AWS' Graviton2 platform, and noticeably smaller (by 4.5kB approx). A few tests on older releases show that even Linaro's gcc-4.7 used to support such register pairs and %H, and by then ATOMICS were not supported so this should not cause build issues, and as such this patch replaces the earlier implementation.	2021-03-05 08:30:08 +01:00
Willy Tarreau	46cca86900	MINOR: atomic: add armv8.1-a atomics variant for cas-dw This variant uses the CASP instruction available on armv8.1-a CPU cores, which is detected when __ARM_FEATURE_ATOMICS is set (gcc-linaro >= 7, mainline >= 9). This one was tested on cortex-A55 (S905D3) and on AWS' Graviton2 CPUs. The instruction performs way better on high thread counts since it guarantees some forward progress when facing extreme contention while the original LL/SC approach is light on low-thread counts but doesn't guarantee progress. The implementation is not the most optimal possible. In particular since the instruction requires to work on register pairs and there doesn't seem to be a way to force gcc to emit register pairs, we have to decide to force to use the pair (x0,x1) to store the old value, and (x2,x3) to store the new one, and this necessarily involves some extra moves. But at least it does improve the situation with 16 threads and more. See issue #958 for more context. Note, a first implementation of this function was making use of an input/output constraint passed using "+Q"((void*)target), which was resulting in smaller overall code than passing "target" as an input register only. It turned out that the cause was directly related to whether the function was inlined or not, hence the "forceinline" attribute. Any changes to this code should still pay attention to this important factor.	2021-03-05 08:30:08 +01:00
Willy Tarreau	168fc5332c	BUG/MINOR: mt-list: always perform a cpu_relax call on failure On highly threaded machines it is possible to occasionally trigger the watchdog on certain contended areas like the server's connection list, because while the mechanism inherently cannot guarantee a constant progress, it lacks CPU relax calls which are absolutely necessary in this situation to let a thread finish its job. The loop's "while (1)" was changed to use a "for" statement calling __ha_cpu_relax() as its continuation expression. This way the "continue" statements jump to the unique place containing the pause without excessively inflating the code. This was sufficient to definitely fix the problem on 64-core ARM Graviton2 machines. This patch should probably be backported once it's confirmed it also helps on many-cores x86 machines since some people are facing contention in these environments. This patch depends on previous commit "REORG: atomic: reimplement pl_cpu_relax() from atomic-ops.h". An attempt was made to first read the value before exchanging, and it significantly degraded the performance. It's very likely that this caused other cores to lose exclusive ownership on their line and slow down their next xchg operation. In addition it was found that MT_LIST_ADD is significantly faster than MT_LIST_ADDQ under high contention, because it fails one step earlier when conflicting with an adjacent MT_LIST_DEL(). It might be worth switching some operations' order to favor MT_LIST_ADDQ() instead.	2021-03-05 08:30:08 +01:00
Willy Tarreau	958ae26c35	REORG: atomic: reimplement pl_cpu_relax() from atomic-ops.h There is some confusion here as we need to place some cpu_relax statements in some loops where it's not easily possible to condition them on the use of threads. That's what atomic.h already does. So let's take the various pl_cpu_relax() implementations from there and place them in atomic.h under the name __ha_cpu_relax() and let them adapt to the presence or absence of threads and to the architecture (currently only x86 and aarch64 use a barrier instruction), though it's very likely that arm would work well with a cache flushing ISB instruction as well). This time they were implemented as expressions returning 1 rather than statements, in order to ease their placement as the loop condition or the continuation expression inside "for" loops. We should probably do the same with barriers and a few such other ones.	2021-03-05 08:30:08 +01:00
Amaury Denoyelle	8ede3db080	MINOR: backend: handle reuse for conns with no server as target If dispatch mode or transparent backend is used, the backend connection target is a proxy instead of a server. In these cases, the reuse of backend connections is not consistent. With the default behavior, no reuse is done and every new request uses a new connection. However, if http-reuse is set to never, the connection are stored by the mux in the session and can be reused for future requests in the same session. As no server is used for these connections, no reuse can be made outside of the session, similarly to http-reuse never mode. A different http-reuse config value should not have an impact. To achieve this, mark these connections as private to have a defined behavior. For this feature to properly work, the connection hash has been slightly adjusted. The server pointer as an input as been replaced by a generic target pointer to refer to the server or proxy instance. The hash is always calculated on connect_server even if the connection target is not a server. This also requires to allocate the connection hash node for every backend connections, not just the one with a server target.	2021-03-03 11:31:19 +01:00
Frédéric Lécaille	b28812af7a	BUILD: quic: Implicit conversion between SSL related enums. Fix such compilation issues: include/haproxy/quic_tls.h:157:10: error: implicit conversion from 'enum ssl_encryption_level_t' to 'enum quic_tls_enc_level' [-Werror=enum-conversion] 157 \| return ssl_encryption_application; \| ^~~~~~~~~~~~~~~~~~~~~~~~~~ src/xprt_quic.c: In function 'quic_conn_enc_level_init': src/xprt_quic.c:2358:13: error: implicit conversion from 'enum quic_tls_enc_level' to 'enum ssl_encryption_level_t' [-Werror=enum-conversion] 2358 \| qel->level = quic_to_ssl_enc_level(level); \| ^ Not detected by all the compilators.	2021-03-02 10:34:18 +01:00
Willy Tarreau	61cfdf4fd8	CLEANUP: tree-wide: replace free(x);x=NULL with ha_free(&x) This makes the code more readable and less prone to copy-paste errors. In addition, it allows to place some __builtin_constant_p() predicates to trigger a link-time error in case the compiler knows that the freed area is constant. It will also produce compile-time error if trying to free something that is not a regular pointer (e.g. a function). The DEBUG_MEM_STATS macro now also defines an instance for ha_free() so that all these calls can be checked. 178 occurrences were converted. The vast majority of them were handled by the following Coccinelle script, some slightly refined to better deal with "&*x" or with long lines: @ rule @ expression E; @@ - free(E); - E = NULL; + ha_free(&E); It was verified that the resulting code is the same, more or less a handful of cases where the compiler optimized slightly differently the temporary variable that holds the copy of the pointer. A non-negligible amount of {free(str);str=NULL;str_len=0;} are still present in the config part (mostly header names in proxies). These ones should also be cleaned for the same reasons, and probably be turned into ist strings.	2021-02-26 21:21:09 +01:00
Christopher Faulet	29e9326f2f	CLEANUP: hlua: Use net_addr structure internally to parse and compare addresses hlua_addr structure may be replaced by net_addr structure to parse and compare addresses. Both structures are similar.	2021-02-26 13:53:26 +01:00
Christopher Faulet	5d1def623a	MEDIUM: http-ana: Add IPv6 support for forwardfor and orignialto options A network may be specified to avoid header addition for "forwardfor" and "orignialto" option via the "except" parameter. However, only IPv4 networks/addresses are supported. This patch adds the support of IPv6. To do so, the net_addr structure is used to store the parameter value in the proxy structure. And ipcmp2net() function is used to perform the comparison. This patch should fix the issue #1145. It depends on the following commit: * c6ce0ab MINOR: tools: Add function to compare an address to a network address * 5587287 MINOR: tools: Add net_addr structure describing a network addess	2021-02-26 13:52:48 +01:00
Christopher Faulet	9553de7fec	MINOR: tools: Add function to compare an address to a network address ipcmp2net() function may be used to compare an addres (struct sockaddr_storage) to a network address (struct net_addr). Among other things, this function will be used to add support of IPv6 for "except" parameter of "forwardfor" and "originalto" options.	2021-02-26 13:52:06 +01:00
Christopher Faulet	01f02a4d84	MINOR: tools: Add net_addr structure describing a network addess The net_addr structure describes a IPv4 or IPv6 address. Its ip and mask are represented. Among other things, this structure will be used to add support of IPv6 for "except" parameter of "forwardfor" and "originalto" options.	2021-02-26 13:32:17 +01:00
Willy Tarreau	401135cee6	MINOR: task: add one extra tasklet class: TL_HEAVY This class will be used exclusively for heavy processing tasklets. It will be cleaner than mixing them with the bulk ones. For now it's allocated ~1% of the CPU bandwidth. The largest part of the patch consists in re-arranging the fields in the task_per_thread structure to preserve a clean alignment with one more list head. Since we're now forced to increase the struct past a second cache line, it now uses 4 cache lines (for easy multiplying) with the first two ones being exclusively used by local operations and the third one mostly by atomic operations. Interestingly, this better arrangement causes less stress and reduced the response time by 8 microseconds at 1 million requests per second.	2021-02-26 12:00:53 +01:00
Willy Tarreau	d8aa21a611	CLEANUP: server: rename srv_cleanup_{idle,toremove}_connections() These function names are unbearably long, they don't even fit into the screen in "show profiling", let's trim the "_connections" to "_conns", which happens to match the name of the lists there.	2021-02-26 00:30:22 +01:00
Willy Tarreau	74dea8caea	MINOR: task: limit the number of subsequent heavy tasks with flag TASK_HEAVY While the scheduler is priority-aware and class-aware, and consistently tries to maintain fairness between all classes, it doesn't make use of a fine execution budget to compensate for high-latency tasks such as TLS handshakes. This can result in many subsequent calls adding multiple milliseconds of latency between the various steps of other tasklets that don't even depend on this. An ideal solution would be to add a 4th queue, have all tasks announce their estimated cost upfront and let the scheduler maintain an auto- refilling budget to pick from the most suitable queue. But it turns out that a very simplified version of this already provides impressive gains with very tiny changes and could easily be backported. The principle is to reserve a new task flag "TASK_HEAVY" that indicates that a task is expected to take a lot of time without yielding (e.g. an SSL handshake typically takes 700 microseconds of crypto computation). When the scheduler sees this flag when queuing a tasklet, it will place it into the bulk queue. And during dequeuing, we accept only one of these in a full round. This means that the first one will be accepted, will not prevent other lower priority tasks from running, but if a new one arrives, then the queue stops here and goes back to the polling. This will allow to collect more important updates for other tasks that will be batched before the next call of a heavy task. Preliminary tests consisting in placing this flag on the SSL handshake tasklet show that response times under SSL stress fell from 14 ms before the patch to 3.0 ms with the patch, and even 1.8 ms if tune.sched.low-latency is set to "on".	2021-02-26 00:25:51 +01:00
Christopher Faulet	69beaa91d5	REORG: server: Export and rename some functions updating server info Some static functions are now exported and renamed to follow the same pattern of other exported functions. Here is the list : * update_server_fqdn: Renamed to srv_update_fqdn and exported * update_server_check_addr_port: renamed to srv_update_check_addr_port and exported * update_server_agent_addr_port: renamed to srv_update_agent_addr_port and exported * update_server_addr: renamed to srv_update_addr * update_server_addr_potr: renamed to srv_update_addr_port * srv_prepare_for_resolution: exported This change is mandatory to move all functions dealing with the server-state files in a separate file.	2021-02-25 10:02:39 +01:00
Christopher Faulet	ecfb9b9109	MEDIUM: server: Store parsed params of a server-state line in the tree Parsed parameters are now stored in the tree of server-state lines. This way, a line from the global server-state file is only parsed once. Before, it was parsed a first time to store it in the tree and one more time to load the server state. To do so, the server-state line object must be allocated before parsing a line. This means its size must no longer depend on the length of first parsed parameters (backend and server names). Thus the node type was changed to use a hashed key instead of a string.	2021-02-25 10:02:39 +01:00
Christopher Faulet	6d87c58fb4	CLEANUP: server: Rename state_line structure into server_state_line The structure used to store a server-state line in an eb-tree has a too generic name. Instead of state_line, the structure is renamed as server_state_line.	2021-02-25 10:02:39 +01:00
Christopher Faulet	fcb53fbb58	CLEANUP: server: Rename state_line node to node instead of name_name <state_line.name_name> field is a node in an eb-tree. Thus, instead of "name_name", we now use "node" to name this field. If is a more explicit name and not too strange.	2021-02-25 10:02:39 +01:00

1 2 3 4 5 ...

4890 Commits