haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2024-12-18 09:24:31 +00:00

Author	SHA1	Message	Date
Aurelien DARRAGON	72514a4467	MEDIUM: tools/ip: v4tov6() and v6tov4() rework v4tov6() and v6tov4() helper function were initially implemented in `4f92d3200` ("[MEDIUM] IPv6 support for stick-tables"). However, since `ceb4ac9c3` ("MEDIUM: acl: support IPv6 address matching") support for legacy ip6 to ip4 conversion formats were added, with the parsing logic directly performed in acl_match_ip (which later became pat_match_ip) The issue is that the original v6tov4() function which is used for sample expressions handling lacks those additional formats, so we could face inconsistencies whether we rely on ip4/ip6 conversions from an acl context or an expression context. To unify ip4/ip6 automatic mapping behavior, we reworked v4tov6 and v6tov4 functions so that they now behave like in pat_match_ip() function. Note: '6to4 (RFC3056)' and 'RFC4291 ipv4 compatible address' formats are still supported for legacy purposes despite being deprecated for a while now.	2023-09-21 09:50:55 +02:00
Christopher Faulet	d3e379b3ce	BUG/MEDIUM: http-ana: Try to handle response before handling server abort In the request analyser responsible to forward the request, we try to detect the server abort to stop the request forwarding. However, we must be careful to not block the response processing, if any. Indeed, it is possible to get the response and the server abort in same time. In this case, we must try to forward the response to the client first. So to fix the issue, in the request analyser we no longer handle the server abort if the response channel is not empty. In the end, the response analyser is able to detect the server abort if it is relevant. Otherwise, the stream will be woken up after the response forwarding and the server abort should be handled at this stage. This patch should be backported as far as 2.7 only because the risk of breakage is high. And it is probably a good idea to wait a bit before backporting it.	2023-09-21 09:36:37 +02:00
Willy Tarreau	cbbee15462	CLEANUP: ring: rename the ring lock "RING_LOCK" instead of "LOGSRV_LOCK" The ring lock was initially mostly used for the logs and used to inherit its name in lock stats. Now that it's exclusively used by rings, let's rename it accordingly.	2023-09-20 21:38:33 +02:00
Willy Tarreau	cec8b42cb3	MEDIUM: logs: atomically check and update the log sample index The log server lock is pretty visible in perf top when using log samples because it's taken for each server in turn while trying to validate and update the log server's index. Let's change this for a CAS, since we have the index and the range at hand now. This allow us to remove the logsrv lock. The test on 4 servers now shows a 3.7 times improvement thanks to much lower contention. Without log sampling a test producing 4.4M logs/s delivers 4.4M logs/s at 21 CPUs used, everything spent in the kernel. After enabling 4 samples (1:4, 2:4, 3:4 and 4:4), the throughput would previously drop to 1.13M log/s with 37 CPUs used and 75% spent in process_send_log(). Now with this change, 4.25M logs/s are emitted, using 26 CPUs and 22% in process_send_log(). That's a 3.7x throughput improvement for a 30% global CPU usage reduction, but in practice it mostly shows that the performance drop caused by having samples is much less noticeable (each of the 4 servers has its index updated for each log). Note that in order to even avoid incrementing an index for each log srv that is consulted, it would be more convenient to have a single index per frontend and apply the modulus on each log server in turn to see if the range has to be updated. It would then only perform one write per range switch. However the place where this is done doesn't have access to a frontend, so some changes would need to be performed for this, and it would require to update the current range independently in each logsrv, which is not necessarily easier since we don't know yet if we can commit it.	2023-09-20 21:38:33 +02:00
Willy Tarreau	e00470378b	MINOR: logs: use a single index to store the current range and index By using a single long long to store both the current range and the next index, we'll make it possible to perform atomic operations instead of locking. Let's only regroup them for now under a new "curr_rg_idx". The upper word is the range, the lower is the index.	2023-09-20 21:38:33 +02:00
Willy Tarreau	49ddc0138c	CLEANUP: logs: rename a confusing local variable "curr_rg" to "smp_rg" The variable curr_rg in process_send_log() is misleading because it is not related to the integer curr_rg that's used to calculate it, instead it's a pointer to the current smp_log_range from smp_rgs[], so let's call it "smp_rg" as a singular for this "smp_rgs" and put an end to this confusion.	2023-09-20 21:38:33 +02:00
Willy Tarreau	3f1284560f	MINOR: log: remove the unused curr_idx in struct smp_log_range This index is useless because it only serves to know when the global index reached the end, while the global one already knows it. Let's just drop it and perform the test on the global range. It was verified with the following config that the first server continues to take 1/10 of the traffic, the 2nd one 2/10, the 3rd one 3/10 and the 4th one 4/10: log 127.0.0.1:10001 sample 1:10 local0 log 127.0.0.1:10002 sample 2,5:10 local0 log 127.0.0.1:10003 sample 3,7,9:10 local0 log 127.0.0.1:10004 sample 4,6,8,10:10 local0	2023-09-20 21:38:33 +02:00
Willy Tarreau	4351364700	MINOR: logs: clarify the check of the log range The test of the log range is not very clear, in part due to the reuse of the "curr_idx" name that happens at two levels. The call to in_smp_log_range() applies to the smp_info's index to which 1 is added: it verifies that the next index is still within the current range. Let's just have a local variable "next_index" in process_send_log() that gets assigned the next index (current+1) and compare it to the current range's boundaries. This makes the test much clearer. We can then simply remove in_smp_log_range() that's no longer needed.	2023-09-20 21:38:33 +02:00
William Lallemand	61b6a4da6c	REGTESTS: ssl: skip generate-certificates test w/ wolfSSL WolfSSL does not seem to work correctly with the generate-certificates features. This patch disables it temporarly. ssl-max-ver TLSv1.2 seems to be a problem in the reg-test and wolfSSL but without it it's not able to generate correctly the cert: * h1 debug\|00000004:clear-lst.accept(0007)=0028 from [127.0.0.1:35956] ALPN=<none> * h1 debug\|00000004:clear-lst.clireq[0028:ffffffff]: GET / HTTP/1.1 * h1 debug\|00000004:clear-lst.clihdr[0028:ffffffff]: x-sni: unknown-sni.com * h1 debug\|00000004:clear-lst.clihdr[0028:ffffffff]: host: 127.0.0.1 * h1 debug\|fd[0x29] OpenSSL error[0x13d] : need the private key * h1 debug\|<134>Sep 20 15:42:58 haproxy[165743]: unix:1 [20/Sep/2023:15:42:58.042] ssl-lst/1: SSL handshake failure (need the private key) ** dT 1.072 * h1 debug\|fd[0x2a] OpenSSL error[0x13d] : need the private key * h1 debug\|<134>Sep 20 15:42:59 haproxy[165743]: unix:1 [20/Sep/2023:15:42:59.044] ssl-lst/1: SSL handshake failure (need the private key) dT 2.075 * h1 debug\|fd[0x29] OpenSSL error[0x13d] : need the private key * h1 debug\|<134>Sep 20 15:43:00 haproxy[165743]: unix:1 [20/Sep/2023:15:43:00.046] ssl-lst/1: SSL handshake failure (need the private key) dT 3.079 * h1 debug\|fd[0x29] OpenSSL error[0x13d] : need the private key * h1 debug\|<134>Sep 20 15:43:01 haproxy[165743]: unix:1 [20/Sep/2023:15:43:01.050] ssl-lst/1: SSL handshake failure (need the private key) dT 3.080 * h1 debug\|00000004:default_backend.clicls[0028:0023] * h1 debug\|00000004:default_backend.closed[0028:0023] * h1 debug\|<134>Sep 20 15:43:01 haproxy[165743]: 127.0.0.1:35956 [20/Sep/2023:15:42:58.042] clear-lst default_backend/s1 0/0/-1/-1/+3009 503 +217 - - SC-- 3/1/0/0/3 0/0 "GET / HTTP/1.1" 0/-/-/-/0 -/-/- ** c3 rxhdr\|HTTP/1.1 503 Service Unavailable\r c3 rxhdr\|content-length: 107\r c3 rxhdr\|cache-control: no-cache\r c3 rxhdr\|content-type: text/html\r ** c3 rxhdr\|\r	2023-09-20 16:02:16 +02:00
William Lallemand	64a4b44a44	REGTESTS: ssl: skip OCSP test w/ WolfSSL The OCSP test does not seem to be working correctly with wolfSSL. i2d_OCSP_CERTID(data->ocsp_cid, NULL); always returns 0. Skip it for now.	2023-09-20 15:23:32 +02:00
Aurelien DARRAGON	2c9bd3ae80	BUG/MINOR: server: add missing free for server->rdr_pfx rdr_pfx was not being free during server cleanup, leading to small memory leak when "redir" argument was used on a server line (HTTP only). This should be backported to every stable versions. [For 2.6 and 2.7: the free should be performed in srv_drop() directly. For older versions: free in deinit() function near the free for the cookie string]	2023-09-15 17:46:49 +02:00
Willy Tarreau	6cbb5a057b	Revert "MAJOR: import: update mt_list to support exponential back-off" This reverts commit `c618ed5ff4`. The list iterator is broken. As found by Fred, running QUIC single- threaded shows that only the first connection is accepted because the accepter relies on the element being initialized once detached (which is expected and matches what MT_LIST_DELETE_SAFE() used to do before). However while doing this in the quic_sock code seems to work, doing it inside the macro show total breakage and the unit test doesn't work anymore (random crashes). Thus it looks like the fix is not trivial, let's roll this back for the time it will take to fix the loop.	2023-09-15 17:13:43 +02:00
William Lallemand	694889ac2d	BUILD: quic: fix build on centos 8 and USE_QUIC_OPENSSL_COMPAT When using USE_QUIC_OPENSSL_COMPAT=1 on centos-8 the build fail this way: In file included from src/quic_openssl_compat.c:11: /usr/include/openssl/kdf.h:33:46: error: unknown type name 'va_list' int EVP_KDF_vctrl(EVP_KDF_CTX *ctx, int cmd, va_list args); This is because of openssl/kdf.h being include before openssl-compat.h	2023-09-14 16:26:58 +02:00
Christopher Faulet	89e20033c7	BUG/MAJOR: mux-h2: Report a protocol error for any DATA frame before headers If any DATA frame is received before all headers are fully received, a protocol error must be reported. It is required by the HTTP/2 RFC but it is also important because the HTTP analyzers expect the first HTX block is a start-line. It leads to a crash if this statement is not respected. For instance, it is possible to trigger a crash by sending an interim message with a DATA frame (It may be an empty DATA frame with the ES flag). AFAIK, only the server side is affected by this bug. To fix the issue, an protocol error is reported for the stream. This patch should fix the issue #2291. It must be backported as far as 2.2 (and probably to 2.0 too).	2023-09-14 11:39:39 +02:00
Willy Tarreau	e3b2704e26	BUG/MINOR: freq_ctr: fix possible negative rate with the scaled API In 1.9 with commit `627505d36` ("MINOR: freq_ctr: add swrate_add_scaled() to work with large samples") we got the ability to indicate when adding some values that they represent a number of samples. However there is an issue in the calculation which is that the number of samples that is added to the sum before the division in order to avoid fading away too fast, is multiplied by the scale. The problem it causes is that this is done in the negative part of the expression, and that as soon if the sum of old_sum and v*s is too small (e.g. zero), we end up with a negative value of -s. This is visible in "show pools" which occasionally report a very large value on "needed_avg" since 2.9, though the bug has been there for longer. Indeed in 2.9 since they're hashed in buckets, it suffices that any thread got one such error once for the sum to be wrong. One possible impact is memory usage not shrinking after a short burst due to pools refraining from releasing objects, believing they don't have enough. This must be backported to all versions. Note that the opportunistic version can be dropped before 2.8.	2023-09-14 11:09:07 +02:00
William Lallemand	148f145d32	DOC: configuration: add %[query] to %HQ add %[query] to the alternative sample fetch for the logs	2023-09-13 15:56:23 +02:00
Fr�d�ric L�caille	3921bf80c7	BUG/MINOR: quic: Leak of frames to send. In very rare cases, it is possible that packet are detected as lost, their frames requeued, then the connection is released without releasing for any reason (to be killed because of a sendto() fatal failure for instance. Such frames are lost and never release because the function which release their packet number spaces does not release the frames which are still enqueued to be send. Must be backported as far as 2.6.	2023-09-13 15:32:14 +02:00
William Lallemand	2a19f7c43b	DOC: configuration: add %[req.ver] sample to %HV add %[req.ver] to the alternative sample fetch for the logs	2023-09-13 14:54:50 +02:00
William Lallemand	c7424a1bac	MINOR: samples: implement bytes_in and bytes_out samples %[bytes_in] and %[bytes_out] are equivalent to %U and %B tags in log-format.	2023-09-13 14:54:50 +02:00
Willy Tarreau	5abbae2d3d	CLEANUP: pools: simplify the pool expression when no pool was matched in dump When dumping pool information, we make a special case of the condition where the pool couldn't be identified and we consider that it was the correct one. In the code arrangements brought by commit `efc46dede` ("DEBUG: pools: inspect pools on fatal error and dump information found"), a ternary expression for testing this depends on the "if" block condition so this can be simplified and will make Coverity happy. This was reported in GH #2290.	2023-09-13 13:31:41 +02:00
Willy Tarreau	c618ed5ff4	MAJOR: import: update mt_list to support exponential back-off The new mt_list code supports exponential back-off on conflict, which is important for use cases where there is contention on a large number of threads. The API evolved a little bit and required some updates: - mt_list_for_each_entry_safe() is now in upper case to explicitly show that it is a macro, and only uses the back element, doesn't require a secondary pointer for deletes anymore. - MT_LIST_DELETE_SAFE() doesn't exist anymore, instead one just has to set the list iterator to NULL so that it is not re-inserted into the list and the list is spliced there. One must be careful because it was usually performed before freeing the element. Now instead the element must be nulled before the continue/break. - MT_LIST_LOCK_ELT() and MT_LIST_UNLOCK_ELT() have always been unclear. They were replaced by mt_list_cut_around() and mt_list_connect_elem() which more explicitly detach the element and reconnect it into the list. - MT_LIST_APPEND_LOCKED() was only in haproxy so it was left as-is in list.h. It may however possibly benefit from being upstreamed. This required tiny adaptations to event_hdl.c and quic_sock.c. The test case was updated and the API doc added. Note that in order to keep include files small, the struct mt_list definition remains in list-t.h (par of the internal API) and was ifdef'd out in mt_list.h. A test on QUIC with both quictls 1.1.1 and wolfssl 5.6.3 on ARM64 with 80 threads shows a drastic reduction of CPU usage thanks to this and the refined memory barriers. Please note that the CPU usage on OpenSSL 3.0.9 is significantly higher due to the excessive use of atomic ops by openssl, but 3.1 is only slightly above 1.1.1 though: - before: 35 Gbps, 3.5 Mpps, 7800% CPU - after: 41 Gbps, 4.2 Mpps, 2900% CPU	2023-09-13 11:50:33 +02:00
Christopher Faulet	13fb7170be	BUG/MEDIUM: master/cli: Pin the master CLI on the first thread of the group 1 There is no reason to start the master CLI on several threads and on several groups. And in fact, it must not be done otherwise the same FD is inserted several times in the fdtab, leading to a crash during startup because of a BUG_ON(). It happens when several groups are configured. To fix the bug the master CLI is now pinned on the first thread of the first group. This patch should fix the issue #2259 and must be backported to 2.8.	2023-09-13 10:26:32 +02:00
Cedric Paillet	3790eb4f85	BUG/MINOR: promex: fix backend_agg_check_status When a server is in maintenance, the check.status is no longer updated. Therefore, we shouldn't consider check.status if the checks are not active. This check was properly implemented in the haproxy_server_check_status metric, but wasn't carried over to backend_agg_check_status, which introduced inconsistencies between the two metrics. [cf: This patch must be backported as far as 2.4]	2023-09-12 19:50:17 +02:00
Christopher Faulet	665703d456	BUG/MEDIUM: mux-fcgi: Don't swap trash and dbuf when handling STDERR records trahs chunks are buffers but not allocated from the buffers pool. And the "trash" chunk is static and thread-local. It is two reason to not swap it with a regular buffer allocated from the buffers pool. Unfortunatly, it is exactly what is performed in the FCGI mux when a STDERR record is handled. b_xfer() is used to copy data from the demux buffer to the trash to format the error message. A zeor-copy via a swap may be performed. In this case, this leads to a memory corruption and a crash because, some time later, the demux buffer is released because it is empty. And it is in fact the trash chunk. b_force_xfer() must be used instead. This function forces the copy. This patch must be backported as far as 2.2. For 2.4 and 2.2, b_force_xfer() does not exist. For these versions, the following commit must be backported too: * `c7860007cc` ("MINOR: buf: Add b_force_xfer() function")	2023-09-12 19:50:17 +02:00
Aurelien DARRAGON	1115fc348e	BUG/MINOR: hlua/init: coroutine may not resume itself It's not supported to call lua_resume with <L> and <from> designating the same lua coroutine. It didn't cause visible bugs so far because Lua 5.3 used to be more permissive about this, and moreover, yielding is not involved during the hlua init state. But this is wrong usage, and the doc clearly specifies that the <from> argument can be NULL when there is no such coroutine, which is the case here. This should be backported in every stable versions.	2023-09-12 19:50:17 +02:00
Aurelien DARRAGON	e7281f3f5d	BUG/MEDIUM: hlua: don't pass stale nargs argument to lua_resume() In hlua_ctx_resume(), we call lua_resume() function like this: lua_resume(lua->T, hlua_states[lua->state_id], lua->nargs) Once the call returns, we may call the function again with the same hlua context when E_YIELD is returned (the execution was interrupted and may be resumed through another lua_resume() call). The 3rd argument to lua_resume(), 'nargs', is a hint passed to Lua to know how many (optional) arguments were pushed on the stack prior to resuming the execution (arguments that Lua will then expose to the Lua script). But here is the catch: we never reset lua->nargs between successive lua_resume() calls, meaning that next lua_resume() calls will still inherit from the initial nargs value that was set in hlua ctx prior to calling hlua_ctx_resume() (our wrapper function) for the first time. This is problematic, because despite not being explicitly mentioned in the Lua documentation, passed arguments (to which `nargs` refer to), are already consumed once lua_resume() returns. This means that we cannot keep calling lua_resume() with non-zero nargs if we don't push new arguments on the stack prior to resuming lua after the initial call: nargs is proper to a single lua_resume() invocation. Despite improper use of lua_resume() for a long time, this didn't cause visible issues in the past with Lua 5.3, but it is particularly sensitive starting with Lua 5.4.3 due to debugging hooks improvements that led to some internal changes (see: lua/lua@58aa09a). Not using nargs properly now exposes us to undefined behavior when resuming after a yield triggered from a debugging hook, which may cause running scripts to crash unexpectedly: for instance with Lua raising errors and complaining about values being NULL where it should not be the case. For reference, this issue was initially raised on the Lua mailing list: http://lua-users.org/lists/lua-l/2023-09/msg00005.html In this patch, we immediately reset nargs when lua_resume() returns to prevent any misuse. It should be backported to every maintained versions.	2023-09-12 19:50:17 +02:00
Willy Tarreau	93c2ea0ec3	MEDIUM: pools: refine pool size rounding The pools sizes were rounded up a little bit too much with commit `30f931ead` ("BUG/MEDIUM: pools: fix the minimum allocation size"). The goal was in fact to make sure they were always at least large enough to store 2 list heads, and stuffing this into the alignment calculation resulted in the size being always rounded up to this size. This is problematic because it means that the appended tag at the end doesn't always catch potential overflows since more bytes than needed are allocated. Moreover, this test was later reinforced by commit `b5ba09ed5` ("BUG/MEDIUM: pools: ensure items are always large enough for the pool_cache_item"), proving that the first test was not always sufficient. This needs to be reworked to proceed correctly: - the two lists are needed when the object is in the cache, hence when we don't care about the tag, which means that the tag's size, if any, can easily cover for the missing bytes to reach that size. This is actually what was already being checked for. - the rounding should not be performed (beyond the size of a word to preserve pointer alignment) when pool tagging is enabled, otherwise we don't detect small overflows. It means that there will be less merging when proceeding like this. Tests show that we merge 93 pools into 36 without tags and 43 with tags enabled. - the rounding should not consider the extra size, since it's already done when calculating the allocated size later (i.e. don't round up twice). The difference is subtle but it's what makes sure the tag immediately follows the area instead of starting from the end. Thanks to this, now when writing one byte too many at the end of a struct stream, the error is instantly caught.	2023-09-12 18:14:05 +02:00
Willy Tarreau	61575769ac	DEBUG: pools: print the contents surrounding the expected tag location When no tag matches a known pool, we can inspect around to help figure what could have possibly overwritten memory. The contents are printed one machine word per line in hex, then using printable characters, and when they can be resolved to a pointer, either the pool's pointer name or a resolvable symbol with offset. The goal here is to help recognize what is easily identifiable in memory. For example applying the following patch to stream_free(): - pool_free(pool_head_stream, s); + pool_free(pool_head_stream, (void*)s+1); Causes the following dump to be emitted: FATAL: pool inconsistency detected in thread 1: tag mismatch on free(). caller: 0x59e968 (stream_free+0x6d8/0xa0a) item: 0x13df5c1 pool: 0x12782c0 ('stream', size 888, real 904, users 1) Tag does not match (0x4f00000000012782). Tag does not match any other pool. Contents around address 0x13df5c1+888=0x13df939: 0x13df918 [00 00 00 00 00 00 00 00] [........] 0x13df920 [00 00 00 00 00 00 00 00] [........] 0x13df928 [00 00 00 00 00 00 00 00] [........] 0x13df930 [00 00 00 00 00 00 00 00] [........] 0x13df938 [c0 82 27 01 00 00 00 00] [..'.....] [pool:stream] 0x13df940 [4f c0 59 00 00 00 00 00] [O.Y.....] [stream_new+0x4f/0xbec] 0x13df948 [49 46 49 43 41 54 45 2d] [IFICATE-] 0x13df950 [81 02 00 00 00 00 00 00] [........] 0x13df958 [df 13 00 00 00 00 00 00] [........] Other possible callers: (...) We notice that the tag references pool_head_stream with the allocation point in stream_new. Another benefit is that a caller may be figured from the tag even if the "caller" feature is not enabled, because upon a free() we always put the caller's location into the tag. This should be sufficient to debug most cases that normally require gdb.	2023-09-12 18:14:05 +02:00
Willy Tarreau	0f9a10c7f1	DEBUG: pools: also print the value of the tag when it doesn't match Sometimes the tag's value may reveal a recognizable pattern, so let's print it when it doesn't match a known pool.	2023-09-12 18:14:05 +02:00
Willy Tarreau	96c1a24224	DEBUG: pools: also print the item's pointer when crashing It's important to inspect a core or recognize some values to have the item pointer, it was not provided.	2023-09-12 18:14:05 +02:00
Fr�d�ric L�caille	84757e32e6	BUG/MEDIUM: quic: quic_cc_conn ->cntrs counters unreachable This bug arrived with this commit in 2.9-dev3: MEDIUM: quic: Allow the quic_conn memory to be asap released. When sending packets from quic_cc_conn_io_cb(), e.g. when the quic_conn object has been released and replaced by a lighter one (quic_cc_conn), some counters may have to be incremented. But they were not reachable because not shared between quic_conn and quic_cc_conn struct. To fix this, one has only to move the ->cntrs counters from quic_conn to QUIC_CONN_COMMON struct which is shared between both quic_cc_conn Thank you to Tristan for having reported this in GH #2247. No need to backport.	2023-09-12 18:13:36 +02:00
Willy Tarreau	efc46dede9	DEBUG: pools: inspect pools on fatal error and dump information found It's a bit frustrating sometimes to see pool checks catch a bug but not provide exploitable information without a core. Here we're adding a function "pool_inspect_item()" which is called just before aborting in pool_check_pattern() and POOL_DEBUG_CHECK_MARK() and which will display the error type, the pool's pointer and name, and will try to check if the item's tag matches the pool, and if not, will iterate over all pools to see if one would be a better candidate, then will try to figure the last known caller and possibly other likely candidates if the pool's tag is not sufficiently trusted. This typically helps better diagnose corruption in use-after-free scenarios, or freeing to a pool that differs from the one the object was allocated from, and will also indicate calling points that may help figure where an object was last released or allocated. The info is printed on stderr just before the backtrace. For example, the recent off-by-one test in the PPv2 changes would have produced the following output in vtest logs: * h1 debug\|FATAL: pool inconsistency detected in thread 1: tag mismatch on free(). * h1 debug\| caller: 0x62bb87 (conn_free+0x147/0x3c5) * h1 debug\| pool: 0x2211ec0 ('pp_tlv_256', size 304, real 320, users 1) * h1 debug\|Tag does not match. Possible origin pool(s): * h1 debug\| tag: @0x2565530 = 0x2216740 (pp_tlv_128, size 176, real 192, users 1) * h1 debug\|Recorded caller if pool 'pp_tlv_128': *** h1 debug\| @0x2565538 (+0184) = 0x62c76d (conn_recv_proxy+0x4cd/0xa24) A mismatch in the allocated/released pool is already visible, and the callers confirm it once resolved, where the allocator indeed allocates from pp_tlv_128 and conn_free() releases to pp_tlv_256: $ addr2line -spafe ./haproxy <<< $'0x62bb87\n0x62c76d' 0x000000000062bb87: conn_free at connection.c:568 0x000000000062c76d: conn_recv_proxy at connection.c:1177	2023-09-11 15:46:14 +02:00
Willy Tarreau	f6bee5a50b	DEBUG: pools: make pool_check_pattern() take a pointer to the pool This will be useful to report detailed bug traces.	2023-09-11 15:19:49 +02:00
Willy Tarreau	e92e96b00f	DEBUG: pools: pass the caller pointer to the check functions and macros In preparation for more detailed pool error reports, let's pass the caller pointers to the check functions. This will be useful to produce messages indicating where the issue happened.	2023-09-11 15:19:49 +02:00
Willy Tarreau	baf2070421	DEBUG: pools: always record the caller for uncached allocs as well When recording the caller of a pool_alloc(), we currently store it only when the object comes from the cache and never when it comes from the heap. There's no valid reason for this except that the caller's pointer was not passed to pool_alloc_nocache(), so it used to set NULL there. Let's just pass it down the chain.	2023-09-11 15:19:49 +02:00
Frédéric Lécaille	2dedbe76c9	BUG/MINOR: quic: fdtab array underflow access When using the listener socket as file descriptor, qc->fd value is -1. In this case one must not access fdtab[qc->fd] element to change its value. This bug could have been detected by asan with such a backtrace: ================================================================= ==402222==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x7fa8ecf417ex7fa8e915cf90 sp 0x7fa8e915cf88 WRITE of size 8 at 0x7fa8ecf417e8 thread T6 #0 0x55707a0bf18a in qc_new_cc_conn src/quic_conn.c:838 #1 0x55707a0c6dc0 in quic_conn_release src/quic_conn.c:1408 #2 0x55707a10916f in quic_close src/xprt_quic.c:35 #3 0x55707a0cec77 in conn_xprt_close include/haproxy/connection.h:153 #4 0x55707a0ceed0 in conn_full_close include/haproxy/connection.h:197 #5 0x55707a0ec253 in qcc_release src/mux_quic.c:2412 #6 0x55707a0ec7d0 in qcc_io_cb src/mux_quic.c:2443 #7 0x55707a63ff2a in run_tasks_from_lists src/task.c:596 #8 0x55707a641cc9 in process_runnable_tasks src/task.c:876 #9 0x55707a56f7b2 in run_poll_loop src/haproxy.c:2954 #10 0x55707a5705fd in run_thread_poll_loop src/haproxy.c:3153 #11 0x7fa8f9450ea6 in start_thread nptl/pthread_create.c:477 #12 0x7fa8f936ea2e in __clone (/lib/x86_64-linux-gnu/libc.so.6+0xfba2e) 0x7fa8ecf417e8 is located 24 bytes to the left of 134217728-byte region [0x7fa8e allocated by thread T0 here: #0 0x7fa8f9a37037 in __interceptor_calloc ../../../../src/libsanitizer/asan/ #1 0x55707a71a61d in init_pollers src/fd.c:1161 #2 0x55707a56cdf1 in init src/haproxy.c:2672 #3 0x55707a5714c2 in main src/haproxy.c:3298 #4 0x7fa8f9296d09 in __libc_start_main ../csu/libc-start.c:308 Thread T6 created by T0 here: #0 0x7fa8f99e22a2 in __interceptor_pthread_create ../../../../src/libsanitizpp:214 #1 0x55707a748a21 in setup_extra_threads src/thread.c:252 #2 0x55707a5735c9 in main src/haproxy.c:3844 #3 0x7fa8f9296d09 in __libc_start_main ../csu/libc-start.c:308 SUMMARY: AddressSanitizer: heap-buffer-overflow src/quic_conn.c:838 in qc_new_cc Shadow bytes around the buggy address: 0x0ff59d9e02a0: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa 0x0ff59d9e02b0: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa 0x0ff59d9e02c0: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa 0x0ff59d9e02d0: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa 0x0ff59d9e02e0: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa =>0x0ff59d9e02f0: fa fa fa fa fa fa fa fa fa fa fa fa fa[fa]fa fa 0x0ff59d9e0300: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 0x0ff59d9e0310: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 0x0ff59d9e0320: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 0x0ff59d9e0330: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 0x0ff59d9e0340: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 Shadow byte legend (one shadow byte represents 8 application bytes): Addressable: 00 Partially addressable: 01 02 03 04 05 06 07 Heap left redzone: fa Freed heap region: fd Stack left redzone: f1 Stack mid redzone: f2 Stack right redzone: f3 Stack after return: f5 Stack use after scope: f8 Global redzone: f9 Global init order: f6 Poisoned by user: f7 Container overflow: fc Array cookie: ac Intra object redzone: bb ASan internal: fe Left alloca redzone: ca Right alloca redzone: cb Shadow gap: cc ==402222==ABORTING Aborted Thank you to @Tristan971 for having reported this bug in GH #2247. No need to backport.	2023-09-11 15:14:22 +02:00
Willy Tarreau	8b7841ff7a	[RELEASE] Released version 2.9-dev5 Released version 2.9-dev5 with the following main changes : - BUG/MEDIUM: mux-h2: fix crash when checking for reverse connection after error - BUILD: import: guard plock.h against multiple inclusion - BUILD: pools: import plock.h to build even without thread support - BUG/MINOR: ssl/cli: can't find ".crt" files when replacing a certificate - BUG/MINOR: stream: protect stream_dump() against incomplete streams - DOC: config: mention uid dependency on the tune.quic.socket-owner option - MEDIUM: capabilities: enable support for Linux capabilities - CLEANUP/MINOR: connection: Improve consistency of PPv2 related constants - MEDIUM: connection: Generic, list-based allocation and look-up of PPv2 TLVs - MEDIUM: sample: Add fetch for arbitrary TLVs - MINOR: sample: Refactor fc_pp_authority by wrapping the generic TLV fetch - MINOR: sample: Refactor fc_pp_unique_id by wrapping the generic TLV fetch - MINOR: sample: Add common TLV types as constants for fc_pp_tlv - MINOR: ssl_sock: avoid iterating realloc(+1) on stored context - DOC: ssl: add some comments about the non-obvious session allocation stuff - CLEANUP: ssl: keep a pointer to the server in ssl_sock_init() - MEDIUM: ssl_sock: always use the SSL's server name, not the one from the tid - MEDIUM: server/ssl: place an rwlock in the per-thread ssl server session - MINOR: server/ssl: maintain an index of the last known valid SSL session - MINOR: server/ssl: clear the shared good session index on failure - MEDIUM: server/ssl: pick another thread's session when we have none yet - MINOR: activity: report the current run queue size - BUG/MINOR: checks: do not queue/wake a bounced check - MINOR: checks: start the checks in sleeping state - MINOR: checks: pin the check to its thread upon wakeup - MINOR: check: remember when we migrate a check - MINOR: check/activity: collect some per-thread check activity stats - MINOR: checks: maintain counters of active checks per thread - MINOR: check: also consider the random other thread's active checks - MEDIUM: checks: search more aggressively for another thread on overload - MEDIUM: checks: implement a queue in order to limit concurrent checks - MINOR: checks: also consider the thread's queue for rebalancing - DEBUG: applet: Properly report opposite SC expiration dates in traces - BUG/MEDIUM: stconn: Update stream expiration date on blocked sends - BUG/MINOR: stconn: Don't report blocked sends during connection establishment - BUG/MEDIUM: stconn: Wake applets on sending path if there is a pending shutdown - BUG/MEDIUM: stconn: Don't block sends if there is a pending shutdown - BUG/MINOR: quic: Possible skipped RTT sampling - MINOR: quic: Add a trace to quic_release_frm() - BUG/MAJOR: quic: Really ignore malformed ACK frames. - BUG/MINOR: quic: Unchecked pointer to packet number space dereferenced - BUG/MEDIUM: connection: fix pool free regression with recent ppv2 TLV patches - BUG/MEDIUM: h1-htx: Ensure chunked parsing with full output buffer - BUG/MINOR: stream: further protect stream_dump() against incomplete sessions - DOC: configuration: update examples for req.ver - MINOR: properly mark the end of the CLI command in error messages - BUILD: ssl: Build with new cryptographic library AWS-LC - REGTESTS: ssl: skip ssl_dh test with AWS-LC - BUILD: bug: make BUG_ON() void to avoid a rare warning - BUILD: checks: shut up yet another stupid gcc warning - MINOR: cpuset: add ha_cpuset_isset() to check for the presence of a CPU in a set - MINOR: cpuset: add ha_cpuset_or() to bitwise-OR two CPU sets - MINOR: cpuset: centralize a reliable bound cpu detection - MEDIUM: threads: detect incomplete CPU bindings - MEDIUM: threads: detect excessive thread counts vs cpu-map - BUILD: quic: Compilation issue on 32-bits systems with quic_may_send_bytes() - BUG/MINOR: quic: Unchecked pointer to Handshake packet number space - MINOR: global: export the display_version() symbol - MEDIUM: mworker: display a more accessible message when a worker crash - MINOR: httpclient: allow to configure the retries - MINOR: httpclient: allow to configure the timeout.connect - BUG/MINOR: quic: Wrong RTT adjusments - BUG/MINOR: quic: Wrong RTT computation (srtt and rrt_var) - BUG/MINOR: stconn: Don't inhibit shutdown on connection on error - BUG/MEDIUM: applet: Fix API for function to push new data in channels buffer - BUG/MEDIUM: stconn: Report read activity when a stream is attached to front SC - BUG/MEDIUM: applet: Report an error if applet request more room on aborted SC - BUG/MEDIUM: stconn/stream: Forward shutdown on write timeout - NUG/MEDIUM: stconn: Always update stream's expiration date after I/O - BUG/MINOR: applet: Always expect data when CLI is waiting for a new command - BUG/MINOR: ring/cli: Don't expect input data when showing events - BUG/MINOR: quic: Dereferenced unchecked pointer to Handshke packet number space - BUG/MINOR: hlua/action: incorrect message on E_YIELD error - MINOR: http_ana: position the FINAL flag for http_after_res execution - CI: scripts: add support to build-ssl.sh to download and build AWS-LC - CI: add support to matrix.py to determine the latest AWS-LC release - CI: Update matrix.py so all code is contained in functions. - CI: github: Add a weekly CI run building with AWS-LC - MINOR: ring: add a function to compute max ring payload - BUG/MEDIUM: ring: adjust maxlen consistency check - MINOR: sink: simplify post_sink_resolve function - MINOR: log/sink: detect when log maxlen exceeds sink size - MINOR: sink: inform the user when logs will be implicitly truncated - MEDIUM: sink: don't perform implicit truncations when maxlen is not set - MINOR: log: move log-forwarders cleanup in log.c - MEDIUM: httpclient/logs: rely on per-proxy post-check instead of global one - MINOR: log: add dup_logsrv() helper function - MEDIUM: log/sink: make logsrv postparsing more generic - MEDIUM: fcgi-app: properly postresolve logsrvs - MEDIUM: spoe-agent: properly postresolve log rings - MINOR: sink: add helper function to deallocate sink struct - MEDIUM: sink/ring: introduce high level ring creation helper function - MEDIUM: sink: add sink_finalize() function - CLEANUP: log: remove unnecessary trim in __do_send_log - MINOR: cache: Change hash function in default normalizer used in case of "vary" - MINOR: tasks/stats: report the number of niced tasks in "show info" - CI: Update to actions/checkout@v4 - MINOR: ssl: add support for 'curves' keyword on server lines - BUG/MINOR: quic: Wrong cluster secret initialization - CLEANUP: quic: Remove useless free_quic_tx_pkts() function. - MEDIUM: init: initialize the trash earlier - MINOR: tools: add function read_line_to_trash() to read a line of a file - MINOR: cfgparse: use read_line_from_trash() to read from /sys - MEDIUM: cfgparse: assign NUMA affinity to cpu-maps - MINOR: cpuset: dynamically allocate cpu_map - REORG: cpuset: move parse_cpu_set() and parse_cpumap() to cpuset.c - CI: musl: highlight section if there are coredumps - CI: musl: drop shopt in workflow invocation	2023-09-08 19:21:45 +02:00
Ilya Shipitsin	d89734bcc5	CI: musl: drop shopt in workflow invocation "shopt" is bash specific, while musl uses bourne shell. /__w/_temp/1b0f5f5d-c71b-4a66-8be3-e1fe51c10993.sh: line 7: shopt: not found	2023-09-08 19:05:04 +02:00
Ilya Shipitsin	8382d71808	CI: musl: highlight section if there are coredumps previously, section was collapsed, thus it was harder to find that there's something to look at	2023-09-08 19:05:04 +02:00
Willy Tarreau	4a18d9e560	REORG: cpuset: move parse_cpu_set() and parse_cpumap() to cpuset.c These ones were still in cfgparse.c but they're not specific to the config at all and may actually be used even when parsing cpu list entries in /sys. Better move them where they can be reused.	2023-09-08 16:25:19 +02:00
Willy Tarreau	5119109e3f	MINOR: cpuset: dynamically allocate cpu_map cpu_map is 8.2kB/entry and there's one such entry per group, that's ~520kB total. In addition, the init code is still in haproxy.c enclosed in ifdefs. Let's make this a dynamically allocated array in the cpuset code and remove that init code. Later we may even consider reallocating it once the number of threads and groups is known, in order to shrink it a little bit, as the typical setup with a single group will only need 8.2kB, thus saving half a MB of RAM. This would require that the upper bound is placed in a variable though.	2023-09-08 16:25:19 +02:00
Willy Tarreau	b0f20ed79b	MEDIUM: cfgparse: assign NUMA affinity to cpu-maps Do not force affinity on the process, instead let's just apply it to cpu-map, it will automatically be used later in the init process. We can do this because we know that cpu-map was not set when we're using this detection code. This is much saner, as we don't need to manipulate the process' affinity at this point in time, and just update the info that the user omitted to set by themselves, which guarantees a better long-term consistency with the documented feature.	2023-09-08 16:25:19 +02:00
Willy Tarreau	809a49da96	MINOR: cfgparse: use read_line_from_trash() to read from /sys It's easier to use this function now to natively support variable fields in the file's path. This also removes read_file_from_trash() that was only used here and was static.	2023-09-08 16:25:19 +02:00
Willy Tarreau	1f2433fb6a	MINOR: tools: add function read_line_to_trash() to read a line of a file This function takes on input a printf format for the file name, making it particularly suitable for /proc or /sys entries which take a lot of numbers. It also automatically trims the trailing CR and/or LF chars.	2023-09-08 16:25:19 +02:00
Willy Tarreau	5f10176e2c	MEDIUM: init: initialize the trash earlier More and more utility functions rely on the trash while most of the init code doesn't have access to it because it's initialized very late (in PRE_CHECK for the initial one). It's a pool, and it purposely supports being reallocated, so let's initialize it in STG_POOL so that early STG_INIT code can at least use it.	2023-09-08 16:25:19 +02:00
Frédéric Lécaille	e3e218b98e	CLEANUP: quic: Remove useless free_quic_tx_pkts() function. This function define but no more used since this commit: BUG/MAJOR: quic: Really ignore malformed ACK frames.	2023-09-08 10:17:25 +02:00
Frédéric Lécaille	292dfdd78d	BUG/MINOR: quic: Wrong cluster secret initialization The function generate_random_cluster_secret() which initializes the cluster secret when not supplied by configuration is buggy. There 1/256 that the cluster secret string is empty. To fix this, one stores the cluster as a reduced size first 128 bits of its own SHA1 (160 bits) digest, if defined by configuration. If this is not the case, it is initialized with a 128 bits random value. Furthermore, thus the cluster secret is always initialized. As the cluster secret is always initialized, there are several tests which are for now on useless. This patch removes such tests (if(global.cluster_secret)) in the QUIC code part and at parsing time: no need to check that a cluster secret was initialized with "quic-force-retry" option. Must be backported as far as 2.6.	2023-09-08 09:50:58 +02:00
William Lallemand	15e591b6e0	MINOR: ssl: add support for 'curves' keyword on server lines This patch implements the 'curves' keyword on server lines as well as the 'ssl-default-server-curves' keyword in the global section. It also add the keyword on the server line in the ssl_curves reg-test. These keywords allow the configuration of the curves list for a server.	2023-09-07 23:29:10 +02:00
Tim Duesterhus	5c923f1869	CI: Update to actions/checkout@v4 No functional change, but we should keep this current. see `5f4ddb54b0`	2023-09-06 17:46:02 +02:00
Willy Tarreau	28ff1a5d56	MINOR: tasks/stats: report the number of niced tasks in "show info" We currently know the number of tasks in the run queue that are niced, and we don't expose it. It's too bad because it can give a hint about what share of the load is relevant. For example if one runs a Lua script that was purposely reniced, or if a stats page or the CLI is hammered with slow operations, seeing them appear there can help identify what part of the load is not caused by the traffic, and improve monitoring systems or autoscalers.	2023-09-06 17:44:44 +02:00

1 2 3 4 5 ...

20750 Commits