haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2024-12-19 01:44:33 +00:00

Author	SHA1	Message	Date
Christopher Faulet	8073094bf1	NUG/MEDIUM: stconn: Always update stream's expiration date after I/O It is a revert of following patches: * `d7111e7ac` ("MEDIUM: stconn: Don't requeue the stream's task after I/O") * `3479d99d5` ("BUG/MEDIUM: stconn: Update stream expiration date on blocked sends") Because the first one is reverted, the second one is useless and can be reverted too. The issue here is that I/O may be performed without stream wakeup. So if no expiration date was set on the last call to process_stream(), the stream is never rescheduled and no timeout can be detected. This especially happens on TCP streams because fast-forward is enabled very early. Instead of tracking all places where the stream's expiration data must be updated, it is now centralized in sc_notify(), as it was performed before the timeout refactoring. This patch must be backported to 2.8.	2023-09-06 09:29:27 +02:00
Christopher Faulet	b9c87f8082	BUG/MEDIUM: stconn/stream: Forward shutdown on write timeout The commit `7f59d68fe2` ("BUG/MEDIIM: stconn: Flush output data before forwarding close to write side") introduced a regression. When a write timeout is detected, the shutdown is no longer forwarded. Dependig on the channels state, it may block the processing, waiting the client or the server leaves. The commit above tries to avoid to truncate messages on shutdown but on write timeout, if the channel is not empty, there is nothing more we can do to send these data. It means the endpoint is unable to send data. In this case, we must forward the shutdown. This patch should be backported as far as 2.2.	2023-09-06 09:29:27 +02:00
Christopher Faulet	d18657ae11	BUG/MEDIUM: applet: Report an error if applet request more room on aborted SC If an abort was performed and the applet still request more room, it means the applet has not properly handle the error on its own. At least the CLI applet is concerned. Instead of reviewing all applets, the error is now handled in task_run_applet() function. Because of this bug, a session may be blocked infinitly and may also lead to a wakup loop. This patch must only be backported to 2.8 for now. And only to lower versions if a bug is reported because it is a bit sensitive and the code older versions are very different.	2023-09-06 09:29:27 +02:00
Christopher Faulet	34645a6365	BUG/MEDIUM: stconn: Report read activity when a stream is attached to front SC It only concerns the front SC. But it is important to report a read activity when a stream is created and attached to the front SC, especially in TCP. In HTTP, when this happens, the request was necessarily received. But in TCP, the client may open a connection without sending anything. We must still report a first read activity in this case to be able to properly report client timeout. This patch must be backported to 2.8.	2023-09-06 09:29:27 +02:00
Christopher Faulet	3ec156f027	BUG/MEDIUM: applet: Fix API for function to push new data in channels buffer All applets only check the -1 error value (need room) for applet_put* functions while the underlying functions may also return -2 if the input is closed or -3 if the data length is invalid. It means applets already handle other cases by their own. The API should be fixed but for now, to ease backports, we only fix applet_put* functions to always return -1 on error. This way, at least for the applets point of view, the API is consistent. This patch should be backported to 2.8. Probably not further. Except if we suspect it could fix a bug.	2023-09-06 09:29:27 +02:00
Christopher Faulet	015fec6a29	BUG/MINOR: stconn: Don't inhibit shutdown on connection on error In the SC function responsible to perform shutdown, there is a statement inhibiting the shutdown if an error was encountered on the SC. This statement is inherited from very old version and should in fact be removed. The error may be set from the stream. In this case the shutdown must be performed. In all cases, it is not a big deal if the shutdown is performed twice because underlying functions already handle multiple calls. This patch does not fix any bug. Thus there is no reason to backport it.	2023-09-06 09:29:27 +02:00
Fr�d�ric L�caille	fb4294be55	BUG/MINOR: quic: Wrong RTT computation (srtt and rrt_var) Due to the fact that several variable values (rtt_var, srtt) were stored as multiple of their real values, some calculations were less accurate as expected. Stop storing 4rtt_var values, and 8srtt values. Adjust all the impacted statements. Must be backported as far as 2.6.	2023-09-05 17:14:51 +02:00
Fr�d�ric L�caille	cf768f7456	BUG/MINOR: quic: Wrong RTT adjusments There was a typo in the test statement to check if the rtt must be adjusted (>= incorectly replaced by >). Must be backported as far as 2.6.	2023-09-05 17:14:51 +02:00
William Lallemand	6bc00a97da	MINOR: httpclient: allow to configure the timeout.connect When using the httpclient, one could be bothered with it returning after a very long time when failing. By default the httpclient has a retries of 3 and a timeout connect of 5s, which can results in pause of 20s upon failure. This patch allows the user to configure the "timeout connect" of the httpclient so it could reduce the time to return an error. This patch helps fixing part of the issue #2269. Could be backported in 2.7 if needed.	2023-09-05 16:42:27 +02:00
William Lallemand	c52948bd2c	MINOR: httpclient: allow to configure the retries When using the httpclient, one could be bothered with it returning after a very long time when failing. By default the httpclient has a retries of 3 and a timeout connect of 5s, which can results in pause of 20s upon failure. This patch allows the user to configure the retries of the httpclient so it could reduce the time to return an error. This patch helps fixing part of the issue #2269. Could be backported in 2.7 if needed.	2023-09-05 15:55:04 +02:00
William Lallemand	fcb080d8f9	MEDIUM: mworker: display a more accessible message when a worker crash Should fix issue #1034. Display a more accessible message when a worker crash about what to do. Example: $ ./haproxy -W -f haproxy.cfg [NOTICE] (308877) : New worker (308884) forked [NOTICE] (308877) : Loading success. [NOTICE] (308877) : haproxy version is 2.9-dev4-d90d3b-58 [NOTICE] (308877) : path to executable is ./haproxy [ALERT] (308877) : Current worker (308884) exited with code 139 (Segmentation fault) [WARNING] (308877) : A worker process unexpectedly died and this can only be explained by a bug in haproxy or its dependencies. Please check that you are running an up to date and maintained version of haproxy and open a bug report. HAProxy version 2.9-dev4-d90d3b-58 2023/09/05 - https://haproxy.org/ Status: development branch - not safe for use in production. Known bugs: https://github.com/haproxy/haproxy/issues?q=is:issue+is:open Running on: Linux 6.2.0-31-generic #31-Ubuntu SMP PREEMPT_DYNAMIC Mon Aug 14 13:42:26 UTC 2023 x86_64 [ALERT] (308877) : exit-on-failure: killing every processes with SIGTERM [WARNING] (308877) : All workers exited. Exiting... (139)	2023-09-05 15:31:04 +02:00
William Lallemand	d90d3bf894	MINOR: global: export the display_version() symbol Export the display_version() function which can be used elsewhere than in haproxy.c	2023-09-05 15:24:39 +02:00
Fr�d�ric L�caille	aeb2f28ca7	BUG/MINOR: quic: Unchecked pointer to Handshake packet number space It is possible that there are still Initial crypto data in flight without Handshake crypto data in flight. This is very rare but possible. This issue was reported by handshakeloss interop test with quic-go as client and @chipitsine in GH #2279. No need to backport.	2023-09-05 11:38:33 +02:00
Frédéric Lécaille	3afe54ed5b	BUILD: quic: Compilation issue on 32-bits systems with quic_may_send_bytes() quic_may_send_bytes() implementation arrived with this commit: MINOR: quic: Amplification limit handling sanitization. It returns a size_t. So when compared with QUIC_MIN() with qc->path->mtu there is no need to cast this latted anymore because it is also a size_t. Detected when compiled with -m32 gcc option.	2023-09-05 10:33:56 +02:00
Willy Tarreau	86854dd032	MEDIUM: threads: detect excessive thread counts vs cpu-map This detects when there are more threads bound via cpu-map than CPUs enabled in cpu-map, or when there are more total threads than the total number of CPUs available at boot (for unbound threads) and configured for bound threads. In this case, a warning is emitted to explain the problems it will cause, and explaining how to address the situation. Note that some configurations will not be detected as faulty because the algorithmic complexity to resolve all arrangements grows in O(N!). This means that having 3 threads on 2 CPUs and one thread on 2 CPUs will not be detected as it's 4 threads for 4 CPUs. But at least configs such as T0:(1,4) T1:(1,4) T2:(2,4) T3:(3,4) will not trigger a warning since they're valid.	2023-09-04 19:39:17 +02:00
Willy Tarreau	8357f950cb	MEDIUM: threads: detect incomplete CPU bindings It's very easy to mess up with some cpu-map directives and to leave some thread unbound. Let's add a test that checks that either all threads are bound or none are bound, but that we do not face the intermediary situation where some are pinned and others are left wandering around, possibly on the same CPUs as bound ones. Note that this should not be backported, or maybe turned into a notice only, as it appears that it will easily catch invalid configs and that may break updates for some users.	2023-09-04 19:39:17 +02:00
Willy Tarreau	e65f54cf96	MINOR: cpuset: centralize a reliable bound cpu detection Till now the CPUs that were bound were only retrieved in thread_cpus_enabled() in order to count the number of CPUs allowed, and it relied on arch-specific code. Let's slightly arrange this into ha_cpuset_detect_bound() that reuses the ha_cpuset struct and the accompanying code. This makes the code much clearer without having to carry along some arch-specific stuff out of this area. Note that the macos-specific code used in thread.c to only count online CPUs but not retrieve a mask, so for now we can't infer anything from it and can't implement it. In addition and more importantly, this function is reliable in that it will only return a value when the detection is accurate, and will not return incomplete sets on operating systems where we don't have an exact list, such as online CPUs.	2023-09-04 19:39:17 +02:00
Willy Tarreau	d3ecc67a01	MINOR: cpuset: add ha_cpuset_or() to bitwise-OR two CPU sets This operation was not implemented and will be needed later.	2023-09-04 19:39:17 +02:00
Willy Tarreau	eb10567254	MINOR: cpuset: add ha_cpuset_isset() to check for the presence of a CPU in a set This function will be convenient to test for the presence of a given CPU in a set.	2023-09-04 19:39:17 +02:00
Willy Tarreau	fca3fc0d90	BUILD: checks: shut up yet another stupid gcc warning gcc has always had hallucinations regarding value ranges, and this one is interesting, and affects branches 4.7 to 11.3 at least. When building without threads, the randomly picked new_tid that is reduced to a multiply by 1 shifted right 32 bits, hence a constant output of 0 shows this warning: src/check.c: In function 'process_chk_conn': src/check.c:1150:32: warning: array subscript [-1, 0] is outside array bounds of 'struct thread_ctx[1]' [-Warray-bounds] In file included from include/haproxy/thread.h:28, from include/haproxy/list.h:26, from include/haproxy/action.h:28, from src/check.c:31: or this one when trying to force the test to see that it cannot be zero(!): src/check.c: In function 'process_chk_conn': src/check.c:1150:54: warning: array subscript [0, 0] is outside array bounds of 'struct thread_ctx[1]' [-Warray-bounds] 1150 \| uint t2_act = _HA_ATOMIC_LOAD(&ha_thread_ctx[thr2].active_checks); \| ~~~~~~~~~~~~~^~~~~~ include/haproxy/atomic.h:66:40: note: in definition of macro 'HA_ATOMIC_LOAD' 66 \| #define HA_ATOMIC_LOAD(val) *(val) \| ^~~ src/check.c:1150:24: note: in expansion of macro '_HA_ATOMIC_LOAD' 1150 \| uint t2_act = _HA_ATOMIC_LOAD(&ha_thread_ctx[thr2].active_checks); \| ^~~~~~~~~~~~~~~ Let's just add an ALREADY_CHECKED() statement there, no other check seems to get rid of it. No backport is needed.	2023-09-04 19:38:51 +02:00
Willy Tarreau	17a7baca07	BUILD: bug: make BUG_ON() void to avoid a rare warning When building without threads, the recently introduced BUG_ON(tid != 0) turns to a constant expression that evaluates to 0 and that is not used, resulting in this warning: src/connection.c: In function 'conn_free': src/connection.c:584:3: warning: statement with no effect [-Wunused-value] This is because the whole thing is declared as an expression for clarity. Make it return void to avoid this. No backport is needed.	2023-09-04 19:38:51 +02:00
Andrew Hopkins	88988bb06c	REGTESTS: ssl: skip ssl_dh test with AWS-LC skip ssl_dh test when HAProxy is built with AWS-LC which does not support FFDH ciphersuites.	2023-09-04 18:21:01 +02:00
Andrew Hopkins	b3f94f8b3b	BUILD: ssl: Build with new cryptographic library AWS-LC This adds a new option for the Makefile USE_OPENSSL_AWSLC, and update the documentation with instructions to use HAProxy with AWS-LC. Update the type of the OCSP callback retrieved with SSL_CTX_get_tlsext_status_cb with the actual type for libcrypto versions greater than 1.0.2. This doesn't affect OpenSSL which casts the callback to void* in SSL_CTX_ctrl.	2023-09-04 18:19:18 +02:00
Miroslav Zagorac	3cfc30416c	MINOR: properly mark the end of the CLI command in error messages In several places in the file src/ssl_ckch.c, in the message about the incorrect use of the CLI command, the end of that CLI command is not correctly marked with the sign ' .	2023-09-04 18:13:43 +02:00
William Lallemand	637306c86d	DOC: configuration: update examples for req.ver Update the documentation for the req.ver sample fetch. Could be backported as far as 2.6.	2023-09-04 18:12:58 +02:00
Willy Tarreau	8547f5cfa2	BUG/MINOR: stream: further protect stream_dump() against incomplete sessions As found by Coverity in issue #2273, the fix in commit `e64bccab2` ("BUG/MINOR: stream: protect stream_dump() against incomplete streams") was still not enough, as scf/scb are still dereferenced to dump their flags and states. This should be backported to 2.8.	2023-09-04 15:32:17 +02:00
Chris Staite	3939e39479	BUG/MEDIUM: h1-htx: Ensure chunked parsing with full output buffer A previous fix to ensure that there is sufficient space on the output buffer to place parsed data (#2053) introduced an issue that if the output buffer is filled on a chunk boundary no data is parsed but the congested flag is not set due to the state not being H1_MSG_DATA. The check to ensure that there is sufficient space in the output buffer is actually already performed in all downstream functions before it is used. This makes the early optimisation that avoids the state transition to H1_MSG_DATA needless. Therefore, in order to allow the chunk parser to continue in this edge case we can simply remove the early check. This ensures that the state can progress and set the congested flag correctly in the caller. This patch fixes #2262. The upstream change that caused this logic error was backported as far as 2.5, therefore it makes sense to backport this fix back that far also.	2023-09-04 12:15:36 +02:00
Willy Tarreau	135c66f6cb	BUG/MEDIUM: connection: fix pool free regression with recent ppv2 TLV patches In commit `fecc573da` ("MEDIUM: connection: Generic, list-based allocation and look-up of PPv2 TLVs") there was a tiny mistake, elements of length <= 128 are allocated from pool_pp_128 but only those of length < 128 are released to this pool, other ones go to pool_pp_256. Because of this, elements of size exactly 128 are allocated from 128 and released to 256. It can be reproduced a few times by running sample_fetches/tlvs.vtc 1000 times with -DDEBUG_DONT_SHARE_POOLS -DDEBUG_MEMORY_POOLS -DDEBUG_EXPR -DDEBUG_STRICT=2 -DDEBUG_POOL_INTEGRITY -DDEBUG_POOL_TRACING -DDEBUG_NO_POOLS. Not sure why it doesn't reproduce more often though. No backport is needed. This should address github issues #2275 and #2274.	2023-09-04 11:45:37 +02:00
Fr�d�ric L�caille	d52466726f	BUG/MINOR: quic: Unchecked pointer to packet number space dereferenced It is possible that there are still Initial crypto data in flight without Handshake crypto data in flight. This is very rare but possible. This issue was reported by long-rtt interop test with quic-go as client and @chipitsine in GH #2276. No need to backport.	2023-09-04 11:29:35 +02:00
Fr�d�ric L�caille	9077f20251	BUG/MAJOR: quic: Really ignore malformed ACK frames. If not correctly parsed, an ACK frame must be ignored without any more treatment. Before this patch an ACK frame could be partially correctly parsed, then some errors could be detected which leaded newly acknowledged packets to be released in a wrong way calling free_quic_tx_pkts() called by qc_parse_ack_frm(). But there is no reason to release such packets because of a malformed ACK frame. This patch modifies qc_parse_ack_frm(). The newly acknowledged TX packets is done in two steps. It first collects the newly acknowledged packet calling qc_newly_acked_pkts(). Then proceed the same way as before for the treatments of haproxy TX packets acknowledged by the peer. If the ACK frame could not be fully parsed, the newly ackowledged packets are replaced back from where they were detached: the tree of TX packets for their encryption level. Must be backported as far as 2.6.	2023-09-04 11:29:35 +02:00
Fr�d�ric L�caille	7dad52bdbd	MINOR: quic: Add a trace to quic_release_frm() Display the address of the frame to be released as soon as entering into quic_release_frm() whose job is obviously to released the memory allocated for the frame <frm> passed as parameter.	2023-09-04 11:29:35 +02:00
Fr�d�ric L�caille	3c90c1ce6b	BUG/MINOR: quic: Possible skipped RTT sampling There are very few chances this bug may occur. Furthermore the consequences are not dramatic: an RTT sampling may be ignored. I guess this may happen when the now_ms global value wraps. Do not rely on the time variable value a packet was sent to decide if it is a newly acknowledged packet but on its presence or not in the tx packet ebtree. Must be backported as far as 2.6.	2023-09-04 11:29:35 +02:00
Christopher Faulet	b50a471adb	BUG/MEDIUM: stconn: Don't block sends if there is a pending shutdown For the same reason than the previous patch, we must not block the sends when there is a pending shutdown. In other words, we must consider the sends are allowed when there is a pending shutdown. This patch must slowly be backported as far as 2.2. It should partially fix issue #2249.	2023-09-01 14:18:26 +02:00
Christopher Faulet	0b93ff8c87	BUG/MEDIUM: stconn: Wake applets on sending path if there is a pending shutdown An applet is not woken up on sending path if it is not waiting for data or if it states it will not consume data. However, it is important to still wake it up if there is a pending shutdown. Otherwise, the event may be missed and some data may remain blocked in the channel's buffer. Because of this bug, it is possible to have a stream stuck if data are also blocked on the opposite channel. It is for instance possible to hit the buf with the stats applet and a client not consuming data. This patch must slowly be backported as far as 2.2. It should partially fix issue #2249.	2023-09-01 14:18:26 +02:00
Christopher Faulet	9e394d34e0	BUG/MINOR: stconn: Don't report blocked sends during connection establishment The server timeout must not be handled during the connection establishment to not superseed the connect timeout. To do so, we must not consider outgoing data are blocked during this stage. Concretly, it means the fsb time must not be updated during connection establishment. It is not an issue with regular clients because the server timeout is only defined when the connection is estalished. However, it may be an issue for the HTTP client, when the server timeout is lower than the connect timeout. In this case, an early 502 may be reported with no connection retries. This patch must be backported to 2.8.	2023-09-01 14:18:26 +02:00
Christopher Faulet	3479d99d5f	BUG/MEDIUM: stconn: Update stream expiration date on blocked sends When outgoing data are blocked, we must update the stream expiration date and requeue the task. It is important to be sure to properly handle write timeout, expecially if the stream cannot expire on reads. This bug was introduced when handling of channel's timeouts was refactored to be managed by the stream-connectors. It is an issue if there is no server timeout and the client does not consume the response (or the opposite but it is less common). It is also possible to trigger the same scenario with applets on server side because, most of time, there is no server timeout. This patch must be backported to 2.8.	2023-09-01 14:18:26 +02:00
Christopher Faulet	49ed83e948	DEBUG: applet: Properly report opposite SC expiration dates in traces The wrong label was used in trace to report expiration dates of the opposite SC. "sc" was used instead of "sco". This patch should be backported to 2.8.	2023-09-01 14:18:26 +02:00
Willy Tarreau	b0031d9679	MINOR: checks: also consider the thread's queue for rebalancing Let's also check for other threads when the current one is queueing, let's not wait for the load to be high. Now this totally eliminates differences between threads.	2023-09-01 14:00:04 +02:00
Willy Tarreau	844a3bc25b	MEDIUM: checks: implement a queue in order to limit concurrent checks The progressive adoption of OpenSSL 3 and its abysmal handshake performance has started to reveal situations where it simply isn't possible anymore to succesfully run health checks on many servers, because between the moment all the checks are started and the moment the handshake finally completes, the timeout has expired! This also has consequences on production traffic which gets significantly delayed as well, all that for lots of checks. While it's possible to increase the check delays, it doesn't solve everything as checks still take a huge amount of time to converge in such conditions. Here we take a different approach by permitting to enforce the maximum concurrent checks per thread limitation and implementing an ordered queue. Thanks to this, if a thread about to start a check has reached its limit, it will add the check at the end of a queue and it will be processed once another check is finished. This proves to be extremely efficient, with all checks completing in a reasonable amount of time and not being disturbed by the rest of the traffic from other checks. They're just cycling slower, but at the speed the machine can handle. One must understand however that if some complex checks perform multiple exchanges, they will take a check slot for all the required duration. This is why the limit is not enforced by default. Tests on SSL show that a limit of 5-50 checks per thread on local servers gives excellent results already, so that could be a good starting point.	2023-09-01 14:00:04 +02:00
Willy Tarreau	cfc0bceeb5	MEDIUM: checks: search more aggressively for another thread on overload When the current check is overloaded (more running checks than the configured limit), we'll try more aggressively to find another thread. Instead of just opportunistically looking for one half as loaded, now if the current thread has more than 1% more active checks than another one, or has more than a configured limit of concurrent running checks, it will search for a more suitable thread among 3 other random ones in order to migrate the check there. The number of migrations remains very low (~1%) and the checks load very fair across all threads (~1% as well). The new parameter is called tune.max-checks-per-thread.	2023-09-01 08:26:06 +02:00
Willy Tarreau	016e189ea3	MINOR: check: also consider the random other thread's active checks When checking if it's worth transferring a sleeping thread to another random thread, let's also check if that random other thread has less checks than the current one, which is another reason for transferring the load there. This commit adds a function "check_thread_cmp_load()" to compare two threads' loads in order to simplify the decision taking. The minimum active check count before starting to consider rebalancing the load was now raised from 2 to 3, because tests show that at 15k concurrent checks, at 2, 50% are evaluated for rebalancing and 30% are rebalanced, while at 3, this is cut in half.	2023-09-01 08:26:06 +02:00
Willy Tarreau	00de9e0804	MINOR: checks: maintain counters of active checks per thread Let's keep two check counters per thread: - one for "active" checks, i.e. checks that are no more sleeping and are assigned to the thread. These include sleeping and running checks ; - one for "running" checks, i.e. those which are currently executing on the thread. By doing so, we'll be able to spread the health checks load a bit better and refrain from sending too many at once per thread. The counters are atomic since a migration increments the target thread's active counter. These numbers are reported in "show activity", which allows to check per thread and globally how many checks are currently pending and running on the system. Ideally, we should only consider checks in the process of establishing a connection since that's really the expensive part (particularly with OpenSSL 3.0). But the inner layers are really not suitable to doing this. However knowing the number of active checks is already a good enough hint.	2023-09-01 08:26:06 +02:00
Willy Tarreau	3b7942a1c9	MINOR: check/activity: collect some per-thread check activity stats We now count the number of times a check was started on each thread and the number of times a check was adopted. This helps understand better what is observed regarding checks.	2023-09-01 08:26:06 +02:00
Willy Tarreau	e03d05c6ce	MINOR: check: remember when we migrate a check The goal here is to explicitly mark that a check was migrated so that we don't do it again. This will allow us to perform other actions on the target thread while still knowing that we don't want to be migrated again. The new READY bit combine with SLEEPING to form 4 possible states: SLP RDY State Description 0 0 - (reserved) 0 1 RUNNING Check is bound to current thread and running 1 0 SLEEPING Check is sleeping, not bound to a thread 1 1 MIGRATING Check is migrating to another thread Thus we set READY upon migration, and check for it before migrating, this is sufficient to prevent a second migration. To make things a bit clearer, the SLEEPING bit was switched with FASTINTER so that SLEEPING and READY are adjacent.	2023-09-01 08:26:06 +02:00
Willy Tarreau	3544c9f8a0	MINOR: checks: pin the check to its thread upon wakeup When a check leaves the sleeping state, we must pin it to the thread that is processing it. It's normally always the case after the first execution, but initial checks that start assigned to any thread (-1) could be assigned much later, causing problems with planned changes involving queuing. Thus better do it early, so that all threads start properly pinned.	2023-09-01 08:26:06 +02:00
Willy Tarreau	7163f95b43	MINOR: checks: start the checks in sleeping state The CHK_ST_SLEEPING state was introduced by commit `d114f4a68` ("MEDIUM: checks: spread the checks load over random threads") to indicate that a check was not currently bound to a thread and that it could easily be migrated to any other thread. However it did not start the checks in this state, meaning that they were not redispatchable on startup. Sometimes under heavy load (e.g. when using SSL checks with OpenSSL 3.0) the cost of setting up new connections is so high that some threads may experience connection timeouts on startup. In this case it's better if they can transfer their excess load to other idle threads. By just marking the check as sleeping upon startup, we can do this and significantly reduce the number of failed initial checks.	2023-09-01 08:26:06 +02:00
Willy Tarreau	48442b8b15	BUG/MINOR: checks: do not queue/wake a bounced check A small issue was introduced with commit `d114f4a68` ("MEDIUM: checks: spread the checks load over random threads"): when a check is bounced to another thread, its expiration time is set to TICK_ETERNITY. This makes it show as not expired upon first wakeup on the next thread, thus being detected as "woke up too early" and being instantly rescheduled. Only this after this next wakeup it will be properly considered. Several approaches were attempted to fix this. The best one seems to consist in resetting t->expire and expired upon wakeup, and changing the !expired test for !tick_is_expired() so that we don't trigger on this case. This needs to be backported to 2.7.	2023-09-01 08:26:06 +02:00
Willy Tarreau	338431ecb6	MINOR: activity: report the current run queue size While troubleshooting the causes of load spikes, it appeared that the length of individual run queues was missing, let's add it to "show activity".	2023-09-01 08:26:06 +02:00
Willy Tarreau	2cb896c4b0	MEDIUM: server/ssl: pick another thread's session when we have none yet The per-thread SSL context in servers causes a burst of connection renegotiations on startup, both for the forwarded traffic and for the health checks. Health checks have been seen to continue to cause SSL rekeying for several minutes after a restart on large thread-count machines. The reason is that the context is exlusively per-thread and that the more threads there are, the more likely it is for a new connection to start on a thread that doesn't have such a context yet. In order to improve this situation, this commit ensures that a thread starting an SSL connection to a server without a session will first look at the last session that was updated by another thread, and will try to use it. In order to minimize the contention, we're using a read lock here to protect the data, and the first-level index is an integer containing the thread number, that is always valid and may always be dereferenced. This way the session retrieval algorithm becomes quite simple: - if the last thread index is valid, then try to use the same session under a read lock ; - if any error happens, then atomically nuke the index so that other threads don't use it and the next one to update a connection updates it again And for the ssl_sess_new_srv_cb(), we have this: - update the entry under a write lock if the new session is valid, otherwise kill it if the session is not valid; - atomically update the index if it was 0 and the new one is valid, otherwise atomically nuke it if the session failed. Note that even if only the pointer is destroyed, the element will be re-allocated by the next thread during the sess_new_srv_sb(). Right now a session is picked even if the SNI doesn't match, because we don't know the SNI yet during ssl_sock_init(), but that's essentially a matter of API, since connect_server() figures the SNI very early, then calls conn_prepare() which calls ssl_sock_init(). Thus in the future we could easily imaging storing a number of SNI-based contexts instead of storing contexts per thread. It could be worth backporting this to one LTS version after some observation, though this is not strictly necessary. the current commit depends on the following ones: BUG/MINOR: ssl_sock: fix possible memory leak on OOM MINOR: ssl_sock: avoid iterating realloc(+1) on stored context DOC: ssl: add some comments about the non-obvious session allocation stuff CLEANUP: ssl: keep a pointer to the server in ssl_sock_init() MEDIUM: ssl_sock: always use the SSL's server name, not the one from the tid MEDIUM: server/ssl: place an rwlock in the per-thread ssl server session MINOR: server/ssl: maintain an index of the last known valid SSL session MINOR: server/ssl: clear the shared good session index on failure MEDIUM: server/ssl: pick another thread's session when we have none yet	2023-08-31 09:27:14 +02:00
Willy Tarreau	777f62cfb7	MINOR: server/ssl: clear the shared good session index on failure If we fail to set the session using SSL_set_session(), we want to quickly erase our index from the shared one so that any other thread with a valid session replaces it.	2023-08-31 08:50:01 +02:00

1 2 3 4 5 ...

20674 Commits