haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2025-05-16 22:58:02 +00:00

Author	SHA1	Message	Date
Willy Tarreau	576e47fb9a	BUG/MEDIUM: stick-table: always remove update before adding a new one Since commit `388539faa` ("MEDIUM: stick-tables: defer adding updates to a tasklet"), between the entry creation and its arrival in the updates tree, there is time for scheduling, and it now becomes possible for an stksess entry to be requeued into the list while it's still in the tree as a remote one. Only local updates were removed prior to being inserted. In this case we would re-insert the entry, causing it to appear as the parent of two distinct nodes or leaves, and to be visited from the first leaf during a delete() after having already been removed and freed, causing a crash, as Christian reported in issue #2959. There's no reason to backport this as this appeared with the commit above in 3.2-dev13.	2025-05-08 23:32:25 +02:00
Aurelien DARRAGON	f03e999912	MINOR: server: ensure server postparse tasks are run for dynamic servers commit `29b76cae4` ("BUG/MEDIUM: server/log: "mode log" after server keyword causes crash") introduced some postparsing checks/tasks for server Initially they were mainly meant for "mode log" servers postparsing, but we already have a check dedicated to "tcp/http" servers (ie: only tcp proto supported) However when dynamic servers are added they bypass _srv_postparse() since the REGISTER_POST_SERVER_CHECK() is only executed for servers defined in the configuration. To ensure consistency between dynamic and static servers, and ensure no post-check init routine is missed, let's manually invoke _srv_postparse() after creating a dynamic server added via the cli.	2025-05-08 02:03:50 +02:00
Aurelien DARRAGON	976e0bd32f	BUG/MINOR: cli: fix too many args detection for commands `d3f928944` ("BUG/MINOR: cli: Issue an error when too many args are passed for a command") added a new check to prevent the command to run when too many arguments are provided. In this case an error is reported. However it turns out this check (despite marked for backports) was ineffective prior to `20ec1de21` ("MAJOR: cli: Refacor parsing and execution of pipelined commands") as 'p' pointer was reset to the end of the buffer before the check was executed. Now since `20ec1de21`, the check works, but we have another issue: we may read past initialized bytes in the buffer because 'p' pointer is always incremented in a while loop without checking if we increment it past 'end' (This was detected using valgrind) To fix the issue introduced by `20ec1de21`, let's only increment 'p' pointer if p < end. For 3.2 this is it, now for older versions, since `d3f928944` was marked for backport, a sligthly different approach is needed: - conditional p increment must be done in the loop (as in this patch) - max arg check must moved above "fill unused slots" comment where p is assigned to the end of the buffer This patch should be backported with `d3f928944`.	2025-05-08 02:03:43 +02:00
Willy Tarreau	0cee7b5b8d	BUG/MEDIUM: stick-tables: close a tiny race in __stksess_kill() It might be possible not to see the element in the tree, then not to see it in the update list, thus not to take the lock before deleting. But an element in the list could have moved to the tree during the check, and be removed later without the updt_lock. Let's delete prior to checking the presence in the tree to avoid this situation. No backport is needed since this arrived in -dev13 with the update list.	2025-05-07 18:49:21 +02:00
Willy Tarreau	006a3acbde	BUG/MEDIUM: peers: hold the refcnt until updating ts->seen In peer_treat_updatemsg(), we call stktable_touch_remote() after releasing the write lock on the TS, asking it to decrement the refcnt, then we update ts->seen. Unfortunately this is racy and causes the issue that Christian reported in issue #2959. The sequence of events is very hard to trigger manually, but what happens is the following: T1. stktable_touch_remote(table, ts, 1); -> at this point the entry is in the mt_list, and the refcnt is zero. T2. stktable_trash_oldest() or process_table_expire() -> these can run, because the refcnt is now zero. The entry is cleanly deleted and freed. T1. HA_ATOMIC_STORE(&ts->seen, 1) -> we dereference freed memory. A first attempt at a fix was made by keeping the refcnt held during all the time the entry is in the mt_list, but this is expensive as such entries cannot be purged, causing lots of skips during trash_oldest_data(). This managed to trigger watchdogs, and was only hiding the real cause of the problem. The correct approach clearly is to maintain the ref_cnt until we touch ->seen. That's what this patch does. It does not decrement the refcnt, while calling stktable_touch_remote(), and does it manually after touching ->seen. With this the problem is gone. Note that a reproducer involves the following: - a config with 10 stick-ctr tracking the same table with a random key between 10M and 100M depending on the machine. - the expiration should be between 10 and 20s. http_req_cnt is stored and shared with the peers. - 4 total processes with such a config on the local machine, each corresponding to a different peer. 3 of the peers are bound to half of the cores (all threads) and share the same threads; the last process is bound to the other half with its own threads. - injecting at full load, ~256 conn, on the shared listening port. After ~2x expiration time to 1 minute the lone process should segfault in pools code due to a corrupted by_lru list. This problem already exists in earlier versions but the race looks narrower. Given how difficult it is to trigger on a given machine in its current form, it's likely that it only happens once in a while on stable branches. The fix must be backported wherever the code is similar, and there's no hope to reproduce it to validate the backport. Thanks again to Christian for his amazing help!	2025-05-07 18:49:21 +02:00
Amaury Denoyelle	4bc7aa548a	BUG/MINOR: quic: reject invalid max_udp_payload size Add a checks on received max_udp_payload transport parameters. As defined per RFC 9000, values below 1200 are invalid, and thus the connection must be closed with TRANSPORT_PARAMETER_ERROR code. Prior to this patch, an invalid value was silently ignored. This should be backported up to 2.6. Note that is relies on previous patch "MINOR: quic: extend return value on TP parsing".	2025-05-07 15:21:30 +02:00
Amaury Denoyelle	ffabfb0fc3	BUG/MINOR: quic: fix TP reject on invalid max-ack-delay Checks are implemented on some received transport parameter values, to reject invalid ones defined per RFC 9000. This is the case for max_ack_delay parameter. The check was not properly implemented as it only reject values strictly greater than the limit set to 2^14. Fix this by rejecting values of 2^14 and above. Also, the proper error code TRANSPORT_PARAMETER_ERROR is now set. This should be backported up to 2.6. Note that is relies on previous patch "MINOR: quic: extend return value on TP parsing".	2025-05-07 15:21:30 +02:00
Amaury Denoyelle	b60a17aad7	BUG/MINOR: quic: use proper error code on invalid received TP value As per RFC 9000, checks must be implemented to reject invalid values for received transport parameters. Such values are dependent on the parameter type. Checks were already implemented for ack_delay_exponent and active_connection_id_limit, accordingly with the QUIC specification. However, the connection was closed with an incorrect error code. Fix this to ensure that TRANSPORT_PARAMETER_ERROR code is used as expected. This should be backported up to 2.6. Note that is relies on previous patch "MINOR: quic: extend return value on TP parsing".	2025-05-07 15:21:30 +02:00
Amaury Denoyelle	10f1f1adce	BUG/MINOR: quic: reject retry_source_cid TP on server side Close the connection on error if retry_source_connection_id transport parameter is received. This is specified by RFC 9000 as this parameter must not be emitted by a client. Previously, it was silently ignored. This should be backported up to 2.6. Note that is relies on previous patch "MINOR: quic: extend return value on TP parsing".	2025-05-07 15:21:30 +02:00
Amaury Denoyelle	a54fdd3d92	BUG/MINOR: quic: use proper error code on invalid server TP This commit is similar to the previous one. It fixes the error code reported when dealing with invalid received transport parameters. This time, it handles reception of original_destination_connection_id, preferred_address and stateless_reset_token which must not be emitted by the client. This should be backported up to 2.6. Note that is relies on previous patch "MINOR: quic: extend return value on TP parsing".	2025-05-07 15:20:06 +02:00
Amaury Denoyelle	df6bd4909e	BUG/MINOR: quic: use proper error code on missing CID in TPs Handle missing received transport parameter value initial_source_connection_id / original_destination_connection_id. Previously, such case would result in an error reported via quic_transport_params_store(), which triggers a TLS alert converted as expected as a CONNECTION_CLOSE. The issue is that the error code reported in the frame was incorrect. Fix this by returning QUIC_TP_DEC_ERR_INVAL for such conditions. This is directly handled via quic_transport_params_store() which set the proper TRANSPORT_PARAMETER_ERROR code for the CONNECTION_CLOSE. However, no error is reported so the SSL handshake is properly terminated without a TLS alert. This is enough to ensure that the CONNECTION_CLOSE frame will be emitted as expected. This should be backported up to 2.6. Note that is relies on previous patch "MINOR: quic: extend return value on TP parsing".	2025-05-07 15:20:06 +02:00
Amaury Denoyelle	294bf26c06	MINOR: quic: extend return value during TP parsing Extend API used for QUIC transport parameter decoding. This is done via the introduction of a dedicated enum to report the various error condition detected. No functional change should occur with this patch, as the only returned code is QUIC_TP_DEC_ERR_TRUNC, which results in the connection closure via a TLS alert. This patch will be necessary to properly reject transport parameters with the proper CONNECTION_CLOSE error code. As such, it should be backported up to 2.6 with the following series.	2025-05-07 15:19:52 +02:00
Willy Tarreau	46b5dcad99	MINOR: stick-tables: add "ipv4" as an alias for the "ip" type However the doc purposely says the opposite, to encourage migrating away from "ip". The goal is that in the future we change "ip" to mean "ipv6", which seems to be what most users naturally expect. But we cannot break configurations in the LTS version so for now "ipv4" is the alias. The reason for not changing it in the table is that the type name is used at a few places (look for "].kw"): - dumps - promex We'd rather not change that output for 3.2, but only do it in 3.3. This way, 3.2 can be made future-proof by using "ipv4" in the config without any other side effect. Please see github issue #2962 for updates on this transition.	2025-05-07 10:11:55 +02:00
Willy Tarreau	697a531516	MINOR: debug: bump the dump buffer to 8kB Now with the improved backtraces, the lock history and details in the mux layers, some dumps appear truncated or with some chars alone at the beginning of the line. The issue is in fact caused by the limited dump buffer size (2kB for stderr, 4kB for warning), that cannot hold a complete trace anymore. Let's jump bump them to 8kB, this will be plenty for a long time.	2025-05-07 10:02:58 +02:00
Willy Tarreau	10e6d0bd57	BUG/MINOR: tools: only fill first empty arg when not out of range In commit `3f2c8af313` ("MINOR: tools: make parse_line() provide hints about empty args") we've added the ability to record the position of the first empty arg in parse_line(), but that check requires to access the args[] array for the current arg, which is not valid in case we stopped on too large an argument count. Let's just check the arg's validity before doing so. This was reported by OSS Fuzz: https://issues.oss-fuzz.com/issues/415850462 No backport is needed since this was in the latest dev branch.	2025-05-07 07:25:29 +02:00
William Lallemand	fbceabbccf	BUG/MINOR: ssl: can't use crt-store some certificates in ssl-f-use When declaring a certificate via the crt-store section, this certificate can then be used 2 ways in a crt-list: - only by using its name, without any crt-store options - or by using the exact set of crt-list option that was defined in the crt-store Since ssl-f-use is generating a crt-list, this is suppose to behave the same. To achieve this, ckch_conf_parse() will parse the keywords related to the ckch_conf on the ssl-f-use line and use ckch_conf_cmp() to compare it to the previous declaration from the crt-store. This comparaison is only done when any ckch_conf keyword are present. However, ckch_conf_parse() was done for the crt-list, and the crt-list does not use the "crt" parameter to declare the name of the certificate, since it's the first element of the line. So when used with ssl-f-use, ckch_conf_parse() will always see a "crt" keyword which is a ckch_conf one, and consider that it will always need to have the exact same set of paremeters when using the same crt in a crt-store and an ssl-f-use line. So a simple configuration like this: crt-store web load crt "foo.com.crt" key "foo.com.key" alias "foo" frontend mysite bind :443 ssl ssl-f-use crt "@web/foo" ssl-min-ver TLSv1.2 Would lead to an error like this: config : '@web/foo' in crt-list '(null)' line 0, is already defined with incompatible parameters: - different parameter 'key' : previously 'foo.com.key' vs '(null)' In order to fix the issue, this patch parses the "crt" parameter itself for ssl-f-use instead of using ckch_conf_parse(), so the keyword would never be considered as a ckch_conf keyword to compare. This patch also take care of setting the CKCH_CONF_SET_CRTLIST flag only if a ckch_conf keyword was found. This flag is used by ckch_conf_cmp() to know if it has to compare or not. No backport needed.	2025-05-06 21:36:29 +02:00
William Lallemand	b3b282d2ee	MINOR: ssl: add filename and linenum for ssl-f-use errors Fill cfg_crt_node with a filename and linenum so the post_section callback can use it to emit errors. This way the errors are emitted with the right filename and linenum where ssl-f-use is used instead of (null):0	2025-05-06 21:36:29 +02:00
Willy Tarreau	99f5be5631	BUG/MAJOR: queue: lock around the call to pendconn_process_next_strm() The extra call to pendconn_process_next_strm() made in commit `cda7275ef5` ("MEDIUM: queue: Handle the race condition between queue and dequeue differently") was performed after releasing the server queue's lock, which is incompatible with the calling convention for this function. The result is random corruption of the server's streams list likely due to picking old or incorrect pendconns from the queue, and in the end infinitely looping on apparently already locked mt_list objects. Just adding the lock fixes the problem. It's very difficult to reproduce, it requires low maxconn values on servers, stickiness on the servers (cookie), a long enough slowstart (e.g. 10s), and regularly flipping servers up/down to re-trigger the slowstart. No backport is needed as this was only in 3.2.	2025-05-06 18:59:54 +02:00
William Lallemand	e035f0c48e	DOC: configuration: add the "crt-store" keyword Add the "crt-store" keyword with its argument in the "3.12" section, so this could be detected by haproxy-dconv has a keyword and put in the keywords list. Must be backported as far as 3.0	2025-05-06 16:07:29 +02:00
William Lallemand	e516b14d36	DOC: configuration: add "acme" section to the keywords list Add the "acme" keyword with its argument in the "3.13" section, so this could be detected by haproxy-dconv has a keyword and put in the keywords list.	2025-05-06 15:34:39 +02:00
William Lallemand	b7c4a68ecf	MEDIUM: acme/ssl: remove 'acme ps' in favor of 'acme status' Remove the 'acme ps' command which does not seem useful anymore with the 'acme status' command. The big difference with the 'acme status' command is that it was only displaying the running tasks instead of the status of all certificate.	2025-05-06 15:27:29 +02:00
William Lallemand	48f1ce77b7	MINOR: acme/cli: 'acme status' show the status acme-configured certificates The "acme status" command, shows the status of every certificates configured with ACME, not only the running task like "acme ps". The IO handler loops on the ckch_store tree and outputs a line for each ckch_store which has an acme section set. This is still done under the ckch_store lock and doesn't support resuming when the buffer is full, but we need to change that in the future.	2025-05-06 15:27:29 +02:00
Christopher Faulet	a3ce7d7772	Revert "BUG/MEDIUM: mux-spop: Handle CLOSING state and wait for AGENT DISCONNECT frame" This reverts commit `53c3046898`. This patch introduced a regression leading to a loop on the frames demultiplexing because a frame may be ignore but not consumed. But outside this regression that can be fixed, there is a design issue that was not totally fixed by the patch above. The SPOP connection state is mixed with the status of the frames demultiplexer and this needlessly complexify the connection management. Instead of fixing the fix, a better solution is to revert it to work a a proper solution. For the record, the idea is to deal with the spop connection state onlu using 'state' field and to introduce a new field to handle the frames demultiplexer state. This should ease the closing state management. Another issue that must be fixed. We must take care to not abort a SPOP stream when an error is detected on a SPOP connection or when the connection is closed, if the ACK frame was already received for this stream. It is not a common case, but it can be solved by saving the last known stream ID that recieved a ACK. This patch must be backported if the commit above is backported.	2025-05-06 13:43:59 +02:00
Aurelien DARRAGON	b39825ee45	BUG/MINOR: proxy: only use proxy_inc_fe_cum_sess_ver_ctr() with frontends proxy_inc_fe_cum_sess_ver_ctr() was implemented in `9969adbc` ("MINOR: stats: add by HTTP version cumulated number of sessions and requests") As its name suggests, it is meant to be called for frontends, not backends Also, in `9969adbc`, when used under h1_init(), a precaution is taken to ensure that the function is only called with frontends. However, this precaution was not applied in h2_init() and qc_init(). Due to this, it remains possible to have proxy_inc_fe_cum_sess_ver_ctr() being called with a backend proxy as parameter. While it did not cause known issues so far, it is not expected and could result in bugs in the future. Better fix this by ensuring the function is only called with frontends. It may be backported up to 2.8	2025-05-06 11:01:39 +02:00
Willy Tarreau	3bb6eea6d5	DEBUG: threads: display held locks in threads dumps Based on the lock history, we can spot some locks that are still held by checking the last operation that happened on them: if it's not an unlock, then we know the lock is held. In this case we append the list after "locked:" with their label and state like below: U:QUEUE S:IDLE_CONNS U:IDLE_CONNS R:TASK_WQ U:TASK_WQ S:QUEUE S:QUEUE S:QUEUE locked: QUEUE(S) S:IDLE_CONNS U:IDLE_CONNS S:TASK_RQ U:TASK_RQ S:QUEUE U:QUEUE S:IDLE_CONNS locked: IDLE_CONNS(S) R:TASK_WQ S:TASK_WQ R:TASK_WQ S:TASK_WQ R:TASK_WQ S:TASK_WQ R:TASK_WQ locked: TASK_WQ(R) W:STK_TABLE W:STK_TABLE_UPDT U:STK_TABLE_UPDT W:STK_TABLE W:STK_TABLE_UPDT U:STK_TABLE_UPDT W:STK_TABLE W:STK_TABLE_UPDT locked: STK_TABLE(W) STK_TABLE_UPDT(W) The format is slightly different (label(status)) so as to easily differentiate them visually from the history.	2025-05-06 05:20:37 +02:00
Willy Tarreau	feaac66b5e	DEBUG: threads: merge successive idempotent lock operations in history In order to make the lock history a bit more useful, let's try to merge adjacent lock/unlock sequences that don't change anything for other threads. For this we can replace the last unlock with the new operation on the same label, and even just not store it if it was the same as the one before the unlock, since in the end it's the same as if the unlock had not been done. Now loops that used to be filled with "R:LISTENER U:LISTENER" show more useful info such as: S:IDLE_CONNS U:IDLE_CONNS S:PEER U:PEER S:IDLE_CONNS U:IDLE_CONNS R:LISTENER U:LISTENER U:STK_TABLE W:STK_SESS U:STK_SESS R:STK_TABLE U:STK_TABLE W:STK_SESS U:STK_SESS R:STK_TABLE R:STK_TABLE U:STK_TABLE W:STK_SESS U:STK_SESS W:STK_TABLE_UPDT U:STK_TABLE_UPDT S:PEER It's worth noting that it can sometimes induce confusion when recursive locks of the same label are used (a few exist on peers or stick-tables), as in such a case the two operations would be needed. However these ones are already undebuggable, so instead they will just have to be renamed to make sure they use a distinct label.	2025-05-05 18:36:12 +02:00
Willy Tarreau	743dce95d2	DEBUG: threads: don't keep lock label "OTHER" in the per-thread history Most threads are filled with "R:OTHER U:OTHER" in their history. Since anything non-important can use other it's not observable but it pollutes the history. Let's just drop OTHER entirely during the recording.	2025-05-05 18:10:57 +02:00
Willy Tarreau	1f51f1c816	BUG/MINOR: tools: make parseline report the required space for the trailing 0 The fix in commit `09a325a4de` ("BUG/MINOR: tools: always terminate empty lines") is insufficient. While it properly addresses the lack of trailing zero, it doesn't account for it in the returned outlen that is used to allocate a larger line. This happens at boot if the very first line of the test file is exactly a sharp with nothing else. In this case it will return a length 0 and the caller (parse_cfg()) will try to re-allocate an entry of size zero and will fail, bailing out a lack of memory. This time it should really be OK. It doesn't need to be backported, unless the patch above would be.	2025-05-05 17:58:04 +02:00
Willy Tarreau	09a325a4de	BUG/MINOR: tools: always terminate empty lines Since latest commit `7e4a2f39ef` ("BUG/MINOR: tools: do not create an empty arg from trailing spaces"), an empty line will no longer produce an arg and no longer append a trailing zero to them. This was not visible because one is already present in the input string, however all the trailing args are set to out+outpos-1, which now points one char before the buffer since nothing was emitted, and was noticed by ASAN, and/or when parsing garbage. Let's make sure to always emit the zero for empty lines as well to address this issue. No backport is needed unless the patch above gets backported.	2025-05-05 17:33:22 +02:00
Willy Tarreau	08d3caf30e	MINOR: cfgparse: visually show the input line on empty args Now when an empty arg is found on a line, we emit the sanitized input line and the position of the first empty arg so as to help the user figure the cause (likely an empty environment variable). Co-authored-by: Valentine Krasnobaeva <vkrasnobaeva@haproxy.com>	2025-05-05 16:17:24 +02:00
Willy Tarreau	3f2c8af313	MINOR: tools: make parse_line() provide hints about empty args In order to help parse_line() callers report the position of empty args to the user, let's decide that if no error is emitted, then we'll stuff the errptr with the position of the first empty arg without affecting the return value. Co-authored-by: Valentine Krasnobaeva <vkrasnobaeva@haproxy.com>	2025-05-05 16:17:24 +02:00
Willy Tarreau	9d14f2c764	MEDIUM: config: warn about the consequences of empty arguments on a config line For historical reasons, the config parser relies on the trailing '\0' to detect the end of the line being parsed. When the lines started to be tokenized into arguments, this principle has been preserved, and now all the parsers rely on *args[arg]='\0' to detect the end of a line. But as reported in issue #2944, while most of the time it breaks the parsing like below: http-request deny if { path_dir '' } it can also cause some elements to be silently ignored like below: acl bad_path path_sub '%2E' '' '%2F' This may also subtly happen with environment variables that don't exist or which are empty: acl bad_path path_sub '%2E' "$BAD_PATTERN" '%2F' Fortunately, parse_line() returns the number of arguments found, so it's easy from the callers to verify if any was empty. The goal of this commit is not to perform sensitive changes, it's only to mention when parsing a line that an empty argument was found and alert about its consequences using a warning. Most of the time when this happens, the config does not parse. But for examples as the ACLs above, there could be consequences that are better detected early. This patch depends on this previous fix: BUG/MINOR: tools: do not create an empty arg from trailing spaces Co-authored-by: Valentine Krasnobaeva <vkrasnobaeva@haproxy.com>	2025-05-05 16:17:24 +02:00
Willy Tarreau	7e4a2f39ef	BUG/MINOR: tools: do not create an empty arg from trailing spaces Trailing spaces on the lines of the config file create an empty arg which makes it complicated to detect really empty args. Let's first address this. Note that it is not user-visible but prevents from fixing user-visible issues. No backport is needed. The initial issue was introduced with this fix that already tried to address it: `8a6767d266` ("BUG/MINOR: config: don't count trailing spaces as empty arg (v2)") The current patch properly addresses leading and trailing spaces by only counting arguments if non-lws chars were found on the line. LWS do not cause a transition to a new arg anymore but they complete the current one. The whole new code relies on a state machine to detect when to create an arg (!in_arg->in_arg), and when to close the current arg. A special care was taken for word expansion in the form of "${ARGS[]}" which still continue to emit individual arguments past the first LWS. This example works fine: ARGS="100 check inter 1000" server name 192.168.1."${ARGS[]}" It properly results in 6 args: "server", "name", "192.168.1.100", "check", "inter", "1000" This fix should not have any visible user impact and is a bit tricky, so it's best not to backport it, at least for a while. Co-authored-by: Valentine Krasnobaeva <vkrasnobaeva@haproxy.com>	2025-05-05 16:16:54 +02:00
William Lallemand	af5bbce664	BUG/MINOR: acme/cli: don't output error on success Previous patch `7251c13c7` ("MINOR: acme: move the acme task init in a dedicated function") mistakenly returned the wrong error code when "acme renew" parsing was successful, and tried to emit an error message. This patch fixes the issue by returning 0 when the acme task was correctly scheduled to start. No backport needed.	2025-05-02 21:21:09 +02:00
Aurelien DARRAGON	0e6f968ee3	BUG/MEDIUM: stktable: fix sc_(<ctr>) BUG_ON() regression with ctx > 9 As reported in GH #2958, commit `6c9b315` caused a regression with sc_ fetches and tracked counter id > 9. As such, the below configuration would cause a BUG_ON() to be triggered: global log stdout format raw local0 tune.stick-counters 11 defaults log global mode http frontend www bind :8080 acl track_me bool(true) http-request set-var(txn.track_var) str("a") http-request track-sc10 var(txn.track_var) table rate_table if track_me http-request set-var(txn.track_var_rate) sc_gpc_rate(0,10,rate_table) http-request return status 200 backend rate_table stick-table type string size 1k expire 5m store gpc_rate(1,1m) While in `6c9b315` the src_fetch logic was removed from smp_fetch_sc_stkctr(), num > 9 is indeed not expected anymore as original num value. But what we didn't consider is that num is effectively re-assigned for generic sc_ variant. Thus the BUG_ON() is misplaced as it should only be evaluated for non-generic fetches. It explains why it triggers with valid configurations Thanks to GH user @tkjaer for his detailed report and bug analysis No backport needed, this bug is specific to 3.2.	2025-05-02 16:57:45 +02:00
Willy Tarreau	758e0818c3	[RELEASE] Released version 3.2-dev14 Released version 3.2-dev14 with the following main changes : - MINOR: acme: retry label always do a request - MINOR: acme: does not leave task for next request - BUG/MINOR: acme: reinit the retries only at next request - MINOR: acme: change the default max retries to 5 - MINOR: acme: allow a delay after a valid response - MINOR: acme: wait 5s before checking the challenges results - MINOR: acme: emit a log when starting - MINOR: acme: delay of 5s after the finalize - BUG/MEDIUM: quic: Let it be known if the tasklet has been released. - BUG/MAJOR: tasks: fix task accounting when killed - CLEANUP: tasks: use the local state, not t->state, to check for tasklets - DOC: acme: external account binding is not supported - MINOR: hlua: ignore "tune.lua.bool-sample-conversion" if set after "lua-load" - MEDIUM: peers: Give up if we fail to take locks in hot path - MEDIUM: stick-tables: defer adding updates to a tasklet - MEDIUM: stick-tables: Limit the number of old entries we remove - MEDIUM: stick-tables: Limit the number of entries we expire - MINOR: cfgparse-global: add explicit error messages in cfg_parse_global_env_opts - MINOR: ssl: add function to extract X509 notBefore date in time_t - BUILD: acme: need HAVE_ASN1_TIME_TO_TM - MINOR: acme: move the acme task init in a dedicated function - MEDIUM: acme: add a basic scheduler - MINOR: acme: emit a log when the scheduler can't start the task	2025-05-02 16:23:28 +02:00
William Lallemand	7ad501e6a1	MINOR: acme: emit a log when the scheduler can't start the task Emit an error log when the renewal scheduler can't start the task.	2025-05-02 16:12:41 +02:00
William Lallemand	7fe59ebb88	MEDIUM: acme: add a basic scheduler This patch implements a very basic scheduler for the ACME tasks. The scheduler is a task which is started from the postparser function when at least one acme section was configured. The scheduler will loop over the certificates in the ckchs_tree, and for each certificate will start an ACME task if the notAfter date is past curtime + (notAfter - notBefore) / 12, or 7 days if notBefore is not available. Once the lookup over all certificates is terminated, the task will sleep and will wakeup after 12 hours.	2025-05-02 16:01:32 +02:00
William Lallemand	7251c13c77	MINOR: acme: move the acme task init in a dedicated function acme_start_task() is a dedicated function which starts an acme task for a specified <store> certificate. The initialization code was move from the "acme renew" command parser to this function, in order to be called from a scheduler.	2025-05-02 16:01:32 +02:00
William Lallemand	878a3507df	BUILD: acme: need HAVE_ASN1_TIME_TO_TM Restrict the build of the ACME feature to libraries which provide ASN1_TIME_to_tm() function.	2025-05-02 16:01:32 +02:00
William Lallemand	626de9538e	MINOR: ssl: add function to extract X509 notBefore date in time_t Add x509_get_notbefore_time_t() which returns the notBefore date in time_t format.	2025-05-02 16:01:32 +02:00
Valentine Krasnobaeva	8a4b3216f9	MINOR: cfgparse-global: add explicit error messages in cfg_parse_global_env_opts When env variable name or value are not provided for setenv/presetenv it's not clear from the old error message shown at stderr, what exactly is missed. User needs to search in it's configuration. Let's add more explicit error messages about these inconsistencies. No need to be backported.	2025-05-02 15:37:45 +02:00
Olivier Houchard	994cc58576	MEDIUM: stick-tables: Limit the number of entries we expire In process_table_expire(), limit the number of entries we remove in one call, and just reschedule the task if there's more to do. Removing entries require to use the heavily contended update write lock, and we don't want to hold it for too long. This helps getting stick tables perform better under heavy load.	2025-05-02 15:27:55 +02:00
Olivier Houchard	d2d4c3eb65	MEDIUM: stick-tables: Limit the number of old entries we remove Limit the number of old entries we remove in one call of stktable_trash_oldest(), as we do so while holding the heavily contended update write lock, so we'd rather not hold it for too long. This helps getting stick tables perform better under heavy load.	2025-05-02 15:27:55 +02:00
Olivier Houchard	388539faa3	MEDIUM: stick-tables: defer adding updates to a tasklet There is a lot of contention trying to add updates to the tree. So instead of trying to add the updates to the tree right away, just add them to a mt-list (with one mt-list per thread group, so that the mt-list does not become the new point of contention that much), and create a tasklet dedicated to adding updates to the tree, in batchs, to avoid keeping the update lock for too long. This helps getting stick tables perform better under heavy load.	2025-05-02 15:27:55 +02:00
Olivier Houchard	b3ad7b6371	MEDIUM: peers: Give up if we fail to take locks in hot path In peer_send_msgs(), give up in order to retry later if we failed at getting the update read lock. Similarly, in __process_running_peer_sync(), give up and just reschedule the task if we failed to get the peer lock. There is an heavy contention on both those locks, so we could spend a lot of time trying to get them. This helps getting peers perform better under heavy load.	2025-05-02 15:27:55 +02:00
Aurelien DARRAGON	7a8d1a3122	MINOR: hlua: ignore "tune.lua.bool-sample-conversion" if set after "lua-load" tune.lua.bool-sample-conversion must be set before any lua-load or lua-load-per-thread is used for it to be considered. Indeed, lua-load directives are parsed on the fly and will cause some parts of the scripts to be executed during init already (script body/init contexts). As such, we cannot afford to have "tune.lua.bool-sample-conversion" set after some Lua code was loaded, because it would mean that the setting would be handled differently for Lua's code executed during or after config parsing. To avoid ambiguities, the documentation now states that the setting must be set before any lua-load(-per-thread) directive, and if the setting is met after some Lua was already loaded, the directive is ignored and a warning informs about that. It should fix GH #2957 It may be backported with `29b6d8af16` ("MINOR: hlua: rename "tune.lua.preserve-smp-bool" to "tune.lua.bool-sample-conversion"")	2025-05-02 14:38:37 +02:00
William Lallemand	6051a6e485	DOC: acme: external account binding is not supported Add a note on external account binding in the ACME section.	2025-05-02 12:04:07 +02:00
Willy Tarreau	1ed238101a	CLEANUP: tasks: use the local state, not t->state, to check for tasklets There's no point reading t->state to check for a tasklet after we've atomically read the state into the local "state" variable. Not only it's more expensive, it's also less clear whether that state is supposed to be atomic or not. And in any case, tasks and tasklets have their type forever and the one reflected in state is correct and stable.	2025-05-02 11:09:28 +02:00
Willy Tarreau	45e83e8e81	BUG/MAJOR: tasks: fix task accounting when killed After recent commit `b81c9390f` ("MEDIUM: tasks: Mutualize the TASK_KILLED code between tasks and tasklets"), the task accounting was no longer correct for killed tasks due to the decrement of tasks in list that was no longer done, resulting in infinite loops in process_runnable_tasks(). This just illustrates that this code remains complex and should be further cleaned up. No backport is needed, as this was in 3.2.	2025-05-02 11:09:28 +02:00

1 2 3 4 5 ...

24569 Commits