haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2025-01-02 10:12:03 +00:00

Author	SHA1	Message	Date
Aurelien DARRAGON	99f3409582	BUG/MINOR: proxy: fix server_id_hdr_name leak on deinit() proxy server_id_hdr_name member (used for "http-send-name-header" option) wasn't cleaned up in free_proxy(), resulting in small memory leak if "http-send-name-header" was used on a regular or default proxy. This may be backported to all stable versions.	2024-06-11 10:59:02 +02:00
Aurelien DARRAGON	e5ccfda9d3	MINOR: log: fix "http-send-name-header" ignore warning message Warning message to indicate that the "http-send-name-header" option is ignored for backend in "mode log" was referenced using its internal struct wording instead of public name (as seen in the documentation). Let's fix that. It may be backported with `c7783fb` ("MINOR: log/backend: prevent "http-send-name-header" use with LOG mode") in 2.9.	2024-06-11 10:58:55 +02:00
Christopher Faulet	7bff576ebb	BUG/MINOR: mux-h1: Use the right variable to set NEGO_FF_FL_EXACT_SIZE flag Instead of setting this flag on the ones used for the zero-copy negociation, it is set on the connection flags used for xprt->rcv_buf() call. Fortunately, there is no real consequence. The only visible effect is the chunk size that is written on 8 bytes for no reason. This patch is related to issue #2598. It must be backported to 3.0.	2024-06-10 14:06:35 +02:00
Christopher Faulet	e8cc8a60be	BUG/MAJOR: mux-h1: Properly copy chunked input data during zero-copy nego When data are transfered via zero-copy data forwarding, if some data were already received, we try to immediately tranfer it during the negociation step. If data are chunked and the chunk size is unknown, 10 bytes are reserved to write the chunk size during the done step. However, when input data are finally transferred, the offset is ignored. Data are copied into the output buffer. But the first 10 bytes are then crushed by the chunk size. Thus the chunk is truncated leading to a malformed message. This patch should fix the issue #2598. It must be backported to 3.0.	2024-06-10 14:06:35 +02:00
William Manley	52eb6b23f8	BUG/MEDIUM: stconn/mux-h1: Fix suspect change causing timeouts This fixes an issue I've had where if a connection was idle for ~23s it would get in a bad state. I don't understand this code, so I'm not sure exactly why it was failing. I discovered this by bisecting to identify the commit that caused the regression between 2.9 and 3.0. The commit is `d2c3f8dde7`: "MINOR: stconn/connection: Move shut modes at the SE descriptor level" - a part of v3.0-dev8. It seems to be an innocent renaming, so I looked through it and this stood out as suspect: - if (mode != CO_SHW_NORMAL) + if (mode & SE_SHW_NORMAL) It looks like the not went missing here, so this patch reverses that condition. It fixes my test. I don't quite understand what this is doing or is for so I can't write a regression test or decent commit message. Hopefully someone else will be able to pick this up from where I've left it. [CF: This inverts the condition to perform clean shutdowns. This means no clean shutdown are performed when it should do. This patch must be backported to 3.0]	2024-06-10 14:06:35 +02:00
Amaury Denoyelle	0ef94e2dff	BUG/MINOR: quic: ensure Tx buf is always purged quic_conn API for sending was recently refactored. The main objective was to regroup the different functions present for both handshake and application emission. After this refactoring, an optimization was introduced to avoid calling qc_send() if there was nothing new to emit. However, this prevent the Tx buffer to be purged if previous sending was interrupted, until new frames are finally available. To fix this, simply remove the optimization. qc_send() is thus now always called in quic_conn IO handlers. The impact of this bug should be minimal as it happens only on sending temporary error. However in this case, this could cause extra latency or even a complete sending freeze in the worst scenario. This must be backported up to 3.0.	2024-06-10 10:29:28 +02:00
Amaury Denoyelle	50470a5181	BUG/MINOR: quic: fix computed length of emitted STREAM frames qc_build_frms() is responsible to encode multiple frames in a single QUIC packet. It accounts for room left in the buffer packet for each newly encded frame. An incorrect computation was performed when encoding a STREAM frame in a single packet. Frame length was accounted twice which would reduce in excess the buffer packet room. This caused the remaining built frames to be reduced with the resulting packet not able to fill the whole MTU. The impact of this bug should be minimal. It is only present when multiple frames are encoded in a single packet after a STREAM. However in this case datagrams built are smaller than expecting, which is suboptimal for bandwith. This should be backported up to 2.6.	2024-06-10 10:24:02 +02:00
William Lallemand	711338e1ce	BUG/MEDIUM: ssl: bad auth selection with TLS1.2 and WolfSSL The ClientHello callback for WolfSSL introduced in haproxy 2.9, seems not to behave correctly with TLSv1.2. In TLSv1.2, this is the cipher that is used to chose the authentication algorithm (ECDSA or RSA), however an SSL client can send a signature algorithm. In TLSv1.3, the authentication is not part of the ciphersuites, and is selected using the signature algorithm. The mistake in the code is that the signature algorithm in TLSv1.2 are overwritting the auth that was selected using the ciphers. This must be backported as far as 2.9.	2024-06-07 15:47:15 +02:00
William Lallemand	93cc23a355	BUG/MEDIUM: ssl: wrong priority whem limiting ECDSA ciphers in ECDSA+RSA configuration The ClientHello Callback which is used for certificate selection uses both the signature algorithms and the ciphers sent by the client. However, when a client is announcing both ECDSA and RSA capabilities with ECSDA ciphers that are not available on haproxy side and RSA ciphers that are compatibles, the ECDSA certificate will still be used but this will result in a "no shared cipher" error, instead of a fallback on the RSA certificate. For example, a client could send 'ECDHE-ECDSA-AES128-CCM:ECDHE-RSA-AES256-SHA and HAProxy could be configured with only 'ECDHE-ECDSA-AES128-SHA:ECDHE-RSA-AES256-SHA'. This patch fixes the issue by validating that at least one ECDSA cipher is available on both side before chosing the ECDSA certificate. This must be backported on all stable versions.	2024-06-05 15:33:36 +02:00
Christopher Faulet	6697e87ae5	MINOR: mux-quic: Don't send an emtpy H3 DATA frame during zero-copy forwarding It may only happens when there is no data to forward but a last stream frame must be sent with the FIN bit. It is not invalid, but it is useless to send an empty H3 DATA frame in that case.	2024-06-05 07:28:10 +02:00
Christopher Faulet	9748df29ff	BUG/MEDIUM: mux-quic: Don't unblock zero-copy fwding if blocked during nego The previous fix (`792a645ec2` ["BUG/MEDIUM: mux-quic: Unblock zero-copy forwarding if the txbuf can be released"]) introduced a regression. The zero-copy data forwarding must only be unblocked if it was blocked by the producer, after a successful negotiation. It is important because during a negotiation, the consumer may be blocked for another reason. Because of the flow control for instance. In that case, there is not necessarily a TX buffer. And it unexpected to try to release an unallocated TX buf. In addition, the same may happen while a TX buf is still in-use. In that case, it must also not be released. So testing the TX buffer is not the right solution. To fix the issue, a new IOBUF flag was added (IOBUF_FL_FF_WANT_ROOM). It must be set by the producer if it is blocked after a sucessful negotiation because it needs more room. In that case, we know a buffer was provided by the consummer. In done_fastfwd() callback function, it is then possible to safely unblock the zero-copy data forwarding if this flag is set. This patch must be backported to 3.0 with the commit above.	2024-06-05 07:28:10 +02:00
Aurelien DARRAGON	2bde0d64dd	CLEANUP: hlua: simplify ambiguous lua_insert() usage in hlua_ctx_resume() 'lua_insert(lua->T, -lua_gettop(lua->T))' is actually used to rotate the top value with the bottom one, thus the code was overkill and the comment was actually misleading, let's fix that by using explicit equivalent form (absolute index). It may be backported with `5508db9a2` ("BUG/MINOR: hlua: fix unsafe lua_tostring() usage with empty stack") to all stable versions to ease code maintenance.	2024-06-04 16:31:38 +02:00
Aurelien DARRAGON	755c2daf0f	BUG/MINOR: hlua: fix leak in hlua_ckch_set() error path in hlua_ckch_commit_yield() and hlua_ckch_set(), when an error occurs, we enter the error path and try to raise an error from the <err> msg pointer which must be freed afterwards. However, the fact that luaL_error() never returns was overlooked, because of that <err> msg is never freed in such case. To fix the issue, let's use hlua_pushfstring_safe() helper to push the err on the lua stack and then free it before throwing the error using lua_error(). It should be backported up to 2.6 with 30fcca18 ("MINOR: ssl/lua: CertCache.set() allows to update an SSL certificate file")	2024-06-04 16:31:30 +02:00
Aurelien DARRAGON	2be94c008e	CLEANUP: hlua: get rid of hlua_traceback() security checks Thanks to the previous commit, we may now assume that hlua_traceback() won't LJMP, so it's safe to use it from unprotected environment without any precautions.	2024-06-04 16:31:22 +02:00
Aurelien DARRAGON	365ee28510	BUG/MINOR: hlua: prevent LJMP in hlua_traceback() Function is often used on error paths where no precaution is taken against LJMP. Since the function is used on error paths (which include out-of-memory error paths) the function lua_getinfo() could also raise a memory exception, causing the process to crash or improper error handling if the caller isn't prepared against that eventually. Since the function is only used on rare events (error handling) and is lacking the __LJMP prototype pefix, let's make it safe by protecting the lua_getinfo() call so that hlua_traceback() callers may use it safely now (the function will always succeed, output will be truncated in case of error). This could be backported to all stable versions.	2024-06-04 16:31:15 +02:00
Aurelien DARRAGON	f0e5b825cf	BUG/MINOR: hlua: fix unsafe hlua_pusherror() usage Following previous commit's logic: hlua_pusherror() is mainly used from cleanup paths where the caller isn't protected against LJMPs. Caller was tempted to think that the function was safe because func prototype was lacking the __LJMP prefix. Let's make the function really LJMP-safe by wrapping the sensitive calls under lua_pcall(). This may be backported to all stable versions.	2024-06-04 16:31:09 +02:00
Aurelien DARRAGON	c0a3c1281f	BUG/MINOR: hlua: don't use lua_pushfstring() when we don't expect LJMP lua_pushfstring() is used in multiple cleanup paths (upon error) to push the error message that will be raised by lua_error(). However this is often done from an unprotected environment, or in the middle of a cleanup sequence, thus we don't want the function to LJMP! (it may cause various issues ranging from memory leaks to crashing the process..) Hopefully this has very few chances of happening but since the use of lua_pushfstring() is limited to error reporting here, it's ok to use our own hlua_pushfstring_safe() implementation with a little overhead to ensure that the function will never LJMP. This could be backported to all stable versions.	2024-06-04 16:31:01 +02:00
Aurelien DARRAGON	6e484996c6	CLEANUP: hlua: use hlua_pusherror() where relevant In hlua_map_new(), when error occurs we use a combination of luaL_where, lua_pushfstring and lua_concat to build the error string before calling lua_error(). It turns out that we already have the hlua_pusherror() macro which is exactly made for that purpose so let's use it. It could be backported to all stable versions to ease code maintenance.	2024-06-04 16:30:55 +02:00
Amaury Denoyelle	f7ae84e7d1	BUG/MINOR: quic: prevent crash on qc_kill_conn() Ensure idle_timer task is allocated in qc_kill_conn() before waking it up. It can be NULL if idle timer has already fired but MUX layer is still present, which prevents immediate quic_conn release. qc_kill_conn() is only used on send() syscall fatal error to notify upper layer of an error and close the whole connection asap. This crash occurence is pretty rare as it relies on timing issues. It happens only if idle timer occurs before the MUX release (a bigger client timeout is thus required) and any send() syscall detected error. For now, it was only reproduced using GDB to interrupt haproxy longer than the idle timeout. This should be backported up to 2.6.	2024-06-04 14:59:24 +02:00
Christopher Faulet	792a645ec2	BUG/MEDIUM: mux-quic: Unblock zero-copy forwarding if the txbuf can be released In done_fastfwd() callback function, if nothing was forwarding while the SD is blocked, it means there is not enough space in the buffer to proceed. It may be because there are data to be sent. But it may also be data already sent waiting for an ack. In this case, no data to be sent by the mux. So the quic stream is not woken up when data are finally removed from the buffer. The data forwarding can thus be stuck. This happens when the stats page is requested in QUIC/H3. Only applets are affected by this issue and only with the QUIC multiplexer because it is the only mux with already sent data in the TX buf. To fix the issue, the idea is to release the txbuf if possible and then unblock the SD to perform a new zero-copy data forwarding attempt. Doing so, and thanks to the previous patch ("MEDIUM: applet: Be able to unblock zero-copy data forwarding from done_fastfwd"), the applet will be woken up. This patch should fix the issue #2584. It must be backported to 3.0.	2024-06-04 14:23:40 +02:00
Christopher Faulet	d2a2014f15	MEDIUM: stconn: Be able to unblock zero-copy data forwarding from done_fastfwd This part is only experienced by applet. When an applet try to forward data via an iobuf, it may decide to block for any reason even if there is free space in the buffer. For instance, the stats applet don't procude data if the buffer is almost full. However, in this case, it could be good to let the consumer decide a new attempt is possible because more space was made. So, if IOBUF_FL_FF_BLOCKED flag is removed by the consumer when done_fastfwd() callback function is called, the SE_FL_WANT_ROOM flag is removed on the producer sedesc. It is only done for applets. And thanks to this change, the applet can be woken up for a new attempt. This patch is required for a fix on the QUIC multiplexer.	2024-06-04 14:23:40 +02:00
Christopher Faulet	7c84ee71f7	BUG/MEDIUM: h1-htx: Don't state interim responses are bodyless Interim responses are by definition bodyless. But we must not set the corresponding HTX start-line flag, beecause the start-line of the final response is still expected. Setting the flag above too early may lead the multiplexer on the sending side to consider the message is finished after the headers of the interim message. It happens with the H2 multiplexer on frontend side if a "100-Continue" is received from the server. The interim response is sent and HTX_SL_F_BODYLESS_RESP flag is evaluated. Then, the headers of the final response are sent with ES flag, because HTX_SL_F_BODYLESS_RESP flag was seen too early, leading to a protocol error if the response has a body. Thanks to grembo for this analysis. This patch should fix the issue #2587. It must be backported as far as 2.9.	2024-06-04 14:23:40 +02:00
Aurelien DARRAGON	a63f2cde94	CLEANUP: hlua: fix CertCache class comment CLASS_CERTCACHE is used to declare CertCache global object, not Regex one This copy-paste typo introduced was in `30fcca18` ("MINOR: ssl/lua: CertCache.set() allows to update an SSL certificate file")	2024-06-03 17:00:06 +02:00
Aurelien DARRAGON	4f906a9c38	BUG/MINOR: hlua: use CertCache.set() from various hlua contexts Using CertCache.set() from init context wasn't explicitly supported and caused the process to crash: crash.lua: core.register_init(function() CertCache.set{filename="reg-tests/ssl/set_cafile_client.pem", ocsp=""} end) crash.conf: global lua-load crash.lua listen front bind localhost:9090 ssl crt reg-tests/ssl/set_cafile_client.pem ca-file reg-tests/ssl/set_cafile_interCA1.crt verify none ./haproxy -f crash.conf [NOTICE] (267993) : haproxy version is 3.0-dev2-640ff6-910 [NOTICE] (267993) : path to executable is ./haproxy [WARNING] (267993) : config : missing timeouts for proxy 'front'. \| While not properly invalid, you will certainly encounter various problems \| with such a configuration. To fix this, please ensure that all following \| timeouts are set to a non-zero value: 'client', 'connect', 'server'. [1] 267993 segmentation fault (core dumped) ./haproxy -f crash.conf This is because in hlua_ckch_set/hlua_ckch_commit_yield, we always consider that we're being called from a yield-capable runtime context. As such, hlua_gethlua() is never checked for NULL and we systematically try to wake hlua->task and yield every 10 instances. In fact, if we're called from the body or init context (that is, during haproxy startup), hlua_gethlua() will return NULL, and in this case we shouldn't care about yielding because it is ok to commit all instances at once since haproxy is still starting up. Also, when calling CertCache.set() from a non-yield capable runtime context (such as hlua fetch context), we kept doing as if the yield succeeded, resulting in unexpected function termination (operation would be aborted and the CertCache lock wouldn't be released). Instead, now we explicitly state in the doc that CertCache.set() cannot be used from a non-yield capable runtime context, and we raise a runtime error if it is used that way. These bugs were discovered by reading the code when trying to address Svace report documented by @Bbulatov GH #2586. It should be backported up to 2.6 with 30fcca18 ("MINOR: ssl/lua: CertCache.set() allows to update an SSL certificate file")	2024-06-03 17:00:00 +02:00
Aurelien DARRAGON	8860c22c00	MINOR: stktable: avoid ambiguous stktable_data_ptr() usage in cli_io_handler_table() As reported by @Bbulatov in GH #2586, stktable_data_ptr() return value is used without checking it isn't NULL first, which may happen if the given type is invalid or not stored in the table. However, since date_type is set by table_prepare_data_request() right before cli_io_handler_table() is invoked, date_type is not expected to be invalid: table_prepare_data_request() normally checked that the type is stored inside the table. Thus stktable_data_ptr() should not be failing at this point, so we add a BUG_ON() to indicate that.	2024-06-03 16:59:54 +02:00
Willy Tarreau	ba958fb230	BUG/MINOR: tools: fix possible null-deref in env_expand() on out-of-memory In GH issue #2586 @Bbulatov reported a theoretical null-deref in env_expand() in case there's no memory anymore to expand an environment variable. The function should return NULL in this case so that the only caller (str2sa_range) sees it. In practice it may only happen during boot thus is harmless but better fix it since it's easy. This can be backported to all versions where this applies.	2024-05-31 18:55:36 +02:00
Willy Tarreau	8a7afb6964	BUG/MINOR: tcpcheck: report correct error in tcp-check rule parser When parsing tcp-check expect-header, a copy-paste error in the error message causes the name of the header to be reporetd as the invalid format string instead of its value. This is really harmless but should be backported to all versions to help users understand the cause of the problem when this happens. This was reported in GH issue #2586 by @Bbulatov.	2024-05-31 18:37:56 +02:00
Willy Tarreau	d8194fab82	BUG/MINOR: cfgparse: remove the correct option on httpcheck send-state warning In GH issue #2586 @Bbulatov reported a bug where the http-check send-state flag is removed from options instead of options2 when http-check is disabled. It only has an effect when this option is set and http-check disabled, where it displays a warning indicating this will be ignored. The option removed instead is srvtcpka when this happens. It's likely that both options being so minor, nobody ever faced it. This can be backported to all versions.	2024-05-31 18:30:16 +02:00
Valentine Krasnobaeva	d5e43caaf5	BUG/MINOR: activity: fix Delta_calls and Delta_bytes count Thanks to the commit `5714aff4a6` "DEBUG: pool: store the memprof bin on alloc() and update it on free()", the amount of memory allocations and memory "frees" is shown now on the same line, corresponded to the caller name. This is very convenient to debug memory leaks (haproxy should run with -dMcaller option). The implicit drawback of this solution is that we count twice same free_calls and same free_tot (bytes) values in cli_io_handler_show_profiling(), when we've calculed tot_free_calls and tot_free_bytes, by adding them to the these totalizators for p_alloc, malloc and calloc allocator types. See the details about why this happens in a such way in __pool_free() implementation and also in the commit message for `5714aff4a6`. This double addition of free counters falses 'Delta_calls' and 'Delta_bytes', sometimes we even noticed that they show negative values. Same problem was with the calculation of average allocated buffer size for lines, where we show simultaneously the number of allocated and freed bytes.	2024-05-28 19:25:08 +02:00
Willy Tarreau	decb7c90df	CLEANUP: ssl_sock: move dirty openssl-1.0.2 wrapper to openssl-compat Valentine noticed this ugly SSL_CTX_get_tlsext_status_cb() macro definition inside ssl_sock.c that is dedicated to openssl-1.0.2 only. It would be better placed in openssl-compat.h, which is what this patch does. It also addresses a missing pair of parenthesis and removes an invalid extra semicolon.	2024-05-28 19:17:57 +02:00
Valentine Krasnobaeva	84380965a5	BUG/MINOR: ssl/ocsp: init callback func ptr as NULL In ssl_sock_load_ocsp() it is better to initialize local scope variable 'callback' function pointer as NULL, while we are declaring it. According to SSL_CTX_get_tlsext_status_cb() API, then we will provide a pointer to this 'on stack' variable in order to check, if the callback was already set before: OpenSSL 1.x.x and 3.x.x: long SSL_CTX_get_tlsext_status_cb(SSL_CTX ctx, int (callback)(SSL , void )); long SSL_CTX_set_tlsext_status_cb(SSL_CTX ctx, int (callback)(SSL , void )); WolfSSL 5.7.0: typedef int(tlsextStatusCb)(WOLFSSL* ssl, void); WOLFSSL_API int wolfSSL_CTX_get_tlsext_status_cb(WOLFSSL_CTX ctx, tlsextStatusCb* cb); WOLFSSL_API int wolfSSL_CTX_set_tlsext_status_cb(WOLFSSL_CTX* ctx, tlsextStatusCb cb); When this func ptr variable stays uninitialized, haproxy comipled with ASAN crushes in ssl_sock_load_ocsp(): ./haproxy -d -f haproxy.cfg ... AddressSanitizer:DEADLYSIGNAL ================================================================= ==114919==ERROR: AddressSanitizer: SEGV on unknown address 0x000000000008 (pc 0x5eab8951bb32 bp 0x7ffcdd6d8410 sp 0x7ffcdd6d82e0 T0) ==114919==The signal is caused by a READ memory access. ==114919==Hint: address points to the zero page. #0 0x5eab8951bb32 in ssl_sock_load_ocsp /home/vk/projects/haproxy/src/ssl_sock.c:1248:22 #1 0x5eab89510d65 in ssl_sock_put_ckch_into_ctx /home/vk/projects/haproxy/src/ssl_sock.c:3389:6 ... This happens, because callback variable is allocated on the stack. As not being explicitly initialized, it may contain some garbage value at runtime, due to the linked crypto library update or recompilation. So, following ssl_sock_load_ocsp code, SSL_CTX_get_tlsext_status_cb() may fail, callback will still contain its initial garbage value, 'if (!callback) {...' test will put us on the wrong path to access some ocsp_cbk_arg properties via its pointer, which won't be set and like this we will finish with segmentation fault. Must be backported in all stable versions. All versions does not have the ifdef, the previous cleanup patch is useful starting from the 2.7 version.	2024-05-28 18:14:26 +02:00
Valentine Krasnobaeva	fb7b46d267	CLEANUP: ssl/ocsp: readable ifdef in ssl_sock_load_ocsp Due to the support of different TLS/SSL libraries and its different versions, sometimes we are forced to use different internal typedefs and callback functions. We strive to avoid this, but time to time "#ifdef... #endif" become inevitable. In particular, in ssl_sock_load_ocsp() we define a 'callback' variable, which will contain a function pointer to our OCSP stapling callback, assigned further via SSL_CTX_set_tlsext_status_cb() to the intenal SSL context struct in a linked crypto library. If this linked crypto library is OpenSSL 1.x.x/3.x.x, for setting and getting this callback we have the following API signatures (see doc/man3/SSL_CTX_set_tlsext_status_cb.pod): long SSL_CTX_get_tlsext_status_cb(SSL_CTX ctx, int (callback)(SSL , void )); long SSL_CTX_set_tlsext_status_cb(SSL_CTX ctx, int (callback)(SSL , void )); If we are using WolfSSL, same APIs expect tlsextStatusCb function prototype, provided via the typedef below (see wolfssl/wolfssl/ssl.h): typedef int(tlsextStatusCb)(WOLFSSL* ssl, void); WOLFSSL_API int wolfSSL_CTX_get_tlsext_status_cb(WOLFSSL_CTX ctx, tlsextStatusCb* cb); WOLFSSL_API int wolfSSL_CTX_set_tlsext_status_cb(WOLFSSL_CTX* ctx, tlsextStatusCb cb); It seems, that in OpenSSL < 1.0.0, there was no support for OCSP extention, so no need to set this callback. Let's avoid #ifndef... #endif for this 'callback' variable definition to keep things clear. #ifndef... #endif are usually less readable, than straightforward "#ifdef... #endif".	2024-05-28 18:00:44 +02:00
Willy Tarreau	725fa0ecd2	BUILD: fd: errno is also needed without poll() When building without USE_POLL, fd.c fails on errno because that one is only included when USE_POLL is set. Let's move it outside of the ifdef.	2024-05-27 19:14:14 +02:00
Aurelien DARRAGON	435a9da267	MINOR: log: rename 'log-format tag' to 'log-format alias' In 2.9 we started to introduce an ambiguity in the documentation by referring to historical log-format variables ('%var') as log-format tags in `739c4e5b1e` ("MINOR: sample: accept_date / request_date return %Ts / %tr timestamp values") and `454c372b60` ("DOC: configuration: add sample fetches for timing events"). In fact, we've had this confusion between log-format tag and log-format var for more than 10 years now, but in 2.9 it was the first time the confusion was exposed in the documentation. Indeed, both 'log-format variable' and 'log-format tag' actually refer to the same feature (that is: '%B' and friends that can be used for direct access to some log-oriented predefined fetches instead of using %[expr] with generic sample expressions). This feature was first implemented in `723b73ad75` ("MINOR: config: Parse the string of the log-format config keyword") and later documented in `4894040fa` ("DOC: log-format documentation"). At that time, it was clear that we used to name it 'log-format variable'. But later the same year, 'log-format tag' naming started to appear in some commit messages (while still referring to the same feature), for instance with `ffc3fcd6d` ("MEDIUM: log: report SSL ciphers and version in logs using logformat %sslc/%sslv"). Unfortunately in 2.9 when we added (and documented) new log-format variables we officially started drifting to the misleading 'log-format tag' naming (perhaps because it was the most recent naming found for this feature in git log history, or because the confusion has always been there) Even worse, in 3.0 this confusion led us to rename all 'var' occurrences to 'tag' in log-format related code to unify the code with the doc. Hopefully William quickly noticed that we made a mistake there, but instead of reverting to historical naming (log-format variable), it was decided that we must use a different name that is less confusing than 'tags' or 'variables' (tags and variables are keywords that are already used to designate other features in the code and that are not very explicit under log-format context today). Now we refer to '%B' and friends as a logformat alias, which is essentially a handy way to print some log oriented information in the log string instead of leveraging '%[expr]' with generic sample expressions made of fetches and converters. Of course, there are some subtelties, such as a few log-format aliases that still don't have sample fetch equivalent for historical reasons, and some aliases that may be a little faster than their generic sample expression equivalents because most aliases are pretty much hardcoded in the log building function. But in general logformat aliases should be simply considered as an alternative to using expressions (with '%[expr']') Also, under log-format context, when we want to refer to either an alias ('%alias') or an expression ('%[expr]'), we should use the generic term 'logformat item', which in fact designates a single item within the logformat string provided by the user. Indeed, a logformat item (whether is is an alias or an expression) always starts with '%' and may accept optional flags / arguments Both the code and the documentation were updated in that sense, hopefully this will clarify things and prevent future confusions.	2024-05-27 17:03:48 +02:00
William Lallemand	0a00302fab	MINOR: sample: implement the uptime sample fetch 'uptime' returns the uptime of the current HAProxy worker in seconds.	2024-05-27 11:06:40 +02:00
Christopher Faulet	0d7c1bc6ab	BUG/MINOR: server: Don't reset resolver options on a new default-server line When a new "default-server" line is parsed, some resolver options are reset. Thus previously defined default options cannot be inherited. There is no reason to do so. First because other server options are inherited. And then because not all resolver options are reset. It is not consistent. This patch should fix issue #2559. It should be backported to all stable versions.	2024-05-24 16:31:01 +02:00
Christopher Faulet	8d2514e087	BUG/MINOR: http-htx: Support default path during scheme based normalization As stated in RFC3986, for an absolute-form URI, an empty path should be normalized to a path of "/". This is part of scheme based normalization rules. This kind of normalization is already performed for default ports. So we might as well deal with the case of empty path. The associated reg-tests was updated accordingly. This patch should fix the issue #2573. It may be backported as far as 2.4 if necessary.	2024-05-24 16:17:24 +02:00
Aurelien DARRAGON	c16eba8183	BUG/MEDIUM: server/dns: preserve server's port upon resolution timeout or error @boi4 reported in GH #2578 that since 3.0-dev1 for servers with address learned from A/AAAA records after a DNS flap server would be put out of maintenance with proper address but with invalid port (== 0), making it unusable and causing tcp checks to fail: [NOTICE] (1) : Loading success. [WARNING] (8) : Server mybackend/myserver1 is going DOWN for maintenance (DNS refused status). 0 active and 0 backup servers left. 0 sessions active, 0 requeued, 0 remaining in queue. [ALERT] (8) : backend 'mybackend' has no server available! [WARNING] (8) : mybackend/myserver1: IP changed from '(none)' to '127.0.0.1' by 'myresolver/ns1'. [WARNING] (8) : Server mybackend/myserver1 ('myhost') is UP/READY (resolves again). [WARNING] (8) : Server mybackend/myserver1 administratively READY thanks to valid DNS answer. [WARNING] (8) : Server mybackend/myserver1 is DOWN, reason: Layer4 connection problem, info: "Connection refused", check duration: 0ms. 0 active and 0 backup servers left. 0 sessions active, 0 requeued, 0 remaining in queue. @boi4 also mentioned that this used to work fine before. Willy suggested that this regression may have been introduced by `64c9c8e` ("BUG/MINOR: server/dns: use server_set_inetaddr() to unset srv addr from DNS") Turns out he was right! Indeed, in `64c9c8e` we systematically memset the whole server_inetaddr struct (which contains both the requested server's addr and port planned for atomic update) instead of only memsetting the addr part of the structure: except when SRV records are involved (SRV records provide both the address and the port unlike A or AAAA records), we must not reset the server's port upon DNS errors because the port may have been provided at config time and we don't want to lose its value. Big thanks to @boi4 for his well-documented issue that really helped us to pinpoint the bug right on time for the dev-13 release. No backport needed (unless `64c9c8e` gets backported).	2024-05-24 15:29:48 +02:00
Amaury Denoyelle	98ed11b0c5	BUG/MINOR: rhttp: initialize session origin after preconnect reversal Since the following commit, session is initialized early for rhttp preconnect. `12c40c25a9` MEDIUM: rhttp: create session for active preconnect Session origin member was not set. However, this prevents several session fetches to not work as expected. Worst, this caused a regression as previously session was created after reversal with origin member defined. This was reported by user William Manley on the mailing-list which rely on set-dst. One possible fix would be to set origin on session_new(). However, as this is done before reversal, some session members may be incorrectly initialized, in particular source and destination address. Thus, session origin is only set after reversal is completed. This ensures that session fetches have the same behavior on standard connections and reversable ones. This does not need to be backported.	2024-05-24 14:47:21 +02:00
Amaury Denoyelle	47168e217a	MEDIUM: connection: use pool-conn-name instead of sni on reuse Implement pool-conn-name support for idle connection reuse. It replaces SNI as arbitrary identifier for connections in the idle pool. Thus, every SNI reference in this context have been replaced. Main change occurs in connect_server() where pool-conn-name sample fetch is now prehash to generate idle connection identifier. SNI is now solely used in the context of SSL for ssl_sock_set_servername().	2024-05-24 14:47:21 +02:00
Amaury Denoyelle	be4f89f2b2	MINOR: server: define pool-conn-name keyword Define a new server keyword pool-conn-name. The purpose of this keyword will be to identify connections inside the idle connections pool, replacing SNI in case SSL is not wanted. This keyword uses a sample expression argument. It thus can reuse existing function parse_srv_expr() for parsing. In the future, it may be necessary to define a keyword variant which uses a logformat for extensability. This patch only implement parsing. Argument is stored inside new server field <pool_conn_name> and expression is generated in _srv_parse_finalize() into <pool_conn_name_expr>. If pool-conn-name is not set but SNI is, the latter is reused automatically as pool-conn-name via _srv_parse_finalize(). This ensures current reuse behavior remains compatible and idle connection reuse will not mix connections with different SNIs by mistake. Main usage will be for rhttp when SSL is not wanted between the two haproxy instances. Previously, it was possible to use "sni" keyword even without SSL on a server line which have a similar effect. However, having a dedicated "pool-conn-name" keyword is deemed clearer. Besides, it would allow for more complex configuration where pool-conn-name and SNI are use in parallel with different values.	2024-05-24 14:36:31 +02:00
Amaury Denoyelle	91001422b4	MINOR: server: generalize sni expr parsing Two functions exists for server sni sample expression parsing. This is confusing so this commit aims at clarifying this. Functions are renamed with the following identifiers. First function is named parse_srv_expr() and can be used during parsing. Besides expression parsing, it has ensure sample fetch validity in the context of a server line. Second function is renamed _parse_srv_expr() and is used internally by parse_srv_expr(). It only implements sample parsing without extra checks. It is already use for server instantiation derived from server-template as checks were already performed. Also, it is now used in http-client code as SNI is a fixed string. Finally, both functions are generalized to remove any reference to SNI. This will allow to reuse it to parse other server keywords which use an expression. This will be the case for the future keyword pool-conn-name.	2024-05-24 14:36:31 +02:00
Amaury Denoyelle	b9f67a46a2	MINOR: quic: clarify doc for quic_recv() Just highlight the fact that quic_recv() only receive a single datagram.	2024-05-24 14:36:31 +02:00
Amaury Denoyelle	5764bc50b5	BUG/MINOR: quic: adjust restriction for stateless reset emission Review RFC 9000 and ensure restriction on Stateless reset are properly enforced. After careful examination, several changes are introduced. First, redefine minimal Stateless Reset emitted packet length to 21 bytes (5 random bytes + a token). This is the new default length used in every case, unless received packet which triggered it is 43 bytes or smaller. Ensure every Stateless Reset packets emitted are at 1 byte shorter than the received packet which triggered it. No Stateless reset will be emitted if this falls under the above limit of 21 bytes. Thus this should prevent looping issues. This should be backported up to 2.6.	2024-05-24 14:36:31 +02:00
Amaury Denoyelle	f55748a422	MAJOR: config: prevent QUIC with clients privileged port by default Previous commit introduce new protection mechanism to forbid communications with clients which use a privileged source port. By default, this mechanism is disabled for every protocols. This patch changes the default value and activate the protection mechanism for QUIC protocol. This is justified as it is a probable sign of DNS/NTP amplification attack. This is labelled as major as it can be a breaking change with some network environments.	2024-05-24 14:36:31 +02:00
Amaury Denoyelle	45f40bac4c	MEDIUM: config: prevent communication with privileged ports This commit introduces a new global setting named harden.reject_privileged_ports.{tcp\|quic}. When active, communications with clients which use privileged source ports are forbidden. Such behavior is considered suspicious as it can be used as spoofing or DNS/NTP amplication attack. Value is configured per transport protocol. For each TCP and QUIC distinct code locations are impacted by this setting. The first one is in sock_accept_conn() which acts as a filter for all TCP based communications just after accept() returns a new connection. The second one is dedicated for QUIC communication in quic_recv(). In both cases, if a privileged source port is used and setting is disabled, received message is silently dropped. By default, protection are disabled for both protocols. This is to be able to backport it without breaking changes on stable release. This should be backported as it is an interesting security feature yet relatively simple to implement.	2024-05-24 14:36:31 +02:00
Amaury Denoyelle	4e632545f7	BUILD: trace: fix warning on null dereference Since a recent change on trace, the following compilation warning may occur : src/trace.c: In function ‘trace_parse_cmd’: src/trace.c:865:33: error: potential null pointer dereference [-Werror=null-dereference] 865 \| for (nd = src->decoding; nd->name && nd->desc; nd++) \| ~~~^~~~~~~~~~~~~~~ Fix this by rearranging code path to better highlight that only "quiet" verbosity is allowed if no trace source is specified. This was detected with GCC 14.1.	2024-05-24 14:36:03 +02:00
Aurelien DARRAGON	c9af6d5414	DEBUG: pollers/fd: add thread id suffix to per-thread memory areas name hints Willy reported that since `abb8412d2` ("DEBUG: pollers: add name hint for large memory areas used by pollers") and `22ec2ad8b` ("DEBUG: fd: add name hint for large memory areas") multiple maps with the same name could be found in /proc/<pid>/maps when haproxy process is started with multiple threads, which can be annoying. In fact this happens because some poller and fd-created memory areas are being created for each available thread, and since the naming was done using vma_set_name() with the same <type> and <name> inputs, the resulting name was the same for all threads. Thanks to the previous commit, we now use vma_set_name_id() for naming per-thread memory areas so that "-id" prefix is appended after the name name, where "id" equals to 'tid+1' (to match the thread numbering logic found in config file or in ha_panic() report), allowing to easily identify which haproxy thread owns the map in /proc/<pid>/maps: 7d3b26200000-7d3b26a01000 rw-p 00000000 00:00 0 [anon:ev_poll:poll_events-2] 7d3b26c00000-7d3b27001000 rw-p 00000000 00:00 0 [anon:fd:fd_updt-2] 7d3b27200000-7d3b27a01000 rw-p 00000000 00:00 0 [anon:ev_poll:poll_events-1] 7d3b34200000-7d3b34601000 rw-p 00000000 00:00 0 [anon:fd:fd_updt-1]	2024-05-24 12:07:18 +02:00
Aurelien DARRAGON	9d37c4b989	DEBUG: tools: add vma_set_name_id() helper Just like vma_set_name() from `51a8f134e` ("DEBUG: tools: add vma_set_name() helper"), but also takes <id> as parameter to append "-$id" suffix after the name in order to differentiate 2 areas that were named using the same <type> and <name> combination. example, using mmap + MAP_SHARED\|MAP_ANONYMOUS: 7364c4fff000-736508000000 rw-s 00000000 00:01 3540 [anon_shmem:type:name-id] Another example, using mmap + MAP_PRIVATE\|MAP_ANONYMOUS or using glibc/malloc() above MMAP_THRESHOLD: 7364c4fff000-736508000000 rw-s 00000000 00:01 3540 [anon:type:name-id]	2024-05-24 12:07:13 +02:00
Aurelien DARRAGON	23814a44e5	CLEANUP: tools: fix vma_set_name() function comment There was a typo in the example provided in vma_set_name(): maps named using the function will show up as "type:name", not "type.name", updating the comment to reflect the current behavior.	2024-05-24 12:07:07 +02:00

1 2 3 4 5 ...

17628 Commits