haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2025-02-22 13:46:52 +00:00

Author	SHA1	Message	Date
Remi Tricot-Le Breton	45a2ff0f4a	MINOR: shctx: Remove 'use_shared_mem' variable This global variable was used to avoid using locks on shared_contexts in the unlikely case of nbthread==1. Since the locks do not do anything when USE_THREAD is not defined, it will be more beneficial to simply remove this variable and the systematic test on its value in the shared context locking functions.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	4fe6c1365d	MINOR: shctx: Remove redundant arg from free_block callback The free_block callback does not get called on blocks that are not row heads anymore so we don't need too shared_block parameters.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	48f81ec09d	MAJOR: cache: Delay cache entry delete in reserve_hot function A reference counter on the cache_entry was added in a previous commit. Its value is atomically increased and decreased via the retain_entry and release_entry functions. This is needed because of the latest cache and shared_context modifications that introduced two separate locks instead of the preexisting single shctx_lock one. With the new logic, we have two main blocks competing for the two locks: - the one in the http_action_req_cache_use that performs a lookup in the cache tree (locked by the cache lock) and then tries to remove the corresponding blocks from the shared_context's 'avail' list until the response is sent to the client by the cache applet, - the shctx_row_reserve_hot that traverses the 'avail' list and gives them back to the caller, while removing previous row heads from the cache tree Those two blocks require the two locks but one of them would take the cache lock first, and the other one the shctx_lock first, which would end in a deadlock without the current patch. The way this conflict is resolved in this patch is by ensuring that at least one of those uses works without taking the two locks at the same time. The solution found was to keep taking the two locks in the cache_use case. We first lock the cache to lookup for an entry and we then take the shctx lock as well to detach the corresponding blocks from the 'avail' list. The subtlety is that between the cache lookup and the actual locking of the shctx, another thread might have called the reserve_hot function in which we only take the shctx lock. In this function we traverse the 'avail' list to remove blocks that are then given to the caller. If one of those blocks corresponds to a previous row head, we call the 'free_blocks' callback that used to delete the cache entry from the tree. We now avoid deleting directly the cache entries in reserve_hot and we rather set the cache entries 'complete' param to 0 so that no other thread tries to work with this entry. This way, when we release the shctx lock in reserve_hot, the first thread that had performed the cache lookup and had found an entry that we just gave to another thread will see that the 'complete' field is 0 and it won't try to work with this response. The actual removal of entries from the cache tree will now be performed in the new 'reserve_finish' callback called at the end of the shctx_row_reserve_hot function. It will iterate on all the row head that were inserted in a dedicated list in the 'free_block' callback and perform the actual delete.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	1cd91b4f2a	MINOR: shctx: Add new reserve_finish callback call to shctx_row_reserve_hot This patch adds a reserve_finish callback that can be defined by the subsystems that require a shared_context. It is called at the end of shctx_row_reserve_hot after the shared_context lock is released.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	11df806c88	MEDIUM: shctx: Descend shctx_lock calls into the shctx_row_reserve_hot Descend the shctx_lock calls into the shctx_row_reserve_hot so that the cases when we don't need to lock anything (enough space in the current row or not enough space in the 'avail' list) do not take the lock at all. In sh_ssl_sess_new_cb the lock had to be descended into sh_ssl_sess_store in order not to cover the shctx_row_reserve_hot call anymore.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	a29b073f26	MEDIUM: cache: Add refcount on cache_entry Add a reference counter on the cache_entry. Its value will be atomically increased and decreased via the retain_entry and release_entry functions. The release_entry function has two distinct versions, release_entry_locked and release_entry_unlocked that should be called when the cache lock is already taken in write mode or not (respectively). In the unlocked case the cache lock will only be taken in write mode on the last reference of the entry (before calling delete_entry). This allows to limit the amount of times when we need to take the cache lock during a release operation.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	ed35b9411a	MEDIUM: cache: Switch shctx spinlock to rwlock and restrict its scope Since a lock on the cache tree was added in the latest cache changes, we do not need to use the shared_context's lock to lock more than pure shared_context related data anymore. This already existing lock will now only cover the 'avail' list from the shared_context. It can then be changed to a rwlock instead of a spinlock because we might want to only run through the avail list sometimes. Apart form changing the type of the shctx lock, the main modification introduced by this patch is to limit the amount of code covered by the shctx lock. This lock does not need to cover any code strictly related to the cache tree anymore.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	a0d7c290ec	MINOR: cache: Use dedicated trash for "show cache" cli command After the latest changes in the cache/shared_context mechanism, the cache and shared_context logic were decorrelated and in some unlikely cases we might end up using the "show cache" command while some regular cache processing is occurring (a response being stored in the cache for instance). In such a case, because we used the same 'trash' buffer in those two contexts, we could end up with the contents of a response in the ouput of the "show cache" command. This patch fixes this problem by allocating a dedicated trash for the CLI command.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	3831d8454f	MEDIUM: shctx: Remove 'hot' list from shared_context The "hot" list stored in a shared_context was used to keep a reference to shared blocks that were currently being used and were thus removed from the available list (so that they don't get reused for another cache response). This 'hot' list does not ever need to be shared across threads since every one of them only works on their current row. The main need behind this 'hot' list was to detach the corresponding blocks from the 'avail' list and to have a known list root when calling list_for_each_entry_from in shctx_row_data_append (for instance). Since we actually never need to iterate over all members of the 'hot' list, we can remove it and replace the inc_hot/dec_hot logic by a detach/reattach one.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	bd24118212	MEDIUM: cache: Use rdlock on cache in cache_use When looking for a valid entry in the cache tree in http_action_req_cache_use, we do not need to delete an expired entry at once because even if an expired entry exists, since the request will be forwarded to the server, then the expired entry will be overwritten when the updated response is seen. We can then use a simpler rdlock during cache_use operation.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	0dfb57bbf9	MINOR: cache: Add option to avoid removing expired entries in lookup function Any lookup in the cache tree done through entry_exist or secondary_entry_exist functions could end up deleting the corresponding entry if it is expired which prevents from using a rdlock on code paths that would just perform a lookup on the tree (in http_action_req_cache_use for instance). Adding a 'delete_expired' boolean as a parameter allows for "pure" lookups and thus it will allow to perform operations on the tree that simply require a rdlock instead of a "heavier" wrlock.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	ff3cb6dad4	MINOR: cache: Remove expired entry delete in "show cache" command The "show cache" CLI command iterates over all the entries of the cache tree and it used this opportunity to remove expired entries from the cache. This behavior was completely undocumented and does not seem that necessary. By removing it we can take the cache lock in read mode only which limits the impact on the other threads.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	ac9c49b40d	MEDIUM: cache: Use dedicated cache tree lock alongside shctx lock Every use of the cache tree was covered by the shctx lock even when no operations were performed on the shared_context lists (avail and hot). This patch adds a dedicated RW lock for the cache so that blocks of code that work on the cache tree only can use this lock instead of the superseding shctx one. This is useful for operations during which the concerned blocks are already in the hot list. When the two locks need to be taken at the same time, in http_action_req_cache_use and in shctx_row_reserve_hot, the shctx one must be taken first. A new parameter needed to be added to the shared_context's free_block callback prototype so that cache_free_block can take the cache lock and release it afterwards.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	81d8014af8	MINOR: shctx: Remove explicit 'from' param from shctx_row_data_append This parameter is not necessary since the first element of a row always has a pointer to the row's tail.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	610b67fd8b	MEDIUM: shctx: Simplify shctx_row_reserve_hot loop The shctx_row_reserve_hot relied on two loop levels in order to first look for the first block of a preused row and then iterate on all the blocks of this row to reserve them for the new row. This was not the simplest nor the easiest to read way so this logic could be replaced by a single iteration on the avail list members. The two use cases of calling this function with or without a preexisting "first" member were a bit cumbersome as well and were replaced by a more straightforward approach.	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	eccb97f60e	MEDIUM: shctx: Move list between hot and avail list in O(1) Instead of iterating over all the elements of a given row when moving it between the hot and available lists, we can make use of the last_reserved pointer that already points to the last block of the list to perform the move in O(1).	2023-11-16 19:35:10 +01:00
Remi Tricot-Le Breton	55fbf82080	MINOR: shctx: Set last_append to NULL when reserving block in hot list Ensure that the last_append pointer is always set to NULL on first block of rows reserved by the subsystems using the shctx (cache for instance). This pointer will be used directly in shctx_row_data_append instead of the 'from' param which will simplify its uses.	2023-11-16 19:35:10 +01:00
Amaury Denoyelle	560cb1332a	MINOR: server: force add to idle on reverse A backend connection is inserted in server idle list via srv_add_to_idle_list(). This function has several conditions which may cause the connection to be rejected instead. One of this condition is based on the current estimate count of needed connections for the server. If the count of idle connections stored has already reached this estimation, the new connection is rejected. This is in opposition with the purpose of reverse HTTP. On active reverse, haproxy can instantiate several connections to properly serve the future traffic. However, the opposite passive haproxy will have only a low estimate of needed connection and will reject most of them. To fix this, simply check CO_FL_REVERSED connection flag on srv_add_to_idle_list(). If set, the connection is inserted without checking for estimate count. Note that all other conditions are not impacted, so it's still possible to reject a connection, for example if process FD limit is reached. This commit relies on recent patch which change CO_FL_REVERSED flag for connection after passive reverse.	2023-11-16 18:43:41 +01:00
Amaury Denoyelle	a1457296d5	BUG/MINOR: mux_h2: reject passive reverse conn if error on add to idle On passive reverse, H2 mux is responsible to insert the connection in the server idle list. This is done via srv_add_to_idle_list(). However, this function may fail for various reason, such as FD usage limit reached. Handle properly this error case. H2 mux flags the connection on error which will cause its release. Prior to this patch, the connection was only released on server timeout. This bug was found inspecting server curr_used_conns counter. Indeed, on connection reverse, this counter is first incremented. It is decremented just after on srv_add_to_idle_list() if insertion is validated. However, if insertion is rejected, the connection was not released which cause curr_used_conns to remains positive. This has the major downside to break the reusing of idle connection on rhttp causing spurrious 503 errors. No need to backport.	2023-11-16 18:43:32 +01:00
Amaury Denoyelle	8cc3fc73f1	MINOR: connection: update rhttp flags usage Change the flags used for reversed connection : * CO_FL_REVERSED is now put after reversal for passive connect. For active connect, it is delayed when accept is completed after reversal. * CO_FL_ACT_REVERSING replace the old CO_FL_REVERSED. It is put only for active connect on reversal and removes once accept is done. This allows to identify a connection as reversed during its whole lifetime. This should be useful to extend reverse connect.	2023-11-16 17:53:31 +01:00
Christopher Faulet	691f4cf449	BUG/MEDIUM: stream: Don't call mux .ctl() callback if not implemented The commit `5ff7d2276` ("BUG/MEDIUM: stream: Properly handle abortonclose when set on backend only") introduced a regression. Not all multiplexer implement the .ctl() callback function. Thus we must be sure this callback function is defined first to call it. This patch should fix a crash reported by Tristan in the issue #2095. It must be backported as far as 2.2, with the commit above.	2023-11-14 19:21:52 +01:00
William Lallemand	d76fa37534	BUG/MEDIUM: mworker: set the master variable earlier Since 2.7 and the mcli_reload_bind_conf (`56f73b21a5`), upon a reload failure because of a bind error, the mcli_reload_bind_conf go through a sock_unbind((). This is not supposed to do anything when a listener is RX_F_INHERITED in the master, but unfortunately this is done too early and provokes an exit of the master. We already suspected in the past that setting the 'master' variable this late could have negative impact. The fix sets the master variable earlier before the bind. This must be backported at least to 2.7. This could be backported earlier but better wait any feedbacks on the fix.	2023-11-14 14:32:39 +01:00
Willy Tarreau	a63e016d27	MINOR: activity: report profiling duration and age in "show profiling" Seeing counters in "show profiling" is not always very helpful without an indication of how long the analysis lasted nor if it's still active or not. Let's add a pair of start/stop timers for tasks and memory so that we can now indicate how long the measurements lasted and when they ended (or 0 if still running). Note that for tasks profiling set to "auto", the measurement is considered enabled since it can automatically switch on and off on a per-thread basis.	2023-11-14 11:46:37 +01:00
Christopher Faulet	af7db3a43c	REGTESTS: http: Improve script testing abortonclose option We now take care to properly handle the abortonclose close option if it is set on the backend and be sure we ignore it when it is set on the frontend (inherited from the defaults section).	2023-11-14 11:01:51 +01:00
Christopher Faulet	ec3ea6f698	MINOR: stconn: Use SC to detect frontend connections in sc_conn_recv() In sc_conn_recv(), instead of using the connection to know we are on the frontend side, we now use the SC flags. It changes nothing but it is cleaner.	2023-11-14 11:01:51 +01:00
Christopher Faulet	5ff7d22767	BUG/MEDIUM: stream: Properly handle abortonclose when set on backend only Since the 2.2 and the commit `dedd30610` ("MEDIUM: h1: Don't wake the H1 tasklet if we got the whole request."), we avoid to subscribe for reads if the H1 message is fully received. However, this broke the abortonclose option. To fix the issue, a CO_RFL flag was added to instruct the mux it should still wait for read events to properly handle read0. Only the H1 mux was concerned. But since then, most of time, the option is only handled if it is set on the frontend proxy because the request is fully received before selecting the backend. If the backend is selected before the end of the request there is no issue. But otherwise, because the backend is not known yet, we are unable to properly handle the option and we miss to subscribe for reads. Of course the option cannot be set on a frontend proxy. So concretly it means the option is properly handled if it is enabled in the defaults section (if common to frontend and backend) or a listen proxy, but it is ignored if it is set on backend only. Thanks to previous patches, we can now instruct the mux it should subscribe for reads if not already done. We use this mechanism in process_stream() when the connection is set up, ie when backend SC is set to SC_ST_REQ state. This patch relies on following patches: * MINOR: connection: Add a CTL flag to notify mux it should wait for reads again * MEDIUM: mux-h1: Handle MUX_SUBS_RECV flag in h1_ctl() and susbscribe for reads This patch should be the issue #2344. All the series must be backported as far as 2.2.	2023-11-14 11:01:51 +01:00
Christopher Faulet	450ff71c95	MEDIUM: mux-h1: Handle MUX_SUBS_RECV flag in h1_ctl() and susbscribe for reads The H1 mux now handle MUX_SUBS_RECV flag in h1_ctl(). If it is not already subscribed for reads, it does so. This patch will be mandatory to properly handle abortonclose option.	2023-11-14 11:01:51 +01:00
Christopher Faulet	e5cffa8ace	MINOR: connection: Add a CTL flag to notify mux it should wait for reads again MUX_SUBS_RECV ctl flag is added to instruct the mux it should wait for read events. This flag will be pretty useful to handle abortonclose option.	2023-11-14 11:01:51 +01:00
Christopher Faulet	9327e7efa7	BUG/MINOR: stconn: Handle abortonclose if backend connection was already set up abortonclose option is a backend option, it should not be handle on frontend side. Of course a frontend can also be a backend but the option should not be handled too early because it is not necessarily the selected backend (think about a listen proxy routing requests to another backend). It is especially an issue when the abortonclose option is enabled in the defaults section and disabled by the selected backend. Because in this case, the option may still be enabled while it should not. Thus, now we wait the backend connection was set up to handle the option. To do so, we check the backend SC state. The option is ignored if it is in ST_CS_INI state. For all other states, it means the backend was already selected. This patch could be backported as far as 2.2.	2023-11-14 11:01:51 +01:00
Willy Tarreau	6a4591c3d0	BUG/MEDIUM: connection: report connection errors even when no mux is installed An annoying issue was met when testing the reverse-http mechanism, by which failed connection attempts would apparently not be attempted again when there was no connect timeout. It turned out to be more generalized than the rhttp system, and actually affects all outgoing connections relying on NPN or ALPN to choose the mux, on which no mux is installed and for which the subscriber (ssl_sock) must be notified instead. The problem appeared during 2.2-dev1 development. First, commit `062df2c23` ("MEDIUM: backend: move the connection finalization step to back_handle_st_con()") broke the error reporting by testing CO_FL_ERROR only under CO_FL_CONNECTED. While it still worked OK for cases where a mux was present, it did not for this specific situation because no single error path would be considered when no mux was present. Changing the CO_FL_CONNECTED test to also include CO_FL_ERROR did work, until a few commits later with `477902bd2` ("MEDIUM: connections: Get ride of the xprt_done callback.") which removed the xprt_done callback that was used to indicate success or failure of the transport layer setup, since, as the commit explains, we can report this via the mux. What this last commit says is true, except when there is no mux. For this, however, the sock_conn_iocb() function (formerly conn_fd_handler) is called for such errors, evaluates a number of conditions, none of which is matched in this error condition case, since sock_conn_check() instantly reports an error causing a jump to the leave label. There, the mux is notified if installed, and the function returns. In other error condition cases, readiness and activity are checked for both sides, the tasklets woken up and the corresponding subscriber flags removed. This means that a sane (and safe) approach would consist in just notifying the subscriber in case of error, if such a subscriber still exists: if still there, it means the event hasn't been caught earlier, then it's the right moment to report it. And since this is done after conn_notify_mux(), it still leaves all control to the mux once it's installed. This commit should be progressively backported as far as 2.2 since it's where the problem was introduced. It's important to clearly check the error path in each function to make sure the fix still does what it's supposed to.	2023-11-14 08:49:23 +01:00
Frédéric Lécaille	3741e4bf90	BUG/MINOR: quic: maximum window limits do not match the doc This bug arrived with this commit: MINOR: quic: Add a max window parameter to congestion control algorithms The documentation was been modified with missing/wrong modifications in the code part. The 'g' suffix must be accepted to parse value in gigabytes. And exctly 4g is also accepted. No need to backport.	2023-11-13 19:56:28 +01:00
Frédéric Lécaille	8df7018736	DOC: quic: Maximum congestion control window configuration Document the optional parameter which may be supplied after the congestion control algorithm name to set the maximum congestion control window.	2023-11-13 18:17:43 +01:00
Frédéric Lécaille	d9bf1b6c41	DOC: quic: Wrong syntax for "quic-cc-algo" keyword. As the argument to "quic-cc-algo" is mandatory, brackets must be used here in the documentation. Must be backported as far as 2.6.	2023-11-13 18:14:16 +01:00
Frédéric Lécaille	9021e8935e	MINOR: quic: Maximum congestion control window for each algo Make all the congestion support the maximum congestion control window set by configuration. There is nothing special to explain. For each each algo, each time the window is incremented it is also bounded.	2023-11-13 17:53:18 +01:00
Frédéric Lécaille	028a55a1d0	MINOR: quic: Add a max window parameter to congestion control algorithms Add a new ->max_cwnd member to bind_conf struct to store the maximum congestion control window value for each QUIC binding. Modify the "quic-cc-algo" keyword parsing to add an optional parameter to its value: the maximum congestion window value between parentheses as follows: ex: quic-cc-algo cubic(10m) This value must be bounded, greater than 10k and smaller than 1g.	2023-11-13 17:53:18 +01:00
Frédéric Lécaille	840af0928b	BUG/MEDIUM: quic: Non initialized CRYPTO data stream deferencing This bug arrived with this commit: BUG/MINOR: quic: Useless use of non-contiguous buffer for in order CRYPTO data Before this commit qc->cstream was tested before entering qc_treat_rx_crypto_frms(). This patch restablishes this behavior. Furthermore, it simplyfies qc_ssl_provide_all_quic_data() which is a little bit ugly: the CRYPTO data frame may be freed asap in the list_for_each_entry_safe() block after having store its data pointer and length in local variables. Also interrupt the CRYPTO data process as soon as qc_ssl_provide_quic_data() or qc_treat_rx_crypto_frms() fail. No need to be backported.	2023-11-13 16:00:25 +01:00
William Lallemand	59b313832a	REGTESTS: startup: -conf-OK requires -V with current VTest Current version of VTest tests the output of "haproxy -c" instead of the return code. Since we don't output anymore when the configuration is valid, this broke the test. (`a06f621`). This fixes the issue by adding the -V when doing a -conf-OK. But this must fixed in VTest.	2023-11-13 14:57:26 +01:00
Christopher Faulet	cb560bf3d7	DOC: config: Fix name for tune.disable-zero-copy-forwarding global param "disable-" prefix was missing. This param was correctly named in the list of supported keywords in the global section, but not in the keyword description. No backport needed.	2023-11-13 14:31:14 +01:00
Amaury Denoyelle	954b5b756a	BUG/MEDIUM: quic: fix FD for quic_cc_conn Since following commit, quic_conn closes its owned socket before transition to quic_cc_conn for closing state. This allows to save FDs as quic_cc_conn could use the listener socket for their I/O. commit `150c0da889` MEDIUM: quic: release conn socket before using quic_cc_conn This patch is incomplete as it removes initialization of <fd> member for quic_cc_conn. Thus, if sending is done on closing state, <fd> value is undefined which in most cases will result in a crash. Fix this by simply initializing <fd> member with qc_init_fd() in qc_new_cc_conn(). This bug should fix recent issue from #2095. Thanks to Tristan for its reporting and then testing of this patch. No need to backport.	2023-11-13 11:55:07 +01:00
Amaury Denoyelle	78d244e9f7	BUG/MINOR: quic: fix decrement of half_open counter on qc alloc failure Half open counter is used to comptabilize QUIC connections waiting for address validation. It was recently reworked to adjust its scope. With each decrement operation, a BUG_ON() was added to ensure the counter never wraps. This BUG_ON() could be triggered if an allocation fails for one of quic_conn members in qc_new_conn(). This is because half open counter is incremented at the end of qc_new_conn(). However, in case of alloc failure, quic_conn_release() is called immediately to ensure the counter is decremented if a connection is freed before peer address has been validated. To fix this, increment half open counter early in qc_new_conn() prior to every quic_conn members allocations. This issue was reproduced using -dMfail argument. This issue has been introduced by commit `278808915b` MINOR: quic: reduce half open counters scope No need to backport.	2023-11-13 11:16:41 +01:00
Amaury Denoyelle	92da3accfd	BUG/MINOR: quic: fix crash on qc_new_conn alloc failure A new counter was recently introduced to comptabilize the current number of active QUIC handshakes. This counter is stored on the listener instance. This counter is incremented at the beginning of qc_new_conn() to check if limit is not reached prior to quic_conn allocation. If quic_conn or one of its inner member allocation fails, special care is taken to decrement the counter as the connection instance is released. However, it relies on <l> variable which is initialized too late to cover pool_head_quic_conn allocation failure. To fix this, simply initialize <l> at the beginning of qc_new_conn(). This issue was reproduced using -dMfail argument. This issue was introduced by the following commit commit `3df6a60113` MEDIUM: quic: limit handshake per listener No need to backport.	2023-11-13 11:16:41 +01:00
Aurelien DARRAGON	76acde9107	BUG/MINOR: log: keep the ref in dup_logger() This bug was introduced with `969e212` ("MINOR: log: add dup_logsrv() helper function") When duplicating an existing log entry, we must take care to inherit from its original ->ref if it is set, because not doing so would make `28ac0999` ("MINOR: log: Keep the ref when a log server is copied to avoid duplicate entries") ineffective given that global log directives will lose their original reference when duplicated resursively (at least twice), which is what happens when global log directives are first inherited to defaults which are then inherited to a regular proxy at the end of the chain. This can be easily reproduced using the following configuration: \|global \| log stdout format raw local0 \| \|defaults \| log global \| \|frontend test \| log global \| ... Logs from "test" proxy will be duplicated because test incorrectly inherited from global "log" directives twice, which `28ac0999` would normally detect and prevent. No backport needed unless `969e212` gets backported.	2023-11-13 11:06:05 +01:00
Christopher Faulet	33a1fc883a	BUG/MINOR: sample: Fix bytes converter if offset is bigger than sample length When the bytes converter was improved to be able to use variables (`915e48675` ["MEDIUM: sample: Enhances converter "bytes" to take variable names as arguments"]), the behavior of the sample slightly change. A failure is reported if the given offset is bigger than the sample length. Before, a empty binary sample was returned. This patch fixes the converter to restore the original behavior. The function was also refactored to properly handle failures by removing SMP_F_MAY_CHANGE flag. Because the converter now handles variables, the conversion to an integer may fail. In this case SMP_F_MAY_CHANGE flag must be removed to be sure the caller will not retry. This patch should fix the issue #2335. No backport needed except if commit above is backported.	2023-11-13 11:06:05 +01:00
William Lallemand	a06f6212c9	MEDIUM: startup: 'haproxy -c' is quiet when valid MODE_CHECK does not output "Configuration file is valid" by default anymore. To display this message the -V option must be used with -c. However the warning and errors are still output by default if they exist. This allows to clean the output of the systemd unit file with is doing a -c.	2023-11-13 09:59:34 +01:00
Willy Tarreau	cf07cb96be	BUG/MEDIUM: proxy: always initialize the default settings after init The proxy's initialization is rather odd. First, init_new_proxy() is called to zero all the lists and certain values, except those that can come from defaults, which are initialized by proxy_preset_defaults(). The default server settings are also only set there. This results in these settings not to be set for a number of internal proxies that do not explicitly call proxy_preset_defaults() after allocation, such as sink and log forwarders. This was revealed by last commit `79aa63823` ("MINOR: server: always initialize pp_tlvs for default servers") which crashes in log parsers when applied to certain proxies which did not initialize their default servers. In theory this should be backported, however it would be desirable to wait a bit before backporting it, in case certain parts would rely on these elements not being initialized.	2023-11-13 09:17:05 +01:00
Willy Tarreau	79aa638238	MINOR: server: always initialize pp_tlvs for default servers In commit `6f4bfed3a` ("MINOR: server: Add parser support for set-proxy-v2-tlv-fmt") a suspicious check for a NULL srv_tlv was placed in the list_for_each_entry(), that should not be needed. In practice, it's caused by the list head not being initialized, hence the first element is NULL, as shown by Alexander's reproducer below which crashes if the test in the loop is removed: backend dummy default-server send-proxy-v2 set-proxy-v2-tlv-fmt(0xE1) %[fc_pp_tlv(0xE1)] server dummy_server 127.0.0.1:2319 The right place to initialize this field is proxy_preset_defaults(). We'd really need a function to initialize a server :-/ The check in the loop was removed. No backport is needed.	2023-11-13 08:53:28 +01:00
Fr�d�ric L�caille	dfda884633	BUG/MINOR: quic: Useless use of non-contiguous buffer for in order CRYPTO data This issue could be reproduced with a TLS client certificate verificatio to generate enough CRYPTO data between the client and haproxy and with dev/udp/udp-perturb as network perturbator. Haproxy could crash thanks to a BUG_ON() call as soon as in disorder data were bufferized into a non-contiguous buffer. There is no need to pass a non NULL non-contiguous to qc_ssl_provide_quic_data() from qc_ssl_provide_all_quic_data() which handles in order CRYPTO data which have not been bufferized. If not, the first call to qc_ssl_provide_quic_data() to process the first block of in order data leads the non-contiguous buffer head to be advanced to a wrong offset, by <len> bytes which is the length of the in order CRYPTO frame. This is detected by a BUG_ON() as follows: FATAL: bug condition "ncb_ret != NCB_RET_OK" matched at src/quic_ssl.c:620 call trace(11): \| 0x5631cc41f3cc [0f 0b 8b 05 d4 df 48 00]: qc_ssl_provide_quic_data+0xca7/0xd78 \| 0x5631cc41f6b2 [89 45 bc 48 8b 45 b0 48]: qc_ssl_provide_all_quic_data+0x215/0x576 \| 0x5631cc3ce862 [48 8b 45 b0 8b 40 04 25]: quic_conn_io_cb+0x19a/0x8c2 \| 0x5631cc67f092 [e9 1b 02 00 00 83 45 e4]: run_tasks_from_lists+0x498/0x741 \| 0x5631cc67fb51 [89 c2 8b 45 e0 29 d0 89]: process_runnable_tasks+0x816/0x879 \| 0x5631cc625305 [8b 05 bd 0c 2d 00 83 f8]: run_poll_loop+0x8b/0x4bc \| 0x5631cc6259c0 [48 8b 05 b9 ac 29 00 48]: main-0x2c6 \| 0x7fa6c34a2ea7 [64 48 89 04 25 30 06 00]: libpthread:+0x7ea7 \| 0x7fa6c33c2a2f [48 89 c7 b8 3c 00 00 00]: libc:clone+0x3f/0x5a Thank you to @Tristan971 for having reported this issue in GH #2095. No need to backport.	2023-11-10 18:16:14 +01:00
Aurelien DARRAGON	078ebde870	CLEANUP: sink: useless leftover in sink_add_srv() Removing a useless leftover which has been introduced with `31e8a003a5` ("MINOR: sink: function to add new sink servers")	2023-11-10 17:49:57 +01:00
Aurelien DARRAGON	2694621151	CLEANUP: sink: bad indent in sink_new_from_logger() Fixing bad indent in sink_new_from_logger() which was recently introduced	2023-11-10 17:49:57 +01:00
Aurelien DARRAGON	d710dfbacc	BUG/MINOR: sink: don't learn srv port from srv addr Since `04276f3d` ("MEDIUM: server: split the address and the port into two different fields") we should not use srv->addr to store server's port and rely on srv->svc_port instead. For sink servers, we correctly set >svc_port upon server creation but we didn't use it when initializing address for the connection. As a result, FQDN resolution will not work properly with sink servers. Hopefully, this used to work by accident because sink servers were resolved using the PA_O_RESOLVE flag in str2sa_range(), which made the srv->addr contain the port in addition to the address. But this will fail to work when FQDN resolution is postponed because only ->svc_port will contain the proper server port upon resolution. For instance, FQDN resolution with servers from log backends (which are resolved as regular servers, that is, without the PA_O_RESOLVE) will fail to work because of this. This may be backported as far as 2.2 even though the bug didn't have noticeable effects for versions below 2.9 [In 2.2, sink_forward_session_init() didn't exist it should be applied in sink_forward_session_create()]	2023-11-10 17:49:57 +01:00

1 2 3 4 5 ...

21192 Commits