haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2024-12-27 07:02:11 +00:00

Author	SHA1	Message	Date
Willy Tarreau	ee1c92cf10	MINOR: ring: rename totlen vs msglen in ring_write() The ring_write() function uses confusing variable names: totlen is in fact the length of the message, not the total length that is going to be written. Let's rename it msglen and have a real "needed" that corresponds to the total size we're going to write. We also add a BUG_ON_HOT() to catch mistakes causing discrepancies.	2024-03-25 17:34:19 +00:00
Willy Tarreau	c222cb8389	MINOR: vecpair: add necessary functions to use vecpairss from/to ring APIs Many ring-based APIs need a tail and a head, with some extra assumption that the user takes care of not filling the ring so that tail==head is unambiguous. Vectors are particularly suited to this usage so here we create 4 functions to create vectors representing free room or data from a ring, as well as updating rings based on a pair of vectors that represents either free space or data.	2024-03-25 17:34:19 +00:00
Willy Tarreau	63261aae39	MINOR: vecpair: add new vector pair based data manipulation mechanisms The buffers API defines both a storage layout and how to handle the data. The storage is shared with the chunks API which only deals with non-wrapping messages while buffers support wrapping both of the data and of the free space. As such, most of the buffers code already makes special cases of two parts in a buffer, the first one before wrapping and the optional second one after the wrapping occurred. The thing is, there are plenty of other places (e.g. rings) where the code dealing with wrapping is desirable but with a different storage layout. Let's export the existing buffer handling code related to reading/writing wrapping data and make it work with arbitrary vector pairs instead. This will handle wrapping and holes in messages if desired, and it will be up to the caller to decide how its messages are arranged and to pass the relevant ptr,len elements. The code is limited to two vectors because this is sufficient to deal with wrapping without making the code needlessly complex. I.e. this will not reassemble an iovec. For vectors, since we already had the ist type, there's no point inventing a new type, and it's even possible that over time some callers will find benefits in using this unified API (i.e. no NOP translation layer). It also allows to pass inputs as direct arguments and outputs as pointers. Not only this is more efficient code-wise, but it also avoids the accidental use of a wrong function. It was indeed found that naming functions is even harder than with the buffer as the notion of from/to is even fuzzier here. The API will likely continue to evolve and some functions might get renamed to more explicit ones over time to limit confusion. For now the code provides anything needed to reset/create/fill/erase/read/peek or measure vector pairs and to manipulate chars/blocks/varints to/from there.	2024-03-25 17:34:19 +00:00
Willy Tarreau	0b1c17a2dd	MINOR: ring: reserve one special value for the readers count In order to support concurrent writers we'll need to lock areas in the buffer. For this we'll use one special value of the single-byte readers count. Let's reserve it now and use the macro instead of the hardcoded 255.	2024-03-25 17:34:19 +00:00
Willy Tarreau	0f611987da	MINOR: ring: make the ring reader use only absolute offsets The goal is to remove references to the buffer's head and tail in the fast path so that we can release the lock during some reads. This means no more comparisons with b_data() nor operations relative to b_head() will be possible anymore. As a first step we need to have an absolute offset in the buffer, and to use b_getblk_ofs() in the applet callbacks to retrieve the data based on this.	2024-03-25 17:34:19 +00:00
Willy Tarreau	63242a59c4	MINOR: buf: add b_getblk_ofs() that works relative to area and not head For some concurrently accessed buffers we can't rely on head/data etc, but sometimes the access patterns guarantees that the buffer contents are there. Let's implement a function to read contents from a fixed offset, which never checks head nor data, only the area and its size. It's the caller's job to get this offset.	2024-03-25 17:34:19 +00:00
Willy Tarreau	2f28981546	MINOR: buf: add b_putblk_ofs() to copy a block at a specific position This new function b_putblk_ofs() puts one full block of data of length <len> from <blk> into the buffer, starting from absolute offset <offset> after the buffer's area. As a convenience to avoid complex checks in callers, the offset is allowed to exceed a valid one by no more than one buffer size, and will automatically be wrapped. The caller is responsible for ensuring that <len> doesn't exceed the known length of the available room at this position, otherwise data may be overwritten. The buffer's length is not updated, so generally the caller will have updated it before calling this function. This is meant to be used on concurrently accessed buffers, so that a writer can append data while a reader is blocked by other means from reaching the current area The function guarantees never to use ->head nor ->data.	2024-03-25 17:34:19 +00:00
Willy Tarreau	c5004ccb36	MINOR: buf: add b_rel_ofs() to turn an absolute offset into a relative one It basically does the opposite of b_peek_ofs(). If x=b_peek_ofs(y), then y=b_rel_ofs(x).	2024-03-25 17:34:19 +00:00
Willy Tarreau	15e47b6a59	MINOR: buf: add b_add_ofs() to add a count to an absolute position This function is used to compute a new absolute buffer offset by adding a length to an existing valid offset. It will check for wrapping.	2024-03-25 17:34:19 +00:00
Willy Tarreau	8f3edf2ac6	MEDIUM: log/sink: make the log forwarder code use ring_dispatch_messages() This code becomes even simpler and almost does not need any knowledge of the structure of the ring anymore. It even highlighted that an old race had not been fixed due to code duplication, but that's now done.	2024-03-25 17:34:19 +00:00
Willy Tarreau	c262442b1a	MEDIUM: sink: move the generic ring forwarder code use ring_dispatch_messages() Now the code is much simpler than the ring forwarding function almost does not need any knowledge of the structure of the ring anymore.	2024-03-25 17:34:19 +00:00
Willy Tarreau	c62a2d540d	MEDIUM: ring: move the ring reader code to ring_dispatch_messages() This new function is made around the loop that scans a ring for new messages and dispatches them to a message handler. It also takes ring flags (WAIT, NEW, etc) and offset pointers that the caller will use to initialize/reuse/update the current processing offset. The caller is still responsible for presetting it to ~0 before the first call if it wants the function to automatically adjust it (or set it to the correct value). The function may also return the last_ofs that was known before releasing the lock so that the caller knows what to compare against and if it needs to restart processing or not. The context remains a void* so that should not necessarily depend on an appctx. The current "show ring" code was ported to this and it continues to work as expected.	2024-03-25 17:34:19 +00:00
Willy Tarreau	ad31e53287	REORG: dns/ring: split the ring between the generic one and the DNS one A ring is used for the DNS code but slightly differently from the generic one, which prevents some important changes from being made to the generic code without breaking DNS. As the use cases differ, it's better to just split them apart for now and have the DNS code use its own ring that we rename dns_ring and let the generic code continue to live on its own. The unused parts such as CLI registration were dropped, resizing and allocation from a mapped area were dropped. dns_ring_detach_appctx() was kept despite not being used, so as to stay consistent with the comments that say it must be called, despite the DNS code explicitly mentioning that it skips it for now (i.e. this may change in the future). Hopefully after the generic rings are converted the DNS code can migrate back to them, though this is really not necessary.	2024-03-25 17:34:19 +00:00
Willy Tarreau	8022ae326c	MEDIUM: ring/sink: use applet_append_line()/syslog_applet_append_event() for readers The rink reader code was duplicated as-is in 2.2 for the ring forwarding code in commits `494c505703` ("MEDIUM: ring: add server statement to forward messages from a ring") and `975564784f` ("MEDIUM: ring: add new srv statement to support octet counting forward") (which only differs by using a prefix instead of a suffix to delimit messages). Unfortunately, that makes it almost impossible to rework the core ring code because all these parts rely on it. This first commit aims at restoring a common structure for the core loop by just calling a distinct function based on the use case. The functions are either applet_append_line() when a whole line is to be emitted followed by an LF character, or syslog_applet_appent_event() when trying to send a TCP syslog line prepended with its size in decimal. There is no functional change beyond this.	2024-03-25 17:34:19 +00:00
Willy Tarreau	201c706330	MINOR: log/applet: add new function syslog_applet_append_event() This function takes a buffer on input, and offset and a length, and consumes the block from that buffer to send it to the appctx's output buffer. Contrary to its sibling applet_append_line(), instead of just appending an LF at the end of the line, it prepends the message size in decimal and a space before the message, as expected by syslog TCP implementaions. This will be used to simplify the ring reader code.	2024-03-25 17:34:19 +00:00
Willy Tarreau	6ae41dc510	MINOR: applet: add new function applet_append_line() This function takes a buffer on input, and offset and a length, and consumes the block from that buffer to send it to the appctx's output buffer. This will be used to simplify the ring reader code.	2024-03-25 17:34:19 +00:00
Willy Tarreau	c038ca8e8c	MINOR: atomic: add a read-specific variant of __ha_cpu_relax() Tests on various systems show that x86 prefers not to wait at all inside read loops while aarch64 prefers to wait a little bit. Instead of having to stuff ifdefs around __ha_cpu_relax() inside plenty of such loops waiting for a condition to appear, better implement a new variant that we call __ha_cpu_relax_for_read() which honors each architecture's preferences and is the same as __ha_cpu_relax() for other ones.	2024-03-25 17:34:19 +00:00
Willy Tarreau	5df0df96dd	MINOR: debug: add "debug dev trace" to flood with traces This new command, enabled only with "DEBUG_DEV", sends 2 or 20 traces per task wakeup (depending on the verbosity level), and stops after 1M wakeups per thread in order not to have to stop/start the process each time it's fired. We have two small messages and 18 larger ones from 20 to 270 bytes each, so that the average size is approx 213 bytes counting headers (the header adds approx 82 bytes), which matches what's generally observed on average when traces are enabled in all muxes. Typical figures show varations between 5.7M and 6.2M msg/s on an EPYC in a 3C6T setup (single CCX), and 2.12M - 2.22M in a 24C48T setup (across 8 CCX, with 8 thread groups).	2024-03-25 17:32:22 +00:00
Aurelien DARRAGON	db1cd8f881	OPTIM: http_ext: avoid useless copy in http_7239_extract_{ipv4,ipv6} In http_7239_extract_{ipv4,ipv6}, we declare a local buffer in order to use inet_pton() since it requires a valid destination argument (cannot be NULL). Then, if the caller provided <ip> argument, we copy inet_pton() result (from local buffer to <ip>). In fact when the caller provides <ip>, we may directly use <ip> as inet_pton() dst argument to avoid an useless copy. Thus the local buffer is only relevant when the user doesn't provide <ip>. While at it, let's add a missing testcase for the rfc7239_n2nn converter (to check that http_7239_extract_ipv4() with <ip> provided works properly) This could be backported in 2.8 with `b2bb925` ("MINOR: proxy/http_ext: introduce proxy forwarded option")	2024-03-25 16:24:15 +01:00
Aurelien DARRAGON	3de1acfb23	BUILD: server: fix build regression on old compilers (<= gcc-4.4) Willy reported that since `3ac79b504` ("MEDIUM: server: make server_set_inetaddr() updater serializable"), haproxy fails to compile on some older compilers such as gcc-4.4 with this kind of error: src/server.c: In function 'snr_resolution_cb': src/server.c:4471: error: unknown field 'dns_resolver' specified in initializer compilation terminated due to -Wfatal-errors. make: *** [Makefile:1006: src/server.o] Error 1 This is due to referencing a member inside anonymous union from a compound literal assignment. Apparently such use of anonymous union wasn't properly supported back then on older compilers. To fix the issue, we give "u" name to the parent union use this name to explicitly refer to the union where relevant in the code (only a few changes fortunately). The fix itself was verified to restore build compatibility with gcc 4.4 (and even 4.2). As `3ac79b504` is used as a prerequisite for `64c9c8ef3` ("BUG/MINOR: server/dns: use server_set_inetaddr() to unset srv addr from DNS"), please consider backporting this patch too if `64c9c8ef3` happens to be backported in 2.9.	2024-03-25 16:23:37 +01:00
Christopher Faulet	56c4b29ff1	BUG/MEDIUM: mux-fcgi: Properly handle EOM flag on end-of-trailers HTX block Trailers are skipped by the FCGI multiplexer. However empty chunked messages are not properly handled. It may be a chunked H1 request with no payload or a H2/H3 POST request with no payload. In that caes, the EOT HTX block is just ignored. The issue is that the EOM flag is thus ignored too. It means no empty STDIN record is sent to mark the end of the request to the server. To fix the issue, when a EOT htx block is found and it is the last HTX block of the message (and it should be), the EOM flag is tested. If it is found, an empty STDIN record is emitted. This patch should fix the issue #2499. It must be backported as far as 2.4.	2024-03-25 11:06:41 +01:00
Amaury Denoyelle	bd384a359b	BUG/MINOR: mux-quic: close all QCS before freeing QCC tasklet QUIC MUX is freed via qcc_release(). This in turn liberate all the remaining QCS instances. For each one of them, their corresponding stream-desc is released via qc_stream_desc_release(). This last function may itself notifies QUIC MUX when new buffers are available. This is useful when QCS are closed individually without the whole connection. However, when the connection is closed through qcc_release(), this may cause issue as some elements of QUIC MUX are already freed. In 2.9.6, a bug was detected directly linked to this. Indeed, QCC instance may be woken up on stream-desc release. If called through qcc_release(), this is an issue because QCC tasklet is freed before QCS instances. However, this bug is not systematic and relies on prior conditions : in particular, QUIC MUX must be under Tx buffers exhaustion prior to the qcc_release() invocation. The current dev tree is not impacted by this bug, thanks to QUIC MUX refactoring. Indeed, notifying accross layers have changed and now stream-desc release notifies individual QCS instances instead of the QCC element, which is a safer mechanism. However, to simplify backport process, bugfix is introduced in the current dev tree as it does not have any impact. Note that a proper fix would be to set quic-conn MUX state to QC_MUX_RELEASED. However, it is not possible to call quic_close() without having releasing all stream-desc elements first. The simpler solution was chosen to prevent other breaking issues during backports. This should fix github issue #2494. It should be backported up to 2.6. Note that prior to 2.7 qcc_release() was named qc_release().	2024-03-25 10:24:59 +01:00
Amaury Denoyelle	0d4273f04b	MEDIUM: server: close private idle connection before server deletion This commit similar to the following one : 65ae241dcfe710e1cdd3ec4e7a9bde38d2e4c116 MEDIUM: server: close idle conn before server deletion This patch implements a similar logic, this time to close private idle connections stored in sessions. The principle is identical to the above commit : conn_release() is used on idle connections after a takeover to ensure thread safety. An extra change was required to be able to execute takeover on such connections. Their original thread ID was unknown, contrary to non private connections which are stored in sharded lists. As such, a new tid member has been added under sess_priv_conns chaining element.	2024-03-22 17:12:27 +01:00
Amaury Denoyelle	5e8eb3661b	MEDIUM: mux: prepare for takeover on private connections When a backend connection is marked as idle, a special flag TASK_F_USR1 is set on MUX tasklet. When MUX tasklet is reactivated, extra checks are executed under this flag to ensure no takeover occurred in the meantime. Previously, only non private connections could be targetted by a takeover. However, this will change when implementing private idle connections closure on "delete server" CLI handler. As such, TASK_F_USR1 is now also set for private connections in MUX detach callbacks.	2024-03-22 17:10:06 +01:00
Amaury Denoyelle	6e0afb2e27	MEDIUM: server: close idle conn on server deletion To be able to delete a server, a number of preconditions must be validated to ensure it is not in used anymore. Previously, if idle connections were stored in the server, the deletion was cancelled. No action was implemented to force idle connection closure, the only solution was to wait for the periodic purging to be achieved. This is an extra burden to be able to delete a server. Indeed, idle connections are by definition inactive and can be closed prior to delete a server. This is the exact purpose of this patch. Idle connections removal is implemented inside "delete server" handler, once it has been determined that the server can be freely removed. A simple loop is run to call conn_release() over each idle connections. Takeover is also executed before conn_release() to ensure tasks/tasklets or any other sensible elements are not deleted from a foreign thread. This patch should reduce the occurence of rejected "delete server" execution, especially when connection reuse is high.	2024-03-22 16:59:02 +01:00
Amaury Denoyelle	f3862a9bc7	MINOR: connection: extend takeover with release option Extend takeover API both for MUX and XPRT with a new boolean argument <release>. Its purpose is to signal if the connection will be freed immediately after the takeover, rendering new resources allocation unnecessary. For the moment, release argument is always false. However, it will be set to true on delete server CLI handler to proactively close server idle connections.	2024-03-22 16:12:36 +01:00
Amaury Denoyelle	ff2e71ae24	MINOR: connection: implement conn_release() Several places reuse the same code to ensure a connection is properly freed, either via its MUX or by calling the proper set of functions. Factorize all of this in a new function conn_release(). This new function is now called via session_free() and session_accept_fd(). It will also be reused on delete server to proactively close idle connections.	2024-03-22 16:12:36 +01:00
Remi Tricot-Le Breton	10ece2cf66	REGTESTS: ssl: Add checks on ocsp-update log format Add checks on the ocsp-update's dedicated log format.	2024-03-20 16:12:11 +01:00
Remi Tricot-Le Breton	099b5c421c	CLEANUP: ssl: Remove undocumented ocsp fetches Those fetchess were undocumented and were just here so that the ocsp-update log could be made through a regular log format. But since the logging is now "handmade" (since BUG/MEDIUM: ssl: Fix crash in ocsp-update log function), we don't need those anymore.	2024-03-20 16:12:11 +01:00
Remi Tricot-Le Breton	328a893713	MINOR: ssl: Change level of ocsp-update logs The pure ocsp-update log used to be in log level "info" and it would be mixed with actual traffic logs. This patch changes it to level "notice".	2024-03-20 16:12:11 +01:00
Remi Tricot-Le Breton	d4eeaa4003	MEDIUM: ssl: Change output of ocsp-update log Since commit "BUG/MEDIUM: ssl: Fix crash in ocsp-update log function", some information from the log line are "faked" because they can be actually retrieved anymore (or never could). We should then remove them from the logline all along instead of providing some useless fields. We then only keep pure OCSP-update information in the log line: "<certname> <status> <status str> <fail count> <success count>"	2024-03-20 16:12:11 +01:00
Remi Tricot-Le Breton	d4e3be18df	BUG/MEDIUM: ssl: Fix crash in ocsp-update log function The ocsp-update logging mechanism was built around the 'sess_log' function which required to keep a pointer to the said session until the logging function could be called. This was made by keeping a pointer to the appctx returned by the 'httpclient_start' function. But this appctx lives its life on its own and might be destroyed before 'ssl_ocsp_send_log' is called, which could result in a crash (UAF). Fixing this crash requires to stop using the 'sess_log' function to emit the ocsp-update logs. The log line will then need to be built by hand out of the information actually available when 'ssl_ocsp_send_log' is called. Since we don't use the "regular" logging functions anymore, we don't need to use the error_logformat anymore. In order to keep a consistent behavior than before, we will keep the same format for the logs but replace the fields that required a 'sess' pointer by fake values (the %ci:%cp for instance, which was never filled anyway). This crash was raised in GitHub issue #2442. It should be backported up to branch 2.8.	2024-03-20 16:12:10 +01:00
Remi Tricot-Le Breton	5c25c577a0	BUG/MEDIUM: ssl: Fix crash when calling "update ssl ocsp-response" when an update is ongoing The CLI command "update ssl ocsp-response" was forcefully removing an OCSP response from the update tree regardless of whether it used to be in it beforehand or not. But since the main OCSP upate task works by removing the entry being currently updated from the update tree and then reinserting it when the update process is over, it meant that in the CLI command code we were modifying a structure that was already being used. These concurrent accesses were not properly locked on the "regular" update case because it was assumed that once an entry was removed from the update tree, the update task was the only one able to work on it. Rather than locking the whole update process, an "updating" flag was added to the certificate_ocsp in order to prevent the "update ssl ocsp-response" command from trying to update a response already being updated. An easy way to reproduce this crash was to perform two "simultaneous" calls to "update ssl ocsp-response" on the same certificate. It would then crash on an eb64_delete call in the main ocsp update task function. This patch can be backported up to 2.8. Wait a little bit before backporting.	2024-03-20 16:12:10 +01:00
Remi Tricot-Le Breton	3f04568ab1	REGTESTS: ssl: Add OCSP related tests Add tests that combine the OCSP update mechanism and the various preexisting commands that allow to manipulate certificates and crt-lists.	2024-03-20 16:12:10 +01:00
Remi Tricot-Le Breton	69071490ff	BUG/MAJOR: ocsp: Separate refcount per instance and per store With the current way OCSP responses are stored, a single OCSP response is stored (in a certificate_ocsp structure) when it is loaded during a certificate parsing, and each SSL_CTX that references it increments its refcount. The reference to the certificate_ocsp is kept in the SSL_CTX linked to each ckch_inst, in an ex_data entry that gets freed when the context is freed. One of the downsides of this implementation is that if every ckch_inst referencing a certificate_ocsp gets detroyed, then the OCSP response is removed from the system. So if we were to remove all crt-list lines containing a given certificate (that has an OCSP response), and if all the corresponding SSL_CTXs were destroyed (no ongoing connection using them), the OCSP response would be destroyed even if the certificate remains in the system (as an unused certificate). In such a case, we would want the OCSP response not to be "usable", since it is not used by any ckch_inst, but still remain in the OCSP response tree so that if the certificate gets reused (via an "add ssl crt-list" command for instance), its OCSP response is still known as well. But we would also like such an entry not to be updated automatically anymore once no instance uses it. An easy way to do it could have been to keep a reference to the certificate_ocsp structure in the ckch_store as well, on top of all the ones in the ckch_instances, and to remove the ocsp response from the update tree once the refcount falls to 1, but it would not work because of the way the ocsp response tree keys are calculated. They are decorrelated from the ckch_store and are the actual OCSP_CERTIDs, which is a combination of the issuer's name hash and key hash, and the certificate's serial number. So two copies of the same certificate but with different names would still point to the same ocsp response tree entry. The solution that answers to all the needs expressed aboved is actually to have two reference counters in the certificate_ocsp structure, one actual reference counter corresponding to the number of "live" pointers on the certificate_ocsp structure, incremented for every SSL_CTX using it, and one for the ckch stores. If the ckch_store reference counter falls to 0, the corresponding certificate must have been removed via CLI calls ('set ssl cert' for instance). If the actual refcount falls to 0, then no live SSL_CTX uses the response anymore. It could happen if all the corresponding crt-list lines were removed and there are no live SSL sessions using the certificate anymore. If any of the two refcounts becomes 0, we will always remove the response from the auto update tree, because there's no point in spending time updating an OCSP response that no new SSL connection will be able to use. But the certificate_ocsp object won't be removed from the tree unless both refcounts are 0. Must be backported up to 2.8. Wait a little bit before backporting.	2024-03-20 16:12:10 +01:00
Amaury Denoyelle	87b96cf3a5	BUG/MAJOR: connection: fix server used_conns with H2 + reuse safe By default, backend connections are accounted by the server. This allows to determine the number of idle connections to keep. A backend connection can also be marked as private to prevent its reuse. It is thus removed from server lists into the session list. As such, a private connection is not accounted into server : conn_set_private() uses srv_release_conn() to ensure this. When using HTTP/2 on backend side with default http-reuse safe, the above principle are mixed. Indeed, when a connection is first used, or switches from idle to used, it is moved into the session list but it is not flagged as private. This is done to prevent its sharing by different clients to prevent head-of-line blocking issue. When all streams are closed, the connection becomes idle again and is reinserted in the server list. This has been introduced by the following patch : `0d21deaded` MEDIUM: backend: add reused conn to sess if mux marked as HOL blocking When freeing a backend connection, special care is taken to ensure server used counter is decremented. This is implemented into conn_backend_deinit(). However, this function does this only if the connection is not present in a session list. This is valid for private connections. However, if a connection is non-private and present only temporarily into a session list, the decrement operation won't be executed despite the connection being accounted by the server. This bug has several impacts. The server used counter won't be able to reach its initial null value, even when all its connections are closed. This can result in a wrong estimation of necessary idle connections, which may cause unnecessary new connection usage. Also, this will prevent definitely the server from being removed via "delete server" CLI command. This should be backported up to 2.4. Note that conn_backend_deinit() was introduced in 2.9. For lesser versions, the change should be done directly into conn_free().	2024-03-20 14:26:57 +01:00
Amaury Denoyelle	fd3ce173aa	BUG/MEDIUM: http_ana: ignore NTLM for reuse aggressive/always and no H1 Backend connections can be marked as private to prevent their sharing by multiple clients. Now, this has become an exception as only two reasons for data traffic can trigger this (checks are ignored here) : * http-reuse never * HTTP response with NTLM header The first case is easy to manage as the connection is flagged as private since its inception. However, the second case is dynamic as the connection can be flagged anytime during its lifetime. When using a backend protocol such as HTTP/2 with reuse mode aggressive or always, we face a design issue as the connection would be marked as private, despite potentially being shared by several clients at the same time. This is conceptually invalid, but worst it can trigger crashes on MUX stream detach callback depending on the order of release of the streams, by calling session_check_idle_conn() with a NULL session. It could also be possible to have several NTLM responses on a single connection for different sessions. In this case, connection owner is still being updated without attaching the connection to its correct session, which ultimately would cause a crash on session_check_idle_conn with an invalid session. Here are two backtrace examples from GDB for such cases : Thread 1 (Thread 0x7ff73e9fc700 (LWP 648859)): #0 session_check_idle_conn (conn=0x7ff72f597800, sess=0x0) at include/haproxy/session.h:209 #1 h2_detach (sd=<optimized out>) at src/mux_h2.c:4520 #2 0x000056151742be24 in sc_detach_endp (scp=scp@entry=0x7ff73e9f0f18) at src/stconn.c:376 #3 0x000056151742c208 in sc_destroy (sc=<optimized out>) at src/stconn.c:444 #4 0x0000561517370871 in stream_free (s=s@entry=0x7ff72a2dbd80) at src/stream.c:728 #5 0x000056151737541f in process_stream (t=t@entry=0x7ff72d5e2620, context=0x7ff72a2dbd80, state=<optimized out>) at src/stream.c:2645 #6 0x0000561517456cbb in run_tasks_from_lists (budgets=budgets@entry=0x7ff73e9f10d0) at src/task.c:632 #7 0x00005615174576b9 in process_runnable_tasks () at src/task.c:876 #8 0x000056151742275a in run_poll_loop () at src/haproxy.c:2996 #9 0x0000561517422db1 in run_thread_poll_loop (data=<optimized out>) at src/haproxy.c:3195 #10 0x00007ff789e081ca in start_thread () from /lib64/libpthread.so.0 #11 0x00007ff789a39e73 in clone () from /lib64/libc.so.6 (gdb) Thread 1 (Thread 0x7ff52e7fc700 (LWP 681458)): #0 0x0000556ebd6e7e69 in session_check_idle_conn (conn=0x7ff5787ff100, sess=0x7ff51d2539a0) at include/haproxy/session.h:209 #1 h2_detach (sd=<optimized out>) at src/mux_h2.c:4520 #2 0x0000556ebd7f3e24 in sc_detach_endp (scp=scp@entry=0x7ff52e7f0f18) at src/stconn.c:376 #3 0x0000556ebd7f4208 in sc_destroy (sc=<optimized out>) at src/stconn.c:444 #4 0x0000556ebd738871 in stream_free (s=s@entry=0x7ff520e28200) at src/stream.c:728 #5 0x0000556ebd73d41f in process_stream (t=t@entry=0x7ff565783700, context=0x7ff520e28200, state=<optimized out>) at src/stream.c:2645 #6 0x0000556ebd81ecbb in run_tasks_from_lists (budgets=budgets@entry=0x7ff52e7f10d0) at src/task.c:632 #7 0x0000556ebd81f6b9 in process_runnable_tasks () at src/task.c:876 #8 0x0000556ebd7ea75a in run_poll_loop () at src/haproxy.c:2996 #9 0x0000556ebd7eadb1 in run_thread_poll_loop (data=<optimized out>) at src/haproxy.c:3195 #10 0x00007ff5752081ca in start_thread () from /lib64/libpthread.so.0 #11 0x00007ff574e39e73 in clone () from /lib64/libc.so.6 (gdb) To solve this issue, simply ignore NTLM responses when using a multiplexer with streams support and the connection is not already attached to the session. The connection is not marked as private and will continue to be shared freely accross clients. This is considered conceptually valid as NTLM usage (rfc 4559) with HTTP is broken and was designed only with HTTP/1.1 in mind. A side-effect of the change is that SESS_FL_PREFER_LAST is also not set anymore on NTLM detection, which allows following requests to be load-balanced accross several server instances. The original behavior is kept for HTTP/1 or if the connection is already attached to the session. This last case happens when using HTTP/2 with default http-reuse safe mode since the following patch : `0d21deaded` MEDIUM: backend: add reused conn to sess if mux marked as HOL blocking This should be backported up to all stable releases. Up until 2.4, it can be taken as-is. For lesser versions, above patch is not present. In this case the condition should be restricted only to HTTP/1 usage : if (srv_conn && strcmp(srv_conn->mux->name, "H1") == 0) {	2024-03-20 14:26:57 +01:00
Amaury Denoyelle	c130f74803	BUG/MINOR: session: ensure conn owner is set after insert into session A crash could occured if a session_add_conn() would temporarily failed when called via h2_detach(). In this case, connection owner is reset to NULL. However, if this wasn't the last connection stream, the connection won't be destroyed. When h2_detach() is recalled for another stream and this time session_add_conn() succeeds, a crash will occur due to session_check_idle_conn() invocation with a NULL connection owner. To fix this, ensure connection owner is always set after session_add_conn() success. This bug is considered as minor as the only failure reason for session_add_conn() is a pool allocation issue. This should be backported up to all stable releases.	2024-03-20 14:26:57 +01:00
Christopher Faulet	eb89e4f3e0	BUG/MEDIUM: spoe: Return an invalid frame on recv if size is too small Frames with a too small size must be detected on receive and an error must be triggered. It is especially important for frames of size 0. Otherwise, because the frame length is used as return value, the frame is ignored (0 is the return value to state the frame must be ignored). It is an issue because in this case, outgoing data, the 4 bytes representing the frame size, are never consumed. If the agent also closes the connection, this leads to a wakeup loop because outgoing data are stuck and a shutdown is pending. In addition, all pending outgoing data are systematcially skipped when the applet is in SPOE_APPCTX_ST_END state. The patch should fix the issue #2490. It must be backported to all stable versions.	2024-03-19 07:54:25 +01:00
Ilia Shipitsin	3a0fc8641b	CI: temporarily adjust kernel entropy to work with ASAN/clang clang runtime (shipped with clang14) is not compatible with recent Ubuntu kernels more details: https://github.com/actions/runner-images/issues/9491	2024-03-18 19:54:33 +01:00
Ilia Shipitsin	5fe02c33bc	CLEANUP: assorted typo fixes in the code and comments This is 40th iteration of typo fixes	2024-03-18 19:54:33 +01:00
Christopher Faulet	885e40494c	MINOR: spoe: Add SPOE filters in the exposed deprecated directives It is the first deprecated directive exposed via the 'expose-deprecated-directives' global option. This way, it is possible to silent the warning about the SPOE uses.	2024-03-15 11:31:48 +01:00
Christopher Faulet	189f74d4ff	MINOR: cfgparse: Add a global option to expose deprecated directives Similarly to "expose-exprimental-directives" option, there is no a global option to expose some deprecated directives. Idea is to have a way to silent warnings about deprecated directives when there is no alternative solution. Of course, deprecated directives covered by this option are not listed and may change. It is only a best effort to let users upgrade smoothly.	2024-03-15 11:31:48 +01:00
Christopher Faulet	dff9807188	MAJOR: spoe: Deprecate the SPOE filter As announced on the ML few weeks (months ?) ago and on several GH issues, the SPOE is now deprecated. Sadly, this filter should be refactored to work properly. It was implemented as a functionnal PoC for the 1.7 and since then, no time was invest to improve it and make it truly maintainable in time. Worst, other parts of HAProxy evolve, especially applets part, making maintenance ever more expensive. Instead of keeping the SPOE filter in a this state and always reply to users encountering issues or limitations that it is far from perfect but we cannot work on it for now, we decided to deprecate it. We can still change our mind before the 3.0.0 release if the situation evolves. Otherwise the filter will be removed or marked as unmaintained for the 3.1. If the situation does not change, it means the 3.0 will be the last version with a true SPOE support.	2024-03-15 11:29:39 +01:00
Christopher Faulet	6547b14292	BUG/MINOR: spoe: Be sure to be able to quickly close IDLE applets on soft-stop On soft-stop, we try, as far as possible, to process all pending messages before closing SPOE applets. However, in sync mode, when an applets waiting for a response receives the ACK frame, it is switched to IDLE state without checking if it may be closed. In this case, we will wait the idle timeout before closing de applet, delaying the soft-stop. To reduce this delay, on soft-stop, IDLE applets are woken up. On the next wakeup, the applet will try to process pending messages or will be closed. This patch should be backported to all stable versions.	2024-03-15 09:09:22 +01:00
Christopher Faulet	3c066b1e34	BUG/MEDIUM: spoe: Don't rely on stream's expiration to detect processing timeout On stream side, the SPOE filter relied on the stream's expiration date to be woken up and be able to detect processing timeout. However, the stream expiration date must not be updated this way. Mainly because it may be overwritten at the end of process_stream(). In the worst case, it is set to TICK_ETERNITY for any reason. In this case, it is impossible to detect the SPOE filter must time out and abort the processing. The right way to do is to set an analysis expiration date on the corresponding channel, depending on the direction. This expiration date will be used to compute the stream's expiration date at the end of process_stream(). This patch may be related to issue #2478. It must be backported to all stable versions.	2024-03-15 09:09:22 +01:00
Amaury Denoyelle	7dae3ceaa0	BUG/MAJOR: server: do not delete srv referenced by session A server can only be deleted if there is no elements which reference it. This is taken care via srv_check_for_deletion(), most notably for active and idle connections. A special case occurs for connections directly managed by a session. This is for so-called private connections, when using http-reuse never or H2 + http-reuse safe for example. In this case. server does not account these connections into its idle lists. This caused a bug as the server is deleted despite the session still being able to access it. To properly fix this, add a new referencing element into the server for these session connections. A mt_list has been chosen for this. On default http-reuse, private connections are typically not used so it won't make any difference. If using H2 servers, or more generally when dealing with private connections, insert/delete should typically occur only once per session lifetime so impact on performance should be minimal. This should be backported up to 2.4. Note that srv_check_for_deletion() was introduced in 3.0 dev tree. On backport, the extra condition in it should be placed in cli_parse_delete_server() instead.	2024-03-14 15:21:07 +01:00
Amaury Denoyelle	5ad801c058	MINOR: session: rename private conns elements By default, backend connections are attached to a server instance. This allows to implement connection reuse. However, in some particular cases, connection cannot be shared accross several clients. These connections are considered and private and are attached to the session instance instead. These private connections are also indexed by the target server to not mix them. All of this is implemented via a dedicated structure previously named struct sess_srv_list. Rename it to better reflect its usage to struct sess_priv_conns. Also rename its internal members and all of the associated functions. This commit is only a renaming, thus no functional impact is expected.	2024-03-14 15:21:02 +01:00
Christopher Faulet	f31a4e302e	BUG/MINOR: listener: Don't schedule frontend without task in listener_release() null pointer dereference was reported by Coverity in listener_release() function. Indeed, we must not try to schedule frontend without task when a limit is still blocking the frontend. This issue was introduced by commit `65ae1347c7` ("BUG/MINOR: listener: Wake proxy's mngmt task up if necessary on session release") This patch should fix issue #2488. It must be backported to all stable version with the commit above.	2024-03-14 09:34:36 +01:00
Christopher Faulet	65ae1347c7	BUG/MINOR: listener: Wake proxy's mngmt task up if necessary on session release When a session is released, listener_release() function is called to notify the listener. It is an opportunity to resume limited/full listeners. We first try to resume the listener owning the released session, then all limited listeners in the global queue and finally all limited listeners in the frontend's waiting queue. This last step is only performed if there is no limit applied on the frontend. Nothing is performed if the session rate is still limited. And it is an issue because if this happens for the last listener's session, there is no other event to wake the frontend's managment task up and the listener remains in the limited state. To fix the issue, when a limit is still applied on the frontent, we must compute the new wake up date from the sessions rate and schedule the frontend's managment task. It is easy to reproduce the issue in SSL by setting a maxconn and a rate limit on sessions. This patch should fix the issue #2476. It must be backported to all stable versions.	2024-03-13 15:20:06 +01:00

... 4 5 6 7 8 ...

22119 Commits