haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2025-01-20 12:40:46 +00:00

Author	SHA1	Message	Date
Amaury Denoyelle	53db43aff2	MINOR: mux-quic: hide traces when woken up on pacing only Previous commit aligned default and pacing emission. This is a cleaner and more robust code. However, it may disrupt traces analysis when pacing is rescheduled until timer expiration. Hide traces when qcc_io_cb() is woken up only due to pacing and timer is not yet expired. This is implemented by using special TASK_WOKEN_IO for pacing. This should be backported up to 3.1.	2024-12-18 09:52:16 +01:00
Amaury Denoyelle	9d155ca706	MINOR: trace: implement tracing disabling API Define a set of functions to temporarily disable/reactivate tracing for the current thread. This could be useful when wanting to quickly remove tracing output for some code parts. The API relies on a disable/resume set of functions, with a thread-local counter. This counter is tested under __trace_enabled(). It is a cumulative value so that the same count of resume must be issued after several disable usage. There is also the possibility to force reset the counter to 0 before restoring the old value. This should be backported up to 3.1.	2024-12-18 09:52:06 +01:00
Amaury Denoyelle	41f0472d96	MEDIUM: mux-quic: remove pacing specific code on qcc_io_cb Pacing was recently implemented by QUIC MUX. Its tasklet is rescheduled until next emission timer is reached. To improve performance, an alternate execution of qcc_io_cb was performed when rescheduled due to pacing. This was implemented using TASK_F_USR1 flag. However, this model is fragile, in particular when several events happened alongside pacing scheduling. This has caused some issue recently, most notably when MUX is subscribed on transport layer on receive for handshake completion while pacing emission is performed in parallel. MUX qcc_io_cb() would not execute the default code path, which means the reception event is silently ignored. Recent patches have reworked several parts of qcc_io_cb. The objective was to improve performance with better algorithm on send and receive part. Most notable, qcc frames list is only cleared when new data is available for emission. With this, pacing alternative code is now mostly unneeded. As such, this patch removes it. The following changes are performed : * TASK_F_USR1 is now not used by QUIC MUX. As such, tasklet_wakeup() default invokation can now replace obsolete wrappers qcc_wakeup/qcc_wakeup_pacing * qcc_purge_sending is removed. On pacing rescheduling, all qcc_io_cb() is executed. This is less error-prone, in particular when pacing is mixed with other events like receive handling. This renders the code less fragile, as it completely solves the described issue above. This should be backported up to 3.1.	2024-12-18 09:49:20 +01:00
Amaury Denoyelle	14710b5e6b	MEDIUM/OPTIM: mux-quic: do not rebuild frms list on every send A newly introduced frames list member has been defined into QCC instance with pacing implementation. This allowed to preserve STREAM frames built between different emission scheduled by pacing, without having to regenerate it if no new QCS data is available. Generalize this principle outside of pacing scheduling. Now, the frames list will be reused accross several qcc_io_send() usage. Frames list is only cleared when necessary. This will force its refreshing in the next qcc_io_send() via qcc_build_frms_list(). Frames list refreshing is performed in the following cases : * on successful transfer from stream snd_buf / done_ff / shut * on stream reset or read abort * on max_data/max_stream_data reception with window increase Note that the two first cases are in fact covered directly due to qcc_send_stream() usage when QCS is (re)inserted into the send_list. The main objective of this patch will be to remove QUIC MUX pacing specific code path. It could also provide better performance as emission of large frames may often be rescheduled due to transport layer, either on congestion or full socket buffer. When QUIC MUX is rescheduled, no new data is available and frames list can be reuse as-is, avoiding an unecessary loop over send_list. This should be backported up to 3.1.	2024-12-18 09:49:02 +01:00
Amaury Denoyelle	9ecc1a8e57	MINOR: mux-quic: split STREAM and RS/SS emission This commit is a follow-up of the previous one which defines function qcc_build_frms(). This function implements looping over qcc send_list, to both encode and send individually any STOP_SENDING and RESET_STREAM, but also encode STREAM frames as a preparator step. STREAM frames were then sent as a list outside of qcc_build_frms() via qcc_send_frames(). Extract STOP_SENDING/RESET_STREAM encoding and emission step into a new function qcc_emit_rs_ss(). The code is thus cleaner. In particular it highlights that an error during STOP_SENDING/RESET_STREAM emission stage is fatal and prevent any STREAM frames processing. This should be backported up to 3.1.	2024-12-18 09:40:21 +01:00
Amaury Denoyelle	244dc00b09	MINOR: mux-quic: extract code to build STREAM frames list Extracts code responsible to generate STREAM, RESET_STREAM and STOP_SENDING frames for each qcs instances registered in qcc send_list. It is moved from qcc_io_send() to its owned new function qcc_build_frms(). This commit does not bring functional change. It is a preparatory step to adapt QUIC MUX send mechanism to allow reusing of qcc frms list accross qcc_io_send() invokation. As a side change, qcc_tx_frms_free() is renamed to qcc_clear_frms(). This better highlights its relationship with qcc_build_frms(). This should be bkacported up to 3.1.	2024-12-18 09:38:19 +01:00
Amaury Denoyelle	e296585ae9	MEDIUM/OPTIM: mux-quic: implement purg_list This commit is part of the current serie which aims to refactor and improve overall performance of QUIC MUX I/O handler. qcc_io_process() is responsible to perform some internal operations on QUIC MUX after I/O completion. It is notably called on every qcc_io_cb() tasklet handler. The most intensive work on it is the purging of QCS instances after transfer completion. This was implemented by looping on QCC streams tree and inspecting the state of every QCS. The purpose of this commit is to optimize this processing. A new purg_list QCC member is defined. It is responsible to list every QCS instances whose transfer has been completed. It is thus safe to reuse <el_send> QCS list attach point. Stream purging will thus only loop on purg_list instead of every known QCS. This should be backported up to 3.1.	2024-12-18 09:33:52 +01:00
Amaury Denoyelle	4b42dd4ae0	MEDIUM/OPTIM: mux-quic: define a recv_list for demux resumption This commit is part of the current serie which aims to refactor and improve overall performance of QUIC MUX I/O handler. Define a recv_list element into qcc structure. This is used to registered every instance of qcs which are currently blocked on demuxing, which happen on no more space in <rx.appbuf>. The purpose of this patch is to reduce qcc_io_recv() CPU usage. Now, only recv_list iteration is performed, instead of the previous looping over every qcs instances. This is useful as qcc_io_recv() is called each time qcc_io_cb() is scheduled, even if only sending condition was the wakeup origin. A qcs is not inserted into recv_list immediately after blocking on demux full buffer. Instead, this is only done after unblocking via stream rcv_buf callback, which ensure that new buffer space is available. This should be backported up to 3.1.	2024-12-18 09:23:41 +01:00
Amaury Denoyelle	0a53a008d0	MINOR: mux-quic: refactor wait-for-handshake support This commit refactors wait-for-handshake support from QUIC MUX. The flag logic QC_CF_WAIT_HS is inverted : it is now positionned only if MUX is instantiated before handshake completion. When the handshake is completed, the flag is removed. The flag is now set directly on initialization via qmux_init(). Removal via qcc_wait_for_hs() is moved from qcc_io_process() to qcc_io_recv(). This is deemed more logical as QUIC MUX is scheduled on RECV to be notify by the transport layer about handshake termination. Moreover, qcc_wait_for_hs() is now called if recv subscription is still active. This commit is the first of a serie which aims to refactor QUIC MUX I/O handler and improves its overall performance. The ultimate objective is to be able to stream qcc_io_cb() by removing pacing specific code path via qcc_purge_sending(). This should be backported up to 3.1.	2024-12-18 09:23:41 +01:00
Amaury Denoyelle	9dcd2369e2	MINOR: quic: add traces Add some traces to better follow QUIC MUX scheduling, in particular with pacing interaction. This should be backported up to 3.1.	2024-12-18 09:20:20 +01:00
Amaury Denoyelle	17bfe93768	CLEANUP: mux-quic: remove unused qcc member send_retry_list Remove unused fields send_retry_list from qcc and its corresponding attach element el from qcs. This should be backported up to 3.1.	2024-12-18 09:20:20 +01:00
Amaury Denoyelle	2e3542bec6	BUG/MEDIUM: mux-quic: do not mix qcc_io_send() return codes with pacing With pacing implementation, qcc_send_frames() return code has been extended to report emission interruption due to pacing limitation. This is used only in qcc_io_send(). However, its invokation may be skipped using 'sent_done' label. This happens on emission failure of a STOP_SENDING or RESET_STREAM (either memory allocation failure, or transport layer rejection). In this case, return values are mixed as qcs_send() is wrongly compared against pacing interruption condition. This value corresponds to the length of the last built STREAM frames. If by mischance the last frame was 1 byte long, qcs_send() return value is equal to pacing interruption condition. This has several effects. If pacing is activated, it may lead to unneeded wakeup on QUIC MUX. Worst, if pacing is not used, a BUG_ON() crash will be triggered. Fix this by using a different variable dedicated to qcc_send_frames() return value. By default it is initialized to 0. This ensures that pacing code won't be activated in case qcc_send_frames() is not used. This must be backported up to 3.1.	2024-12-18 09:18:48 +01:00
Willy Tarreau	93d4e9d50f	CLEANUP: ssl-sock: drop two now unneeded ALREADY_CHECKED() In ssl_sock_bind_verifycbk() a BUG_ON() checks the validity of "ctx" and "bind_conf". There was a pair of ALREADY_CHECKED() macros after BUG_ON() for the case where DEBUG_STRICT=0. But this is now addressed so we can remove these two macros and rely on the BUG_ON() instead.	2024-12-17 17:47:57 +01:00
Willy Tarreau	7760e3a374	CLEANUP: quic: replace ALREADY_CHECKED() with ASSUME_NONNULL() at a few places There were 4 instances of ALREADY_CHECKED() used to tell the compiler that the argument couldn't be NULL by design. Let's change them to the cleaner ASSUME_NONNULL(). Functions like qc_snd_buf() were slightly reduced in size (-24 bytes). Apparently gcc-13 sees a potential case that others don't see, and it's likely a bug since depending what is masked, it will completely change the output warnings to the point of contradicting itself. After many attempts, it appears that just checking that CMSG_FIRSTHDR(msg) is not null suffices to calm it down, so the strange warnings might have been the result of an overoptimization based on a supposed UB in the first place. At least now all versions up to 13.2 as well as clang are happy.	2024-12-17 17:47:57 +01:00
Willy Tarreau	1f93622779	CLEANUP: stats: use ASSUME_NONNULL() to indicate that the first block exists In stats_scope_ptr(), the validity of blk() was assumed using ALREADY_CHECKED(blk), but we can now use the cleaner ASSUME_NONNULL(). In addition this simplifies the BUG_ON() check that follows.	2024-12-17 17:47:57 +01:00
Willy Tarreau	6dfd541ca8	CLEANUP: mux-fcgi: use ASSUME_NONNULL() to indicate that the first block exists In fcgi_snd_buf(), this was previously achieved using ALREADY_CHECKED(blk), but we can now fold it into the cleaner ASSUME_NONNULL().	2024-12-17 17:47:57 +01:00
Willy Tarreau	143a103696	CLEANUP: htx: use ASSUME_NONNULL() to mark the start line as non-null In http_replace_req_uri(), this assumption was previously made using ALREADY_CHECKED() but the new one is cleaner (and smaller, 24 bytes less).	2024-12-17 17:47:57 +01:00
Willy Tarreau	a4f50c69e4	CLEANUP: hlua: use ASSUME_NONNULL() instead of ALREADY_CHECKED() The purpose of the test in hlua_applet_tcp_new() was precisely to declare non-nullity. Let's just do it using ASSUME_NONNULL() now.	2024-12-17 17:47:57 +01:00
Willy Tarreau	29b2c5d4d4	CLEANUP: cache: use ASSUME_NONNULL() instead of DISGUISE() DISGUISE() was used to avoid a NULL warning. Using ASSUME_NONNULL() instead makes it clearer and made the function slightly shorter.	2024-12-17 17:42:11 +01:00
Willy Tarreau	7b6acb6a51	MINOR: bug: make BUG_ON() fall back to ASSUME When the strict level is zero and BUG_ON() is not implemented, some possible null-deref warnings are emitted again because some were covering for these cases. Let's make it fall back to ASSUME() so that the compiler continues to know that the tested expression never happens. It also allows to further optimize certain functions by helping the compiler eliminate certain tests for impossible values. However it requires that the expression is really evaluated before passing the result through ASSUME() otherwise it was shown that gcc-11 and above will fail to evaluate its implications and will continue to emit the null-deref warnings in case the expression is non-trivial (e.g. it has multiple terms). We don't do it for BUG_ON_HOT() however because the extra cost of evaluating the condition is generally not welcome in fast paths, particularly when that BUG_ON_HOT() was kept disabled for performance reasons.	2024-12-17 17:39:12 +01:00
Willy Tarreau	63798088b3	MINOR: compiler: add ASSUME_NONNULL() to tell the compiler a pointer is valid At plenty of places we have ALREADY_CHECKED() or DISGUISE() on a pointer just to avoid "possibly null-deref" warnings. These ones have the side effect of weakening optimizations by passing through an assembly step. Using ASSUME_NONNULL() we can avoid that extra step. And when the __builtin_unreachable() builtin is not present, we fall back to the old method using assembly. The macro returns the input value so that it may be used both as a declarative way to claim non-nullity or directly inside an expression like DISGUISE().	2024-12-17 16:46:46 +01:00
Willy Tarreau	2ce63b7b17	MINOR: compiler: also enable __builtin_assume() for ASSUME() Clang apparently has __builtin_assume() which does exactly the same as our macro, since at least v3.8. Let's enable it, in case it may even better detect assumptions vs unreachable code.	2024-12-17 16:46:46 +01:00
Willy Tarreau	efc897484b	MINOR: compiler: add a new "ASSUME" macro to help the compiler This macro takes an expression, tests it and calls an unreachable statement if false. This allows the compiler to know that such a combination does not happen, and totally eliminate tests that would be related to this condition. When the statement is not available in the compiler, we just perform a break from a do {} while loop so that the expression remains evaluated if needed (e.g. function call).	2024-12-17 16:46:46 +01:00
Willy Tarreau	41fc18b1d1	MINOR: compiler: rely on builtin detection for __builtin_unreachable() Due to __builtin_unreachable() only being associated to gcc 4.5 and above, it turns out it was not enabled for clang. It's not used that much but still a little bit, so let's enable it now. This reduces the code size by 0.2% and makes it a bit more efficient.	2024-12-17 16:46:46 +01:00
Willy Tarreau	96cfcb1df3	MINOR: compiler: add a __has_builtin() macro to detect features more easily We already have a __has_attribute() macro to detect when the compiler supports a specific attribute, but we didn't have the equivalent for builtins. clang-3 and gcc-10 have __has_builtin() for this. Let's just bring it using the same mechanism as __has_attribute(), which will allow us to simply define the macro's value for older compilers. It will save us from keeping that many compiler-specific tests that are incomplete (e.g. the __builtin_unreachable() test currently doesn't cover clang).	2024-12-17 16:46:46 +01:00
Willy Tarreau	4710ab5604	BUILD: debug: only dump/reset glitch counters when really defined If neither DEBUG_GLITCHES nor DEBUG_STRICT is set, we end up with no dbg_cnt section, resulting in debug_parse_cli_counters not building due to __stop_dbg_cnt and __start_dbg_cnt not being defined. Let's just condition the end of the function to these conditions. An alternate approach (less elegant) is to always declare a dummy entry of type DBG_COUNTER_TYPES in debug.c. This must be backported to 3.1 since it was brought with glitches.	2024-12-17 16:46:25 +01:00
Olivier Houchard	b3cd5a4b86	CLEANUP: queues: Remove pendconn_grab_from_px(). pendconn_grab_from_px() is now unused, so just remove it.	2024-12-17 16:05:44 +01:00
Olivier Houchard	111ea83ed4	BUG/MEDIUM: queues: Do not use pendconn_grab_from_px(). pendconn_grab_from_px() was called when a server was brought back up, to get some streams waiting in the proxy's queue and get them to run on the newly available server. It is very similar to process_srv_queue(), except it only goes through the proxy's queue, which can be a problem, because there is a small race condition that could lead us to add more streams to the server queue just as it's going down. If that happens, the server would just be ignored when back up by new streams, as its queue is not empty, and it would never try to process its queue. The other problem with pendconn_grab_from_px() is that it is very liberal with how it dequeues streams, and it is not very good at enforcing maxconn, it could lead to having 3*maxconn connections. For both those reasons, just get rid of pendconn_grab_from_px(), and just use process_srv_queue(). Both problems are easy to reproduce, especially on a 64 threads machine, set a maxconn to 100, inject in H2 with 1000 concurrent connections containing up to 100 streams each, and after a few seconds/minutes the max number of concurrent output streams will be much higher than maxconn, and eventually the server will stop processing connections. It may be related to github issue #2744. Note that it doesn't totally fix the problem, we can occasionally see a few more connections than maxconn, but the max that have been observed is 4 more connections, we no longer get multiple times maxconn. have more outgoing connections than maxconn, This should be backported up to 2.6.	2024-12-17 16:05:44 +01:00
Olivier Houchard	dc9ce9c264	BUG/MEDIUM: queues: Make sure we call process_srv_queue() when leaving In stream_free(), make sure we call process_srv_queue() each time we call sess_change_server(), otherwise a server may end up not dequeuing any stream when it could do so. In some extreme cases it could lead to an infinite loop, as the server would appear to be available, as its "served" parameter would be < maxconn, but would end up not being used, as there are elements still in its queue. This should be backported up to 2.6.	2024-12-17 16:05:44 +01:00
Christopher Faulet	4f32d03360	BUG/MEDIUM: stconn: Only consider I/O timers to update stream's expiration date In sc_notify(), it remained a case where it was possible to set an expiration date on the stream in the past, leading to a crash because of a BUG_ON(). This must never happen of course. In sc_notify(), The stream's expiration may be updated in case no wakeup conditions are encoutered. In that case, we must take care to never set an expiration date in the past. However, it appeared there was still a condition to do so. This code is based on an implicit postulate: the stream's expiration date must always be set when we leave process_stream(). It was true since the 2.9. But in 3.0, the buffer allocation mechanism was improved and on an alloc failure in process_stream(), the stream is inserted in a wait-list and its expiration date is set to TICK_ETERNITY. With the good timing, and an analysis expiration date set on a channel, it is possible to set the stream's expiration date in past. After analysis, it appeared that the proper way to fix the issue is to only evaluate I/O timers (read and write timeout) and not stream's timers (analase_exp or conn_exp) because only I/O timers may have changed since the last process_stream() call. This patch must be backported as far as 3.0 to fix the issue. But it is probably a good idea to also backported it as far as 2.8.	2024-12-16 17:47:25 +01:00
William Lallemand	e3b760ebcc	BUG/MINOR: ssl/cli: 'show ssl ca-file' escape the first '' of a filename When doing a 'show ssl ca-file <filename>', prefixing a filename with a '' allows to show the uncommited transaction asociated to this filename. However for people using '*' as the first character of their filename, there is no way to access this filename. This patch fixes the problem by allowing to escape the first character with \. This should be backported in every stable branches.	2024-12-16 17:09:34 +01:00
William Lallemand	82c83a11a1	BUG/MINOR: ssl/cli: 'show ssl crl-file' escape the first '' of a filename When doing a 'show ssl crl-file <filename>', prefixing a filename with a '' allows to show the uncommited transaction asociated to this filename. However for people using '*' as the first character of their filename, there is no way to access this filename. This patch fixes the problem by allowing to escape the first character with \. This should be backported in every stable branches.	2024-12-16 16:46:52 +01:00
William Lallemand	2ba4cf541b	BUG/MINOR: ssl/cli: 'show ssl cert' escape the first '' of a filename When doing a 'show ssl cert <filename>', prefixing a filename with a '' allows to show the uncommited transaction asociated to this filename. However for people using '*' as the first character of their filename, there is no way to access this filename. This patch fixes the problem by allowing to escape the first character with \. This should be backported in every stable branches.	2024-12-16 16:17:12 +01:00
William Lallemand	fd35b7fb97	MINOR: ssl/cli: add -A to the 'show ssl sni' command description Add [-A] to the 'show ssl sni' command description.	2024-12-16 15:22:27 +01:00
William Lallemand	7c8e38d4d6	MINOR: ssl/cli: allow to filter expired certificates with 'show ssl sni' -A option in 'show ssl sni' shows certificates that are past the notAfter date. The patch reworks the options parsing to get multiple.	2024-12-16 14:55:23 +01:00
William Lallemand	bb88f68cf7	MINOR: ssl: add utils functions to extract X509 notAfter date Add ASN1_to_time_t() which converts an ASN1_TIME to a time_t and x509_get_notafter_time_t() which returns the notAfter date in time_t format.	2024-12-16 14:54:53 +01:00
Valentine Krasnobaeva	fbc534a6fa	REORG: startup: move nofile limit checks in limits.c Let's encapsulate the code, which checks the applied nofile limit into a separate helper check_nofile_lim_and_prealloc_fd(). Let's keep in this new function scope the block, which tries to create a copy of FD with the highest number, if prealloc-fd is set in the configuration.	2024-12-16 10:44:01 +01:00
Valentine Krasnobaeva	14f5e00d38	REORG: startup: move code that applies limits to limits.c In step_init_3() we try to apply provided or calculated earlier haproxy maxsock and memmax limits. Let's encapsulate these code blocks in dedicated functions: apply_nofile_limit() and apply_memory_limit() and let's move them into limits.c. Limits.c gathers now all the logic for calculating and setting system limits in dependency of the provided configuration.	2024-12-16 10:44:01 +01:00
Valentine Krasnobaeva	1332e9b58d	REORG: startup: move global.maxconn calculations in limits.c Let's encapsulate the code, which calculates global.maxconn and global.maxsslconn into a dedicated function set_global_maxconn() and let's move this function in limits.c. In limits.c we keep helpers to calculate and check haproxy internal limits, based on the system nofile and memory limits.	2024-12-16 10:44:01 +01:00
Frederic Lecaille	949bc18f66	CLEANUP: quic: Rename some BBR functions in relation with bw probing Rename bbr_is_probing_bw() to bbr_is_in_a_probe_state() and bbr_is_accelerating_probing_bw() to bbr_is_probing_bw() to match the function names of the BBR v3 internet draft. Must be backported to 3.1 to ease any further backport to come.	2024-12-13 19:41:21 +01:00
Frederic Lecaille	0dc0c890ea	BUG/MINOR: quic: missing Startup accelerating probing bw states Startup state is also a probing with acceleration bandwidth state. This modification should have come with this previous one: BUG/MINOR: quic: reduce packet losses at least during ProbeBW_CRUISE (BBR) Must be backported to 3.1.	2024-12-13 19:41:21 +01:00
Valentine Krasnobaeva	ea4a148a7d	REGTESTS: ssl: add a PEM with mix of LF and CRLF line endings User tried to update a PEM, generated automatically. Part of this PEM has LF line endings, and another part (CA certificate), added by some API, has CRLF line endings. This has revealed a bug in cli_snd_buf(), see more details in issue GitHUB #2818. So, let's add an example of such PEM in our SSL regtest.	2024-12-13 18:13:42 +01:00
Valentine Krasnobaeva	d60c893991	BUG/MINOR: cli: cli_snd_buf: preserve \r\n for payload lines cli_snd_buf() analyzez input line by line. Before this patch it has always scanned a given line for the presence of '\r' followed by '\n'. This is only needed for strings, that contain the commands itself like "show ssl cert\n", "set ssl cert test.pem <<\n". In case of strings, which contain the command's payload, like "-----BEGIN CERTIFICATE-----\r\n", '\r\n' should be preserved as is. This patch fixes the GitHub issue #2818. This patch should be backported in v3.1 and in v3.0.	2024-12-13 18:13:42 +01:00
Frederic Lecaille	178109f608	BUG/MINOR: quic: too permissive exit condition for high loss detection in Startup (BBR) This bug fixes the 3rd condition used by bbr_check_startup_high_loss() to decide it has detected some high loss as mentioned by the BBR v3 RFC draft: 4.3.1.3. Exiting Startup Based on Packet Loss ... There are at least BBRStartupFullLossCnt=6 discontiguous sequence ranges lost in that round trip. where a <= operator was used in place of <. Must be backported to 3.1.	2024-12-13 14:42:43 +01:00
Frederic Lecaille	e61b418907	BUG/MINOR: quic: fix the wrong tracked recovery start time value bbr_congestion_event() role is to track the start time of recovery periods. This was done using <ts> passed as parameter. But this parameter is the time the newest lost packet has been sent. The timestamp value to store in ->recovery_start_ts is <now_ms>. Must be backported to 3.1.	2024-12-13 14:42:43 +01:00
Frederic Lecaille	e1d25cdbdd	CLEANUP: quic: remove a wrong comment about ->app_limited (drs) ->app_limited quic_drs struct member is not a boolean. This is the index of the last transmitted packet marked as application-limited, or 0 if the connection is not currently application-limited (see C.app_limited definition in BBR v3 draft).	2024-12-13 14:42:43 +01:00
Frederic Lecaille	eeaeb412dc	MINOR: quic: reduce the private data size of QUIC cc algos After these commits: BUG/MINOR: quic: remove max_bw filter from delivery rate sampling BUG/MINOR: quic: fix BBB max bandwidth oscillation issue where some members were removed from bbr struct, the private data size of QUIC cc algorithms may be reduced from 160 to 144 uint32_t. Should be easily backported to 3.1 alonside the commits mentioned above.	2024-12-13 14:42:43 +01:00
Frederic Lecaille	9813de0537	BUG/MINOR: quic: reduce packet losses at least during ProbeBW_CRUISE (BBR) Upon congestion events (for a instance packet loss), bbr_adapt_lower_bounds_from_congestion() role is to adapt some BBR internal variables in relation with the estimated bandwidth (BBR.bw). According to the BBR v3 draft, this function should do nothing if BBRIsProbingBW() pseudo-code returns true. That said, this function is not defined by the BBR v3 draft. But according to this part mentioned before defining the pseudo-code for BBRAdaptLowerBoundsFromCongestion(): 4.5.10.3. When not Probing for Bandwidth When not explicitly accelerating to probe for bandwidth (Drain, ProbeRTT, ProbeBW_DOWN, ProbeBW_CRUISE), BBR responds to loss by slowing down to some extent. This is because loss suggests that the available bandwidth and safe volume of in-flight data may have decreased recently, and the flow needs to adapt, slowing down toward the latest delivery process. BBR flows implement this response by reducing the short-term model parameters, BBR.bw_lo and BBR.inflight_lo. BBRIsProbingBW() should concern the accelerating probe for bandwidth states which are BBR_ST_PROBE_BW_REFILL and BBR_ST_PROBE_BW_UP. Adapt the code to match this latter assumption. At least this reduce drastically the packet loss volumes at least during ProbeBW_CRUISE. As an example, on a 100MBits/s internet link with ~94ms as RTT, before this patch, 4329640 sent packets were needed with 1617119 lost packets (!!!) to download a 3GB object. After this patch, 2843952 sent packets vs 144134 lost packets are needed. There may be some packet loss issue. I suspect the maximum bandwidth which may be overestimated. More this is the case, more the packet loss is big. That said, at this time, it remains below 5% depending on the size of the objects, 5% being for more than 2GB objects. Must be backported to 3.1.	2024-12-13 14:42:43 +01:00
Frederic Lecaille	ebfc301d5d	BUG/MINOR: quic: underflow issue for bbr_inflight_hi_from_lost_packet() Add a test to ensure that values of a local variable used by bbr_inflight_hi_from_lost_packet() is not be impacted by underflow issues when subtracting too big numbers and make this function return a correct value. Must be backported to 3.1.	2024-12-13 14:42:43 +01:00
Frederic Lecaille	22ab45a3a8	BUG/MINOR: quic: remove max_bw filter from delivery rate sampling This filter is no more needed after this commit: BUG/MINOR: quic: fix BBB max bandwidth oscillation issue. Indeed, one added this filter at delivery rate sampling level to filter the BBR max bandwidth estimations and was inspired from ngtcp2 code source when trying to fix the oscillation issue. But this BBR max bandwidth oscillation issue was fixed by the aforementioned commit. Furthermore this code tends to always increment the BBR max bandwidth. From my point of view, this is not a good idea at all. Must be backported to 3.1.	2024-12-13 14:42:43 +01:00

1 2 3 4 5 ...

23701 Commits