haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2025-03-05 19:10:45 +00:00

Author	SHA1	Message	Date
Willy Tarreau	7999bfbfd3	MEDIUM: buffers: make b_xfer() automatically swap buffers when possible Whenever it's possible to avoid a copy, b_xfer() will simply swap the buffer's heads without touching the data. This has brought the performance back from 140 kH/s to 202 kH/s on the test case.	2018-07-20 19:21:43 +02:00
Willy Tarreau	a56a6def91	MEDIUM: h2: move headers and data frame decoding to their respective parsers Now we entirely process the input frame before transfering it above, so that h2_rcv_buf() doesn't have to "speak" h2 anymore.	2018-07-20 19:21:43 +02:00
Willy Tarreau	454b57b347	MEDIUM: h2: centralize transfer of decoded frames in h2_rcv_buf() We still call the parser but it should soon not be needed anymore. The decode functions don't need the buffer nor the max size anymore. They must also not touch the CS_FL_EOS or CS_FL_RCV_MORE flags either, so this is done within h2_rcv_buf() after transmission. The "flags" argument to h2_frt_decode_headers() and h2_frt_transfer_data() has been removed since it's not used anymore.	2018-07-20 19:21:43 +02:00
Willy Tarreau	d755ea6c7d	MEDIUM: h2: make h2_frt_transfer_data() copy via an intermediary buffer The purpose here is also to ensure we can split the lower from the top layers. The way the CS_FL_MSG_MORE flag is set was updated so that it's set or cleared upon exit depending on the buffer's remaining contents.	2018-07-20 19:21:43 +02:00
Willy Tarreau	937f760e1e	MEDIUM: h2: make h2_frt_decode_headers() use an intermediary buffer The purpose is to decode to a temporary buffer and then to copy this buffer to the caller. This double-copy definitely has an impact on performance, the test code goes down from 220k to 140k req/s, but this memcpy() will disappear soon. The test on CO_RFL_BUF_WET has become irrelevant now since we only use the cs' rxbuf, so we cannot be blocked by "output" data that has to be forwarded first. Thus instead we don't start until the rxbuf is empty (it will be drained from any input data once the stream processes it).	2018-07-20 19:21:43 +02:00
Willy Tarreau	0b559071dd	MINOR: h2: make each H2 stream support an intermediary input buffer The purpose is to decode to a temporary buffer and then to copy this buffer to the caller upon request to avoid having to process frames on the fly when called from the higher level. For now the buffer is only initialized on stream creation via cs_new() and allocated if the buffer_wait's callback is called.	2018-07-20 19:21:43 +02:00
Willy Tarreau	67b1e78f68	MEDIUM: stream-int: automatically call si_cs_recv_cb() if the cs has data on wake() If the cs has data pending or shutdown and the input channel is still waiting for reads, let's simply call the recv() function from the wake() callback. This will allow the lower layers to simply wake the upper one up without having to consider the recv() nor anything else.	2018-07-20 19:21:43 +02:00
Willy Tarreau	11c9aa424e	MEDIUM: conn_stream: add cs_recv() as a default rcv_buf() function This function is generic and is able to automatically transfer data from a conn_stream's rx buffer to the destination buffer. It does this automatically if the mux doesn't define another rcv_buf() function.	2018-07-20 19:21:43 +02:00
Willy Tarreau	5e1cc5ea83	MINOR: conn_stream: add an rx buffer to the conn_stream In order to reorganize the connection layers, recv() operations will need to be retryable and to support partial transfers. This requires an intermediary buffer to hold the data coming from the mux. After a few attempts, it turns out that this buffer is best placed inside the conn_stream itself. For now it's only set to buf_empty and it will be up to the caller to allocate it if required.	2018-07-20 19:21:43 +02:00
Willy Tarreau	a3f7efe009	MINOR: conn_stream: add a new CS_FL_REOS flag This flag indicates that the mux layer has already detected an end of stream which will become CS_FL_EOS during a recv() once the rx buffer is empty.	2018-07-20 19:21:43 +02:00
Willy Tarreau	9382cdd8e1	DOC: add some design notes about the new layering model This explains how streams and connection should interact.	2018-07-20 19:21:43 +02:00
Willy Tarreau	f148888d19	MINOR: buffers: add b_xfer() to transfer data between buffers Instead of open-coding buffer-to-buffer transfers using blocks, let's have a dedicated function for this. It also adjusts the buffer counts.	2018-07-20 19:21:43 +02:00
Willy Tarreau	f7d0447376	MINOR: buffers: split b_putblk() into __b_putblk() The latter function is more suited to operations that don't require any check because the check has already been performed. It will be used by other b_* functions.	2018-07-20 19:21:43 +02:00
Willy Tarreau	ab322d4fd4	MINOR: buffers: simplify b_contig_space() This function is used a lot in block copies and is needlessly complicated since it still uses pointer arithmetic. Let's fall back to regular offsets and simplify it. This removed around 23 bytes from b_putblk() and it removed any conditional jump.	2018-07-20 19:21:43 +02:00
Olivier Houchard	f495fc460e	BUG/MEDIUM: mux_h2: Call h2_send() before updating polling. In h2_wake(), make sure we call h2_send() before we try to update the polling flags, and detect connection errors, or errors will never be detected.	2018-07-20 19:07:49 +02:00
Christopher Faulet	ddb6c16576	BUG/MEDIUM: threads: Fix the exit condition of the thread barrier In thread_sync_barrier, we exit when all threads have set their own bit in the barrier mask. It is done by comparing it to all_threads_mask. But we must not use a simple equality to do so, becaue all_threads_mask may change. Since commit `ba86c6c25` ("MINOR: threads: Be sure to remove threads from all_threads_mask on exit"), when a thread exit, its bit is removed from all_threads_mask. Instead, we must use a bitwise AND to test is all bits of all_threads_mask are set. This also requires that all_threads_mask is set to volatile if we want to catch changes. This patch must be backported in 1.8.	2018-07-20 14:24:41 +02:00
Christopher Faulet	20761453fb	MINOR: ist: Add the function isteqi This new function does the same as isteq, but ignoring the case.	2018-07-20 13:39:30 +02:00
Christopher Faulet	5f8ef13d5d	MINOR: debug: Add checks for conn_stream flags This may be carefully backported to 1.8 (a few flags don't exist there).	2018-07-20 13:39:30 +02:00
Christopher Faulet	aff9328739	MINOR: debug: Add check for CO_FL_WILL_UPDATE This could be backported to 1.8.	2018-07-20 13:39:30 +02:00
Tim Duesterhus	3ce3811a9c	BUILD: Generate sha256 checksums in publish-release Currently only md5 signatures are generated. While md5 still is not broken with regard to preimage attacks, sha256 clearly is the current secure solution. This patch should be backported to all supported branches.	2018-07-20 10:50:20 +02:00
Christopher Faulet	4507351a2f	BUG/MINOR: build: Fix compilation with debug mode enabled It remained some fragments of the old buffers API in debug messages, here and there. This was caused by the recent buffer API changes, no backport is needed.	2018-07-20 10:45:20 +02:00
Christopher Faulet	005e79e5dd	BUG/MINOR: http: Set brackets for the unlikely macro at the right place When test on the header "Early-Data" is made, the unlikely macro must encompass the condition. This patch must be backported in 1.8.	2018-07-20 10:42:53 +02:00
Willy Tarreau	8318885487	MINOR: connection: simplify subscription by adding a registration function This new function wl_set_waitcb() prepopulates a wait_list with a tasklet and a context and returns it so that it can be passed to ->subscribe() to be added to a connection or conn_stream's wait_list. The caller doesn't need to know all the insiders details anymore this way.	2018-07-19 18:31:07 +02:00
Olivier Houchard	910b2bc829	MEDIUM: connections/mux: Revamp the send direction. Totally nuke the "send" method, instead, the upper layer decides when it's time to send data, and if it's not possible, uses the new subscribe() method to be called when it can send data again.	2018-07-19 18:31:07 +02:00
Olivier Houchard	6ff2039d13	MINOR: connections/mux: Add a new "subscribe" method. Add a new "subscribe" method for connection, conn_stream and mux, so that upper layer can subscribe to them, to be called when the event happens. Right now, the only event implemented is "SUB_CAN_SEND", where the upper layer can register to be called back when it is possible to send data. The connection and conn_stream got a new "send_wait_list" entry, which required to move a few struct members around to maintain an efficient cache alignment (and actually this slightly improved performance).	2018-07-19 16:23:43 +02:00
Olivier Houchard	e17c2d3e57	MINOR: tasklets: Don't attempt to add a tasklet in the list twice. Don't try to add a tasklet to the run queue if it's already in there, or we might get an infinite loop.	2018-07-19 16:23:43 +02:00
Willy Tarreau	23d465c48c	DOC: buffers: remove obsolete docs about buffers A number of outdated docs dating 2012 about buffers implementation and management were totally irrelevant to the current code (and even to most 1.8 code as well). These docs have all been removed so that only the up to date documentation remains.	2018-07-19 16:23:43 +02:00
Willy Tarreau	9d752e96d4	DOC: buffers: document the new buffers API Most of the new functions have been documented in buffer-api.txt, with a diagram of what a buffer looks like and some hints to convert older code.	2018-07-19 16:23:43 +02:00
Willy Tarreau	83061a820e	MAJOR: chunks: replace struct chunk with struct buffer Now all the code used to manipulate chunks uses a struct buffer instead. The functions are still called "chunk*", and some of them will progressively move to the generic buffer handling code as they are cleaned up.	2018-07-19 16:23:43 +02:00
Willy Tarreau	843b7cbe9d	MEDIUM: chunks: make the chunk struct's fields match the buffer struct Chunks are only a subset of a buffer (a non-wrapping version with no head offset). Despite this we still carry a lot of duplicated code between buffers and chunks. Replacing chunks with buffers would significantly reduce the maintenance efforts. This first patch renames the chunk's fields to match the name and types used by struct buffers, with the goal of isolating the code changes from the declaration changes. Most of the changes were made with spatch using this coccinelle script : @rule_d1@ typedef chunk; struct chunk chunk; @@ - chunk.str + chunk.area @rule_d2@ typedef chunk; struct chunk chunk; @@ - chunk.len + chunk.data @rule_i1@ typedef chunk; struct chunk chunk; @@ - chunk->str + chunk->area @rule_i2@ typedef chunk; struct chunk chunk; @@ - chunk->len + chunk->data Some minor updates to 3 http functions had to be performed to take size_t ints instead of ints in order to match the unsigned length here.	2018-07-19 16:23:43 +02:00
Willy Tarreau	c9fa0480af	MAJOR: buffer: finalize buffer detachment Now the buffers only contain the header and a pointer to the storage area which can be anywhere. This will significantly simplify buffer swapping and will make it possible to map chunks on buffers as well. The buf_empty variable was removed, as now it's enough to have size==0 and area==NULL to designate the empty buffer (thus a non-allocated head is the empty buffer by default). buf_wanted for now is indicated by size==0 and area==(void *)1. The channels and the checks now embed the buffer's head, and the only pointer is to the storage area. This slightly increases the unallocated buffer size (3 extra ints for the empty buffer) but considerably simplifies dynamic buffer management. It will also later permit to detach unused checks. The way the struct buffer is arranged has proven quite efficient on a number of tests, which makes sense given that size is always accessed and often first, followed by the othe ones.	2018-07-19 16:23:43 +02:00
Willy Tarreau	bd1dba8a89	MINOR: buffer: rename the data length member to '->data' It used to be called 'len' during the reorganisation but strictly speaking it's not a length since it wraps. Also we already use '_data' as the suffix to count available data, and data is also what we use to indicate the amount of data in a pipe so let's improve consistency here. It was important to do this in two operations because data used to be the name of the pointer to the storage area.	2018-07-19 16:23:43 +02:00
Willy Tarreau	e3128024bf	MINOR: buffer: replace buffer_replace2() with b_rep_blk() This one is more generic and designed to work on a random block. It may later get a b_rep_ist() variant since many strings are already available as (ptr,len).	2018-07-19 16:23:43 +02:00
Willy Tarreau	4d893d440c	MINOR: buffers/channel: replace buffer_insert_line2() with ci_insert_line2() There was no point keeping that function in the buffer part since it's exclusively used by HTTP at the channel level, since it also automatically appends the CRLF. This further cleans up the buffer code.	2018-07-19 16:23:43 +02:00
Willy Tarreau	7b04cc4467	CLEANUP: buffer: minor cleanups to buffer.h Remove a few unused functions and add some comments to split the file parts in sections.	2018-07-19 16:23:43 +02:00
Willy Tarreau	911f7dd893	MINOR: buffers: remove b_putstr() It's not needed anymore.	2018-07-19 16:23:43 +02:00
Willy Tarreau	a094fde2b6	MINOR: checks: use b_putist() instead of b_putstr() This slightly simplifies the code.	2018-07-19 16:23:43 +02:00
Willy Tarreau	ea1b06d5bb	MINOR: buffer: add a new file for ist + buffer manipulation functions The new file istbuf.h links the indirect strings (ist) with the buffers. The purpose is to encourage addition of more standard buffer manipulation functions that rely on this in order to improve the overall ease of use along all the code. Just like ist.h and buf.h, this new file is not expected to depend on anything beyond these two files. A few functions were added and/or converted from buffer.h : - b_isteq() : indicates if a buffer and a string match - b_isteat() : consumes a string from the buffer if it matches - b_istput() : appends a small string to a buffer (all or none) - b_putist() : appends part of a large string to a buffer The equivalent functions were removed from buffer.h and changed at the various call places.	2018-07-19 16:23:43 +02:00
Willy Tarreau	55372f646f	MINOR: buffer: replace b{i,o}_put* with b_put* The two variants now do exactly the same (appending at the tail of the buffer) so let's not keep the distinction between these classes of functions and have generic ones for this. It's also worth noting that b{i,o}_putchk() wasn't used at all and was removed.	2018-07-19 16:23:43 +02:00
Willy Tarreau	72a100b386	MINOR: buffer: replace bi_fast_delete() with b_del() There's no distinction between in and out data now. The latter covers the needs of the former and supports wrapping. The extra cost is negligible given the locations where it's used.	2018-07-19 16:23:43 +02:00
Olivier Houchard	08afac0fd7	MEDIUM: buffers: move "output" from struct buffer to struct channel Since we never access this field directly anymore, but only through the channel's wrappers, it can now move to the channel. The buffers are now completely free from the distinction between input and output data.	2018-07-19 16:23:43 +02:00
Willy Tarreau	892f1dbe4f	MINOR: buffer: rename the "data" field to "area" Since we use "_data" for the amount of data at many places, as opposed to "_space" for the amount of space, let's rename the "data" field to "area" so that we can reuse "data" later for the amount of data in the buffer (currently called "len" despite not being contigous).	2018-07-19 16:23:43 +02:00
Willy Tarreau	f6dfd88a92	MINOR: buffer: b_set_data() doesn't truncate output data anymore b_set_data() is used : - in proto_http and hlua to trim input data (b_set_data(co_data())) - in SPOE to append data to a buffer while building a message In no case will this truncate a buffer so we can safely remove the test for len < b->output.	2018-07-19 16:23:43 +02:00
Willy Tarreau	abed1e7f34	MINOR: buffer: remove the check for output on b_del() b_del() is used in : - mux_h2 with the demux buffer : always processes input data - checks with output data though output is not considered at all there - b_eat() which is not used anywhere - co_skip() where the len is always <= output Thus the distinction for output data is not needed anymore and the decrement can be made inconditionally in co_skip().	2018-07-19 16:23:43 +02:00
Willy Tarreau	d54a8ceb97	MAJOR: start to change buffer API This is intentionally the minimal and safest set of changes, some cleanups area still required. These changes are quite tricky and cannot be independantly tested, so it's important to keep this patch as bisectable as possible. buf_empty and buf_wanted were changed and are now exactly similar since there's no <p> member in the structure anymore. Given that no test is ever made in the code to check that buf == &buf_wanted, it may be possible that we don't need to have two anymore, unless some buf_empty tests have precedence. This will have to be investigated. A significant part of this commit affects the HTTP compression code, which used to deeply manipulate the input and output buffers without any reasonable solution for a better abstraction. For this reason, if any regression is met and designates this patch as the culprit, it is important to run tests which specifically involve compression or which definitely don't use it in order to spot the issue. Cc: Olivier Houchard <ohouchard@haproxy.com>	2018-07-19 16:23:42 +02:00
Willy Tarreau	81521ed850	MINOR: buffer: adapt buffer_slow_realign() and buffer_dump() to the new API These are the last ones which need to be adapted.	2018-07-19 16:23:42 +02:00
Willy Tarreau	523cc5d506	MINOR: buffer: convert part bo_putblk() and bi_putblk() to the new API These functions are pretty similar and will be merged at the end of the migration. For now they still need to remain distinct.	2018-07-19 16:23:42 +02:00
Willy Tarreau	a79021af6f	MINOR: lua: use the wrappers instead of directly manipulating buffer states This replaces chn->buf->p with ci_head(chn), chn->buf->o with co_data(chn) and chn->buf->i with ci_data(chn). This is in order to help porting to the new buffer API.	2018-07-19 16:23:42 +02:00
Olivier Houchard	0b662843c8	MEDIUM: compression: start to move to the new buffer API This part is tricky, it passes a channel where we used to have a buffer, in order to reduce the API changes during the big switch. This way all the channel's wrappers to distinguish between input and output are available. It also makes sense given that the compression applies on a channel since it's in the forwarding path.	2018-07-19 16:23:42 +02:00
Willy Tarreau	f158937620	MINOR: flt_trace: adapt to the new buffer API The trace_hexdump() function now takes a count in argument to know where to start dumping from.	2018-07-19 16:23:42 +02:00

1 2 3 4 5 ...

7758 Commits