haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2025-01-02 02:02:03 +00:00

Author	SHA1	Message	Date
Willy Tarreau	a3b62d374a	MINOR: stream-int: add a new blocking condition on the remote connection There are some situations where we need to wait for the other side to be connected. None of the current blocking flags support this. It used to work more or less by accident using the old flags. Let's add a new flag to mention we're blocking on this, it's removed by si_chk_rcv() when a connection is established. It should be enough for now.	2018-12-06 15:24:01 +01:00
William Lallemand	27f3fa56f5	BUG/MEDIUM: mworker: stop every tasks in the master The master is not supposed to run (at the moment) any task before the polling loop, the created tasks should be run only in the workers but in the master they should be disabled or removed. No backport needed.	2018-12-06 14:12:58 +01:00
Christopher Faulet	aa75b3d2d5	CLEANUP: htx: Fix indentation here and there in HTX files	2018-12-05 17:33:14 +01:00
Christopher Faulet	b2aedea142	MEDIUM: channel/htx: Add functions for forward HTX data To ease the fast forwarding and the infinte forwarding on HTX proxies, 2 functions have been added to let the channel be almost aware of the way data are stored in its buffer. By calling these functions instead of legacy ones, we are sure to forward the right amount of data.	2018-12-05 17:29:30 +01:00
Christopher Faulet	27ba2dc6d6	MEDIUM: htx: Rework conversion from a buffer to an htx structure Now, the function htx_from_buf() will set the buffer's length to its size automatically. In return, the caller should call htx_to_buf() at the end to be sure to leave the buffer hosting the HTX message in the right state. When the caller can use the function htxbuf() to get the HTX message without any update on the underlying buffer.	2018-12-05 17:10:16 +01:00
Willy Tarreau	3906e22f6f	MINOR: htx: add buf_room_for_htx_data() to help optimize buffer transfers The small HTX overhead is enough to make the system perform multiple reads and unaligned memory copies. Here we provide a function whose purpose is to reduce the apparent room in a buffer by the size of the overhead for DATA blocks, which is the struct htx plus 2 blocks (one for DATA, one for the end of message so that small blocks can fit at once). The muxes using HTX will be encouraged to use this one instead of b_room() to compute the available buffer room and avoid filling their demux buf with more data than can fit at once into the HTX buffer.	2018-12-05 10:57:42 +01:00
Willy Tarreau	8ae4235f94	MINOR: htx: make htx_from_buf() adjust the size only on new buffers This one is used a lot during transfers, let's avoid resetting its size when there are already data in the buffer since it implies the size is correct.	2018-12-05 10:57:42 +01:00
Christopher Faulet	c59ff23804	MINOR: htx: Rename functions htx_*_to_str() to be H1 specific "_to_h1" suffix is now used because these function produce H1 strings. It avoids any ambiguity on the output format.	2018-12-04 05:51:37 +01:00
Joseph Herlant	75a323f04e	CLEANUP: Fix a typo in the listener subsystem Fixes a typo in the code comment of the listener subsystem.	2018-12-02 18:43:28 +01:00
Joseph Herlant	f69b807fa4	CLEANUP: Fix typos in the file descriptor subsystem Fixes 2 typos in the code comment of the file descriptor subsystem.	2018-12-02 18:43:25 +01:00
Joseph Herlant	0b75e63dc5	CLEANUP: Fix a typo in the checks header file Fixes a typo in the code comments of the checks header file.	2018-12-02 18:43:21 +01:00
Joseph Herlant	eeac3c722f	CLEANUP: Fix a typo in the protocol header file Fixes a typo in the code comments of the header file holding the general protocol primitives.	2018-12-02 18:42:49 +01:00
Joseph Herlant	8a95a6e5ed	CLEANUP: Fix a typo in the connection subsystem Fixes a typo in the code comments of the connection subsystem.	2018-12-02 18:42:12 +01:00
Joseph Herlant	41abef77cb	CLEANUP: Fix a typo in the mini-clist header Fixes a typo in the code comments of the mini-clist header.	2018-12-02 18:38:15 +01:00
Joseph Herlant	30bc509c40	CLEANUP: Fix typos in the h1 subsystem Fixes typos in the code comments of the h1 subsystem.	2018-12-02 18:38:02 +01:00
Joseph Herlant	be7619aaca	CLEANUP: Fix typo in the chunk headers file Fix a typo detected in the chunk.h header file's code comments.	2018-12-02 18:37:56 +01:00
Joseph Herlant	c42c0e9969	CLEANUP: fix typos in the htx subsystem Fix typos detected in the code comments of the htx subsystem.	2018-12-02 18:37:50 +01:00
Olivier Houchard	0c18a6fe34	MEDIUM: servers: Add a way to keep idle connections alive. Add a new keyword for servers, "idle-timeout". If set, unused connections are kept alive until the timeout happens, and will be picked for reuse if no other connection is available.	2018-12-02 18:16:53 +01:00
Olivier Houchard	8defe4b51a	MINOR: mux: add a "max_streams" method. Add a new method to muxes, "max_streams", that returns the max number of streams the mux can handle. This will be used to know if a mux is in use or not.	2018-12-02 17:48:32 +01:00
Olivier Houchard	f3e65b086d	MINOR: connection: Fix a comment. Connections can now have an owner for outgoing connections, so update the comment tu reflect that.	2018-12-02 17:48:28 +01:00
Willy Tarreau	1329b5be71	MINOR: h2: add new functions to produce an HTX message from an H2 response The new function h2_prepare_htx_stsline() produces an HTX response message from an H2 response presented as a list of header fields.	2018-12-02 13:30:17 +01:00
Willy Tarreau	3fbea1d8d0	MINOR: server: the mux_proto entry in the server is const Same as previous commit. We'll have to update this one soon, let's avoid any cast and mark it const as it really is.	2018-12-02 13:12:16 +01:00
Willy Tarreau	5fc311c001	MINOR: connection: create conn_get_best_mux_entry() We currently have conn_get_best_mux() to return the best mux for a given protocol name, side and proxy mode. But we need the mux entry as well in order to fix the bind_conf and servers at the end of the config parsing. Let's split the function in two parts. It's worth noting that the <conn> argument is never used anymore so this part is eligible to some cleanup.	2018-12-02 13:12:16 +01:00
Willy Tarreau	a004ae3e66	MINOR: listener: the mux_proto entry in the bind_conf is const We'll have to update this one soon, let's avoid any cast and mark it const as it really is.	2018-12-02 13:12:15 +01:00
Willy Tarreau	6deb4129de	MINOR: h2: implement H2->HTX request header frame transcoding Till now we could only produce an HTTP/1 request from a list of H2 request headers. Now the new function h2_make_htx_request() does the same but using the HTX encoding instead, while respecting the H2 semantics. The code is not much different from the first version, only the encoding differs. For now it's not used.	2018-12-01 17:38:32 +01:00
Christopher Faulet	75bc913d23	MAJOR: filters: Adapt filters API to be compatible with the HTX represenation First, to be called on HTX streams, a filter must explicitly be declared as compatible by setting the flag STRM_FLT_FL_HAS_FILTERS on the filter's config at HAProxy startup. This flag is checked when a filter implementation is attached to a stream. Then, some changes have been made on HTTP callbacks. The callback http_payload has been added to filter HTX data. It will be called on HTX streams only. It replaces the callbacks http_data, http_chunk_trailers and http_forward_data, called on legacy HTTP streams only and marked as deprecated. The documention (once updated)) will give all information to implement this new callback. Other HTTP callbacks will be called for HTX and HTTP legacy streams. So it is the filter's responsibility to known which kind of data it handles. The macro IS_HTX_STRM should be used in such cases. There is at least a noticeable changes in the way data are forwarded. In HTX, after the call to the callback http_headers, all the headers are considered as forwarded. So, in http_payload, only the body and eventually the trailers will be filtered.	2018-12-01 17:37:27 +01:00
Christopher Faulet	e44769b4fa	MINOR: mux-h1: Capture bad H1 messages First of all, an dedicated error snapshot, h1_snapshot, has been added. It contains more or less the some info than http_snapshot but adapted for H1 messages. Then, the function h1_capture_bad_message() has been added to capture bad H1 messages. And finally, the function h1_show_error_snapshot() is used to dump these errors. Only Headers or data parsing are captured.	2018-12-01 17:37:27 +01:00
Christopher Faulet	a7b677cd0d	MEDIUM: proto_htx: Convert all HTTP error messages into HTX During startup, after the configuration parsing, all HTTP error messages (errorloc, errorfile or default messages) are converted into HTX messages and stored in dedicated buffers. We use it to return errors in the HTX analyzers instead of using ugly OOB blocks.	2018-12-01 17:37:27 +01:00
Christopher Faulet	b2db4fa016	MINOR: htx: Add BODYLESS flags on the HTX start-line and the HTTP message the flags HTX_SL_F_BODYLESS and HTTP_MSGF_BODYLESS have been added. These flags are set when the corresponding HTTP message has no body at all.	2018-12-01 17:37:27 +01:00
Christopher Faulet	f1ba18d7b3	MEDIUM: htx: Don't rely on h1_sl anymore except during H1 header parsing Instead, we now use the htx_sl coming from the HTX message. It avoids to have too H1 specific code in version-agnostic parts. Of course, the concept of the start-line is higly influenced by the H1, but the structure htx_sl can be adapted, if necessary. And many things depend on a start-line during HTTP analyzis. Using the structure htx_sl also avoid boring conversions between HTX version and H1 version.	2018-12-01 17:37:27 +01:00
Christopher Faulet	54483df5ba	MINOR: htx: Add the start-line offset for the HTX message in the HTX structure If there is no start-line, this offset is set to -1. Otherwise, it is the relative address where the start-line is stored in the data block. When the start-line is added, replaced or removed, this offset is updated accordingly. On remove, if the start-line is no set and if the next block is a start-line, the offset is updated. Finally, when an HTX structure is defragmented, the offset is also updated accordingly.	2018-12-01 17:37:27 +01:00
Christopher Faulet	570d1614fa	MEDIUM: htx: Change htx_sl to be a struct instead of an union The HTX start-line is now a struct. It will be easier to extend, if needed. Same info can be found, of course. In addition it is now possible to set flags on it. It will be used to set some infos about the message. Some macros and functions have been added in proto/htx.h to help accessing different parts of the start-line.	2018-12-01 17:37:27 +01:00
Christopher Faulet	14e88252f2	MINOR: htx: Add a function to find the HTX block corresponding to a data offset The function htx_find_blk() returns the HTX block containing data with a given offset, relatively to the beginning of the HTX message. It is a good way to skip outgoing data and find the first HTX block not already processed.	2018-12-01 17:37:27 +01:00
Christopher Faulet	d16b0a7b2d	MINOR: htx: Add function to iterate on an HTX message using HTX blocks the functions htx_get_next() and htx_get_prev() are used to iterate on an HTX message using blocks position. With htx_get_next_blk() and htx_get_prev_blk(), it is possible to do the same, but with HTX blocks. Of course, internally, we rely on position's versions to do so. But it is handy for callers to not take care of the blocks position.	2018-12-01 17:37:27 +01:00
Christopher Faulet	24ed835129	MINOR: htx: Add function to add an HTX block just before another one The function htx_add_data_before() can be used to add an HTX block before another one. For instance, it could be used to add some data before the end-of-message marker.	2018-12-01 17:37:27 +01:00
Christopher Faulet	3bc1b11dae	MEDIUM: conn_stream: Add a way to get mux's info on a CS from the upper layer Time to time, the need arises to get some info owned by the multiplexer about a connection stream from the upper layer. Today we really need to get some dates and durations specific to the conn_stream. It is only true for the mux H1 and H2. Otherwise it will be impossible to have correct times reported in the logs. To do so, the structure cs_info has been defined to provide all info we ever need on a conn_stream from the upper layer. Of course, it is the first step. So this structure will certainly envloved. But for now, only the bare minimum is referenced. On the mux side, the callback get_cs_info() has been added in the structure mux_ops. Multiplexers can now implement it, if necessary, to return a pointer on a structure cs_info. And finally, the function si_get_cs_info() should be used from the upper layer. If the stream interface is not attached to a connection stream, this function returns NULL, likewise if the callback get_cs_info() is not defined for the corresponding mux.	2018-12-01 17:37:27 +01:00
Willy Tarreau	c01ed9ff20	MINOR: htx: add a function to cut the beginning of a DATA block htx_cut_data_blk() is used to cut the beginning of a DATA block after a part of it was tranferred. It simply advances the address, reduces the advertised length and updates the htx's total data count.	2018-12-01 17:36:59 +01:00
Willy Tarreau	d3c49d17dc	BUG/MINOR: connection: report mux modes when HTX is supported It looks like we forgot to report HTX when listing the muxes and their respective protocols, leading to "NONE" being displayed. Let's report "HTX" and "HTTP\|HTX" since both will exist. Also fix a minor typo in the output message.	2018-12-01 17:33:35 +01:00
Olivier Houchard	00cf70f28b	MAJOR: sessions: Store multiple outgoing connections in the session. Instead of just storing the last connection in the session, store all of the connections, for at most MAX_SRV_LIST (currently 5) targets. That way we can do keepalive on more than 1 outgoing connection when the client uses HTTP/2.	2018-12-01 10:47:18 +01:00
William Lallemand	4b58c80ee2	REORG: mworker: declare master variable in global.h This variable is used at several places, better declare it in global.h.	2018-11-27 19:34:00 +01:00
Willy Tarreau	7f0165e399	MEDIUM: memory: make the pool cache an array and not a thread_local Having a thread_local for the pool cache is messy as we need to initialize all elements upon startup, but we can't until the threads are created, and once created it's too late. For this reason, the allocation code used to check for the pool's initialization, and it was the release code which used to detect the first call and to initialize the cache on the fly, which is not exactly optimal. Now that we have initcalls, let's turn this into a per-thread array. This array is initialized very early in the boot process (STG_PREPARE) so that pools are always safe to use. This allows to remove the tests from the alloc/free calls. Doing just this has removed 2.5 kB of code on all cumulated pool_alloc() and pool_free() paths.	2018-11-26 19:50:32 +01:00
Willy Tarreau	b6b3df3ed3	MEDIUM: initcall: use initcalls for a few initialization functions signal_init(), init_log(), init_stream(), and init_task() all used to only preset some values and lists. This needs to be done very early to provide a reliable interface to all other users. The calls used to be explicit in haproxy.c:init(). Now they're placed in initcalls at the STG_PREPARE stage. The functions are not exported anymore.	2018-11-26 19:50:32 +01:00
Willy Tarreau	2455cebe00	MEDIUM: memory: use pool_destroy_all() to destroy all pools on deinit() Instead of exporting a number of pools and having to manually delete them in deinit() or to have dedicated destructors to remove them, let's simply kill all pools on deinit(). For this a new function pool_destroy_all() was introduced. As its name implies, it destroys and frees all pools (provided they don't have any user anymore of course). This allowed to remove 4 implicit destructors, 2 explicit ones, and 11 individual calls to pool_destroy(). In addition it properly removes the mux_pt_ctx pool which was not cleared on exit (no backport needed here since it's 1.9 only). The sig_handler pool doesn't need to be exported anymore and became static now.	2018-11-26 19:50:32 +01:00
Willy Tarreau	8ceae72d44	MEDIUM: init: use initcall for all fixed size pool creations This commit replaces the explicit pool creation that are made in constructors with a pool registration. Not only this simplifies the pools declaration (it can be done on a single line after the head is declared), but it also removes references to pools from within constructors. The only remaining create_pool() calls are those performed in init functions after the config is parsed, so there is no more user of potentially uninitialized pool now. It has been the opportunity to remove no less than 12 constructors and 6 init functions.	2018-11-26 19:50:32 +01:00
Willy Tarreau	7107c8b494	MINOR: memory: add a callback function to create a pool The new function create_pool_callback() takes 3 args including the return pointer, and creates a pool with the specified name and size. In case of allocation error, it emits an error message and returns. The new macro REGISTER_POOL() registers a callback using this function and will be usable to request some pools creation and guarantee that the allocation will be checked. An even simpler approach is to use DECLARE_POOL() and DECLARE_STATIC_POOL() which declare and register the pool.	2018-11-26 19:50:32 +01:00
Willy Tarreau	e655251e80	MINOR: initcall: use initcalls for section parsers The two calls to cfg_register_section() and cfg_register_postparser() are now supported by initcalls. This allowed to remove two other constructors.	2018-11-26 19:50:32 +01:00
Willy Tarreau	172f5ce948	MINOR: initcall: use initcalls for most post_{check,deinit} and per_thread* Most calls to hap_register_post_check(), hap_register_post_deinit(), hap_register_per_thread_init(), hap_register_per_thread_deinit() can be done using initcalls and will not require a constructor anymore. Let's create a set of simplified macros for this, called respectively REGISTER_POST_CHECK, REGISTER_POST_DEINIT, REGISTER_PER_THREAD_INIT, and REGISTER_PER_THREAD_DEINIT. Some files were not modified because they wouldn't benefit from this or because they conditionally register (e.g. the pollers).	2018-11-26 19:50:32 +01:00
Willy Tarreau	8071338c78	MINOR: initcall: apply initcall to all register_build_opts() calls Most register_build_opts() calls use static strings. These ones were replaced with a trivial REGISTER_BUILD_OPTS() statement adding the string and its call to the STG_REGISTER section. A dedicated section could be made for this if needed, but there are very few such calls for this to be worth it. The calls made with computed strings however, like those which retrieve OpenSSL's version or zlib's version, were moved to a dedicated function to guarantee they are called late in the process. For example, the SSL call probably requires that SSL_library_init() has been called first.	2018-11-26 19:50:32 +01:00
Willy Tarreau	90fa97b65e	MINOR: threads: add new macros to declare self-initializing locks Using __decl_spinlock(), __decl_rwlock(), __decl_aligned_spinlock() and __decl_aligned_rwlock(), one can now simply declare a spinlock or an rwlock which will automatically be initialized at boot time by calling the ha_spin_init() or ha_rwlock_init() callback. The "aligned" variants enforce a 64-byte alignment on the lock.	2018-11-26 19:50:32 +01:00
Willy Tarreau	a8ae77da61	MINOR: thread: provide a set of lock initialisers This patch adds ha_spin_init() and ha_rwlock_init() which are used as a callback to initialise locks at boot time. They perform exactly the same as HA_SPIN_INIT() or HA_RWLOCK_INIT() but from within a real function.	2018-11-26 19:50:32 +01:00
Willy Tarreau	d13a9281bd	MINOR: initcall: introduce a way to register init functions to call at boot We currently have to deal with multiple initialization stages in a way that can be confusing, because certain parts rely on others having been properly initialized. Most calls consist in adding lists to existing lists, whose heads are initialized in the declaration so this is easy. But some calls create new pools and require pools to be properly initialized. Pools currently are thread-local and as such cannot be pre-initialized, requiring run-time checks. All this could be simplified by using multiple boot stages and allowing functions to be registered at various stages. One approach might be to use gcc's constructor priorities, but this requires gcc >= 4.3 which eliminates a wide spectrum of working compilers, and some versions of certain compilers (like clang 3.0) are known for silently ignore these priorities. Instead we can use our own init function registration mechanism. A first attempt was made using register_function() calls in all constructors but this made the code more painful. This patch's approach is different. It creates sections containing arrays of pointers to "initcall" descriptors. An initcall contains a pointer to a function and an argument. Each section corresponds to a specific initialization stage. Each module creates such descriptors for various calls it requires. The main() function starts by scanning each of these sections in turn to process these initcalls. This will make it possible to remove many constructors from various modules, by simply placing initcalls for the requested functions next to the keyword lists that need to be called. A first attempt was made by placing the initcalls directly into the sections instead of creating an array of pointers, but it becomes sensitive to the array's alignment which depends on the compiler and the linker, so it seems too fragile. For now we support 6 init stages : - STG_PREPARE : preset variables, tables and list heads - STG_LOCK : initialize spinlocks and rwlocks - STG_ALLOC : allocate the required structures - STG_POOL : create pools - STG_REGISTER : register static lists (keywords etc) - STG_INIT : subsystems normal initialization These ones are declared directly in the files where they are needed using one of the INITCALL* macros, passing 0 to 3 pointers as arguments. The API should possibly be extended to support a return value to give a status to the caller, and to support a unified API, possibly a bit more flexibility in the arguments. In this case it might make sense to support a set of macros to register functions having a different API and to pass the function type in the initcall itself. Special thanks to Olivier for showing how to scan sections as this is not something particularly well documented and exactly what I've been missing to achieve this.	2018-11-26 19:50:32 +01:00
Willy Tarreau	a7280a1ec2	BUILD: buffers: buf.h requires unistd to get ssize_t on libmusl Building with musl and gcc-5.3 for MIPS returns this : include/common/buf.h: In function 'b_dist': include/common/buf.h:252:2: error: unknown type name 'ssize_t' ssize_t dist = to - from; ^ Including stdint or stddef is not sufficient there to get ssize_t, unistd is needed as well. It's likely that other platforms will have the same issue. This patch also addresses it in ist.h and memory.h.	2018-11-26 19:49:21 +01:00
Willy Tarreau	6689609090	BUILD: htx: fix fprintf format inconsistency on 32-bit platforms Building on 32 bits gives this : include/proto/htx.h: In function 'htx_dump': include/proto/htx.h:443:25: warning: format '%lu' expects argument of type 'long unsigned int', but argument 8 has type 'uint64_t {aka long long unsigned int}' [-Wformat=] fprintf(stderr, "htx:%p [ size=%u - data=%u - used=%u - wrap=%s - extra=%lu]\n", ^ In htx_dump(), fprintf() uses %lu but the value is an uint64_t so it doesn't match on 32-bit. Let's cast this to unsigned long long and use %llu instead.	2018-11-26 19:37:32 +01:00
Olivier Houchard	ee23b2a1e3	MEDIUM: servers: Store the connection in the SI until we have a mux. When we create a connection, if we have to defer the conn_stream and the mux creation until we can decide it (ie until the SSL handshake is done, and the ALPN is decided), store the connection in the stream_interface, so that we're sure we can destroy it if needed.	2018-11-23 19:11:14 +01:00
Olivier Houchard	201b9f4eb5	MAJOR: connections: Defer mux creation for outgoing connection if alpn is set. If an ALPN (or a NPN) was chosen for a server, defer choosing the mux until after the SSL handshake is done, and the ALPN/NPN has been negociated, so that we know which mux to pick.	2018-11-22 19:52:23 +01:00
Olivier Houchard	c756600103	MINOR: server: Add "alpn" and "npn" keywords. Add new keywords to "server" lines, alpn and npn. If set, when connecting through SSL, those alpn/npn will be negociated during the SSL handshake.	2018-11-22 19:50:08 +01:00
Willy Tarreau	beb859abce	MINOR: polling: add an option to support busy polling In some situations, especially when dealing with low latency on processors supporting a variable frequency or when running inside virtual machines, each time the process waits for an I/O using the poller, the processor goes back to sleep or is offered to another VM for a long time, and it causes excessively high latencies. A solution to this provided by this patch is to enable busy polling using a global option. When busy polling is enabled, the pollers never sleep and loop over themselves waiting for an I/O event to happen or for a timeout to occur. On multi-processor machines it can significantly overheat the processor but it usually results in much lower latencies. A typical test consisting in injecting traffic over a single connection at a time over the loopback shows a bump from 4640 to 8540 connections per second on forwarded connections, indicating a latency reduction of 98 microseconds for each connection, and a bump from 12500 to 21250 for locally terminated connections (redirects), indicating a reduction of 33 microseconds. It is only usable with epoll and kqueue because select() and poll()'s API is not convenient for such usages, and the level of performance they are used in doesn't benefit from this anyway. The option, which obviously remains disabled by default, can be turned on using "busy-polling" in the global section, and turned off later using "no busy-polling". Its status is reported in "show info" to help troubleshooting suspicious CPU spikes.	2018-11-22 19:47:30 +01:00
Willy Tarreau	48f8bc1368	MINOR: poller: move the call of tv_update_date() back to the pollers The reason behind this will be to be able to compute a timeout when busy polling.	2018-11-22 18:57:37 +01:00
Willy Tarreau	9efd7456e0	MEDIUM: tasks: collect per-task CPU time and latency Right now we measure for each task the cumulated time spent waiting for the CPU and using it. The timestamp uses a 64-bit integer to report a nanosecond-level date. This is only enabled when "profiling.tasks" is enabled, and consumes less than 1% extra CPU on x86_64 when enabled. The cumulated processing time and wait time are reported in "show sess". The task's counters are also reset when an HTTP transaction is reset since the HTTP part pretends to restart on a fresh new stream. This will make sure we always report correct numbers for each request in the logs.	2018-11-22 15:44:21 +01:00
Willy Tarreau	75c62c2793	MINOR: activity: add configuration and CLI support for "profiling.tasks" This is a new global setting which enables or disables CPU profiling per task. For now it only sets/resets the variable based on the global option "profiling.tasks" and supports showing it as well as setting it from the CLI using "show profiling" and "set profiling". The option will be used by a future commit. It was done in a way which should ease future addition of profiling options.	2018-11-22 11:48:51 +01:00
Willy Tarreau	baba82fe70	MINOR: activity: report the average loop time in "show activity" Since we know the time it takes to process everything between two poll() calls, we can use this as the max latency measurement any task will experience and average it. This code does this, and reports in "show activity" the average of this loop time over the last 1024 poll() loops, for each thread. It will vary quickly at high loads and slowly under low to moderate loads, depending on the rate at which poll() is called. The latency a task experiences is expected to be half of this on average.	2018-11-22 11:48:41 +01:00
Willy Tarreau	609aad9e73	REORG: time/activity: move activity measurements to activity.{c,h} At the moment the situation with activity measurement is quite tricky because the struct activity is defined in global.h and declared in haproxy.c, with operations made in time.h and relying on freq_ctr which are defined in freq_ctr.h which itself includes time.h. It's barely possible to touch any of these files without breaking all the circular dependency. Let's move all this stuff to activity.{c,h} and be done with it. The measurement of active and stolen time is now done in a dedicated function called just after tv_before_poll() instead of mixing the two, which used to be a lazy (but convenient) decision. No code was changed, stuff was just moved around.	2018-11-22 11:48:41 +01:00
Willy Tarreau	17306b905e	MINOR: cli: add a few missing includes in proto/cli.h Just found that proto/cli.h doesn't build if types/cli.h is not also included by the caller, as it uses cli_kw_list is used in arguments. But it's also true for a few other ones like mworker_proc, stream, and channel, so let's fix this.	2018-11-22 11:47:53 +01:00
William Lallemand	31a1c1d5e7	MEDIUM: signal: signal_unregister() removes every handlers The new function signal_unregister() removes every handlers assigned to a signal. Once the handler list of the signal is empty, the signal is ignored with SIG_IGN.	2018-11-22 11:42:51 +01:00
William Lallemand	db6bdfbf68	MINOR: cli: add mworker_accept_wrapper to 'show fd' In the output of 'show fd', the worker CLI's socketpair was still handled by an "unknown" function. That can be really confusing during debug. Fixed it by showing "mworker_accept_wrapper" instead.	2018-11-22 11:42:51 +01:00
William Lallemand	9c56a22b20	MINOR: log: introduce ha_notice() It's like ha_warning() or ha_alert() but with a NOTICE prefix.	2018-11-21 19:02:23 +01:00
William Lallemand	944e619b64	MEDIUM: mworker: wait mode use standard init code path The mworker waitpid mode (which is used when a reload failed to apply the new configuration) was still using a specific initialisation path. That's a problem since we use a polling loop in the master now, the master proxy is not initialized and the master CLI is not activated. This patch removes the initialisation code of the wait mode and introduce the MODE_MWORKER_WAIT in order to use the same init path as the MODE_MWORKER with some exceptions. It allows to use the master proxy and the master CLI during the waitpid mode.	2018-11-21 17:05:30 +01:00
William Lallemand	16dd1b3ead	MINOR: cli: show master information in 'show proc' Displays the master information in show proc.	2018-11-20 04:43:54 +01:00
William Lallemand	e368330128	MINOR: cli: displays uptime in `show proc` Displays the uptime of the workers in `show proc`	2018-11-20 04:43:54 +01:00
Willy Tarreau	3a1f5fda10	REORG: config: extract the proxy parser into cfgparse-listen.c This was the largest function of the whole file, taking a rough second to build alone. Let's move it to a distinct file along with a few dependencies. Doing so saved about 2 seconds on the total build time.	2018-11-19 06:47:09 +01:00
Willy Tarreau	36b9e222bb	REORG: config: extract the global section parser into cfgparse-global The config parser is the largest file to build and its build dominates the total project's build time. Let's start to split it into multiple smaller pieces by extracting the "global" section parser into a new file called "cfgparse-global.c". This removes 1/4th of the file's build time.	2018-11-19 06:41:57 +01:00
Joseph Herlant	32b8327266	CLEANUP: Fix typos in the standard subsystem Fix typos in the code comments of the standard subsystem.	2018-11-18 22:26:42 +01:00
Joseph Herlant	f7f6031184	CLEANUP: Fix typos in the spoe subsystem Fix typos in the code comments of the spoe subsystem.	2018-11-18 22:26:42 +01:00
Joseph Herlant	757f5ad73a	CLEANUP: Fix typos in the sample subsystem Fix some typos in the code comment of the sample subsystem.	2018-11-18 22:26:42 +01:00
Joseph Herlant	85b4059b82	CLEANUP: Fix typos in the log subsystem Fix some misspells in the code comments of the log subsystem.	2018-11-18 22:26:42 +01:00
Joseph Herlant	b35ea68081	CLEANUP: Fix typos in the filters subsystem Fix typos in the code comments of the filters subsystems.	2018-11-18 22:26:42 +01:00
Joseph Herlant	59dd295926	CLEANUP: fix typos in the proxy subsystem Fix typos in the code comments of the proxy subsystem.	2018-11-18 22:23:15 +01:00
Joseph Herlant	5ba8025976	CLEANUP: fix typos in the proto_http subsystem Fixes typos in the code comments of the proto_http subsystem.	2018-11-18 22:23:15 +01:00
Joseph Herlant	44466826b1	CLEANUP: fix a few typos in the comments of the server subsystem A few misspells where detected in the server subsystem. This commit fixes them.	2018-11-18 22:23:15 +01:00
Joseph Herlant	42cf6395c4	CLEANUP: Fix typos in the dns subsystem Fix misspells in the code comments of the dns subsystem.	2018-11-18 22:23:15 +01:00
Christopher Faulet	ef453ed9b0	MINOR: http_fetch: Add smp_prefetch_htx It does the same than smp_prefetch_http but for HTX messages. It can be called from an HTTP proxy or a TCP proxy. For HTTP proxies, the parsing is handled by the mux, so it does nothing but wait. For TCP proxies, it tries to parse an HTTP message and to convert it in a temporary HTX message. Sample fetches will use this temporary variable to do their job.	2018-11-18 22:09:00 +01:00
Christopher Faulet	fefc73da34	MINOR: proto_htx: Add functions htx_perform_server_redirect It is more or less the same than legacy version but adapted to be called from HTX analyzers. In the legacy version of this function, we switch on the HTX code when applicable.	2018-11-18 22:08:58 +01:00
Christopher Faulet	64159df1fb	MINOR: proto_htx: Add functions htx_send_name_header It is more or less the same than legacy version but adapted to be called from HTX analyzers. In the legacy version of this function, we switch on the HTX code when applicable.	2018-11-18 22:08:58 +01:00
Christopher Faulet	25a02f65b1	MINOR: proto_htx: Add functions to check the cacheability of HTX messages It is more or less the same than legacy versions but adapted to be called from HTX analyzers. In the legacy versions of these functions, we switch on the HTX code when applicable.	2018-11-18 22:08:58 +01:00
Christopher Faulet	8d8ac191a7	MINOR: proto_htx: Add functions htx_req_replace_stline and htx_res_set_status It is more or less the same than legacy versions but adapted to be called from HTX analyzers. In the legacy versions of these functions, we switch on the HTX code when applicable.	2018-11-18 22:08:56 +01:00
Christopher Faulet	7233352fe4	MINOR: proto_htx: Add functions htx_transform_header and htx_transform_header_str It is more or less the same than legacy versions but adapted to be called from HTX analyzers.	2018-11-18 22:08:56 +01:00
Christopher Faulet	7ff1ceaa5e	MINOR: http_htx: Add functions to retrieve a specific occurrence of a header There are 2 functions. The first one considers any comma as a delimiter for distinct values. The second one considers full-line headers.	2018-11-18 22:08:55 +01:00
Christopher Faulet	e010c80753	MINOR: http_htx: Add functions to replace part of the start-line	2018-11-18 22:08:54 +01:00
Christopher Faulet	0f226958b7	MINOR: proto_htx: Add some functions to handle HTX messages More functions will come, but it is the minimum to switch HTX analyzers on the HTX internal representation.	2018-11-18 22:08:54 +01:00
Christopher Faulet	47596d3787	MINOR: http_htx: Add functions to manipulate HTX messages in http_htx.c This file will host all functions to manipulate HTTP messages using the HTX representation. Functions in this file will be able to be called from anywhere and are mainly related to the HTTP semantics.	2018-11-18 22:08:53 +01:00
Christopher Faulet	a3d2a16fad	MEDIUM: htx: Add API to deal with the internal representation of HTTP messages The internal representation of an HTTP message, called HTX, is a structured representation, unlike the old one which is a raw representation of messages. Idea is to have a version-agnostic representation of the HTTP messages, which can be easily used by to handle HTTP/1, HTTP/2 and hopefully QUIC messages, and communication from one of them to another. In this patch, we add types to define the internal representation itself and the main functions to manipulate them.	2018-11-18 22:08:53 +01:00
Christopher Faulet	f2824e6e10	MAJOR: mux-h1/proto_htx: Handle keep-alive connections in the mux Now, the connection mode is detected in the mux and not in HTX analyzers anymore. Keep-alive connections are now managed by the mux. A new stream is created for each transaction. This removes the most important part of the synchronization between channels and the HTTP transaction cleanup. These changes only affect the HTX part (proto_htx.c). Legacy HTTP analyzers remain untouched for now. On the client-side, the mux is responsible to create new streams when a new request starts. It is also responsible to parse and update the "Connection:" header of the response. On the server-side, the mux is responsible to parse and update the "Connection:" header of the request. Muxes on each side are independent. For now, there is no connection pool on the server-side, so it always close the server connection.	2018-11-18 22:02:42 +01:00
Christopher Faulet	e0768ebabc	MEDIUM: proto_htx: Add HTX analyzers and use it when the mux H1 is used For now, these analyzers are just copies of the legacy HTTP analyzers. But, during the HTTP refactoring, it will be the main place where it will be visible. And in legacy analyzers, the macro IS_HTX_STRM is used to know if the HTX version should be called or not. Note: the following commits were applied to proto_http.c after this patch was developed and need to be studied to see if an adaptation to htx is required : `fd9b68c` BUG/MINOR: only mark connections private if NTLM is detected	2018-11-18 21:45:50 +01:00
Christopher Faulet	1d5b85aba2	MINOR: http: Add macros to check if a stream uses the HTX representation To prepare the refactoring of the code handling HTTP messages, these macros will help to use HTX functions instead of legacy ones when the new HTX internal representation is in use. To do so, for a given stream, we will check if its frontend has the option PR_O2_USE_HTX. It is useless to test backend options because it is not possible to mix the HTX representation and the legacy one (i.e, having an HTX frontend and a legacy backend or vice versa).	2018-11-18 21:45:50 +01:00
Christopher Faulet	effc3750cc	MINOR: conn_stream: Add a flag to notify the SI some data were received The flag CS_FL_READ_PARTIAL can be set by the mux on the conn_stream to notify the stream interface that some data were received. Is is used in si_cs_recv to re-arm read timeout on the channel.	2018-11-18 21:45:49 +01:00
Christopher Faulet	27a3dc8fb2	MINOR: http: Call http_send_name_header with the stream instead of the txn This is just a minor change to ease integrartion of the HTX.	2018-11-18 21:45:49 +01:00
Christopher Faulet	8277ca72b1	MINOR: http: Add standalone functions to parse a start-line or a header These 2 functions are pretty naive. They only split a start-line into its 3 substrings or a header line into its name and value. Spaces before and after each part are skipped. No CRLF at the end are expected.	2018-11-18 21:45:49 +01:00
Christopher Faulet	72d9125efb	MINOR: conn_stream: Add a flag to notify the mux it must respect the reserve By setting the flag CO_RFL_KEEP_RSV when calling mux->rcv_buf, the stream-interface notifies the mux it must keep some space to preserve the buffer's reserve. This flag is only useful for multiplexers handling structured data, because in such case, the stream-interface cannot know the real amount of free space in the channel's buffer.	2018-11-18 21:45:48 +01:00
Christopher Faulet	c6618d6835	MINOR: conn_stream: Add a flag to notify the mux it should flush its buffers By setting the flag CO_RFL_BUF_FLUSH when calling mux->rcv_buf, the stream-interface notifies the mux it should flush its buffers without reading more data. This flag is set when the SI want to use the kernel TCP splicing to forward data. Of course, the mux can respect it or not, depending on its state. It's just an information.	2018-11-18 21:45:48 +01:00
Olivier Houchard	7c6f8b146d	MAJOR: connections: Detach connections from streams. Do not destroy the connection when we're about to destroy a stream. This prevents us from doing keepalive on server connections when the client is using HTTP/2, as a new stream is created for each request. Instead, the session is now responsible for destroying connections. When reusing connections, the attach() mux method is now used to create a new conn_stream.	2018-11-18 21:45:45 +01:00
Olivier Houchard	131fd89d5a	MINOR: sessions: Start to store the outgoing connection in sessions. Introduce a new field in session, "srv_conn", and a linked list of sessions in the connection. It will be used later when we'll switch connections from being managed by the stream, to being managed by the session.	2018-11-18 21:44:56 +01:00
Olivier Houchard	060ed43361	MINOR: mux: Add a destroy() method. Add a new method to muxes, destroy(), that is responsible for destroying the mux and the associated connection, to be used for server connections.	2018-11-18 21:44:53 +01:00
Olivier Houchard	d540b36e8a	MINOR: mux: Add a new "avail_streams" method. Add a new method for mux, avail_streams, that returns the number of streams still available for a mux. For the mux_pt, it'll return 1 if the connection is in idle, or 0. For the H2 mux, it'll return the max number of streams allowed, minus the number of streams currently in use.	2018-11-18 21:44:06 +01:00
Willy Tarreau	db398435aa	MINOR: stream-int: replace si_cant_put() with si_rx_room_{blk,rdy}() Remaining calls to si_cant_put() were all for lack of room and were turned to si_rx_room_blk(). A few places where SI_FL_RXBLK_ROOM was cleared by hand were converted to si_rx_room_rdy(). The now unused si_cant_put() function was removed.	2018-11-18 21:41:50 +01:00
Willy Tarreau	b26a6f9708	MEDIUM: stream-int: make use of si_rx_chan_{rdy,blk} to control the stream-int from the channel The channel can disable reading from the stream-interface using various methods, such as : - CF_DONT_READ - !channel_may_recv() - and possibly others Till now this was done by mangling SI_FL_RX_WAIT_EP which is not appropriate at all since it's not the stream interface which decides whether it wants to deliver data or not. Some places were also wrongly relying on SI_FL_RXBLK_ROOM since it was the only other alternative, but it's not suitable for CF_DONT_READ. Let's use the SI_FL_RXBLK_CHAN flag for this instead. It will properly prevent the stream interface from being woken up and reads from subscribing to more receipt without being accidently removed. It is automatically reset if CF_DONT_READ is not set in stream_int_notify(). The code is not trivial because it splits the logic between everything related to buffer contents (channel_is_empty(), CF_WRITE_PARTIAL, etc) and buffer policy (CF_DONT_READ). Also it now needs to decide timeouts based on any blocking flag and not just SI_FL_RXBLK_ROOM anymore. It looks like this patch has caused a minor performance degradation on connection rate, which possibly deserves being investigated deeper as the test conditions are uncertain (e.g. slightly more subscribe calls?).	2018-11-18 21:41:49 +01:00
Willy Tarreau	abb5d4202f	MEDIUM: stream-int: use si_rx_shut_blk() to indicate the SI is closed Till now we were using si_done_put() upon shutr, but these flags could be reset upon next activity. Now let's switch to SI_FL_RXBLK_SHUT which doesn't go away. It's also set in stream_int_update() in case a shutr condition is detected. The now unused si_done_put() was removed.	2018-11-18 21:41:49 +01:00
Willy Tarreau	7f494d0c5e	MINOR: stream-int: make si_sync_recv() simply check ENDP before si_cs_recv() Instead of checking complex conditions to call si_cs_recv() upon first call, let's simply use si_rx_endp_ready() now that si_cs_recv() reports it accurately, and add si_rx_blocked() to cover any blocking situation.	2018-11-18 21:41:48 +01:00
Willy Tarreau	8bb2ffb831	MINOR: stream-int: replace si_{want,stop}_put() with si_rx_endp_{more,done}() Here it's only a 1-to-1 replacement.	2018-11-18 21:41:47 +01:00
Willy Tarreau	8be7cd7b92	MEDIUM: stream-int: use si_rx_buff_{rdy,blk} to report buffer readiness The stream interface used to conflate a missing buffer and lack of buffer space into SI_FL_WAIT_ROOM but this causes difficulties as these cannot be checked at the same moment and are not resolved at the same moment either. Now we instead mark the buffer as presumably available using si_rx_buff_rdy() and mark it as unavailable+requested using si_rx_buff_blk(). The call to si_alloc_buf() was moved after si_stop_put(). This makes sure that the SI_FL_RX_WAIT_EP flag is cleared on allocation failure so that the function is called again if the callee fails to do its work.	2018-11-18 21:41:47 +01:00
Willy Tarreau	32742fdf45	MINOR: stream-int: use si_rx_blocked()/si_tx_blocked() to check readiness This way we don't limit ourselves to random flags only and the code is more readable and safer for the long term.	2018-11-18 21:41:46 +01:00
Willy Tarreau	05b9b64afb	MINOR: stream-int: replace SI_FL_WANT_PUT with !SI_FL_RX_WAIT_EP The SI_FL_WANT_PUT flag is used in an awkward way, sometimes it's set by the stream-interface to mean "I have something to deliver", sometimes it's cleared by the channel to say "I don't want you to send what you have", and it has to be set back once CF_DONT_READ is cleared. This will have to be split between SI_FL_RX_WAIT_EP and SI_FL_RXBLK_CHAN. This patch only replaces all uses of the flag with its natural (but negated) replacement SI_FL_RX_WAIT_EP. The code is expected to be strictly equivalent. The now unused flag was completely removed.	2018-11-18 21:41:46 +01:00
Willy Tarreau	78dcacef5c	MINOR: stream-int: add new functions si_{rx,tx}_{blocked,endp_ready}() The first ones are used to figure if a direction is blocked on the stream interface for anything but the end point. The second ones are used to detect if the end point is ready to receive/transmit. They should be used instead of directly fiddling with the existing bits.	2018-11-18 21:41:46 +01:00
Willy Tarreau	94f7907d65	MINOR: stream-int: introduce new SI_FL_RXBLK flags The plan is to have the following flags to describe why a stream interface doesn't produce data : - SI_FL_RXBLK_CHAN : the channel doesn't want it to receive - SI_FL_RXBLK_BUFF : waiting for a buffer allocation to complete - SI_FL_RXBLK_ROOM : more room is required in the channel to receive - SI_FL_RXBLK_SHUT : input now closed, nothing new will come - SI_FL_RX_WAIT_EP : waiting for the endpoint to produce more data Applets like the CLI which consume complete commands at once and produce large chunks of responses will for example be able to stop being woken up by clearing SI_FL_WANT_GET and setting SI_FL_RXBLK_ROOM when the rx buffer is full. Once called they will unblock WANT_GET. The flags were moved together in readable form with the Rx bits using 2 hex digits and still have some room to do a similar operation on the Tx path later, with the WAIT_EP flag being represented alone on a digit.	2018-11-18 21:41:45 +01:00
Willy Tarreau	d0f5bbcd64	MINOR: stream-int: rename SI_FL_WAIT_ROOM to SI_FL_RXBLK_ROOM This flag is not enough to describe all blocking situations, as can be seen in each case we remove it. The muxes has taught us that using multiple blocking flags in parallel will be much easier, so let's start to do this now. This patch only renames this flags in order to make next changes more readable.	2018-11-18 21:41:45 +01:00
Willy Tarreau	a44e576f62	MINOR: stream-int: expand the flags to 32-bit We used to have enough of 16 bits, with 3 still available but it's not possible to add the rx/tx blocking bits there. Let's extend the format to 32 bits and slightly reorder the fields to maintain the struct size to 64 bytes. Nothing else was changed.	2018-11-18 21:41:45 +01:00
Willy Tarreau	fafd3984b9	MINOR: mux: implement a get_first_cs() method This method is used to retrieve the first known good conn_stream from the mux. It will be used to find the other end of a connection when dealing with the proxy protocol for example.	2018-11-18 21:29:20 +01:00
Willy Tarreau	ade6478a8c	MINOR: stream: move the conn_stream specific calls to the stream-int There are still some unwelcome synchronous calls to si_cs_recv() in process_stream(). Let's have a new function si_sync_recv() to perform a synchronous receive call on a stream interface regardless of the type of its endpoint, and move these calls there. For now it only implements conn_streams since it doesn't seem useful to support applets there. The function implements an extra check for the stream interface to be in an established state before attempting anything.	2018-11-17 19:53:45 +01:00
William Lallemand	c59f9884d7	MEDIUM: listeners: support unstoppable listener An unstoppable listener is a listener which won't be stop during a soft stop. The unstoppable_jobs variable is incremented and the listener won't prevent the process to leave properly. It is not a good idea to use this feature (the LI_O_NOSTOP flag) with a listener that need to be bind again on another process during a soft reload.	2018-11-16 17:05:40 +01:00
William Lallemand	a719926cf8	MEDIUM: jobs: support unstoppable jobs for soft stop This patch allows a process to properly quit when some jobs are still active, this feature is handled by the unstoppable_jobs variable, which must be atomically incremented. During each new iteration of run_poll_loop() the break condition of the loop is now (jobs - unstoppable_jobs) == 0. The unique usage of this at the moment is to handle the socketpair CLI of a the worker during the stopping of the process. During the soft stop, we could mark the CLI listener as an unstoppable job and still handle new connections till every other jobs are stopped.	2018-11-16 17:05:40 +01:00
Fr�d�ric L�caille	9ca51aa288	MINOR: http: Implement "early-hint" http request rules. This patch implements http_apply_early_hint_rule() function is responsible of building HTTP 103 Early Hint responses each time a "early-hint" rule is matched.	2018-11-12 21:08:55 +01:00
Fr�d�ric L�caille	0ebbcb663c	MINOR: http: Make new "early-hint" http-request action really be parsed. This patch adds a "early_hint" struct to "arg" union of "act_rule" struct and parse "early-hint" http-request keyword with it using the same code as for "(add\|set)-header" parser.	2018-11-12 21:08:55 +01:00
Fr�d�ric L�caille	a985e3875b	MINOR: http: Add new "early-hint" http-request action. This patch adds the new "early-hint" action to "http-request" rules parser. This action should be parsed the same way as "(add\|set)-header" actions.	2018-11-12 21:08:55 +01:00
Willy Tarreau	7520e4ff57	MINOR: namespaces: don't build namespace.c if disabled When namespaces are disabled, support is still reported because the file is built with almost nothing in it but built anyway. Instead of extending the scope of the numerous ifdefs in this file, better avoid building it when namespaces are diabled. In this case we define my_socketat() as an inline function mapping directly to socket(). The struct netns_entry still needs to be defined because it's used by various other functions in the code.	2018-11-12 19:15:15 +01:00
Willy Tarreau	c1b0645dac	MEDIUM: log: add a new "raw" format This format is pretty similar to the previous "short" format except that it also removes the severity level. Thus only the raw message is sent. This is suitable for use in containers, where only the raw information is expected and where the severity is supposed to come from the file descriptor used.	2018-11-12 18:37:55 +01:00
Willy Tarreau	e8746a08b2	MEDIUM: log: support a new "short" format This format is meant to be used with local file descriptors. It emits messages only prefixed with a level, removing all the process name, system name, date and so on. It is similar to the printk() format used on Linux. It's suitable to be sent to a local logger compatible with systemd's output format. Note that the facility is still required but not used, hence it is suggested to use "daemon" to remind that it's a local logger. Example : log stdout format short daemon # send everything to stdout log stderr format short daemon notice # send important events to stderr	2018-11-12 18:37:55 +01:00
Willy Tarreau	13ef773722	MINOR: log: report the number of dropped logs in the stats It's easy to detect when logs on some paths are lost as sendmsg() will return EAGAIN. This is particularly true when sending to /dev/log, which often doesn't support a big logging capacity. Let's keep track of these and report the total number of dropped messages in "show info".	2018-11-12 18:37:55 +01:00
Willy Tarreau	d0d40ebf5e	CLEANUP: stream-int: remove the now unused si->update() function We exclusively use stream_int_update() now, the lower layers are not called anymore so let's remove them, as well as si_update() which used to be their wrapper.	2018-11-11 10:18:37 +01:00
Willy Tarreau	d14844a734	MINOR: stream-int: replace si_update() with si_update_both() The function used to be called in turn for each side of the stream, but since it's called exclusively from process_stream(), it prevents us from making use of the knowledge we have of the operations in progress for each side, resulting in having to go all the way through functions like stream_int_notify() which are not appropriate there. That patch creates a new function, si_update_both() which takes two stream interfaces expected to belong to the same stream, and processes their flags in a more suitable order, but for now doesn't change the logic at all. The next step will consist in trying to reinsert the rest of the socket layer-specific update code to ultimately update the flags correctly at the end of the operation.	2018-11-11 10:18:37 +01:00
Willy Tarreau	8fe516f08a	MEDIUM: stream-int: make si_chk_rcv() check that SI_FL_WAIT_ROOM is cleared After careful inspection, it now seems OK to call si_chk_rcv() only when SI_FL_WAIT_ROOM is cleared and SI_FL_WANT_PUT is set, since all identified call places have already taken care of this.	2018-11-11 10:18:37 +01:00
Willy Tarreau	abf531caa0	MEDIUM: stream-int: always call si_chk_rcv() when we make room in the buffer Instead of clearing the SI_FL_WAIT_ROOM flag and losing the information about the need from the producer to be woken up, we now call si_chk_rcv() immediately. This is cheap to do and it could possibly be further improved by only doing it when SI_FL_WAIT_ROOM was still set, though this will require some extra auditing of the code paths. The only remaining place where the flag was cleared without a call to si_chk_rcv() is si_alloc_ibuf(), but since this one is called from a receive path woken up from si_chk_rcv() or not having failed, the clearing was not necessary anymore either. And there was one place in stream_int_notify() where si_chk_rcv() was called with SI_FL_WAIT_ROOM still explicitly set so this place was adjusted in order to clear the flag prior to calling si_chk_rcv(). Now we don't have any situation where we randomly clear SI_FL_WAIT_ROOM without trying to wake the other side up, nor where we call si_chk_rcv() with the flag set, so this flag should accurately represent a failed attempt at putting data into the buffer.	2018-11-11 10:18:37 +01:00
Willy Tarreau	1f9de21c38	MEDIUM: stream-int: make SI_FL_WANT_PUT reflect CF_DONT_READ When CF_DONT_READ is set, till now we used to set SI_FL_WAIT_ROOM, which is not appropriate since it would lose the subscribe status. Instead let's clear SI_FL_WANT_PUT (just like applets do), and set the flag only when CF_DONT_READ is cleared. We have to do this in stream_int_update(), and in si_cs_io_cb() after returning from si_cs_recv() since it would be a bit invasive to hack this one for now. It must not be done in stream_int_notify() otherwise it would re-enable blocked applets. Last, when si_chk_rcv() is called, it immediately clears the flag before calling ->chk_rcv() so that we are not tempted to uselessly loop on the same call until the receive function is called. This is the same principle as what is done with the applet scheduler.	2018-11-11 10:18:37 +01:00
Willy Tarreau	1bdb598a55	MINOR: stream-int: factor the SI_ST_EST state test into si_chk_rcv() This test is made in each implementation of the function, better to merge it.	2018-11-11 10:18:37 +01:00
Willy Tarreau	96aadd5c55	MEDIUM: stream-int: temporarily make si_chk_rcv() take care of SI_FL_WAIT_ROOM This flag should already be cleared before calling the *chk_rcv() functions. Before adapting all call places, let's first make sure si_chk_rcv() clears it before calling them so that these functions do not have to check it again and so that they do not adjust it. This function will only call the lower layers if the SI_FL_WANT_PUT flag is present so that the endpoint can decide not to be called (as done with applets).	2018-11-11 10:18:37 +01:00
Willy Tarreau	57f08bb63b	MINOR: stream-int: make it clear that si_ops cannot be null There was an ambiguity in which functions of the si_ops struct could be null or not. only ->update doesn't exist in one of the si_ops (the embedded one), all others are always defined. ->shutr and ->shutw were never tested. However ->chk_rcv() and ->chk_snd() were tested, causing confusion about the proper way to wake the other side up if undefined (which never happens). Let's update the comments to state these functions are mandatory and remove the offending checks.	2018-11-11 10:18:37 +01:00
Willy Tarreau	af4f6f6d2f	MINOR: stream-int: use si_cant_put() instead of setting SI_FL_WAIT_ROOM We now do this on the si_cs_recv() path so that we always have SI_FL_WANT_PUT properly set when there's a need to receive and SI_FL_WAIT_ROOM upon failure.	2018-11-11 10:18:37 +01:00
Willy Tarreau	394970c297	MINOR: stream-int: add si_done_{get,put} to indicate that we won't do it anymore This is useful on close or stream aborts as it saves us from having to manipulate the (sometimes confusing) flags.	2018-11-11 10:18:37 +01:00
Willy Tarreau	0cd3bd628a	MINOR: stream-int: rename si_applet_{want\|stop\|cant}_{get\|put} It doesn't make sense to limit this code to applets, as any stream interface can use it. Let's rename it by simply dropping the "applet_" part of the name. No other change was made except updating the comments.	2018-11-11 10:18:37 +01:00
Willy Tarreau	21028b5e7f	MEDIUM: appctx: check for allocation attempts in buffer allocation callbacks The buffer allocation callback appctx_res_wakeup() used to rely on old tricks to detect if a buffer was already granted to an appctx, namely by checking the task's state. Not only this test is not valid anymore, but it's inaccurate. Let's solely on SI_FL_WAIT_ROOM that is now set on allocation failure by the functions trying to allocate a buffer. The buffer is now allocated on the fly and the flag removed so that the consistency between the two remains granted. The patch also fixes minor issues such as the function being improperly declared inline(!) and the fact that using appctx_wakeup() sets the wakeup reason to TASK_WOKEN_OTHER while we try to use TASK_WOKEN_RES when waking up consecutive to a ressource allocation such as a buffer.	2018-11-11 10:18:37 +01:00
Willy Tarreau	b882dd88cc	MEDIUM: stream: implement stream_buf_available() This function replaces stream_res_available(), which is used as a callback for the buffer allocator. It now carefully checks which stream interface was blocked on a buffer allocation, tries to allocate the input buffer to this stream interface, and wakes the task up once such a buffer was found. It will automatically remove the SI_FL_WAIT_ROOM flag upon success since the info this flag indicates becomes wrong as soon as the buffer is allocated. The code is still far from being perfect because if a call to si_cs_recv() fails to allocate a buffer, we'll still end up passing via process_stream() again, but this could be improved in the future by using finer-grained wake-up notifications.	2018-11-11 10:18:37 +01:00
Willy Tarreau	2d372c2aa1	MINOR: stats: report the number of currently connected peers The active peers output indicates both the number of established peers connections and the number of peers connection attempts. The new counter "ConnectedPeers" also indicates the number of currently connected peers. This helps detect that some peers cannot be reached for example. It's worth mentioning that this value changes over time because unused peers are often disconnected and reconnected. Most of the time it should be equal to ActivePeers.	2018-11-05 17:15:21 +01:00
Willy Tarreau	199ad24661	MINOR: stats: report the number of active peers in "show info" Peers are the last type of activity which can maintain a job present, so it's important to report that such an entity is still active to explain why the job count may be higher than zero. Here by "ActivePeers" we report peers sessions, which include both established connections and outgoing connection attempts.	2018-11-05 17:15:21 +01:00
Willy Tarreau	00098ea034	MINOR: stats: report the number of active jobs and listeners in "show info" When an haproxy process doesn't stop after a reload, it's because it still has some active "jobs", which mainly are active sessions, listeners, peers or other specific activities. Sometimes it's difficult to troubleshoot the cause of these issues (which generally are the result of a bug) only because some indicators are missing. This patch add the number of listeners, the number of jobs, and the stopping status to the output of "show info". This way it becomes a bit easier to try to narrow down the cause of such an issue should it happen. A typical use case is to connect to the CLI before reloading, then issuing the "show info" command to see what happens. In the normal situation, stopping should equal 1, jobs should equal 1 (meaning only the CLI is still active) and listeners should equal zero. The patch is so trivial that it could make sense to backport it to 1.8 in order to help with troubleshooting.	2018-11-05 17:15:21 +01:00
Willy Tarreau	4698adf68f	MINOR: compat: automatically detect support for crypt_r() glibc >= 2.2 and FreeBSD >= 12.0 support crypt_r(), let's detect this and set a macro HA_HAVE_CRYPT_R for this.	2018-10-29 19:14:14 +01:00
Willy Tarreau	34d4b525a1	BUG/MEDIUM: auth/threads: use of crypt() is not thread-safe It was reported here that authentication may fail when threads are enabled : https://bugzilla.redhat.com/show_bug.cgi?id=1643941 While I couldn't reproduce the issue, it's obvious that there is a problem with the use of the non-reentrant crypt() function there. On Linux systems there's crypt_r() but not on the vast majority of other ones. Thus a first approach consists in placing a lock around this crypt() call. Another patch may relax it when crypt_r() is available. This fix must be backported to 1.8. Thanks to Ryan O'Hara for the quick notification.	2018-10-29 18:06:02 +01:00
Willy Tarreau	ce487aab46	BUG/MEDIUM: tools: fix direction of my_ffsl() Commit `27346b01a` ("OPTIM: tools: optimize my_ffsl() for x86_64") optimized my_ffsl() for intensive use cases in the scheduler, but as half of the times I got it wrong so it counted bits the reverse way. It doesn't matter for the scheduler nor fd cache but it broke cpu-map with threads which heavily relies on proper ordering. We should probably consider dropping support for gcc < 3.4 and switching to builtins for these ones, though often they are as ambiguous. No backport is needed.	2018-10-29 16:09:57 +01:00
Willy Tarreau	8e9f4531cb	BUG/MINOR: memory: make the thread-local cache allocator set the debugging link When building with DEBUG_MEMORY_POOLS, an element returned from the cache would not have its pool link initialized unless it's allocated using pool_alloc(). This is problematic for buffer allocators which use pool_alloc_dirty(), as freeing this object will make the code think it was allocated from another pool. This patch does two things : - make __pool_get_from_cache() set the link - remove the extra initialization from pool_alloc() since it's always done in either __pool_get_first() or __pool_refill_alloc() This patch is marked MINOR since it only affects code explicitly built for debugging. No backport is needed.	2018-10-28 20:12:31 +01:00
William Lallemand	90b1ca1ff5	MEDIUM: channel: reorder the channel analyzers for the cli Reorder the channel analyzers so the CLI analyzers are defined before the XFER_DATA ones.	2018-10-28 14:13:31 +01:00
William Lallemand	309dc9adec	MEDIUM: mworker: stop the master proxy in the workers The master proxy which handles the CLI should not be used or shown in the stats of the workers. This proxy is now disabled after the fork.	2018-10-28 14:03:31 +01:00
William Lallemand	cf62f7e3cb	MEDIUM: cli: implement 'mode cli' proxy analyzers This patch implements analysers for parsing the CLI and extra features for the master's CLI. For each command (sent alone, or separated by ; or \n) the request analyser will determine to which server it should send the request. The 'mode cli' proxy is able to parse a prefix for each command which is used to select the apropriate server. The prefix start by @ and is followed by "master", the PID preceded by ! or the relative PID. (e.g. @master, @1, @!1234). The servers are not round-robined anymore. The command is sent with a SHUTW which force the server to close the connection after sending its response. However the proxy allows a keepalive connection on the client side and does not close. The response analyser does not do much stuff, it only reinits the connection when it received a close from the server, and forward the response. It does not analyze the response data. The only guarantee of the end of the response is the close of the server, we can't rely on the double \n since it's not send by every command. This could be reimplemented later as a filter.	2018-10-28 14:03:06 +01:00
William Lallemand	291810d8f8	MEDIUM: mworker: find the server ptr using a CLI prefix Add a struct server pointer in the mworker_proc struct so we can easily use it as a target for the mworker proxy. pcli_prefix_to_pid() is used to find the right PID of the worker when using a prefix in the CLI. (@master, @#<relative pid> , @<pid>) pcli_pid_to_server() is used to find the right target server for the CLI proxy.	2018-10-28 13:51:39 +01:00
William Lallemand	14721be11f	MEDIUM: cli: disable some keywords in the master The master process does not need all the keywords of the cli, add 2 flags to chose which keyword to use. It might be useful to activate some of them in a debug mode later...	2018-10-28 13:51:39 +01:00
William Lallemand	e736115d3a	MEDIUM: mworker: create CLI listeners from argv[] This patch introduces mworker_cli_proxy_new_listener() which allows the creation of new listeners for the CLI proxy. Using this function it is possible to create new listeners from the program arguments with -Sa <unix_socket>. It is allowed to create multiple listeners with several -Sa.	2018-10-28 13:51:39 +01:00
William Lallemand	8a02257d88	MEDIUM: mworker: proxy for the master CLI This patch implements a listen proxy within the master. It uses the sockpair of all the workers as servers. In the current state of the code, the proxy is only doing round robin on the CLI of the workers. A CLI mode will be needed to know to which CLI send the requests.	2018-10-28 13:51:39 +01:00
William Lallemand	6e0db2fa99	MEDIUM: mworker: add proc_list in global.h Add the process list in types/global.h so it could be accessed from anywhere.	2018-10-28 13:51:39 +01:00
William Lallemand	313bfd18c1	MINOR: server: export new_server() function The new_server() function will be useful to create a proxy for the master-worker.	2018-10-28 13:51:38 +01:00
William Lallemand	7e1299bb3a	REORG: mworker: move struct mworker_proc to global.h Move the definition of the mworker_proc structure in types/global.h.	2018-10-28 13:51:38 +01:00
William Lallemand	ce83b4a5dd	MEDIUM: mworker: each worker socketpair is a CLI listener The init code of the mworker_proc structs has been moved before the init of the listeners. Each socketpair is now connected to a CLI within the workers, which allows the master to access their CLI. The inherited flag of the worker side socketpair is removed so the socket can be closed in the master.	2018-10-28 13:51:38 +01:00
Willy Tarreau	85f890174a	MEDIUM: stream-int: make si_update() synchronize flag changes before the I/O With the new synchronous si_cs_send() at the end of process_stream(), we're seeing re-appear the I/O layer specific part of the stream interface which is supposed to deal with I/O event subscription. The only difference is that now we subscribe to I/Os only after having attempted (and failed) them. This patch brings a cleanup in this by reintroducing stream_int_update_conn() with the send code from process_stream(). However this alone would not be enough because the flags which are cleared afterwards would result in the loss of the possible events (write events only at the moment). So the flags clearing and stream-int state updates are also performed inside si_update() between the generic code and the I/O specific code. This definitely makes sense as after this call we can simply check again for channel and SI flag changes and decide to loop once again or not.	2018-10-28 13:47:00 +01:00
Willy Tarreau	0979916d3b	MINOR: stream-int: add si_alloc_ibuf() to ease input buffer allocation This will supersed channel_alloc_buffer() while relying on it. It will automatically adjust SI_FL_WAIT_ROOM on the stream-int depending on success or failure to allocate this buffer. It's worth noting that it could make sense to also set SI_FL_WANT_PUT each time we do this to further simplify the code at user places such as applets, but it would possibly not be easy to clean this flag everywhere an rx operation stops.	2018-10-28 13:47:00 +01:00
Willy Tarreau	ede3d884fc	MEDIUM: channel: merge back flags CF_WRITE_PARTIAL and CF_WRITE_EVENT The behaviour of the flag CF_WRITE_PARTIAL was modified by commit `95fad5ba4` ("BUG/MAJOR: stream-int: don't re-arm recv if send fails") due to a situation where it could trigger an immediate wake up of the other side, both acting in loops via the FD cache. This loss has caused the need to introduce CF_WRITE_EVENT as commit `c5a9d5bf`, to replace it, but both flags express more or less the same thing and this distinction creates a lot of confusion and complexity in the code. Since the FD cache now acts via tasklets, the issue worked around in the first patch no longer exists, so it's more than time to kill this hack and to restore CF_WRITE_PARTIAL's semantics (i.e.: there has been some write activity since we last left process_stream). This patch mostly reverts the two commits above. Only the part making use of CF_WROTE_DATA instead of CF_WRITE_PARTIAL to detect the loss of data upon connection setup was kept because it's more accurate and better suited.	2018-10-26 08:32:57 +02:00
Ioannis Cherouvim	1ff7633dd7	CLEANUP: tools: fix misleading comment above function LIM2A The function produces ASCII, but its comment was copied from U2H which produces HTML.	2018-10-26 05:00:48 +02:00
Fr�d�ric L�caille	b80bc273a3	MINOR: shctx: Change max. object size type to unsigned int. This change is there to prevent implicit conversions when comparing shctx maximum object sizes with other unsigned values.	2018-10-26 04:54:40 +02:00
Fr�d�ric L�caille	b7838afe6f	MINOR: shctx: Add a maximum object size parameter. This patch adds a new parameter to shctx_init() function to be used to limit the size of each shared object, -1 value meaning "no limit".	2018-10-24 04:39:44 +02:00
Fr�d�ric L�caille	8df65ae5e2	MINOR: cache: Larger HTTP objects caching. This patch makes the capable of storing HTTP objects larger than a buffer. It makes usage of the "block by block shared object allocation" new shctx API. A new pointer to struct shared_block has been added to the cache applet context to memorize the next block to be used by the HTTP cache I/O handler http_cache_io_handler() to emit the data. Another member, named "sent" memorize the number of bytes already sent by this handler. So, to send an object from cache, http_cache_io_handler() must be called until "sent" counter reaches the size of this object.	2018-10-24 04:37:12 +02:00
Fr�d�ric L�caille	0bec807e08	MINOR: shctx: Shared objects block by block allocation. This patch makes shctx capable of storing objects in several parts, each parts being made of several blocks. There is no more need to walk through until reaching the end of a row to append new blocks. A new pointer to a struct shared_block member, named last_reserved, has been added to struct shared_block so that to memorize the last block which was reserved by shctx_row_reserve_hot(). Same thing about "last_append" pointer which is used to memorize the last block used by shctx_row_data_append() to store the data.	2018-10-24 04:35:53 +02:00
Willy Tarreau	68ad3a42f7	MINOR: proxy: add a new option "http-use-htx" This option makes a proxy use only HTX-compatible muxes instead of the HTTP-compatible ones for HTTP modes. It must be set on both ends, this is checked at parsing time.	2018-10-23 10:22:36 +02:00
Christopher Faulet	55d6be7d83	MINOR: h1: Export some functions parsing the value of some HTTP headers Functions parsing the value of "Connection:", "Transfer-encoding:" and "Content-length:" headers are now exported to be used by the mux-h1.	2018-10-23 10:22:36 +02:00
Willy Tarreau	627505d36a	MINOR: freq_ctr: add swrate_add_scaled() to work with large samples Some samples representing time will cover more than one sample at once if they are units of time per time. For this we'd need to have the ability to loop over swrate_add() multiple times but that would be inefficient. By developing the function elevated to power N, it's visible that some coefficients quickly disappear and that those which remain at the first order more or less compensate each other. Thus a simplified version of this function was added to provide a single value for a given number of samples. Tests with multiple values, window sizes and sample sizes have shown that it is possible to make it remain surprisingly accurate (typical error < 0.2% over various large window and sample sizes, even samples representing up to 1/4 of the window).	2018-10-22 08:13:57 +02:00
Olivier Houchard	3f03ab5b15	MINOR: connection: Add a SUB_CALL_UNSUBSCRIBE event. Add a SUB_CALL_UNSUBSCRIBE event, to let the caller know that the unsubscribe method should be called before destroyin the object.	2018-10-21 06:00:04 +02:00
Olivier Houchard	53216e7db9	MEDIUM: connections: Don't directly mess with the polling from the upper layers. Avoid using conn_xprt_want_send/recv, and totally nuke cs_want_send/recv, from the upper layers. The polling is now directly handled by the connection layer, it is activated on subscribe(), and unactivated once we got the event and we woke the related task.	2018-10-21 05:58:40 +02:00
Olivier Houchard	1fddc9b7bb	BUG/MEDIUM: connections: Remove subscription if going in idle mode. Make sure we don't have any subscription when the connection is going in idle mode, otherwise there's a race condition when the connection is reused, if there are still old subscriptions, new ones won't be done. No backport is needed.	2018-10-21 05:55:20 +02:00
Olivier Houchard	62975a7740	BUG/MEDIUM: pools: Fix the usage of mmap()) with DEBUG_UAF. When mapping memory with mmap(), we should use a fd of -1, not 0. 0 may work on linux, but it doesn't work on FreeBSD, and probably other OSes. It would be nice to backport this to 1.8 to help debugging there.	2018-10-21 05:43:33 +02:00
Willy Tarreau	4e7cc3381b	BUILD: compiler: rename __unreachable() to my_unreachable() Olivier reported that on FreeBSD __unreachable is already defined and causes build warnings. Let's rename it then.	2018-10-20 17:45:48 +02:00
Willy Tarreau	7a6ad88b02	BUILD: memory: fix free_list pointer declaration again for atomic CAS Commit `ac6c880` ("BUILD: memory: fix pointer declaration for atomic CAS") attemtped to fix a build warning affecting the lock-free version of the pool allocator. But the fix tried to hide the cause instead of addressing it, thus clang still complains about (void ) not matching (void ). The real solution is to declare free_list (void *) and not to use a cast. Now this builds fine with gcc/clang with and without threads. No backport is needed.	2018-10-20 17:37:38 +02:00
Willy Tarreau	ed72d82827	MEDIUM: time: measure the time stolen by other threads The purpose is to detect if threads or processes are competing for the same CPU. This can happen when threads are incorrectly bound, or after a reload if the previous process still has an important activity. With threads this situation is problematic because a preempted thread holding a lock will block other ones waiting for this lock to be released. A first attempt consisted in measuring the cumulated lost time more precisely but the system's scheduler is smart enough to try to limit the thread preemption rate by mostly context switching during poll()'s blank periods, so most of the time lost is not seen. In essence this is good because it means a thread is not preempted with a lock held, and even regarding the rendez-vous point it cannot prevent the other ones from making progress. But still it happens tens to hundreds of times per second that a thread might be preempted, so it's still possible to detect that the situation is happening, thus it's interesting to measure and report its frequency. Each time we enter the poller, we check the CPU time spent working and see if we've lost time doing something else. To limit false positives, we're only interested in losses of 500 microseconds or more (i.e. half a clock tick on a 1 kHz system). If so, it indicates that some time was stolen by another thread or process. Note that we purposely store some sub-millisecond counters so that under heavy traffic with a 1 kHz clock, it's still possible to measure something without being subject to the risk of rounding errors (i.e. if exactly 1 ms is stolen it's possible that the time difference could often be slightly lower). This counter of lost CPU time slots time is reported in "show activity" in numbers of milliseconds of CPU lost per second, per 15s, and total over the process' life. By definition, the per-second counter cannot report values larger than 1000 per thread per second and the 15s one will be limited to 15000/s in the worst case, but it's possible that peak values exceed such thresholds after long pauses.	2018-10-19 08:51:59 +02:00
Willy Tarreau	5ceeb15002	MINOR: time: add now_mono_time() and now_cpu_time() These two functions retrieve respectively the monotonic clock time and the per-thread CPU time when available on the platform, or return zero. These syscalls may require to link with -lrt on certain libc, which is enabled in the Makefile with USE_RT=1 (default on Linux systems).	2018-10-18 16:39:48 +02:00
Willy Tarreau	ac6c8805be	BUILD: memory: fix pointer declaration for atomic CAS The calls to HA_ATOMIC_CAS() on the lockfree version of the pool allocator were mistakenly done on (void) for the old value instead of (void *). While this has no impact on "recent" gcc, it does have one for gcc < 4.7 since the CAS was open coded and it's not possible to assign a temporary variable of type "void". No backport is needed, this only affects 1.9.	2018-10-18 16:12:28 +02:00
Willy Tarreau	7e9c4ae4de	MINOR: poller: move time and date computation out of the pollers By placing this code into time.h (tv_entering_poll() and tv_leaving_poll()) we can remove the logic from the pollers and prepare for extending this to offer more accurate time measurements.	2018-10-17 19:59:43 +02:00
Willy Tarreau	f37ba94768	MINOR: fd: centralize poll timeout computation in compute_poll_timeout() The 4 pollers all contain the same code used to compute the poll timeout. This is pointless, let's centralize this into fd.h. This also gets rid of the useless SCHEDULER_RESOLUTION macro which used to work arond a very old linux 2.2 bug causing select() to wake up slightly before the timeout.	2018-10-17 19:59:43 +02:00
Willy Tarreau	e18db9e984	MEDIUM: pools: implement a thread-local cache for pool entries Each thread now keeps the last ~512 kB of freed objects into a local cache. There are some heuristics involved so that a specific pool cannot use more than 1/8 of the total cache in number of objects. Tests have shown that 512 kB is an optimal size on a 24-thread test running on a dual-socket machine, resulting in an overall 7.5% performance increase and a cache miss ratio reducing from 19.2 to 17.7%. Anyway it seems pointless to keep more than an L2 cache, which probably explains why sizes between 256 and 512 kB are optimal. Cached objects appear in two lists, one per pool and one LRU to help with fair eviction. Currently there is no way to check each thread's cache state nor to flush it. This cache cannot be disabled and is enabled as soon as the lockless pools are enabled (i.e.: threads are enabled, no pool debugging is in use and the CPU supports a double word CAS).	2018-10-16 13:46:08 +02:00
Willy Tarreau	146794dc4f	MINOR: pools: split pool_free() in the lockfree variant This separates the validity tests from the code committing the object to the pool, in order to ease insertion of the thread-local cache.	2018-10-16 10:29:28 +02:00
Willy Tarreau	0a93b6413f	MINOR: pools: allocate most memory pools from an array For caching it will be convenient to have indexes associated with pools, without having to dereference the pool itself. One solution could consist in replacing all pool pointers with integers but this would limit the number of allocatable pools. Instead here we allocate the 32 first pools from a pre-allocated array whose base address is known so that it's trivial to convert a pool to an index in this array. Pools that cannot fit there will be allocated normally.	2018-10-16 10:29:26 +02:00
Bertrand Jacquin	d5e4de8e5f	DOC: Fix a few typos these are mostly spelling mistakes, some of them might be candidate for backporting as well.	2018-10-15 19:38:15 +02:00
Willy Tarreau	8d8747abe0	OPTIM: tasks: group all tree roots per cache line Currently we have per-thread arrays of trees and counts, but these ones unfortunately share cache lines and are accessed very often. This patch moves the task-specific stuff into a structure taking a multiple of a cache line, and has one such per thread. Just doing this has reduced the cache miss ratio from 19.2% to 18.7% and increased the 12-thread test performance by 3%. It starts to become visible that we really need a process-wide per-thread storage area that would cover more than just these parts of the tasks. The code was arranged so that it's easy to move the pieces elsewhere if needed.	2018-10-15 19:06:13 +02:00
Willy Tarreau	b20aa9eef3	MAJOR: tasks: create per-thread wait queues Now we still have a main contention point with the timers in the main wait queue, but the vast majority of the tasks are pinned to a single thread. This patch creates a per-thread wait queue and queues a task to the local wait queue without any locking if the task is bound to a single thread (the current one) otherwise to the shared queue using locking. This significantly reduces contention on the wait queue. A test with 12 threads showed 11 ms spent in the WQ lock compared to 4.7 seconds in the same test without this change. The cache miss ratio decreased from 19.7% to 19.2% on the 12-thread test, and its performance increased by 1.5%. Another indirect benefit is that the average queue size is divided by the number of threads, which roughly removes log(nbthreads) levels in the tree and further speeds up lookups.	2018-10-15 19:04:40 +02:00
Willy Tarreau	87d54a9a6d	MEDIUM: fd/threads: only grab the fd's lock if the FD has more than one thread The vast majority of FDs are only seen by one thread. Currently the lock on FDs costs a lot because it's touched often, though there should be very little contention. This patch ensures that the lock is only grabbed if the FD is shared by more than one thread, since otherwise the situation is safe. Doing so resulted in a 15% performance boost on a 12-threads test.	2018-10-15 13:25:06 +02:00
Willy Tarreau	98d334bd94	MINOR: tools: add a new function atleast2() to test masks for more than 1 bit For threads it's common to have to check if a mask contains more than one bit set. Let's have this "atleast2()" function report this.	2018-10-15 13:25:06 +02:00
Willy Tarreau	d944344f01	BUILD: peers: check allocation error during peers_init_sync() peers_init_sync() doesn't check task_new()'s return value and doesn't return any result to indicate success or failure. Let's make it return an int and check it from the caller. This can be backported as far as 1.6.	2018-10-15 13:24:43 +02:00
Willy Tarreau	8d26f02e69	BUILD: compiler: add a new statement "__unreachable()" This statement is used as a hint for the compiler so that it knows that the location where it's placed cannot be reached. It will mostly be used after longjmp() or equivalent statements that deal with error processing and that the compiler doesn't know will not return on certain conditions, so that it doesn't complain about null dereferences on error paths.	2018-10-15 13:24:43 +02:00
Willy Tarreau	c1f40b38a6	MINOR: chunk: add chunk_cpy() and chunk_cat() Sometimes we need to concatenate constant chunks to existing ones, but no function currently exists to do this easily, hence these two new ones.	2018-10-12 16:58:01 +02:00
Christopher Faulet	25da9e34f1	MINOR: h1: Add the flag H1_MF_NO_PHDR to not add pseudo-headers during parsing Some pseudo-headers are added during the headers parsing, mainly for the mux H2. With this flag, it is possible to not add them. This avoid some boring filtering in the mux H1.	2018-10-12 16:15:18 +02:00
Christopher Faulet	1dc2b49556	MINOR: h1: Change the union h1_sl to use indirect strings to store infos Instead of using offsets relating to the parsed buffer to store start line infos, we now use indirect strings. So now, these infos remain valid only if the origin buffer remains untouched. But it's not a real problem because this union is used during the parsing and never stored to a later use.	2018-10-12 16:14:57 +02:00
Christopher Faulet	08088e77c6	MINOR: conn-stream: Add CL_FL_NOT_FIRST flag This flags will be used by multiplexers to warn a conn-stream (and, by transitivity, a stream) it is not the first one created by the mux. It will help mux H1 to handle keep-alive connections.	2018-10-12 16:09:26 +02:00
Christopher Faulet	315b39c391	MINOR: http: Use same flag for httpclose and forceclose options Since keep-alive mode is the default mode, the passive close has disappeared, and in the code, httpclose and forceclose options are handled the same way: connections with the client and the server are closed as soon as the request and the response are received and missing "Connection: close" header is added in each direction. So to make things clearer, forceclose is now an alias for httpclose. And httpclose is explicitly an active close. So the old passive close does not exist anymore. Internally, the flag PR_O_HTTP_PCL has been removed and PR_O_HTTP_FCL has been replaced by PR_O_HTTP_CLO. In HTTP analyzers, the checks done to find the right mode to use, depending on proxies options and "Connection: " header value, have been simplified. This should only be a cleanup and no changes are expected.	2018-10-12 16:07:56 +02:00
Christopher Faulet	10079f59b7	MINOR: http: Export some functions and do cleanup to prepare HTTP refactoring To ease the refactoring, the function "http_header_add_tail" have been remove. Now, "http_header_add_tail2" is always used. And the function "capture_headers" have been renamed into "http_capture_headers". Finally, some functions have been exported.	2018-10-12 16:00:45 +02:00
Christopher Faulet	702226c827	MINOR: stats: Add missing include "proto/stats.h" must include "types/stats.h".	2018-10-12 16:00:32 +02:00
Christopher Faulet	7e266c7936	MINOR: http: Move comment about some HTTP macros in the right header file HTTP_FLG_* and HTTP_IS_* were moved from "proto/proto_http.h" to "common/http.h" but the associated comment was forgotten during the move. This is 1.9-specific and should not be backported.	2018-10-12 16:00:24 +02:00
Olivier Houchard	4fdec7aafa	BUG/MEDIUM: stream: Make sure to unsubscribe before si_release_endpoint. Make sure we unsubscribe from events before si_release_endpoint destroys the conn_stream, or it will be never called. To do so, move the call to unsubscribe to si_release_endpoint() directly. This is 1.9-specific and shouldn't be backported.	2018-10-11 17:16:43 +02:00
Olivier Houchard	fa8aa867b9	MEDIUM: connections: Change struct wait_list to wait_event. When subscribing, we don't need to provide a list element, only the h2 mux needs it. So instead, Add a list element to struct h2s, and use it when a list is needed. This forces us to use the unsubscribe method, since we can't just unsubscribe by using LIST_DEL anymore. This patch is larger than it should be because it includes some renaming.	2018-10-11 15:34:39 +02:00
Olivier Houchard	83a0cd8a36	MINOR: connections: Introduce an unsubscribe method. As we don't know how subscriptions are handled, we can't just assume we can use LIST_DEL() to unsubscribe, so introduce a new method to mux and connections to do so.	2018-10-11 15:34:21 +02:00

... 2 3 4 5 6 ...

3310 Commits