haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2025-01-31 10:31:46 +00:00

Author	SHA1	Message	Date
Olivier Houchard	88698d966d	MEDIUM: connections: Add a way to control the number of idling connections. As by default we add all keepalive connections to the idle pool, if we run into a pathological case, where all client don't do keepalive, but the server does, and haproxy is configured to only reuse "safe" connections, we will soon find ourself having lots of idling, unusable for new sessions, connections, while we won't have any file descriptors available to create new connections. To fix this, add 2 new global settings, "pool_low_ratio" and "pool_high_ratio". pool-low-fd-ratio is the % of fds we're allowed to use (against the maximum number of fds available to haproxy) before we stop adding connections to the idle pool, and destroy them instead. The default is 20. pool-high-fd-ratio is the % of fds we're allowed to use (against the maximum number of fds available to haproxy) before we start killing idling connection in the event we have to create a new outgoing connection, and no reuse is possible. The default is 25.	2019-04-18 19:52:03 +02:00
Olivier Houchard	7c49d2e213	MINOR: fd: Add a counter of used fds. Add a new counter, ha_used_fds, that let us know how many file descriptors we're currently using.	2019-04-18 19:19:59 +02:00
Ilya Shipitsin	8a9d55bb9b	MEDIUM: enable travis-ci builds currently only xenial/clang build is enabled. osx and xenial/gcc will be enabled later. travis-ci is cloud based continuous integration, builds will be started automatically if they are enabled for certain repo or fork. Signed-off-by: Ilya Shipitsin <chipitsine@gmail.com>	2019-04-18 18:39:55 +02:00
Emeric Brun	0bbec0fa34	MINOR: peers: adds counters on show peers about tasks calls. This patch adds a counter of calls on the orchestator peers task and a counter on the tasks linked to applet i/o handler for each peer. Those two counters are useful to detect if a peer sync is active or frozen. This patch is related to the commit: "MINOR: peers: Add a new command to the CLI for peers." and should be backported with it.	2019-04-18 18:24:25 +02:00
Olivier Houchard	66a7b3302a	BUILD/medium: ssl: Fix build with OpenSSL < 1.1.0 Make sure it builds with OpenSSL < 1.1.0, a lot of the BIO_get/set methods were introduced with OpenSSL 1.1.0, so fallback with the old way of doing things if needed.	2019-04-18 15:58:58 +02:00
Olivier Houchard	a8955d57ed	MEDIUM: ssl: provide our own BIO. Instead of letting the OpenSSL code handle the file descriptor directly, provide a custom BIO, that will use the underlying XPRT to send/recv data. This will let us implement QUIC later, and probably clean the upper layer, if/when the SSL code provide its own subscribe code, so that the upper layers won't have to care if we're still waiting for the handshake to complete or not.	2019-04-18 14:56:24 +02:00
Olivier Houchard	e179d0e88f	MEDIUM: connections: Provide a xprt_ctx for each xprt method. For most of the xprt methods, provide a xprt_ctx. This will be useful later when we'll want to be able to stack xprts. The init() method now has to create and provide the said xprt_ctx if needed.	2019-04-18 14:56:24 +02:00
Olivier Houchard	df35784600	MEDIUM: ssl: provide its own subscribe/unsubscribe function. In order to prepare for the possibility of using different kinds of xprt with ssl, make the ssl code provide its own subscribe and unsubscribe functions, right now it just calls conn_subscribe and conn_unsubsribe.	2019-04-18 14:56:24 +02:00
Olivier Houchard	7b5fd1ec26	MEDIUM: connections: Move some fields from struct connection to ssl_sock_ctx. Move xprt_st, tmp_early_data and sent_early_data from struct connection to struct ssl_sock_ctx, as they are only used in the SSL code.	2019-04-18 14:56:24 +02:00
Olivier Houchard	66ab498f26	MEDIUM: ssl: Give ssl_sock its own context. Instead of using directly a SSL * as xprt_ctx, give ssl_sock its own context. It's useless for now, but will be useful later when we'll want to be able to stack xprts.	2019-04-18 14:56:24 +02:00
Olivier Houchard	ed1a6a0d8a	MEDIUM: tasks: Use __ha_barrier_store after modifying global_tasks_mask. Now that we no longer use atomic operations to update global_tasks_mask, as it's always modified while holding the TASK_RQ_LOCK, we have to use __ha_barrier_store() instead of __ha_barrier_atomic_store() to ensure any modification of global_tasks_mask is seen before modifying active_tasks_mask. This should be backported to 1.9.	2019-04-18 14:14:10 +02:00
Willy Tarreau	d83b6c1ab3	BUG/MINOR: mworker: disable busy polling in the master process When enabling busy polling, we don't want the master to use it, or it wastes a dedicated processor to this! Must be backported to 1.9.	2019-04-18 11:34:41 +02:00
Christopher Faulet	769a92d86d	MINOR: contrib/prometheus-exporter: Follow best practices about metrics type In short, _total metrics are now counters and others are gauges. No backport needed. See issue #81 on github.	2019-04-18 10:27:16 +02:00
Christopher Faulet	8c8e4b1263	MINOR: contrib/prometheus-exporter: Rename some metrics to be more usable Some metrics have been renamed and their type adapted to be more usable in Prometheus: * haproxy_process_uptime_seconds -> haproxy_process_start_time_seconds * haproxy_process_max_memory -> haproxy_process_max_memory_bytes * haproxy_process_pool_allocated_total -> haproxy_process_pool_allocated_bytes * haproxy_process_pool_used_total -> haproxy_process_pool_used_bytes * haproxy_process_ssl_cache_lookups -> haproxy_process_ssl_cache_lookups_total * haproxy_process_ssl_cache_misses -> haproxy_process_ssl_cache_misses_total No backport needed. See issue #81 on github.	2019-04-18 10:27:16 +02:00
Christopher Faulet	c58fc0dec9	MINOR: contrib/prometheus-exporter: Remove usless rate metrics Following metrics have been removed: * haproxy_frontend_connections_rate_current (ST_F_CONN_RATE) * haproxy_frontend_http_requests_rate_current (ST_F_REQ_RATE) * haproxy_*_current_session_rate (ST_F_RATE) These rates can be deduced using the total value with this kind of formula: rate(haproxy_frontend_connections_total[1m]) No backport needed. See issue #81 on github.	2019-04-18 10:27:16 +02:00
Christopher Faulet	f782c23ec6	BUG/MINOR: contrib/prometheus-exporter: Fix a typo in the run-queue metric type No backport needed.	2019-04-18 10:27:16 +02:00
Olivier Houchard	1cfac37b65	MEDIUM: tasks: Don't account a destroyed task as a runned task. In process_runnable_tasks(), if the task we're about to run has been destroyed, and should be free, don't account for it in the number of task we ran. We're only allowed a maximum number of tasks to run per call to process_runnable_tasks(), and freeing one shouldn't take the slot of a valid task.	2019-04-18 10:11:13 +02:00
Olivier Houchard	3f795f76e8	MEDIUM: tasks: Merge task_delete() and task_free() into task_destroy(). task_delete() was never used without calling task_free() just after, and task_free() was only used on error pathes to destroy a just-created task, so merge them into task_destroy(), that will remove the task from the wait queue, and make sure the task is either destroyed immediately if it's not in the run queue, or destroyed when it's supposed to run.	2019-04-18 10:10:04 +02:00
Willy Tarreau	03dd029a5b	CLEANUP: task: remain consistent when using the task's handler A pointer "process" is assigned the task's handler in process_runnable_tasks(), we have no reason to use t->process right after it is assigned.	2019-04-17 22:32:27 +02:00
Willy Tarreau	8c12e2f785	MINOR: task/thread: factor out a wake-up condition The wakeup condition in task_wakeup() is redundant as it is already validated by the CAS. Better move the __task_wakeup() call there, it also has the merit of being easier to audit this way. This also reduces the code size by around 1.8 kB : $ size haproxy-? text data bss dec hex filename 2153806 100208 1307676 3561690 3658da haproxy-1 2152094 100208 1307676 3559978 36522a haproxy-2	2019-04-17 22:15:58 +02:00
Willy Tarreau	a70bfaaf8b	BUG/MAJOR: task: make sure never to delete a queued task Commit `0c7a4b6` ("MINOR: tasks: Don't set the TASK_RUNNING flag when adding in the tasklet list.") revealed a hole in the way tasks may be freed : they could be removed while in the run queue when the TASK_QUEUED flag was present but not the TASK_RUNNING one. But it seems the issue was emphasized by commit `cde7902` ("MEDIUM: tasks: improve fairness between the local and global queues") though the code it replaces was already affected given how late the TASK_RUNNING flag was set after removal from the global queue. At the moment the task is picked from the global run queue, if it is the last one, the global run queue lock is dropped, and then the TASK_RUNNING flag was added. In the mean time another thread might have performed a task_free(), and immediately after, the TASK_RUNNING flag was re-added to the task, which was then added to the tasklet list. The unprotected window was extremely faint but does definitely exist and inconsistent task lists have been observed a few times during very intensive tests over the last few days. From this point various options are possible, the task might have been re-allocated while running, and assigned state 0 and/or state QUEUED while it was still running, resulting in the tast not being put back into the tree. This commit simply makes sure that tests on TASK_RUNNING before removing the task also cover TASK_QUEUED. It must be backported to 1.9 along with the previous ones touching that area.	2019-04-17 22:15:58 +02:00
Olivier Houchard	51205a1958	BUG/MEDIUM: applets: Don't use task_in_rq(). When deciding if we want to wake the task of an applet up, don't give up if task_in_rq returns 1, as there's a race condition and another thread may run it. Instead, always attempt to task_wakeup(), at worst the task is already in the run queue, and nothing will happen.	2019-04-17 19:30:23 +02:00
Olivier Houchard	0c7a4b6371	MINOR: tasks: Don't set the TASK_RUNNING flag when adding in the tasklet list. Now that TASK_QUEUED is enforced, there's no need to set TASK_RUNNING when removing the task from the runqueue to add it to the tasklet list. The flag will only be set right before we run the task.	2019-04-17 19:28:01 +02:00
Olivier Houchard	4a1be0c6d6	MEDIUM: tasks: No longer use rq.node.leaf_p as a lock. Now that we have the warranty that a task won't be added in the runqueue while the TASK_QUEUED or the TASK_RUNNING flag is set, don't bother trying to lock the task by setting leaf_p to 0x1 while inserting it in the runqueue or having it in the tasklet_list, as nobody else will attempt to add it.	2019-04-17 19:28:01 +02:00
Olivier Houchard	5c964f7b42	MINOR: tasks: Don't consider we can wake task with tasklet_wakeup(). In tasklet_wakeup(), don't bother checking if the tasklet is really a task, calling tasklet_wakeup() with a task is invalid.	2019-04-17 19:28:01 +02:00
Olivier Houchard	de82aeaa26	BUG/MEDIUM: tasks: Make sure we modify global_tasks_mask with the rq_lock. When modifying global_tasks_mask, make sure we hold the rq_lock, or we might remove the bit while it has been re-set by somebody else, and we make not be waked when needed.	2019-04-17 19:28:01 +02:00
Willy Tarreau	b038007ae8	BUG/MEDIUM: tasks: Make sure we set TASK_QUEUED before adding a task to the rq. Make sure we set TASK_QUEUED in every case before adding the task to the run queue. task_wakeup() now checks if either TASK_QUEUED or TASK_RUNNING is set, and if neither is set, add TASK_QUEUED and effectively add the task to the runqueue. No longer use __task_wakeup() anywhere except in task_wakeup(), always use task_wakeup() instead. With the old code, process_runnable_task() may re-add a task in the runqueue without setting the TASK_QUEUED flag, and there were race conditions that could lead to a task having the TASK_QUEUED flag but not in the runqueue, thus being unschedulable. This should be backported to 1.9.	2019-04-17 19:28:01 +02:00
Christopher Faulet	46575cd392	BUG/MINOR: http_fetch/htx: Use HTX versions if the proxy enables the HTX mode Because the HTX is now the default mode for all proxies (HTTP and TCP), it is better to match on the proxy options to know if the HTX is enabled or not. This way, if a TCP proxy explicitly disables the HTX mode, the legacy version of HTTP fetches will be used. No backport needed except if the patch activating the HTX by default for all proxies is backported.	2019-04-17 15:12:27 +02:00
Christopher Faulet	5ec8bcb021	BUG/MINOR: http_fetch/htx: Allow permissive sample prefetch for the HTX As for smp_prefetch_http(), there is now a way to successfully perform a prefetch in HTX, even if the message forwarding already begun. It is used for the sample fetches "req.proto_http" and "method". This patch must be backported to 1.9.	2019-04-17 15:12:27 +02:00
Christopher Faulet	89dc499359	BUG/MAJOR: http_fetch: Get the channel depending on the keyword used All HTTP samples are buggy because the channel tested in the prefetch functions (HTX and legacy HTTP) is chosen depending on the sample direction and not the keyword really used. It means the request channel is used if the sample is called during the request analysis and the response channel is used if it is called during the response analysis, regardless the sample really called. For instance, if you use the sample "req.ver" in an http-response rule, the response channel will be prefeched because it is called during the response analysis, while the request channel should have been used instead. So some assumptions on the validity of the sample may be made on the wrong channel. It is the first bug. Then the same error is done in some samples themselves. So fetches are performed on the wrong channel. For instance, the header extraction (req.fhdr, res.fhdr, req.hdr, res.hdr...). If the sample "req.hdr" is used in an http-response rule, then the matching is done on the response headers and not the request ones. It is the second bug. Finally, the last one but not the least, in some samples, the right channel is used. But because the prefetch was done on the wrong one, this channel may be in a undefined state. For instance, using the sample "req.ver" in an http-response rule leads to a matching on a posibility released buffer. To fix all these bugs, the right channel is now chosen in sample fetches, before the prefetch. If the same function is used to fetch requests and responses elements, then the keyword is used to choose the right one. This channel is then used by the functions smp_prefetch_htx() and smp_prefetch_http(). Of course, it is also used by the samples themselves to extract information. This patch must be backported to all supported versions. For version 1.8 and priors, it must be totally refactored. First because there is no HTX into these versions. Then the buffers API has changed in HAProxy 1.9. The files http_fetch.{ch} doesn't exist on old versions.	2019-04-17 15:12:27 +02:00
Christopher Faulet	3a4d1bea61	BUG/MEDIUM: htx: Don't return the start-line if the HTX message is empty In the function htx_get_stline(), NULL must be returned if the HTX message doesn't contain any element. This patch must be backported to 1.9.	2019-04-17 15:12:27 +02:00
Christopher Faulet	038ad8123b	MINOR: mux-h1: Handle read0 during TCP splicing It avoids a roundtrip with underlying I/O callbacks to do so. If a read0 is handled at the end of h1_rcv_pipe(), the flag CS_FL_REOS is set on the conn_stream. And if there is no data in the pipe, the flag CS_FL_EOS is also set. This path may be backported to 1.9.	2019-04-17 14:52:31 +02:00
Christopher Faulet	e18777b79d	BUG/MEDIUM: mux-h1: Enable TCP splicing to exchange data only Use the TCP splicing only when the input parser is in the state H1_MSG_DATA or H1_MSG_TUNNEL and don't transfer more than then known expected length for these data (unlimited for the tunnel mode). In other states or when all data are transferred, the TCP splicing is disabled. This patch must be backported to 1.9.	2019-04-17 14:52:31 +02:00
Christopher Faulet	f7d5ff37e0	BUG/MEDIUM: mux-h1: Notify the stream waiting for TCP splicing if ibuf is empty When a stream-interface want to use the TCP splicing to forward its data, it notifies the mux h1. We will then flush the input buffer and don't read more data. So the stream-interface will not be notified for read anymore, except if an error or a read0 is detected. It is a problem everytime the receive I/O callback is called again. It happens when the pipe is full or when no data are received on the pipe. It also happens when the input buffer is freshly flushed. Because the TCP splicing is enabled, nothing is done in h1_recv() and the stream-interface is never woken up. So, now, in h1_recv(), if the TCP splicing is used and the input buffer is empty, the stream-interface is notified for read. This patch must be backported to 1.9.	2019-04-17 14:52:31 +02:00
Christopher Faulet	2f320ee59c	BUG/MINOR: mux-h1: Don't switch the parser in busy mode if other side has done There is no reaon to switch the input parser in busy mode if all the output has been processed. This patch must be backported to 1.9.	2019-04-17 14:52:31 +02:00
Christopher Faulet	91f77d5999	BUG/MINOR: mux-h1: Process input even if the input buffer is empty It is required, at least, to add the EOM block and finish the message when the TCP splicing was used to send all data. Otherwise, there is no way to finish the parsing. This patch must be backported to 1.9.	2019-04-17 14:52:31 +02:00
Ilya Shipitsin	9ab3138d71	REGTESTS: exclude tests that require ssl, pcre if no such feature is enabled Signed-off-by: Ilya Shipitsin <chipitsine@gmail.com>	2019-04-17 11:01:58 +02:00
William Lallemand	74f0ec3894	BUG/MINOR: mworker: ensure that we still quits with SIGINT Since the fix "BUG/MINOR: mworker: don't exit with an ambiguous value" we are leaving with a EXIT_SUCCESS upon a SIGINT. We still need to quit with a SIGINT when a worker leaves with a SIGINT. This is done this way because vtest expect a 130 during the process stop, haproxy without mworker returns a 130, so it should be the same in mworker mode. This should be backported in 1.9, with the previous patch ("BUG/MINOR: mworker: don't exit with an ambiguous value"). Code has moved, mworker_catch_sigchld() is in haproxy.c.	2019-04-16 18:14:29 +02:00
William Lallemand	4cf4b33744	BUG/MINOR: mworker: don't exit with an ambiguous value When the sigchld handler is called and waitpid() returns -1, the behavior of waitpid() with the status variable is undefined. It is not a good idea to exit with the value contained in it. Since this exit path does not use the exitcode variable, it means that this is an expected and successful exit. This should be backported in 1.9, code has moved, mworker_catch_sigchld() is in haproxy.c.	2019-04-16 18:14:29 +02:00
William Lallemand	32b6901550	BUG/MINOR: mworker: mworker_kill should apply on every children Commit `3f12887` ("MINOR: mworker: don't use children variable anymore") introduced a regression. The previous behavior was to send a signal to every children, whether or not they are former children. Instead of this, we only send a signal to the current children, so we don't try to kill -INT or -TERM all processes during a reload. No backport needed.	2019-04-16 18:14:29 +02:00
Willy Tarreau	85d0424b20	BUG/MINOR: listener/mq: correctly scan all bound threads under low load When iterating on the CLI using "show activity" and no other load, it was visible that the last thread was always skipped. This was caused by the way the thread bits were walking : t1 was updated after t2 to make sure it never equals t2 (thus it skips t2), and in case of a tie we choose t1. This results in the chosen thread never to equal t2 unless the other ones already have one connection. In addition to this, t2 was recalulated upon each pass due to the fact that only the 31th bit was looked at instead of looking at the t2'th bit. This patch fixes this by updating t2 after t1 so that t1 is free to walk over all positions under equal load. No measurable performance gains are expected from this though, but it at least removes one strange indicator which could lead to some suspicion. No backport is needed.	2019-04-16 18:09:13 +02:00
Willy Tarreau	636848aa86	MINOR: init: add a "set-dumpable" global directive to enable core dumps It's always a pain to get a core dump when enabling user/group setting (which disables the dumpable flag on Linux), when using a chroot and/or when haproxy is started by a service management tool which requires complex operations to just raise the core dump limit. This patch introduces a new "set-dumpable" global directive to work around these troubles by doing the following : - remove file size limits (equivalent of ulimit -f unlimited) - remove core size limits (equivalent of ulimit -c unlimited) - mark the process dumpable again (equivalent of suid_dumpable=1) Some of these will depend on the operating system. This way it becomes much easier to retrieve a core file. Temporarily moving the chroot to a user-writable place generally enough.	2019-04-16 14:31:23 +02:00
William Lallemand	482f9a9a2f	MINOR: mworker: export HAPROXY_MWORKER=1 when running in mworker mode Export HAPROXY_MWORKER=1 in an environment variable when running in mworker mode.	2019-04-16 13:26:43 +02:00
William Lallemand	620072bc0d	MINOR: cli: don't add a semicolon at the end of HAPROXY_CLI Only add the semicolon when there is several CLI in HAPROXY_CLI and HAPROXY_MASTER_CLI.	2019-04-16 13:26:43 +02:00
William Lallemand	9a37fd0f19	MEDIUM: mworker/cli: export the HAPROXY_MASTER_CLI variable It works the same way as the HAPROXY_CLI variable, it exports the listeners addresses separated by semicolons.	2019-04-16 13:26:43 +02:00
William Lallemand	8f7069a389	CLEANUP: mworker: remove the type field in mworker_proc Since the introduction of the options field, we can use it to store the type of process. type = 'm' is replaced by PROC_O_TYPE_MASTER type = 'w' is replaced by PROC_O_TYPE_WORKER type = 'e' is replaced by PROC_O_TYPE_PROG The old values are still used in the HAPROXY_PROCESSES environment variable to pass the information during a reload.	2019-04-16 13:26:43 +02:00
William Lallemand	bd3de3efb7	MEDIUM: mworker-prog: implements 'option start-on-reload' This option is already the default, but its opposite 'no option start-on-reload' allows the master to keep a previous instance of a program and don't start a new one upon a reload. The old program will then appear as a current one in "show proc" and could also trigger an exit-on-failure upon a segfault.	2019-04-16 13:26:43 +02:00
William Lallemand	4528611ed6	MEDIUM: mworker: store the leaving state of a process Previously we were assuming than a process was in a leaving state when its number of reload was greater than 0. With mworker programs it's not the case anymore so we need to store a leaving state.	2019-04-16 13:26:43 +02:00
Willy Tarreau	9df86f997e	BUG/MAJOR: lb/threads: fix insufficient locking on round-robin LB Maksim Kupriianov reported very strange crashes in fwrr_update_position() which didn't make sense because of an apparent divide overflow except that the value was not null in the core. It happens that while the locking is correct in all the functions' call graph, the uppermost one (fwrr_get_next_server()) incorrectly expected that its target server was already locked when called. This stupid assumption causd the server lock not to be held when calling the other ones, explaining how it was possible to change the server's eweight by calling srv_lb_commit_status() under the server lock yet collide with its unprotected usage. This commit makes sure that fwrr_get_server_from_group() retrieves a locked server and that fwrr_get_next_server() is responsible for unlocking the server before returning it. There is one subtlety in this function which is that it builds a list of avoided servers that were full while scanning the tree, and all of them are queued in a full state so they must be unlocked upon return. Many thanks to Maksim for providing detailed info allowing to narrow down this bug. This fix must be backported to 1.9. In 1.8 the lock seems much wider and changes to the server's state are performed under the rendez-vous point so this it doesn't seem possible that it happens there.	2019-04-16 11:21:14 +02:00
Fr�d�ric L�caille	21dde5053a	DOC: update for "show peers" CLI command. Add the documentation for the new "show peers" CLI command which comes with this commit "MINOR: peers: Add a new command to the CLI for peers.".	2019-04-16 09:58:40 +02:00

1 2 3 4 5 ...

9573 Commits