haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2024-12-19 01:54:37 +00:00

Author	SHA1	Message	Date
David Carlier	a92c5cec2d	BUILD/MEDIUM: threads: rename thread_info struct to ha_thread_info On Darwin, the thread_info name exists as a standard function thus we need to rename our array to ha_thread_info to fix this conflict.	2019-10-17 07:15:17 +02:00
Willy Tarreau	9a1f57351d	MEDIUM: threads: add thread_sync_release() to synchronize steps This function provides an alternate way to leave a critical section run under thread_isolate(). Currently, a thread may remain in thread_release() without having the time to notice that the rdv mask was released and taken again by another thread entering thread_isolate() (often the same that just released it). This is because threads wait in harmless mode in the loop, which is compatible with the conditions to enter thread_isolate(). It's not possible to make them wait with the harmless bit off or we cannot know when the job is finished for the next thread to start in thread_isolate(), and if we don't clear the rdv bit when going there, we create another race on the start point of thread_isolate(). This new synchronous variant of thread_release() makes use of an extra mask to indicate the threads that want to be synchronously released. In this case, they will be marked harmless before releasing their sync bit, and will wait for others to release their bit as well, guaranteeing that thread_isolate() cannot be started by any of them before they all left thread_sync_release(). This allows to construct synchronized blocks like this : thread_isolate() /* optionally do something alone here / thread_sync_release() / do something together here / thread_isolate() / optionally do something alone here */ thread_sync_release() And so on. This is particularly useful during initialization where several steps have to be respected and no thread must start a step before the previous one is completed by other threads. This one must not be placed after any call to thread_release() or it would risk to block an earlier call to thread_isolate() which the current thread managed to leave without waiting for others to complete, and end up here with the thread's harmless bit cleared, blocking others. This might be improved in the future.	2019-06-10 09:42:43 +02:00
Willy Tarreau	31cba0d3e0	MINOR: threads: avoid clearing harmless twice in thread_release() thread_release() is to be called after thread_isolate(), i.e. when the thread already has its harmless bit cleared. No need to clear it twice, thus avoid calling thread_harmless_end() and directly check the rdv bits then loop on them.	2019-06-09 08:47:35 +02:00
Willy Tarreau	2beaaf7d46	MINOR: threads: implement ha_tkill() and ha_tkillall() These functions are used respectively to signal one thread or all threads. When multithreading is disabled, it's always the current thread which is signaled.	2019-05-22 11:50:48 +02:00
Willy Tarreau	05ed14cfc4	CLEANUP: threads: really move thread_info to hathreads.c Commit `5a6e2245f` ("REORG: threads: move the struct thread_info from global.h to hathreads.h") didn't hold its promise well, as the thread_info struct was still declared and initialized in haproxy.c in addition to being in hathreads.c. Let's move it for real now.	2019-05-22 11:50:48 +02:00
Willy Tarreau	8323a375bc	MINOR: threads: add a thread-local thread_info pointer "ti" Since we're likely to access this thread_info struct more frequently in the future, let's reserve the thread-local symbol to access it directly and avoid always having to combine thread_info and tid. This pointer is set when tid is set.	2019-05-20 21:14:12 +02:00
Willy Tarreau	5a6e2245fa	REORG: threads: move the struct thread_info from global.h to hathreads.h It doesn't make sense to keep this struct thread_info in global.h, it causes difficulties to access its contents from hathreads.h, let's move it to the threads where it ought to have been created.	2019-05-20 20:00:25 +02:00
Willy Tarreau	38171daf21	MINOR: thread: implement ha_thread_relax() At some places we're using a painful ifdef to decide whether to use sched_yield() or pl_cpu_relax() to relax in loops, this is hardly exportable. Let's move this to ha_thread_relax() instead and une this one only.	2019-05-17 17:16:20 +02:00
Olivier Houchard	46453d3f7d	MINOR: threads: Implement thread_cpus_enabled() for FreeBSD. Use cpuset_getaffinity() to implement thread_cpus_enabled() on FreeBSD, so that we can know the number of CPUs available, and automatically launch as much threads if nbthread isn't specified.	2019-04-11 00:09:22 +02:00
Olivier Houchard	b23a61f78a	MEDIUM: threads: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Willy Tarreau	149ab779cc	MAJOR: threads: enable one thread per CPU by default Threads have long matured by now, still for most users their usage is not trivial. It's about time to enable them by default on platforms where we know the number of CPUs bound. This patch does this, it counts the number of CPUs the process is bound to upon startup, and enables as many threads by default. Of course, "nbthread" still overrides this, but if it's not set the default behaviour is to start one thread per CPU. The default number of threads is reported in "haproxy -vv". Simply using "taskset -c" is now enough to adjust this number of threads so that there is no more need for playing with cpu-map. And thanks to the previous patches on the listener, the vast majority of configurations will not need to duplicate "bind" lines with the "process x/y" statement anymore either, so a simple config will automatically adapt to the number of processors available.	2019-02-27 14:51:50 +01:00
Willy Tarreau	fc647360e0	CLEANUP: threads: use nbits to calculate the thread mask It's pointless to do arithmetics by hand, we have a function for this.	2019-02-02 17:48:39 +01:00
Willy Tarreau	f5809cde7a	MINOR: threads: make MAX_THREADS configurable at build time There's some value in being able to limit MAX_THREADS, either to save precious resources in embedded environments, or to protect certain deployments against accidently incorrect settings. With this patch, if MAX_THREADS is defined at build time, it will be used. However, given that LONGBITS is not a macro but is defined according to sizeof(long), we can't check the value range at build time and instead we need to perform the check at early boot time. However, the compiler is able to optimize away the constant comparisons and doesn't even emit the check code when values are correct. The output message regarding threading support was improved to report the number of threads.	2019-01-26 13:37:48 +01:00
Willy Tarreau	8459f25e03	BUILD: thread: properly report multi-thread support When refactoring the build option strings in 1.9, the thread support was placed outside of the ifdef block resulting in threads always being mentioned even if that was not true. Let's fix this and also mention when threads are disabled to help troubleshooting.	2018-12-15 16:48:14 +01:00
Willy Tarreau	8071338c78	MINOR: initcall: apply initcall to all register_build_opts() calls Most register_build_opts() calls use static strings. These ones were replaced with a trivial REGISTER_BUILD_OPTS() statement adding the string and its call to the STG_REGISTER section. A dedicated section could be made for this if needed, but there are very few such calls for this to be worth it. The calls made with computed strings however, like those which retrieve OpenSSL's version or zlib's version, were moved to a dedicated function to guarantee they are called late in the process. For example, the SSL call probably requires that SSL_library_init() has been called first.	2018-11-26 19:50:32 +01:00
Willy Tarreau	a8ae77da61	MINOR: thread: provide a set of lock initialisers This patch adds ha_spin_init() and ha_rwlock_init() which are used as a callback to initialise locks at boot time. They perform exactly the same as HA_SPIN_INIT() or HA_RWLOCK_INIT() but from within a real function.	2018-11-26 19:50:32 +01:00
Willy Tarreau	a9c0252b2e	BUG/MEDIUM: threads: fix thread_release() at the end of the rendez-vous point There is a bug in this function used to release other threads. It leaves the current thread marked as harmless. If after this another thread does a thread_isolate(), but before the first one reaches poll(), the second thread will believe it's alone while it's not. This must be backported to 1.8 since the rendez-vous point was merged into 1.8.14.	2018-10-16 17:03:16 +02:00
Willy Tarreau	647c70b681	MINOR: threads: remove the previous synchronization point It's not needed anymore as it is fully covered by the new rendez-vous point. This also removes the pipe and its polling.	2018-08-08 09:57:45 +02:00
Willy Tarreau	ab657ce251	BUG/MEDIUM: threads: fix the no-thread case after the change to the sync point In commit `0c026f4` ("MINOR: threads: add more consistency between certain variables in no-thread case"), we ensured that we don't have all_threads_mask zeroed anymore. But one test was missed for the write() to the sync pipe. This results in a situation where when running single-threaded, once a server status changes, a wake-up message is written to the pipe and never consumed, showing a 100% CPU usage. No backport is needed.	2018-08-07 10:07:15 +02:00
Willy Tarreau	60b639ccbe	MEDIUM: hathreads: implement a more flexible rendez-vous point The current synchronization point enforces certain restrictions which are hard to workaround in certain areas of the code. The fact that the critical code can only be called from the sync point itself is a problem for some callback-driven parts. The "show fd" command for example is fragile regarding this. Also it is expensive in terms of CPU usage because it wakes every other thread just to be sure all of them join to the rendez-vous point. It's a problem because the sleeping threads would not need to be woken up just to know they're doing nothing. Here we implement a different approach. We keep track of harmless threads, which are defined as those either doing nothing, or doing harmless things. The rendez-vous is used "for others" as a way for a thread to isolate itself. A thread then requests to be alone using thread_isolate() when approaching the dangerous area, and then waits until all other threads are either doing the same or are doing something harmless (typically polling). The function only returns once the thread is guaranteed to be alone, and the critical section is terminated using thread_release().	2018-08-02 17:51:45 +02:00
Willy Tarreau	0c026f49e7	MINOR: threads: add more consistency between certain variables in no-thread case When threads are disabled, some variables such as tid and tid_bit are still checked everywhere, the MAX_THREADS_MASK macro is ~0UL while MAX_THREADS is 1, and the all_threads_mask variable is replaced with a macro forced to zero. The compiler cannot optimize away all this code involving checks on tid and tid_bit, and we end up in special cases where all_threads_mask has to be specifically tested for being zero or not. It is not even certain the code paths are always equivalent when testing without threads and with nbthread 1. Let's change this to make sure we always present a single thread when threads are disabled, and have the relevant values declared as constants so that the compiler can optimize all the tests away. Now we have MAX_THREADS_MASK set to 1, all_threads_mask set to 1, tid set to zero and tid_bit set to 1. Doing just this has removed 4 kB of code in the no-thread case. A few checks for all_threads_mask==0 have been removed since it never happens anymore.	2018-08-02 17:48:09 +02:00
Willy Tarreau	0ccd32285f	MINOR: threads: move "nbthread" parsing to hathreads.c The purpose is to make sure that all variables which directly depend on this nbthread argument are set at the right moment. For now only all_threads_mask needs to be set. It used to be set while calling thread_sync_init() which is called too late for certain checks. The same function handles threads and non-threads, which removes the need for some thread-specific knowledge from cfgparse.c.	2018-07-30 11:10:46 +02:00
Olivier Houchard	3e12304ae0	BUG/MINOR: threads: Handle nbthread == MAX_THREADS. If nbthread is MAX_THREADS, the shift operation needed to compute all_threads_mask fails in thread_sync_init(). Instead pass a number of threads to this function and let it compute the mask without overflowing. This should be backported to 1.8.	2018-07-27 17:18:22 +02:00
Willy Tarreau	3ea2490b48	BUG/MEDIUM: threads/sync: use sched_yield when available There is a corner case with the sync point which can significantly degrade performance. The reason is that it forces all threads to busy spin there, and that if there are less CPUs available than threads, this busy activity from some threads will force others to wait longer in epoll() or to simply be scheduled out while doing something else, and will increase the time needed to reach the sync point. Given that the sync point is not expected to be stressed that much, better call sched_yield() while waiting there to release the CPU and offer it to waiting threads. On a simple test with 4 threads bound to two cores using "maxconn 1" on the server line, the performance was erratic before the recent scheduler changes (between 40 and 200 conn/s with hundreds of ms response time), and it jumped to 7200 with 12ms response time with this fix applied. It should be backported to 1.8 since 1.8 is affected as well.	2018-07-27 07:54:08 +02:00
Christopher Faulet	ddb6c16576	BUG/MEDIUM: threads: Fix the exit condition of the thread barrier In thread_sync_barrier, we exit when all threads have set their own bit in the barrier mask. It is done by comparing it to all_threads_mask. But we must not use a simple equality to do so, becaue all_threads_mask may change. Since commit `ba86c6c25` ("MINOR: threads: Be sure to remove threads from all_threads_mask on exit"), when a thread exit, its bit is removed from all_threads_mask. Instead, we must use a bitwise AND to test is all bits of all_threads_mask are set. This also requires that all_threads_mask is set to volatile if we want to catch changes. This patch must be backported in 1.8.	2018-07-20 14:24:41 +02:00
Olivier Houchard	6b96f7289c	BUG/MEDIUM: pollers: Use a global list for fd shared between threads. With the old model, any fd shared by multiple threads, such as listeners or dns sockets, would only be updated on one threads, so that could lead to missed event, or spurious wakeups. To avoid this, add a global list for fd that are shared, using the same implementation as the fd cache, and only remove entries from this list when every thread as updated its poller. [wt: this will need to be backported to 1.8 but differently so this patch must not be backported as-is]	2018-05-06 06:27:09 +02:00
Christopher Faulet	148b16e1ce	BUG/MEDIUM: threads: Fix the sync point for more than 32 threads In the sync point, to know if a thread has requested a synchronization, we call the function thread_need_sync(). It should return 1 if yes, otherwise it should return 0. It is intended to return a signed integer. But internally, instead of returning 0 or 1, it returns 0 or tid_bit (threads_want_sync & tid_bit). So, tid_bit is casted in integer. For the first 32 threads, it's ok, because we always check if thread_need_sync() returns something else than 0. But this is a problem if HAProxy is started with more than 32 threads, because for threads 33 to 64 (so for tid 32 to 63), their tid_bit casted to integer are evaluated to 0. So the sync point does not work for more than 32 threads. Now, the function thread_need_sync() respects its contract, returning 0 or 1. the function thread_no_sync() has also been updated to avoid any ambiguities. This patch must be backported in HAProxy 1.8.	2018-05-02 17:58:36 +02:00
Willy Tarreau	e96e61cadc	BUILD/MINOR: threads: always export thread_sync_io_handler() Otherwise it doesn't build again without threads.	2018-03-29 18:54:33 +02:00
Willy Tarreau	4037a3f904	MINOR: cli/threads: make "show fd" report thread_sync_io_handler instead of "unknown" The output was confusing when the sync point's dummy handler was shown. This patch should be backported to 1.8 to help with troubleshooting.	2018-03-28 18:06:47 +02:00
Willy Tarreau	a9786b6f04	MINOR: fd: pass the iocb and owner to fd_insert() fd_insert() is currently called just after setting the owner and iocb, but proceeding like this prevents the operation from being atomic and requires a lock to protect the maxfd computation in another thread from meeting an incompletely initialized FD and computing a wrong maxfd. Fortunately for now all fdtab[].owner are set before calling fd_insert(), and the first lock in fd_insert() enforces a memory barrier so the code is safe. This patch moves the initialization of the owner and iocb to fd_insert() so that the function will be able to properly arrange its operations and remain safe even when modified to become lockless. There's no other change beyond the internal API.	2018-01-29 16:07:25 +01:00
Willy Tarreau	c20d737338	BUG/MINOR: threads: always set an owner to the thread_sync pipe The owner of the fd used by the synchronization pipe was set to NULL, making it ignored by maxfd computation. The risk would be that some synchronization events get delayed between threads when using poll() or select(). However this is only theorical since the pipe is created before listeners are bound so normally its FD should be lower and this should normally not happen. The only possible situation would be if all listeners are bound to inherited FDs which are lower than the pipe's. This patch must be backported to 1.8.	2018-01-25 07:31:08 +01:00
Christopher Faulet	81991d3285	BUG/MAJOR: thread: Be sure to request a sync between threads only once at a time The first thread requesting a synchronization is responsible to write in the "sync" pipe to notify all others. But we must write only once in the pipe between two synchronizations to have exactly one character in the pipe. It is important because we only read 1 character in return when the last thread exits from the sync-point. Here there is a bug. If two threads request a synchronization, only the first writes in the pipe. But, if the same thread requests several times a synchronization before entering in the sync-point (because, for instance, it detects many servers down), it writes as many as characters in the pipe. And only one of them will be read. Repeating this bug many times will block HAProxy on the write because the pipe is full. To fix the bug, we just check if the current thread has already requested a synchronization before trying to notify all others. The patch must be backported in 1.8	2017-12-02 14:31:01 +01:00
Christopher Faulet	e9a896e09e	BUG/MINOR: threads: tid_bit must be a unsigned long This is specific to threads, no backport is needed.	2017-11-14 18:11:28 +01:00
Christopher Faulet	2a944ee16b	BUILD: threads: Rename SPIN/RWLOCK macros using HA_ prefix This remove any name conflicts, especially on Solaris.	2017-11-07 11:10:24 +01:00
Willy Tarreau	6dbd3e963b	BUG/MEDIUM: threads: don't try to free build option message on exit Commit `0493149` ("MINOR: thread: report multi-thread support in haproxy -vv") added information about thread support in haproxy -vv output but accidently marked the message as "must_free" while it's a constant. This causes a segv on the old process on clean exit if threads are enabled. It doesn't affect the stability during operations however.	2017-11-05 11:51:48 +01:00
Willy Tarreau	0493149ac3	MINOR: thread: report multi-thread support in haproxy -vv Otherwise it's hard to know if it was enabled or not.	2017-11-03 23:39:25 +01:00
Christopher Faulet	209d02a257	WIP: SQUASH WITH SYNC POINT	2017-10-31 13:58:30 +01:00
Christopher Faulet	36716a7fec	MEDIUM: threads/fd: Initialize the process mask during the call to fd_insert Listeners will allow any threads to process the corresponding fd. But for other FDs, we limit the processing to the current thread.	2017-10-31 13:58:30 +01:00
Christopher Faulet	339fff8a18	MEDIUM: threads: Adds a set of functions to handle sync-point A sync-point is a protected area where you have the warranty that no concurrency access is possible. It is implementated as a thread barrier to enter in the sync-point and another one to exit from it. Inside the sync-point, all threads that must do some syncrhonous processing will be called one after the other while all other threads will wait. All threads will then exit from the sync-point at the same time. A sync-point will be evaluated only when necessary because it is a costly operation. To limit the waiting time of each threads, we must have a mechanism to wakeup all threads. This is done with a pipe shared by all threads. By writting in this pipe, we will interrupt all threads blocked on a poller. The pipe is then flushed before exiting from the sync-point.	2017-10-31 13:58:29 +01:00
Christopher Faulet	1a2b56ea8e	MEDIUM: threads: Add hathreads header file This file contains all functions and macros used to deal with concurrency in HAProxy. It contains all high-level function to do atomic operation (HA_ATOMIC_*). Note, for now, we rely on "__atomic" GCC builtins to do atomic operation. So HAProxy can be compiled with the thread support iff these builtins are available. It also contains wrappers around plocks to use spin or read/write locks. These wrappers are used to abstract the internal representation of the locking system and to add information to help debugging, when compiled with suitable options. To add extra info on locks, you need to add DEBUG=-DDEBUG_THREAD or DEBUG=-DDEBUG_FULL compilation option. In addition to timing info on locks, we keep info on where a lock was acquired the last time (function name, file and line). There are also the thread id and a flag to know if it is still locked or not. This will be useful to debug deadlocks.	2017-10-31 13:58:23 +01:00

40 Commits