haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2025-02-20 04:37:04 +00:00

Author	SHA1	Message	Date
Willy Tarreau	5c0128d942	IMPORT: ebtree: make string_equal_bits() return an unsigned It used to return ssize_t for -1 but in fact we're using this -1 as the largest possible value and the result is generally cast to signed to check if the end was reached, so better make it clearly return an unsigned value here. This is cbtree commit e1e58a2b2ced2560d4544abaefde595273089704. This is ebtree commit d7531a7475f8ba8e592342ef1240df3330d0ab47.	2024-01-06 13:35:42 +01:00
Willy Tarreau	b7068b3152	IMPORT: ebtree: use unsigned ints for flznz() There's no reason to return signed values there. And it turns out that the compiler manages to improve the performance by ~2%. This is cbtree commit ab3fd53b8d6bbe15c196dfb4f47d552c3441d602. This is ebtree commit 0ebb1d7411d947de55fa5913d3ab17d089ea865c.	2024-01-06 13:35:42 +01:00
Willy Tarreau	2a14f99dbb	IMPORT: ebtree: make string_equal_bits turn back to unsigned char With flsnz() instead of flsnz_long() we're now getting a better performance on both x86 and ARM. The difference is that previously we were relying on a function that was forcing the use of register %eax for the 8-bit version and that was preventing the compiler from keeping the code optimized. The gain is roughly 5% on ARM and 1% on x86. This is cbtree commit 19cf39b2514bea79fed94d85e421e293be097a0e. This is ebtree commit a9aaf2d94e2c92fa37aa3152c2ad8220a9533ead.	2024-01-06 13:35:42 +01:00
Willy Tarreau	1c46a07460	IMPORT: ebtree: rework the fls macros to better deal with arch-specific ones The definitions were a bit of a mess and there wasn't even a fall back to __builtin_clz() on compilers supporting it. Now we instead define a macro for each implementation that is set on an arch-dependent case by case, and add the fall back ones only when not defined. This also allows the flsnz8() to automatically fall back to the 32-bit arch-specific version if available. This shows a consistent 33% speedup on arm for strings. This is cbtree commit c6075742e8d0a6924e7183d44bd93dec20ca8049. This is ebtree commit f452d0f83eca72f6c3484ccb138d341ed6fd27ed.	2024-01-06 13:35:42 +01:00
Willy Tarreau	fc421e5b3d	IMPORT: ebtree: switch the sizes and offsets to size_t and ssize_t Let's use these in order to avoid 32-64 bit casts on 64 bit platforms. This is cbtree commit e4f4c10fcb5719b626a1ed4f8e4e94d175468c34. This is ebtree commit cc10507385c784d9a9e74ea9595493317d3da99e.	2024-01-06 13:35:13 +01:00
Willy Tarreau	9afe3b59a7	IMPORT: ebtree: implement and use flsnz_long() to count bits The asm code shows multiple conversions. Gcc has always been terribly bad at dealing with chars, which are constantly converted to ints for every operation and zero-extended after each operation. But here in addition there are conversions before and after the flsnz(). Let's just mark the variables as long and use flsnz_long() to process them without any conversion. This shortens the code and makes it slightly faster. Note that the fls operations could make use of __builtin_clz() on gcc 4.6 and above, and it would be useful to implement native support for ARM as well. This is cbtree commit 1f0f83ba26f2279c8bba0080a2e09a803dddde47. This is ebtree commit 9c38dcae22a84f0b0d9c5a56facce1ca2ad0aaef.	2024-01-06 13:35:13 +01:00
Willy Tarreau	9b7a617a0e	REORG: ebtree: split structures into their own file ebtree-t.h ebtree is one piece using a lot of inlines and each tree root or node definition needed by many of our structures requires to parse and compile all these includes, which is large and painfully slow. Let's move the very basic definitions to their own file and include it from ebtree.h.	2021-10-07 01:41:14 +02:00
Ilya Shipitsin	01881087fc	CLEANUP: assorted typo fixes in the code and comments This is 25th iteration of typo fixes	2021-08-16 12:37:59 +02:00
Remi Tricot-Le Breton	2608e348be	BUG/MEDIUM: ebtree: Invalid read when looking for dup entry The first item inserted into an ebtree will be inserted directly below the root, which is a simple struct eb_root which only holds two branch pointers (left and right). If we try to find a duplicated entry to this first leaf through a ebmb_next_dup, our leaf_p pointer will point to the eb_root instead of a complete eb_node so we cannot look for the bit part of our leaf_p since it would try to cast our eb_root into an eb_node and perform an out of bounds access when reading "eb_root_to_node(eb_untag(t,EB_LEFT)))->bit". This bug was found by address sanitizer running on a CRL hot update VTC test. Note that the bug has been there since the import of the eb_next_dup() and eb_prev_dup() function in 1.5-dev19 by commit 2b5702030 ("MINOR: ebtree: add new eb_next_dup/eb_prev_dup() functions to visit duplicates"). It can be backported to all stable branches.	2021-05-18 19:26:21 +02:00
Willy Tarreau	853926a9ac	BUG/MEDIUM: ebtree: use a byte-per-byte memcmp() to compare memory blocks As reported in issue #689, there is a subtle bug in the ebtree code used to compared memory blocks. It stems from the platform-dependent memcmp() implementation. Original implementations used to perform a byte-per-byte comparison and to stop at the first non-matching byte, as in this old example: https://www.retro11.de/ouxr/211bsd/usr/src/lib/libc/compat-sys5/memcmp.c.html The ebtree code has been relying on this to detect the first non-matching byte when comparing keys. This is made so that a zero-terminated string can fail to match against a longer string. Over time, especially with large busses and SIMD instruction sets, multi-byte comparisons have appeared, making the processor fetch bytes past the first different byte, which could possibly be a trailing zero. This means that it's possible to read past the allocated area for a string if it was allocated by strdup(). This is not correct and definitely confuses address sanitizers. In real life the problem doesn't have visible consequences. Indeed, multi-byte comparisons are implemented so that aligned words are loaded (e.g. 512 bits at once to process a cache line at a time). So there is no way such a multi-byte access will cross a page boundary and end up reading from an unallocated zone. This is why it was never noticed before. This patch addresses this by implementing a one-byte-at-a-time memcmp() variant for ebtree, called eb_memcmp(). It's optimized for both small and long strings and guarantees to stop after the first non-matching byte. It only needs 5 instructions in the loop and was measured to be 3.2 times faster than the glibc's AVX2-optimized memcmp() on short strings (1 to 257 bytes), since that latter one comes with a significant setup cost. The break-even seems to be at 512 bytes where both version perform equally, which is way longer than what's used in general here. This fix should be backported to stable versions and reintegrated into the ebtree code.	2020-06-16 11:30:33 +02:00
Willy Tarreau	4c7e4b7738	REORG: include: update all files to use haproxy/api.h or api-t.h if needed All files that were including one of the following include files have been updated to only include haproxy/api.h or haproxy/api-t.h once instead: - common/config.h - common/compat.h - common/compiler.h - common/defaults.h - common/initcall.h - common/tools.h The choice is simple: if the file only requires type definitions, it includes api-t.h, otherwise it includes the full api.h. In addition, in these files, explicit includes for inttypes.h and limits.h were dropped since these are now covered by api.h and api-t.h. No other change was performed, given that this patch is large and affects 201 files. At least one (tools.h) was already freestanding and didn't get the new one added.	2020-06-11 10:18:42 +02:00
Willy Tarreau	8d2b777fe3	REORG: ebtree: move the include files from ebtree to include/import/ This is where other imported components are located. All files which used to directly include ebtree were touched to update their include path so that "import/" is now prefixed before the ebtree-related files. The ebtree.h file was slightly adjusted to read compiler.h from the common/ subdirectory (this is the only change). A build issue was encountered when eb32sctree.h is loaded before eb32tree.h because only the former checks for the latter before defining type u32. This was addressed by adding the reverse ifdef in eb32tree.h. No further cleanup was done yet in order to keep changes minimal.	2020-06-11 09:31:11 +02:00

12 Commits