haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2025-03-06 11:28:00 +00:00

Author	SHA1	Message	Date
Willy Tarreau	57a374131c	MINOR: backend: add a new "path-only" option to "balance uri" Since we've fixed the way URIs are handled in 2.1, some users have started to experience inconsistencies in "balance uri" between requests received over H1 and the same ones received over H2. This is caused by the fact that H1 rarely uses absolute URIs while H2 always uses them. Similar issues were reported already around replace-uri etc, leading to "pathq" recently being introduced, so this isn't new. Here what this patch does is add a new option to "balance uri" to indicate that the hashing should only start at the path and not cover the authority. This makes H1 relative URIs and H2 absolute URI hashes equally again. Some extra options could be added to normalize URIs by always hashing the authority (or host) in front of them, which would make sure that both absolute and relative requests provide the same hash. This is left for later if needed.	2020-09-23 08:56:29 +02:00
Willy Tarreau	3d1119d225	MINOR: backend: make the "whole" option of balance uri take only one bit We'll want to add other boolean options on "balance uri", so let's make some room aside "whole" and make it take only one bit and not one int.	2020-09-23 08:05:47 +02:00
Amaury Denoyelle	36b536652f	BUG/MINOR: config: Fix memory leak on config parse listen This memory leak happens if there is two or more defaults section. When the default proxy is reinitialized, the structure member containing the config filename must be freed. Fix github issue #851. Should be backported as far as 1.6.	2020-09-18 16:17:09 +02:00
Eric Salama	1aab911017	BUG/MINOR: Fix memory leaks cfg_parse_peers When memory allocation fails in cfg_parse_peers or when an error occurs while parsing a stick-table, the temporary table and its id must be freed. This fixes github issue #854. It should be backported as far as 2.0.	2020-09-18 12:06:08 +02:00
Christopher Faulet	d2414a23c4	BUG/MINOR: http-fetch: Don't set the sample type during the htx prefetch A subtle bug was introduced by the commit `a6d9879e6` ("BUG/MEDIUM: htx: smp_prefetch_htx() must always validate the direction"), for the "method" sample fetch only. The sample data type and the method id are always overwritten because smp_prefetch_htx() function is called later in the sample fetch evaluation. The bug is in the smp_prefetch_htx() function but it is only visible for the "method" sample fetch, for an unknown method. In fact, when smp_prefetch_htx() is called, the sample object is altered. The data type is set to SMP_T_BOOL and, on success, the data value is set to 1. Thus, if the caller has already set some infos into the sample object, they may be lost. AFAIK, there is no reason to do so. It is inherited from the legacy HTTP code and I honestely don't known why it was done this way. So, instead of fixing the "method" sample fetch to set useful info after the call to smp_prefetch_htx() function, I prefer to not alter the sample object in smp_prefetch_htx(). This patch must be backported as far as 2.0. On the 2.0, only the HTX part must be fixed.	2020-09-18 11:06:24 +02:00
Willy Tarreau	3ca2365904	BUG/MEDIUM: h2: report frame bits only for handled types As part of his GREASE experiments on Chromium, Bence B�ky reported in https://lists.w3.org/Archives/Public/ietf-http-wg/2020JulSep/0202.html and https://bugs.chromium.org/p/chromium/issues/detail?id=1127060 that a certain combination of frame type and frame flags was causing an error on app.slack.com. It turns out that it's haproxy that is causing this issue because the frame type is wrongly assumed to support padding, the frame flags indicate padding is present, and the frame is too short for this, resulting in an error. The reason why only some frame types are affected is due to the frame type being used in a bit shift to match against a mask, and where the 5 lower bits of the frame type only are used to compute the frame bit. If the resulting frame bit matches a DATA, HEADERS or PUSH_PROMISE frame bit, then padding support is assumed and the test is enforced, resulting in a PROTOCOL_ERROR or FRAME_SIZE_ERROR depending on the payload size. We must never match any such bit for unsupported frame types so let's add a check for this. This must be backported as far as 1.8. Thanks to Cooper Bethea for providing enough context to help narrow the issue down and to Bence B�ky for creating a simple reproducer.	2020-09-18 08:05:03 +02:00
Willy Tarreau	bba7a4dafd	BUG/MINOR: h2/trace: do not display "stream error" after a frame ACK When sending a frame ACK, the parser state is not equal to H2_CS_FRAME_H and we used to report it as an error, which is not true. In fact we should only indicate when we skip remaining data. This may be backported as far as 2.1.	2020-09-18 07:41:28 +02:00
Willy Tarreau	8520d87198	MINOR: h2/trace: also display the remaining frame length in traces It's often missing when debugging, even though it's often zero for control frames or after data are consumed.	2020-09-18 07:39:29 +02:00
Willy Tarreau	f2cda10b1d	BUILD: sock_inet: include errno.h I was careful to have it for sock_unix.c but missed it for sock_inet which broke with commit `36722d227` ("MINOR: sock_inet: report the errno string in binding errors") depending on the build options. No backport is needed.	2020-09-17 14:02:01 +02:00
Willy Tarreau	3cd58bf805	MINOR: sock_unix: report the errno string in binding errors Just like with previous patch, let's report UNIX socket binding errors in plain text. we can now see for example: [ALERT] 260/083531 (13365) : Starting frontend f: cannot switch final and temporary UNIX sockets (Operation not permitted) [/tmp/root.sock] [ALERT] 260/083640 (13375) : Starting frontend f: cannot change UNIX socket ownership (Operation not permitted) [/tmp/root.sock]	2020-09-17 08:35:38 +02:00
Willy Tarreau	36722d2274	MINOR: sock_inet: report the errno string in binding errors With the socket binding code cleanup it becomes easy to add more info to error messages. One missing thing used to be the error string, which is now added after the generic one, for example: [ALERT] 260/082852 (12974) : Starting frontend f: cannot bind socket (Permission denied) [0.0.0.0:4] [ALERT] 260/083053 (13292) : Starting frontend f: cannot bind socket (Address already in use) [0.0.0.0:4444] [ALERT] 260/083104 (13298) : Starting frontend f: cannot bind socket (Cannot assign requested address) [1.1.1.1:4444]	2020-09-17 08:32:17 +02:00
Willy Tarreau	eb8cfe6723	BUILD: sock_unix: add missing errno.h It builds fine when openssl is enabled, but fails otherwise. No backport is needed.	2020-09-16 22:15:40 +02:00
Willy Tarreau	af9609b4d1	MINOR: tools: drop listener detection hack from str2sa_range() We used to resort to a trick to detect whether the caller was a listener or an outgoing socket in order never to present an AF_CUST_UDP* socket to a log server nor a nameserver. This is no longer necessary, the socket type alone will be enough.	2020-09-16 22:08:08 +02:00
Willy Tarreau	2b5e0d8b6a	MEDIUM: proto_udp: replace last AF_CUST_UDP* with AF_INET* We don't need to cheat with the sock_domain anymore, we now always have the SOCK_DGRAM sock_type as a complementary selector. This patch restores the sock_domain to AF_INET* in the udp* protocols and removes all traces of the now unused AF_CUST_*.	2020-09-16 22:08:08 +02:00
Willy Tarreau	b2ffc99bbd	MEDIUM: tools: make str2sa_range() use protocol_lookup() By doing so we can remove the hard-coded mapping from AF_INET to AF_CUST_UDP but we still need to keep the test on the listeners as long as these dummy families remain present in the code.	2020-09-16 22:08:08 +02:00
Willy Tarreau	910c64da96	MEDIUM: protocol: store the socket and control type in the protocol array The protocol array used to be only indexed by socket family, which is very problematic with UDP (requiring an extra family) and with the forthcoming QUIC (also requiring an extra family), especially since that binds them to certain families, prevents them from supporting dgram UNIX sockets etc. In order to address this, we now start to register the protocols with more info, namely the socket type and the control type (either stream or dgram). This is sufficient for the protocols we have to deal with, but could also be extended further if multiple protocol variants were needed. But as is, it still fits nicely in an array, which is convenient for lookups that are instant.	2020-09-16 22:08:08 +02:00
Willy Tarreau	a54553f74f	MINOR: protocol: add the control layer type in the protocol struct This one will be needed to more accurately select a protocol. It may differ from the socket type for QUIC, which uses dgram at the socket layer and provides stream at the control layer. The upper level requests a control layer only so we need this field.	2020-09-16 22:08:08 +02:00
Willy Tarreau	65ec4e3ff7	MEDIUM: tools: make str2sa_range() check that the protocol has ->connect() Most callers of str2sa_range() need the protocol only to check that it provides a ->connect() method. It used to be used to verify that it's a stream protocol, but it might be a bit early to get rid of it. Let's keep the test for now but move it to str2sa_range() when the new flag PA_O_CONNECT is present. This way almost all call places could be cleaned from this. There's a strange test in the server address parsing code that rechecks the family from the socket which seems to be a duplicate of the previously removed tests. It will have to be rechecked.	2020-09-16 22:08:08 +02:00
Willy Tarreau	5fc9328aa2	MINOR: tools: make str2sa_range() directly return the protocol We'll need this so that it can return pointers to stacked protocol in the future (for QUIC). In addition this removes a lot of tests for protocol validity in the callers. Some of them were checked further apart, or after a call to str2listener() and they were simplified as well. There's still a trick, we can fail to return a protocol in case the caller accepts an fqdn for use later. This is what servers do and in this case it is valid to return no protocol. A typical example is: server foo localhost:1111	2020-09-16 22:08:08 +02:00
Willy Tarreau	9b3178df23	MINOR: listener: pass the chosen protocol to create_listeners() The function will need to use more than just a family, let's pass it the selected protocol. The caller will then be able to do all the fancy stuff required to pick the best protocol.	2020-09-16 22:08:08 +02:00
Willy Tarreau	5e1779abbf	MEDIUM: config: make str2listener() not accept datagram sockets anymore str2listener() was temporarily hacked to support datagram sockets for the log-forward listeners. This has has an undesirable side effect that "bind udp@1.2.3.4:5555" was silently accepted as TCP for a bind line. We don't need this hack anymore since the only user (log-forward) now relies on str2receiver(). Now such an address will properly be rejected.	2020-09-16 22:08:08 +02:00
Willy Tarreau	26ff5dabc0	MINOR: log-forward: use str2receiver() to parse the dgram-bind address Thanks to this we don't need to specify "udp@" as it's implicitly a datagram type listener that is expected, so any AF_INET/AF_INET4 address will work.	2020-09-16 22:08:08 +02:00
Willy Tarreau	aa333123f2	MINOR: cfgparse: add str2receiver() to parse dgram receivers This is at least temporary, as the migration at once is way too difficuly. For now it still creates listeners but only allows DGRAM sockets. This aims at easing the split between listeners and receivers.	2020-09-16 22:08:08 +02:00
Willy Tarreau	62a976cd44	MINOR: tools: remove the central test for "udp" in str2sa_range() Now we only rely on dgram type associated with AF_INET/AF_INET6 to infer UDP4/UDP6. We still keep the hint based on PA_O_SOCKET_FD to detect that the caller is a listener though. It's still far from optimal but UDP remains rooted into the protocols and needs to be taken out first.	2020-09-16 22:08:08 +02:00
Willy Tarreau	3baec249b1	MEDIUM: tools: make str2sa_range() only report AF_CUST_UDP on listeners For now only listeners can make use of AF_CUST_UDP and it requires hacks in the DNS and logsrv code to remap it to AF_INET. Make str2sa_range() smarter by detecting that it's called for a listener and only set these protocol families for listeners. This way we can get rid of the hacks.	2020-09-16 22:08:08 +02:00
Willy Tarreau	e835bd8f91	MINOR: tools: start to distinguish stream and dgram in str2sa_range() The parser now supports a socket type for the control layer and a possible other one for the transport layer. Usually they are the same except for protocols like QUIC which will provide a stream transport layer based on a datagram control layer. The default types are preset based on the caller's expectations, and may be refined using "stream+" and "dgram+" prefixes. For now they were not added to the docuemntation because other changes will probably happen around UDP as well. It is conceivable that "tcpv4@" or "udpv6@" will appear later as aliases for "stream+ipv4" or "dgram+ipv6".	2020-09-16 22:08:08 +02:00
Willy Tarreau	a215be282d	MEDIUM: tools: make str2sa_range() check for the sockpair's FD usability Just like for inherited sockets, we want to make sure that FDs that are mentioned in "sockpair@" are actually usable. Right now this test is performed by the callers, but not everywhere. Typically, the following config will fail if fd #5 is not bound: frontend bind sockpair@5 But this one will pass if fd #6 is not bound: backend server s1 sockpair@6 Now both will return an error in such a case: - 'bind' : cannot use file descriptor '5' : Bad file descriptor. - 'server s1' : cannot use file descriptor '6' : Bad file descriptor. As such the test in str2listener() is not needed anymore (and it was wrong by the way, as it used to test for the socket by overwriting the local address with a new address that's made of the FD encoded on 16 bits and happens to still be at the same place, but that strictly depends on whatever the kernel wants to put there).	2020-09-16 22:08:08 +02:00
Willy Tarreau	804f11fdf8	MINOR: config: do not test an inherited socket again Since previous patch we know that a successfully bound fd@XXX socket is returned as its own protocol family from str2sa_range() and not as AF_CUST_EXISTING_FD anymore o we don't need to check for that case in str2listener().	2020-09-16 22:08:08 +02:00
Willy Tarreau	6edc722093	MEDIUM: tools: make str2sa_range() resolve pre-bound listeners When str2sa_range() is invoked for a bind or log line, and it gets a file descriptor number, it will immediately resolve the socket's address (when it's a socket) so that the address family, address and port are correctly set. This will later allow to resolve some transport protocols that are attached to existing FDs. For raw FDs (e.g. logs) and for socket pairs, the FD number is still returned in the address, because we need the underlying address management to complete the bind/listen/connect/whatever needed. One immediate benefit is that passing a bad FD will now result in one of these errors: 'bind' : cannot use file descriptor '3' : Socket operation on non-socket. 'bind' : socket on file descriptor '3' is of the wrong type. Note that as of now, we never return a listening socket with a family of AF_CUST_EXISTING_FD. The only case where this family is seen is for a raw FD (e.g. logs).	2020-09-16 22:08:08 +02:00
Willy Tarreau	895992619d	MINOR: log: detect LOG_TARGET_FD from the fd and not from the syntax Now that we have the FD value reported we don't need to cheat and detect "fd@" in the address, we can safely rely on the FD value.	2020-09-16 22:08:08 +02:00
Willy Tarreau	a93e5c7fae	MINOR: tools: make str2sa_range() optionally return the fd If a file descriptor was passed, we can optionally return it. This will be useful for listening sockets which are both a pre-bound FD and a ready socket.	2020-09-16 22:08:08 +02:00
Willy Tarreau	909c23b086	MINOR: listener: remove the inherited arg to create_listener() This argument can now safely be determined from fd != -1, let's just drop it.	2020-09-16 22:08:08 +02:00
Willy Tarreau	328199348b	MINOR: tools: add several PA_O_* flags in str2sa_range() callers These flags indicate whether the call is made to fill a bind or a server line, or even just send/recv calls (like logs or dns). Some special cases are made for outgoing FDs (e.g. pipes for logs) or socket FDs (e.g external listeners), and there's a distinction between stream or dgram usage that's expected to significantly help str2sa_range() proceed appropriately with the input information. For now they are not used yet.	2020-09-16 22:08:08 +02:00
Willy Tarreau	8b0fa8f0ab	MEDIUM: config: remove all checks for missing/invalid ports/ranges Now that str2sa_range() checks for appropriate port specification, we don't need to implement adhoc test cases in every call place, if the result is valid, the conditions are met otherwise the error message is appropriately filled.	2020-09-16 22:08:08 +02:00
Willy Tarreau	7f96a8474c	MEDIUM: tools: make str2sa_range() validate callers' port specifications Now str2sa_range() will enforce the caller's port specification passed using the PA_O_PORT_* flags, and will return an error on failure. For optional ports, values 0-65535 will be enforced. For mandatory ports, values 1-65535 are enforced. In case of ranges, it is also verified that the upper bound is not lower than the lower bound, as this used to result in empty listeners. I couldn't find an easy way to test this using VTC since the purpose is to trigger parse errors, so instead a test file is provided as tests/ports.cfg with comments about what errors are expected for each line.	2020-09-16 22:08:08 +02:00
Willy Tarreau	809587635e	MINOR: tools: add several PA_O_PORT_* flags in str2sa_range() callers These flags indicate what is expected regarding port specifications. Some callers accept none, some need fixed ports, some have it mandatory, some support ranges, and some take an offset. Each possibilty is reflected by an option. For now they are not exploited, but the goal is to instrument str2sa_range() to properly parse that.	2020-09-16 22:08:07 +02:00
Willy Tarreau	cd3a5591f6	MINOR: tools: make str2sa_range() take more options than just resolve We currently have an argument to require that the address is resolved but we'll soon add more, so let's turn it into a bit field. The old "resolve" boolean is now PA_O_RESOLVE.	2020-09-16 22:08:07 +02:00
Willy Tarreau	5a7beed67b	CLEANUP: tools: make str2sa_range() less awful for fd@ and sockpair@ The code is built to match prefixes at one place and to parse the address as a second step, except for fd@ and sockpair@ where the test first passes via AF_UNSPEC that is changed again. This is ugly and confusing, so let's proceed like for the other ones.	2020-09-16 22:08:07 +02:00
Willy Tarreau	a5b325f92c	MINOR: protocol: add a real family for existing FDs At some places (log fd@XXX, bind fd@XXX) we support using an explicit file descriptor number, that is placed into the sockaddr for later use. The problem is that till now it was done with an AF_UNSPEC family, which is also used for other situations like missing info or rings (for logs). Let's create an "official" family AF_CUST_EXISTING_FD for this case so that we are certain the FD can be found in the address when it is set.	2020-09-16 22:08:07 +02:00
Willy Tarreau	1e984b73f0	CLEANUP: protocol: remove family-specific fields from struct protocol This removes the following fields from struct protocol that are now retrieved from the protocol family instead: .sock_family, .sock_addrlen, .l3_addrlen, .addrcmp, .bind, .get_src, .get_dst. This also removes the UDP-specific udp{,6}_get_{src,dst}() functions which were referenced but not used yet. Their goal was only to remap the original AF_INET* addresses to AF_CUST_UDP*. Note that .sock_domain is still there as it's used as a selector for the protocol struct to be used.	2020-09-16 22:08:07 +02:00
Willy Tarreau	f1f660978c	MINOR: protocol: retrieve the family-specific fields from the family We now take care of retrieving sock_family, l3_addrlen, bind(), addrcmp(), get_src() and get_dst() from the protocol family and not just the protocol itself. There are very few places, this was only seldom used. Interestingly in sock_inet.c used to rely on ->sock_family instead of ->sock_domain, and sock_unix.c used to hard-code PF_UNIX instead of using ->sock_domain. Also it appears obvious we have something wrong it the protocol selection algorithm because sock_domain is the one set to the custom protocols while it ought to be sock_family instead, which would avoid having to hard-code some conversions for UDP namely.	2020-09-16 22:08:07 +02:00
Willy Tarreau	b0254cb361	MINOR: protocol: add a new proto_fam structure for protocol families We need to specially handle protocol families which regroup common functions used for a given address family. These functions include bind(), addrcmp(), get_src() and get_dst() for now. Some fields are also added about the address family, socket domain (protocol family passed to the socket() syscall), and address length. These protocol families are referenced from the protocols but not yet used.	2020-09-16 22:08:07 +02:00
Willy Tarreau	ad33acf838	MEDIUM: protocol: do not call proto->bind() anymore from bind_listener() All protocol's listeners now only take care of themselves and not of the receiver anymore since that's already being done in proto_bind_all(). Now it finally becomes obvious that UDP doesn't need a listener, as the only thing it does is to set the listener's state to LI_LISTEN!	2020-09-16 22:08:07 +02:00
Willy Tarreau	fc974887ce	MEDIUM: protocol: explicitly start the receiver before the listener Now protocol_bind_all() starts the receivers before their respective listeners so that ultimately we won't need the listeners for non- connected protocols. We still have to resort to an ugly trick to set the I/O handler in case of syslog over UDP because for now it's still not set in the receiver, so we hard-code it.	2020-09-16 22:08:07 +02:00
Willy Tarreau	9eda7a6d62	MEDIUM: proto_sockpair: make use of sockpair_bind_receiver() Now we rely on the address family's receiver instead of binding everything ourselves.	2020-09-16 22:08:07 +02:00
Willy Tarreau	62292b28a3	MEDIUM: sockpair: implement sockpair_bind_receiver() Note that for now we don't have a sockpair.c file to host that unusual family, so the new function was placed directly into proto_sockpair.c. It's no big deal given that this family is currently not shared with multiple protocols. The function does almost nothing but setting up the receiver. This is normal as the socket the FDs are passed onto are supposed to have been already created somewhere else, and the only usable identifier for such a socket pair is the receiving FD itself. The function was assigned to sockpair's ->bind() and is not used yet.	2020-09-16 22:08:07 +02:00
Willy Tarreau	cd5e5eaf50	MEDIUM: uxst: make use of sock_unix_bind_receiver() This removes all the AF_UNIX-specific code from uxst_bind_listener() and now simply relies on sock_unix_bind_listener() to do the same job. As mentionned in previous commit, the only difference is that now an unlikely failure on listen() will not result in a roll back of the temporary socket names since they will have been renamed during the bind() operation (as expected). But such failures do not correspond to any normal case and mostly denote operating system issues so there's no functionality loss here.	2020-09-16 22:08:07 +02:00
Willy Tarreau	1e0a860099	MEDIUM: sock_unix: implement sock_unix_bind_receiver() This function performs all the bind-related stuff for UNIX sockets that was previously done in uxst_bind_listener(). There is a very tiny difference however, which is that previously, in the unlikely event where listen() would fail, it was still possible to roll back the binding and rename the backup to the original socket. Now we have to rename it before calling returning, hence it will be done before calling listen(). However, this doesn't cover any particular use case since listen() has no reason to fail there (and the rollback is not done for inherited sockets), that was just done that way as a generic error processing path. The code is not used yet and is referenced in the uxst proto's ->bind().	2020-09-16 22:08:07 +02:00
Willy Tarreau	2f7687d0e8	MEDIUM: udp: make use of sock_inet_bind_receiver() This removes all the AF_INET-specific code from udp_bind_listener() and now simply relies on sock_inet_bind_listener() to do the same job. The function is now basically just a wrapper around sock_inet_bind_receiver().	2020-09-16 22:08:07 +02:00
Willy Tarreau	af9a7f5bb0	MEDIUM: tcp: make use of sock_inet_bind_receiver() This removes all the AF_INET-specific code from tcp_bind_listener() and now simply relies on sock_inet_bind_listener() to do the same job. The function was now roughly cut in half and its error path significantly simplified.	2020-09-16 22:08:07 +02:00

1 2 3 4 5 ...

12773 Commits