haproxy

mirror of http://git.haproxy.org/git/haproxy.git/ synced 2024-12-16 08:24:42 +00:00

Author	SHA1	Message	Date
Willy Tarreau	e5843b383d	BUG/MEDIUM: peers: recent applet changes broke peers updates scheduling Since appctx are scheduled out of streams, it's pointless to wake up the task managing the stream to push updates, they won't be seen. In fact unit tests work because silent sessions are restarted after 5s of idle and the exchange is correctly scheduled during startup! So we need to notify the appctx instead. For this we add a pointer to the appctx in the peer session. No backport is needed of course.	2015-04-27 18:42:17 +02:00
Willy Tarreau	eb406dc73c	MINOR: stream-int: add two flags to indicate an applet's wishes regarding I/O Currently we have a problem. There are some cases where a sleeping applet is not woken up (eg: show sess during an injection). The reason is that the applet is marked WAIT_DATA and is not woken up when WAIT_ROOM leaves, because we wait for both flags to be cleared in order to call it. And if we wait for either flag, then we have the opposite situation, which is that we're not waiting for room in the output buffer so we're spinning calling the applet to do nothing. What is missing is an indication of what the applet needs. Since it only manipulates the WAIT_ROOM/WAIT_DATA which are overwritten later, that cannot work. In the case of connections, the problem doesn't happen because the connection maintains these extra states. Ideally we'd need to have similar states for each appctx and to store those information there. But it would be overcomplicated given that an applet doesn't exist alone without a stream-int, so we can safely put these information into the stream int and make the code simpler. With this patch we introduce two new flags in the stream interface : - SI_FL_WANT_PUT : the applet wants to put something into the buffer - SI_FL_WANT_GET : the applet wants to get something from the buffer We also have the new functions si_applet_{stop\|want\|cant}_{get\|put} to make the code look similar to the connection code. For now these flags are not used yet.	2015-04-23 17:56:17 +02:00
Willy Tarreau	e5f8649102	MEDIUM: stream-int: add a new function si_applet_done() This is the equivalent of si_conn_wake() but for applets. It will be called after changes to the stream interface are brought by the applet I/O handler. Ultimately it will release buffers and may be even wake the stream's task up if some important changes are detected. It would be nice to be able to merge it with the connection's wake function since it mostly manipulates the stream interface, but there are minor differences (such as how to enable/disable polling on a fd vs applet) and some specificities to applets (eg: don't wake the applet up until the output is empty) which would require abstract functions which would slow down everything.	2015-04-23 17:56:16 +02:00
Willy Tarreau	3c595ac3ad	MEDIUM: applet: implement a run queue for active appctx The new function is called for each round of polling in order to call any active appctx. For now we pick the stream interface from the appctx's owner. At the moment there's no appctx queued yet, but we have everything needed to queue them and remove them.	2015-04-23 17:56:16 +02:00
Willy Tarreau	81f38d6f57	MEDIUM: applet: add basic support for an applet run queue This will be needed so that we can schedule applets out of the streams. For now nothing calls the queue yet.	2015-04-23 17:56:16 +02:00
Willy Tarreau	d45b9f8991	REORG: stream-int: create si_applet_ops dedicated to applets These functions are dedicated to applets so that we don't use the default ones anymore in this case.	2015-04-23 17:56:16 +02:00
Willy Tarreau	3057645b37	CLEANUP: applet: rename struct si_applet to applet Since this one does not depend on stream_interface anymore, remove the "si_" prefix.	2015-04-23 17:56:16 +02:00
Willy Tarreau	8a8d83b85c	REORG: applet: move the applet definitions out of stream_interface We're tidying the definitions so that appctx lives on its own. A new set of applet.h files has been added for this purpose.	2015-04-23 17:56:16 +02:00
Willy Tarreau	00a37f0029	MEDIUM: applet: make the applet not depend on a stream interface anymore Now that applet's functions only take an appctx in argument, not a stream interface. This slightly simplifies the code and will be needed to take the appctx out of the stream interface.	2015-04-23 17:56:16 +02:00
Willy Tarreau	19c8161b3d	MINOR: applet: add a new "owner" pointer in the appctx This pointer indicates what stream-interface the appctx belongs to, just like we have for the connections.	2015-04-23 17:56:16 +02:00
Willy Tarreau	7365dad40f	BUG/MEDIUM: stream-int: always reset si->ops when si->end is nullified It happened after changing the stream interface deinitialization sequence that we got random crashes with si_shutw() being called on NULL si->end. The reason was that si->ops was not reset after a call to si_release_endpoint() which is sometimes called directly. Thus we now move the resetting of si->ops just after any si->end assignment. It happens that si_detach() is now just the same as si_release_endpoint() and stream_int_unregister_handler(). Some cleanup will have to be performed there. It's not sure whether this problem can impact 1.5 since in 1.5 applets are part of the default embedded stream handler. The only way it could cause some trouble is if it's used with a connection, which doesn't seem possible at first glance.	2015-04-21 14:15:22 +02:00
Willy Tarreau	152b81e7b2	BUG/MAJOR: tcp/http: fix current_rule assignment when restarting over a ruleset Commit `bc4c1ac` ("MEDIUM: http/tcp: permit to resume http and tcp custom actions") introduced the ability to interrupt and restart processing in the middle of a TCP/HTTP ruleset. But it doesn't do it in a consistent way : it checks current_rule_list, immediately dereferences current_rule, which is only set in certain cases and never cleared. So that broke the tcp-request content rules when the processing was interrupted due to missing data, because current_rule was not yet set (segfault) or could have been inherited from another ruleset if it was used in a backend (random behaviour). The proper way to do it is to always set current_rule before dereferencing it. But we don't want to set it for all rules because we don't want any action to provide a checkpointing mechanism. So current_rule is set to NULL before entering the loop, and only used if not NULL and if current_rule_list matches the current list. This way they both serve as a guard for the other one. This fix also makes the current rule point to the rule instead of its list element, as it's much easier to manipulate. No backport is needed, this is 1.6-specific.	2015-04-20 13:46:20 +02:00
CJ Ess	108b1dd69d	MEDIUM: http: configurable http result codes for http-request deny This patch adds support for error codes 429 and 405 to Haproxy and a "deny_status XXX" option to "http-request deny" where you can specify which code is returned with 403 being the default. We really want to do this the "haproxy way" and hope to have this patch included in the mainline. We'll be happy address any feedback on how this is implemented.	2015-04-11 10:34:54 +02:00
Willy Tarreau	73b65acd46	MINOR: stream: pass the pointer to the origin explicitly to stream_new() We don't pass sess->origin anymore but the pointer to the previous step. Now it should be much easier to chain elements together once applets are moved out of streams. Indeed, the session is only used for configuration and not for the dynamic chaining anymore.	2015-04-08 18:26:29 +02:00
Willy Tarreau	1b90511eeb	CLEANUP: namespaces: fix protection against multiple inclusions The include file did not protect correctly against multiple inclusions, as it didn't define the file name after checking for it. That's currently harmless as the file is only included from .c but that could change.	2015-04-08 17:31:40 +02:00
Thierry FOURNIER	3def393f8d	MINOR: lua: map system integration in Lua This patch cretes a new Map class that permits to do some lookup in HAProxy maps. This Map class is integration in the HAProxy update system, so we can modify the map throught the socket.	2015-04-07 15:56:21 +02:00
Willy Tarreau	d414f8e8b7	CLEANUP: stream-int: swap stream-int and appctx declarations This is just in order to remove two forward declarations of si_applet and stream_interface that are not needed once properly ordered.	2015-04-06 11:43:45 +02:00
Willy Tarreau	02d863866d	MEDIUM: stream: return the stream upon accept() The function was called stream_accept_session(), let's rename it stream_new() and make it return the newly allocated pointer. It's more convenient for some callers who need it.	2015-04-06 11:37:34 +02:00
Willy Tarreau	c38f71cfcd	MINOR: session: introduce session_new() This one creates a new session and does the minimum initialization.	2015-04-06 11:37:33 +02:00
Willy Tarreau	a7513f5d00	MINOR: stream-int: make appctx_new() take the applet in argument Doing so simplifies the initialization of a new appctx. We don't need appctx_set_applet() anymore.	2015-04-06 11:37:32 +02:00
Willy Tarreau	9903f0e1a2	REORG: session: move the session parts out of stream.c This concerns everythins related to accepting a new session and expiring the embryonic session. There's still a hard-coded call to stream_accept_session() which could be set somewhere in the frontend, but for now it's not a problem.	2015-04-06 11:37:32 +02:00
Willy Tarreau	32990b531b	MEDIUM: session: remove the task pointer from the session Now that the previous changes were made, we can add a struct task pointer to stream_complete() and get rid of it in struct session. The new relation between connection, session and task are like this : orig -- sess <-- context \| \| v \| conn -- owner ---> task Some session-specific parts should now move away from stream.	2015-04-06 11:37:32 +02:00
Willy Tarreau	02a0c0e407	MAJOR: stream: don't initialize the stream anymore in stream_accept The function now only initializes a session, calls the tcp req connection rules, and calls stream_complete() to finish initialization. If a handshake is needed, it is done without allocating the stream at all. Temporarily, in order to limit the amount of changes, the task allocated is put into sess->task, and it is used by the connection for the handshake or is offered to the stream. At this point we set the relation between sess/task/conn this way : orig -- sess <-- context \| ^ +- task -+ \| v \| v \| conn -- owner task The task must not remain in the session and ultimately it is planned to remove this task pointer from the session because it can be found by having conn->owner = task, and looping back from sess to conn, and to find the session from the connection via the task.	2015-04-06 11:37:32 +02:00
Willy Tarreau	e73ef85a63	MAJOR: tcp: make tcp_exec_req_rules() only rely on the session It passes a NULL wherever a stream was needed (acl_exec_cond() and action_ptr mainly). It can still track the connection rate correctly and block based on ACLs.	2015-04-06 11:37:31 +02:00
Willy Tarreau	bb2ef12a60	MEDIUM: session: update the session's stick counters upon session_free() Whenever session_free() is called, any possible stick counter stored in the session will be synchronized.	2015-04-06 11:37:31 +02:00
Willy Tarreau	8b7f8688ee	MEDIUM: streams: support looking up stkctr in the session In order to support sessions tracking counters, we first ensure that there is no overlap between streams' stkctr and sessions', and we allow an automatic lookup into the session's counters when the stream doesn't have a counter or when the stream doesn't exist during an access via a sample fetch. The functions used to update the stream counters only update them and not the session counters however.	2015-04-06 11:37:31 +02:00
Willy Tarreau	7698c9080a	REORG: stktable: move the stkctr_* functions from stream to sticktable These ones are not stream-specific at all and will be needed outside of stream, so let's move them to stick_tables where struct stkctr is defined.	2015-04-06 11:37:30 +02:00
Willy Tarreau	b2bf8331fb	MINOR: session: add stick counters to the struct session The stick counters in the session will be used for everything not related to contents, hence the connections / concurrent sessions / etc. They will be usable by "tcp-request connection" rules even without a stream. For now they're just allocated and initialized.	2015-04-06 11:37:30 +02:00
Willy Tarreau	11c3624c32	MINOR: session: implement session_free() and use it everywhere We want to call this one everywhere we have to kill a session so that future parts we move to the session can be released from there.	2015-04-06 11:37:30 +02:00
Willy Tarreau	7ea671b914	MINOR: session: store the session's accept date Doing so ensures we don't need to use the stream anymore to prepare the log information to report a failed handshake on an embryonic session. Thus, prepare_mini_sess_log_prefix() now takes a session in argument.	2015-04-06 11:37:30 +02:00
Willy Tarreau	1f52bb27ec	CLEANUP: stream: don't set ->target to the incoming connection anymore Now that we have sess->origin to carry that information along, we don't need to put that into strm->target anymore, so we remove one dependence on the stream in embryonic connections.	2015-04-06 11:37:29 +02:00
Willy Tarreau	d0d8da989b	MINOR: stream: provide a few helpers to retrieve frontend, listener and origin Expressions are quite long when using strm_sess(strm)->whatever, so let's provide a few helpers : strm_fe(), strm_li(), strm_orig().	2015-04-06 11:37:29 +02:00
Willy Tarreau	192252e2d8	MAJOR: sample: pass a pointer to the session to each sample fetch function Many such function need a session, and till now they used to dereference the stream. Once we remove the stream from the embryonic session, this will not be possible anymore. So as of now, sample fetch functions will be called with this : - sess = NULL, strm = NULL : never - sess = valid, strm = NULL : tcp-req connection - sess = valid, strm = valid, strm->txn = NULL : tcp-req content - sess = valid, strm = valid, strm->txn = valid : http-req / http-res	2015-04-06 11:37:25 +02:00
Willy Tarreau	987e3fb868	MEDIUM: http: remove the now useless http_txn from {req/res} rules The registerable http_req_rules / http_res_rules used to require a struct http_txn at the end. It's redundant with struct stream and propagates very deep into some parts (ie: it was the reason for lua requiring l7). Let's remove it now.	2015-04-06 11:35:53 +02:00
Willy Tarreau	f3bf3050a1	CLEANUP: lua: remove unused hlua_smp->l7 and hlua_txn->l7 Since last commit, we don't retrieve the HTTP transaction from there anymore, so these entries can go.	2015-04-06 11:35:53 +02:00
Willy Tarreau	15e91e1b36	MAJOR: sample: don't pass l7 anymore to sample fetch functions All of them can now retrieve the HTTP transaction if it exists from the stream and be sure to get NULL there when called with an embryonic session. The patch is a bit large because many locations were touched (all fetch functions had to have their prototype adjusted). The opportunity was taken to also uniformize the call names (the stream is now always "strm" instead of "l4") and to fix indent where it was broken. This way when we later introduce the session here there will be less confusion.	2015-04-06 11:35:53 +02:00
Willy Tarreau	eee5b51248	MAJOR: http: move http_txn out of struct stream Now this one is dynamically allocated. It means that 280 bytes of memory are saved per TCP stream, but more importantly that it will become possible to remove the l7 pointer from fetches and converters since it will be deduced from the stream and will support being null. A lot of care was taken because it's easy to forget a test somewhere, and the previous code used to always trust s->txn for being valid, but all places seem to have been visited. All HTTP fetch functions check the txn first so we shouldn't have any issue there even when called from TCP. When branching from a TCP frontend to an HTTP backend, the txn is properly allocated at the same time as the hdr_idx.	2015-04-06 11:35:52 +02:00
Willy Tarreau	cb7dd015be	MEDIUM: http: move header captures from http_txn to struct stream The header captures are now general purpose captures since tcp rules can use them to capture various contents. That removes a dependency on http_txn that appeared in some sample fetch functions and in the order by which captures and http_txn were allocated. Interestingly the reset of the header captures were done at too many places as http_init_txn() used to do it while it was done previously in every call place.	2015-04-06 11:35:52 +02:00
Willy Tarreau	40606ab976	MINOR: session: add a pointer to the session's origin A session's origin is the entity that was responsible for creating the session. It can be an applet or a connection for now.	2015-04-06 11:23:58 +02:00
Willy Tarreau	e36cbcb3b0	MEDIUM: stream: move the frontend's pointer to the session Just like for the listener, the frontend is session-wide so let's move it to the session. There are a lot of places which were changed but the changes are minimal in fact.	2015-04-06 11:23:58 +02:00
Willy Tarreau	fb0afa77c9	MEDIUM: stream: move the listener's pointer to the session The listener is session-specific, move it there.	2015-04-06 11:23:57 +02:00
Willy Tarreau	b1ec8c4a59	MINOR: session: start to reintroduce struct session There is now a pointer to the session in the stream, which is NULL for now. The session pool is created as well. Some parts will move from the stream to the session now.	2015-04-06 11:23:57 +02:00
Willy Tarreau	e7dff02dd4	REORG/MEDIUM: stream: rename stream flags from SN_* to SF_* This is in order to keep things consistent.	2015-04-06 11:23:57 +02:00
Willy Tarreau	87b09668be	REORG/MAJOR: session: rename the "session" entity to "stream" With HTTP/2, we'll have to support multiplexed streams. A stream is in fact the largest part of what we currently call a session, it has buffers, logs, etc. In order to catch any error, this commit removes any reference to the struct session and tries to rename most "session" occurrences in function names to "stream" and "sess" to "strm" when that's related to a session. The files stream.{c,h} were added and session.{c,h} removed. The session will be reintroduced later and a few parts of the stream will progressively be moved overthere. It will more or less contain only what we need in an embryonic session. Sample fetch functions and converters will have to change a bit so that they'll use an L5 (session) instead of what's currently called "L4" which is in fact L6 for now. Once all changes are completed, we should see approximately this : L7 - http_txn L6 - stream L5 - session L4 - connection \| applet There will be at most one http_txn per stream, and a same session will possibly be referenced by multiple streams. A connection will point to a session and to a stream. The session will hold all the information we need to keep even when we don't yet have a stream. Some more cleanup is needed because some code was already far from being clean. The server queue management still refers to sessions at many places while comments talk about connections. This will have to be cleaned up once we have a server-side connection pool manager. Stream flags "SN_*" still need to be renamed, it doesn't seem like any of them will need to move to the session.	2015-04-06 11:23:56 +02:00
Willy Tarreau	418b8c0c41	MAJOR: compression: integrate support for libslz This library is designed to emit a zlib-compatible stream with no memory usage and to favor resource savings over compression ratio. While zlib requires 256 kB of RAM per compression context (and can only support 4000 connections per GB of RAM), the stateless compression offered by libslz does not need to retain buffers between subsequent calls. In theory this slightly reduces the compression ratio but in practice it does not have that much of an effect since the zlib window is limited to 32kB. Libslz is available at : http://git.1wt.eu/web?p=libslz.git It was designed for web compression and provides a lot of savings over zlib in haproxy. Here are the preliminary results on a single core of a core2-quad 3.0 GHz in 32-bit for only 300 concurrent sessions visiting the home page of www.haproxy.org (76 kB) with the default 16kB buffers : BW In BW Out BW Saved Ratio memory VSZ/RSS zlib 237 Mbps 92 Mbps 145 Mbps 2.58 84M / 69M slz 733 Mbps 380 Mbps 353 Mbps 1.93 5.9M / 4.2M So while the compression ratio is lower, the bandwidth savings are much more important due to the significantly lower compression cost which allows to consume even more data from the servers. In the example above, zlib became the bottleneck at 24% of the output bandwidth. Also the difference in memory usage is obvious. More tests run on a single core of a core i5-3320M, with 500 concurrent users and the default 16kB buffers : At 100% CPU (no limit) : BW In BW Out BW Saved Ratio memory VSZ/RSS hits/s zlib 480 Mbps 188 Mbps 292 Mbps 2.55 130M / 101M 744 slz 1700 Mbps 810 Mbps 890 Mbps 2.10 23.7M / 9.7M 2382 At 85% CPU (limited) : BW In BW Out BW Saved Ratio memory VSZ/RSS hits/s zlib 1240 Mbps 976 Mbps 264 Mbps 1.27 130M / 100M 1738 slz 1600 Mbps 976 Mbps 624 Mbps 1.64 23.7M / 9.7M 2210 The most important benefit really happens when the CPU usage is limited by "maxcompcpuusage" or the BW limited by "maxcomprate" : in order to preserve resources, haproxy throttles the compression ratio until usage is within limits. Since slz is much cheaper, the average compression ratio is much higher and the input bandwidth is quite higher for one Gbps output. Other tests made with some reference files : BW In BW Out BW Saved Ratio hits/s daniels.html zlib 1320 Mbps 163 Mbps 1157 Mbps 8.10 1925 slz 3600 Mbps 580 Mbps 3020 Mbps 6.20 5300 tv.com/listing zlib 980 Mbps 124 Mbps 856 Mbps 7.90 310 slz 3300 Mbps 553 Mbps 2747 Mbps 5.97 1100 jquery.min.js zlib 430 Mbps 180 Mbps 250 Mbps 2.39 547 slz 1470 Mbps 764 Mbps 706 Mbps 1.92 1815 bootstrap.min.css zlib 790 Mbps 165 Mbps 625 Mbps 4.79 777 slz 2450 Mbps 650 Mbps 1800 Mbps 3.77 2400 So on top of saving a lot of memory, slz is constantly 2.5-3.5 times faster than zlib and results in providing more savings for a fixed CPU usage. For links smaller than 100 Mbps, zlib still provides a better compression ratio, at the expense of a much higher CPU usage. Larger input files provide slightly higher bandwidth for both libs, at the expense of a bit more memory usage for zlib (it converges to 256kB per connection).	2015-03-29 03:32:06 +02:00
Willy Tarreau	7b21877888	CLEANUP: compression: remove unused reset functions It's unclear what purpose these functions used to server, however they are not used anywhere, one good reason to remove them.	2015-03-28 22:08:25 +01:00
Willy Tarreau	9787efa97c	MEDIUM: compression: split deflate_flush() into flush and finish This function used to take a zlib-specific flag as argument to indicate whether a buffer flush or end of contents was met, let's split it in two so that we don't depend on zlib anymore.	2015-03-28 19:17:31 +01:00
Willy Tarreau	615105e7e8	MEDIUM: compression: add a distinction between UA- and config- algorithms Thanks to MSIE/IIS, the "deflate" name is ambigous. According to the RFC it's a zlib-wrapped deflate stream, but IIS used to send only a raw deflate stream, which is the only format MSIE understands for "deflate". The other widely used browsers do support both formats. For this reason some people prefer to emit a raw deflate stream on "deflate" to serve more users even it that means violating the standards. Haproxy only follows the standard, so they cannot do this. This patch makes it possible to have one algorithm name in the configuration and another one in the protocol. This will make it possible to have a new configuration token to add a different algorithm so that users can decide if they want a raw deflate or the standard one.	2015-03-28 16:46:38 +01:00
Willy Tarreau	9f640a1eab	CLEANUP: compression: statify all algo-specific functions There's no reason for exporting identity_* nor deflate_*, they're only used in the same file. Mark them static, it will make it easier to add other algorithms.	2015-03-28 15:46:00 +01:00
Willy Tarreau	15530d28a4	MEDIUM: compression: don't send leading zeroes with chunk size Till now we used to rely on a fixed maximum chunk size. Thanks to last commit we're now free to adjust the chunk's length before sending the data, so we don't have to use 6 digits all the time anymore, and if one wants buffers larger than 16 MB it is now possible.	2015-03-28 12:05:47 +01:00

1 2 3 4 5 ...

1804 Commits