mirror of
http://git.haproxy.org/git/haproxy.git/
synced 2025-01-02 02:02:03 +00:00
BUG/MAJOR: connection: always disable ready events once reported
This effectively reverts the two following commits:6f95f6e11
("OPTIM: connection: disable receiving on disabled events when the run queue is too high")065a02561
("MEDIUM: connection: don't stop receiving events in the FD handler") The problem as reported in issue #662 is that when the events signals the readiness of input data that has to be forwarded over a congested stream, the mux will read data and wake the stream up to forward them, but the buffer full condition makes this impossible immediately, then nobody in the chain will be able to disable the event after it was first reported. And given we don't know at the connection level whether an event was already reported or not, we can't decide anymore to forcefully stop it if for any reason its processing gets delayed. The problem is magnified in issue #662 by the fact that a shutdown is reported with pending data occupying the buffer. The shutdown will strike in loops and cause the upper layer stream to be notified until it's handled, but with a buffer full it's not possible to call cs_recv() hence to purge the event. All this can only be handled optimally by implementing a lower layer, direct mux-to-mux forwarding that will not require any scheduling. This was no wake up will be needed and the event will be instantly handled or paused for a long time. For now let's simply revert these optimizations. Running a 1 MB transfer test over H2 using 8 connections having each 32 streams with a limited link of 320 Mbps shows the following profile before this fix: calls syscall (100% CPU) ------ ------- 259878 epoll_wait 519759 clock_gettime 17277 sendto 17129 recvfrom 672 epoll_ctl And the following one after the fix: calls syscall (2-3% CPU) ------ ------- 17201 sendto 17357 recvfrom 2304 epoll_wait 4609 clock_gettime 1200 epoll_ctl Thus the behavior is much better. No backport is needed as these patches were only in 2.2-dev. Many thanks to William Dauchy for reporting a lot of details around this difficult issue.
This commit is contained in:
parent
b25970f896
commit
4cabfc18a3
@ -133,15 +133,7 @@ void conn_fd_handler(int fd)
|
||||
if (!conn->subs->events)
|
||||
conn->subs = NULL;
|
||||
}
|
||||
else if (tasks_run_queue_cur >= 16*global.tune.runqueue_depth) {
|
||||
/* In order to save syscalls especially with epoll, we
|
||||
* prefer *not* to disable receiving and instead let
|
||||
* the handler do its job. But if the run queue becomes
|
||||
* high, the excess of events may cause extra wakeups
|
||||
* and in this case we'd rather flow-control ourselves.
|
||||
*/
|
||||
fd_stop_recv(fd);
|
||||
}
|
||||
fd_stop_recv(fd);
|
||||
}
|
||||
|
||||
leave:
|
||||
|
Loading…
Reference in New Issue
Block a user