haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-14 02:57:01 +02:00

Author	SHA1	Message	Date
Jerome Magnin	b8bd6d7efd	BUILD: pattern: include errno.h Commit `3c79d4bdc` introduced the use of errno in pattern.c without including errno.h. If we build haproxy without any option errno is not defined and the build fails.	2020-01-17 18:30:06 +01:00
Willy Tarreau	3381bf89e3	MEDIUM: connection: get rid of CO_FL_CURR_* flags These ones used to serve as a set of switches between CO_FL_SOCK_* and CO_FL_XPRT_, and now that the SOCK layer is gone, they're always a copy of the last know CO_FL_XPRT_ ones that is resynchronized before I/O events by calling conn_refresh_polling_flags(), and that are pushed back to FDs when detecting changes with conn_xprt_polling_changes(). While these functions are not particularly heavy, what they do is totally redundant by now because the fd_want_/fd_stop_() actions already perform test-and-set operations to decide to create an entry or not, so they do the exact same thing that is done by conn_xprt_polling_changes(). As such it is pointless to call that one, and given that the only reason to keep CO_FL_CURR_* is to detect changes there, we can now remove them. Even if this does only save very few cycles, this removes a significant complexity that has been responsible for many bugs in the past, including the last one affecting FreeBSD. All tests look good, and no performance regressions were observed.	2020-01-17 17:45:12 +01:00
Willy Tarreau	93c9f59a9c	MINOR: stream-int: remove dependency on CO_FL_WAIT_ROOM for rcv_buf() The only case where this made sense was with mux_h1 but Since we introduced CS_FL_MAY_SPLICE, we don't need to rely on this anymore, thus we don't need to clear it either when we do not splice. There is a last check on this flag used to determine if the rx channel is full and that cannot go away unless it's changed to use the CS instead but for now this wouldn't add any benefit so better not do it yet.	2020-01-17 17:24:30 +01:00
Willy Tarreau	e2a0eeca77	MINOR: connection: move the CO_FL_WAIT_ROOM cleanup to the reader only CO_FL_WAIT_ROOM is set by the splicing function in raw_sock, and cleared by the stream-int when splicing is disabled, as well as in conn_refresh_polling_flags() so that a new call to ->rcv_pipe() could be attempted by the I/O callbacks called from conn_fd_handler(). This clearing in conn_refresh_polling_flags() makes no sense anymore and is in no way related to the polling at all. Since we don't call them from there anymore it's better to clear it before attempting to receive, and to set it again later. So let's move this operation where it should be, in raw_sock_to_pipe() so that it's now symmetric. It was also placed in raw_sock_to_buf() so that we're certain that it gets cleared if an attempt to splice is replaced with a subsequent attempt to recv(). And these were currently already achieved by the call to conn_refresh_polling_flags(). Now it could theorically be removed from the stream-int.	2020-01-17 17:19:27 +01:00
Jerome Magnin	3c79d4bdc4	BUG/MINOR: pattern: handle errors from fgets when trying to load patterns We need to do some error handling after we call fgets to make sure everything went fine. If we don't users can be fooled into thinking they can load pattens from directory because cfgparse doesn't flinch. This applies to acl patterns map files. This should be backported to all supported versions.	2020-01-17 17:09:50 +01:00
Willy Tarreau	17ccd1a356	BUG/MEDIUM: connection: add a mux flag to indicate splice usability Commit `c640ef1a7d` ("BUG/MINOR: stream-int: avoid calling rcv_buf() when splicing is still possible") fixed splicing in TCP and legacy mode but broke it badly in HTX mode. What happens in HTX mode is that the channel's to_forward value remains set to CHN_INFINITE_FORWARD during the whole transfer, and as such it is not a reliable signal anymore to indicate whether more data are expected or not. Thus, when data are spliced out of the mux using rcv_pipe(), even when the end is reached (that only the mux knows about), the call to rcv_buf() to get the final HTX blocks completing the message were skipped and there was often no new event to wake this up, resulting in transfer timeouts at the end of large objects. All this goes down to the fact that the channel has no more information about whether it can splice or not despite being the one having to take the decision to call rcv_pipe() or not. And we cannot afford to call rcv_buf() inconditionally because, as the commit above showed, this reduces the forwarding performance by 2 to 3 in TCP and legacy modes due to data lying in the buffer preventing splicing from being used later. The approach taken by this patch consists in offering the muxes the ability to report a bit more information to the upper layers via the conn_stream. This information could simply be to indicate that more data are awaited but the real need being to distinguish splicing and receiving, here instead we clearly report the mux's willingness to be called for splicing or not. Hence the flag's name, CS_FL_MAY_SPLICE. The mux sets this flag when it knows that its buffer is empty and that data waiting past what is currently known may be spliced, and clears it when it knows there's no more data or that the caller must fall back to rcv_buf() instead. The stream-int code now uses this to determine if splicing may be used or not instead of looking at the rcv_pipe() callbacks through the whole chain. And after the rcv_pipe() call, it checks the flag again to decide whether it may safely skip rcv_buf() or not. All this bitfield dance remains a bit complex and it starts to appear obvious that splicing vs reading should be a decision of the mux based on permission granted by the data layer. This would however increase the API's complexity but definitely need to be thought about, and should even significantly simplify the data processing layer. The way it was integrated in mux-h1 will also result in no more calls to rcv_pipe() on chunked encoded data, since these ones are currently disabled at the mux level. However once the issue with chunks+splice is fixed, it will be important to explicitly check for curr_len\|CHNK to set MAY_SPLICE, so that we don't call rcv_buf() after each chunk. This fix must be backported to 2.1 and 2.0.	2020-01-17 17:00:12 +01:00
Jerome Magnin	bee00ad080	BUG/MINOR: stream: don't mistake match rules for store-request rules In process_sticking_rules() we only want to apply the first store-request rule for a given table, but when doing so we need to make sure we only count actual store-request rules when we list the sticking rules. Failure to do so leads to not being able to write store-request and match sticking rules in any order as a match rule after a store-request rule will be ignored. The following configuration reproduces the issue: global stats socket /tmp/foobar defaults mode http frontend in bind :8080 default_backend bar backend bar server s1 127.0.0.1:21212 server s2 127.0.0.1:21211 stick store-request req.hdr(foo) stick match req.hdr(foo) stick-table type string size 10 listen foo bind :21212 bind *:21211 http-request deny deny_status 200 if { dst_port 21212 } http-request deny This patch fixes issue #448 and should be backported as far as 1.6.	2020-01-16 18:07:52 +01:00
William Lallemand	d308c5e0ce	CLEANUP: cli: deduplicate the code in _getsocks Since the fix `5fd3b28` ("BUG/MEDIUM: cli: _getsocks must send the peers sockets") for bug #443. The code which sends the socket for the peers and the proxies is duplicated. This patch move this code in a separated function.	2020-01-16 16:26:41 +01:00
William Lallemand	5fd3b28c9c	BUG/MEDIUM: cli: _getsocks must send the peers sockets This bug prevents to reload HAProxy when you have both the seamless reload (-x / expose-fd listeners) and the peers. Indeed the _getsocks command does not send the FDs of the peers listeners, so if no reuseport is possible during the bind, the new process will fail to bind and exits. With this feature, it is not possible to fallback on the SIGTTOU method if we didn't receive all the sockets, because you can't close() the sockets of the new process without closing those of the previous process, they are the same. Should fix bug #443. Must be backported as far as 1.8.	2020-01-16 16:25:22 +01:00
Willy Tarreau	ec9ac54982	REGTEST: add sample_fetches/hashes.vtc to validate hashes This regtest validates all hashes that we support, on all input bytes from 0x00 to 0xFF. Those supporting avalanche are tested as well. It also tests len(), hex() and base64(). It purposely does not enable sha2() because this one relies on OpenSSL and there's no point in validating that OpenSSL knows how to hash, what matters is that we can test our hashing functions in all cases. However since the tests were written, they're still present and commented out in case that helps. It may be backported to supported versions, possibly dropping a few algos that were not supported (e.g. crc32c requires 1.9 minimum). Note that this test will fail on crc32/djb2/sdbm/wt6 unless patches "BUG/MINOR: stream: init variables when the list is empty" and "BUG/MAJOR: hashes: fix the signedness of the hash inputs" are included.	2020-01-16 08:45:27 +01:00
Willy Tarreau	340b07e868	BUG/MAJOR: hashes: fix the signedness of the hash inputs Wietse Venema reported in the thread below that we have a signedness issue with our hashes implementations: due to the use of const char* for the input key that's often text, the crc32, sdbm, djb2, and wt6 algorithms return a platform-dependent value for binary input keys containing bytes with bit 7 set. This means that an ARM or PPC platform will hash binary inputs differently from an x86 typically. Worse, some algorithms are well defined in the industry (like CRC32) and do not provide the expected result on x86, possibly causing interoperability issues (e.g. a user-agent would fail to compare the CRC32 of a message body against the one computed by haproxy). Fortunately, and contrary to the first impression, the CRC32c variant used in the PROXY protocol processing is not affected. Thus the impact remains very limited (the vast majority of input keys are text-based, such as user-agent headers for exmaple). This patch addresses the issue by fixing all hash functions' prototypes (even those not affected, for API consistency). A reg test will follow in another patch. The vast majority of users do not use these hashes. And among those using them, very few will pass them on binary inputs. However, for the rare ones doing it, this fix MAY have an impact during the upgrade. For example if the package is upgraded on one LB then on another one, and the CRC32 of a binary input is used as a stick table key (why?) then these CRCs will not match between both nodes. Similarly, if "hash-type ... crc32" is used, LB inconsistency may appear during the transition. For this reason it is preferable to apply the patch on all nodes using such hashes at the same time. Systems upgraded via their distros will likely observe the least impact since they're expected to be upgraded within a short time frame. And it is important for distros NOT to skip this fix, in order to avoid distributing an incompatible implementation of a hash. This is the reason why this patch is tagged as MAJOR, eventhough it's extremely unlikely that anyone will ever notice a change at all. This patch must be backported to all supported branches since the hashes were introduced in 1.5-dev20 (commit `98634f0c`). Some parts may be dropped since implemented later. Link to Wietse's report: https://marc.info/?l=postfix-users&m=157879464518535&w=2	2020-01-16 08:23:42 +01:00
William Dauchy	bb9da0b8e2	CLEANUP: proxy: simplify proxy_parse_rate_limit proxy checks rate-limits are valid for both frontend and listen, but not backend; so we can simplify this check in a similar manner as it is done in e.g max-keep-alive-queue. this should fix github issue #449 Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-16 07:04:05 +01:00
Olivier Houchard	ac8147446c	BUG/MEDIUM: raw_sock: Make sur the fd and conn are sync. Commit `08fa16e397` made sure we let the fd layer we didn't want to poll anymore if we failed to send and sendto() returne EAGAIN. However, just disabling the polling with fd_stop_send() while not notifying the connection layer means the connection layer may believe the polling is activated and nothing needs to be done when it is wrong. A better fix is to revamp that whole code, for the time being, just make sure the fd and connection layer are properly synchronised. This should fix the problem recently reported on FreeBSD.	2020-01-15 19:16:23 +01:00
Olivier Houchard	68787ef70a	BUG/MEDIUM: mux_h1: Don't call h1_send if we subscribed(). In h1_snd_buf(), only attempt to call h1_send() if we haven't already subscribed. It makes no sense to do it if we subscribed, as we know we failed to send before, and will create a useless call to sendto(), and in 2.2, the call to raw_sock_from_buf() will disable polling if it is enabled. This should be backported to 2.2, 2.1, 2.0 and 1.9.	2020-01-15 19:13:32 +01:00
William Dauchy	8602394040	CLEANUP: compression: remove unused deinit_comp_ctx section Since commit `27d93c3f94` ("BUG/MAJOR: compression/cache: Make it really works with these both filters"), we no longer use section deinit_comp_ctx. This should fix github issue #441 Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-15 10:58:17 +01:00
William Dauchy	238ea3bd75	DOC: clarify crt-base usage crt-base is also used after "crt" directive. Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-15 10:55:43 +01:00
William Lallemand	24c928c8bd	BUG/MEDIUM: mworker: remain in mworker mode during reload If you reload an haproxy started in master-worker mode with "master-worker" in the configuration, and no "-W" argument, the new process lost the fact that is was in master-worker mode resulting in weird behaviors. The bigest problem is that if it is reloaded with an bad configuration, the master will exits instead of remaining in waitpid mode. This problem was discovered in bug #443. Should be backported in every version using the master-worker mode. (as far as 1.8)	2020-01-14 18:10:29 +01:00
William Lallemand	25b5693021	REGTEST: mcli/mcli_start_progs: start 2 programs This regtest tests the issue #446 by starting 2 programs and checking if they exist in the "show proc" of the master CLI. Should be backported as far as 2.0.	2020-01-14 15:42:38 +01:00
William Lallemand	a31b09e982	BUG/MINOR: cli/mworker: can't start haproxy with 2 programs When trying to start HAProxy with the master CLI and more than one program in the configuration, it refuses to start with: [ALERT] 013/132926 (1378) : parsing [cur--1:0] : proxy 'MASTER', another server named 'cur--1' was already defined at line 0, please use distinct names. [ALERT] 013/132926 (1378) : Fatal errors found in configuration. The problem is that haproxy tries to create a server for the MASTER proxy but only the worker are supposed to be in the server list. Fix issue #446. Must be backported as far as 2.0.	2020-01-14 15:42:38 +01:00
Willy Tarreau	c7ce4e3e7f	BUG/MEDIUM: mux-h2: don't stop sending when crossing a buffer boundary In version 2.0, after commit `9c218e7521` ("MAJOR: mux-h2: switch to next mux buffer on buffer full condition."), the H2 mux started to use a ring buffer for the output data in order to reduce competition between streams. However, one corner case was suboptimally covered: when crossing a buffer boundary, we have to shrink the outgoing frame size to the one left in the output buffer, but this shorter size is later used as a signal of incomplete send due to a buffer full condition (which used to be true when using a single buffer). As a result, function h2s_frt_make_resp_data() used to return less than requested, which in turn would cause h2_snd_buf() to stop sending and leave some unsent data in the buffer, and si_cs_send() to subscribe for sending more later. But it goes a bit further than this, because subscribing to send again causes the mux's send_list not to be empty anymore, hence extra streams can be denied the access to the mux till the first stream is woken again. This causes a nasty wakeup-sleep dance between streams that makes it totally impractical to try to remove the sending list. A test showed that it was possible to observe 3 million h2_snd_buf() giveups for only 100k requests when using 100 concurrent streams on 20kB objects. It doesn't seem likely that a stream could get blocked and time out due to this bug, though it's not possible either to demonstrate the opposite. One risk is that incompletely sent streams do not have any blocking flags so they may not be identified as blocked. However on first scan of the send_list they meet all conditions for a wakeup. This patch simply allows to continue on a new frame after a partial frame. with only this change, the number of failed h2_snd_buf() was divided by 800 (4% of calls). And by slightly increasing the H2C_MBUF_CNT size, it can go down to zero. This fix must be backported to 2.1 and 2.0.	2020-01-14 13:55:04 +01:00
Willy Tarreau	f31af9367e	MEDIUM: lua: don't call the GC as often when dealing with outgoing connections In order to properly close connections established from Lua in case a Lua context dies, the context currently automatically gets a flag HLUA_MUST_GC set whenever an outgoing connection is used. This causes the GC to be enforced on the context's death as well as on yield. First, it does not appear necessary to do it when yielding, since if the connections die they are already cleaned up. Second, the problem with the flag is that even if a connection gets properly closed, the flag is not removed and the GC continues to be called on the Lua context. The impact on performance looks quite significant, as noticed and diagnosed by Sadasiva Gujjarlapudi in the following thread: https://www.mail-archive.com/haproxy@formilux.org/msg35810.html This patch changes the flag for a counter so that each created connection increments it and each cleanly closed connection decrements it. That way we know we have to call the GC on the context's death only if the count is non-null. As reported in the thread above, the Lua performance gain is now over 20% by doing this. Thanks to Sada and Thierry for the design discussion and tests that led to this solution.	2020-01-14 10:12:31 +01:00
William Dauchy	9a8ef7f51d	CLEANUP: ssl: remove opendir call in ssl_sock_load_cert Since commit `3180f7b554` ("MINOR: ssl: load certificates in alphabetical order"), `readdir` was replaced by `scandir`. We can indeed replace it with a check on the previous `stat` call. This micro cleanup can be a good benefit when you have hundreds of bind lines which open TLS certificates directories in terms of syscall, especially in a case of frequent reloads. Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-13 19:51:52 +01:00
Willy Tarreau	70c5b0e5fd	BUG/MEDIUM: mux-h2: fix missing test on sending_list in previous patch Previous commit `989539b048` ("BUG/MINOR: mux-h2: use a safe list_for_each_entry in h2_send()") accidently lost its sending_list test, resulting in some elements to be woken up again while already in the sending_list and h2_unsubscribe() crashing on integrity tests (only when built with DEBUG_DEV). If the fix above is backported this one must be as well.	2020-01-10 18:20:15 +01:00
Willy Tarreau	989539b048	BUG/MINOR: mux-h2: use a safe list_for_each_entry in h2_send() h2_send() uses list_for_each_entry() to scan paused streams and resume them, but happily deletes any leftover from a previous failed unsubscribe, which is obviously not safe and would corrupt the list. In practice this is a proof that this doesn't happen, but it's not the best way to prove it. In order to fix this and reduce the maintenance burden caused by code duplication (this list walk exists at 3 places), let's introduce a new function h2_resume_each_sending_h2s() doing exactly this and use it at all 3 places. This bug was introduced as a side effect of fix `998410a41b` ("BUG/MEDIUM: h2: Revamp the way send subscriptions works.") so it should be backported as far as 1.9.	2020-01-10 17:18:32 +01:00
Olivier Houchard	3c4f40acbf	BUG/MEDIUM: tasks: Use the MT macros in tasklet_free(). In tasklet_free(), to attempt to remove ourself, use MT_LIST_DEL, we can't just use LIST_DEL(), as we theorically could be in the shared tasklet list. This should be backported to 2.1.	2020-01-10 16:56:59 +01:00
Christopher Faulet	48726b78e5	BUG/MINOR: stream-int: Don't trigger L7 retry if max retries is already reached When an HTTP response is received, at the stream-interface level, if a L7 retry must be triggered because of the status code, the response is trashed and a read error is reported on the response channel. Then the stream handles this error and perform the retry. Except if the maximum connection retries is reached. In this case, an error is reported. Because the server response was already trashed by the stream-interface, a generic 502 error is returned to the client instead of the server's one. Now, the stream-interface triggers a L7 retry only if the maximum connection retries is not already reached. Thus, at the end, the last server's response is returned. This patch must be backported to 2.1 and 2.0. It should fix the issue #439.	2020-01-09 15:39:06 +01:00
Ilya Shipitsin	3e34ac7d09	REGTEST: set_ssl_cert.vtc: replace "echo" with "printf" "echo -e" for some reason does not work on travis-ci, so let us switch to "printf" Fixes: #423	2020-01-09 05:56:04 +01:00
William Dauchy	7675c720f8	CLEANUP: server: remove unused err section in server_finalize_init Since commit `980855bd95` ("BUG/MEDIUM: server: initialize the orphaned conns lists and tasks at the end"), we no longer use err section. This should fix github issue #438 Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-09 05:54:48 +01:00
Willy Tarreau	eeea8082a8	BUG/MAJOR: listener: do not schedule a task-less proxy Apparently seamingless commit `0591bf7deb` ("MINOR: listener: make the wait paths cleaner and more reliable") caused a nasty regression and revealed a rare race that hits regtest stickiness/lb-services.vtc about 4% of the times for 8 threads. The problem is that when a multi-threaded listener wakes up on an incoming connection, several threads can receive the event, especially when idle. And all of them will race to accept the connections in parallel, adjusting the listener's nbconn and proxy's feconn until one reaches the proxy's limit and declines. At this step the changes are cancelled, the listener is marked "limited", and when the threads exit the function, one of them will unlimit the listener/proxy again so that it can accept incoming connections again. The problem happens when many threads connect to a small peers section because its maxconn is very limited (typically 6 for 2 peers), and it's sometimes possible for enough competing threads to hit the limit and one of them will limit the listener and queue the proxy's task... except that peers do not initialize their proxy task since they do not use rate limiting. Thus the process crashes when doing task_schedule(p->task). Prior to the cleanup patch above, this didn't happen because the error path that was dedicated to only limiting the listener did not call task_schedule(p->task). Given that the proxy's task is optional, and that the expire value passed there is always TICK_ETERNITY, it's sufficient and reasonable to avoid calling this task_schedule() when expire is not set. And for long term safety we can also avoid to do it when the task is not set. A first fix consisted in allocating a task for the peers proxies but it's never used and would eat resources for reason. No backport is needed as this commit was only merged into 2.2.	2020-01-08 19:39:09 +01:00
Ilya Shipitsin	d9514b6e79	BUILD: cirrus-ci: choose proper openssl package name freebsd-11.3 and 12.1 comes with different openssl naming let us add proper switch to cirrus-ci script	2020-01-08 16:26:11 +01:00
William Dauchy	cd7fa3dcfc	CLEANUP: mux-h2: remove unused goto "out_free_h2s" Since commit `fa8aa867b9` ("MEDIUM: connections: Change struct wait_list to wait_event.") we no longer use this section. this should fix github issue #437 Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-08 16:16:19 +01:00
Florian Tham	9205fea13a	MINOR: http: Add 404 to http-request deny This patch adds http status code 404 Not Found to http-request deny. See issue #80.	2020-01-08 16:15:23 +01:00
Florian Tham	272e29b5cc	MINOR: http: Add 410 to http-request deny This patch adds http status code 410 Gone to http-request deny. See issue #80.	2020-01-08 16:15:23 +01:00
Willy Tarreau	08fa16e397	MINOR: raw_sock: make sure to disable polling once everything is sent Analysing traces revealed a rare but surprizing pattern : connect() = -1 EAGAIN send() = success epoll_ctl(ADD, EPOLLOUT) epoll_wait() recvfrom() = success close() What happens is that the failed connect() creates an FD update for pollout, but the successful synchronous send() doesn't disable it because polling was only disabled in the FD handler. But a successful synchronous connect() cancellation is a good opportunity to disable polling before it's effectively enabled in the next loop, so better disable it when reaching the end. The cost is very low if it was already disabled anyway (one atomic op). This only affects local connections but with this the typical number of epoll_ctl() calls per connection dropped from ~4.2 to ~3.8 for plain TCP and 10k transfers.	2020-01-08 09:59:40 +01:00
Willy Tarreau	0eae6323bf	MEDIUM: dns: implement synchronous send In dns_send_query(), there's no point in first waking up the FD, to get called back by the poller to send the request and sleep. Instead let's simply send the request as soon as it's known and only subscribe to the poller when the socket buffers are full and it's required to poll (i.e. almost never). This significantly reduces the number of calls to the poller. A large config sees the number of epoll_ctl() calls reduced from 577 to 7 over 10 seconds, the number of recvfrom() from 1533 to 582 and the number of sendto() from 369 to 162. It also has the extra benefit of building each requests only once per resolution and sending it to multiple resolvers instead of rebuilding it for each and every resolver. This will reduce the risk of seeing situations similar to bug #416 in the future.	2020-01-08 06:10:38 +01:00
Willy Tarreau	e5891ca6c1	BUG/MEDIUM: session: do not report a failure when rejecting a session In session_accept_fd() we can perform a synchronous call to conn_complete_session() and if it succeeds the connection is accepted and turned into a session. If it fails we take it as an error while it is not, in this case, it's just that a tcp-request rule has decided to reject the incoming connection. The problem with reporting such an event as an error is that the failed status is passed down to the listener code which decides to disable accept() for 100ms in order to leave some time for transient issues to vanish, and that's not what we want to do here. This fix must be backported as far as 1.7. In 1.7 the code is a bit different as tcp_exec_l5_rules() is called directly from within session_new_fd() and ret=0 must be assigned there.	2020-01-07 18:15:32 +01:00
Christopher Faulet	584348be63	BUG/MINOR: channel: inject output data at the end of output In co_inject(), data must be inserted at the end of output, not the end of input. For the record, this function does not take care of input data which are supposed to not exist. But the caller may reset input data after or before the call. It is its own choice. This bug, among other effects, is visible when a redirect is performed on the response path, on legacy HTTP mode (so for HAProxy < 2.1). The redirect response is appended after the server response when it should overwrite it. Thanks to Kevin Zhu <ip0tcp@gmail.com> to report the bug. It must be backported as far as 1.9.	2020-01-07 10:51:15 +01:00
Kevin Zhu	96b363963f	BUG/MEDIUM: http-ana: Truncate the response when a redirect rule is applied When a redirect rule is executed on the response path, we must truncate the received response. Otherwise, the redirect is appended after the response, which is sent to the client. So it is obviously a bug because the redirect is not performed. With bodyless responses, it is the "only" bug. But if the response has a body, the result may be invalid. If the payload is not fully received yet when the redirect is performed, an internal error is reported. It must be backported as far as 1.9.	2020-01-07 10:50:28 +01:00
Christopher Faulet	47a7210b9d	BUG/MINOR: proxy: Fix input data copy when an error is captured In proxy_capture_error(), input data are copied in the error snapshot. The copy must take care of the data wrapping. But the length of the first block is wrong. It should be the amount of contiguous input data that can be copied starting from the input's beginning. But the mininum between the input length and the buffer size minus the input length is used instead. So it is a problem if input data are wrapping or if more than the half of the buffer is used by input data. This patch must be backported as far as 1.9.	2020-01-06 13:58:30 +01:00
Christopher Faulet	1703478e2d	BUG/MINOR: h1: Report the right error position when a header value is invalid During H1 messages parsing, when the parser has finished to parse a full header line, some tests are performed on its value, depending on its name, to be sure it is valid. The content-length is checked and converted in integer and the host header is also checked. If an error occurred during this step, the error position must point on the header value. But from the parser point of view, we are already on the start of the next header. Thus the effective reported position in the error capture is the beginning of the unparsed header line. It is a bit confusing when we try to figure out why a message is rejected. Now, the parser state is updated to point on the invalid value. This way, the error position really points on the right position. This patch must be backported as far as 1.9.	2020-01-06 13:58:21 +01:00
Olivier Houchard	7f4f7f140f	MINOR: ssl: Remove unused variable "need_out". The "need_out" variable was used to let the ssl code know we're done reading early data, and we should start the handshake. Now that the handshake function is responsible for taking care of reading early data, all that logic has been removed from ssl_sock_to_buf(), but need_out was forgotten, and left. Remove it know. This patch was submitted by William Dauchy <w.dauchy@criteo.com>, and should fix github issue #434. This should be backported to 2.0 and 2.1.	2020-01-05 16:45:14 +01:00
William Dauchy	3894d97fb8	MINOR: config: disable busy polling on old processes in the context of seamless reload and busy polling, older processes will create unecessary cpu conflicts; we can assume there is no need for busy polling for old processes which are waiting to be terminated. This patch is not a bug fix itself but might be a good stability improvment when you are un the context of frequent seamless reloads with a high "hard-stop-after" value; for that reasons I think this patch should be backported in all 2.x versions. Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-02 10:29:49 +01:00
Ilya Shipitsin	125ad1f343	BUILD: CI: modernize cirrus-ci use freebsd-12.1 instead of freebsd-12.0, add freebsd-11.3 to build matrix, install socat in order to run modern reg-tests	2020-01-02 10:27:36 +01:00
Olivier Houchard	140237471e	BUG/MEDIUM: connections: Hold the lock when wanting to kill a connection. In connect_server(), when we decide we want to kill the connection of another thread because there are too many idle connections, hold the toremove_lock of the corresponding thread, othervise, there's a small race condition where we could try to add the connection to the toremove_connections list while it has already been free'd. This should be backported to 2.0 and 2.1.	2019-12-30 18:18:28 +01:00
Olivier Houchard	37d7897aaf	BUG/MEDIUM: checks: Only attempt to do handshakes if the connection is ready. When creating a new check connection, only attempt to add an handshake connection if the connection has fully been initialized. It can not be the case if a DNS resolution is still pending, and thus we don't yet have the address for the server, as the handshake code assumes the connection is fully initialized and would otherwise crash. This is not ideal, the check shouldn't probably run until we have an address, as it leads to check failures with "Socket error". While I'm there, also add an xprt handshake if we're using socks4, otherwise checks wouldn't be able to use socks4 properly. This should fix github issue #430 This should be backported to 2.0 and 2.1.	2019-12-30 15:18:16 +01:00
Willy Tarreau	eaf05be0ee	OPTIM: polling: do not create update entries for FD removal In order to reduce the number of poller updates, we can benefit from the fact that modern pollers use sampling to report readiness and that under load they rarely report the same FD multiple times in a row. As such it's not always necessary to disable such FDs especially when we're almost certain they'll be re-enabled again and will require another set of syscalls. Now instead of creating an update for a (possibly temporary) removal, we only perform this removal if the FD is reported again as ready while inactive. In addition this is performed via another update so that alternating workloads like transfers have a chance to re-enable the FD without any syscall during the loop (typically after the data that filled a buffer have been sent). However we only do that for single- threaded FDs as the other ones require a more complex setup and are not on the critical path. This does cause a few spurious wakeups but almost totally eliminates the calls to epoll_ctl() on connections seeing intermitent traffic like HTTP/1 to a server or client. A typical example with 100k requests for 4 kB objects over 200 connections shows that the number of epoll_ctl() calls doesn't depend on the number of requests anymore but most exclusively on the number of established connections: Before: % time seconds usecs/call calls errors syscall ------ ----------- ----------- --------- --------- ---------------- 57.09 0.499964 0 654361 321190 recvfrom 38.33 0.335741 0 369097 1 epoll_wait 4.56 0.039898 0 44643 epoll_ctl 0.02 0.000211 1 200 200 connect ------ ----------- ----------- --------- --------- ---------------- 100.00 0.875814 1068301 321391 total After: % time seconds usecs/call calls errors syscall ------ ----------- ----------- --------- --------- ---------------- 59.25 0.504676 0 657600 323630 recvfrom 40.68 0.346560 0 374289 1 epoll_wait 0.04 0.000370 0 620 epoll_ctl 0.03 0.000228 1 200 200 connect ------ ----------- ----------- --------- --------- ---------------- 100.00 0.851834 1032709 323831 total As expected there is also a slight increase of epoll_wait() calls since delaying de-activation of events can occasionally cause one spurious wakeup.	2019-12-27 16:38:47 +01:00
Willy Tarreau	5d7dcc2a8e	OPTIM: epoll: always poll for recv if neither active nor ready The cost of enabling polling in one direction with epoll is very high because it requires one syscall per FD and per direction change. In addition we don't know about input readiness until we either try to receive() or enable polling and watch the result. With HTTP keep-alive, both are equally expensive as it's very uncommon to see the server instantly respond (unless it's a second stage of the same process on localhost, which has become much less common with threads). But when a connection is established it's also quite usual to have to poll for sending (except on localhost or UNIX sockets where it almost always instantly works). So this cost of polling could be factored out with the second step if both were enabled together. This is the idea behind this patch. What it does is to always enable polling for Rx if it's not ready and at least one direction is active. This means that if it's not explicitly disabled, or if it was but in a state that causes the loss of the information (rx ready cannot be guessed), then let's take any opportunity for a polling change to enable it at the same time, and learn about rx readiness for free. In addition the FD never gets unregistered for Rx unless it's ready and was blocked (buffer full). This avoids a lot of the flip-flop behaviour at beginning and end of requests. On a test with 10k requests in keep-alive, the difference is quite noticeable: Before: % time seconds usecs/call calls errors syscall ------ ----------- ----------- --------- --------- ---------------- 83.67 0.010847 0 20078 epoll_ctl 16.33 0.002117 0 2231 epoll_wait 0.00 0.000000 0 20 20 connect ------ ----------- ----------- --------- --------- ---------------- 100.00 0.012964 22329 20 total After: % time seconds usecs/call calls errors syscall ------ ----------- ----------- --------- --------- ---------------- 96.35 0.003351 1 2644 epoll_wait 2.36 0.000082 4 20 20 connect 1.29 0.000045 0 66 epoll_ctl ------ ----------- ----------- --------- --------- ---------------- 100.00 0.003478 2730 20 total It may also save a recvfrom() after connect() by changing the following sequence, effectively saving one epoll_ctl() and one recvfrom() : before \| after -----------------------------+---------------------------- - connect() \| - connect() - epoll_ctl(add,out) \| - epoll_ctl(add, in\|out) - sendto() \| - epoll_wait() = out - epoll_ctl(mod,in\|out) \| - send() - epoll_wait() = out \| - epoll_wait() = in\|out - recvfrom() = EAGAIN \| - recvfrom() = OK - epoll_ctl(mod,in) \| - recvfrom() = EAGAIN - epoll_wait() = in \| - epoll_ctl(mod, in) - recvfrom() = OK \| - epoll_wait() - recvfrom() = EAGAIN \| - epoll_wait() \| (...) Now on a 10M req test on 16 threads with 2k concurrent conns and 415kreq/s, we see 190k updates total and 14k epoll_ctl() only.	2019-12-27 16:38:47 +01:00
Willy Tarreau	19689882e6	MINOR: poller: do not call the IO handler if the FD is not active For now this almost never happens but with subsequent patches it will become more important not to uselessly call the I/O handlers if the FD is not active.	2019-12-27 16:38:47 +01:00
Willy Tarreau	0fbc318e24	CLEANUP: connection: merge CO_FL_NOTIFY_DATA and CO_FL_NOTIFY_DONE Both flags became equal in commit `82967bf9` ("MINOR: connection: adjust CO_FL_NOTIFY_DATA after removal of flags"), which already predicted the overlap between xprt_done_cb() and wake() after the removal of the DATA specific flags in 1.8. Let's simply remove CO_FL_NOTIFY_DATA since the "_DONE" version already covers everything and explains the intent well enough.	2019-12-27 16:38:47 +01:00
Willy Tarreau	cbcf77edb7	MINOR: connection: remove the double test on xprt_done_cb() The conn_fd_handler used to have one possible call to this function to notify about end of handshakes, and another one to notify about connection setup or error. But given that we're now only performing wakeup calls after connection validation, we don't need to keep two places to run this test since the conditions do not change in between. This patch merges the two tests into a single one and moves the CO_FL_CONNECTED test appropriately as well so that it's called even on the error path if needed.	2019-12-27 16:38:47 +01:00

... 4 5 6 7 8 ...

11307 Commits