haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-10-31 00:21:00 +01:00

Author	SHA1	Message	Date
William Lallemand	e1533f5790	MINOR: cache: disable cache if shctx_row_data_append fail Disable the cache if the append of data failed, it should never happen because the allocated row size is at least equal to the size of the object to allocate.	2017-11-14 15:20:44 +01:00
William Lallemand	10935bc547	MINOR: cache: forward data with headers Forward the remaining headers with the data in the first call of cache_store_http_forward_data(). Previously the headers were forwarded first, and the function left, implying an additionnal call to cache_store_http_forward_data() for the data. Cc: Christopher Faulet <cfaulet@haproxy.com>	2017-11-14 15:20:44 +01:00
William Lallemand	9d5f54daad	BUG/MEDIUM: cache: use msg->sov to forward header Use msg->sov to forward headers instead of msg->eoh. It can causes some problem because eoh does not contains the last \r\n, and the filter does not support to send the headers partially. Cc: Christopher Faulet <cfaulet@haproxy.com>	2017-11-14 15:20:44 +01:00
Tim Duesterhus	0436ab7841	BUG/MEDIUM: mworker: Fix re-exec when haproxy is started from PATH If haproxy is started using the name of the binary only (i.e. not using a relative or absolute path) the `execv` in `mworker_reload` fails with `ENOENT`, because it does not examine the `PATH`: [WARNING] 315/161139 (7) : Reexecuting Master process [WARNING] 315/161139 (7) : Cannot allocate memory [WARNING] 315/161139 (7) : Failed to reexecute the master processs [7] The error messages are misleading, because the return value of `execv` is not checked. This should be fixed in a separate commit. Once this happened the master process ignores any further signals sent by the administrator. Replace `execv` with `execvp` to establish the expected behaviour. This bug was introduced in commit 73b85e75b3963086be889e1fb40a59e7ef2ad63b.	2017-11-14 15:11:24 +01:00
Christopher Faulet	9dcf9b6f03	MINOR: threads: Use __decl_hathreads to declare locks This macro should be used to declare variables or struct members depending on the USE_THREAD compile option. It avoids the encapsulation of such declarations between #ifdef/#endif. It is used to declare all lock variables.	2017-11-13 11:38:17 +01:00
Christopher Faulet	600d37edda	BUG/MINOR: spoe: check buffer size before acquiring or releasing it In spoe_acquire_buffer and spoe_release_buffer, instead of checking the buffer against buf_empty, we now check its size. It is important because when an allocation fails, it will be set to buf_wanted. In both cases, the size is 0. It is a proactive bug fix, no real problem was observed till now. It cannot be backported as is in 1.7 because of all changes made on the SPOE in 1.8.	2017-11-13 11:38:12 +01:00
Willy Tarreau	3e5e417060	BUILD: thread/pipe: fix build without threads Marcus R�ckert reported that commit d8b3b65 ("BUG/MEDIUM: splice/threads: pipe reuse list was not protected.") broke threadless support. Add the required #ifdef.	2017-11-11 18:00:24 +01:00
William Lallemand	18f133adb3	BUG/MEDIUM: cache: does not cache if no Content-Length In the case of Transfer-Encoding: chunked, there is no Content-Length which causes the cache to allocate a too small shctx row for the data. It's not possible to allocate a shctx row for the chunks, we need to be able to allocate on-the-fly the shctx blocks during the data transfer.	2017-11-11 14:01:21 +01:00
Willy Tarreau	916597903c	MEDIUM: http: always reject the "PRI" method This method was reserved for the HTTP/2 connection preface, must never be used and must be rejected. In normal situations it doesn't happen, but it may be visible if a TCP frontend has alpn "h2" enabled, and forwards to an HTTP backend which tries to parse the request. Before this patch it would pass the wrong request to the backend server, now it properly returns 400 bad req. This patch should probably be backported to stable versions.	2017-11-10 19:38:10 +01:00
Willy Tarreau	387bd4f69f	CLEANUP: global: introduce variable pid_bit to avoid shifts with relative_pid At a number of places, bitmasks are used for process affinity and to map listeners to processes. Every time 1UL<<(relative_pid-1) is used. Let's create a "pid_bit" variable corresponding to this value to clean this up.	2017-11-10 19:08:14 +01:00
Willy Tarreau	9a398beac3	BUG/MEDIUM: stream: don't ignore res.analyse_exp anymore It happens that no single analyser has ever needed to set res.analyse_exp, so that process_stream() didn't consider it when computing the next task expiration date. Since Lua actions were introduced in 1.6, this can be needed on http-response actions for example, so let's ensure it's properly handled. Thanks to Nick Dimov for reporting this bug. The fix needs to be backported to 1.7 and 1.6.	2017-11-10 17:14:23 +01:00
Willy Tarreau	5d9846f4b3	MINOR: cli: make "show fd" report the fd's thread mask This is useful to know what thread(s) an fd is scheduled to be handled on. It's worth noting that at the moment the "show fd"d doesn't seem totally thread-safe.	2017-11-10 16:53:09 +01:00
Willy Tarreau	28b55c6fed	CLEANUP: mux: remove the unused "release()" function In commit 53a4766 ("MEDIUM: connection: start to introduce a mux layer between xprt and data") we introduced a release() function which ends up never being used. Let's get rid of it now.	2017-11-10 16:43:05 +01:00
Willy Tarreau	7ce3f09513	BUG/MEDIUM: threads/cli: fix "show sess" locking on release The recent thread updates on the CLI broke "show sess" by unlocking the stream twice instead of lock+unlock. No backport is needed.	2017-11-10 16:24:41 +01:00
Willy Tarreau	22cf59bbba	BUG/MEDIUM: h2: support orphaned streams When a stream_interface performs a shutw() then a shutr(), the stream is marked closed. Then cs_destroy() calls h2_detach() and it cannot fail since we're on the leaving path of the caller. The problem is that in order to close streams we usually have to send either an emty DATA frame with the ES flag set or an RST_STREAM frame, and the mux buffer might already be full, forcing the stream to be queued. The forced removal of this stream causes this last message to silently disappear, and the client to wait forever for a response. This commit ensures we can detach the conn_stream from the h2 stream if the stream is blocked, effectively making the h2 stream an orphan, ensures that the mux can deal with orphaned streams after processing them, and that the demux can kill them upon receipt of GOAWAY.	2017-11-10 11:48:15 +01:00
Willy Tarreau	8c0ea7d21a	BUG/MEDIUM: h2: split the function to send RST_STREAM There is an issue with how the RST_STREAM frames are sent. Some of them are sent from the demux, either for valid or for closed streams, and some are sent from the mux always for valid streams. At the moment the demux stream ID is used, which is wrong for all streams being muxed, and sometimes results in certain bad HTTP responses causing the emission of an RST_STREAM referencing stream zero. In addition, the stream's blocked flags could be updated even if the stream was the closed or idle ones. We really need to split the function for the two distinct use cases where one is used to send an RST on a condition detected at the connection level (such as a closed stream) and the other one is used to send an RST for a condition detected at the stream level. The first one is used only in the demux, and the other one only by a valid stream.	2017-11-10 10:05:24 +01:00
Christopher Faulet	09fdf4b112	BUG/MINOR: pattern: Rely on the sample type to copy it in pattern_exec_match To be thread safe, the function pattern_exec_match copy data (the pattern and the inner sample) in thread-local variables. But when the sample is duplicated, we must check its type and not the pattern one. This is specific to threads, no backport is needed.	2017-11-09 17:19:20 +01:00
Christopher Faulet	c5a9d5bf23	BUG/MEDIUM: stream-int: Don't loss write's notifs when a stream is woken up When a write activity is reported on a channel, it is important to keep this information for the stream because it take part on the analyzers' triggering. When some data are written, the flag CF_WRITE_PARTIAL is set. It participates to the task's timeout updates and to the stream's waking. It is also used in CF_MASK_ANALYSER mask to trigger channels anaylzers. In the past, it was cleared by process_stream. Because of a bug (fixed in commit 95fad5ba4 ["BUG/MAJOR: stream-int: don't re-arm recv if send fails"]), It is now cleared before each send and in stream_int_notify. So it is possible to loss this information when process_stream is called, preventing analyzers to be called, and possibly leading to a stalled stream. Today, this happens in HTTP2 when you call the stat page or when you use the cache filter. In fact, this happens when the response is sent by an applet. In HTTP1, everything seems to work as expected. To fix the problem, we need to make the difference between the write activity reported to lower layers and the one reported to the stream. So the flag CF_WRITE_EVENT has been added to notify the stream of the write activity on a channel. It is set when a send succedded and reset by process_stream. It is also used in CF_MASK_ANALYSER. finally, it is checked in stream_int_notify to wake up a stream and in channel_check_timeouts. This bug is probably present in 1.7 but it seems to have no effect. So for now, no needs to backport it.	2017-11-09 15:16:05 +01:00
Willy Tarreau	a87f202b49	BUG/MEDIUM: h2: reject non-3-digit status codes If the H1 parser would report a status code length not consisting in exactly 3 digits, the error case was confused with a lack of buffer room and was causing the parser to loop infinitely.	2017-11-09 11:23:00 +01:00
Willy Tarreau	1b4cf9b754	BUG/MINOR: h1: the HTTP/1 make status code parser check for digits The H1 parser used by the H2 gateway was a bit lax and could validate non-numbers in the status code. Since it computes the code on the fly it's problematic, as "30:" is read as status code 310. Let's properly check that it's a number now. No backport needed.	2017-11-09 11:15:45 +01:00
Willy Tarreau	ddfbd83780	BUILD: shctx: do not depend on openssl anymore The build breaks on a machine without openssl/crypto.h because shctx still loads openssl-compat.h while it doesn't need it anymore since the code was moved : In file included from src/shctx.c:20:0: include/proto/openssl-compat.h:3:28: fatal error: openssl/crypto.h: No such file or directory #include <openssl/crypto.h> Just remove include openssl-compat from shctx.	2017-11-08 14:33:36 +01:00
Willy Tarreau	46c9d3e6cb	BUILD: ssl: fix build of backend without ssl Commit 522eea7 ("MINOR: ssl: Handle sending early data to server.") added a dependency on SRV_SSL_O_EARLY_DATA which only exists when USE_OPENSSL is defined (which is probably not the best solution) and breaks the build when ssl is not enabled. Just add an ifdef USE_OPENSSL around the block for now.	2017-11-08 14:28:08 +01:00
Olivier Houchard	522eea7110	MINOR: ssl: Handle sending early data to server. This adds a new keyword on the "server" line, "allow-0rtt", if set, we'll try to send early data to the server, as long as the client sent early data, as in case the server rejects the early data, we no longer have them, and can't resend them, so the only option we have is to send back a 425, and we need to be sure the client knows how to interpret it correctly.	2017-11-08 14:11:10 +01:00
Olivier Houchard	cfdef2e312	MINOR: ssl: Spell 0x10101000L correctly. Issue added in 1.8-dev by c2aae74 ("MEDIUM: ssl: Handle early data with OpenSSL 1.1.1"), no impact on older versions.	2017-11-08 14:10:02 +01:00
Olivier Houchard	bd84ac8737	MINOR: ssl: Handle session resumption with TLS 1.3 With TLS 1.3, session aren't established until after the main handshake has completed. So we can't just rely on calling SSL_get1_session(). Instead, we now register a callback for the "new session" event. This should work for previous versions of TLS as well.	2017-11-08 14:08:07 +01:00
Olivier Houchard	35a63cc1c7	BUG/MINOR; ssl: Don't assume we have a ssl_bind_conf because a SNI is matched. We only have a ssl_bind_conf if crt-list is used, however we can still match a certificate SNI, so don't assume we have a ssl_bind_conf.	2017-11-08 14:08:07 +01:00
Willy Tarreau	9e45b33f7e	BUG/MAJOR: threads/tasks: fix the scheduler again My recent change in commit ce4e0aa ("MEDIUM: task: change the construction of the loop in process_runnable_tasks()") was bogus as it used to keep the rq_next across an unlock/lock sequence, occasionally leading to crashes for tasks that are eligible to any thread. We must use the lookup call for each new batch instead. The problem is easily triggered with such a configuration : global nbthread 4 listen check mode http bind 0.0.0.0:8080 redirect location / option httpchk GET / server s1 127.0.0.1:8080 check inter 1 server s2 127.0.0.1:8080 check inter 1 Thanks to Olivier for diagnosing this one. No backport is needed.	2017-11-08 14:05:19 +01:00
Willy Tarreau	ecd2e15919	BUG/MINOR: stream-int: don't set MSG_MORE on closed request path Commit 4ac4928 ("BUG/MINOR: stream-int: don't set MSG_MORE on SHUTW_NOW without AUTO_CLOSE") was incomplete. H2 reveals another situation where the input stream is marked closed with the request and we set MSG_MORE, causing a delay before the request leaves. Better avoid setting the flag on the request path for close cases in general.	2017-11-07 15:07:25 +01:00
Emeric Brun	11f5886e5c	BUG/MINOR: comp: fix compilation warning compiling without compression. This is specific to threads, no backport is needed.	2017-11-07 14:48:13 +01:00
Emeric Brun	d8b3b65faa	BUG/MEDIUM: splice/threads: pipe reuse list was not protected. The list is now protected using a global spinlock.	2017-11-07 14:47:28 +01:00
Willy Tarreau	926fa4c098	BUG/MINOR: h2: don't send GOAWAY on failed response As part of the detection for intentional closes, we can kill the connection if a shutw() happens before the headers. But it can also happen that an invalid response is not properly parsed, preventing any headers frame from being sent and making the function believe it was an abort. Now instead we check if any response was received from the stream, regardless of the fact that it was properly converted.	2017-11-07 14:47:04 +01:00
Willy Tarreau	c4312d3dfd	MINOR: h2: add new stream flag H2_SF_OUTGOING_DATA This one indicates whether we've received data to mux out. It helps make the difference between a clean close and a an erroneous one.	2017-11-07 14:47:04 +01:00
Willy Tarreau	58e3208714	BUG/MINOR: h2: correctly check for H2_SF_ES_SENT before closing In h2_shutw() we must not send another empty frame (nor RST) after one has been sent, as the stream is already in HLOC/CLOSED state.	2017-11-07 14:47:04 +01:00
Willy Tarreau	6d8b682f9a	BUG/MEDIUM: h2: properly set H2_SF_ES_SENT when sending the final frame When sending DATA+ES, it's important to set H2_SF_ES_SENT as we don't want to emit is several times nor to send an RST afterwards.	2017-11-07 14:47:04 +01:00
Willy Tarreau	e6ae77f64f	MINOR: h2: don't re-enable the connection's task when we're closing It's pointless to requeue the task when we're closing, so swap the order of the task_queue() and h2_release(). It also matches what was written in the comment regarding re-arming the timer.	2017-11-07 14:47:04 +01:00
Willy Tarreau	83906c2f91	BUG/MEDIUM: h2: don't close the connection is there are data left h2_detach() is called after a stream was closed, and it evaluates if it's worth closing the connection. The issue there is that the connection is closed too early in case there's demand for closing after the last stream, even if some data remain in the mux. Let's change the condition to check for this.	2017-11-07 14:47:04 +01:00
Christopher Faulet	2a944ee16b	BUILD: threads: Rename SPIN/RWLOCK macros using HA_ prefix This remove any name conflicts, especially on Solaris.	2017-11-07 11:10:24 +01:00
Willy Tarreau	7d8e4af46a	BUG/MEDIUM: h2: fix some wrong error codes on connections When the assignment of the connection state was moved into h2c_error(), 3 of them were missed because they were wrong, using H2_SS_ERROR instead. This resulted in the connection's state being set to H2_CS_ERROR2 in fact, so the error was not properly sent.	2017-11-07 11:08:28 +01:00
Willy Tarreau	721c974e5e	MEDIUM: h2: remove the H2_SS_RESET intermediate state This one was created to maintain the knowledge that a stream was closed after having sent an RST_STREAM frame but that's not needed anymore and it confuses certain conditions on the error processing path. It's time to get rid of it.	2017-11-07 11:05:42 +01:00
Willy Tarreau	319994a2e9	BUG/MEDIUM: h2: don't try (and fail) to send non-existing data in the mux The call to xprt->snd_buf() was not conditionned on the presence of data in the buffer, resulting in snd_buf() returning 0 and never disabling the polling. It was revealed by the previous bug on error processing but must properly be handled.	2017-11-07 11:03:56 +01:00
Willy Tarreau	3eabe9b174	BUG/MEDIUM: h2: properly send the GOAWAY frame in the mux A typo on a condition prevented H2_CS_ERROR from being processed, leading to an infinite loop on connection error.	2017-11-07 11:03:01 +01:00
Willy Tarreau	c6795ca7c1	BUG/MEDIUM: h2: properly send an RST_STREAM on mux stream error Some stream errors are detected on the MUX path (eg: H1 response encoding). The ones forgot to emit an RST_STREAM frame, causing the client to wait and/or to see the connection being immediately closed. This is now fixed.	2017-11-07 09:43:06 +01:00
Willy Tarreau	6743420778	BUG/MINOR: h2: set the "HEADERS_SENT" flag on stream, not connection This flag was added after the GOAWAY flags were introduced and mistakenly placed in the connection, but that doesn't make sense as it's specific to the stream. The main impact is the risk of returning a DATA0+ES frame for an error instead of an RST_STREAM.	2017-11-06 20:20:51 +01:00
Olivier Houchard	283810773a	BUG/MINOR: dns: Don't lock the server lock in snr_check_ip_callback(). snr_check_ip_callback() may be called with the server lock, so don't attempt to lock it again, instead, make sure the callers always have the lock before calling it.	2017-11-06 18:34:42 +01:00
Olivier Houchard	55dcdf4c39	BUG/MINOR: dns: Don't try to get the server lock if it's already held. dns_link_resolution() can be called with the server lock already held, so don't attempt to lock it again in that case.	2017-11-06 18:34:24 +01:00
Willy Tarreau	f0c531ab55	MEDIUM: tasks: implement a lockless scheduler for single-thread usage The scheduler is complex and uses local queues to amortize the cost of locks. But all this comes with a cost that is quite observable with single-thread workloads. The purpose of this patch is to reimplement the much simpler scheduler for the case where threads are not used. The code is very small and simple. It doesn't impact the multi-threaded performance at all, and provides a nice 10% performance increase in single-thread by reaching 606kreq/s on the tests that showed 550kreq/s before.	2017-11-06 11:20:11 +01:00
Willy Tarreau	9d4b56b88e	MINOR: tasks: only visit filled task slots after processing them process_runnable_tasks() needs to requeue or wake up tasks after processing them in batches. By only refilling the existing ones, we avoid revisiting all the queue. The performance gain is measurable starting with two threads, where the request rate climbs to 657k/s compared to 644k.	2017-11-06 11:20:11 +01:00
Willy Tarreau	ce4e0aa7f3	MEDIUM: task: change the construction of the loop in process_runnable_tasks() This patch slightly rearranges the loop to pack the locked code a little bit, and to try to concentrate accesses to the tree together to benefit more from the cache. It also fixes how the loop handles the right margin : now that is guaranteed that the retrieved nodes are filtered to only match the current thread, we don't need to rewind every 16 entries. Instead we can rewind each time we reach the right margin again. With this change, we now achieve the following performance for 10 H2 conns each containing 100 streams : 1 thread : 550kreq/s 2 thread : 644kreq/s 3 thread : 598kreq/s	2017-11-06 11:20:11 +01:00
Willy Tarreau	b992ba16ef	MINOR: task: simplify wake_expired_tasks() to avoid unlocking in the loop This function is sensitive, let's make it shorter by factoring out the unlock and leave code. This reduced the function's size by a few tens of bytes and increased the overall performance by about 1%.	2017-11-06 11:20:11 +01:00
Willy Tarreau	8d38805d3d	MAJOR: task: make use of the scope-aware ebtree functions Currently the task scheduler suffers from an O(n) lookup when skipping tasks that are not for the current thread. The reason is that eb32_lookup_ge() has no information about the current thread so it always revisits many tasks for other threads before finding its own tasks. This is particularly visible with HTTP/2 since the number of concurrent streams created at once causes long series of tasks for the same stream in the scheduler. With only 10 connections and 100 streams each, by running on two threads, the performance drops from 640kreq/s to 11.2kreq/s! Lookup metrics show that for only 200000 task lookups, 430 million skips had to be performed, which means that on average, each lookup leads to 2150 nodes to be visited. This commit backports the principle of scope lookups for ebtrees from the ebtree_v7 development tree. The idea is that each node contains a mask indicating the union of the scopes for the nodes below it, which is fed during insertion, and used during lookups. Then during lookups, branches that do not contain any leaf matching the requested scope are simply ignored. This perfectly matches a thread mask, allowing a thread to only extract the tasks it cares about from the run queue, and to always find them in O(log(n)) instead of O(n). Thus the scheduler uses tid_bit and task->thread_mask as the ebtree scope here. Doing this has recovered most of the performance, as can be seen on the test below with two threads, 10 connections, 100 streams each, and 1 million requests total : Before After Gain test duration : 89.6s 4.73s x19 HTTP requests/s (DEBUG) : 11200 211300 x19 HTTP requests/s (PROD) : 15900 447000 x28 spin_lock time : 85.2s 0.46s /185 time per lookup : 13us 40ns /325 Even when going to 6 threads (on 3 hyperthreaded CPU cores), the performance stays around 284000 req/s, showing that the contention is much lower. A test showed that there's no benefit in using this for the wait queue though.	2017-11-06 11:20:11 +01:00

... 6 7 8 9 10 ...

5905 Commits