haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-10-26 14:10:59 +01:00

Author	SHA1	Message	Date
Willy Tarreau	33982cbdc0	BUG/MAJOR: stream: ensure analysers are always called upon close A recent issue affecting HTTP/2 + redirect + cache has uncovered an old problem affecting all existing versions regarding the way events are reported to analysers. It happens that when an event is reported, analysers see it and may decide to temporarily pause processing and prevent other analysers from processing the same event. Then the event may be cleared and upon the next call to the analysers, some of them will never see it. This is exactly what happens with CF_READ_NULL if it is received before the request is processed, like during redirects : the first time, some analysers see it, pause, then the event may be converted to a SHUTW and cleared, and on next call, there's nothing to process. In practice it's hard to get the CF_READ_NULL flag during the request because requests have CF_READ_DONTWAIT, preventing the read0 from happening. But on HTTP/2 it's presented along with any incoming request. Also on a TCP frontend the flag is not set and it's possible to read the NULL before the request is parsed. This causes a problem when filters are present because flt_end_analyse needs to be called to release allocated resources and remove the CF_FLT_ANALYZE flag. And the loss of this event prevents the analyser from being called and from removing itself, preventing the connection from ever ending. This problem just shows that the event processing needs a serious revamp after 1.8. In the mean time we can deal with the really problematic case which is that we want to call analysers if CF_SHUTW is set on any side ad it's the last opportunity to terminate a processing. It may occasionally result in some analysers being called for nothing in half- closed situations but it will take care of the issue. An example of problematic configuration triggering the bug in 1.7 is : frontend tcp bind :4445 default_backend http backend http redirect location / compression algo identity Then submitting requests which immediately close will have for effect to accumulate streams which will never be freed : $ printf "GET / HTTP/1.1\r\n\r\n" >/dev/tcp/0/4445 This fix must be backported to 1.7 as well as any version where commit c0c672a ("BUG/MINOR: http: Fix conditions to clean up a txn and to handle the next request") was backported. This commit didn't cause the bug but made it much more likely to happen.	2017-11-20 15:58:22 +01:00
Willy Tarreau	e223e3bc85	BUG/MEDIUM: stream: don't automatically forward connect nor close Upon stream instanciation, we used to enable channel auto connect and auto close to ease TCP processing. But commit 9aaf778 ("MAJOR: connection : Split struct connection into struct connection and struct conn_stream.") has revealed that it was a bad idea because this commit enables reading of the trailing shutdown that may follow a small requests, resulting in a read and a shutr turned into shutw before the stream even has a chance to apply the filters. This causes an issue with impossible situations where the backend stream interface is still in SI_ST_INI with a closed output, which blocks some streams for example when performing a redirect with filters enabled. Let's change this so that we only enable these two flags if there is no analyser on the stream. This way process_stream() has a chance to let the analysers decide whether or not to allow the shutdown event to be transferred to the other side. It doesn't seem possible to trigger this issue before 1.8, so for now it is preferable not to backport this fix.	2017-11-20 15:58:22 +01:00
David Carlier	91a88b0c25	BUG/MEDIUM: deviceatlas: ignore not valuable HTTP request data A customer reported a crash when within the HTTP request some headers were not set leading to the module to crash. So the module ignore them since empty data have no value for the detection. Needs to be backported to 1.7.	2017-11-17 10:41:40 +01:00
Olivier Houchard	e9bed53486	MINOR: ssl: Make sure we don't shutw the connection before the handshake. Instead of trying to finish the handshake in ssl_sock_shutw, which may fail, try not to shutdown until the handshake is finished.	2017-11-16 19:04:10 +01:00
Olivier Houchard	e6060c5d87	MINOR: SSL: Store the ASN1 representation of client sessions. Instead of storing the SSL_SESSION pointer directly in the struct server, store the ASN1 representation, otherwise, session resumption is broken with TLS 1.3, when multiple outgoing connections want to use the same session.	2017-11-16 19:03:32 +01:00
Christopher Faulet	f02050662b	MINOR: stream: Add thread-mask of tasks/FDs/applets in "show sess all" command	2017-11-16 11:19:46 +01:00
Christopher Faulet	b4a4d9aed4	MEDIUM: applets: Don't process more than 200 active applets at once Now, we process at most 200 active applets per call to applet_run_active. We use the same limit as the tasks. With the cache filter and the SPOE, the number of active applets can now be huge. So, it is important to limit the number of applets processed in applet_run_active.	2017-11-16 11:19:46 +01:00
Christopher Faulet	7163056dc5	MAJOR: polling: Use active_appels_mask instead of applets_active_queue applets_active_queue is the active queue size. It is a global variable. So it is underoptimized because we may be lead to consider there are active applets for a thread while in fact all active applets are assigned to the otherthreads. So, in such cases, the polling loop will be evaluated many more times than necessary. Instead, we now check if the thread id is set in the bitfield active_applets_mask. This is specific to threads, no backport is needed.	2017-11-16 11:19:46 +01:00
Christopher Faulet	595d7b72a6	MINOR: applets: Use a bitfield to track applets activity per-thread a bitfield has been added to know if there are runnable applets for a thread. When an applet is woken up, the bits corresponding to its thread_mask are set. When all active applets for a thread is get to be processed, the thread is removed from active ones by unsetting its tid_bit from the bitfield.	2017-11-16 11:19:46 +01:00
Christopher Faulet	8a48f67526	MAJOR: polling: Use active_tasks_mask instead of tasks_run_queue tasks_run_queue is the run queue size. It is a global variable. So it is underoptimized because we may be lead to consider there are active tasks for a thread while in fact all active tasks are assigned to the other threads. So, in such cases, the polling loop will be evaluated many more times than necessary. Instead, we now check if the thread id is set in the bitfield active_tasks_mask. Another change has been made in process_runnable_tasks. Now, we always limit the number of tasks processed to 200. This is specific to threads, no backport is needed.	2017-11-16 11:19:46 +01:00
Christopher Faulet	3911ee85df	MINOR: tasks: Use a bitfield to track tasks activity per-thread a bitfield has been added to know if there are runnable tasks for a thread. When a task is woken up, the bits corresponding to its thread_mask are set. When all tasks for a thread have been evaluated without any wakeup, the thread is removed from active ones by unsetting its tid_bit from the bitfield.	2017-11-16 11:19:46 +01:00
Christopher Faulet	96d4483df7	BUG/MINOR: Allocate the log buffers before the proxies startup Since the commit cd7879adc ("BUG/MEDIUM: threads: Run the poll loop on the main thread too"), the log buffers are allocated after the proxies startup. So log messages produced during this startup was ignored. To fix the bug, we restore the initialization of these buffers before proxies startup. This is specific to threads, no backport is needed.	2017-11-16 11:19:46 +01:00
William Lallemand	75ea0a06b0	BUG/MEDIUM: mworker: does not close inherited FD At the end of the master initialisation, a call to protocol_unbind_all() was made, in order to close all the FDs. Unfortunately, this function closes the inherited FDs (fd@), upon reload the master wasn't able to reload a configuration with those FDs. The create_listeners() function now store a flag to specify if the fd was inherited or not. Replace the protocol_unbind_all() by mworker_cleanlisteners() + deinit_pollers()	2017-11-15 19:53:33 +01:00
William Lallemand	fade49d8fb	BUG/MEDIUM: mworker: does not deinit anymore Does not use the deinit() function during a reload, it's dangerous and might be subject to double free, segfault and hazardous behavior if it's called twice in the case of a execvp fail.	2017-11-15 19:53:31 +01:00
William Lallemand	2f8b31c2c6	BUG/MEDIUM: mworker: wait again for signals when execvp fail After execvp fails, the signals were ignored, preventing to try a reload again. It is now fixed by reaching the top of the mworker_wait() function once the execvp failed.	2017-11-15 19:52:06 +01:00
William Lallemand	722d4ca0dd	MINOR: mworker: display an accurate error when the reexec fail When the master worker fail the execvp, it returns the wrong error "Cannot allocate memory". We now display the accurate error corresponding to the errno value.	2017-11-15 19:52:06 +01:00
Willy Tarreau	9c1e15d8cd	MINOR: tools: emphasize the node being worked on in the tree dump Now we can show in dotted red the node being removed or surrounded in red a node having been inserted, and add a description on the graph related to the operation in progress for example.	2017-11-15 19:43:05 +01:00
Willy Tarreau	6c7f4deb21	MINOR: tools: improve the DOT dump of the ebtree Use a smaller and cleaner fixed font, use upper case to indicate sides on branches, remove the useless node/leaf markers on branches since the colors already indicate them, and show the node's key as it helps spot the matching leaf.	2017-11-15 19:43:05 +01:00
Willy Tarreau	ed3cda02ae	MINOR: tools: add a function to dump a scope-aware tree to a file It emits a dump in DOT format for graphing purposes during debugging sessions. It's convenient to dump the run queue.	2017-11-15 16:07:15 +01:00
Christopher Faulet	99bca65f53	BUG/MEDIUM: standard: itao_str/idx and quote_str/idx must be thread-local This bug has an impact on the stats applet and easily leads to a crash of HAProxy. This is specific to threads, no backport is needed.	2017-11-14 18:11:57 +01:00
Christopher Faulet	919b739862	CLEANUP: tasks: Remove useless double test on rq_next No backport is needed, this is purely 1.8-specific.	2017-11-14 18:11:34 +01:00
Christopher Faulet	e9a896e09e	BUG/MINOR: threads: tid_bit must be a unsigned long This is specific to threads, no backport is needed.	2017-11-14 18:11:28 +01:00
William Lallemand	e1533f5790	MINOR: cache: disable cache if shctx_row_data_append fail Disable the cache if the append of data failed, it should never happen because the allocated row size is at least equal to the size of the object to allocate.	2017-11-14 15:20:44 +01:00
William Lallemand	10935bc547	MINOR: cache: forward data with headers Forward the remaining headers with the data in the first call of cache_store_http_forward_data(). Previously the headers were forwarded first, and the function left, implying an additionnal call to cache_store_http_forward_data() for the data. Cc: Christopher Faulet <cfaulet@haproxy.com>	2017-11-14 15:20:44 +01:00
William Lallemand	9d5f54daad	BUG/MEDIUM: cache: use msg->sov to forward header Use msg->sov to forward headers instead of msg->eoh. It can causes some problem because eoh does not contains the last \r\n, and the filter does not support to send the headers partially. Cc: Christopher Faulet <cfaulet@haproxy.com>	2017-11-14 15:20:44 +01:00
Tim Duesterhus	0436ab7841	BUG/MEDIUM: mworker: Fix re-exec when haproxy is started from PATH If haproxy is started using the name of the binary only (i.e. not using a relative or absolute path) the `execv` in `mworker_reload` fails with `ENOENT`, because it does not examine the `PATH`: [WARNING] 315/161139 (7) : Reexecuting Master process [WARNING] 315/161139 (7) : Cannot allocate memory [WARNING] 315/161139 (7) : Failed to reexecute the master processs [7] The error messages are misleading, because the return value of `execv` is not checked. This should be fixed in a separate commit. Once this happened the master process ignores any further signals sent by the administrator. Replace `execv` with `execvp` to establish the expected behaviour. This bug was introduced in commit 73b85e75b3963086be889e1fb40a59e7ef2ad63b.	2017-11-14 15:11:24 +01:00
Christopher Faulet	9dcf9b6f03	MINOR: threads: Use __decl_hathreads to declare locks This macro should be used to declare variables or struct members depending on the USE_THREAD compile option. It avoids the encapsulation of such declarations between #ifdef/#endif. It is used to declare all lock variables.	2017-11-13 11:38:17 +01:00
Christopher Faulet	600d37edda	BUG/MINOR: spoe: check buffer size before acquiring or releasing it In spoe_acquire_buffer and spoe_release_buffer, instead of checking the buffer against buf_empty, we now check its size. It is important because when an allocation fails, it will be set to buf_wanted. In both cases, the size is 0. It is a proactive bug fix, no real problem was observed till now. It cannot be backported as is in 1.7 because of all changes made on the SPOE in 1.8.	2017-11-13 11:38:12 +01:00
Willy Tarreau	3e5e417060	BUILD: thread/pipe: fix build without threads Marcus R�ckert reported that commit d8b3b65 ("BUG/MEDIUM: splice/threads: pipe reuse list was not protected.") broke threadless support. Add the required #ifdef.	2017-11-11 18:00:24 +01:00
William Lallemand	18f133adb3	BUG/MEDIUM: cache: does not cache if no Content-Length In the case of Transfer-Encoding: chunked, there is no Content-Length which causes the cache to allocate a too small shctx row for the data. It's not possible to allocate a shctx row for the chunks, we need to be able to allocate on-the-fly the shctx blocks during the data transfer.	2017-11-11 14:01:21 +01:00
Willy Tarreau	916597903c	MEDIUM: http: always reject the "PRI" method This method was reserved for the HTTP/2 connection preface, must never be used and must be rejected. In normal situations it doesn't happen, but it may be visible if a TCP frontend has alpn "h2" enabled, and forwards to an HTTP backend which tries to parse the request. Before this patch it would pass the wrong request to the backend server, now it properly returns 400 bad req. This patch should probably be backported to stable versions.	2017-11-10 19:38:10 +01:00
Willy Tarreau	387bd4f69f	CLEANUP: global: introduce variable pid_bit to avoid shifts with relative_pid At a number of places, bitmasks are used for process affinity and to map listeners to processes. Every time 1UL<<(relative_pid-1) is used. Let's create a "pid_bit" variable corresponding to this value to clean this up.	2017-11-10 19:08:14 +01:00
Willy Tarreau	9a398beac3	BUG/MEDIUM: stream: don't ignore res.analyse_exp anymore It happens that no single analyser has ever needed to set res.analyse_exp, so that process_stream() didn't consider it when computing the next task expiration date. Since Lua actions were introduced in 1.6, this can be needed on http-response actions for example, so let's ensure it's properly handled. Thanks to Nick Dimov for reporting this bug. The fix needs to be backported to 1.7 and 1.6.	2017-11-10 17:14:23 +01:00
Willy Tarreau	5d9846f4b3	MINOR: cli: make "show fd" report the fd's thread mask This is useful to know what thread(s) an fd is scheduled to be handled on. It's worth noting that at the moment the "show fd"d doesn't seem totally thread-safe.	2017-11-10 16:53:09 +01:00
Willy Tarreau	28b55c6fed	CLEANUP: mux: remove the unused "release()" function In commit 53a4766 ("MEDIUM: connection: start to introduce a mux layer between xprt and data") we introduced a release() function which ends up never being used. Let's get rid of it now.	2017-11-10 16:43:05 +01:00
Willy Tarreau	7ce3f09513	BUG/MEDIUM: threads/cli: fix "show sess" locking on release The recent thread updates on the CLI broke "show sess" by unlocking the stream twice instead of lock+unlock. No backport is needed.	2017-11-10 16:24:41 +01:00
Willy Tarreau	22cf59bbba	BUG/MEDIUM: h2: support orphaned streams When a stream_interface performs a shutw() then a shutr(), the stream is marked closed. Then cs_destroy() calls h2_detach() and it cannot fail since we're on the leaving path of the caller. The problem is that in order to close streams we usually have to send either an emty DATA frame with the ES flag set or an RST_STREAM frame, and the mux buffer might already be full, forcing the stream to be queued. The forced removal of this stream causes this last message to silently disappear, and the client to wait forever for a response. This commit ensures we can detach the conn_stream from the h2 stream if the stream is blocked, effectively making the h2 stream an orphan, ensures that the mux can deal with orphaned streams after processing them, and that the demux can kill them upon receipt of GOAWAY.	2017-11-10 11:48:15 +01:00
Willy Tarreau	8c0ea7d21a	BUG/MEDIUM: h2: split the function to send RST_STREAM There is an issue with how the RST_STREAM frames are sent. Some of them are sent from the demux, either for valid or for closed streams, and some are sent from the mux always for valid streams. At the moment the demux stream ID is used, which is wrong for all streams being muxed, and sometimes results in certain bad HTTP responses causing the emission of an RST_STREAM referencing stream zero. In addition, the stream's blocked flags could be updated even if the stream was the closed or idle ones. We really need to split the function for the two distinct use cases where one is used to send an RST on a condition detected at the connection level (such as a closed stream) and the other one is used to send an RST for a condition detected at the stream level. The first one is used only in the demux, and the other one only by a valid stream.	2017-11-10 10:05:24 +01:00
Christopher Faulet	09fdf4b112	BUG/MINOR: pattern: Rely on the sample type to copy it in pattern_exec_match To be thread safe, the function pattern_exec_match copy data (the pattern and the inner sample) in thread-local variables. But when the sample is duplicated, we must check its type and not the pattern one. This is specific to threads, no backport is needed.	2017-11-09 17:19:20 +01:00
Christopher Faulet	c5a9d5bf23	BUG/MEDIUM: stream-int: Don't loss write's notifs when a stream is woken up When a write activity is reported on a channel, it is important to keep this information for the stream because it take part on the analyzers' triggering. When some data are written, the flag CF_WRITE_PARTIAL is set. It participates to the task's timeout updates and to the stream's waking. It is also used in CF_MASK_ANALYSER mask to trigger channels anaylzers. In the past, it was cleared by process_stream. Because of a bug (fixed in commit 95fad5ba4 ["BUG/MAJOR: stream-int: don't re-arm recv if send fails"]), It is now cleared before each send and in stream_int_notify. So it is possible to loss this information when process_stream is called, preventing analyzers to be called, and possibly leading to a stalled stream. Today, this happens in HTTP2 when you call the stat page or when you use the cache filter. In fact, this happens when the response is sent by an applet. In HTTP1, everything seems to work as expected. To fix the problem, we need to make the difference between the write activity reported to lower layers and the one reported to the stream. So the flag CF_WRITE_EVENT has been added to notify the stream of the write activity on a channel. It is set when a send succedded and reset by process_stream. It is also used in CF_MASK_ANALYSER. finally, it is checked in stream_int_notify to wake up a stream and in channel_check_timeouts. This bug is probably present in 1.7 but it seems to have no effect. So for now, no needs to backport it.	2017-11-09 15:16:05 +01:00
Willy Tarreau	a87f202b49	BUG/MEDIUM: h2: reject non-3-digit status codes If the H1 parser would report a status code length not consisting in exactly 3 digits, the error case was confused with a lack of buffer room and was causing the parser to loop infinitely.	2017-11-09 11:23:00 +01:00
Willy Tarreau	1b4cf9b754	BUG/MINOR: h1: the HTTP/1 make status code parser check for digits The H1 parser used by the H2 gateway was a bit lax and could validate non-numbers in the status code. Since it computes the code on the fly it's problematic, as "30:" is read as status code 310. Let's properly check that it's a number now. No backport needed.	2017-11-09 11:15:45 +01:00
Willy Tarreau	ddfbd83780	BUILD: shctx: do not depend on openssl anymore The build breaks on a machine without openssl/crypto.h because shctx still loads openssl-compat.h while it doesn't need it anymore since the code was moved : In file included from src/shctx.c:20:0: include/proto/openssl-compat.h:3:28: fatal error: openssl/crypto.h: No such file or directory #include <openssl/crypto.h> Just remove include openssl-compat from shctx.	2017-11-08 14:33:36 +01:00
Willy Tarreau	46c9d3e6cb	BUILD: ssl: fix build of backend without ssl Commit 522eea7 ("MINOR: ssl: Handle sending early data to server.") added a dependency on SRV_SSL_O_EARLY_DATA which only exists when USE_OPENSSL is defined (which is probably not the best solution) and breaks the build when ssl is not enabled. Just add an ifdef USE_OPENSSL around the block for now.	2017-11-08 14:28:08 +01:00
Olivier Houchard	522eea7110	MINOR: ssl: Handle sending early data to server. This adds a new keyword on the "server" line, "allow-0rtt", if set, we'll try to send early data to the server, as long as the client sent early data, as in case the server rejects the early data, we no longer have them, and can't resend them, so the only option we have is to send back a 425, and we need to be sure the client knows how to interpret it correctly.	2017-11-08 14:11:10 +01:00
Olivier Houchard	cfdef2e312	MINOR: ssl: Spell 0x10101000L correctly. Issue added in 1.8-dev by c2aae74 ("MEDIUM: ssl: Handle early data with OpenSSL 1.1.1"), no impact on older versions.	2017-11-08 14:10:02 +01:00
Olivier Houchard	bd84ac8737	MINOR: ssl: Handle session resumption with TLS 1.3 With TLS 1.3, session aren't established until after the main handshake has completed. So we can't just rely on calling SSL_get1_session(). Instead, we now register a callback for the "new session" event. This should work for previous versions of TLS as well.	2017-11-08 14:08:07 +01:00
Olivier Houchard	35a63cc1c7	BUG/MINOR; ssl: Don't assume we have a ssl_bind_conf because a SNI is matched. We only have a ssl_bind_conf if crt-list is used, however we can still match a certificate SNI, so don't assume we have a ssl_bind_conf.	2017-11-08 14:08:07 +01:00
Willy Tarreau	9e45b33f7e	BUG/MAJOR: threads/tasks: fix the scheduler again My recent change in commit ce4e0aa ("MEDIUM: task: change the construction of the loop in process_runnable_tasks()") was bogus as it used to keep the rq_next across an unlock/lock sequence, occasionally leading to crashes for tasks that are eligible to any thread. We must use the lookup call for each new batch instead. The problem is easily triggered with such a configuration : global nbthread 4 listen check mode http bind 0.0.0.0:8080 redirect location / option httpchk GET / server s1 127.0.0.1:8080 check inter 1 server s2 127.0.0.1:8080 check inter 1 Thanks to Olivier for diagnosing this one. No backport is needed.	2017-11-08 14:05:19 +01:00
Willy Tarreau	ecd2e15919	BUG/MINOR: stream-int: don't set MSG_MORE on closed request path Commit 4ac4928 ("BUG/MINOR: stream-int: don't set MSG_MORE on SHUTW_NOW without AUTO_CLOSE") was incomplete. H2 reveals another situation where the input stream is marked closed with the request and we set MSG_MORE, causing a delay before the request leaves. Better avoid setting the flag on the request path for close cases in general.	2017-11-07 15:07:25 +01:00

... 32 33 34 35 36 ...

7227 Commits