haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-12-16 15:11:01 +01:00

Author	SHA1	Message	Date
Christopher Faulet	35ab9b8c6d	DEBUG: mux-h1: Add debug counters to track some errors Debug counters are added to track errors about wrong the payload length during the message formatting (on the sending path). Aborts are also concerned. connection shutdowns and errors while the end of the message was not reached are now tracked. On the sending path, shutdown performed while all the message was not forwarded are tracked too.	2024-10-22 17:39:32 +02:00
Christopher Faulet	f98feda53f	MINOR: mux-h1: Add a trace on shutdown when keep-alive is not possible When the stream is shut down, some tests are performed to know if the connection must also be closed or not. There are trace messages for all cases, except for the default one: Abort or close-mode. Thanks to this patch, there is now a message too in this case.	2024-10-17 13:53:40 +02:00
Christopher Faulet	2c82ca60c6	MINOR: mux-h1: Show the SD iobuf in trace messages on stream send events Info about the SD iobuf are now dumped in trace messages when a stream send event is processed. It is a useful information to debug zero-copy forwarding issues.	2024-10-17 13:53:40 +02:00
Willy Tarreau	d288ddb575	CLEANUP: muxes: remove useless inclusion of ebmbtree.h Since 2.7 with commit 8522348482 ("BUG/MAJOR: conn-idle: fix hash indexing issues on idle conns"), we've been using eb64 trees and not ebmb trees anymore, and later we dropped all that to centralize the operations in the server. Let's remove the ebmbtree.h includes from the muxes that do not use them.	2024-10-12 16:29:15 +02:00
Christopher Faulet	267ba1d889	MINOR: mux-h1: Use a dedicated function to conditionnaly set EOI flag on SE The same conditions are evaluated in h1_process_demux() and h1_fastfwd() to know if SE_FL_EOI flag must be set or not on the sedesc. So now, a dedicated function is used.	2024-10-02 10:22:51 +02:00
Christopher Faulet	6b39e245e1	BUG/MINOR: mux-h1: Fix condition to set EOI on SE during zero-copy forwarding During zero-copy data forwarding, the producer must set the EOI flag on the SE when end of the message is reached. It is already done but there is a case where this flag is set while it should not. When a request wants to perform a protocol upgrade and it is waiting for the server response, the flag must not be set because the HTTP message is finished but some data are possibly still expected, depending on the server response. On a 101-switching-protocol, more data will be sent because the producer is switch to TUNNEL state. So, now, the right condition is used. In DONE state, SE_FL_EOI flag is set on the sedesc iff: - it is the response - it is the request and the response is also in DONNE state - it is a request but no a protocol upgrade nor a CONNECT This patch must be backported as far as 2.9.	2024-10-02 10:22:51 +02:00
Christopher Faulet	1900ca475f	MEDIUM: h1: Accept invalid T-E values with accept-invalid-http-response option Since the 2.6, A parsing error is reported when the chunked encoding is found twice. As stated in RFC9112, A sender must not apply the chunked transfer coding more than once to a message body. It means only one chunked coding must be found. In addition, empty values are also rejected becaues it is forbidden by RFC9110. However, in both cases, it may be useful to relax the rules for trusted legacy servers when accept-invalid-http-response option is set. Especially because it was accepted on 2.4 and older. In addition, T-E header is now sanitized before sending it. It is not a problem Because it is a hop-by-hop header Note that it remains invalid on client side because there is no good reason to relax the parsing on this side. We can argue a server is trusted so we can decide to support some legacy behavior. It is not true on client side and it is highly suspicious if a client is sending an invalid T-E header. Note also we continue to reject unsupported T-E values (so all codings except "chunked"). Because the "TE" header is sanitized and cannot contain other value than "Trailers", there is absolutely no reason for a server to use something else. This patch should fix the issue #2677. It could probably be backported as far as 2.6 if necessary.	2024-09-12 09:21:57 +02:00
Christopher Faulet	f6e193f1b0	BUG/MAJOR: mux-h1: Wake SC to perform 0-copy forwarding in CLOSING state When the mux is woken up on I/O events, if the zero-copy forwarding is enabled, receives are blocked. In this case, the SC is woken up to be able to perform 0-copy forwarding to the other side. This works well, except for the H1C in CLOSING state. Indeed, in that case, in h1_process(), the SC is not woken up because only RUNNING H1 connections are considered. As consequence, the mux will ignore connection closure. The H1 connection remains blocked, waiting for the shutdown timeout. If no timeout is configured, the H1 connection is never closed leading to a leak. This patch should fix leak reported by Damien Claisse in the issue #2697. It should be backported as far as 2.8.	2024-09-09 19:01:47 +02:00
Christopher Faulet	001fb1a548	BUG/MEDIUM: mux-h1/mux-h2: Reject upgrades with payload on H2 side only Since 1d2d77b27 ("MEDIUM: mux-h1: Return a 501-not-implemented for upgrade requests with a body"), it is no longer possible to perform a protocol upgrade for requests with a payload. The main reason was to be able to support protocol upgrade for H1 client requesting a H2 server. In that case, the upgrade request is converted to a CONNECT request. So, it is not possible to convey a payload in that case. But, it is a problem for anyone wanting to perform upgrades on H1 server using requests with a payload. It is uncommon but valid. So, now, it is the H2 multiplexer responsibility to reject upgrade requests, on server side, if there is a payload. An INTERNAL_ERROR is returned for the H2S in that case. On H1 side, the upgrade is now allowed, but only if the server waits for the end of the request to return the 101-Switching-protocol response. Indeed, it is quite hard to synchronise the frontend side and the backend side in that case. Asking to servers to fully consume the request payload before returned the response seems reasonable. This patch should fix the issue #2684. It could be backported after a period of observation, as far as 2.4 if possible. But only if it is not too hard. It depends on "MINOR: mux-h1: Set EOI on SE during demux when both side are in DONE state".	2024-09-06 09:16:18 +02:00
Christopher Faulet	ad1ef94612	MINOR: mux-h1: Set EOI on SE during demux when both side are in DONE state For now, this case is already handled for all requests except for those waiting for a tunnel establishment (CONNECT and protocol upgrades). It is not an issue because only bodyless requests are supported in these cases. So the request is always finished at the end of headers and therefore before the response. However, to relax conditions for full H1 protocol upgrades (H1 client and server), this case will be necessary. Indeed, the idea is to be able to perform protocol upgrades for requests with a payload. Today, the "Upgrade:" header is removed before sending the request to the server. But to support this case, this patch is required to properly finish transaction when the server does not perform the upgrade.	2024-09-06 09:00:13 +02:00
Christopher Faulet	0d4271cdae	BUG/MEDIUM: mux-h1: Properly handle empty message when an error is triggered When a 400/408/500/501 error is returned by the H1 multiplexer, we first try to get the error message of the proxy before using the default one. This may be configured to be mapped on /dev/null or on an empty file. In that case, no message is emitted, as expected. But everything is handled as the error was successfully sent. However, there is an bug here. In h1_send_error() function, this case is not properly handled. The flag H1C_F_ABRTED is not set on the H1 connection as it should be and h1_close() function is not called, leaving the H1 connection in an undefined state. It is especially an issue when a "empty" 408-Request-Time-out error is emitted while there are data blocked in the output buffer. In that case, the connection remains openned until the client closes and a "cR--"/408 is logged repeatedly, every time the client timeout is reached. This patch must backported as far as 2.8.	2024-09-03 14:28:42 +02:00
Willy Tarreau	cc12d1b253	MINOR: mux-h1/trace: add a state trace on stream creation/upgrade Logging below the developer level doesn't always yield very convenient traces as we don't know well where streams are allocated nor released. Let's just make that more explicit by using state-level traces. Note that h1s destruction was already logged as closing connection or switching to idle mode.	2024-08-07 16:02:59 +02:00
Willy Tarreau	adfe0a30e1	MINOR: mux-h1: add a trace context filling helper This helper is able to find a connection, a session, a stream, a frontend or a backend from its args.	2024-08-07 16:02:59 +02:00
Christopher Faulet	760d26a862	BUG/MEDIUM: mux-pt/mux-h1: Release the pipe on connection error on sending path When data are sent using the kernel splicing, if a connection error occurred, the pipe must be released. Indeed, in that case, no more data can be sent and there is no reason to not release the pipe. But it is in fact an issue for the stream because the channel will appear are not empty. This may prevent the stream to be released. This happens on 2.8 when a filter is also attached on it. On 2.9 and upper, it seems there is not issue. But it is hard to be sure and the current patch remains valid is all cases. On 2.6 and lower, the code is not the same and, AFAIK, there is no issue. This patch must be backported to 2.8. However, on 2.8, there is no zero-copy data forwarding. The patch must be adapted. There is no done_ff/resume_ff callback functions for muxes. The pipe must released in sc_conn_send() when an error flag is set on the SE, after the call to snd_pipe callback function.	2024-07-30 09:05:25 +02:00
Willy Tarreau	2dab1ba84b	MEDIUM: h1: allow to preserve keep-alive on T-E + C-L In 2.5-dev9, commit 631c7e866 ("MEDIUM: h1: Force close mode for invalid uses of T-E header") enforced a recently arrived new security rule in the HTTP specification aiming at preventing a class of content-smuggling attacks involving HTTP/1.0 agents. It consists in handling the very rare T-E + C-L requests or responses in close mode. It happens it does have an impact of a rare few and very old clients (probably running insecure TLS stacks by the way) that continue to send both with their POST requests. The impact is that for each and every request they'll have to reconnect, possibly negotiating a full TLS handshake that becomes harmful to the machine in terms of CPU computation. This commit adds a new option "h1-do-not-close-on-insecure-transfer-encoding" that does exactly what it says, it just asks not to close on such messages, even though the message continues to be sanitized and C-L dropped. It means that the risk is only between the sender and haproxy, which is limited, and might be the only acceptable solution for such environments having to deal with broken implementations. The cases are so rare that it should not need to be backported, or in the worst case, to the latest LTS if there is any demand.	2024-07-26 15:59:35 +02:00
Christopher Faulet	51ebf644e5	MINOR: stconn: Use a dedicated function to get the opposite sedesc se_opposite() function is added to let an endpoint retrieve the opposite endpoint descriptor. Muxes supportng the zero-copy forwarding can now use it. The se_shutdown() function too. This will be use by the SPOP multiplexer to be able to retrieve the SPOE agent configuration attached to the applet on client side. The related issue is #2502.	2024-07-12 15:27:04 +02:00
Christopher Faulet	4b8098bf48	MINOR: connection: No longer include stconn type header in connection-t.h It is a small change, but it is cleaner to no include stconn-t.h header in connection-t.h, mainly to avoid circular definitions. The related issue is #2502.	2024-07-12 15:27:04 +02:00
Christopher Faulet	0e09cce0fd	BUG/MAJOR: mux-h1: Prevent any UAF on H1 connection after draining a request Since 2.9, it is possible to drain the request payload from the H1 multiplexer in case of early reply. When this happens, the upper stream is detached but the H1 stream is not destroyed. Once the whole request is drained, the end of the detach stage is finished. So the H1 stream is destroyed and the H1 connection is ready to be reused, if possible, otherwise it is released. And here is the issue. If some data of the next request are received with last bytes of the drained one, parsing of the next request is immediately started. The previous H1 stream is destroyed and a new one is created to handle the parsing. At this stage the H1 connection may be released, for instance because of a parsing error. This case was not properly handled. Instead of immediately exiting the mux, it was still possible to access the released H1 connection to refresh its timeouts, leading to a UAF issue. Many thanks to Annika for her invaluable help on this issue. The patch should fix the issue #2602. It must be backported as far as 2.9.	2024-06-12 16:12:47 +02:00
Christopher Faulet	7bff576ebb	BUG/MINOR: mux-h1: Use the right variable to set NEGO_FF_FL_EXACT_SIZE flag Instead of setting this flag on the ones used for the zero-copy negociation, it is set on the connection flags used for xprt->rcv_buf() call. Fortunately, there is no real consequence. The only visible effect is the chunk size that is written on 8 bytes for no reason. This patch is related to issue #2598. It must be backported to 3.0.	2024-06-10 14:06:35 +02:00
Christopher Faulet	e8cc8a60be	BUG/MAJOR: mux-h1: Properly copy chunked input data during zero-copy nego When data are transfered via zero-copy data forwarding, if some data were already received, we try to immediately tranfer it during the negociation step. If data are chunked and the chunk size is unknown, 10 bytes are reserved to write the chunk size during the done step. However, when input data are finally transferred, the offset is ignored. Data are copied into the output buffer. But the first 10 bytes are then crushed by the chunk size. Thus the chunk is truncated leading to a malformed message. This patch should fix the issue #2598. It must be backported to 3.0.	2024-06-10 14:06:35 +02:00
William Manley	52eb6b23f8	BUG/MEDIUM: stconn/mux-h1: Fix suspect change causing timeouts This fixes an issue I've had where if a connection was idle for ~23s it would get in a bad state. I don't understand this code, so I'm not sure exactly why it was failing. I discovered this by bisecting to identify the commit that caused the regression between 2.9 and 3.0. The commit is d2c3f8dde7c2474616c0ea51234e6ba9433a4bc1: "MINOR: stconn/connection: Move shut modes at the SE descriptor level" - a part of v3.0-dev8. It seems to be an innocent renaming, so I looked through it and this stood out as suspect: - if (mode != CO_SHW_NORMAL) + if (mode & SE_SHW_NORMAL) It looks like the not went missing here, so this patch reverses that condition. It fixes my test. I don't quite understand what this is doing or is for so I can't write a regression test or decent commit message. Hopefully someone else will be able to pick this up from where I've left it. [CF: This inverts the condition to perform clean shutdowns. This means no clean shutdown are performed when it should do. This patch must be backported to 3.0]	2024-06-10 14:06:35 +02:00
Christopher Faulet	2fc9e6fa39	MEDIUM: mux-h1: Support C-L/T-E header suppressions when sending messages During the 2.9 dev cycle, to be able to support zero-copy data forwarding, a change on the H1 mux was performed to ignore the headers modifications about payload representation (Content-Length and Transfer-Encoding headers). It appears there are some use-cases where it could be handy to change values of these headers or just remove them. For instance, we can imagine to remove these headers on a server response to force the old HTTP/1.0 close mode behavior. So thaks to this patch, the rules are relaxed. It is now possible to remove these headers. When this happens, the following rules are applied: * If "Content-Length" header is removed but a "Transfer-Encoding: chunked" header is found, no special processing is performed. The message remains chunked. However the close mode is not forced. * If "Transfer-Encoding" header is removed but a "Content-Length" header is found, no special processing is performed. The payload length must comply to the specified content length. * If one of them is removed and the other one is not found, a response is switch the close mode and a "Content-Length: 0" header is forced on a request. With these rules, we fit the best to the user expectations. This patch depends on the following commit: * MINOR: mux-h1: Add a flag to ignore the request payload This patch should fix the issue #2536. It should be backported it to 2.9 with the commit above.	2024-05-17 16:33:53 +02:00
Christopher Faulet	8e55d29109	MINOR: mux-h1: Add a flag to ignore the request payload There was a flag to skip the response payload on output, if any, by stating it is bodyless. It is used for responses to HEAD requests or for 204/304 responses. This allow rewrites during analysis. For instance a HEAD request can be rewrite to a GET request for any reason (ie, a server not supporting HEAD requests). In this case, the server will send a response with a payload. On frontend side, the payload will be skipped and a valid response (without payload) will be sent to the client. With this patch we introduce the corresponding flag for the request. It will be used to skip the request payload. In addition, when payload must be skipped for a request or a response, The zero-copy data forwarding is now disabled.	2024-05-17 16:33:53 +02:00
Willy Tarreau	821a04377d	BUG/MEDIUM: muxes: enforce buf_wait check in takeover() The ->takeover() is quite tricky. It didn't take care of the possibility that the original thread's connection handler had been woken up to handle an event (e.g. read0), failed to get a buffer, registered against its own thread's buffer_wait queue and left the connection in an idle state. A new thread could then come by, perform a takeover(), and when a buffer was available, the new thread's tasklet would be woken up by the old one via _buf_available(), causing all sort of problems. These problems are easy to reproduce, by running with shared backend connections and few buffers (tune.buffers.limit=20, 8 threads, 500 connections, transfer 64kB objects and wait 2-5s for a crash to appear). A first estimated solution consisted in removing the connection from the idle list but it turns out that it would be worse for the delete stuff (the connection no longer appearing as idle, making it impossible to find it in order to close it). Also, idle counts wouldn't match anymore the list's state, and the special case of private connections could be difficult to handle as the connection could be forcefully re-added to the idle list after allocation despite being private. After multiple attempts to address the problem in various ways, it appears that the only reliable solution for now (without starting to turn many lists to mt_lists) is to have the takeover() function handle the buf_wait detection or unregistration itself: - when doing a regular takeover aiming at finding an idle connection for a new request, connections that are blocked in a buffer_wait queue are quite rare and not interesting at all (since not immediately usable), so skipping them is sufficient. For this we detect that the desired connection belongs to a buffer_wait list by checking its buf_wait.list element. Note that this check is not* thread-safe! The LIST_DEL_INIT() is performed by __offer_buffers() after the callback was called. But this is sufficient as it is now because the only way for the element to be seen as not in a list is after the element was last touched by __offer_buffers(), so the situation for this connection will not change in a different way later. - when doing a server delete, we're running under thread isolation. The connection might get taken over to be killed. The only trick is that private connections not belonging to any idle list may also experience this, and in this case even the idle_conns lock will not offer any protection against anything. But since we're run under thread isolation, we're certain not to compete with the other thread, so it's safe to directly unregister the connection from its owner thread. Normally this is already handled by conn_release() in cli_parse_delete_server(), which calls mux->destroy(), but this would actually update the current thread's queue instead of the origin thread's, thus we do need to perform an explicit dequeue before completing the takeover. With this, the problem now looks solved for HTTP/1, HTTP/2 and FCGI, though extensive tests were essentially run on HTTP/1 and HTTP/2. While the problem has been there for a very long time, there should be no reason to backport it since buffer_wait didn't practically work before 3.0-dev and the process used to freeze hard very quickly before we'd even have a chance to meet that race.	2024-05-15 19:37:12 +02:00
Willy Tarreau	47665be083	MEDIUM: mux-h1: allocate without queuing when retrying Now when trying to allocate a buffer, we can check if we've been notified of availability via the callback, in which case we should not consult the queue, or if we're doing a first allocation and check the queue. At this point it still doesn't change much since the stream still doesn't make use of it but some progress is expected.	2024-05-10 17:18:13 +02:00
Willy Tarreau	f552f79ba5	MINOR: mux-h1: report that a buffer allocation succeeded When the buffer allocation callback is notified of a buffer availability, it will now set a MAYALLOC flag in addition to clearing the ALLOC one, for each of the 3 levels where we may fail an allocation. The flag will be cleared upon a successful allocation. This will soon be used to decide to re-allocate without waiting again in the queue. For now it has no effect. There's just a trick, we need to clear the various *_ALLOC flags before testing h1_recv_allowed() otherwise it will return false!	2024-05-10 17:18:13 +02:00
Willy Tarreau	a160b3c50c	MEDIUM: dynbuf/mux-h1: do not allocate the buffers in the callback One of the problematic designs with the buffer_wait mechanism is that the callbacks pre-allocate the buffers and stay in the run queue for a while, resulting in all of the few buffers being assigned to waiting tasks instead of being all available to one task that needs them all at once. Here we simply stop doing this, the callback clears the waiting flags and wakes the task up so that it has a chance of still finding some buffers.	2024-05-10 17:18:13 +02:00
Willy Tarreau	c510e81a3f	MINOR: dynbuf/mux-h1: use different criticalities for buffer allocations While it could certainly still be improved, this first approach consists in assigning buffers like this in the H1 mux: - h1c->obuf : DB_MUX_TX - h1c->ibuf : DB_MUX_RX - h1s->rxbuf: DB_SE_RX That's done via 3 distinct functions for better code clarity, and it also allowed to move the missing buffer flags assignment there. Among possible improvements would be to take into consideration the state of the parser (i.e. no data yet vs data, or headers vs payload) so that even server beginning of response or pure payload can be lowered in priority.	2024-05-10 17:18:13 +02:00
Willy Tarreau	f5566afec6	MEDIUM: dynbuf: generalize the use of b_dequeue() to detach buffer_wait Now thanks to this the bufq_map field is expected to remain accurate.	2024-05-10 17:18:13 +02:00
Willy Tarreau	a214197ce7	MINOR: dynbuf: use the b_queue()/b_requeue() functions everywhere The code places that were used to manipulate the buffer_wq manually now just call b_queue() or b_requeue(). This will simplify the multiple list management later.	2024-05-10 17:18:13 +02:00
Willy Tarreau	72d0dcda8e	MINOR: dynbuf: pass a criticality argument to b_alloc() The goal is to indicate how critical the allocation is, between the least one (growing an existing buffer ring) and the topmost one (boot time allocation for the life of the process). The 3 tcp-based muxes (h1, h2, fcgi) use a common allocation function to try to allocate otherwise subscribe. There's currently no distinction of direction nor part that tries to allocate, and this should be revisited to improve this situation, particularly when we consider that mux-h2 can reduce its Tx allocations if needed. For now, 4 main levels are planned, to translate how the data travels inside haproxy from a producer to a consumer: - MUX_RX: buffer used to receive data from the OS - SE_RX: buffer used to place a transformation of the RX data for a mux, or to produce a response for an applet - CHANNEL: the channel buffer for sync recv - MUX_TX: buffer used to transfer data from the channel to the outside, generally a mux but there can be a few specificities (e.g. http client's response buffer passed to the application, which also gets a transformation of the channel data). The other levels are a bit different in that they don't strictly need to allocate for the first two ones, or they're permanent for the last one (used by compression).	2024-05-10 17:18:13 +02:00
Christopher Faulet	eca9831ec8	MINOR: muxes: Add ctl commands to get info on streams for a connection There are 2 new ctl commands that may be used to retrieve the current number of streams openned for a connection and its limit (the maximum number of streams a mux connection supports). For the PT and H1 muxes, the limit is always 1 and the current number of streams is 0 for idle connections, otherwise 1 is returned. For the H2 and the FCGI muxes, info are already available in the mux connection. For the QUIC mux, the limit is also directly available. It is the maximum initial sub-ID of bidirectional stream allowed for the connection. For the current number of streams, it is the number of SC attached on the connection and the number of not already attached streams present in the "opening_list" list.	2024-05-06 22:00:00 +02:00
Christopher Faulet	96f8b7ad08	MEDIUM: stconn/muxes: Add an abort reason for SE shutdowns on muxes A reason is now passed as parameter to muxes shutdowns to pass additional info about the abort, if any. No info means no abort or only generic one. For now, the reason is composed of 2 32-bits integer. The first on represents the abort code and the other one represents the info about the code (for instance the source). The code should be interpreted according to the associated info. One info is the source, encoding on 5 bits. Other bits are reserverd for now. For now, the muxes are the only supported source. But we can imagine to extend it to applets, streams, health-checks... The current design is quite simple and will most probably evolved.. But the idea is to let the opposite side forward some errors and let's a mux know why its stream was aborted. At first glance, a abort reason must only be evaluated if SE_SHW_SILENT flag is set. The main goal at short term, is to forward some H2 RST_STREAM codes because it is mandatory for gRPC applications, mainly to forward gRPC cancellation from an H2 client to an H2 server. But we can imagine to alter this reason at the applicative level to enrich it. It would also be used to report more accurate errors in logs.	2024-05-06 22:00:00 +02:00
Amaury Denoyelle	65624876f2	MINOR: stats: introduce a more expressive stat definition method Previously, statistics were simply defined as a list of name_desc, as for example "stat_cols_px" for proxy stats. No notion of type was fixed for each stat definition. This correspondance was done individually inside stats_fill_*_line() functions. This renders the process to define new statistics tedious. Implement a more expressive stat definition method via a new API. A new type "struct stat_col" for stat column to replace name_desc usage is defined. It contains a field to store the stat nature and format. A <cap> field is also defined to be able to define a proxy stat only for certain type of objects. This new type is also further extended to include counter offsets. This allows to define a method to automatically generate a stat value field from a "struct stat_col". This will be the subject of a future commit. New type "struct stat_col" is fully compatible full name_desc. This allows to gradually convert stats definition. The focus will be first for proxies counters to implement statistics preservation on reload.	2024-04-26 10:20:57 +02:00
Christopher Faulet	fbc0850d36	MEDIUM: muxes: Use one callback function to shut a mux stream mux-ops .shutr and .shutw callback functions are merged into a unique functions, called .shut. The shutdown mode is still passed as argument, muxes are responsible to test it. Concretly, .shut() function of each mux is now the content of the old .shutw() followed by the content of the old .shutr().	2024-04-19 16:33:40 +02:00
Christopher Faulet	d2c3f8dde7	MINOR: stconn/connection: Move shut modes at the SE descriptor level CO_SHR_* and CO_SHW_* modes are in fact used by the stream-connectors to instruct the muxes how streams must be shut done. It is then the mux responsibility to decide if it must be propagated to the connection layer or not. And in this case, the modes above are only tested to pass a boolean (clean or not). So, it is not consistant to still use connection related modes for information set at an upper layer and never used by the connection layer itself. These modes are thus moved at the sedesc level and merged into a single enum. Idea is to add more modes, not necessarily mutually exclusive, to pass more info to the muxes. For now, it is a one-for-one renaming.	2024-04-19 16:24:46 +02:00
Amaury Denoyelle	5e8eb3661b	MEDIUM: mux: prepare for takeover on private connections When a backend connection is marked as idle, a special flag TASK_F_USR1 is set on MUX tasklet. When MUX tasklet is reactivated, extra checks are executed under this flag to ensure no takeover occurred in the meantime. Previously, only non private connections could be targetted by a takeover. However, this will change when implementing private idle connections closure on "delete server" CLI handler. As such, TASK_F_USR1 is now also set for private connections in MUX detach callbacks.	2024-03-22 17:10:06 +01:00
Amaury Denoyelle	f3862a9bc7	MINOR: connection: extend takeover with release option Extend takeover API both for MUX and XPRT with a new boolean argument <release>. Its purpose is to signal if the connection will be freed immediately after the takeover, rendering new resources allocation unnecessary. For the moment, release argument is always false. However, it will be set to true on delete server CLI handler to proactively close server idle connections.	2024-03-22 16:12:36 +01:00
Christopher Faulet	60fcc27577	MEDIUM: htx/http-ana: No longer close connection on early HAProxy response When a response was returned by HAProxy, a dedicated HTX flag was set. Thanks to this flag, it was possible to add a "connection: close" header to the response if the request was not fully received and to close the connection. In the same way, when a redirect rule was applied, keep-alive was forcefully disabled for unfinished requests. All these mechanisms are now useless because the H1 mux is able to drain the response. So HTX_FL_PROXY_RESP flag is removed and no special processing is performed on HAProxy response when the request is unfinished.	2024-02-28 16:02:33 +01:00
Christopher Faulet	077906da14	MAJOR: mux-h1: Drain requests on client side before shut a stream down unlike for H2 and H3, there is no mechanism in H1 to notify the client it must stop to upload data when a response is replied before the end of the request without closing the connection. There is no RST_STREAM frame equivalent. Thus, there is only two ways to deal with this situation: closing the connection or draining the request. Until now, HAProxy didn't support draining H1 messages. Closing the connection in this case has however a major drawback. It leads to send a TCP reset, dropping this way all in-fly data. There is no warranty the client has fully received the response. Draining H1 messages was never implemented because in old versions it was a bit tricky to implement. However, it is now far simplier to support this feature because it is possible to have a H1 stream without any applicative stream. It is the purpose of this patch. Now, when a shutdown is requested and the stream is detached from the connection, if the request is unfinished while the response was fully sent, the request in drained. To do so, in this case the shutdown and the detach are delayed. From the upper layer point of view, there is no changes. The endpoint is shut down and detached as usual. But on H1 mux point of view, the H1 stream is still alive and is being able to drain data. However the stream-endpoint descriptor is orphan. Once the request is fully received (and drained), the connection is shut down if it cannot be reused for a new transaction and the H1 stream is destroyed.	2024-02-28 16:02:33 +01:00
Christopher Faulet	14db433db9	MINOR: mux-h1: Move all stuff to detach a stream in an internal function All code from h1_detach() function was moved in a internal function, h1s_finish_detach(). It will be used to defer the detach and be able to drain the requests payload.	2024-02-28 15:31:07 +01:00
Christopher Faulet	7ae280d091	MINOR: mux-h1: Move checks performed before a shutdown in a dedicated function Checks performed in h1_shutw() to determine if the connection must be shutdown now or not was move in a dedicated function. This will be used to be able to drain the requests payload.	2024-02-28 15:31:07 +01:00
Christopher Faulet	81f75d32b2	BUG/MINOR: mux-h1: Properly report when mux is blocked during a nego During a zero-copy forwarding negociation, if the H1 mux is blocked for any reason, the IOBUF_FL_FF_BLOCKED flag must be set on its iobuf to notfiy the producer it must wait. However, there were two places where it was not performed: when the output buffer allocation failed and when the chunk formatting failed. This patch fixes the issue. It must be backported to 2.9.	2024-02-28 15:31:07 +01:00
Christopher Faulet	489e583ac5	BUG/MEDIUM: mux-h1: Fix again 0-copy forwarding of chunks with an unknown size There is still an issue with zero-copy forwarding of chunks with an unknown size. It is possible for a producer to fill the sapce reserved for the CRLF at the end of the chunk. The root cause is that this space is not accounted in the iobuf offset. So, from the producer point of view, the space may be used. We can also argue the current design for iobuf is not well suited for this case. Instead of using a pointer on the consumer's buffer, it could be easier to use a custom buffer built on top of the consumer one, via a call to b_make(), with the size, head and data field reflecting the avaialble space the producer can use. By the way, because of this bug, it is possible to trigger a BUG_ON() when we try to write the CRLF at the end of the chunk because the buffer is full. It is unexpected. Only the stats applet may hit this bug. To fix the issue, instead of writting this CRLF when the current chunk is consumed, it is written before consuming the next one. This way, all space reserved to create the chunk formatting is always placed before forwarding data. No backport needed.	2024-02-28 15:31:07 +01:00
Christopher Faulet	1a2a196fcf	BUG/MEDIUM: mux-h1: Don't emit 0-CRLF chunk in h1_done_ff() when iobuf is empty A chunk message transferred via zero-copy forwarding in H1 may be corrupted. This only happens when the chunk size is not known during the nego stage and when there is nothing to forward when h1_donn_ff() is called. In this case, we always emit a chunk. Because there is nothing to forward, a 0-CRLF is emitted in the middle of the message. The issue occurred with the HTTP stats applet only. A simple fix is to check the size of data in the iobuf before emitting a new chunk in h1_done_ff(). However, we still try to send outgoing data because when this happens, it is most of time because the H1 output buffer is almost full. This patch should fix the issue #2453. No backport needed.	2024-02-21 11:49:58 +01:00
Christopher Faulet	081022a0c5	MINOR: muxes/applet: Simplify checks on options to disable zero-copy forwarding Global options to disable for zero-copy forwarding are now tested outside callbacks responsible to perform the forwarding itself. It is cleaner this way because we don't try at all zero-copy forwarding if at least one side does not support it. It is equivalent to what was performed before, but it is simplier this way.	2024-02-14 15:41:04 +01:00
Christopher Faulet	e2921ffad1	MINOR: muxes: Announce support for zero-copy forwarding on consumer side It is unused for now, but the muxes announce their support of the zero-copy forwarding on consumer side. All muxes, except the fgci one, are supported it.	2024-02-14 15:15:10 +01:00
Christopher Faulet	7598c0ba69	MINOR: stconn: Rename SE_FL_MAY_FASTFWD and reorder bitfield To fix a bug, a flag to announce the capabitlity to support the zero-copy forwarding on the consumer side will be added on the SE descriptor. So the old flag SE_FL_MAY_FASTFWD is renamed to indicate it concerns the producer side. It is now SE_FL_MAY_FASTFWD_PROD. And to prepare addition of the new flag, the bitfield is a bit reordered.	2024-02-14 15:00:32 +01:00
Christopher Faulet	3ee3a7937a	BUG/MAJOR: mux-h1: Fix zero-copy forwarding when sending chunks of unknown size Commit 91b77c1632 ("MEDIUM: mux-h1: Support zero-copy forwarding for chunks with an unknown size") was recently pushed but it contains 3 bugs. The first one is during the nego. The extra size reserved for the CRLF at the end of the chunk must not be added to the offset value. Indeed, the CRLF will be appended after the data and not prepended to them. The second one, still during the nego, is an integer overflow when the available room in the output buffer is computed. Finally, the last one is when the chunk itself is formatted. This part was totally buggy if the output buffer was not empty at the beginning. No backport needed.	2024-02-14 14:22:36 +01:00
Christopher Faulet	91b77c1632	MEDIUM: mux-h1: Support zero-copy forwarding for chunks with an unknown size Till now, for chunked messages, the H1 mux used the size requested during the zero-copy forwarding negotiation as the chunk size. And till now, this was accurate because the requested size was indeed the chunk size on the producer side. But this will be a problem to implement the zero-copy forwarding on some applets because the content size is not known during the nego but only when it is produced. Thanks to previous patches, it is now possible to know the requested size is not exact and we are able to reserve a larger space to write the chunk size later, in h1_done_ff(), with some padding.	2024-02-07 15:04:44 +01:00

1 2 3 4 5 ...

717 Commits