haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-12 10:06:58 +02:00

Author	SHA1	Message	Date
Willy Tarreau	e1c8bfd0ed	BUG/MEDIUM: mux-h2: refine connection vs stream error on headers Commit `7021a8c4d8` ("BUG/MINOR: mux-h2: also count streams for refused ones") addressed stream counting issues on some error cases but not completely correctly regarding the conn_err vs stream_err case. Indeed, contrary to the initial analysis, h2c_dec_hdrs() can set H2_CS_ERROR when facing some unrecoverable protocol errors, and it's not correct to send it to strm_err which will only send the RST_STREAM frame and the subsequent GOAWAY frame is in fact the result of the read timeout. The difficulty behind this lies on the sequence of output validations because h2c_dec_hdrs() returns two results at once: - frame processing status (done/incomplete/failed) - connection error status The original ordering requires to write 2 exemplaries of the exact same error handling code disposed differently, which the patch above tried to factor to one. After careful inspection of h2c_dec_hdrs() and its comments, it's clear that it always returns -1 on failure, including connection errors. This means we can rearrange the test to get rid of the missing data first, and immediately enter the no-return zone where both the stream and connection errors can be checked at the same place, making sure to consistently maintain error counters. This is way better because we don't have to update stream counters on the error path anymore. h2spec now passes the test much faster. This will need to be backported to the same branches as the commit above, which was already backported to 2.9.	2024-01-18 17:21:02 +01:00
Willy Tarreau	7021a8c4d8	BUG/MINOR: mux-h2: also count streams for refused ones There are a few places where we can reject an incoming stream based on technical errors such as decoded headers that are too large for the internal buffers, or memory allocation errors. In this case we send an RST_STREAM to abort the request, but the total stream counter was not incremented. That's not really a problem, until one starts to try to enforce a total stream limit using tune.h2.fe.max-total-streams, and which will not count such faulty streams. Typically a client that learns too large cookies and tries to replay them in a way that overflows the maximum buffer size would be rejected and depending on how they're implemented, they might retry forever. This patch removes the stream count increment from h2s_new() and moves it instead to the calling functions, so that it translates the decision to process a new stream instead of a successfully decoded stream. The result is that such a bogus client will now be blocked after reaching the total stream limit. This can be validated this way: global tune.h2.fe.max-total-streams 128 expose-experimental-directives trace h2 sink stdout trace h2 level developer trace h2 verbosity complete trace h2 start now frontend h bind :8080 mode http redirect location / Sending this will fill frames with 15972 bytes of cookie headers that expand to 16500 for storage+index once decoded, causing "message too large" events: (dev/h2/mkhdr.sh -t p;dev/h2/mkhdr.sh -t s; for sid in {0..1000}; do dev/h2/mkhdr.sh -t h -i $((sid*2+1)) -f es,eh \ -R "828684410f7777772e6578616d706c652e636f6d \ $(for i in {1..66}; do echo -n 60 7F 73 433d $(for j in {1..24}; do echo -n 2e313233343536373839; done); done) "; done) \| nc 0 8080 Now it properly stops after sending 128 streams. This may be backported wherever commit `983ac4397` ("MINOR: mux-h2: support limiting the total number of H2 streams per connection") is present, since without it, that commit is less effective.	2024-01-12 18:59:59 +01:00
Willy Tarreau	e19334a343	CLEANUP: mux-h2: remove the printfs from previous commit on h2 streams limit. After thinking about them all the time at the end, I managed to remove them while editing the commit and to forget to push them :-(	2024-01-05 19:19:10 +01:00
Willy Tarreau	983ac4397d	MINOR: mux-h2: support limiting the total number of H2 streams per connection This patch introduces a new setting: tune.h2.fe.max-total-streams. It sets the HTTP/2 maximum number of total streams processed per incoming connection. Once this limit is reached, HAProxy will send a graceful GOAWAY frame informing the client that it will close the connection after all pending streams have been closed. In practice, clients tend to close as fast as possible when receiving this, and to establish a new connection for next requests. Doing this is sometimes useful and desired in situations where clients stay connected for a very long time and cause some imbalance inside a farm. For example, in some highly dynamic environments, it is possible that new load balancers are instantiated on the fly to adapt to a load increase, and that once the load goes down they should be stopped without breaking established connections. By setting a limit here, the connections will have a limited lifetime and will be frequently renewed, with some possibly being established to other nodes, so that existing resources are quickly released. The default value is zero, which enforces no limit beyond those implied by the protocol (2^30 ~= 1.07 billion). Values around 1000 were found to already cause frequent enough connection renewal without causing any perceptible latency to most clients. One notable exception here is h2load which reports errors for all requests that were expected to be sent over a given connection after it receives a GOAWAY. This is an already known limitation: https://github.com/nghttp2/nghttp2/issues/981 The patch was made in two parts inside h2_frt_handle_headers(): - the first one, at the end of the function, which verifies if the configured limit was reached and if it's needed to emit a GOAWAY ; - the second, just before decoding the stream frame, which verifies if a previously configured limit was ignored by the client, and closes the connection if this happens. Indeed, one reason for a connection to stay alive for too long definitely comes from a stupid bot that periodically fetches the same resource, scans lots of URLs or tries to brute-force something. These ones are more likely to just ignore the last stream ID advertised in GOAWAY than a regular browser, or a well-behaving client such as curl which respects it. So in order to make sure we can close the connection we need to enforce the advertised limit. Note that a regular client will not face a problem with that because in the worst case it will have max_concurrent_streams in flight and this limit is taken into account when calculating the advertised last acceptable stream ID. Just a note: it may also be possible to move the first part above to h2s_frt_stream_new() instead so that it's not processed for trailers, though it doesn't seem to be more interesting, first because it has two return points. This is something that may be backported to 2.9 and 2.8 to offer more control to those dealing with dynamic infrastructures, especially since for now we cannot force a connection to be cleanly closed using rules (e.g. github issues #946, #2146).	2024-01-05 18:49:11 +01:00
Christopher Faulet	d9eb6d6680	BUG/MEDIUM: mux-h2: Don't report error on SE for closed H2 streams An error on the H2 connection was always reported as an error to the stream-endpoint descriptor, independently on the H2 stream state. But it is a bug to do so for closed streams. And indeed, it leads to report "SD--" termination state for some streams while the response was fully received and forwarded to the client, at least for the backend side point of view. Now, errors are no longer reported for H2 streams in closed state. This patch is related to the three previous ones: * "BUG/MEDIUM: mux-h2: Don't report error on SE for closed H2 streams" * "BUG/MEDIUM: mux-h2: Don't report error on SE if error is only pending on H2C" * "BUG/MEDIUM: mux-h2: Only Report H2C error on read error if demux buffer is empty" The series should fix a bug reported in issue #2388 (#2388#issuecomment-1855735144). The series should be backported to 2.9 but only after a period of observation. In theory, older versions are also affected but this part is pretty sensitive. So don't backport it further except if someone ask for it.	2023-12-18 21:15:32 +01:00
Christopher Faulet	580ffd6123	BUG/MEDIUM: mux-h2: Don't report error on SE if error is only pending on H2C In h2s_wake_one_stream(), we must not report an error on the stream-endpoint descriptor if the error is not definitive on the H2 connection. A pending error on the H2 connection means there are potentially remaining data to be demux. It is important to not truncate a message for a stream. This patch is part of a series that should fix a bug reported in issue #2388 (#2388#issuecomment-1855735144). Backport instructions will be shipped in the last commit of the series.	2023-12-18 21:15:32 +01:00
Christopher Faulet	19fb19976f	BUG/MEDIUM: mux-h2: Only Report H2C error on read error if demux buffer is empty It is similar to the previous fix ("BUG/MEDIUM: mux-h2: Don't report H2C error on read error if dmux buffer is not empty"), but on receive side. If the demux buffer is not empty, an error on the TCP connection must not be immediately reported as an error on the H2 connection. We must be sure to have tried to demux all data first. Otherwise, messages for one or more streams may be truncated while all data were already received and are waiting to be demux. This patch is part of a series that should fix a bug reported in issue #2388 (#2388#issuecomment-1855735144). Backport instructions will be shipped in the last commit of the series.	2023-12-18 21:15:32 +01:00
Christopher Faulet	5b78cbae77	BUG/MEDIUM: mux-h2: Switch pending error to error if demux buffer is empty When an error on the H2 connection is detected when sending data, only a pending error is reported, waiting for an error or a shutdown on the read side. However if a shutdown was already received, the pending error is switched to a definitive error. At this stage, we must also wait to have flushed the demux buffer. Otherwise, if some data must still be demux, messages for one or more streams may be truncated. There is already the flag H2_CF_END_REACHED to know a shutdown was received and we no longer progress on demux side (buffer empty or data truncated). On sending side, we should use this flag instead to report a definitive error. This patch is part of a series that should fix a bug reported in issue #2388 (#2388#issuecomment-1855735144). Backport instructions will be shipped in the last commit of the series.	2023-12-18 21:15:32 +01:00
Christopher Faulet	682f73b4fa	BUG/MEDIUM: mux-h2: Report too large HEADERS frame only when rxbuf is empty During HEADERS frames decoding, if a frame is too large to fit in a buffer, an internal error is reported and a RST_STREAM is emitted. On the other hand, we wait to have an empty rxbuf to decode the frame because we cannot retry a failed HPACK decompression. When we are decoding headers, it is valid to return an error if dbuf buffer is full because no data can be blocked in the rxbuf (which hosts the HTX message). However, during the trailers decoding, it is possible to have some data not sent yet for the current stream in the rxbug and data for another stream fully filling the dbuf buffer. In this case, we don't decode the trailers but we must not return an error. We must wait to empty the rxbuf first. Now, a HEADERS frame is considered as too large if the dbuf buffer is full and if the rxbuf is empty (the HTX message to be accurate). This patch should fix the issue #2382. It must be backported to all stable versions.	2023-12-13 16:45:29 +01:00
Christopher Faulet	6da0429e75	MINOR: mux-h2: Add global option to enable/disable zero-copy forwarding tune.h2.zero-copy-fwd-send can now be used to enable or disable the zero-copy fast-forwarding for the H2 mux only, for sends. For now, there is no option to disable it for receives because it is not supported yet. It is enabled ('on') by default.	2023-12-04 15:33:34 +01:00
Christopher Faulet	fd8ce788a5	MINOR: muxes: Implement ->sctl() callback for muxes and return the stream id All muxes now implements the ->sctl() callback function and are able to return the stream ID. For the PT multiplexer, it is always 0. For the H1 multiplexer it is the request count for the current H1 connection (added for this purpose). The FCGI, H2 and QUIC muxes, the stream ID is returned. The stream ID is returned as a signed 64 bits integer.	2023-11-29 11:11:12 +01:00
Christopher Faulet	d982a37e4c	MINOR: muxes: Rename mux_ctl_type values to use MUX_CTL_ prefix Instead of the generic MUX_, we now use MUX_CTL_ prefix for all mux_ctl_type value. This will avoid any ambiguities with other enums, especially with a new one that will be added to get information on mux streams.	2023-11-29 11:11:12 +01:00
Christopher Faulet	af733ef6e4	BUG/MEDIUM: mux-h2: Remove H2_SF_NOTIFIED flag for H2S blocked on fast-forward When a H2 stream is blocked during data fast-forwarding, we must take care to remove H2_SF_NOTIFIED flag. This was only performed when data fast-forward was attempted. However, if the H2 stream was blocked for any reason, this flag was not removed. During our tests, we found it was possible to infinitely block a connection because one of its streams was in the send_list with the flag set. In this case, the stream was no longer woken up to resume the sends, blocking all other streams. No backport needed.	2023-11-28 14:01:56 +01:00
Willy Tarreau	d656ac7e13	OPTIM: mux-h2/zero-copy: don't allocate more buffers per connections than streams It's the exact same as commit `0a7ab7067` ("OPTIM: mux-h2: don't allocate more buffers per connections than streams"), but for the zero-copy case this time. Previously it was only done on the regular snd_buf() path, but this one is needed as well. A transfer on 16 parallel streams now consumes half of the memory, and a single stream consumes much less. An alternate approach would be worth investigating in the future, based on the same principle as the CF_STREAMER_FAST at the higher level: in short, by monitoring how many mux buffers we write at once before refilling them, we would get an idea of how much is worth keeping in buffers max, given that anything beyond would just waste memory. Some tests show that a single buffer already seems almost as good, except for single-stream transfers, which is why it's worth spending more time on this.	2023-11-28 09:15:26 +01:00
Ilya Shipitsin	80813cdd2a	CLEANUP: assorted typo fixes in the code and comments This is 37th iteration of typo fixes	2023-11-23 16:23:14 +01:00
Willy Tarreau	4f02e3da67	BUG/MEDIUM: mux-h2: fail earlier on malloc in takeover() Connection takeover was implemented for H2 in 2.2 by commit `cd4159f03` ("MEDIUM: mux_h2: Implement the takeover() method."). It does have one corner case related to memory allocation failure: in case the task or tasklet allocation fails, the connection gets released synchronously. Unfortunately the situation is bad there, because the lower layers are already switched to the new thread while the tasklet is either NULL or still the old one, and calling h2_release() will also result in h2_process() and h2_process_demux() that may process any possibly pending frames. Even the session remains the old one on the old thread, so that some sess_log() that are called when facing certain demux errors will be associated with the previous thread, possibly accessing a number of elements belonging to another thread. There are even code paths where the thread will try to grab the lock of its own idle conns list, believing the connection is there while it has no useful effect. However, if the owner thread was doing the same at the same moment, and ended up trying to pick from the current thread (which could happen if picking a connection for a different name), the two could even deadlock. The risk is extremely low, but Fred managed to reproduce use-after-free errors in conn_backend_get() after a takeover() failed by playing with -dMfail, indicating that h2_release() had been successfully called. In practise it's sufficient to have h2 on the server side with reuse-always and to inject lots of request on it with -dMfail. This patch takes a simple but radically different approach. Instead of starting to migrate the connection before risking to face allocation failures, it first pre-allocates a new task and tasklet, then assigns them to the connection if the migration succeeds, otherwise it just frees them. This way it's no longer needed to manipulate the connection until it's fully migrated, and as a bonus this means the connection will continue to exist and the use-after-free condition is solved at the same time. This should be backported to 2.2. Thanks to Fred for the initial analysis of the problem!	2023-11-17 18:10:16 +01:00
Amaury Denoyelle	a1457296d5	BUG/MINOR: mux_h2: reject passive reverse conn if error on add to idle On passive reverse, H2 mux is responsible to insert the connection in the server idle list. This is done via srv_add_to_idle_list(). However, this function may fail for various reason, such as FD usage limit reached. Handle properly this error case. H2 mux flags the connection on error which will cause its release. Prior to this patch, the connection was only released on server timeout. This bug was found inspecting server curr_used_conns counter. Indeed, on connection reverse, this counter is first incremented. It is decremented just after on srv_add_to_idle_list() if insertion is validated. However, if insertion is rejected, the connection was not released which cause curr_used_conns to remains positive. This has the major downside to break the reusing of idle connection on rhttp causing spurrious 503 errors. No need to backport.	2023-11-16 18:43:32 +01:00
Willy Tarreau	0a7ab7067f	OPTIM: mux-h2: don't allocate more buffers per connections than streams When an H2 mux works with a slow downstream connection and without the mux-mux mode, it is possible that a single stream will allocate all 32 buffers in the connection. This is not desirable at all because 1) it brings no value, and 2) it allocates a lot of memory per connection, which, in addition to using a lot of memory, tends to degrade performance due to cache thrashing. This patch improves the situation by refraining from sending data frames over a connection when more mbufs than streams are allocated. On a test featuring 10k connections each with a single stream reading from the cache, this patch reduces the RAM usage from ~180k buffers to ~20k bufs, and improves the bandwidth. This may even be backported later to recent versions to improve memory usage. Note however that it is efficient only when combined with `e16762f8a` ("OPTIM: mux-h2: call h2_send() directly from h2_snd_buf()"), and tends to slightly reduce the single-stream performance without it, so in case of a backport, the two need to be considered together.	2023-11-09 17:24:00 +01:00
Christopher Faulet	84d26bcf3f	MINOR: stconn/mux-h2: Use a iobuf flag to report EOI to consumer side during FF IOBUF_FL_EOI iobuf flag is now set by the producer to notify the consumer that the end of input was reached. Thanks to this flag, we can remove the ugly ack in h2_done_ff() to test the opposite SE flags. Of course, for now, it works and it is good enough. But we must keep in mind that EOI is always forwarded from the producer side to the consumer side in this case. But if this change, a new CO_RFL_ flag will have to be added to instruct the producer if it can forward EOI or not.	2023-11-08 21:14:07 +01:00
Christopher Faulet	4be0c7c655	MEDIUM: stconn/muxes: Loop on data fast-forwarding to forward at least a buffer In the mux-to-mux data forwarding, we now try, as far as possible to send at least a buffer. Of course, if the consumer side is congested or if nothing more can be received, we leave. But the idea is to retry to fast-forward data if less than a buffer was forwarded. It is only performed for buffer fast-forwarding, not splicing. The idea behind this patch is to optimise the forwarding, when a first forward was performed to complete a buffer with some existing data. In this case, the amount of data forwarded is artificially limited because we are using a non-empty buffer. But without this limitation, it is highly probable that a full buffer could have been sent. And indeed, with H2 client, a significant improvement was observed during our test. To do so, .done_fastfwd() callback function must be able to deal with interim forwards. Especially for the H2 mux, to remove H2_SF_NOTIFIED flags on the H2S on the last call only. Otherwise, the H2 stream can be blocked by itself because it is in the send_list. IOBUF_FL_INTERIM_FF iobuf flag is used to notify the consumer it is not the last call. This flag is then removed on the last call.	2023-11-08 21:14:07 +01:00
Christopher Faulet	141b489291	BUG/MEDIUM: stconn: Report send activity during mux-to-mux fast-forward When data are directly forwarded from a mux to the opposite one, we must not forget to report send activity when data are successfully sent or report a blocked send with data are blocked. It is important because otherwise, if the transfer is quite long, longer than the client or server timeout, an error may be triggered because the write timeout is reached. H1, H2 and PT muxes are concerned. To fix the issue, The done_fastword() callback now returns the amount of data consummed. This way it is possible to update/reset the FSB data accordingly. No backport needed.	2023-11-07 10:30:01 +01:00
Willy Tarreau	e16762f8a8	OPTIM: mux-h2: call h2_send() directly from h2_snd_buf() This allows to eliminate full buffers very quickly and to recycle them much faster, resulting in higher transfer rates and lower memory usage at the same time. We just wake the tasklet up if it succeeded so that h2_process() and friends are called to finalize what needs to. For regular buffer sizes, the performance level becomes quite close to the one obtained with the zero-copy mechanism (zero-copy remains much faster with non-default buffer sizes). The memory savings are huge with default buffer size: at 64c * 100 streams on a single thread, we used to forward 4.4 Gbps of traffic using 10400 buffers. After the change, the performance reaches 5.9 Gbps with only 22-24 buffers, since they are quickly recycled. That's asaving of 160 MB of RAM. A concern was an increase in the number of syscalls but this is not the case, the numbers remained exactly the same before and after. Some experimentations were made to try to cork data and not send incomplete buffers, and that always voided these changes. One explanation might be that keeping a first buffer with only headers frames is sufficient to prevent a zero-copy of the data coming in a next snd_buf() call. This still needs to be studied anyway.	2023-11-04 08:34:23 +01:00
Willy Tarreau	0fa5adee3b	MINOR: mux-h2: always use h2_send() in h2_done_ff(), not h2_process() By calling h2_process(), the code would theoretically make it possible for a synchronous ->wake() call to provoke an indirect call to h2_snd_buf() while we're in h2_done_ff(), which could be quite bad. The current conditions do not permit it right now but this could easily break by accident. Better use h2_send() and wake the task up if needed. Precise performance tests showed no change.	2023-11-04 08:12:17 +01:00
Amaury Denoyelle	f76e94d231	MINOR: backend: refactor insertion in avail conns tree Define a new function srv_add_to_avail_list(). This function is used to centralize connection insertion in available tree. It reuses a BUG_ON() statement to ensure the connection is not present in the idle list.	2023-10-25 10:33:06 +02:00
Willy Tarreau	380f115a4a	BUG/MINOR: mux-h2: update tracked counters with req cnt/req err Originally H2 would transfer everything to H1 and parsing errors were handled there, so that if there was a track-sc rule in effect, the counters would be updated as well. As we started to add more and more HTTP-compliance checks at the H2 layer, then switched to HTX, we progressively lost this ability. It's a bit annoying because it means we will not maintain accurate error counters for a given source, for example. This patch adds the calls to session_inc_http_req_ctr() and session_inc_http_err_ctr() when needed (i.e. when failing to parse an HTTP request since all other cases are handled by the stream), just like mux-h1 does. The same should be done for mux-h3 by the way. This can be backported to recent stable versions. It's not exactly a bug, rather a missing feature in that we had never updated this counter for H2 till now, but it does make sense to do it especially based on what the doc says about its usage.	2023-10-20 21:09:12 +02:00
Willy Tarreau	250b630fb9	BUG/MINOR: mux-h2: commit the current stream ID even on reject The H2 spec says that a HEADERS frame turns an idle stream to the open state, and it may then turn to half-closed(remote) on ES, then to close, all at once, if we respond with RST (e.g. on error). Due to the fact that we process a complete frame at once since h2_dec_hdrs() may reassemble CONTINUATION frames until everything is complete, the state was only committed after the frame was completley valid (otherwise multiple passes could result in subsequent frames being rejected as the stream ID would be equal to the highest one). However this is not correct because it means that a client may retry on the same ID as a previously failed one, which technically is forbidden (for example the client couldn't know which of them a WINDOW_UPDATE or RST_STREAM frame is for). In practice, due to the error paths, this would only be possible when failing to decode HPACK while leaving the HPACK stream intact, thus when the valid decoded HPACK stream cannot be turned into a valid HTTP representation, e.g. when the resulting headers are too large for example. The solution to avoid this consists in committing the stream ID on this error path as well. h2spec continues to be happy. Thanks to Annika Wickert and Tim Windelschmidt for reporting this issue. This fix must be backported to all stable versions.	2023-10-20 21:09:12 +02:00
Willy Tarreau	08f3bb5bd5	MINOR: mux-h2/traces: clarify the "rejected H2 request" event In h2_frt_handle_headers() all failures lead to a generic message saying "rejected H2 request". It's quite inexpressive while there are a few distinct tests that are made before jumping there: - trailers on closed stream - unparsable request - refused stream Let's emit the traces from these call points instead so that we get more info about what happened. Since these are user-level messages, we take care of keeping them aligned as much as possible. For example before it would say: [04\|h2\|1\|mux_h2.c:2859] rejected H2 request : h2c=0x7f5d58036fd0(F,FRE) [04\|h2\|5\|mux_h2.c:2860] h2c_frt_handle_headers(): leaving on error : h2c=0x7f5d58036fd0(F,FRE) dsi=1 h2s=0x9fdb60(0,CLO) And now it says: [04\|h2\|1\|mux_h2.c:2817] rcvd unparsable H2 request : h2c=0x7f55f8037160(F,FRH) dsi=1 h2s=CLO [04\|h2\|5\|mux_h2.c:2875] h2c_frt_handle_headers(): leaving on error : h2c=0x7f55f8037160(F,FRE) dsi=1 h2s=CLO	2023-10-20 21:09:12 +02:00
Willy Tarreau	1deac6f99a	MINOR: mux-h2/traces: explicitly show the error/refused stream states Sometimes it's unclear whether a stream is still open or closed when certain traces are emitted, for example when the stream was refused, because the reported pointer and ID in fact correspond to the refused stream. And for closed streams, no pointer/name is printed, leaving some confusion about the state. This patch makes the situation easier to analyse by explicitly reporting "h2s=CLO" on closed/error/refused streams so that we don't waste time comparing pointers and we instantly know the stream is closed. Now instead of emitting: [03\|h2\|5\|mux_h2.c:2874] h2c_frt_handle_headers(): leaving on error : h2c=0x7fdfa8026820(F,FRE) dsi=201 h2s=0x9fdb60(0,CLO) It will emit: [03\|h2\|5\|mux_h2.c:2874] h2c_frt_handle_headers(): leaving on error : h2c=0x7fdfa8026820(F,FRE) dsi=201 h2s=CLO	2023-10-20 21:09:12 +02:00
Willy Tarreau	3dd963b35f	BUG/MINOR: mux-h2: fix http-request and http-keep-alive timeouts again Stefan Behte reported that since commit `f279a2f14` ("BUG/MINOR: mux-h2: refresh the idle_timer when the mux is empty"), the http-request and http-keep-alive timeouts don't work anymore on H2. Before this patch, and since 3e448b9b64 ("BUG/MEDIUM: mux-h2: make sure control frames do not refresh the idle timeout"), they would only be refreshed after stream frames were sent (HEADERS or DATA) but the patch above that adds more refresh points broke these so they don't expire anymore as long as there's some activity. We cannot just revert the fix since it also addressed an isse by which sometimes the timeout would trigger too early and provoque truncated responses. The right approach here is in fact to only use refresh the idle timer when the mux buffer was flushed from any such stream frames. In order to achieve this, we're now setting a flag on the connection whenever we write a stream frame, and we consider that flag when deciding to refresh the buffer after it's emptied. This way we'll only clear that flag once the buffer is empty and there were stream data in it, not if there were no such stream data. In theory it remains possible to leave the flag on if some control data is appended after the buffer and it's never cleared, but in practice it's not a problem as a buffer will always get sent in large blocks when the window opens. Even a large buffer should be emptied once in a while as control frames will not fill it as much as data frames could. Given the patch above was backported as far as 2.6, this patch should also be backported as far as 2.6.	2023-10-18 17:17:58 +02:00
Christopher Faulet	203211f4cb	REORG: stconn/muxes: Rename init step in fast-forwarding Instead of speaking of an initialisation stage for each data fast-forwarding, we now use the negociate term. Thus init_ff/init_fastfwd functions were renamed nego_ff/nego_fastfwd.	2023-10-18 12:46:55 +02:00
Christopher Faulet	11c05c516a	MEDIUM: mux-h2: Add consumer-side fast-forwarding support The H2 multiplexer now implements callbacks to consume fast-forwarded data. It is the most usful case: A H2 client getting data from a H1 server. It is also the easiest case to implement. The producer side is trickier because of multiplexing. It is not obvious this case would be improved with data fast-forwarding.	2023-10-17 18:51:13 +02:00
Christopher Faulet	1fdfa4f9ba	BUG/MEDIUM: mux-h2: Don't report an error on shutr if a shutw is pending If a shutw is blocked because the mux is full or busy, we must defer the shutr. In this case, the H2 stream is not in H2_SS_CLOSED state because the shutw is also deferred. If the shutr is performed, this will lead to a error. Concretly, when the mux is unblocked, a RST_STREAM is sent while in some cases, an empty DATA frame with ES flag set could be sent. This patch should be backported to all stable versions.	2023-10-17 18:51:13 +02:00
Willy Tarreau	68d02e5fa9	BUG/MINOR: mux-h2: make up other blocked streams upon removal from list An interesting issue was met when testing the mux-to-mux forwarding code. In order to preserve fairness, in h2_snd_buf() if other streams are waiting in send_list or fctl_list, the stream that is attempting to send also goes to its list, and will be woken up by h2_process_mux() or h2_send() when some space is released. But on rare occasions, there are only a few (or even a single) streams waiting in this list, and these streams are just quickly removed because of a timeout or a quick h2_detach() that calls h2s_destroy(). In this case there's no even to wake up the other waiting stream in its list, and this will possibly resume processing after some client WINDOW_UPDATE frames or even new streams, so usually it doesn't last too long and it not much noticeable, reason why it was left that long. In addition, measures have shown that in heavy network-bound benchmark, this exact situation happens on less than 1% of the streams (reached 4% with mux-mux). The fix here consists in replacing these LIST_DEL_INIT() calls on h2s->list with a function call that checks if other streams were queued to the send_list recently, and if so, which also tries to resume them by calling h2_resume_each_sending_h2s(). The detection of late additions is made via a new flag on the connection, H2_CF_WAIT_INLIST, which is set when a stream is queued due to other streams being present, and which is cleared when this is function is called. It is particularly difficult to reproduce this case which is particularly timing-dependent, but in a constrained environment, a test involving 32 conns of 20 streams each, all downloading a 10 MB object previously showed a limitation of 17 Gbps with lots of idle CPU time, and now filled the cable at 25 Gbps. This should be backported to all versions where it applies.	2023-10-17 16:43:44 +02:00
Willy Tarreau	5798b5bb14	BUG/MAJOR: connection: make sure to always remove a connection from the tree Since commit `5afcb686b` ("MAJOR: connection: purge idle conn by last usage") in 2.9-dev4, the test on conn->toremove_list added to conn_get_idle_flag() in 2.8 by commit `3a7b539b1` ("BUG/MEDIUM: connection: Preserve flags when a conn is removed from an idle list") becomes misleading. Indeed, now both toremove_list and idle_list are shared by a union since the presence in these lists is mutually exclusive. However, in conn_get_idle_flag() we check for the presence in the toremove_list to decide whether or not to delete the connection from the tree. This test now fails because instead it sees the presence in the idle or safe list via the union, and concludes the element must not be removed. Thus the element remains in the tree and can be found later after the connection is released, causing crashes that Tristan reported in issue #2292. The following config is sufficient to reproduce it with 2 threads: defaults mode http timeout client 5s timeout server 5s timeout connect 1s listen front bind :8001 server next 127.0.0.1:8002 frontend next bind :8002 timeout http-keep-alive 1 http-request redirect location / Sending traffic with a few concurrent connections and some short timeouts suffices to instantly crash it after ~10k reqs: $ h2load -t 4 -c 16 -n 10000 -m 1 -w 1 http://0:8001/ With Amaury we analyzed the conditions in which the function is called in order to figure a better condition for the test and concluded that ->toremove_list is never filled there so we can safely remove that part from the test and just move the flag retrieval back to what it was prior to the 2.8 patch above. Note that the patch is not reverted though, as the parts that would drop the unexpected flags removal are unchanged. This patch must NOT be backported. The code in 2.8 works correctly, it's only the change in 2.9 that makes it misbehave.	2023-10-12 14:20:03 +02:00
Amaury Denoyelle	337c71423f	MINOR: connection: define mux flag for reverse support Add a new MUX flag MX_FL_REVERSABLE. This value is used to indicate that MUX instance supports connection reversal. For the moment, only HTTP/2 multiplexer is flagged with it. This allows to dynamically check if reversal can be completed during MUX installation. This will allow to relax requirement on config writing for 'tcp-request session attach-srv' which currently cannot be used mixed with non-http/2 listener instances, even if used conditionnally with an ACL.	2023-09-29 18:09:08 +02:00
Christopher Faulet	89e20033c7	BUG/MAJOR: mux-h2: Report a protocol error for any DATA frame before headers If any DATA frame is received before all headers are fully received, a protocol error must be reported. It is required by the HTTP/2 RFC but it is also important because the HTTP analyzers expect the first HTX block is a start-line. It leads to a crash if this statement is not respected. For instance, it is possible to trigger a crash by sending an interim message with a DATA frame (It may be an empty DATA frame with the ES flag). AFAIK, only the server side is affected by this bug. To fix the issue, an protocol error is reported for the stream. This patch should fix the issue #2291. It must be backported as far as 2.2 (and probably to 2.0 too).	2023-09-14 11:39:39 +02:00
Willy Tarreau	a7b9baa2cc	BUG/MEDIUM: mux-h2: fix crash when checking for reverse connection after error If the connection is closed in h2_release(), which is indicated by ret<0, we must not dereference conn anymore. This was introduced in 2.9-dev4 by commit `5053e8914` ("MEDIUM: h2: prevent stream opening before connection reverse completed") and detected after a few hours of runtime thanks to running with pool integrity checks and caller enabled. No backport is needed.	2023-08-26 17:05:19 +02:00
Amaury Denoyelle	61fc9568fb	MINOR: server: move idle tree insert in a dedicated function Define a new function _srv_add_idle(). This is a simple wrapper to insert a connection in the server idle tree. This is reserved for simple usage and require to idle_conns lock. In most cases, srv_add_to_idle_list() should be used. This patch does not have any functional change. However, it will help with the next patch as idle connection will be always inserted in a list as secondary storage along with idle/safe trees.	2023-08-25 15:57:48 +02:00
Amaury Denoyelle	77ac8eb4a6	MINOR: connection: simplify removal of idle conns from their trees Small change of API for conn_delete_from_tree(). Now the connection instance is taken as argument instead of its inner node. No functional change introduced with this commit. This simplifies slightly invocation of conn_delete_from_tree(). The most useful changes is that this function will be extended in the next patch to be able to remove the connection from its new idle list at the same time as in its idle tree.	2023-08-25 15:57:48 +02:00
Amaury Denoyelle	6bd994d5d7	BUG/MINOR: h2: fix reverse if no timeout defined h2c.task is not allocated in h2_init() if timeout client/server is not defined depending on the connection side. This caused crash on connection reverse due to systematic requeuing of h2c.task in h2_conn_reverse(). To fix this, check h2c.task in h2_conn_reverse(). If old timeout was undefined but new one is, h2c.task must be allocated as it was not in h2_init(). On the opposite situation, if old timeout was defined and new one is not, h2c.task is freed. In this case, or if neither timeout are defined, skip the task requeuing. This bug is easily reproduced by using reverse bind or server with undefined timeout client/server depending on the connection reverse direction. This bug has been introduced by reverse connect support. No need to backport it.	2023-08-24 17:58:14 +02:00
Amaury Denoyelle	5053e89142	MEDIUM: h2: prevent stream opening before connection reverse completed HTTP/2 demux must be handled with care for active reverse connection. Until accept has been completed, it should be forbidden to handle HEADERS frame as session is not yet ready to handle streams. To implement this, use the flag H2_CF_DEM_TOOMANY which blocks demux process. This flag is automatically set just after conn_reverse() invocation. The flag is removed on rev_accept_conn() callback via a new H2 ctl enum. H2 tasklet is woken up to restart demux process. As a side-effect, reporting in H2 mux may be blocked as demux functions are used to convert error status at the connection level with CO_FL_ERROR. To ensure error is reported for a reverse connection, check h2c_is_dead() specifically for this case in h2_wake(). This change also has its own side-effect : h2c_is_dead() conditions have been adjusted to always exclude !h2c->conn->owner condition which is always true for reverse connection or else H2 mux may kill them unexpectedly.	2023-08-24 17:03:08 +02:00
Amaury Denoyelle	6820b9b393	MEDIUM: h2: implement active connection reversal Implement active reverse on h2_conn_reverse(). Only minimal steps are done here : HTTP version session counters are incremented on the listener instance. Also, the connection is inserted in the mux_stopping_list to ensure it will be actively closed on process shutdown/listener suspend.	2023-08-24 17:03:08 +02:00
Amaury Denoyelle	b781a1bb09	MINOR: connection: prepare init code paths for active reverse When an active reverse connection is initialized, it has no stream-conn attached to it contrary to other backend connections. This forces to add extra check on stream existence in conn_create_mux() and h2_init(). There is also extra checks required for session_accept_fd() after reverse and accept is done. This is because contrary to other frontend connections, reversed connections have already initialized their mux and transport layers. This forces us to skip the majority of session_accept_fd() initialization part. Finally, if session_accept_fd() is interrupted due to an early error, a reverse connection cannot be freed directly or else mux will remain alone. Instead, the mux destroy callback is used to free all connection elements properly.	2023-08-24 17:02:37 +02:00
Amaury Denoyelle	4fb538d4b6	MEDIUM: h2: reverse connection after SETTINGS reception Reverse connection after SETTINGS reception if it was set as reversable. This operation is done in a new function h2_conn_reverse(). It regroups common changes which are needed for both reversal direction : H2_CF_IS_BACK is set or unset and timeouts are inverted. For the moment, only passive reverse is fully implemented. Once done, the connection instance is directly inserted in its targetted server pool. It can then be used immediately for future transfers using this server.	2023-08-24 14:49:03 +02:00
Willy Tarreau	d93a00861d	MINOR: h2: pass accept-invalid-http-request down the request parser We're adding a new argument "relaxed" to h2_make_htx_request() so that we can control its level of acceptance of certain invalid requests at the proxy level with "option accept-invalid-http-request". The goal will be to add deactivable checks that are still desirable to have by default. For now no test is subject to it.	2023-08-08 19:10:54 +02:00
Willy Tarreau	db97bb42d9	MINOR: mux-h2/traces: also suggest invalid header upon parsing error Historically the parsing error used to apply only to too large headers, so this is what has been reported in traces. But nowadays we can also reject invalid characters, and when this happens the trace is a bit misleading, so let's mention "or invalid".	2023-08-08 19:02:24 +02:00
Christopher Faulet	ef2b15998c	BUG/MINOR: htx/mux-h1: Properly handle bodyless responses when splicing is used There is a mechanisme in the H1 and H2 multiplexer to skip the payload when a response is returned to the client when it must not contain any payload (response to a HEAD request or a 204/304 response). However, this does not work when the splicing is used. The H2 multiplexer does not support the splicing, so there is no issue. But with the mux-h1, when data are sent using the kernel splicing, the mux on the server side is not aware the client side should skip the payload. And once the data are put in a pipe, there is no way to stop the sending. It is a defect of the current design. This will be easier to deal with this case when the mux-to-mux forwarding will be implemented. But for now, to fix the issue, we should add an HTX flag on the start-line to pass the info from the client side to the server side and be able to disable the splicing in necessary. The associated reg-test was improved to be sure it does not fail when the splicing is configured. This patch should be backported as far as 2.4..	2023-08-02 12:05:05 +02:00
Willy Tarreau	f279a2f148	BUG/MINOR: mux-h2: refresh the idle_timer when the mux is empty There's a rare case where on long fat pipes, we can see the keep-alive timeout trigger before the end of the transfer of the last large object, and the connection closed a bit quickly after the end of the transfer because a GOAWAY is queued. The data are not destroyed, except that the WINDOW_UPDATES from the client arriving late while the last data are being drained by the socket buffers may at some point trigger a reset, and some clients might choke a bit too early on these. Let's make sure we only arm the idle_start timestamp once the output buffer is empty. Of course it will still not cover for the data pending in the socket buffers but it will at least let those in the buffer leave in peace. More elaborate options can be used to protect the data in the kernel buffers, such as the one described in GH issue #5. It's very likely that this old issue was emphasized by the following commit in 2.6: `15a4733d5` ("BUG/MEDIUM: mux-h2: make use of http-request and keep-alive timeouts") and the behavior probably changed again with this one in 2.8, which was backported to 2.7 and scheduled for 2.6: `d38d8c6cc` ("BUG/MEDIUM: mux-h2: make sure control frames do not refresh the idle timeout") As such this patch should be backported to 2.6 after some observation period.	2023-05-31 10:45:30 +02:00
Christopher Faulet	c2f1d0ee5e	BUG/MEDIUM: mux-h2: Propagate termination flags when frontend SC is created We must evaluate if EOS/EOI/ERR_PENDING/ERROR flags must be set on the SE when the frontend SC is created because the rxbuf is transferred to the steeam at this stage. It means the call to h2_rcv_buf() may be skipped on some circumstances. And indeed, it happens when HAproxy quickly replies, for instance because of a deny rule. In this case, depending on the scheduling, the abort may block the receive attempt from the SC. In this case if SE flags were not properly set earlier, there is no way to terminate the request and the session may be freezed. For now, I can't explain why there is no timeout when this happens but it remains an issue because here we should not rely on timeouts to close the stream. This patch relies on following commits: * MINOR: mux-h2: Add a function to propagate termination flags from h2s to SE * MINOR: mux-h2: Set H2_SF_ES_RCVD flag when decoding the HEADERS frame The issue was encountered on the 2.8 but it seems the bug exists since the 2.4. But it is probably a good idea to only backport the series to 2.7 only and wait for a bug report on earlier versions. This patch should solve the issue #2147.	2023-05-24 16:06:11 +02:00
Christopher Faulet	531dd050ff	MINOR: mux-h2: Add a function to propagate termination flags from h2s to SE The function h2s_propagate_term_flags() was added to check the H2S state and evaluate when EOI/EOS/ERR_PENDING/ERROR flags must be set on the SE. It is not the only place where those flags are set. But it centralizes the synchro between the H2 stream and the SC. For now, this function is only used at the end of h2_rcv_buf(). But it will be used to fix a bug.	2023-05-24 16:06:11 +02:00
Christopher Faulet	1a60a66306	MINOR: mux-h2: Set H2_SF_ES_RCVD flag when decoding the HEADERS frame The flag H2_SF_ES_RCVD is set on the H2 stream when the ES flag is found in a frame. On HEADERS frame, it was set in function processing the frame. It is moved in the function decoding the frame. Fundamentally, this changes nothing. But it will be useful to have this information earlier when a client H2 stream is created.	2023-05-24 16:06:11 +02:00
Christopher Faulet	78b1eb2b04	BUG/MINOR: mux-h2: Check H2_SF_BODY_TUNNEL on H2S flags and not demux frame ones In h2c_frt_stream_new(), H2_SF_BODY_TUNNEL flags was tested on demux frame flags (h2c->dff) instead of the h2s flags. By chance, it is a noop test becasue H2_SF_BODY_TUNNEL value, once converted to an int8_t, is 0. It is a 2.8-specific issue. No backport needed.	2023-05-24 16:06:11 +02:00
Willy Tarreau	d38d8c6ccb	BUG/MEDIUM: mux-h2: make sure control frames do not refresh the idle timeout Christopher found as part of the analysis of Tim's issue #1891 that commit `15a4733d5` ("BUG/MEDIUM: mux-h2: make use of http-request and keep-alive timeouts") introduced in 2.6 incompletely addressed a timeout issue in the H2 mux. The problem was that the http-keepalive and http-request timeouts were not applied before it. With that commit they are now considered, but if a GOAWAY is sent (or even attempted to be sent), then they are not used anymore again, because the way the code is arranged consists in applying the client-fin timeout (if set) to the current date, and falling back to the client timeout, without considering the idle_start period. This means that a config having a "timeout http-keepalive" would still not close the connection quickly when facing a client that periodically sends PING, PRIORITY or whatever other frame types. In addition, after the GOAWAY was attempted to be sent, there was no check for pending data in the output buffer, meaning that it would be possible to truncate some responses in configs involving a very short client-fin timeout. Finally the spreading of the closures during the soft-stop brought in 2.6 by commit `b5d968d9b` ("MEDIUM: global: Add a "close-spread-time" option to spread soft-stop on time window") didn't consider the particular case of an idle "pre-connect" connection, which would also live long if a browser failed to deliver a valid request for a long time. All of this indicates that the conditions must be reworked so as not to have that level of exclusion between conditions, but rather stick to the rules from the doc that are already enforced on other muxes: - timeout client always applies if there are data pending, and is relative to each new I/O ; - timeout http-request applies before the first complete request and is relative to the entry in idle state ; - timeout http-keepalive applies between idle and the next complete request and is relative to the entry in idle state ; - timeout client-fin applies when in idle after a shut was sent (here the shut is the GOAWAY). The shut may only be considered as sent if the buffer is empty and the flags indicate that it was successfully sent (or failed) but not if it's still waiting for some room in the output buffer for example. This implies that this timeout may then lower the http-keepalive/http-request ones. This is what this patch implements. Of course the client timeout still applies as a fallback when all the ones above are not set or when their conditions are not met. It would seem reasoanble to backport this to 2.7 first, then only after one or two releases to 2.6.	2023-05-15 12:01:20 +02:00
Amaury Denoyelle	25cf19d5c8	MINOR: htx: add function to set EOM reliably Implement a new HTX utility function htx_set_eom(). If the HTX message is empty, it will first add a dummy EOT block. This is a small trick needed to ensure readers will detect the HTX buffer as not empty and retrieve the EOM flag. Replace the H2 code related by a htx_set_eom() invocation. QUIC also has the same code which will be replaced in the next commit. This should be backported up to 2.7 before the related QUIC patch.	2023-05-12 15:29:28 +02:00
Christopher Faulet	34f81d5815	BUG/MINOR: mux-h2: Also expect data when waiting for a tunnel establishment When a client H2 stream is waiting for a tunnel establishment, it must state it expects data from server. It is the second fix that should fix regressions of the commit 2722c04b ("MEDIUM: mux-h2: Don't expect data from server as long as request is unfinished") It is a 2.8-specific bug. No backport needed.	2023-05-04 16:58:33 +02:00
Christopher Faulet	4403cdf653	BUG/MEDIUM: mux-h2: Properly handle end of request to expect data from server The commit 2722c04b ("MEDIUM: mux-h2: Don't expect data from server as long as request is unfinished") introduced a regression in the H2 multiplexer. The end of the request is not systematically handled to state a H2 stream on client side now expexts data from the server. Indeed, while the client is uploading its request, the H2 stream warns it does not expect data from the server. This way, no server timeout is applied at this stage. When end of the request is detected, the H2 stream must state it now expects the server response. This enables the server timeout. However, it was only performed at one place while the end of the request can be handled at different places. First, during a zero-copy in h2_rcv_buf(). Then, when the SC is created with the full request. Because of this bug, it is possible to totally disable the server timeout for H2 streams. In h2_rcv_buf(), we now rely on h2s flags to detect the end of the request, but only when the rxbuf was emptied. It is a 2.8-specific bug. No backport needed.	2023-05-04 16:29:27 +02:00
Willy Tarreau	69530f59ae	MEDIUM: clock: replace timeval "now" with integer "now_ns" This puts an end to the occasional confusion between the "now" date that is internal, monotonic and not synchronized with the system's date, and "date" which is the system's date and not necessarily monotonic. Variable "now" was removed and replaced with a 64-bit integer "now_ns" which is a counter of nanoseconds. It wraps every 585 years, so if all goes well (i.e. if humanity does not need haproxy anymore in 500 years), it will just never wrap. This implies that now_ns is never nul and that the zero value can reliably be used as "not set yet" for a timestamp if needed. This will also simplify date checks where it becomes possible again to do "date1<date2". All occurrences of "tv_to_ns(&now)" were simply replaced by "now_ns". Due to the intricacies between now, global_now and now_offset, all 3 had to be turned to nanoseconds at once. It's not a problem since all of them were solely used in 3 functions in clock.c, but they make the patch look bigger than it really is. The clock_update_local_date() and clock_update_global_date() functions are now much simpler as there's no need anymore to perform conversions nor to round the timeval up or down. The wrapping continues to happen by presetting the internal offset in the short future so that the 32-bit now_ms continues to wrap 20 seconds after boot. The start_time used to calculate uptime can still be turned to nanoseconds now. One interrogation concerns global_now_ms which is used only for the freq counters. It's unclear whether there's more value in using two variables that need to be synchronized sequentially like today or to just use global_now_ns divided by 1 million. Both approaches will work equally well on modern systems, the difference might come from smaller ones. Better not change anyhting for now. One benefit of the new approach is that we now have an internal date with a resolution of the nanosecond and the precision of the microsecond, which can be useful to extend some measurements given that timestamps also have this resolution.	2023-04-28 16:08:08 +02:00
Willy Tarreau	ad5a5f6779	MEDIUM: tree-wide: replace timeval with nanoseconds in tv_accept and tv_request Let's get rid of timeval in storage of internal timestamps so that they are no longer mistaken for wall clock time. These were exclusively used subtracted from each other or to/from "now" after being converted to ns, so this patch removes the tv_to_ns() conversion to use them natively. Two occurrences of tv_isge() were turned to a regular wrapping subtract.	2023-04-28 16:08:08 +02:00
Willy Tarreau	76d343d3d3	MINOR: time: replace calls to tv_ms_elapsed() with a linear subtract Instead of operating on {sec, usec} now we convert both operands to ns then subtract them and convert to ms. This is a first step towards dropping timeval from these timestamps. Interestingly, tv_ms_elapsed() and tv_ms_remain() are no longer used at all and could be removed.	2023-04-28 16:08:08 +02:00
Tim Duesterhus	b1ec21d259	CLEANUP: Stop checking the pointer before calling `tasklet_free()` Changes performed with this Coccinelle patch: @@ expression e; @@ - if (e != NULL) { tasklet_free(e); - } @@ expression e; @@ - if (e) { tasklet_free(e); - } @@ expression e; @@ - if (e) tasklet_free(e); @@ expression e; @@ - if (e != NULL) tasklet_free(e); See GitHub Issue #2126	2023-04-23 00:28:25 +02:00
Willy Tarreau	ca1027c22f	MINOR: mux-h2: make the max number of concurrent streams configurable per side For a long time the maximum number of concurrent streams was set once for both sides (front and back) while the impacts are different. This commit allows it to be configured separately for each side. The older settings remains the fallback choice when other ones are not set.	2023-04-18 15:58:55 +02:00
Willy Tarreau	9d7abda787	MINOR: mux-h2: make the initial window size configurable per side For a long time the initial window size (per-stream size) was set once for both directions, frontend and backend, resulting in a tradeoff between upload speed and download fairness. This commit allows it to be configured separately for each side. The older settings remains the fallback choice when other ones are not set.	2023-04-18 15:58:55 +02:00
Christopher Faulet	c202c740b5	BUG/MEDIUM: mux-h2: Never set SE_FL_EOS without SE_FL_EOI or SE_FL_ERROR When end-of-stream is reported by a H2 stream, we must take care to also report an error is end-of-input was not reported. Indeed, it is now mandatory to set SE_FL_EOI or SE_FL_ERROR flags when SE_FL_EOS is set. It is a 2.8-specific issue. No backport needed.	2023-04-11 08:59:10 +02:00
Ilya Shipitsin	07be66d21b	CLEANUP: assorted typo fixes in the code and comments This is 35th iteration of typo fixes	2023-04-01 18:33:40 +02:00
Christopher Faulet	21fb6bdab4	BUG/MEDIUM: mux-h2: Be able to detect connection error during handshake When a backend H2 connection is waiting the connection is fully established, nothing is sent. However, it remains useful to detect connection error at this stage. It is especially important to release H2 connection on connect error. Be able to set H2_CF_ERR_PENDiNG or H2_CF_ERROR flags when the underlying connection is not fully established will exclude the H2C to be inserted in a idle list in h2_detach(). Without this fix, an H2C in PREFACE state and relying on a connection in error can be inserted in the safe list. Of course, it will be purged if not reused. But in the mean time, it can be reused. When this happens, the connection remains in error and nothing happens. At the end a connection error is returned to the client. On low traffic, we can imagine a scenario where this dead connection is the only idle connection. If it is always reused before being purged, no connection to the server is possible. In addition, h2c_is_dead() is updated to declare as dead any H2 connection with a pending error if its state is PREFACE or SETTINGS1 (thus if no SETTINGS frame was received yet). This patch should fix the issue #2092. It must be backported as far as 2.6.	2023-03-28 14:52:42 +02:00
Willy Tarreau	389ab0d4b4	BUG/MEDIUM: mux-h2: erase h2c->wait_event.tasklet on error path On the allocation error path in h2_init() we may check if h2c->wait_event.tasklet needs to be released but it has not yet been zeroed. Let's do this before jumping to the freeing location. This needs to be backported to all maintained versions.	2023-03-20 19:58:38 +01:00
Willy Tarreau	bcdc6cc15b	BUG/MEDIUM: mux-h2: do not try to free an unallocated h2s->sd In h2s_close() we may dereference h2s->sd to get the sc, but this function may be called on allocation error paths, so we must check for this specific condition. Let's also update the comment to make it explicitly permitted. This needs to be backported to 2.6.	2023-03-20 19:58:38 +01:00
Willy Tarreau	464fa06e9a	MINOR: mux-h2: set CO_SFL_MSG_MORE when sending multiple buffers Traces show that sendto() rarely has MSG_MORE on H2 despite sending multiple buffers. The reason is that the loop iterating over the buffer ring doesn't have this info and doesn't pass it down. But now we know how many buffers are left to be sent, so we know whether or not the current buffer is the last one. As such we can set this flag for all buffers but the last one.	2023-03-17 16:43:51 +01:00
Willy Tarreau	14ea98af73	BUG/MINOR: mux-h2: set CO_SFL_STREAMER when sending lots of data Emeric noticed that h2 bit-rate performance was always slightly lower than h1 when the CPU is saturated. Strace showed that we were always data in 2kB chunks, corresponding to the max_record size. What's happening is that when this mechanism of dynamic record size was introduced, the STREAMER flag at the stream level was relied upon. Since all this was moved to the muxes, the flag has to be passed as an argument to the snd_buf() function, but the mux h2 did not use it despite a comment mentioning it, probably because before the multi-buf it was not easy to figure the status of the buffer. The solution here consists in checking if the mbuf is congested or not, by checking if it has more than one buffer allocated. If so we set the CO_SFL_STREAMER flag, otherwise we don't. This way moderate size exchanges continue to be made over small chunks, but downloads will be able to use the large ones. While it could be backported to all supported versions, it would be better to limit it to the last LTS, so let's do it for 2.7 and 2.6 only. This patch requires previous commit "MINOR: buffer: add br_single() to check if a buffer ring has more than one buf".	2023-03-16 18:45:46 +01:00
Willy Tarreau	93c5511af8	BUG/MEDIUM: mux-h2: only restart sending when mux buffer is decongested During performance tests, Emeric faced a case where the wakeups of sc_conn_io_cb() caused by h2_resume_each_sending_h2s() was multiplied by 5-50 and a lot of CPU was being spent doing this for apparently no reason. The culprit is h2_send() not behaving well with congested buffers and small SSL records. What happens when the output is congested is that all buffers are full, and data are emitted in 2kB chunks, which are sufficient to wake all streams up again to ask them to send data again, something that will obviously only work for one of them at best, and waste a lot of CPU in wakeups and memcpy() due to the small buffers. When this happens, the performance can be divided by 2-2.5 on large objects. Here the chosen solution against this is to keep in mind that as long as there are still at least two buffers in the ring after calling xprt->snd_buf(), it means that the output is congested and there's no point trying again, because these data will just be placed into such buffers and will wait there. Instead we only mark the buffer decongested once we're back to a single allocated buffer in the ring. By doing so we preserve the ability to deal with large concurrent bursts while not causing a thundering herd by waking all streams for almost nothing. This needs to be backported to 2.7 and 2.6. Other versions could benefit from it as well but it's not strictly necessary, and we can reconsider this option if some excess calls to sc_conn_io_cb() are faced. Note that this fix depends on this recent commit: MINOR: buffer: add br_single() to check if a buffer ring has more than one buf	2023-03-16 18:45:46 +01:00
Willy Tarreau	3fb2c6d5b4	BUG/MINOR: mux-h2: make sure the h2c task exists before refreshing it When detaching a stream, if it's the last one and the mbuf is blocked, we leave without freeing the stream yet. We also refresh the h2c task's timeout, except that it's possible that there's no such task in case there is no client timeout, causing a crash. The fix just consists in doing this when the task exists. This bug has always been there and is extremely hard to meet even without a client timeout. This fix has to be backported to all branches, but it's unlikely anyone has ever met it anyay.	2023-03-16 18:45:46 +01:00
Christopher Faulet	3a7b539b12	BUG/MEDIUM: connection: Preserve flags when a conn is removed from an idle list The commit `5e1b0e7bf` ("BUG/MEDIUM: connection: Clear flags when a conn is removed from an idle list") introduced a regression. CO_FL_SAFE_LIST and CO_FL_IDLE_LIST flags are used when the connection is released to properly decrement used/idle connection counters. if a connection is idle, these flags must be preserved till the connection is really released. It may be removed from the list but not immediately released. If these flags are lost when it is finally released, the current number of used connections is erroneously decremented. If means this counter may become negative and the counters tracking the number of idle connecitons is not decremented, suggesting a leak. So, the above commit is reverted and instead we improve a bit the way to detect an idle connection. The function conn_get_idle_flag() must now be used to know if a connection is in an idle list. It returns the connection flag corresponding to the idle list if the connection is idle (CO_FL_SAFE_LIST or CO_FL_IDLE_LIST) or 0 otherwise. But if the connection is scheduled to be removed, 0 is also returned, regardless the connection flags. This new function is used when the connection is temporarily removed from the list to be used, mainly in muxes. This patch should fix #2078 and #2057. It must be backported as far as 2.2.	2023-03-16 15:34:20 +01:00
Christopher Faulet	5e1b0e7bf8	BUG/MEDIUM: connection: Clear flags when a conn is removed from an idle list When a connection is removed from the safe list or the idle list, CO_FL_SAFE_LIST and CO_FL_IDLE_LIST flags must be cleared. It is performed when the connection is reused. But not when it is moved into the toremove_conns list. It may be an issue because the multiplexer owning the connection may be woken up before the connection is really removed. If the connection flags are not sanitized, it may think the connection is idle and reinsert it in the corresponding list. From this point, we can imagine several bugs. An UAF or a connection reused with an invalid state for instance. To avoid any issue, the connection flags are sanitized when an idle connection is moved into the toremove_conns list. The same is performed at right places in the multiplexers. Especially because the connection release may be delayed (for h2 and fcgi connections). This patch shoudld fix the issue #2057. It must carefully be backported as far as 2.2. Especially on the 2.2 where the code is really different. But some conflicts should be expected on the 2.4 too.	2023-02-28 18:36:29 +01:00
Christopher Faulet	72722c04b0	MEDIUM: mux-h2: Don't expect data from server as long as request is unfinished As for the H1 stream, the H2 stream now states it does not expect data from the server as long as the request is unfinished. The aim is the same. We must be sure to not trigger a read timeout on server side if the client is still uploading data. From the moment the end of the request is received and forwarded to upper layer, the H2 stream reports it expects to receive data from the opposite endpoint. This re-enables read timeout on the server side.	2023-02-27 17:45:45 +01:00
Willy Tarreau	0d6e5d271f	MINOR: mux-h2/traces: add a missing TRACE_LEAVE() in h2s_frt_handle_headers() Traces from this function would miss a TRACE_LEAVE() on the success path, which had for consequences, 1) that it was difficult to figure where the function was left, and 2) that we never had the allocated stream ID clearly visible (actually the one returned by h2c_frt_stream_new() is the right one but it's not obvious). This can be backported to 2.7 and 2.6.	2023-02-20 17:22:03 +01:00
Willy Tarreau	f9f4499429	MINOR: mux-h2/traces: do not log h2s pointer for dummy streams Functions which are called with dummy streams pass it down the traces and that leads to somewhat confusing "h2s=0x1234568(0,IDL)" for example while the nature of the called function makes this stream useless at that place. Better not report a random pointer, especially since it always requires to look at the code before remembering how this should be interpreted. Now what we're doing is that the idle stream only prints "h2s=IDL" which is shorter and doesn't report a pointer, closed stream do not report anything since the stream ID 0 already implies it, and other ones are reported normally. This could be backported to 2.7 and 2.6 as it improves traces legibility.	2023-02-20 17:22:03 +01:00
Fr�d�ric L�caille	9969adbcdc	MINOR: stats: add by HTTP version cumulated number of sessions and requests Add cum_sess_ver[] new array of counters to count the number of cumulated HTTP sessions by version (h1, h2 or h3). Implement proxy_inc_fe_cum_sess_ver_ctr() to increment these counter. This function is called each a HTTP mux is correctly initialized. The QUIC must before verify the application operations for the mux is for h3 before calling proxy_inc_fe_cum_sess_ver_ctr(). ST_F_SESS_OTHER stat field for the cumulated of sessions others than HTTP sessions is deduced from ->cum_sess_ver counter (for all the session, not only HTTP sessions) from which the HTTP sessions counters are substracted. Add cum_req[] new array of counters to count the number of cumulated HTTP requests by version and others than HTTP requests. This new member replace ->cum_req. Modify proxy_inc_fe_req_ctr() which increments these counters to pass an HTTP version, 0 special values meaning "other than an HTTP request". This is the case for instance for syslog.c from which proxy_inc_fe_req_ctr() is called with 0 as version parameter. ST_F_REQ_TOT stat field compputing for the cumulated number of requests is modified to count the sum of all the cum_req[] counters. As this patch is useful for QUIC, it must be backported to 2.7.	2023-02-03 17:55:49 +01:00
Christopher Faulet	c254516c53	BUG/MINOR: mux-h2: Fix possible null pointer deref on h2c in _h2_trace_header() As reported by Coverity, this function may be called with no h2c. Thus, the pointer must always be checked before any access. One test was missing in TRACE_PRINTF_LOC(). This patch should fix the issue #2015. No backport needed, except if the commit `11e8a8c2a` ("MEDIUM: mux-h2/trace: add tracing support for headers") is backported.	2023-01-30 08:26:12 +01:00
Willy Tarreau	7cfbb81c85	CLEANUP: mux-h2/trace: shorten the name of the header enc/dec functions The functions in charge of processing headers have their names in the traces and they're among the longest of the mux_h2.c file, while even containing some redundancy. These names are not used outside, let's shorten them: - h2c_decode_headers -> h2c_dec_hdrs - h2s_bck_make_req_headers -> h2s_snd_bhdrs - h2s_frt_make_resp_headers -> h2s_snd_fhdrs Now the traces are a bit more readable: [00\|h2\|5\|mux_h2.c:4822] h2c_dec_hdrs(): h2c=0x1870510(F,FRP) dsi=1 rcvh :method: GET [00\|h2\|5\|mux_h2.c:4822] h2c_dec_hdrs(): h2c=0x1870510(F,FRP) dsi=1 rcvh :path: / [00\|h2\|5\|mux_h2.c:4822] h2c_dec_hdrs(): h2c=0x1870510(F,FRP) dsi=1 rcvh :scheme: http [00\|h2\|5\|mux_h2.c:4822] h2c_dec_hdrs(): h2c=0x1870510(F,FRP) dsi=1 rcvh :authority: localhost:14446 [00\|h2\|5\|mux_h2.c:4822] h2c_dec_hdrs(): h2c=0x1870510(F,FRP) dsi=1 rcvh user-agent: curl/7.54.1 [00\|h2\|5\|mux_h2.c:4822] h2c_dec_hdrs(): h2c=0x1870510(F,FRP) dsi=1 rcvh accept: /	2023-01-26 16:05:51 +01:00
Willy Tarreau	11e8a8c2ac	MEDIUM: mux-h2/trace: add tracing support for headers Now we can make use of TRACE_PRINTF() to iterate over headers as they are received or dumped. It's worth noting that the dumps may occasionally be interrupted due to a buffer full or a realign, but in this case it will be visible because the trace will restart from the first one. All these headers (and trailers) may be interleaved with other connections' so they're all preceeded by the pointer to the connection and optionally the stream (or alternately the stream ID) to help discriminating them. Since it's not easy to read the header directions, sent headers are prefixed with "sndh" and received headers are prefixed with "rcvh", both of which are rare enough in the traces to conveniently support a quick grep. In order to avoid code duplication, h2_encode_headers() was implemented as a wrapper on top of hpack_encode_header(), which optionally emits the header to the trace if the trace is active. In addition, for headers that are encoded using a different method, h2_trace_header() was added as well. Header names are truncated to 256 bytes and values to 1024 bytes. If the lengths are larger, they will be truncated and suffixed with "(... +xxx)" where "xxx" is the number of extra bytes. Example of what an end-to-end H2 request gives: [00\|h2\|5\|mux_h2.c:4818] h2c_decode_headers(): h2c=0x1c13120(F,FRP) dsi=1 rcvh :method: GET [00\|h2\|5\|mux_h2.c:4818] h2c_decode_headers(): h2c=0x1c13120(F,FRP) dsi=1 rcvh :path: / [00\|h2\|5\|mux_h2.c:4818] h2c_decode_headers(): h2c=0x1c13120(F,FRP) dsi=1 rcvh :scheme: http [00\|h2\|5\|mux_h2.c:4818] h2c_decode_headers(): h2c=0x1c13120(F,FRP) dsi=1 rcvh :authority: localhost:14446 [00\|h2\|5\|mux_h2.c:4818] h2c_decode_headers(): h2c=0x1c13120(F,FRP) dsi=1 rcvh user-agent: curl/7.54.1 [00\|h2\|5\|mux_h2.c:4818] h2c_decode_headers(): h2c=0x1c13120(F,FRP) dsi=1 rcvh accept: / [00\|h2\|5\|mux_h2.c:4818] h2c_decode_headers(): h2c=0x1c13120(F,FRP) dsi=1 rcvh cookie: blah [00\|h2\|5\|mux_h2.c:5491] h2s_bck_make_req_headers(): h2c=0x1c1cd90(B,FRH) h2s=0x1c1e3d0(1,IDL) sndh :method: GET [00\|h2\|5\|mux_h2.c:5572] h2s_bck_make_req_headers(): h2c=0x1c1cd90(B,FRH) h2s=0x1c1e3d0(1,IDL) sndh :authority: localhost:14446 [00\|h2\|5\|mux_h2.c:5596] h2s_bck_make_req_headers(): h2c=0x1c1cd90(B,FRH) h2s=0x1c1e3d0(1,IDL) sndh :path: / [00\|h2\|5\|mux_h2.c:5647] h2s_bck_make_req_headers(): h2c=0x1c1cd90(B,FRH) h2s=0x1c1e3d0(1,IDL) sndh user-agent: curl/7.54.1 [00\|h2\|5\|mux_h2.c:5647] h2s_bck_make_req_headers(): h2c=0x1c1cd90(B,FRH) h2s=0x1c1e3d0(1,IDL) sndh accept: / [00\|h2\|5\|mux_h2.c:5647] h2s_bck_make_req_headers(): h2c=0x1c1cd90(B,FRH) h2s=0x1c1e3d0(1,IDL) sndh cookie: blah [00\|h2\|5\|mux_h2.c:4818] h2c_decode_headers(): h2c=0x1c1cd90(B,FRP) dsi=1 rcvh :status: 200 [00\|h2\|5\|mux_h2.c:4818] h2c_decode_headers(): h2c=0x1c1cd90(B,FRP) dsi=1 rcvh content-length: 0 [00\|h2\|5\|mux_h2.c:4818] h2c_decode_headers(): h2c=0x1c1cd90(B,FRP) dsi=1 rcvh x-req: size=102, time=0 ms [00\|h2\|5\|mux_h2.c:4818] h2c_decode_headers(): h2c=0x1c1cd90(B,FRP) dsi=1 rcvh x-rsp: id=dummy, code=200, cache=1, size=0, time=0 ms (0 real) [00\|h2\|5\|mux_h2.c:5210] h2s_frt_make_resp_headers(): h2c=0x1c13120(F,FRH) h2s=0x1c1c780(1,HCR) sndh :status: 200 [00\|h2\|5\|mux_h2.c:5231] h2s_frt_make_resp_headers(): h2c=0x1c13120(F,FRH) h2s=0x1c1c780(1,HCR) sndh content-length: 0 [00\|h2\|5\|mux_h2.c:5231] h2s_frt_make_resp_headers(): h2c=0x1c13120(F,FRH) h2s=0x1c1c780(1,HCR) sndh x-req: size=102, time=0 ms [00\|h2\|5\|mux_h2.c:5231] h2s_frt_make_resp_headers(): h2c=0x1c13120(F,FRH) h2s=0x1c1c780(1,HCR) sndh x-rsp: id=dummy, code=200, cache=1, size=0, time=0 ms (0 real) At some point the frontend/backend names would be useful but that's a more general comment than just the H2 traces.	2023-01-26 15:51:30 +01:00
Willy Tarreau	17c630b846	BUG/MINOR: mux-h2: add missing traces on failed headers decoding In case HPACK cannot be decoded, logs are emitted but there's no info in the H2 traces, so let's add them. This may be backported to all supported versions.	2023-01-20 00:02:21 +01:00
Willy Tarreau	f43f36da5b	BUG/MINOR: mux-h2: make sure to produce a log on invalid requests As reported by Dominik Froehlich in github issue #1968, some H2 request parsing errors do not result in a log being emitted. This is annoying for debugging because while an RST_STREAM is correctly emitted to the client, there's no way without enabling traces to find it on the haproxy side. After some testing with various abnormal requests, a few places were found where logs were missing and could be added. In this case, we simply use sess_log() so some sample fetch functions might not be available since the stream is not created. But at least there will be a BADREQ in the logs. A good eaxmple of this consists in sending forbidden headers or header syntax (e.g. presence of LF in value). Some quick tests can be done this way: - protocol error (LF in value): curl -iv --http2-prior-knowledge -H "$(printf 'a:b\na')" http://0:8001/ - too large header block after decoding: curl -v --http2-prior-knowledge -H "a:$(perl -e "print('a'x10000)")" -H "a:$(perl -e "print('a'x10000)")" http://localhost:8001/ This should be backported where needed, most likely 2.7 and 2.6 at least for a start, and progressively to other versions.	2023-01-19 23:37:00 +01:00
Willy Tarreau	35c4dd0005	CLEANUP: stconn: always use se_fl_set_error() to set the pending error In mux-h2 and mux-quic we still had two places manually setting SE_FL_ERR_PENDING or SE_FL_ERROR depending on the EOS state, instead of using se_fl_set_error() which takes care of the condition. Better use the specialized function for this, it will allow to centralize the conditions. Note that this will be needed to fix a bug.	2023-01-17 16:25:29 +01:00
Christopher Faulet	2e47e3a1cf	MINOR: htx: Add an HTX value for the extra field is payload length is unknown When the payload length cannot be determined, the htx extra field is set to the magical vlaue ULLONG_MAX. It is not obvious. This a dedicated HTX value is now used. Now, HTX_UNKOWN_PAYLOAD_LENGTH must be used in this case, instead of ULLONG_MAX.	2023-01-13 11:51:11 +01:00
Christopher Faulet	462f52260c	BUG/MEDIUM: mux-h2: Don't send CANCEL on shutw when response length is unkown Since commit `473e0e54` ("BUG/MINOR: mux-h2: send a CANCEL instead of ES on truncated writes"), a CANCEL may be reported when the response length is unkown. It happens for H1 reponses without "Content-lenght" or "Transfer-encoding" header. Indeed, in this case, the end of the reponse is detected when the server connection is closed. On the fontend side, the H2 multiplexer handles this event as an abort and sensd a RST_STREAM frame with CANCEL error code. The issue is not with the above commit but with the commit `4877045f1` ("MINOR: mux-h2: make streams know if they need to send more data"). The H2_SF_MORE_HTX_DATA flag must only be set if the payload length can be determined. This patch should fix the issue #1992. It must be backported to 2.7.	2023-01-13 11:28:32 +01:00
Christopher Faulet	827a6299e6	BUG/MEDIUM: mux-h2: Refuse interim responses with end-stream flag set As state in RFC9113#8.1, HEADERS frame with the ES flag set that carries an informational status code is malformed. However, there is no test on this condition. On 2.4 and higher, it is hard to predict consequences of this bug because end of the message is only reported with a flag. But on 2.2 and lower, it leads to a crash because there is an unexpected extra EOM block at the end of an interim response. Now, when a ES flag is detected on a HEADERS frame for an interim message, a stream error is sent (RST_STREAM/PROTOCOL_ERROR). This patch should solve the issue #1972. It should be backported as far as 2.0.	2022-12-22 13:46:21 +01:00
Willy Tarreau	f8c7709013	MINOR: mux-h2: add the expire task and its expiration date in "show fd" Some issues such as #1929 seem to involve a task without timeout but we can't find the condition to reproduce this in the code. However, not having this info in the output doesn't help, so this patch adds the task pointer and its timeout (when the task is non-null). It may be useful to backport it.	2022-11-29 15:29:00 +01:00
Christopher Faulet	68ee7845cf	CLEANUP: mux-h2: Remove unused fields in h2c structures Some fields in h2c structures are not used: .mfl, .mft and .mff. Just remove them. .msi field is also removed. It is tested but never set, except when a H2 connection is initialized. It also means h2c_mux_busy() function is useless because it always returns 0 (.msi is always -1). And thus, by transitivity, H2_CF_DEM_MBUSY is also useless because it is never set. So .msi field, h2c_mux_busy() function and H2C_MUX_BUSY flag are removed.	2022-11-17 14:33:15 +01:00
Christopher Faulet	ff7925dce0	MEDIUM: mux-h2: Introduce flags to deal with connection read/write errors Similarly to the H1 multiplexer, H2_CF_ERR_PENDING is now used to report an error when we try to send data and H2_CF_ERROR to report an error when we try to read data. In other funcions, we rely on these flags instead of connection ones. Only H2_CF_ERROR is considered as a final error. H2_CF_ERR_PENDING does not block receive attempt. In addition, we rely on H2_CF_RCVD_SHUT flag to test if a read0 was received or not.	2022-11-17 14:33:15 +01:00
Willy Tarreau	8522348482	BUG/MAJOR: conn-idle: fix hash indexing issues on idle conns Idle connections do not work on 32-bit machines due to an alignment issue causing the connection nodes to be indexed with their lower 32-bits set to zero and the higher 32 ones containing the 32 lower bitss of the hash. The cause is the use of ebmb_node with an aligned data, as on this platform ebmb_node is only 32-bit aligned, leaving a hole before the following hash which is a uint64_t: $ pahole -C conn_hash_node ./haproxy struct conn_hash_node { struct ebmb_node node; /* 0 20 / / XXX 4 bytes hole, try to pack / int64_t hash; / 24 8 / struct connection conn; /* 32 4 / / size: 40, cachelines: 1, members: 3 / / sum members: 32, holes: 1, sum holes: 4 / / padding: 4 / / last cacheline: 40 bytes */ }; Instead, eb64 nodes should be used when it comes to simply storing a 64-bit key, and that is what this patch does. For backports, a variant consisting in simply marking the "hash" member with a "packed" attribute on the struct also does the job (tested), and might be preferable if the fix is difficult to adapt. Only 2.6 and 2.5 are affected by this.	2022-10-03 12:06:36 +02:00
Willy Tarreau	6c0fadfb7d	REORG: mux-h2: extract flags and enums into mux_h2-t.h Originally in 1.8 we wanted to have an independent mux that could possibly be disabled and would not impose dependencies on the outside. Everything would fit into a single C file and that was fine. Nowadays muxes are unavoidable, and not being able to easily inspect them from outside is sometimes a bit of a pain. In particular, the flags utility still cannot be used to decode their flags. As a first step towards this, this patch moves the flags and enums to mux_h2-t.h, as well as the two state decoding inline functions. It also dropped the H2_SS_*_BIT defines that nobody uses. The mux_h2.c file remains the only one to include that for now.	2022-09-12 19:33:07 +02:00
Willy Tarreau	7051f73efe	MINOR: mux-h2: insert line breaks in "show sess all" output for legibility h2s and h2c were extremely long in the "show sess all" output, around 300 chars each. This adds a few line breaks to improve legibility, there are now 3 lines for each, which are around the same length as the other ones while keeping a natural arrangement. E.g (lines highlighted with '>'): 0x7faad8144f80: [02/Sep/2022:15:49:40.171620] id=105283 proto=tcpv4 source=127.0.0.1:42942 flags=0x100c4a, conn_retries=0, conn_exp=<NEVER> conn_et=0x000 srv_conn=0x1f44b20, pend_pos=(nil) waiting=0 epoch=0 frontend=decrypt (id=2 mode=http), listener=? (id=3) addr=127.0.0.1:8001 backend=decrypt (id=2 mode=http) addr=127.0.0.1:18144 server=httpterm (id=1) addr=127.0.0.1:8000 task=0x7faad812b7c0 (state=0x00 nice=0 calls=2 rate=0 exp=4s tid=7(1/7) age=0s) txn=0x7faad81453e0 flags=0x43000 meth=1 status=200 req.st=MSG_DONE rsp.st=MSG_DATA req.f=0x4c rsp.f=0x0d scf=0x7faad81625d0 flags=0x00000080 state=EST endp=CONN,0x7faad811d380,0x02805001 sub=1 > h2s=0x7faad811d380 h2s.id=2113 .st=HCR .flg=0x207001 .rxbuf=0@(nil)+0/0 > .sc=0x7faad81625d0(.flg=0x00000080 .app=0x7faad8144f80) .sd=0x7faad8119dc0(.flg=0x02805001) > .subs=0x7faad81625e0(ev=1 tl=0x7faad86d6500 tl.calls=4 tl.ctx=0x7faad81625d0 tl.fct=sc_conn_io_cb) > h2c=0x7faad802c640 h2c.st0=FRH .err=0 .maxid=2157 .lastid=-1 .flg=0x0600 .nbst=1 .nbsc=1 > .fctl_cnt=0 .send_cnt=0 .tree_cnt=1 .orph_cnt=0 .sub=1 .dsi=2157 .dbuf=0@(nil)+0/0 > .msi=-1 .mbuf=[6..6\|32],h=[0@(nil)+0/0],t=[0@(nil)+0/0] co0=0x7faae402efc0 ctrl=tcpv4 xprt=RAW mux=H2 data=STRM target=LISTENER:0x1f43c40 flags=0x00000300 fd=95 fd.state=121 updt=0 fd.tmask=0x80 scb=0x7faad8145370 flags=0x00000011 state=EST endp=CONN,0x7faad8115630,0x02840001 sub=1 co1=0x7faad86c0730 ctrl=tcpv4 xprt=RAW mux=H1 data=STRM target=SERVER:0x1f44b20 flags=0x00000300 fd=1656 fd.state=10121 updt=0 fd.tmask=0x80 req=0x7faad8144fa0 (f=0x49c40000 an=0x8000 pipe=0 tofwd=0 total=110) an_exp=<NEVER> rex=<NEVER> wex=<NEVER> buf=0x7faad8144fa8 data=(nil) o=0 p=0 i=0 size=0 htx=0xdd90a0 flags=0x0 size=0 data=0 used=0 wrap=NO extra=0 res=0x7faad8145000 (f=0x80040202 an=0x4000000 pipe=0 tofwd=-1 total=60365) an_exp=<NEVER> rex=<NEVER> wex=<NEVER> buf=0x7faad8145008 data=(nil) o=0 p=0 i=0 size=0 htx=0xdd90a0 flags=0x0 size=0 data=0 used=0 wrap=NO extra=0	2022-09-02 16:43:03 +02:00
Willy Tarreau	bf4ec6f4a0	MINOR: mux-h2: provide a "show_sd" helper to output stream debugging info With this, it now becomes possible to see the state of each H2 stream from "show sess all". Lines are still too long and need to be split, but that's for another patch.	2022-09-02 15:48:50 +02:00
Willy Tarreau	4e97bcc76b	MINOR: mux-h2: extract the connection dump function out of h2_show_fd() The function will be reusable to dump connections, so let's extract it.	2022-09-02 15:48:10 +02:00
Willy Tarreau	90bffa2ce3	MINOR: mux-h2: extract the stream dump function out of h2_show_fd() The function will be reusable to dump streams, so let's extract it. Note that due to "last_h2s" being originally printed as a prefix for the stream dump, now the pointer is displayed by the caller instead.	2022-09-02 15:48:10 +02:00
Willy Tarreau	ba7657ca0f	BUG/MINOR: mux-h2: fix the "show fd" dest buffer for the subscriber Commit `98e40b981` ("MINOR: mux-h2: make the "show fd" helper also decode the h2s subscriber when known") improved the output of "show fd" for the H2 mux, but the output is sent to the trash buffer instead of the msg argument. It turns out that this has no effect right now as the caller passes the trash but this is risky. This should be backported to 2.4.	2022-09-02 14:23:56 +02:00
Willy Tarreau	473e0e54f5	BUG/MINOR: mux-h2: send a CANCEL instead of ES on truncated writes If a POST upload is cancelled after having advertised a content-length, or a response body is truncated after a content-length, we're not allowed to send ES because in this case the total body length must exactly match the advertised value. Till now that's what we were doing, and that was causing the other side (possibly haproxy) to respond with an RST_STREAM PROTOCOL_ERROR due to "ES on DATA frame before content-length". We can behave a bit cleaner here. Let's detect that we haven't sent everything, and send an RST_STREAM(CANCEL) instead, which is designed exactly for this purpose. This patch could be backported to older versions but only a little bit of exposure to make sure it doesn't wake up a bad behavior somewhere. It relies on the following previous commit: "MINOR: mux-h2: make streams know if they need to send more data"	2022-08-19 08:03:53 +02:00
Willy Tarreau	4877045f1d	MINOR: mux-h2: make streams know if they need to send more data H2 streams do not even know if they are expected to send more data or not, which is problematic when closing because we don't know if we're closing too early or not. Let's start by adding a new stream flag "H2_SF_MORE_HTX_DATA" to indicate this on the tx path.	2022-08-19 08:03:53 +02:00
Willy Tarreau	ed2b9d9f27	MINOR: mux-h2/traces: report transition to SETTINGS1 before not after Traces indicating "switching to XXX" generally apply before the transition so that the current connection state is visible in the trace. SETTINGS1 was incorrect in this regard, with the trace being emitted after. Let's fix this. No need to backport this, as this is purely cosmetic.	2022-08-19 08:03:53 +02:00
Willy Tarreau	0f45871344	BUG/MEDIUM: mux-h2: do not fiddle with ->dsi to indicate demux is idle When switching to H2_CS_FRAME_H, we do not want to present the previous frame's state, flags, length etc in traces, or we risk to confuse the analysis, making the reader think that the header information presented is related to the new frame header being analysed. A naive approach could have consisted in simply relying on the current parser state (FRAME_H being that state), but traces are emitted before switching the state, so traces cannot rely on this. This was initially addressed by commit `73db434f7` ("MINOR: h2/trace: report the frame type when known") which used to set dsi to -1 when the connection becomes idle again, but was accidentally broken by commit `5112a603d` ("BUG/MAJOR: mux_h2: Don't consume more payload than received for skipped frames") which moved dsi after calling the trace function. But in both cases there's problem with this approach. If an RST or WU frame cannot be uploaded due to a busy mux, and at the same time we complete processing on a perfect end of frame with no single new frame header, we can leave the demux loop with dsi=-1 and with RST or WU to be sent, and these ones will be sent for stream ID -1. This is what was reported in github issue #1830. This can be reproduced with a config chaining an h1->h2 proxy to an empty h2 frontend, and uploading a large body such as below: $ (printf "POST / HTTP/1.1\r\nContent-length: 1000000000\r\n\r\n"; cat /dev/zero) \| nc 0 4445 > /dev/null This shows that we must never affect ->dsi which must always remain valid, and instead we should set "something else". That something else could be served by the demux frame type, but that one also needs to be preserved for the RST_STREAM case. Instead, let's just add a connection flag to say that the demuxing is in progress. This will be set once a new demux header is set and reset after the end of a frame. This way the trace subsystem can know that dft/dfl must not be displayed, without affecting the logic relying on such values. Given that the commits above are old and were backported to 1.8, this new one also needs to be backported as far as 1.8. Many thanks to David le Blanc (@systemmonkey42) for spotting, reporting, capturing and analyzing this bug; his work permitted to quickly spot the problem.	2022-08-19 08:03:53 +02:00

1 2 3 4 5 ...

956 Commits