haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-11-20 18:31:02 +01:00

Author	SHA1	Message	Date
Amaury Denoyelle	b849ee5fa3	BUILD: quic: fix overflow in global tune A new global option was recently introduced to disable pacing. However, the value used (1<<31) caused issue with some compiler as options field used for storage is declared as int. Move pacing deactivation flag outside into the newly defined quic_tune to fix this. This should be backported up to 3.1 after a period of observation. Note that it relied on the previous patch which defined new quic_tune type.	2025-01-30 18:12:53 +01:00
Amaury Denoyelle	0c8b54b2d1	MINOR: quic: transform pacing settings into a global option Pacing support was previously activated on each bind line individually, via an optional argument of quic-cc-algo keyword. Remove this optional argument and introduce a global setting to enable/disable pacing. Pacing activation is still flagged as experimental. One important change is that previously BBR usage automatically activated pacing support. This is not the case anymore, so users should now always explicitely activate pacing if BBR is selected. A new warning message will be displayed if this is not the case. Another consequence of this change is that now pacing_inter callback is always defined for every quic_cc_algo types. As such, QUIC MUX uses global.tune.options to determine if pacing is required. This should be backported up to 3.1, after a period of observation.	2025-01-30 17:19:38 +01:00
Amaury Denoyelle	8098be1fdc	MEDIUM: mux-quic: reduce pacing CPU usage with passive wait Pacing algorithm has been revamped in the previous commit to implement a credit based solution. This is a far more adaptative solution, in particular which allow to catch up in case pause between pacing emission was longer than expected. This allows QMUX to remove the active loop based on tasklet wake-up. Instead, a new task is used when emission should be paced. The main advantage is that CPU usage is drastically reduced. New pacing task timer is reset each time qcc_io_send() is invoked. Timer will be set only if pacing engine reports that emission must be interrupted. In this case timer is set via qcc_wakeup_pacing() to the delay reported by congestion algorithm, or 1ms if delay is too short. At the end of qcc_io_cb(), pacing task is queued if timer has been set. Pacing task execution is simple enough : it immediately wakes up QCC I/O handler. Note that to have decent performance, it requires to have a large enough burst defined in configuration of quic-cc-algo. However, this value is common to every listener clients, which may cause too much loss under network conditions. This will be address in a future patch. This should be backported up to 3.1.	2025-01-23 17:40:22 +01:00
Amaury Denoyelle	4489a61585	MEDIUM: quic: implement credit based pacing Implement a new method for QUIC pacing emission based on credit. This represents the number of packets which can be emitted in a single burst. After emission, decrement from the credit the number of emitted packets. Several emission can be conducted in the same sequence until the credit is completely decremented. When a new emission sequence is initiated (i.e. under a new QMUX tasklet invokation), credit is refilled according to the delay which occured between the last and current emission context. This new mechanism main advantage is that it allows to conduct several emission in the same task context without having to wait between each invokation. Wait is only forced if pacing is expired, which is now equivalent to having a null credit. Furthermore, if delay between two emissions sequence would have been smaller than expected, credit is only partially refilled. This allows to restart emission without having to wait for the whole credit to be available. On the implementation side, a new field <credit> is avaiable in quic_pacer structure. It is automatically decremented on quic_pacing_sent_done() invokation. Also, a new function quic_pacing_reload() must be used by QUIC MUX when a new emission sequence is initiated to refill credit. <next> field from quic_pacer has been removed. For the moment, credit is based on the burst configured via quic-cc-algo keyword, or directly reported by BBR. This should be backported up to 3.1.	2025-01-23 17:40:20 +01:00
Amaury Denoyelle	9d8589f0de	MINOR: mux-quic: increment pacing retry counter on expired A field <paced_sent_ctr> from quic_pacer structure is used to report the number of occurences where emission has been interrupted due to pacing. However, it was not incremented when QUIC MUX had to pause immediately emission as pacing was still not yet expired. Fix this by incrementing <paced_sent_ctr> in qcc_io_send() prior to emission if pacing is expired. Note that incrementation is only done once if the tasklet is then repeatdely woken up until the timer is expired. This should be backported up to 3.1.	2025-01-23 17:29:14 +01:00
Amaury Denoyelle	7c0820892f	MINOR: quic: rename pacing_rate cb to pacing_inter Rename one of the congestion algorithms pacing callback from pacing_rate to pacing_inter. This better reflects that this function returns a delay (in nanoseconds) which should be applied between each packet emission to fill the congestion window with a perfectly smoothed emission. This should be backported up to 3.1.	2025-01-23 14:49:35 +01:00
Amaury Denoyelle	801e39e1cc	BUG/MINOR: mux-quic: handle closure of uni-stream This commit is a direct follow-up to the previous one. As already described, a previous fix was merged to prevent streamdesc attach operation on already completed QCS instances scheduled for purging. This was implemented by skipping app proto decoding. However, this has a bad side-effect for remote uni-directional stream. If receiving a FIN stream frame on such a stream, it will considered as complete because streamdesc are never attached to a uni stream. Due to the mentionned new fix, this prevent analysis of this last frame for every uni stream. To fix this, do not skip anymore app proto decoding for completed QCS. Update instead qcs_attach_sc() to transform it as a noop function if QCS is already fully closed before streamdesc instantiation. However, success return value is still used to prevent an invalid decoding error report. The impact of this bug should be minor. Indeed, HTTP3 and QPACK uni streams are never closed by the client as this is invalid due to the spec. The only issue was that this prevented QUIC MUX to close the connection with error H3_ERR_CLOSED_CRITICAL_STREAM. This must be backported along the previous patch, at least to 3.1, and eventually to 2.8 if mentionned patches are merged there.	2025-01-03 17:21:19 +01:00
Amaury Denoyelle	af00be8e0f	MINOR: mux-quic: change return value of qcs_attach_sc() A recent fix was introduced to ensure that a streamdesc instance won't be attached to an already completed QCS which is eligible to purging. This was performed by skipping application protocol decoding if a QCS is in such a state. Here is the patch responsible for this change. caf60ac696a29799631a76beb16d0072f65eef12 BUG/MEDIUM: mux-quic: do not attach on already closed stream However, this is too restrictive, in particular for unidirection stream where no streamdesc is never attached. To fix this behavior, first qcs_attach_sc() API has been modified. Instead of returning a streamdesc instance, it returns either 0 on success or a negative error code. There should be no functional changes with this patch. It is only to be able to extend qcs_attach_sc() with the possibility of skipping streamdesc instantiation while still keeping a success return value. This should be backported wherever the above patch has been merged. For the record, it was scheduled for immediate backport on 3.1, plus merging on older releases up to 2.8 after a period of observation.	2025-01-03 17:19:21 +01:00
Amaury Denoyelle	4f2554903b	BUG/MINOR: mux-quic: fix wakeup on qcc_set_error() The following patch was a major refactoring of QUIC MUX. It removes pacing specific code path. In particular, qcc_wakeup() utility function was removed and replaced by its tasklet_wakup() usage. 41f0472d967b2deb095d5adc8a167da973fbee3d MEDIUM: mux-quic: remove pacing specific code on qcc_io_cb However, an incorrect substitution was performed in qcc_set_error(). As such, there was no explicit wakeup in case an error is detected by QUIC MUX or the app protocol layer. This may lead to missing error reporting to clients. Fix this by re-add tasklet_wakup() usage into qcc_set_error(). This must be backported up to 3.1 where above patch is scheduled.	2025-01-03 10:39:49 +01:00
Amaury Denoyelle	caf60ac696	BUG/MEDIUM: mux-quic: do not attach on already closed stream Due to QUIC packet reordering, a stream may be opened via a new RESET_STREAM or STOP_SENDING frame. This would cause either Tx or Rx channel to be immediately closed. This can cause an issue with current QUIC MUX implementation with QCS purging. QCS are inserted into QCC purge list when transfer could be considered as completed. In most cases, this happens after full request/response exchange. However, it can also happens after request reception if RESET_STREAM/STOP_SENDING are received first. A BUG_ON() crash will occur if a STREAM frame is received after. In this case, streamdesc instance will be attached via qcs_attach_sc() to handle the new request. However, QCS is already considered eligible to purging. It could cause it to be released while its streamdesc instance remains. A BUG_ON() crash detects this problem in qcc_purge_streams(). To fix this, extend qcc_decode_qcs() to skip app proto rcv_buf invokation if QCS is considered completed. A similar condition was already implemented when read was previously aborted after a STOP_SENDING emission by QUIC MUX. This crash was reproduced on haproxy.org. Here is the output of the backtrace : Core was generated by `./haproxy-dev -db -f /etc/haproxy/haproxy-current.cfg -sf 16495'. Program terminated with signal SIGILL, Illegal instruction. #0 0x00000000004e442b in qcc_purge_streams (qcc=0x774cca0) at src/mux_quic.c:2661 2661 BUG_ON_HOT(!qcs_is_completed(qcs)); [Current thread is 1 (LWP 1457)] [ ## gdb ## ] bt #0 0x00000000004e442b in qcc_purge_streams (qcc=0x774cca0) at src/mux_quic.c:2661 #1 0x00000000004e4db7 in qcc_io_process (qcc=0x774cca0) at src/mux_quic.c:2744 #2 0x00000000004e5a54 in qcc_io_cb (t=0x7f71193940c0, ctx=0x774cca0, status=573504) at src/mux_quic.c:2886 #3 0x0000000000b4f792 in run_tasks_from_lists (budgets=0x7ffdcea1e670) at src/task.c:603 #4 0x0000000000b5012f in process_runnable_tasks () at src/task.c:883 #5 0x00000000007de4a3 in run_poll_loop () at src/haproxy.c:2771 #6 0x00000000007deb9f in run_thread_poll_loop (data=0x1335a00 <ha_thread_info>) at src/haproxy.c:2985 #7 0x00000000007dfd8d in main (argc=6, argv=0x7ffdcea1e958) at src/haproxy.c:3570 This BUG_ON() crash can only happen since 3.1 refactoring. Indeed, purge list was only implemented on this version. As such, please backport it on 3.1 immediately. However, a logic issue remains for older version as a stream could be attached on a fully closed QCS. Thus, it should be backported up to 2.8, this time after a period of observation.	2025-01-02 11:25:40 +01:00
Amaury Denoyelle	4a997e5a93	MINOR: mux-quic: add traces on sd attach Add traces into qcs_attach_sc(). This function is called when a request is received on a QCS stream and a streamdesc instance is attached. This will be useful to facilitate debugging.	2025-01-02 11:25:40 +01:00
Amaury Denoyelle	ddfd8031f8	BUG/MAJOR: mux-quic: properly fix BUG_ON on empty STREAM emission Properly fix BUG_ON() occurence when QUIC MUX emits only empty STREAM frames. This was addressed by a previous patch but it causes another regression so a revert was needed. BUG_ON() on qcc_build_frms() return value is invalid. Indeed, qcc_build_frms() may return 0, but this does not imply that frame list is empty, as encoded frames can have a zero length payload. As such, simply remove this invalid BUG_ON(). This must be backported up to 3.1.	2025-01-02 11:25:40 +01:00
Amaury Denoyelle	85e27f1e92	Revert "BUG/MAJOR: mux-quic: fix BUG_ON on empty STREAM emission" This reverts commit 98064537423fafe05b9ddd97e81cedec8b6b278d. Above patch tried to fix a BUG_ON() occurence when MUX only emitted empty STREAM frames via qcc_build_frms(). Return value of qcs_send() was changed from the payload STREAM frame to the whole frame length. However, this is invalid as this return value is used to ensure connection flow-control is not exceeded on sending retry. This causes occurence of BUG_ON() crash in qcc_io_send() as send-list is not properly purged after QCS emission. Reverts this incorrect fix. The original issue will be properly dealt in the next commit. This commit must be backported to 3.1 if reverted commit was already applied on it.	2025-01-02 11:00:25 +01:00
Amaury Denoyelle	9806453742	BUG/MAJOR: mux-quic: fix BUG_ON on empty STREAM emission A BUG_ON() is present in qcc_io_send() to ensure that encoded frame list is empty if qcc_build_frms() previously returned 0. This BUG_ON() may be triggered if empty STREAM frame is encoded for standalone FIN. Indeed, qcc_build_frms() returns the sum of all STREAM payload length. In case only empty STREAM frames are generated, return value will be 0, despite new frames encoded and inserted into frame list. To fix this, change return value of qcs_send(). This now returns the whole STREAM frame length, both header and payload included. This ensures that qcc_build_frms() won't return a nul value if new frames are encoded, even empty ones. This must be backported up to 3.1.	2024-12-31 16:39:53 +01:00
Amaury Denoyelle	4490df57a6	CLEANUP: mux-quic: remove dead err label in qcc_build_frms() STREAM frames emission in qcc_build_frms() has been splitted from RESET_STREAM/STOP_SENDING into qcc_emit_rs_ss(). Now, the former cannot fail, as such err label can be removed as it is unreachable. This should be backported up to 3.1. This should fix github issue #2824.	2024-12-19 16:36:33 +01:00
Amaury Denoyelle	7edb2ffae7	BUG/MEDIUM: mux-quic: prevent BUG_ON() by refreshing frms on MAX_DATA QUIC MUX emission has been optimized recently by recycling STREAM frames list between emission cycles. This is done via qcc frms list member. If new data is available, frames list must be cleared before the next emission to force the encoding of new STREAM frames. If a refresh frames list is missed, it would lead to incomplete data emission on the next transfer. In most cases, this is detected via a BUG_ON() inside qcc_io_send(), as qcs instances remains in send_list after a qcc_send_frames() full emission. A bug was recently found which causes this BUG_ON() crash. This is directly related to flow control. Indeed, when sending credit is increased on the connection or a stream, frames list should be cleared as new larger STREAM frames could be encoded. This was already performed on MAX_DATA/MAX_STREAM_DATA reception but only if flow-control limit was unblocked. However this is not the proper condition and it may lead to insufficient frames refresh and thus this BUG_ON() crash. Fix this by adjusting the condition for frames refresh on flow control credit increase. Now, frames list is cleared if real offset is not blocked and soft offset was equal or greater to the previous limit. Indeed, this is the only case in which frames refreshing is necessary as it would result in bigger encoded STREAM frames. This bug was detected on QUIC interop with go-x-net client. It can also be reproduced, albeit not systematically, using the following command : $ ngtcp2-client -q --no-quic-dump --no-http-dump \ --exit-on-all-streams-close --max-data 10 \ 127.0.0.1 20443 -n10 "http://127.0.0.1:20443/?s=10k" This bug appeared with the following patch. As it is scheduled for 3.1 backporting, the current fix should be backported with it. 14710b5e6bf76834343d58db22e00b72590b16fe MEDIUM/OPTIM: mux-quic: do not rebuild frms list on every send	2024-12-19 16:36:28 +01:00
Amaury Denoyelle	53db43aff2	MINOR: mux-quic: hide traces when woken up on pacing only Previous commit aligned default and pacing emission. This is a cleaner and more robust code. However, it may disrupt traces analysis when pacing is rescheduled until timer expiration. Hide traces when qcc_io_cb() is woken up only due to pacing and timer is not yet expired. This is implemented by using special TASK_WOKEN_IO for pacing. This should be backported up to 3.1.	2024-12-18 09:52:16 +01:00
Amaury Denoyelle	41f0472d96	MEDIUM: mux-quic: remove pacing specific code on qcc_io_cb Pacing was recently implemented by QUIC MUX. Its tasklet is rescheduled until next emission timer is reached. To improve performance, an alternate execution of qcc_io_cb was performed when rescheduled due to pacing. This was implemented using TASK_F_USR1 flag. However, this model is fragile, in particular when several events happened alongside pacing scheduling. This has caused some issue recently, most notably when MUX is subscribed on transport layer on receive for handshake completion while pacing emission is performed in parallel. MUX qcc_io_cb() would not execute the default code path, which means the reception event is silently ignored. Recent patches have reworked several parts of qcc_io_cb. The objective was to improve performance with better algorithm on send and receive part. Most notable, qcc frames list is only cleared when new data is available for emission. With this, pacing alternative code is now mostly unneeded. As such, this patch removes it. The following changes are performed : * TASK_F_USR1 is now not used by QUIC MUX. As such, tasklet_wakeup() default invokation can now replace obsolete wrappers qcc_wakeup/qcc_wakeup_pacing * qcc_purge_sending is removed. On pacing rescheduling, all qcc_io_cb() is executed. This is less error-prone, in particular when pacing is mixed with other events like receive handling. This renders the code less fragile, as it completely solves the described issue above. This should be backported up to 3.1.	2024-12-18 09:49:20 +01:00
Amaury Denoyelle	14710b5e6b	MEDIUM/OPTIM: mux-quic: do not rebuild frms list on every send A newly introduced frames list member has been defined into QCC instance with pacing implementation. This allowed to preserve STREAM frames built between different emission scheduled by pacing, without having to regenerate it if no new QCS data is available. Generalize this principle outside of pacing scheduling. Now, the frames list will be reused accross several qcc_io_send() usage. Frames list is only cleared when necessary. This will force its refreshing in the next qcc_io_send() via qcc_build_frms_list(). Frames list refreshing is performed in the following cases : * on successful transfer from stream snd_buf / done_ff / shut * on stream reset or read abort * on max_data/max_stream_data reception with window increase Note that the two first cases are in fact covered directly due to qcc_send_stream() usage when QCS is (re)inserted into the send_list. The main objective of this patch will be to remove QUIC MUX pacing specific code path. It could also provide better performance as emission of large frames may often be rescheduled due to transport layer, either on congestion or full socket buffer. When QUIC MUX is rescheduled, no new data is available and frames list can be reuse as-is, avoiding an unecessary loop over send_list. This should be backported up to 3.1.	2024-12-18 09:49:02 +01:00
Amaury Denoyelle	9ecc1a8e57	MINOR: mux-quic: split STREAM and RS/SS emission This commit is a follow-up of the previous one which defines function qcc_build_frms(). This function implements looping over qcc send_list, to both encode and send individually any STOP_SENDING and RESET_STREAM, but also encode STREAM frames as a preparator step. STREAM frames were then sent as a list outside of qcc_build_frms() via qcc_send_frames(). Extract STOP_SENDING/RESET_STREAM encoding and emission step into a new function qcc_emit_rs_ss(). The code is thus cleaner. In particular it highlights that an error during STOP_SENDING/RESET_STREAM emission stage is fatal and prevent any STREAM frames processing. This should be backported up to 3.1.	2024-12-18 09:40:21 +01:00
Amaury Denoyelle	244dc00b09	MINOR: mux-quic: extract code to build STREAM frames list Extracts code responsible to generate STREAM, RESET_STREAM and STOP_SENDING frames for each qcs instances registered in qcc send_list. It is moved from qcc_io_send() to its owned new function qcc_build_frms(). This commit does not bring functional change. It is a preparatory step to adapt QUIC MUX send mechanism to allow reusing of qcc frms list accross qcc_io_send() invokation. As a side change, qcc_tx_frms_free() is renamed to qcc_clear_frms(). This better highlights its relationship with qcc_build_frms(). This should be bkacported up to 3.1.	2024-12-18 09:38:19 +01:00
Amaury Denoyelle	e296585ae9	MEDIUM/OPTIM: mux-quic: implement purg_list This commit is part of the current serie which aims to refactor and improve overall performance of QUIC MUX I/O handler. qcc_io_process() is responsible to perform some internal operations on QUIC MUX after I/O completion. It is notably called on every qcc_io_cb() tasklet handler. The most intensive work on it is the purging of QCS instances after transfer completion. This was implemented by looping on QCC streams tree and inspecting the state of every QCS. The purpose of this commit is to optimize this processing. A new purg_list QCC member is defined. It is responsible to list every QCS instances whose transfer has been completed. It is thus safe to reuse <el_send> QCS list attach point. Stream purging will thus only loop on purg_list instead of every known QCS. This should be backported up to 3.1.	2024-12-18 09:33:52 +01:00
Amaury Denoyelle	4b42dd4ae0	MEDIUM/OPTIM: mux-quic: define a recv_list for demux resumption This commit is part of the current serie which aims to refactor and improve overall performance of QUIC MUX I/O handler. Define a recv_list element into qcc structure. This is used to registered every instance of qcs which are currently blocked on demuxing, which happen on no more space in <rx.appbuf>. The purpose of this patch is to reduce qcc_io_recv() CPU usage. Now, only recv_list iteration is performed, instead of the previous looping over every qcs instances. This is useful as qcc_io_recv() is called each time qcc_io_cb() is scheduled, even if only sending condition was the wakeup origin. A qcs is not inserted into recv_list immediately after blocking on demux full buffer. Instead, this is only done after unblocking via stream rcv_buf callback, which ensure that new buffer space is available. This should be backported up to 3.1.	2024-12-18 09:23:41 +01:00
Amaury Denoyelle	0a53a008d0	MINOR: mux-quic: refactor wait-for-handshake support This commit refactors wait-for-handshake support from QUIC MUX. The flag logic QC_CF_WAIT_HS is inverted : it is now positionned only if MUX is instantiated before handshake completion. When the handshake is completed, the flag is removed. The flag is now set directly on initialization via qmux_init(). Removal via qcc_wait_for_hs() is moved from qcc_io_process() to qcc_io_recv(). This is deemed more logical as QUIC MUX is scheduled on RECV to be notify by the transport layer about handshake termination. Moreover, qcc_wait_for_hs() is now called if recv subscription is still active. This commit is the first of a serie which aims to refactor QUIC MUX I/O handler and improves its overall performance. The ultimate objective is to be able to stream qcc_io_cb() by removing pacing specific code path via qcc_purge_sending(). This should be backported up to 3.1.	2024-12-18 09:23:41 +01:00
Amaury Denoyelle	9dcd2369e2	MINOR: quic: add traces Add some traces to better follow QUIC MUX scheduling, in particular with pacing interaction. This should be backported up to 3.1.	2024-12-18 09:20:20 +01:00
Amaury Denoyelle	2e3542bec6	BUG/MEDIUM: mux-quic: do not mix qcc_io_send() return codes with pacing With pacing implementation, qcc_send_frames() return code has been extended to report emission interruption due to pacing limitation. This is used only in qcc_io_send(). However, its invokation may be skipped using 'sent_done' label. This happens on emission failure of a STOP_SENDING or RESET_STREAM (either memory allocation failure, or transport layer rejection). In this case, return values are mixed as qcs_send() is wrongly compared against pacing interruption condition. This value corresponds to the length of the last built STREAM frames. If by mischance the last frame was 1 byte long, qcs_send() return value is equal to pacing interruption condition. This has several effects. If pacing is activated, it may lead to unneeded wakeup on QUIC MUX. Worst, if pacing is not used, a BUG_ON() crash will be triggered. Fix this by using a different variable dedicated to qcc_send_frames() return value. By default it is initialized to 0. This ensures that pacing code won't be activated in case qcc_send_frames() is not used. This must be backported up to 3.1.	2024-12-18 09:18:48 +01:00
Amaury Denoyelle	7885a3b3e1	MINOR: mux-quic: clean up zero-copy done_ff callback Recently, an issue was found with QUIC zero-copy forwarding on 3.0 version. A desynchronization could occur internally in QCS Tx bytes counters which would cause a BUG_ON() crash on qcs_destroy() when the stream is detached. It was silently fixed in version 3.1 by the following patch. As it was considered as an optimization, it was not scheduled yet for backport. 6697e87ae5e1f569dc87cf690b5ecfc049c4aab0 MINOR: mux-quic: Don't send an emtpy H3 DATA frame during zero-copy forwarding This mistake has been caused due to some counter-intuitive manipulation in QUIC zero-copy implementation. Try to streamline this in QUIC MUX done_ff callback and its application protocol counterpart. Especially for values exchanged between MUX and application on one side, and MUX and stconn layer as done_fastfwd return value. First, application done_ff callback now returns the length of the wholly encoded frame. For HTTP/3, it means header length + payload length h3 frame. This value can then be reused as qcc_send_stream() argument to increase QCS Tx soft offset. As previously, special care has been taken to ensure that QUIC MUX done_ff only return the transferred data bytes. Thus, any extra offset for HTTP/3 header is properly excluded. This is mandatory for stconn layer to consider the transfer has completed. Secondly, remove duplicated code in application done_ff to reset iobuf info. This is now factorize in QUIC MUX done_ff itself. This patch is related to github issue #2678.	2024-12-05 16:57:31 +01:00
Amaury Denoyelle	08f557f0c4	BUG/MEDIUM: mux-quic: remove pacing status when everything is sent TASK_F_USR1 is used by MUX tasklet when emission has been interrupted due to pacing. When the tasklet runs again, only qcc_purge_sending() will be called as an optimization. Pacing status is only removed via qcc_wakeup(). Until then, TASK_F_USR1 is not cleared. This causes an issue after emission with pacing completion if the MUX tasklet is woken up for a recv subscribe, as qcc_wakeup() is not used by quic-conn layer. The tasklet will incorrectly run only for pacing emission, without handling reception process. Worst, a crash will occur if QCC tx frames list is empty, due to a BUG_ON() in qcc_purge_sending(). Recv subscribe is only used for 0-RTT, when QUIC MUX is instantiated before quic-conn handshake completion. Thus, this bug can only be reproduced with 0-rtt. Furthermore, MUX must already have emitted at least a few response bytes with pacing, before QUIC handshake completion. It cannot easily be reproduced, at least with CLI clients where the handshake is always already completed before MUX exchanges. To fix this, remove TASK_F_USR1 when pacing emission has been completed. At least, this prevents BUG_ON() on qcc_purge_sending() as it won't be called with an empty QCC Tx frame list anymore. However, this bug has revealed that MUX tasklet architecture is not suitable when both handling reception and emission part. This will be improved in a future serie of patches. This should fix github issue #2796. This must be backported up to 3.1.	2024-12-05 11:04:06 +01:00
Amaury Denoyelle	9d4c26ebaa	BUG/MEDIUM: quic: prevent stream freeze on pacing On snd_buf completion, QUIC MUX tasklet is scheduled if newly data has been transferred from the stream layer. Thanks to qcc_wakeup(), pacing status is removed from tasklet, which ensure next emission will reset Tx frames and use the new data. Tasklet is not scheduled if MUX is already subscribed on send due to a previous blocking condition. This is an optimization to prevent an unneeded IO handler execution. However, this causes a bug if an emission is currently delayed due to pacing. As pacing status is not removed on snd_buf, next emission process will continue emission with older data without refreshing the newly transferred one. This causes a transfer freeze. Unless there is some activity on the connection, the transfer will be eventually aborted due to idle timeout. To fix this, remove TASK_F_USR1 if tasklet wakeup is not called due to send subscription. Note that this code is also duplicated in done_ff for zero-copy transfer. This must be backported up to 3.1.	2024-11-29 14:35:10 +01:00
Amaury Denoyelle	22bd92a87f	MINOR: mux-quic: use sched call time for pacing QUIC pacing was recently implemented to limit burst and improve overall bandwidth. This is used only for MUX STREAM emission. Pacing requires nanosecond resolution. As such, it used now_cpu_time() which relies on clock_gettime() syscall. The usage of clock_gettime() has several drawbacks : * it is a syscall and thus requires a context-switch which may hurt performance * it is not be available on all systems * timestamp is retrieved multiple times during a single task execution, thus yielding different values which may tamper pacing calculation Improve this by using task_mono_time() instead. This returns task call time from the scheduler thread context. It requires the flag TASK_F_WANTS_TIME on QUIC MUX tasklet to force the scheduler to update call time with now_mono_time(). This solves every limitations listed above : * syscall invokation is only performed once before tasklet execution, thus reducing context-switch impact * on non compatible system, a millisecond timer is used as a fallback which should ensure that pacing works decently for them * timer value is now guaranteed to be fixed duing task execution	2024-11-25 11:21:45 +01:00
Amaury Denoyelle	3704e0e174	BUG/MINOR: mux-quic: fix show quic report of QCS prepared bytes On show quic, each MUX streams are listed with their various indicator for buffering on Rx and Tx. In particular, txoff displays in parenthesis the current level of data prepared by the upper stream instance not yet emitted by QUIC transport layer. This value is only accessible after a substract operation. However, there was a typo which caused the result to be always 0. Fix this by reusing the correct offsets in the calculation. This should be backported up to 3.0.	2024-11-25 11:21:28 +01:00
Amaury Denoyelle	5a29fd6c61	MINOR: mux_quic/pacing: display pacing info on show quic To improve debugging, extend "show quic" output to report if pacing is activated on a connection. Two values will be displayed for pacing : * a new counter paced_sent_ctr is defined in QCC structure. It will be incremented each time an emission is interrupted due to pacing. * pacing engine now saves the number of datagrams sent in the last paced emission. This will be helpful to ensure burst parameter is valid.	2024-11-19 16:21:05 +01:00
Amaury Denoyelle	796446a15e	MAJOR: mux-quic: support pacing emission Support pacing emission for STREAM frames at the QUIC MUX layer. This is implemented by adding a quic_pacer engine into QCC structure. The main changes have been written into qcc_io_send(). It now differentiates cases when some frames have been rejected by transport layer. This can occur as previously due to congestion or FD buffer full, which requires subscribing on transport layer. The new case is when emission has been interrupted due to pacing timing. In this case, QUIC MUX I/O tasklet is rescheduled to run with the flag TASK_F_USR1. On tasklet execution, if TASK_F_USR1 is set, all standard processing for emission and reception is skipped. Instead, a new function qcc_purge_sending() is called. Its purpose is to retry emission with the saved STREAM frames list. Either all remaining frames can now be send, subscribe is done on transport error or tasklet must be rescheduled for pacing purging. In the meantime, if tasklet is rescheduled due to other conditions, TASK_F_USR1 is reset. This will trigger a full regeneration of STREAM frames. In this case, pacing expiration must be check before calling qcc_send_frames() to ensure emission is now allowed.	2024-11-19 16:16:48 +01:00
Amaury Denoyelle	ede4cd4c2e	MINOR: mux-quic: encapsulate QCC tasklet wakeup QUIC MUX will be responsible to drive emission with pacing. This will be implemented via setting TASK_F_USR1 before I/O tasklet wakeup. To prepare this, encapsulate each I/O tasklet wakeup into a new function qcc_wakeup(). This commit is purely refactoring prior to pacing implementation into QUIC MUX.	2024-11-19 16:16:48 +01:00
Amaury Denoyelle	4a94a018f0	MINOR: mux-quic: define a tx STREAM frame list member For STREAM emission, MUX QUIC previously used a local list defined under qcc_io_send(). This was suitable as either all frames were sent, or emission must be interrupted due to transport congestion or fatal error. In the latter case, the list was emptied anyway and a new frame list was built on future qcc_io_send() invokation. For pacing, MUX QUIC may have to save the frame list if pacing should be applied across emission. This is necessary to avoid to unnecessarily rebuilt stream frame list between each paced emission. To support this, STREAM list is now stored as a member of QCC structure. Ensure frame list is always deleted, even on QCC release, using newly defined utility function qcc_tx_frms_free().	2024-11-19 16:16:48 +01:00
Amaury Denoyelle	8039fe43e6	MINOR: quic/pacing: support pacing emission on quic_conn layer Pacing will be implemented for STREAM frames emission. As such, qc_send_mux() API has been extended to add an argument to a quic_pacer engine. If non NULL, engine will be used to pace emission. In short, no more than one datagram will be emitted for each qc_send_mux() invokation. Pacer is then notified about the emission and a timer for a future emission is calculated. qc_send_mux() will return PACING error value, to inform QUIC MUX layer that it will be responsible to retry emission after some delay.	2024-11-19 16:16:48 +01:00
Amaury Denoyelle	7fd48a5723	MINOR: quic: extend qc_send_mux() return type with a dedicated enum This commit is part of a adjustment on QUIC transport send API to support pacing. Here, qc_send_mux() return type has been changed to use a new enum quic_tx_err. This is useful to explain different failure causes of emission. For now, only two values have been defined : NONE and FATAL. When pacing will be implemented, a new value would be added to specify that emission was interrupted on pacing. This won't be a fatal error as this allows to retry emission but not immediately.	2024-11-19 16:16:48 +01:00
Willy Tarreau	f66bfcff96	BUG/MINOR: mux_quic: make sure to always apply offsets to now_ms in expiration Now_ms can be zero nowadays, so it's not suitable for direct assignment to t->expire, as there's a risk that the timer never wakes up once assigned (TICK_ETERNITY). Let's use tick_add(now_ms, 0) for an immediate wakeup instead. The impact looks nul since the task is also woken up, but better not leave such tasks in the timer tree anyway. This should be backported where it applies.	2024-11-15 15:41:21 +01:00
Willy Tarreau	4fd6d15344	MINOR: mux-quic/h3: count glitches when they're reported The qcc_report_glitch() function is now replaced with a macro to support enumerating counters for each individual glitch line. For now this adds 36 such counters. The macro supports an optional description, though that is not being used for now. As a reminder, this requires to build with -DDEBUG_GLITCHES=1.	2024-11-14 20:43:33 +01:00
Amaury Denoyelle	dcf334168c	MINOR: quic: move qc_send_mux() prototype into quic_tx.h qc_send_mux() is defined in quic_tx.c. As such, its prototype is moved from quic_conn.h to quic_tx.h.	2024-10-31 15:35:31 +01:00
Amaury Denoyelle	68c8c91023	BUG/MINOR: mux-quic: do not close STREAM with empty FIN if no data sent A stream may be shut without any HTX EOM reported to report a proper closure. This is the case for QCS instances flagged with QC_SF_UNKNOWN_PL_LENGTH. Shut is performed with an empty FIN emission instead of a RESET_STREAM. This has been implemented since the following patch : 24962dd1784dd22babc8da09a5fc8769617f89e3 BUG/MEDIUM: mux-quic: do not emit RESET_STREAM for unknown length However, in case of HTTP/3, an empty FIN should only be done after a full message is emitted, which requires at least a HEADERS frame. If an empty FIN is emitted without it, client may interpret this as invalid and close the connection. To prevent this, fallback to a RESET_STREAM emission if no data were emitted on the stream. This was reproduced using ngtcp2-client with 10% loss (-r 0.1) on a remote host, with httpterm request "/?s=100k&C=1&b=0&P=400". An error ERR_H3_FRAME_UNEXPECTED is returned by ngtcp2-client when the bug occurs. Note that this change is incomplete. The message validity depends solely on the application protocol in use. As such, a new app_ops callback should be implemented to ensure the stream is closed accordingly. However, this first patch ensures that at least HTTP/3 case is valid while keeping a minimal backport process. This should be backported up to 2.8.	2024-10-21 11:24:38 +02:00
Amaury Denoyelle	b200d3d80b	MINOR: mux-quic: simplify sending of empty STREAM FIN An empty STREAM frame can be emitted by QUIC MUX to notify about a delayed FIN when there is no data left to transmit. This requires a tedious comparison on stream offset in qmux_ctrl_send() to ensure an empty stream frame is not always considered as retransmitted, which is necessary to locally close the QCS instance. Simplify this by unsubscribe from streamdesc layer when the QCS is locally closed on FIN transmission notification. This prevents all future retransmitted frames to be reported to the QCS instance, especially any potentially retransmitted empty FIN.	2024-10-21 11:21:07 +02:00
Amaury Denoyelle	0918c41ef6	BUG/MEDIUM: quic: support wait-for-handshake wait-for-handshake http-request action was completely ineffective with QUIC protocol. This commit implements its support for QUIC. QUIC MUX layer is extended to support wait-for-handshake. A new function qcc_handle_wait_for_hs() is executed during qcc_io_process(). It detects if MUX processing occurs after underlying QUIC handshake completion. If this is the case, it indicates that early data may be received. As such, connection is flagged with CO_FL_EARLY_SSL_HS, which is necessary to block stream processing on wait-for-handshake action. After this, qcc subscribs on quic_conn layer for RECV notification. This is used to detect QUIC handshake completion. Thus, qcc_handle_wait_for_hs() can be reexecuted one last time, to remove CO_FL_EARLY_SSL_HS and notify every streams flagged as SE_FL_WAIT_FOR_HS. This patch must be backported up to 2.6, after a mandatory period of observation. Note that it relies on the backport of the two previous patches : - MINOR: quic: notify connection layer on handshake completion - BUG/MINOR: stream: unblock stream on wait-for-handshake completion	2024-10-16 11:51:35 +02:00
Amaury Denoyelle	5a5950e42d	MINOR: quic: notify connection layer on handshake completion Wake up connection layer on QUIC handshake completion via quic_conn_io_cb. Select SUB_RETRY_RECV as this was previously unused by QUIC MUX layer. For the moment, QUIC MUX never subscribes for handshake completion. However, this will be necessary for features such as the delaying of early data forwarding via wait-for-handshake. This patch will be necessary to implement wait-for-handshake support for QUIC. As such, it must be backported with next commits up to 2.6, after a mandatory period of observation.	2024-10-16 11:42:06 +02:00
Amaury Denoyelle	232083c3e5	BUG/MEDIUM: mux-quic: ensure timeout server is active for short requests If a small request is received on QUIC MUX frontend, it can be transmitted directly with the FIN on attach operation. rcv_buf is skipped by the stream layer. Thus, it is necessary to ensure that there is similar behavior when FIN is reported either on attach or rcv_buf. One difference was that se_expect_data() was called only for rcv_buf but not on attach. This most obvious effect is that stream timeout was deactivated for this request : client timeout was disabled on EOI but server one not armed due to previous se_expect_no_data(). This prevents the early closure of too long requests. To fix this, add an invokation of se_expect_data() on attach operation. This bug can simply be detected using httpterm with delay request (for example /?t=10000) and using smaller client/server timeouts. The bug is present if the request is not aborted on timeout but instead continue until its proper HTTP 200 termination. This has been introduced by the following commit : 85eabfbf672c57e4ed082da1b96c95348b331320 MEDIUM: mux-quic: Don't expect data from server as long as request is unfinished This must be backported up to 2.8.	2024-10-10 17:20:39 +02:00
Amaury Denoyelle	e7578084b0	MINOR: quic: implement dedicated type for out-of-order stream ACK QUIC streamdesc layer is responsible to handle reception of ACK for streams. It removes stream data from the underlying buffers on ACK reception. Streamdesc layer treats ACK in order at the stream level. Out of order ACKs are buffered in a tree until they can be handled on older data acknowledgement reception. Previously, qf_stream instance which comes from the quic_tx_packet was used as tree node to buffer such ranges. Introduce a new type dedicated to represent out of order stream ack data range. This type is named qc_stream_ack. It contains minimal infos only relative to the acknowledged stream data range. This allows to reduce size of frequently used quic_frame with the removal of tree node from qf_stream. Another side effect of this change is that now quic_frame are always released immediately on ACK reception, both in-order and out-of-order. This allows to also release the quic_tx_packet instance which should reduce memory consumption. The drawback of this change is that qc_stream_ack instance must be allocated on out-of-order ACK reception. As such, qc_stream_desc_ack() may fail if an error happens on allocation. For the moment, such error is silenly recovered up to qc_treat_rx_pkts() with the dropping of the received packet containing the ACK frame. In the future, it may be useful to close the connection as this error may only happens on low memory usage.	2024-10-04 17:56:45 +02:00
Amaury Denoyelle	c1d714156e	BUG/MAJOR: mux-quic: do not crash on empty STREAM frame emission Most of the time STREAM frames emitted by QUIC MUX have some data in it. However, it is possible to use an empty frame when a delayed FIN must be transferred. Recently, QUIC MUX send callback notification has been refactored. Now, this callback is blindly called by quic_conn lower layer each time a STREAM frame is built into a newly Tx packet. QUIC MUX is responsible to ensure the notified frame corresponds to newly emitted data or retransmission. Offsets are used for this comparison, but this requires special care for empty FIN frames. Sadly, the comparison written to determine if an empty FIN frame was sent for the first time or retransmitted is not correct. This caused such frame to always be dismissed as retransmission in QUIC MUX sent callback. This prevented the related QCS instance to be removed from the send_list, causing qcc_io_send() to retry a new emission. This was finally interrupted by the BUG_ON() assertion to prevent an infinite loop. Fix this crash by updating the condition in QUIC MUX send callback. For empty STREAM frame, it is sufficient to check if QC_SF_FIN_STREAM was already removed or not to detect a retransmission. Indeed, empty STREAM frames are never used outside of delayed FIN reporting. No need to backport. This crash was introduced in the current dev branch by the following commit. d7f4e5abf0b7129329d0ea716c104474fd934bc6 MEDIUM: quic: strengthen MUX send notification	2024-10-04 11:31:11 +02:00
Amaury Denoyelle	58b7a72d07	BUG/MINOR: mux-quic: fix crash on qcc_init() early return qcc_release() may be used in case qcc_init() cannot complete. In this case, connection instance is NULL. As such, it cannot be dereferenced without testing it first. This should fix github coverity report #2739. No backport needed.	2024-10-02 17:06:31 +02:00
Amaury Denoyelle	943e48dadd	MINOR: quic: store streambuf in a streamdesc tree qc_stream_desc layer is used by QUIC MUX to store emitted STREAM data until their acknowledgement. Each stream with Tx capability can allocate its own qc_stream_desc. In turn, each stream desc can have one or multiple data buffers. This is useful when a MUX stream releases a buffer and allocate a new one, to preserve bandwith without waiting to receive all acknowledgement of the previous buffer. Each buffer is encapsulated in a qc_stream_buf structure. Previously, it was stored as a list into qc_stream_desc. Change this storage to use a tree instead. Each buffer is indexed by their offset. This commit does not introduce functional changes. However, this rearchitecture will be necessary for future commit to extend ACK management which require fetching individual buffer instance, not just the first or last element of a streamdesc, by their offset.	2024-10-01 16:19:41 +02:00
Amaury Denoyelle	db68f8ed86	MINOR: quic: refactor STREAM room notification qc_stream_desc is an intermediary layer between QUIC MUX and quic_conn. It is a facility which permits to store data to emit and keep them for retransmission until acknowledgment. This layer is responsible to notify QUIC MUX each time a buffer is freed. This is necessary as MUX buffer allocation is limited by the underlying congestion window size. Refactor this to use a mechanism similar to send notification. A new callback notify_room can now be registered to qc_stream_desc instance. This is set by QUIC MUX to qmux_ctrl_room(). On MUX QUIC free, special care is now taken to reset notify_room callback to NULL. Thanks to this refactoring, further adjustment have been made to refine the architecture. One of them is the removal of qc_stream_desc QC_SD_FL_OOB_BUF, which is now converted to a MUX layer flag QC_SF_TXBUF_OOB.	2024-10-01 16:19:25 +02:00

1 2 3 4 5 ...

483 Commits