haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2026-05-08 22:46:10 +02:00

Author	SHA1	Message	Date
Willy Tarreau	b52a0e6782	BUG/MINOR: h2: only accept :protocol with extended CONNECT As reported by Huangbin Zhan in github issue #3355, we're too lax on the :protocol pseudo header. It is currently accepted with regular CONNECT as well as non-CONNECT methods while it only ought to be accepted with extended CONNECT (i.e. CONNECT after the connection negotiated the RFC8441 extension). Let's refine the check in H2 by leveraging the new flag H2_MSGF_EXT_CONN_OK that is passed by the caller when the connection supports the extension. This is sufficient to sort the various cases. The proto upgrade regtest was updated to verify that CONNECT with :protocol without nego and another method with nego and :protocol both fail. Thanks to Huangbin Zhan (@zhanhb) for the report and helpful reproducer. This needs to be backported to all versions. It relies on these patches first: REGTESTS: http-messaging: always send RFC8441 client settings to use ext connect BUG/MINOR: mux-h2: condition the processing of 8441 extension to global setting MINOR: mux-h2: add a new message flag to indicate ext connect support	2026-05-05 14:10:36 +02:00
Willy Tarreau	96f7ff4fdd	MINOR: mux-h2: add a new message flag to indicate ext connect support The new message flag H2_MSGF_EXT_CONN_OK indicates that the connection supports extended connect. This will be used in a subsequent fix.	2026-05-05 14:09:49 +02:00
Willy Tarreau	9986ad65a4	BUG/MINOR: mux-h2: condition the processing of 8441 extension to global setting When rfc8441 (extended connect) is disabled via h2-workaround-bogus-websocket-clients, we properly refrain from advertising support for extended connect, but we should also ignore the incoming setting, otherwise it can remain enabled if the client advertises it. This should be backported to stable versions.	2026-05-05 14:09:49 +02:00
Willy Tarreau	a950c4b72d	BUG/MINOR: h2: add decoding for :protocol in traces Function h2_phdr_to_list() was missing the decoding for the :protocol header and would emit :UNKNOWN in this case. It's only used in traces so it's not important. The fix can be backported in all versions.	2026-05-05 14:09:49 +02:00
Willy Tarreau	47bb725d88	REGTESTS: http-messaging: always send RFC8441 client settings to use ext connect The tests were validating extended connect without sending the setting in the client settings frame. It currently works due to a bug, so let's fix the vtc first.	2026-05-05 14:09:49 +02:00
William Lallemand	166dfcbbee	BUG/MINOR: mworker/cli: check ci_insert() return value in pcli_parse_request() All ci_insert() calls in pcli_parse_request() were ignoring the return value, silently miscounting the bytes to forward if an insertion failed. Add a check on each call and return -1 on failure. In practice this has no impact: the channel receive machinery enforces a maxrewrite reserve, so there is always sufficient room in the buffer for these small prefixes by the time pcli_parse_request() is called. Must be backported in every maintained versions, the list of available commands might change.	2026-05-04 19:03:48 +02:00
William Lallemand	946921c199	BUG/MEDIUM: mworker/cli: fix user and operator permission via @@<pid> in master CLI When @@<pid> is matched in pcli_parse_request(), no "operator -" or "user -" is being sent before the command, like it's done for @<pid>. It leads to privileges not being respected and commands are sent as admin. Fix this by applying the access-level downgrade in the @@<pid> path, like it's done for @<pid>. Must be backported to 3.2. Reported-by: Omkhar Arasaratnam <omkhar@linkedin.com>	2026-05-04 19:03:48 +02:00
Willy Tarreau	dae302d479	REGTESTS: add a regtest to validate various NTLM transitions This test first performs two successive requests over the same connection where reuse is expected, then perform two 401 which must both work, testing both the transition from null->sess, and sess->sess. This test could be backported to detect changes related to private sessions. Thanks to Omkhar Arasaratnam for the test.	2026-05-04 18:58:19 +02:00
Willy Tarreau	4fc1e39371	BUG/MAJOR: http-ana: fix private session retrieval on NTLM During the architectural review leading to commit 90b2154d93 ("MEDIUM: muxes: always set conn->owner to the session that owns the connection"), we wondered whether srv_conn->owner could ever be NULL or even invalid, or if it ought to be changed to sess, and the projections of various use cases, as well as a number of attempts to fool it led us to conclude that it was always valid since the connection is private. So it was considered safer not to start fiddling with the pointer in case it could still match a previous session after a reuse, which would match the scenario described in the session_add_conn() comment. Actually there was exactly one case where a NULL could be met, and that was covered by the preliminary call to conn_set_owner() that was removed in that patch, precisely related to the one that the next patch tried to address: in http-reuse always, after the second request on a connection releases the connection, the owner can now become NULL, so if an NTLM header is seen at this point, it will crash. Interestingly, after the immediately following commit was merged, d93c53b0df ("MEDIUM: session: always reset the conn->owner on backend when installing mux"), the problem became immediate as the conn's owner is now null during operation if the connection is not private, and now the first response in NTLM is sufficient to crash the process. On the other hand, thanks to the two patches above, we're now certain never to meet a different session, which was the sought goal: either the session is normal and it has no owner, or it's private and it has <sess> as owner. Also with HTTP/1 (since the code explicitly checks for H1), there may be a single request at a time on a connection so the owner should either be the session or NULL. So this patch finally implements the original plan, to pass <sess> to session_add_conn(). The call is idempotent if the owner is already set, but at least the function performs some preliminary sanity checks which are quite welcome, so better continue to always call it. Note that this is only for 3.4 or any branch that has exactly the two patches above. And if the patches above were to ever be backported (together), this one would be needed as well. Thanks to Omkhar Arasaratnam for reporting this regression.	2026-05-04 18:57:15 +02:00
William Lallemand	726aa2dfb2	BUG/MEDIUM: ssl/sample: check output buffer size in aes_cbc_enc converter AES-CBC uses a 16-byte block size, and PKCS padding always adds at least one byte (up to a full 16 bytes when input is already block-aligned), so the encrypted output is always larger than the input. Without checking that the output buffer can hold the padded result, encryption could overflow it. Add a pre-encryption guard for block cipher (blksize > 1) that rejects the operation when the output buffer is too small. No backport needed. Reported-by: Omkhar Arasaratnam <omkhar@linkedin.com>	2026-05-04 17:38:15 +02:00
Willy Tarreau	b9028ee1e4	BUG/MAJOR: net_helper: also fix tcp_options_list for OOB write loop The exact same issue that was fixed for ip.fp with commit dbf471f99a ("BUG/MAJOR: net_helper: ip.fp infinite loop on malformed tcp options") also exists for tcp_options_list: an option with a zero size will prevent the offset from making any progress and will loop forever. In this case, since we produce output, trash->data gets incremented on each loop and the byte at ofs is used to fill the memory there, quickly overflowing the area. The exact same fix as above was applied here again, including the uchar cast to make sure we don't go back-and-forth between two positions. Such TCP options are not valid and will not be reported by the TCP stack, however they can happen in case options are passed in headers by a first proxy, or are offered in a return directive fed from a query string as a means of providing a debugging tool for admins. Thanks to Omkhar Arasaratnam for reporting this issue. This fix must be backported to 3.2 where the commits above were backported.	2026-05-04 17:26:44 +02:00
Christopher Faulet	1d54b9f70e	BUG/MINOR: resolvers: Free opts on parse error in resolv_parse_do_resolve() The error handler at do_resolve_parse_error freed varname and resolvers_id but missed freeing rule->arg.resolv.opts (allocated via calloc). Added ha_free(&rule->arg.resolv.opts) to the cleanup path. This patch could be backported to all stable branches.	2026-05-04 16:33:56 +02:00
Christopher Faulet	3b1db89283	BUG/MINOR: resolvers: Fix lookup for a hostname in the state-file tree In resolv_check_response(), the condition 'if (!srvrq->named_servers)' was inverted - it ran the named server lookup when the tree was empty/NULL instead of when it was populated. This prevented proper hostname-based matching of SRV records to servers from the state file. The issue was introduced when tree of servers was replaced by a cebis tree (fdf6fd5b45). This patch must be backported to 3.3.	2026-05-04 16:33:56 +02:00
Christopher Faulet	baf0924a50	BUG/MINOR: resolvers: Free new requester on error when linking a resolution If an error is triggered while the requester was allocated, it is not immediately released. It is not really a memory leak because the requester will be reused laster, on a next attempt, or released on deinit. For a do-resolv action, the requester is released with the stream. So no leak here. But it is probably not expected to keep a newly allocated requester in case of error. This patch could be backported as far as 2.6.	2026-05-04 16:33:56 +02:00
Christopher Faulet	95ee47fd32	CLEANUP: resolvers: Remove duplicated line when resolvers proxy is initialized In resolvers_setup_proxy(), "px->conn_retries" was initialized twice. Let's remove the line with no comment.	2026-05-04 16:33:56 +02:00
Christopher Faulet	e46090f424	BUG/MINOR: tcpcheck: Properly report error for http health-checks When an error is reported on an expect rule for tcp and http health-checks, a dedicated message is reported with details about the wrong match. However, this was never performed for HTTP health-check because of the wrong test on the check type. In addition, when an error was reported on an "expect hdr" rule, a break statement was missing. So the error message could be truncated (but never emitted because of the issue above). This patch should be backported to all stable versions.	2026-05-04 16:33:56 +02:00
Willy Tarreau	4ef31cb5c2	BUG/MINOR: dns: always validate the source address in responses When we removed the use of connect() to reach DNS servers in 3.3 with commit 2c7e05f80e ("MEDIUM: dns: don't call connect to dest socket for AF_INET*"), we accidentally lost a check on the server's address in responses, opening the possibility of spoofed DNS responses for someone who knows both haproxy's IP:port and the transaction ID. In practice, the impact is very low, because: - DNS servers IP addresses are almost always known, and often even among the widely used ones (1.1.1.1, 8.8.8.8 etc), and their port is 53. - all DNS "security" relies on the ignorance of the transaction ID and is possible source port, so if either the attacker is on-path and sees them, or it's off-path and has to guess them, but in any case it's trivial to spoof the known server in responses, with or without the check. Regardless, let's not further weaken the protocol and do the check. Thanks to Omkhar Arasaratnam for reporting this issue. An interesting observation while testing this fix was that the code does support UNIX dgram sockets (via connect()) but that since we don't bind to a local UNIX socket to send requests, the server's recvfrom() doesn't get any address and has nowhere to respond to. So in practice while the code is designed to deal with UNIX sockets, these cannot work by design. This fix must be backported to 3.2 where the commit above was backported.	2026-05-04 16:27:26 +02:00
William Lallemand	4153aae932	DOC: acme: document missing acme-vars and provider-name keywords Both keywords are used with dns-01 and dns-persist-01 challenges to pass information to an external DNS provisioning tool (e.g. the dataplaneAPI) via the "dpapi" sink. provider-name sets the DNS provider identifier and acme-vars passes arbitrary tool-specific variables. Thanks to @oliwer for reporting the issue. Must be backported to 3.2, however previous version don't have "dns-persist-01".	2026-05-04 14:44:53 +02:00
Willy Tarreau	8ffb4b5a09	DOC: otel: update the filter's status and URL in the docs The docs (readme and configuration.txt) still used to mention that OTEL was under development. Now that it's released, let's indicate that it's ready with the download URL.	2026-05-04 14:38:35 +02:00
Willy Tarreau	7e09e2a916	BUG/MAJOR: mux-h2: preset MSGF_BODY_CL on H2_SF_DATA_CLEN in h2c_dec_hdrs() Commit d12edebe4a ("BUG/MAJOR: mux-h2: detect incomplete transfers on HEADERS frames as well") tried to enforce strict matching between advertised content-length and transferred data when dealing with ES on a headers frame. It purposely arranged the code so that it would cover both headers and trailers. The problem is, in h2c_dec_hdrs() we preset message flags (msgf) based on the current state and knowledge related to the stream being processed, then we pass these flags to the headers parser and use their final state to perform some extra checks. MSGF_BODY_CL was set by the parsers themselves when processing a content-length header, but when parsing a trailers frame, it will not be set, and due to this the matching between the remaining expected content-length and the transferred data is not verified, so the fix above doesn't work for trailers. This patch sets MSGF_BODY_CL to the same value as H2_SF_DATA_CLEN so that during headers it remains zero, but it matches what was learned during headers when processing trailers. It is sufficient to re-enable the check that was attempted in the commit above. The impact remains the same as the one indicated in the commit above: in practice this can be used to force subsequent requests to fail, or when running with "http-reuse never" or when running with a totally idle server, to perform a request smuggling by constructing specially crafted request pairs where the first one is used to trigger an early response and hide parts of or all headers of the second one, to instead use a second embedded one that was not subject to analysis. As such, the risk remains moderate given the low prevalence of "http-reuse never" in production environments, and of idle servers. Again, a temporary alternative to the fix is to disable HTTP/2 by specifying "alpn http/1.1" on "bind" lines, and adding "option disable-h2-upgrade" in HTTP frontends. Many thanks to Pratham Gupta / alchemy1729 for spotting and analyzing this problem, and for providing a lightweight reproducer to illustrate the problem! This fix must be backported to all versions where the fix above was backported (i.e. all). Note that it depends on this previous commit otherwise trailers will always break: BUG/MEDIUM: mux-h2: fix the body_len to check when parsing request trailers As a side note, it's worth noting that these temporary message flags have reached a level of pain and fragility that really warrants a complete rework. Ideally we should have a pair of such flags in the h2s (one per direction) and the callers of the parsers should point to them so that they are always up to date. And by having generic HTTP flags instead of H2, we could better unify the h1/h2/h3/fcgi processors (and maybe avoid some HTX conversion). One flag could even indicate that trailers are being parsed (since they're last) so as to ease this detection down the chain.	2026-05-04 14:33:22 +02:00
Willy Tarreau	b6995d25d1	BUG/MEDIUM: mux-h2: fix the body_len to check when parsing request trailers The h2 content-length validation in commit d12edebe4a ("BUG/MAJOR: mux-h2: detect incomplete transfers on HEADERS frames as well") was insufficient. The content-length check is still ineffective on request trailers and it could not work by default due to the fact that the default body_len is used in h2c_frt_handle_headers() when processing trailers, instead of passing h2s->body_len, which was necessarily parsed before reaching trailers. Let's fix this point first, otherwise fixing the second issue would break trailers. Many thanks to Pratham Gupta / alchemy1729 for spotting and analyzing this problem, and for providing a lightweight reproducer to illustrate the problem! This fix must be backported to all versions where the fix above was backported (i.e. all).	2026-05-04 14:33:22 +02:00
William Lallemand	a8b1af277f	CLEANUP: otel: move opentelemetry outside haproxy sources The opentelemetry addons now live outside haproxy sources and is available at https://github.com/haproxytech/haproxy-opentelemetry/ The addon must be built using the EXTRA_MAKE option from HAProxy Makefile: $ PKG_CONFIG_PATH=/opt/lib/pkgconfig make -j8 TARGET=linux-glibc EXTRA_MAKE="/tmp/a/a/haproxy-opentelemetry" OTEL_DEBUG=1 OTEL_USE_VARS=1	2026-05-04 14:18:40 +02:00
Miroslav Zagorac	e16b1a44f8	BUILD: otel: removed USE_OTEL, addon is now built via EXTRA_MAKE The OpenTelemetry filter has been moved out of the haproxy source tree and now lives and is developed in the haproxy-opentelemetry repository as an external addon. Removed all hard-coded references to USE_OTEL and to the addons/otel directory from the main Makefile, as the addon is now plugged in through the generic EXTRA_MAKE hook added in the previous commit.	2026-05-04 14:15:17 +02:00
William Lallemand	66bccb7a3a	BUILD: add an EXTRA_MAKE option to build addons easily Allow to call an external Makefile called Makefile.inc in order to build complex addons. make TARGET=linux-glibc ... EXTRA_MAKE="/path/to/addon1" \ EXTRA_MAKE+="/path/to/addon2"	2026-05-04 13:49:26 +02:00
Willy Tarreau	0896ea43fc	CLEANUP: mux-h2: remove the outdated condition to release h2c on timeout The historical code dealing with timeout was left with a confusing condition that is always true but always requires some analysis. It would check if some streams were left pending before deciding to release the connection. It could indeed be problematic to leave with no timeout and an active connection! As Christopher figured, the situation cannot exist because a first check ensures there's no more h2c via h2c_may_expire(), then a call to h2_wake_some_streams() will call h2s_wake_one_stream() for each of the h2s, and all those with no h2c are purged. Thus on return from h2_wake_some_streams() we're guaranteed to have an empty tree. Let's just remove the condition and clean up the code.	2026-05-04 12:01:57 +02:00
Amaury Denoyelle	95ea3c5510	MINOR: quic: fix trace spacing when datagram is displayed Adjust spacing between individual arguments for the QUIC trace QUIC_EV_CONN_RCV when a datagram is displayed.	2026-05-04 11:18:47 +02:00
Amaury Denoyelle	a0a510f0d2	BUG/MINOR: quic: fix trace crash on datagram receive Recently, datagram reception architecture has been completely reworked to improve performance. A regression has been introduced when using traces in qc_rcv_buf() : datagram argument is uninitialized after recv syscall. This may cause a crash as CIDs buffer is dereferenced. Fix this by removing dgram argument from the affected trace. A new trace is added after quic_dgram_init() to keep the ability to display the received content. This issue has caused failure of all QUIC interop testing. No need to backport.	2026-05-04 11:18:35 +02:00
William Lallemand	71267bc6a5	BUG/MEDIUM: acme: fix stalled renewal when opportunistic DNS check fails In ACME_INITIAL_RSLV_READY, when the opportunistic DNS propagation check fails and the code falls back to ACME_CLI_WAIT, ACME_RDY_INITIAL_DNS was left set in cond_ready. Since the CLI-wait path only ever sets ACME_RDY_CLI on auth->ready, the readiness check in ACME_CLI_WAIT could never be satisfied, permanently stalling certificate renewal. Fix this by stripping ACME_RDY_INITIAL_DNS from cond_ready before falling back to the regular CLI-wait flow. Also replace the &= with a plain assignment in the success path to make the intent explicit. No backport needed, 3.4 only.	2026-05-04 11:06:12 +02:00
Amaury Denoyelle	63f853957a	BUG/MINOR: quic: fix buffer overflow with sockaddr_in46 New type sockaddr_in46 has been recently introduced. It serves as a union which can store either an IPv4 or IPv6 address. The objective is to reduce the storage size for QUIC datagrams which previously uses a sockaddr_storage field. On qc_new_conn(), source and destination addresses from the datagram are passed to the function as sockaddr_storage so that they are copied into the newly built quic_conn instance. However, the involved memcpy() is producing a buffer overflow as sockaddr_in46 is smaller than sockaddr_storage type. This patch fixes this by defining a new helper function in46un_to_addr(). This allows to convert safely sockaddr_in46 to a plain sockaddr type. The function is now used before invoking qc_new_conn(). Note that there is still other several places where union sockaddr_in46 is casted as sockaddr_storage type. However, these should be safe as in these cases sockaddr fields are accessed individually after checking ss_family. The memory issue only exists when plain memcpy is used. This bug was detected using ASAN. It generates the following traces when a QUIC connection is instantiated. ==37474==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x7c3bb9a61100 at pc 0x5631f52c3946 bp 0x7ffc83e45b50 sp 0x7ffc83e45310 READ of size 128 at 0x7c3bb9a61100 thread T0 #0 0x5631f52c3945 in __asan_memcpy (/home/amaury/work/haproxy-quic-dev/haproxy+0x3ae945) (BuildId: a7ccfd74b7a71a869b8ff8d13f6dcde8c82c1487) #1 0x5631f55f9e34 in qc_new_conn /home/amaury/work/haproxy-quic-dev/src/quic_conn.c:1311:2 #2 0x5631f558d5c3 in quic_rx_pkt_retrieve_conn /home/amaury/work/haproxy-quic-dev/src/quic_rx.c:1875:10 #3 0x5631f558330b in quic_dgram_parse /home/amaury/work/haproxy-quic-dev/src/quic_rx.c:2463:29 #4 0x5631f5625da6 in quic_lstnr_dghdlr /home/amaury/work/haproxy-quic-dev/src/quic_sock.c:206:3 #5 0x5631f6a64173 in run_tasks_from_lists /home/amaury/work/haproxy-quic-dev/src/task.c:660:26 #6 0x5631f6a6ba1e in process_runnable_tasks /home/amaury/work/haproxy-quic-dev/src/task.c:913:9 #7 0x5631f5e984c3 in run_poll_loop /home/amaury/work/haproxy-quic-dev/src/haproxy.c:2982:3 #8 0x5631f5e9a715 in run_thread_poll_loop /home/amaury/work/haproxy-quic-dev/src/haproxy.c:3212:2 #9 0x5631f5e9f732 in main /home/amaury/work/haproxy-quic-dev/src/haproxy.c:3853:2 #10 0x7f2bba8276c0 (/usr/lib/libc.so.6+0x276c0) (BuildId: ca0db5ab57a36507d61bbcf4988d344974331f19) #11 0x7f2bba8277f8 in __libc_start_main (/usr/lib/libc.so.6+0x277f8) (BuildId: ca0db5ab57a36507d61bbcf4988d344974331f19) #12 0x5631f51be594 in _start (/home/amaury/work/haproxy-quic-dev/haproxy+0x2a9594) (BuildId: a7ccfd74b7a71a869b8ff8d13f6dcde8c82c1487) No need to backport.	2026-05-04 10:49:49 +02:00
William Lallemand	9b722520a9	CI: github: add DEBUG_STRICT=2 to ASAN jobs Add an DEBUG_STRICT=2 option to the ASAN jobs in order to trigger the BUG_ON_HOT() conditions.	2026-04-30 17:46:30 +02:00
Willy Tarreau	e6b3dd7b34	CLEANUP: acl: remove duplicate test in parse_acl_expr() and unused variable aclkw->match_type was compared twice to PAT_MATCH_DOM in parse_acl_expr(). After auditing the involved types, it's only a copy-paste mistake, as no other matching method is missing, so let's drop it to avoid the confusion. Also drop variable ckw which is assigned NULL and passed to free(), and is clearly a leftover from a previous version. No backport needed.	2026-04-30 17:39:26 +02:00
Willy Tarreau	dc93168f22	BUG/MINOR: pattern: release the reference on failure to load from file In pattern_read_from_file(), in case of failure to load from the file, the newly allocated reference remains attached to the list and is never freed. Let's just do it. This can be backported to all versions since it arrived in 1.5 with commit 1e00d3853b ("MAJOR: pattern/map: Extends the map edition system in the patterns").	2026-04-30 17:39:26 +02:00
Willy Tarreau	ecb901ff6c	CLEANUP: map/cli: fix some map-related help messages Some CLI help messages used to mention "acl" instead of "map", others had unbalanced brackets. This can be backported if desired.	2026-04-30 17:39:26 +02:00
Willy Tarreau	67fdcebd79	BUG/MINOR: map: do not leak a map descriptor on load error Maps can leak a map descriptor in sample_load_map() on error. This is harmless since the process won't start after this, and this cannot be used at run time. but it's cleaner to fix it. This can be backported.	2026-04-30 17:39:26 +02:00
Willy Tarreau	3fe3a189c2	BUG/MINOR: acl: fix a possible arg corruption in smp_fetch_acl_parse() smp_fetch_acl_parse() first places the newly allocated ACL sample into the first argument to be parsed, before parsing it. The type is not changed so the first argument remains of type string. In case of error, the allocated sample is released and release_sample_expr() will call release_sample_arg() to release the argument, possibly freeing the string present there. And here's the catch: by overwriting the first arguments's ->ptr entry, it happens to be located over the ->str.size location, to not be null and to still be freed, but by pure chance thanks to aliasing. A slight reorder of the args or buffer fields could place it in the ->area and provoke a double-free, or even always make the first argument's parsing fail. Let's move the assignment after the loop has succeeded instead, and properly set type=ARGT_PTR so that we never try to free it. The bug appeared with the "acl()" sample fetch in 2.9 with commit 7fccccccea ("MINOR: acl: add acl() sample fetch") so it can be backported to 3.0.	2026-04-30 17:39:26 +02:00
Amaury Denoyelle	2d6ebd0410	MINOR: h3/hq_interop: implement stream reset on shut abort/kill-conn Adjust QUIC mux stream shut procedure when abort or kill-conn is performed. Changes are implemented directly into lclose callback for h3/h09 protocols. On abort, the stream is resetted as previously. The only change is that now a proper error code can be used, with REQUEST_CANCELLED specified for HTTP/3 protocol. Kill-conn is requested when a tcp-requect connection reject rule has been executed. In this case the stream is resetted, and the connection is also closed. This is identical to the H2 multiplexer. HTTP/3 protocol uses EXCESSIVE_LOAD as error code in this case.	2026-04-30 17:18:20 +02:00
Amaury Denoyelle	d0bd6a946b	MEDIUM: mux-quic: extend shut to app proto layer Previously, shut callback was entirely implemented in QUIC mux layer. However, this operation depends on the above application protocol, as it may define its own closure procedure and error codes. This is the case notably with HTTP/3 specification. This patch defines a stream shut API between QUIC mux and application protocol layers via the new qcc_app_ops callback lclose(). The closure reason is specified via an enum argument. Application protcol can then perform the stream closure as intended. This patch is only an architecture adjustment but should not have any functional impact. Stream closure logic was moved identically from QUIC mux into h3 and h09 lclose callback.	2026-04-30 17:18:20 +02:00
Alexander Stephan	64383e655b	BUG/MEDIUM: cli: fix master CLI connection slot leak on client disconnect In master-worker mode the master CLI proxy (mworker_proxy) has a hardcoded maxconn of 10. When a client connects to the master CLI socket and issues a command that gets forwarded to an unresponsive worker (e.g. one that is stuck or very slow), the connection hangs waiting for the worker's response. If the client then disconnects (timeout, Ctrl-C, etc.), the connection slot is never released because the client-side FIN is never acknowledged by the unresponsive worker. After 10 such leaked slots the master CLI socket becomes completely unreachable, returning "Resource temporarily unavailable" to any new connection attempt. In containerized deployments this means readiness probes start failing and the pod gets restarted. The fix adds a timeout server-fin of 1s to the mworker_proxy. When the client disconnects while waiting for a worker response, this timeout ensures the dangling backend connection is cleaned up after 1s, freeing the connection slot. This does not affect normal CLI operations since the timeout only starts after the client has already closed its side of the connection. A regression test is included that blocks the worker CLI thread using "debug dev delay" with nbthread 1, fills all 10 master CLI slots, waits for client-side timeouts, then verifies a new connection still succeeds. This fixes GH issue #3351. This should be backported to all stable branches. Co-authored-by: Martin Strenge <github@trixer.net> Co-authored-by: William Lallemand <wlallemand@haproxy.com>	2026-04-30 17:06:19 +02:00
Maxime Henrion	55ba951f3d	BUG/MINOR: quic: handle cases where we don't have an address It is possible to not have addresses in certain conditions, so we now also allow the address family to be AF_UNSPEC in quic_dgram_init(). No backport necessary since this code is not yet in a release.	2026-04-30 16:54:28 +02:00
Maxime Henrion	cc231f3468	OPTIM: quic: reduce the size of struct quic_dgram The QUIC code can only handle IPv4 or IPv6 addresses, so using two sockaddr_storage structs wastes a lot of space in the quic_dgram struct. This is a very large overhead since this structure is written in the MPSC ring buffers before every datagram, while many of those datagrams are only 50 bytes or less. Using an union instead saves 200 bytes per datagram, increasing the capacity of the buffers significantly.	2026-04-30 15:33:07 +02:00
Maxime Henrion	df0614b177	MINOR: quic: store the DCID as an offset Using an offset instead of a pointer into the datagram buffer is less error-prone as we do not have to manually fixup that pointer when the datagram is moved somewhere else in memory.	2026-04-30 15:33:07 +02:00
Maxime Henrion	9b5f11cd3d	OPTIM: quic: rework the QUIC RX code Use an MPSC ring buffer to hold data for each datagram handler. Holding this data in a per-handler buffer avoids the HoL blocking we experienced when we had per-listener buffers with data from all threads mixed up in them. This also gets rid of the mt_list contention we were suffering before, that was causing some threads to be stuck for a significant amount of time, causing warnings and even crashes in some cases.	2026-04-30 15:33:07 +02:00
Maxime Henrion	57d8f06215	MINOR: add an MPSC ring buffer implementation This is to be used in the QUIC code, where the multiple producers are the listener threads, and the single consumer is the datagram handler thread. Entries are variable-length with a size header, and are kept contiguous in the buffer, so padding is inserted at the end when an entry would otherwise wrap around. The size field is overloaded to also mark padding (-1) and entries that are still free or not yet ready for reads (0). Headers and payloads are aligned on 8 bytes. Aligning on 16 bytes might be beneficial on some architectures to let memcpy() use 128-bit SIMD instructions. The head and tail offsets are 64-bit unsigned integers, making ABA issues from integer overflow impossible on current or near-future hardware. Reservation uses a CAS rather than FAA because of the need to insert padding to keep entries contiguous.	2026-04-30 15:33:07 +02:00
Willy Tarreau	66b00543c5	BUG/MINOR: hpack: validate idx > 0 in hpack_valid_idx() HPACK indices start at 1, so idx=0 is invalid. The function only checked the upper bound before, allowing idx=0 to pass as valid. This is harmless as the code properly checks for existing name and values everywhere, but then due to the call to hpack_idx_to_phdr(), index 0 will be taken for :authority. Let's just make sure it's never zero. This can be backported.	2026-04-30 09:33:31 +02:00
Willy Tarreau	2464c73526	BUG/MINOR: vars: only print first invalid char in fill_desc() The variable name is passed as (ptr,len) suggesting that certain callers might not always have 0-terminated strings (e.g. when used in arguments where closing parenthesis or commas might follow), the error message carefully tries to designate the first invalid character, yet by mistake it prints the whole string. This mistake has been there since commit 4834bc773c ("MEDIUM: vars: adds support of variables") in 1.6. Let's properly report the char as promised instead. This could be backported but is totally unimportant.	2026-04-30 09:19:53 +02:00
Willy Tarreau	37b1f15d85	BUG/MINOR: vars: don't store the variable twice with set-var-fmt In 2.5, commit 9a621ae76d ("MEDIUM: vars: add a new "set-var-fmt" action") introduced the set-var-fmt action. However the storage (by then sample_store_stream() now var_set()) was added for this specific branch without any return, leaving the sample copied again over the variable via the final call, meaning that the variable name is looked up twice and for proc scope, the lock is taken twice for each call to set-var-fmt. This patch removes the first call. Tests show that proc operations now jump from 1.1M to 1.67M/s on a 64-core CPU (lower lock contention), while other scopes only observe a modest improvement with few vars (10 goes from 43.3M to 44M/s). This could be backported.	2026-04-30 09:11:05 +02:00
Willy Tarreau	2b30330cdc	BUG/MINOR: vars: make parse_store() return error on var_set() failure In 2.5, variables in the scope "proc" were pre-created with commit df8eeb1619 ("MEDIUM: vars: pre-create parsed SCOPE_PROC variables as permanent ones"). However one test on var_set() was copy-pasted from vars_check_args() into parse_store(), and the former returns 0 on error while for the latter it's a success. This means that some errors on variables of scope "proc" (typically alloc failure) can be missed at boot time, probably either making that variable invisible or causing a crash during boot. Let's return ACT_RET_PRS_ERR instead. This can be backported.	2026-04-30 08:37:10 +02:00
Willy Tarreau	64e5489864	CLEANUP: net_helper: fix incorrect const pointers in writev_n16() It's interesting to see that output pointers p1 and p2 were declared as const, and that thisremained unnoticed due to the explicit casts to u8 when writing to them. The function is currently not used, but better clean it up to avoid surprises.	2026-04-30 08:21:33 +02:00
Willy Tarreau	76d894956f	BUG/MINOR: sink: do not free existing sinks on allocation error In 3.1 with commit 1bdf6e884a ("MEDIUM: sink: implement sink_find_early()") sink creation started to support plugging to an existing sink and only updating the description. However if the strdup() for the new description failed, it would go down the error path releasing everything, while leaving the released entry in the sink list and leaving other users still attached to it. Here we take a different approach, we first allocate the new description, and release the old one on success, otherwise we leave the entry unchanged. And we only release new sinks, not old ones. This way allocation errors just do not change what is already referenced elsewhere, so that error unrolling remains clean. This remains of low importance anyway since it's only a case of boot error aggravation. Not really needed to be backported.	2026-04-30 08:01:24 +02:00
William Lallemand	19e17fd6e2	BUG/MINOR: acme: skip auth/challenge steps when newOrder returns a certificate When an ACME server returns a certificate URL directly in the newOrder response (order already validated), parse it and transition straight to ACME_CERTIFICATE, bypassing the auth/challenge steps. This needs to be backported to 3.2.	2026-04-29 18:22:45 +02:00

1 2 3 4 5 ...

27090 Commits