haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-09-20 21:31:28 +02:00

Author	SHA1	Message	Date
Frederic Lecaille	d753f24096	BUG/MINOR: quic: reorder fragmented RX CRYPTO frames by their offsets This issue impacts the QUIC listeners. It is the same as the one fixed by this commit: BUG/MINOR: quic: repeat packet parsing to deal with fragmented CRYPTO As chrome, ngtcp2 client decided to fragment its CRYPTO frames but in a much more agressive way. This could be fixed with a list local to qc_parse_pkt_frms() to please chrome thanks to the commit above. But this is not sufficient for ngtcp2 which often splits its ClientHello message into more than 10 fragments with very small ones. This leads the packet parser to interrupt the CRYPTO frames parsing due to the ncbuf gap size limit. To fix this, this patch approximatively proceeds the same way but with an ebtree to reorder the CRYPTO by their offsets. These frames are directly inserted into a local ebtree. Then this ebtree is reused to provide the reordered CRYPTO data to the underlying ncbuf (non contiguous buffer). This way there are very few less chances for the ncbufs used to store CRYPTO data to reach a too much fragmented state. Must be backported as far as 2.6.	2025-08-27 16:14:19 +02:00
Frederic Lecaille	729196fbed	BUG/MEDIUM: quic-be: avoid crashes when releasing Initial pktns This bug arrived with this fix: BUG/MINOR: quic-be: missing Initial packet number space discarding leading to crashes when dereferencing ->ipktns. Such crashes could be reproduced with -dMfail option. To reach them, the memory allocations must fail. So, this is relatively rare, except on systems with limited memory. To fix this, do not call quic_pktns_discard() if ->ipktns is NULL. No need to backport.	2025-08-27 16:14:19 +02:00
William Lallemand	c36e4fb17f	DOC: configuration: reword 'generate-certificates' Reword the 'generate-certificates' keyword documentation to clarify what's happening upon error. This was discussed in ticket #3082.	2025-08-27 13:42:29 +02:00
Aurelien DARRAGON	2cd0afb430	MINOR: proxy: handle shared listener counters preparation from proxy_postcheck() We used to allocate and prepare listener counters from check_config_validity() all at once. But it isn't correct, since at that time listeners's guid are not inserted yet, thus counters_fe_shared_prepare() cannot work correctly, and so does shm_stats_file_preload() which is meant to be called even earlier. Thus in this commit (and to prepare for upcoming shm shared counters preloading patches), we handle the shared listener counters prep in proxy_postcheck(), which means that between the allocation and the prep there is the proper window for listener's guid insertion and shm counters preloading. No change of behavior expected when shm shared counters are not actually used.	2025-08-27 12:54:25 +02:00
Aurelien DARRAGON	cdb97cb73e	MEDIUM: server: split srv_init() in srv_preinit() + srv_postinit() We actually need more granularity to split srv postparsing init tasks: Some of them are required to be run BEFORE the config is checked, and some of them AFTER the config is checked. Thus we push the logic from 368d0136 ("MEDIUM: server: add and use srv_init() function") a little bit further and split the function in two distinct ones, one of them executed under check_config_validity() and the other one using REGISTER_POST_SERVER_CHECK() hook. SRV_F_CHECKED flag was removed because it is no longer needed, srv_preinit() is only called once, and so is srv_postinit().	2025-08-27 12:54:19 +02:00
Aurelien DARRAGON	9736221e90	MINOR: haproxy: abort config parsing on fatal errors for post parsing hooks When pre-check and post-check postparsing hooks= are evaluated in step_init_2() potential fatal errors are ignored during the iteration and are only taken into account at the end of the loop. This is not ideal because some errors (ie: memory errors) could cause multiple alert messages in a row, which could make troubleshooting harder for the user. Let's stop as soon as a fatal error is encountered for post parsing hooks, as we use to do everywhere else.	2025-08-27 12:54:13 +02:00
Christopher Faulet	49db9739d0	BUG/MEDIUM: spoe: Improve error detection in SPOE applet on client abort It is possible to interrupt a SPOE applet without reporting an error. For instance, when the client of the parent stream aborts. Thanks to this patch, we take care to report an error on the SPOE applet to be sure to interrupt the processing. It is especially important if the connection to the agent is queued. Thanks to 886a248be ("BUG/MEDIUM: mux-spop: Reject connection attempts from a non-spop frontend"), it is no longer an issue. But there is no reason to continue to process if the parent stream is gone. In addition, in the SPOE filter, if the processing is interrupted when the filter is destroyed, no specific status code was set. It is not a big deal because it cannot be logged at this stage. But it can be used to notify the SPOE applet. So better to set it. This patch should be backported as far as 3.1.	2025-08-26 16:12:18 +02:00
William Lallemand	7a30c10587	REGTESTS: jwt: create dynamically "cert.ecdsa.pem" Stop declaring "cert.ecdsa.pem" in a crt-store, and add it dynamically over the stats socket insted. This way we fully verify a JWS signature with a certificate which never existed at HAProxy startup.	2025-08-25 16:44:24 +02:00
Christopher Faulet	886a248be4	BUG/MEDIUM: mux-spop: Reject connection attempts from a non-spop frontend It is possible to crash the process by initializing a connection to a SPOP server from a non-spop frontend. It is of course unexpected and invalid. And there are some checks to prevent that when the configuration is loaded. However, it is not possible to handle all cases, especially the "use_backend" rules relying on log-format strings. It could be good to improve the backend selection by checking the mode compatibility (for now, it is only performed for the HTTP). But at the end, this can also be handled by the SPOP multiplexer when it is initialized. If the opposite SD is not attached to an SPOE agent, we should fail the mux initialization and return an internal error. This patch must be backported as far as 3.1.	2025-08-25 11:11:05 +02:00
Christopher Faulet	b4a92e7cb1	MEDIUM: applet: Set .rcv_buf and .snd_buf functions on default ones if not set Based on the applet flags, it is possible to set .rcv_buf and .snd_buf callback functions if necessary. If these functions are not defined for an applet using the new API, it means the default functions must be used. We also take care to choose the raw version or the htx version, depending on the applet flags.	2025-08-25 11:11:05 +02:00
Christopher Faulet	71c01c1010	MINOR: applet: Make some applet functions HTX aware applet_output_room() and applet_input_data() are now HTX aware. These functions automatically rely on htx versions if APPLET_FL_HTX flag is set for the applet.	2025-08-25 11:11:05 +02:00
Christopher Faulet	927884a3eb	MINOR: applet: Add a flag to know an applet is using HTX buffers Multiplexers already explicitly announce their HTX support. Now it is possible to set flags on applet, it could be handy to do the same. So, now, HTX aware applets must set the APPLET_FL_HTX flag.	2025-08-25 11:11:05 +02:00
Christopher Faulet	1c76e4b2e4	MINOR: applet: Add function to test applet flags from the appctx appctx_app_test() function can now be used to test the applet flags using an appctx. This simplify a bit tests on applet flags. For now, this function is used to test APPLET_FL_NEW_API flag.	2025-08-25 11:11:05 +02:00
Christopher Faulet	3de6c375aa	MINOR: applet: Rely on applet flag to detect the new api Instead of setting a flag on the applet context by checking the defined callback functions of the applet to know if an applet is using the new API or not, we can now rely on the applet flags itself. By checking APPLET_FL_NEW_API flag, it does the job. APPCTX_FL_INOUT_BUFS flag is thus removed.	2025-08-25 11:11:05 +02:00
Aurelien DARRAGON	3da1d63749	BUG/MEDIUM: http_ana: handle yield for "stats http-request" evaluation stats http-request rules evaluation is handled separately in http_process_req_common(). Because of that, if a rule requires yielding, the evaluation is interrupted as (F)YIELD verdict return values are not handled there. Since 3.2 with the introduction of costly ruleset interruption in 0846638 ("MEDIUM: stream: interrupt costly rulesets after too many evaluations"), the issue started being more visible because stats http-request rules would be interrupted when the evaluation counters reached tune.max-rules-at-once, but the evaluation would never be resumed, and the request would continue to be handled as if the evaluation was complete. Note however that the issue already existed in the past for actions that could return ACT_RET_YIELD such as "pause" for instance. This issue was reported by GH user @Wahnes in #3087, thanks to him for providing useful repro and details. To fix the issue, we merge rule vedict handling in http_process_req_common() so that "stats http-request" evaluation benefits from all return values already supported for the current ruleset. It should be backported in 3.2 with 0846638 ("MEDIUM: stream: interrupt costly rulesets after too many evaluations"), and probably even further (all stable versions) if the patch adaptation is not to complex (before HTTP_RULE_RES_FYIELD was introduced) because it is still relevant.	2025-08-25 10:59:16 +02:00
Aurelien DARRAGON	f9b227ebff	MINOR: http_ana: fix typo in http_res_get_intercept_rule HTTP_RULE_RES_YIELD was used where HTTP_RULE_RES_FYIELD should be used. Hopefully, aside from debug traces, both return values were treated equally. Let's fix that to prevent confusion and from causing bugs in the future. It may be backported in 3.2 with 0846638 ("MEDIUM: stream: interrupt costly rulesets after too many evaluations") if it easily applies	2025-08-25 10:59:08 +02:00
Amaury Denoyelle	1529ec1a25	MINOR: quic: centralize padding for HP sampling on packet building The below patch has simplified INITIAL padding on emission. Now, qc_prep_pkts() is responsible to activate padding for this case, and there is no more special case in qc_do_build_pkt() needed. commit 8bc339a6ad4702f2c39b2a78aaaff665d85c762b BUG/MAJOR: quic: fix INITIAL padding with probing packet only However, qc_do_build_pkt() may still activate padding on its own, to ensure that a packet is big enough so that header protection decryption can be performed by the peer. HP decryption is performed by extracting a sample from the ciphered packet, starting 4 bytes after PN offset. Sample length is 16 bytes as defined by TLS algos used by QUIC. Thus, a QUIC sender must ensures that length of packet number plus payload fields to be at least 4 bytes long. This is enough given that each packet is completed by a 16 bytes AEAD tag which can be part of the HP sample. This patch simplifies qc_do_build_pkt() by centralizing padding for this case in a single location. This is performed at the end of the function after payload is completed. The code is thus simpler. This is not a bug. However, it may be interesting to backport this patch up to 2.6, as qc_do_build_pkt() is a tedious function, in particular when dealing with padding generation, thus it may benefit greatly from simplification.	2025-08-25 08:48:24 +02:00
Amaury Denoyelle	7d554ca629	BUG/MINOR: quic: don't coalesce probing and ACK packet of same type Haproxy QUIC stack suffers from a limitation : it's not possible to emit a packet which contains probing data and a ACK frame in it. Thus, in case qc_do_build_pkt() is invoked which both values as true, probing has the priority and ACK is ignored. However, this has the undesired side-effect of possibly generating two coalesced packets of the same type in the same datagram : the first one with the probing data and the second with an ACK frame. This is caused by qc_prep_pkts() loop which may call qc_do_build_pkt() multiple times with the same QEL instance. This case is normally use when a full datagram has been built but there is still content to emit on the current encryption level. To fix this, alter qc_prep_pkts() loop : if both probing and ACK is requested, force the datagram to be written after packet encoding. This will result in a datagram containing the packet with probing data as final entry. A new datagram is started for the next packet which will can contain the ACK frame. This also has some impact on INITIAL padding. Indeed, if packet must be the last due to probing emission, qc_prep_pkts() will also activate padding to ensure final datagram is at least 1.200 bytes long. Note that coalescing two packets of the same type is not invalid according to QUIC RFC. However it could cause issue with some shaky implementations, so it is considered as a bug. This must be backported up to 2.6.	2025-08-22 18:20:42 +02:00
Amaury Denoyelle	8bc339a6ad	BUG/MAJOR: quic: fix INITIAL padding with probing packet only A QUIC datagram that contains an INITIAL packet must be padded to 1.200 bytes to prevent any deadlock due to anti-amplification protection. This is implemented by encoding a PADDING frame on the last packet of the datagram if necessary. Previously, qc_prep_pkts() was responsible to activate padding when calling qc_do_build_pkt(), as it knows which packet is the last to encode. However, this has the side-effect of preventing PING emission for probing with no data as this case was handled in an else-if branch after padding. This was fixed by the below commit 217e467e89d15f3c22e11fe144458afbf718c8a8 BUG/MINOR: quic: fix malformed probing packet building Above logic was altered to fix the PING case : padding was set to false explicitely in qc_prep_pkts(). Padding was then added in a specific block dedicated to the PING case in qc_do_build_pkt() itself for INITIAL packets. However, the fix is incorrect if the last QEL used to built a packet is not the initial one and probing is used with PING frame only. In this case, specific block in qc_do_build_pkt() does not add padding. This causes a BUG_ON() crash in qc_txb_store() which catches these packets as irregularly formed. To fix this while also properly handling PING emission, revert to the original padding logic : qc_prep_pkts() is responsible to activate INITIAL padding. To not interfere with PING emission, qc_do_build_pkt() body is adjusted so that PING block is moved up in the function and detached from the padding condition. The main benefit from this patch is that INITIAL padding decision in qc_prep_pkts() is clearer now. Note that padding can also be activated by qc_do_build_pkt(), as packets should be big enough for header protection decipher. However, this case is different from INITIAL padding, so it is not covered by this patch. This should be backported up to 2.6.	2025-08-22 18:12:32 +02:00
Amaury Denoyelle	0376e66112	BUG/MINOR: quic: do not emit probe data if CONNECTION_CLOSE requested If connection closing is activated, qc_prep_pkts() can only built a datagram with a single packet. This is because we consider that only a single CONNECTION_CLOSE frame is relevant at this stage. This is handled both by qc_prep_pkts() which ensure that only a single packet datagram is built and also qc_do_build_pkt() which prevents the invokation of qc_build_frms() if <cc> is set. However, there is an incoherency for probing. First, qc_prep_pkts() deactivates it if connection closing is requested. But qc_do_build_pkt() may still emit probing frame as it does not check its <probe> argument but rather <pto_probe> QEL field directly. This can results in a packet mixing a PING and a CONNECTION close frames, which is useless. Fix this by adjusting qc_do_build_pkt() : closing argument is also checked on PING probing emission. Note that there is still shaky code here as qc_do_build_pkt() should rely only on <probe> argument to ensure this. This should be backported up to 2.6.	2025-08-22 18:06:43 +02:00
Amaury Denoyelle	fc3ad50788	BUG/MEDIUM: quic: reset padding when building GSO datagrams qc_prep_pkts() encodes input data into QUIC packets in a loop into one or several datagrams. It supports GSO which requires to built a serie of multiple datagrams of the same length. Each packet encoding is performed via a call to qc_do_build_pkt(). This function has an argument to specify if output packet must be completed with a PADDING frame. This option is activated when qc_prep_pkts() encodes the last packet of a datagram with at least one INITIAL packet in it. Padding is resetted each time a new datagram is started. However, this was not performed if GSO is used to built the next datagram. This patch fixes it by properly resetting padding in this case also. The impact of this bug is unknown. It may have several effectfs, one of the most obvious being the insertion of unnecessary padding in packets. It could also potentially trigger an infinite loop in qc_prep_pkts(), although this has never been encountered so far. This must be backported up to 3.1.	2025-08-22 16:22:01 +02:00
Valentine Krasnobaeva	0dc8d8d027	MINOR: dns: dns_connect_nameserver: fix fd leak at error path This fixes the commit 2c7e05f80e3b ("MEDIUM: dns: don't call connect to dest socket for AF_INET*"). If we fail to bind AF_INET sockets or the address family of the nameserver protocol isn't something, what we expect, we need to close the fd, obtained by connect. This fixes the issue GitHub #3085 This must be backported along with the commit 2c7e05f80e3b.	2025-08-22 10:50:47 +02:00
Christopher Faulet	a498e527b4	BUG/MAJOR: stream: Remove READ/WRITE events on channels after analysers eval It is possible to miss a synchronous write event in process_stream() if the stream was woken up on a write event. In that case, it is possible to freeze the stream until the next I/O event or timeout. Concretely, the stream is woken up with CF_WRITE_EVENT on a channel. this flag is removed from the channel when we leave proces_stream(). But before leaving process_stream(), when a synchronous send is tried on this channel, the flag is removed and eventually set again on success. But this event is masked by the previous one, and the channel is not resync as it should be. To fix the bug, CF_READ_EVENT and CF_WRITE_EVENT flags are removed from a channel after the corresponding analysers evaluation. This way, we will be able to detect a successful synchronous send to restart analysers evaluation based on the new channel state. It is safe (or it should be) to do so becaues these flags are only used by analysers and tested to resync the stream inside process_stream(). It is a very old bug and I guess all versions are affected. It was observed on 2.9 and higher, and with the master/worker only. But it could affect any stream. It is tagged a MAJOR because this area is really sensitive to any change. This patch should fix the issue #3070. It should probably be backported to all stable versions, but only after a period of observation and with a special care because this area is really sensitive to changes. It is probably reasonnable to backport it as far as 3.0 and wait for older versions. Thanks to Valentine for its help on this issue !	2025-08-21 20:15:18 +02:00
William Lallemand	7b3b3d7146	BUG/MEDIUM: ssl: apply ssl-f-use on every "ssl" bind This patch introduces a change of behavior in the configuration parsing. Previously the "ssl-f-use" lines were only applied on "ssl" bind lines that does not have any "crt" configured. Since there is no warning and you could mix bind lines with and without crt, this is really confusing. This patch applies the "ssl-f-use" lines on every "ssl" bind lines. This was discussed in ticket #3082. Must be backported in 3.2.	2025-08-21 14:58:06 +02:00
Frederic Lecaille	e513620c72	BUG/MEDIUM: quic-be: crash after backend CID allocation failures This bug impacts only the QUIC backends. It arrived with this commit: MINOR: quic-be: QUIC connection allocation adaptation (qc_new_conn()) which was supposed to be fixed by: BUG/MEDIUM: quic: crash after quic_conn allocation failures but this commit was not sufficient. Such a crashe could be reproduced with -dMfail option. To reach it, the <conn_id> object allocation must fail (from qc_new_conn()). So, this is relatively rare, except on systems with limited memory. No need to backport.	2025-08-21 14:24:31 +02:00
Frederic Lecaille	9a22770ac5	BUG/MINOR: quic-be: missing Initial packet number space discarding A QUIC client must discard the Initial packet number space as soon as it first sends a Handshake packet. This patch implements this packet number space which was missing.	2025-08-21 14:24:31 +02:00
Amaury Denoyelle	901de11157	BUG/MEDIUM: mux-h2: fix crash on idle-ping due to unwanted ABORT_NOW An ABORT_NOW() was used during debugging idle-ping but was not removed from the final code. This may cause crash, in particular when mixing idle-ping with shorter http-request/http-keep-alive values. Fix this situation by removing ABORT_NOW() statement. This should fix github issue #3079. This must be backported up to 3.2.	2025-08-21 14:21:11 +02:00
Willy Tarreau	82b002a225	[RELEASE] Released version 3.3-dev7 Released version 3.3-dev7 with the following main changes : - MINOR: quic: duplicate GSO unsupp status from listener to conn - MINOR: quic: define QUIC_FL_CONN_IS_BACK flag - MINOR: quic: prefer qc_is_back() usage over qc->target - BUG/MINOR: cfgparse: immediately stop after hard error in srv_init() - BUG/MINOR: cfgparse-listen: update err_code for fatal error on proxy directive - BUG/MINOR: proxy: avoid NULL-deref in post_section_px_cleanup() - MINOR: guid: add guid_get() helper - MINOR: guid: add guid_count() function - MINOR: clock: add clock_set_now_offset() helper - MINOR: clock: add clock_get_now_offset() helper - MINOR: init: add REGISTER_POST_DEINIT_MASTER() hook - BUILD: restore USE_SHM_OPEN build option - BUG/MINOR: stick-table: cap sticky counter idx with tune.nb_stk_ctr instead of MAX_SESS_STKCTR - MINOR: sock: update broken accept4 detection for older hardwares. - CI: vtest: add os name to OT cache key - CI: vtest: add Ubuntu arm64 builds - BUG/MEDIUM: ssl: Fix 0rtt to the server - BUG/MEDIUM: ssl: fix build with AWS-LC - MEDIUM: acme: use lowercase for challenge names in configuration - BUG/MINOR: init: Initialize random seed earlier in the init process - DOC: management: clarify usage of -V with -c - MEDIUM: ssl/cli: relax crt insertion in crt-list of type directory - MINOR: tools: implement ha_aligned_zalloc() - CLEANUP: fd: make use of ha_aligned_alloc() for the fdtab - MINOR: pools: distinguish the requested alignment from the type-specific one - MINOR: pools: permit to optionally specify extra size and alignment - MINOR: pools: always check that requested alignment matches the type's - DOC: api: update the pools API with the alignment and typed declarations - MEDIUM: tree-wide: replace most DECLARE_POOL with DECLARE_TYPED_POOL - OPTIM: tasks: align task and tasklet pools to 64 - OPTIM: buffers: align the buffer pool to 64 - OPTIM: queue: align the pendconn pools to 64 - OPTIM: connection: align connection pools to 64 - OPTIM: server: start to use aligned allocs in server - DOC: management: fix typo in commit f4f93c56 - DOC: config: recommend single quoting passwords - MINOR: tools: also implement ha_aligned_alloc_typed() - MEDIUM: server: introduce srv_alloc()/srv_free() to alloc/free a server - MINOR: server: align server struct to 64 bytes - MEDIUM: ring: always allocate properly aligned ring structures - CI: Update to actions/checkout@v5 - MINOR: quic: implement qc_ssl_do_hanshake() - BUG/MEDIUM: quic: listener connection stuck during handshakes (OpenSSL 3.5) - BUG/MINOR: mux-h1: fix wrong lock label - MEDIUM: dns: don't call connect to dest socket for AF_INET* - BUG/MINOR: spoe: Properly detect and skip empty NOTIFY frames - BUG/MEDIUM: cli: Report inbuf is no longer full when a line is consumed - BUG/MEDIUM: quic: crash after quic_conn allocation failures - BUG/MEDIUM: quic-be: do not initialize ->conn too early - BUG/MEDIUM: mworker: more verbose error upon loading failure - MINOR: xprt: Add recvmsg() and sendmsg() parameters to rcv_buf() and snd_buf(). - MINOR: ssl: Add a "flags" field to ssl_sock_ctx. - MEDIUM: xprt: Add a "get_capability" method. - MEDIUM: mux_h1/mux_pt: Use XPRT_CAN_SPLICE to decide if we should splice - MINOR: cfgparse: Add a new "ktls" option to bind and server. - MINOR: ssl: Define HAVE_VANILLA_OPENSSL if openssl is used. - MINOR: build: Add a new option, USE_KTLS. - MEDIUM: ssl: Add kTLS support for OpenSSL. - MEDIUM: splice: Don't consider EINVAL to be a fatal error - MEDIUM: ssl: Add splicing with SSL. - MEDIUM: ssl: Add ktls support for AWS-LC. - MEDIUM: ssl: Add support for ktls on TLS 1.3 with AWS-LC - MEDIUM: ssl: Handle non-Application data record with AWS-LC - MINOR: ssl: Add a way to globally disable ktls. v3.3-dev7	2025-08-20 21:52:39 +02:00
Olivier Houchard	6f21c5631a	MINOR: ssl: Add a way to globally disable ktls. Add a new global option, "noktls", as well as a command line option, "-dT", to totally disable ktls usage, even if it is activated on servers or binds in the configuration. That makes it easier to quickly figure out if a problem is related to ktls or not.	2025-08-20 18:33:11 +02:00
Olivier Houchard	5da3540988	MEDIUM: ssl: Handle non-Application data record with AWS-LC Handle receiving and sending TLS records that are not application data records. When receiving, we ignore new session tickets records, we handle close notify as a read0, and we consider any other records as a connection error. For sending, we're just sending close notify, so that the TLS connection is properly closed.	2025-08-20 18:33:11 +02:00
Olivier Houchard	fefc1cce20	MEDIUM: ssl: Add support for ktls on TLS 1.3 with AWS-LC AWS-LC added a new API in AWS-LC 1.54 that allows the user to retrieve the keys for TLS 1.3 connections with SSL_get_read_traffic_secret(), so use it to be able to use ktls with TLS 1.3 too.	2025-08-20 18:33:11 +02:00
Olivier Houchard	5c8fa50966	MEDIUM: ssl: Add ktls support for AWS-LC. Add ktls support for AWS-LC. As it does not know anything about ktls, it means extracting keys from the ssl lib, and provide them to the kernel. At which point we can use regular recvmsg()/sendmsg() calls. This patch only provides support for TLS 1.2, AWS-LC provides a different way to extract keys for TLS 1.3. Note that this may work with BoringSSL too, but it has not been tested.	2025-08-20 18:33:11 +02:00
Olivier Houchard	a903004a1a	MEDIUM: ssl: Add splicing with SSL. Implement the splicing methods to the SSL xprt (which will just call the raw_sock methods if kTLS is enabled on the socket), and properly report that a connection supports splicing if kTLS is configured on that connection. For OpenSSL, if the upper layer indicated that it wanted to start using splicing by adding the CO_FL_WANT_SPLICING flag, make sure we don't read any more data from the socket, and just drain what may be in the internal OpenSSL buffers, before allowing splicing	2025-08-20 18:33:11 +02:00
Olivier Houchard	755436920d	MEDIUM: splice: Don't consider EINVAL to be a fatal error Don't consider that EINVAL is a fatal error, when calling splice(). When doing splicing from a kTLS socket, splice() will set errno to EINVAL if the next record to be read is not an application data record. This is not a fatal error, it just means we have to use recvmsg() to read it, and potentially we can then resume using splicing. It is unfortunate that EINVAL was used for that case, but we should never get any other case of receiving EINVAL from splice(), so it should be safe to treat it as non-fatal.	2025-08-20 18:33:11 +02:00
Olivier Houchard	ed7d20afc8	MEDIUM: ssl: Add kTLS support for OpenSSL. Modify the SSL code to enable kTLS with OpenSSL. It mostly requires our internal BIO to be able to handle the various kTLS-specific controls in ha_ssl_ctrl(), as well as being able to use recvmsg() and sendmsg() from ha_ssl_read() and ha_ssl_write().	2025-08-20 18:33:11 +02:00
Olivier Houchard	6270073072	MINOR: build: Add a new option, USE_KTLS. Add a new define, USE_KTLS, that enables using kTLS in haproxy. It will only work for Linux with a kernel >= 4.17.	2025-08-20 18:33:11 +02:00
Olivier Houchard	7836fe8fe3	MINOR: ssl: Define HAVE_VANILLA_OPENSSL if openssl is used. If we're using OpenSSL as our crypto library, so add a define, HAVE_VANILLA_OPENSSL, to make it easier to differentiate between the various crypto libs.	2025-08-20 18:33:10 +02:00
Olivier Houchard	e8674658ae	MINOR: cfgparse: Add a new "ktls" option to bind and server. Add a new "ktls" option to bind and server. Valid values are "on" and "off". It currently does nothing, but when kTLS will be implemented, it will enable or disable kTLS for the corresponding sockets. It is marked as experimental for now.	2025-08-20 18:33:10 +02:00
Olivier Houchard	075e753802	MEDIUM: mux_h1/mux_pt: Use XPRT_CAN_SPLICE to decide if we should splice In both mux_h1 and mux_pt, use the new XPRT_CAN_SPLICE capability to decide if we should attempt to use splicing or not. If we receive XPRT_CONN_CAN_MAYBE_SPLICE, add a new flag on the connection, CO_FL_WANT_SPLICING, to let the xprt know that we'd love to be able to do splicing, so that it may get ready for that. This should have no effect right now, and is required work for adding kTLS support.	2025-08-20 18:33:10 +02:00
Olivier Houchard	5731b8a19c	MEDIUM: xprt: Add a "get_capability" method. Add a new method to xprts, get_capability, that can be used to query if an xprt supports something or not. The first capability implemented is XPRT_CAN_SPLICE, to know if the xprt will be able to use splicing for the provided connection. The possible answers are XPRT_CONN_CAN_NOT_SPLICE, which indicates splicing will never be possible for that connection, XPRT_CONN_COULD_SPLICE, which indicates that splicing is not usable right now, but may be in the future, and XPRT_CONN_CAN_SPLICE, that means we can splice right away.	2025-08-20 18:33:10 +02:00
Olivier Houchard	2623b7822e	MINOR: ssl: Add a "flags" field to ssl_sock_ctx. Instead of adding more separate fields in ssl_sock_ctx, add a "flags" one. Convert the "can_send_early_data" to the flag SSL_SOCK_F_EARLY_ENABLED. More flags will be added for kTLS support.	2025-08-20 17:28:03 +02:00
Olivier Houchard	3d685fcb7d	MINOR: xprt: Add recvmsg() and sendmsg() parameters to rcv_buf() and snd_buf(). In rcv_buf() and snd_buf(), use sendmsg/recvmsg instead of send and recv, and add two new optional parameters to provide msg_control and msg_controllen. Those are unused for now, but will be used later for kTLS.	2025-08-20 17:28:03 +02:00
William Lallemand	67cb6aab90	BUG/MEDIUM: mworker: more verbose error upon loading failure When a worker crashes during its configuration parsing and without emitting any messages, the master will emit the message "Failed to load worker!". However that doesn't give us neither the PID of the worker, nor the status code. This patch fixes the problem by emitting a more verbose error. Must be backported as far as 3.1.	2025-08-20 17:15:52 +02:00
Frederic Lecaille	ca5511f022	BUG/MEDIUM: quic-be: do not initialize ->conn too early This bug arrived with this commit: BUG/MEDIUM: quic: do not release BE quic-conn prior to upper conn which added a BUG_ON(qc->conn) statement at the beginning of quic_conn_release(). It is triggered if the connection is not released before releasing the quic_conn. But this is always the case for a backend quic_conn when its allocation from qc_new_conn() fails. Such crashes could be reproduced with -dMfail option. To reach them, the memory allocations must fail. So, this is relatively rare, except on systems with limited memory. To fix this, simply set ->conn quic_conn struct member to a not null value (the one passed as parameter) after the quic_conn allocation has succeeded. No backport needed.	2025-08-20 16:25:51 +02:00
Frederic Lecaille	8514647849	BUG/MEDIUM: quic: crash after quic_conn allocation failures This regression arrived with this commit: MINOR: quic-be: QUIC connection allocation adaptation (qc_new_conn()) where qc_new_conn() was modified. The ->cids allocation was moved without checking if a quic_conn_release() call could lead to crashes due to uninitialized quic_conn members. Indeed, if qc_new_conn() fails, then quic_conn_release() is called. This bug could impact both QUIC servers and clients. Such crashes could be reproduced with -dMfail option. To reach them, the memory allocations must fail. So, this is relatively rare, except on systems with limited memory. This patch ensures all the quic_conn members which could lead to crash from quic_conn_release() are initialized before any remaining memory allocations required for the quic_conn. The <conn_id> variable allocated by the client is no more attached to the connection during its allocation, but after the ->cids trees is allocated. No backport needed.	2025-08-20 16:25:51 +02:00
Christopher Faulet	c6c2ef1f11	BUG/MEDIUM: cli: Report inbuf is no longer full when a line is consumed When the command line parsing was refactored (20ec1de21 "MAJOR: cli: Refacor parsing and execution of pipelined commands"), a regression was introduced. When input data are consumed, information about the applet's input buffer are no longer updated accordingly to state it is no longer full. So it is possible to freeze the CLI applet. And a spinning loop may be encountered if a client shutdown is detected in this state. The fix is obivous. When data are consumed from the applet's input buffer, APPCTX_FL_INBLK_FULL flag is removed to notify the input buffer is no longer full and more data can be sent to the CLI applet. This patch should fix the issue #3064. It must be backported to 3.2.	2025-08-20 16:01:50 +02:00
Christopher Faulet	dc6e8dde23	BUG/MINOR: spoe: Properly detect and skip empty NOTIFY frames Since the SPOE was refactored, the detection of empty NOTIFY frames is broken. So it is possible to send a NOTIFY frames to an agent with no message at all. The bug happens because the frame type is now added to the buffer before the messages encoding. So the buffer is never really empty. To fix the issue, the condition to detect empty frame was adapted. This patch must be backported as far as 3.1.	2025-08-20 16:01:50 +02:00
Valentine Krasnobaeva	2c7e05f80e	MEDIUM: dns: don't call connect to dest socket for AF_INET* When we perform connect call for a datagram socket, used to send DNS requests, we set for it the default destination address to some given nameserver. Then we simply use send(), as the destination address is already set. In some usecases described in GitHub issues #3001 and #2654, this approach becames inefficient, nameservers change its IP addresses dynamically, this triggers DNS resolution errors. To fix this, let's perform the bind() on the wildcard address for the datagram AF_INET* client socket. Like this we will allocate a port for it. Then let's use sendto() instead of send(). If the nameserver is local and is listening on the UNIX domain socket, we continue to use the existed approach (connect() and then send()). This fixes issues #3001 and #2654. This may be backported in all stable versions.	2025-08-19 11:26:02 +02:00
Amaury Denoyelle	8ac54cafcd	BUG/MINOR: mux-h1: fix wrong lock label Wrong lock label is used when manipulating idle lock on h1_timeout_task. Fix this by replacing OTHER_LOCK by IDLE_CONNS_LOCK. This only concerns thread debugging statistics. This must be backported up to 2.4.	2025-08-14 16:31:25 +02:00
Frederic Lecaille	878a72d001	BUG/MEDIUM: quic: listener connection stuck during handshakes (OpenSSL 3.5) This issue was reported in GH #3071 by @famfo where a wireshark capture reveals that some handshake could not complete after having received two Initial packets. This could happen when the packets were parsed in two times, calling qc_ssl_provide_all_quic_data() two times. This is due to crypto data stream counter which was incremented two times from qc_ssl_provide_all_quic_data() (see cstream->rx.offset += data statement around line 1223 in quic_ssl.c). One time by the callback which "receives" the crypto data, and on time by qc_ssl_provide_all_quic_data(). Then when parsing the second crypto data frame, the parser detected that the crypto were already provided. To fix this, one could comment the code which increment the crypto data stream counter by <data>. That said, when using the OpenSSL 3.5 QUIC API one should not modified the crypto data stream outside of the OpenSSL 3.5 QUIC API. So, this patch stop calling qc_ssl_provide_all_quic_data() and qc_ssl_provide_quic_data() and only calls qc_ssl_do_hanshake() after having received some crypto data. In addition to this, as these functions are no more called when building haproxy against OpenSSL 3.5, this patch disable their compilations (with #ifndef HAVE_OPENSSL_QUIC). This patch depends on this previous one: MINOR: quic: implement qc_ssl_do_hanshake() Thank you to @famto for this report. Must be backported to 3.2.	2025-08-14 14:54:47 +02:00

1 2 3 4 5 ...

25229 Commits