haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-09-20 13:21:29 +02:00

Author	SHA1	Message	Date
Amaury Denoyelle	f645cd3c74	MINOR: quic: restore QUIC_HP_SAMPLE_LEN constant The below patch fixes padding emission for small packets, which is required to ensure that header protection removal can be performed by the recipient. commit d7dea408c64c327cab6aebf4ccad93405b675565 BUG/MINOR: quic: too short PADDING frame for too short packets In addition to the proper fix, constant QUIC_HP_SAMPLE_LEN was removed and replaced by QUIC_TLS_TAG_LEN. However, it still makes sense to have a dedicated constant which represent the size of the sample used for header protection. Thus, this patch restores it. Special instructions for backport : above patch mentions that no backport is needed. However, this is incorrect, as bug is introduced by another patch scheduled for backport up to 2.6. Thus, it is first mandatory to schedule d7dea408c64c327cab6aebf4ccad93405b675565 after it. Then, this patch can also be used for the sake of code clarity.	2025-09-08 14:49:03 +02:00
Amaury Denoyelle	c20c71a079	TESTS: quic: add unit-tests for QUIC TX part Define a new "quic_tx" unit-test which is used to test QUIC TX module. For the moment, a single test is performed on qc_do_build_pkt(). It checks that PADDING is correctly added for HP sampling in case of a small packet.	2025-09-08 14:49:03 +02:00
Amaury Denoyelle	fb8c6e2030	CLEANUP: quic: fix typo in quic_tx trace Fix trace in qc_may_build_pkt(). This can be backported up to 3.0.	2025-09-08 14:49:03 +02:00
Aurelien DARRAGON	b9ef55d56d	MINOR: stats-file: use explicit unsigned integer bitshift for user slots As reported in GH #3104, there remained a place where (1 << shift was used to set or remove bits from uint64_t users bitfield. It is incorrect and could lead to bugs for values > 32 bits. Instead, let's use 1ULL to ensure the operation remains 64bits consistent. No backport needed.	2025-09-08 13:38:49 +02:00
Aurelien DARRAGON	9272b8ce74	BUG/MEDIUM: proxy: fix crash with stop_proxy() called during init Willy reported that the following config would segfault right after the "removing incomplete section 'peer' is emitted: peers peers bind :2300 server n10 127.0.0.1:2310 listen dummy bind localhost:9999 This is caused by the fact that stop_proxy(), which tries to read shared counters, is called during early init while shared counters are not yet initialized. To fix the crash, let's check if we're still during starting phase, in which case we assume the counters are not initialized and we assume 0 value instead. No backport needed unless 16eb0fab31 ("MAJOR: counters: dispatch counters over thread groups") is.	2025-09-08 13:38:38 +02:00
Frederic Lecaille	6f9fccec1f	MINOR: quic: SSL session reuse for QUIC Mimic the same behavior as the one for SSL/TCP connetion to implement the SSL session reuse. Extract the code which try to reuse the SSL session for SSL/TCP connections to implement ssl_sock_srv_try_reuse_sess(). Call this function from QUIC ->init() xprt callback (qc_conn_init()) as this done for SSL/TCP connections.	2025-09-08 11:46:26 +02:00
Olivier Houchard	b3e685ac3d	BUG/MEDIUM: ssl: Properly initialize msg_controllen. When kTLS is compiled in, make sure msg_controllen is initialized to 0. If we're not actually kTLS, then it won't be set, but we'll check that it is non-zero later to check if we ancillary data. This does not need to be backported. This should fix CID 1620865, as reported in github issue #3106.	2025-09-06 14:19:48 +02:00
Willy Tarreau	75bd9255dd	BUG/MINOR: cpu_topo: work around a small bug in musl's CPU_ISSET() As found in GH issue #3103, CPU_ISSET() on musl 1.25 doesn't match the man page which says it's returning an int. The reason is pretty simple, it's a macro that operates on the bits directly and returns the result of the bit field applied to the mask as an unsigned long. Bits above 31 will simply be dropped if returned as an int, which causes CPUs 32..63 to appear as absent from cpu_sets. The fix is trivial, it consists in just comparing the result against zero (i.e. turning it to a boolean), but before it's merged and deployed we'll have to face such deployments, so better implement the same workaround in the code here since we have access to the raw long value. This workaround should be backported to 3.0.	2025-09-06 11:05:52 +02:00
Frederic Lecaille	d7dea408c6	BUG/MINOR: quic: too short PADDING frame for too short packets This bug arrvived with this commit: MINOR: quic: centralize padding for HP sampling on packet building What was missed is the fact that at the centralization point for the PADDING frame to add for too short packet, <len> payload length already includes <*pn_len> the packet number field length value. So when computing the length of the PADDING frame, the packet field length must not be considered and added to the payload length (<len>). This bug leaded too short PADDING frame to too short packets. This was the case, most of times with Application level packets with a 1-byte packet number field followed by a 1-byte PING frame. A 1-byte PADDING frame was added in this case in place of a correct 2-bytes PADDINF frame. The header packet protection of such packet could not be removed by the clients as for instance for ngtcp2 with such traces: I00001828 0x5a135c81e803f092c74bac64a85513b657 pkt could not decrypt packet number As the header protection could no be removed, the header keyupdate bit could also not be read by packet analyzers such as pyshark used during the keyupdate tests. No need to backport.	2025-09-05 16:17:11 +02:00
Frederic Lecaille	71336bdd08	MINOR: quic: add useful trace about padding params values When adding a PADDING frame for too short packets, add a trace about variable values whose this PADDING frame length depends on.	2025-09-05 16:17:11 +02:00
Christopher Faulet	cc8af125be	REGTESTS: ssl: Add a script to test the automatic SNI selection The script reg-tests/ssl/ssl_sni_auto.vtc tests the automatic SNI selection for regular server connections and for health-check ones. It rely on a 3.3-dev8 feature (in fact, it was pushed just after the dev8).	2025-09-05 15:56:42 +02:00
Christopher Faulet	f9a6ae727c	OPTIM: tcpcheck: Reorder tcpchek_connect structure fields to fill holes Thanks to this patch, two 4-bytes holes are now filled in the tcpchek_connect structure.	2025-09-05 15:56:42 +02:00
Christopher Faulet	ffc1f096e0	MEDIUM: httpcheck/ssl: Base the SNI value on the HTTP host header by default Similarly to the automic SNI selection for regulat SSL traffic, the SNI of health-checks HTTPS connection is now automatically set by default by using the host header value. "check-sni-auto" and "no-check-sni-auto" server settings were added to change this behavior. Only implicit HTTPS health-checks can take advantage of this feature. In this case, the host header value from the "option httpchk" directive is used to extract the SNI. It is disabled if http-check rules are used. So, the SNI must still be explicitly specified via a "http-check connect" rule. This patch with should paritally fix the issue #3081.	2025-09-05 15:56:42 +02:00
Christopher Faulet	668916c1a2	MEDIUM: server/ssl: Base the SNI value to the HTTP host header by default For HTTPS outgoing connections, the SNI is now automatically set using the Host header value if no other value is already set (via the "sni" server keyword). It is now the default behavior. It could be disabled with the "no-sni-auto" server keyword. And eventually "sni-auto" server keyword may be used to reset any previous "no-sni-auto" setting. This option can be inherited from "default-server" settings. Finally, if no connection name is set via "pool-conn-name" setting, the selected value is used. The automatic selection of the SNI is enabled by default for all outgoing connections. But it is concretely used for HTTPS connections only. The expression used is "req.hdr(host),host_only". This patch should paritally fix the issue #3081. It only covers the server part. Another patch will add the feature for HTTP health-checks.	2025-09-05 15:56:42 +02:00
Christopher Faulet	58555b8653	BUG/MINOR: tcpcheck: Don't use sni as pool-conn-name for non-SSL connections When we try to ruse connection to perform an healtcheck, the SNI, from the tcpcheck connection or the healthcheck itself, must not be used as connection name for non-SSL connections. This patch must be backported to 3.2.	2025-09-05 15:56:42 +02:00
Christopher Faulet	eb3d4eb59f	OPTIM: tcpcheck: Don't set SNI and ALPN for non-ssl connections There is no reason to set the SNI and ALPN for non-ssl connections. It is not really an issue because ssl_sock_set_servername() and ssl_sock_set_alpn() functions will do nothing. But it is cleaner this way and this could avoid bugs in future. No backport needed, because there is no bug.	2025-09-05 15:56:42 +02:00
Christopher Faulet	ef07d3511a	OPTIM: proto_rhttp: Don't set SNI for non-ssl connections There is no reason to set the SNI for non-ssl connections. It is not really an issue because ssl_sock_set_servername() function will do nothing. But there is no reason to uselessly evaluate an expression. No backport needed, because there is no bug.	2025-09-05 15:56:42 +02:00
Christopher Faulet	52866349a1	OPTIM: backend: Don't set SNI for non-ssl connections There is no reason to set the SNI for non-ssl connections. It is not really an issue because ssl_sock_set_servername() function will do nothing. But there is no reason to uselessly evaluate an expression. No backport needed, because there is no bug.	2025-09-05 15:56:42 +02:00
Christopher Faulet	a97bd0f505	BUG/MINOR: server: Update healthcheck when server settings are changed via CLI not all changes are concerned. But when the SSL is enabled or disabled for a server, the healthcheck xprt must be eventually be updated too. This happens when the healthcheck relies on the server settings. In the same spirit, when the healthcheck address and port are updated, we must fallback on the raw xprt if the SSL is not explicitly enabled for the healthcheck with a "check-ssl" parameter. This patch should be backported to all stable versions.	2025-09-05 15:56:42 +02:00
Christopher Faulet	f8f94ffc9c	BUG/MEDIUM: server: Use sni as pool connection name for SSL server only By default, for a given server, when no pool-conn-name is specified, the configured sni is used. However, this must only be done when SSL is in-use for the server. Of course, it is uncommon to have a sni expression for now-ssl server. But this may happen. In addition, the SSL may be disabled via the CLI. In that case, the pool-conn-name must be discarded if it was copied from the sni. And, we must of course take care to set it if the ssl is enabled. Finally, when the attac-srv action is checked, we now checked the pool-conn-name expression. This patch should be backported as far as 3.0. It relies on "MINOR: server: Parse sni and pool-conn-name expressions in a dedicated function" which should be backported too.	2025-09-05 15:56:08 +02:00
Christopher Faulet	086a248645	MINOR: server: Parse sni and pool-conn-name expressions in a dedicated function This change is mandatory to fix an issue. The parsing of sni and pool-conn-name expressions (from string to expression) is now handled in a dedicated function. This will avoid to duplicate the same code at different places.	2025-09-05 11:32:21 +02:00
Christopher Faulet	bb407ba8e3	BUG/MINOR: acl: Fix error message about several '-m' parameters There is a typo in the commit * c51ddd5c3 ("MINOR: acl: Only allow one '-m' matching method") . '*m' was reported in the error message instead of '-m'. In addition, it is now mentionned that only the last one should be keep if an old config triggers the error. No backport needed, except if the commit above is backported.	2025-09-05 11:32:20 +02:00
Willy Tarreau	b167d545cf	[RELEASE] Released version 3.3-dev8 Released version 3.3-dev8 with the following main changes : - BUG/MEDIUM: mux-h2: fix crash on idle-ping due to unwanted ABORT_NOW - BUG/MINOR: quic-be: missing Initial packet number space discarding - BUG/MEDIUM: quic-be: crash after backend CID allocation failures - BUG/MEDIUM: ssl: apply ssl-f-use on every "ssl" bind - BUG/MAJOR: stream: Remove READ/WRITE events on channels after analysers eval - MINOR: dns: dns_connect_nameserver: fix fd leak at error path - BUG/MEDIUM: quic: reset padding when building GSO datagrams - BUG/MINOR: quic: do not emit probe data if CONNECTION_CLOSE requested - BUG/MAJOR: quic: fix INITIAL padding with probing packet only - BUG/MINOR: quic: don't coalesce probing and ACK packet of same type - MINOR: quic: centralize padding for HP sampling on packet building - MINOR: http_ana: fix typo in http_res_get_intercept_rule - BUG/MEDIUM: http_ana: handle yield for "stats http-request" evaluation - MINOR: applet: Rely on applet flag to detect the new api - MINOR: applet: Add function to test applet flags from the appctx - MINOR: applet: Add a flag to know an applet is using HTX buffers - MINOR: applet: Make some applet functions HTX aware - MEDIUM: applet: Set .rcv_buf and .snd_buf functions on default ones if not set - BUG/MEDIUM: mux-spop: Reject connection attempts from a non-spop frontend - REGTESTS: jwt: create dynamically "cert.ecdsa.pem" - BUG/MEDIUM: spoe: Improve error detection in SPOE applet on client abort - MINOR: haproxy: abort config parsing on fatal errors for post parsing hooks - MEDIUM: server: split srv_init() in srv_preinit() + srv_postinit() - MINOR: proxy: handle shared listener counters preparation from proxy_postcheck() - DOC: configuration: reword 'generate-certificates' - BUG/MEDIUM: quic-be: avoid crashes when releasing Initial pktns - BUG/MINOR: quic: reorder fragmented RX CRYPTO frames by their offsets - MINOR: ssl: diagnostic warning when both 'default-crt' and 'strict-sni' are used - MEDIUM: ssl: convert diag to warning for strict-sni + default-crt - DOC: configuration: clarify 'default-crt' and implicit default certificates - MINOR: quic: remove ->offset qf_crypto struct field - BUG/MINOR: mux-quic: trace with non initialized qcc - BUG/MINOR: acl: set arg_list->kw to aclkw->kw string literal if aclkw is found - BUG/MEDIUM: mworker: fix startup and reload on macOS - BUG/MINOR: connection: rearrange union list members - BUG/MINOR: connection: remove extra session_unown_conn() on reverse - MINOR: cli: display failure reason on wait command - BUG/MINOR: server: decrement session idle_conns on del server - BUG/MINOR: mux-quic: do not access conn after idle list insert - MINOR: session: document explicitely that session_add_conn() is safe - MINOR: session: uninline functions related to BE conns management - MINOR: session: refactor alloc/lookup of sess_conns elements - MEDIUM: session: protect sess conns list by idle_conns_lock - MINOR: server: shard by thread sess_conns member - MEDIUM: server: close new idle conns if server in maintenance - MEDIUM: session: close new idle conns if server in maintenance - MINOR: server: cleanup idle conns for server in maint already stopped - MINOR: muxes: enforce thread-safety for private idle conns - MEDIUM: conn/muxes/ssl: reinsert BE priv conn into sess on IO completion - MEDIUM: conn/muxes/ssl: remove BE priv idle conn from sess on IO - MEDIUM: mux-quic: enforce thread-safety of backend idle conns - MAJOR: server: implement purging of private idle connections - MEDIUM: session: account on server idle conns attached to session - MAJOR: server: do not remove idle conns in del server - BUILD: mworker: fix ignoring return value of ‘read’ - DOC: unreliable sockpair@ on macOS - MINOR: muxes: adjust takeover with buf_wait interaction - OPTIM: backend: set release on takeover for strict maxconn - DOC: configuration: confuse "strict-mode" with "zero-warning" - MINOR: doc: add missing statistics column - MINOR: doc: add missing statistics column - MINOR: stats: display new curr_sess_idle_conns server counter - MINOR: proxy: extend "show servers conn" output - MEDIUM: proxy: Reject some header names for 'http-send-name-header' directive - BUG/BUILD: stats: fix build due to missing stat enum definition - DOC: proxy-protocol: Make example for PP2_SUBTYPE_SSL_SIG_ALG accurate - CLEANUP: quic: remove a useless CRYPTO frame variable assignment - BUG/MEDIUM: quic: CRYPTO frame freeing without eb_delete() - BUG/MAJOR: mux-quic: fix crash on reload during emission - MINOR: conn/muxes/ssl: add ASSUME_NONNULL() prior to _srv_add_idle - REG-TESTS: map_redirect: Don't use hdr_dom in ACLs with "-m end" matching method - MINOR: acl: Only allow one '-m' matching method - MINOR: acl; Warn when matching method based on a suffix is overwritten - BUG/MEDIUM: server: Duplicate healthcheck's alpn inherited from default server - BUG/MINOR: server: Duplicate healthcheck's sni inherited from default server - BUG/MINOR: acl: Properly detect overwritten matching method - BUG/MINOR: halog: Add OOM checks for calloc() in filter_count_srv_status() and filter_count_url() - BUG/MINOR: log: Add OOM checks for calloc() and malloc() in logformat parser and dup_logger() - BUG/MINOR: acl: Add OOM check for calloc() in smp_fetch_acl_parse() - BUG/MINOR: cfgparse: Add OOM check for calloc() in cfg_parse_listen() - BUG/MINOR: compression: Add OOM check for calloc() in parse_compression_options() - BUG/MINOR: tools: Add OOM check for malloc() in indent_msg() - BUG/MINOR: quic: ignore AGAIN ncbuf err when parsing CRYPTO frames - MINOR: quic/flags: complete missing flags - BUG/MINOR: quic: fix room check if padding requested - BUG/MINOR: quic: fix padding issue on INITIAL retransmit - BUG/MINOR: quic: pad Initial pkt with CONNECTION_CLOSE on client - MEDIUM: quic: strengthen BUG_ON() for unpad Initial packet on client - DOC: configuration: rework the jwt_verify keyword documentation - BUG/MINOR: haproxy: be sure not to quit too early on soft stop - BUILD: acl: silence a possible null deref warning in parse_acl_expr() - MINOR: quic: Add more information about RX packets - CI: fix syntax of Quic Interop pipelines - MEDIUM: cfgparse: warn when using user/group when built statically - BUG/MEDIUM: stick-tables: don't leave the expire loop with elements deleted - BUG/MINOR: stick-tables: never leave used entries without expiration - BUG/MEDIUM: peers: don't fail twice to grab the update lock - MINOR: stick-tables: limit the number of visited nodes during expiration - OPTIM: stick-tables: exit expiry faster when the update lock is held - MINOR: counters: retrieve detailed errmsg upon failure with counters_{fe,be}_shared_prepare() - MINOR: stats-file: introduce shm-stats-file directive - MEDIUM: stats-file: processes share the same clock source from shm-stats-file - MINOR: stats-file: add process slot management for shm stats file - MEDIUM: stats-file/counters: store and preload stats counters as shm file objects - DOC: config: document "shm-stats-file" directive - OPTIM: stats-file: don't unnecessarily die hard on shm_stats_file_reuse_object() - MINOR: compiler: add ALWAYS_PAD() macro - BUILD: stats-file: fix aligment issues - MINOR: stats-file: reserve some bytes in exported structs - MEDIUM: stats-file: add some BUG_ON() guards to ensure exported structs are not changed by accident - BUG/MINOR: check: ensure check-reuse is compatible with SSL - BUG/MINOR: check: fix dst address when reusing a connection - REGTESTS: explicitly use "balance roundrobin" where RR is needed - MAJOR: backend: switch the default balancing algo to "random" - BUG/MEDIUM: conn: fix UAF on connection after reversal on edge - BUG/MINOR: connection: streamline conn detach from lists - BUG/MEDIUM: quic-be: too early SSL_SESSION initialization - BUG/MINOR: log: fix potential memory leak upon error in add_to_logformat_list() - MEDIUM: init: always warn when running as root without being asked to - MINOR: sample: Add base2 converter - MINOR: version: add -vq, -vqb, and -vqs flags for concise version output - BUILD: trace: silence a bogus build warning at -Og - MINOR: trace: accept trace spec right after "-dt" on the command line - BUILD: makefile: bump the default minimum linux version to 4.17 v3.3-dev8	2025-09-05 09:54:34 +02:00
Willy Tarreau	85ac6a6f7b	BUILD: makefile: bump the default minimum linux version to 4.17 As explained during the 3.3-dev7 announcement below: https://www.mail-archive.com/haproxy@formilux.org/msg46073.html no regularly maintained distro supports a kernel older than 4.18 anymore, and KTLS is supported since 4.17. So it's about the right moment to bump the default minimum kernel version supported by glibc and musl to automatically cover new features. The linux-glibc-legacy target still supports 2.6.28 and above.	2025-09-05 09:44:56 +02:00
Willy Tarreau	670dc299d3	MINOR: trace: accept trace spec right after "-dt" on the command line I continue to mistakenly set the traces using "-dtXXX" and to have to refer to the doc to figure that it requires a separate argument and differs from some other options. Worse, "-dthelp" doesn't say anything and silently ignores the argument. Let's make the parser take whatever follows "-dt" as the argument if present, otherwise take the next one (as it currently does). Doing this even allows to simplify the code, and is easier to figure the syntax since "-dthelp" now works.	2025-09-05 09:33:28 +02:00
Willy Tarreau	abfd6f3b93	BUILD: trace: silence a bogus build warning at -Og gcc-13.3 at -Og emits an incorrect build warning in trace.c about a possibly initialized variable: In file included from include/haproxy/api.h:35, from src/trace.c:22: src/trace.c: In function 'trace_parse_cmd': include/haproxy/bug.h:431:17: warning: 'arg' may be used uninitialized [-Wmaybe-uninitialized] 431 \| free(__x); \ \| ^~~~~~~~~~ src/trace.c:1136:9: note: in expansion of macro 'ha_free' 1136 \| ha_free(&oarg); \| ^~~~~~~ src/trace.c:1008:15: note: 'arg' was declared here 1008 \| char arg, *oarg; \| ^~~ The warning is obviously wrong since the field is initialized in one of the two branches of an "if" whose complementary one returns. But the compiler doesn't seem to see this because the if is in fact two ifs each with an opposite condition: "if (arg_src)" then "if (!arg_src)". Let's just move upwards the default one that returns and eliminate the other one. Reading the diff with "git diff -b" better shows the tiny change. It could be backported to 3.0.	2025-09-05 09:19:24 +02:00
Nikita Kurashkin	ef73fe2584	MINOR: version: add -vq, -vqb, and -vqs flags for concise version output This patch introduces three new command line flags to display HAProxy version info more flexibly: - `-vqs` outputs the short version string without commit info (e.g., "3.3.1"). - `-vqb` outputs only the branch (major.minor) part of the version (e.g., "3.3"). - `-vq` outputs the full version string with suffixes (e.g., "3.3.1-dev5-1bb975-71"). This allows easier parsing of version info in automation while keeping existing -v and -vv behaviors. The command line argument parsing now calls `display_version_plain()` with a display_mode parameter to select the desired output format. The function handles stripping of commit or patch info as needed, depending on the mode. Signed-off-by: Nikita Kurashkin <nkurashkin@stsoft.ru>	2025-09-05 08:57:57 +02:00
Maximilian Moehl	5d9abc68b4	MINOR: sample: Add base2 converter This commit adds the base2 converter to turn binary input into it's string representation. Each input byte is converted into a series of eight characters which are either 0s and 1s by bit-wise comparison.	2025-09-05 08:51:51 +02:00
Willy Tarreau	a6986e1cd6	MEDIUM: init: always warn when running as root without being asked to Like many exposed network deamons, haproxy does normally not need to run as root and strongly recommends against this, unless strictly necessary. On some operating systems, capabilities even totally alleviate this need. Lately, maybe due to a raise of containerization or automated config generation or a bit of both, we've observed a resurgence of this bad practice, possibly due to the fact that users are just not aware of the conditions they're using their daemon. Let's add a warning at boot when starting as root without having requested it using "uid" or "user". And take this opportunity for warning the user about the existence of capabilities when supported, and encouraging the use of a chroot. This is achieved by leaving global.uid set to -1 by default, allowing us to detect if it was explicitly set or not.	2025-09-05 08:51:07 +02:00
Aurelien DARRAGON	c97ced3f93	BUG/MINOR: log: fix potential memory leak upon error in add_to_logformat_list() As reported on GH #3099, upon memory error add_to_logformat_list() will return and error but it fails to properly memory which was allocated within the function, which could result in memory leak. Let's free all relevant variables allocated by the function before returning. No backport needed unless 22ac1f5ee ("("BUG/MINOR: log: Add OOM checks for calloc() and malloc() in logformat parser and dup_logger()") is.	2025-09-04 23:07:22 +02:00
Frederic Lecaille	842f32f3f1	BUG/MEDIUM: quic-be: too early SSL_SESSION initialization When an SNI is set on a QUIC server line, ssl_sock_set_servername() is called from connect_server() (backend.c). This leads some BUG_ON() to be triggered because the CO_FL_WAIT_L6_CONN \| CO_FL_SSL_WAIT_HS were not set. This must be done into the ->init() xprt callback. This patch move the flags settings from ->start() to ->init() callback. Indeed, connect_server() calls these functions in this order: ->init(), ssl_sock_set_servername() # => crash if CO_FL_WAIT_L6_CONN \| CO_FL_SSL_WAIT_HS not set ->start() Furthermore ssl_sock_set_servername() has a side effect to reset the SSL_SESSION object (attached to SSL object) calling SSL_set_session(), leading to crashes as follows: [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1". Core was generated by `./haproxy -f quic_srv.cfg'. Program terminated with signal SIGSEGV, Segmentation fault. #0 tls_process_server_hello (s=0x560c259733b0, pkt=0x7fffac239f20) at ssl/statem/statem_clnt.c:1624 1624 if (s->session->session_id_length > 0) { [Current thread is 1 (Thread 0x7fc364e53dc0 (LWP 35514))] (gdb) bt #0 tls_process_server_hello (s=0x560c259733b0, pkt=0x7fffac239f20) at ssl/statem/statem_clnt.c:1624 #1 0x00007fc36540fba4 in ossl_statem_client_process_message (s=0x560c259733b0, pkt=0x7fffac239f20) at ssl/statem/statem_clnt.c:1042 #2 0x00007fc36540d028 in read_state_machine (s=0x560c259733b0) at ssl/statem/statem.c:646 #3 0x00007fc36540ca70 in state_machine (s=0x560c259733b0, server=0) at ssl/statem/statem.c:439 #4 0x00007fc36540c576 in ossl_statem_connect (s=0x560c259733b0) at ssl/statem/statem.c:250 #5 0x00007fc3653f1698 in SSL_do_handshake (s=0x560c259733b0) at ssl/ssl_lib.c:3835 #6 0x0000560c22620327 in qc_ssl_do_hanshake (qc=qc@entry=0x560c25961f60, ctx=ctx@entry=0x560c25963020) at src/quic_ssl.c:863 #7 0x0000560c226210be in qc_ssl_provide_quic_data (len=90, data=<optimized out>, ctx=0x560c25963020, level=ssl_encryption_initial, ncbuf=0x560c2588bb18) at src/quic_ssl.c:1071 #8 qc_ssl_provide_all_quic_data (qc=qc@entry=0x560c25961f60, ctx=0x560c25963020) at src/quic_ssl.c:1123 #9 0x0000560c2260ca5f in quic_conn_io_cb (t=0x560c25962f80, context=0x560c25961f60, state=<optimized out>) at src/quic_conn.c:791 #10 0x0000560c228255ed in run_tasks_from_lists (budgets=<optimized out>) at src/task.c:648 #11 0x0000560c22825f7a in process_runnable_tasks () at src/task.c:889 #12 0x0000560c22793dc7 in run_poll_loop () at src/haproxy.c:2836 #13 0x0000560c22794481 in run_thread_poll_loop (data=<optimized out>) at src/haproxy.c:3056 #14 0x0000560c2259082d in main (argc=<optimized out>, argv=<optimized out>) at src/haproxy.c:3667 <s> is the SSL object, and <s->session> is the SSL_SESSION object. For the client, this is the first call do SSL_do_handshake() which initializes this SSL_SESSION object from ->init() xpt callback. Then it is reset by ssl_sock_set_servername(), then tls_process_server_hello() TLS stack is called with NULL value for s->session when receiving the ServerHello TLS message. To fix this, simply move the first call to SSL_do_handshake to ->start xprt call back (qc_xprt_start()). No need to backport.	2025-09-04 20:49:06 +02:00
Amaury Denoyelle	687df405fe	BUG/MINOR: connection: streamline conn detach from lists Over their lifetime, connections are attached to different list. These lists depends on whether connection is on frontend or backend side. Attach point members are stored via a union in struct connection. The next commit reorganizes them so that a proper frontend/backend separation is performed : commit a96f1286a75246fef6db3e615fabdef1de927d83 BUG/MINOR: connection: rearrange union list members On conn_free(), connection instance must be removed from these lists to ensure there is no use-after-free case. However code was still shaky there, despite no real issue. Indeed, <toremove_list> was detached for all connections, despite being only used on backend side only. This patch streamlines the freeing of connection. Now, <toremove_list> detach is performed in conn_backend_deinit(). Moreover, a new helper conn_frontend_deinit() is defined. It ensures that <stopping_list> detach is done. Prior it was performed individually by muxes. Note that a similar procedure is performed when the connection is reversed. Hence, conn_frontend_deinit() is now used here as well, rendering reversal from FE to BE or vice versa symmetrical. As mentionned above, no crash occured prior to this patch, but the code was fragile, in particular access to <toremove_list> for frontend connections. Thus this patch is considered as a bug fix worthy of a backport along with above mentionned patch, currently up to 3.0.	2025-09-04 18:31:20 +02:00
Amaury Denoyelle	27ff7ff296	BUG/MEDIUM: conn: fix UAF on connection after reversal on edge When a connection is reversed, some elements must be resetted prior to reusing it. Most notably, connection must be removed from lists specific on frontend/backend sides. When reverse was performed for frontend to backend side, connection was not removed via its <stopping_list> attach point. On previous releases, this did not cause any issue. However, crashes start to occur recently, probably due to the recent reorganization of connection list attach points from the following patch. commit a96f1286a75246fef6db3e615fabdef1de927d83 BUG/MINOR: connection: rearrange union list members To fix this, simply ensure that <stopping_list> detach is performed via conn_reverse(). This patch must be backported up to 3.0 release.	2025-09-04 18:13:35 +02:00
Willy Tarreau	93cc18ac42	MAJOR: backend: switch the default balancing algo to "random" For many years, an unset load balancing algorithm would use "roundrobin". It was shown several times that "random" with at least 2 draws (the default) generally provides better performance and fairness in that it will automatically adapt to the server's load and capacity. This was further described with numbers in this discussion: https://www.mail-archive.com/haproxy@formilux.org/msg46011.html https://github.com/orgs/haproxy/discussions/3042 BTW there were no objection and only support for the change. The goal of this patch is to change the default algo when none is specified, from "roundrobin" to "random". This way, users who don't care and don't set the load balancing algorithm will benefit from a better one in most cases, while those who have good reasons to prefer roundrobin (for session affinity or for reproducible sequences like used in regtests) can continue to specify it. The vast majority of users should not notice a difference.	2025-09-04 08:30:35 +02:00
Willy Tarreau	60931ceae9	REGTESTS: explicitly use "balance roundrobin" where RR is needed A few tests explicitly rely on the server ordering granted by "balance roundrobin", but didn't specify the balance algorithm. As it will change soon, let's explicit it.	2025-09-04 08:18:53 +02:00
Amaury Denoyelle	9410b2ab97	BUG/MINOR: check: fix dst address when reusing a connection The keyword check-reuse-pool allows to reuse an idle connection to perform a health check instead of opening a new one. It is implemented similarly to HTTP transfer reuse : a hash is calculated with a subset of properties to lookup a connection with the same characteristics. One of these properties is the destination address. Initially it was always set to NULL prior to reuse check, as this is necessary to match connections on a reverse-HTTP server. However, this prevents reuse on other servers with a proper address configured. Indeed, in this case destination address is always used as key for connections inserted in idle pool. This patch fixes this by properly setting destination address for check reuse. By default, it reuses the address from the server. The only exception is if the server is using reverse-HTTP, in which case address remains NULL. A new test is also performed prior to try check reuse to ensure this is not performed on a transparent server. Indeed, in this case server address would be unset. Anyway, check cannot reuse a connection in this case so this is OK. Note that this does not prevent to continue check with a newly connection with a NULL address : this should be handled more properly in another patch. This must be backported up to 3.2.	2025-09-03 16:58:14 +02:00
Amaury Denoyelle	6d3c3c7871	BUG/MINOR: check: ensure check-reuse is compatible with SSL SSL may be activated implicitely if a server relies on SSL, even without check-ssl keyword. This is performed by init_srv_check() function. The main operation is to change xprt layer for check to SSL. Prior to this patch, <use_ssl> check member was also set, despite not strictly necessary. This has a negative side-effect of rendering check-reuse-pool ineffective. Indeed, reuse on check is only performed if no specific check configuration has been specified (see tcpcheck_use_nondefault_connect()). This patch fixes check reuse with SSL : <use_ssl> is not set in case SSL is inherited implicitely from server configuration. Thus, <use_ssl> is now only set if an explicit check-ssl keyword is set, which disables connection reuse for check. This must be backported up to 3.2.	2025-09-03 16:54:48 +02:00
Aurelien DARRAGON	f32bc8f0a4	MEDIUM: stats-file: add some BUG_ON() guards to ensure exported structs are not changed by accident Add two BUG_ON() in shm_stats_file_prepare() which will trigger if exported structures (shm_stats_file_hdr and shm_stats_file_object) change in size, because it means that they will become incompatible with older versions and thus precautions should be taken by the developer to ensure compatibility with olders versions, or at least detect incompatible versions by changing the version number to prevent bugs resulting from inconsistent mapping between versions. The BUG_ON() may be safely adjusted then. Please note that it doesn't protect against accidental struct member re-ordering if the resulting struct size is equal..	2025-09-03 16:29:55 +02:00
Aurelien DARRAGON	1a1362ea0b	MINOR: stats-file: reserve some bytes in exported structs We may need additional struct members in shm_stats_file_object and shm_stats_file_hdr, yet since these structs are exported they should not change in size nor ordering else it would require a version change to break compability on purpose since mapping would differ. Here we reserve 64 additional bytes in shm_stats_file_object, and 128 bytes in shm_stats_file_hdr for future usage.	2025-09-03 16:29:48 +02:00
Aurelien DARRAGON	21d97ccfae	BUILD: stats-file: fix aligment issues Document some byte holes and fix some potential aligment issues between 32 and 64 bits architectures to ensure the shm_stats_file memory mapping is consistent between operating systems.	2025-09-03 16:28:46 +02:00
Aurelien DARRAGON	46a5948ed2	MINOR: compiler: add ALWAYS_PAD() macro same as THREAD_PAD() but doesn't depend on haproxy being compiled with thread support. It may be useful for memory (or files) that may be shared between multiple processed.	2025-09-03 16:28:46 +02:00
Aurelien DARRAGON	cf2562cddf	OPTIM: stats-file: don't unnecessarily die hard on shm_stats_file_reuse_object() shm_stats_file_reuse_object() has a non negligible cost, especially if the shm file contains a lot of objects because the functions scans the whole shm file to find available slots. During startup, if no existing objects could be mapped in the shm file shm_stats_file_add_object() for each object (server, fe, be or listener) with a GUID set. On large config it means shm_stats_file_add_object() could be called a lot of times in a row. With current implementation, each shm_stats_file_add_object() call leverages shm_stats_file_reuse_object(), so the more objects are defined in the config, the slower the startup will be. To try to optimize startup time a bit with large configs, we don't sytematically call shm_stats_file_reuse_object(), especially when we know that the previous attempt to reuse objects failed. In this case we add a small tempo between failed attempts to reuse objects because we assume the new attempt will probably fail anyway. (For slots to become available, either an old process has to clean its entries, or they have to time out which implies that the clock needs to be updated)	2025-09-03 16:28:41 +02:00
Aurelien DARRAGON	16abfb6e06	DOC: config: document "shm-stats-file" directive Add some documentation for "shm-stats-file" and "shm-stats-file-max-objects" experimental directives related to the use of shared memory for storing stats counters (see previous commits for implementation details)	2025-09-03 15:59:42 +02:00
Aurelien DARRAGON	585ece4c92	MEDIUM: stats-file/counters: store and preload stats counters as shm file objects This is the last patch of the shm stats file series, in this patch we implement the logic to store and fetch shm stats objects and associate them to existing shared counters on the current process. Shm objects are stored in the same memory location as the shm stats file header. In fact they are stored right after it. All objects (struct shm_stats_file_object) have the same size (no matter their type), which allows for easy object traversal without having to check the object's type, and could permit the use of external tools to scan the SHM in the future. Each object stores a guid (of GUID_MAX_LEN+1 size) and tgid which allows to match corresponding shared counters indexes. Also, as stated before, each object stores the list of users making use of it. Objects are never released (the map can only grow), but unused objects (when no more users or active users are found in objects->users), the object is automatically recycled. Also, each object stores its type which defines how the object generic data member should be handled. Upon startup (or reload), haproxy first tries to scan existing shm to find objects that could be associated to frontends, backends, listeners or servers in the current config based on GUID. For associations that couldn't be made, haproxy will automatically create missing objects in the SHM during late startup. When haproxy matches with an existing object, it means the counter from an older process is preserved in the new process, so multiple processes temporarily share the same counter for as long as required for older processes to eventually exit.	2025-09-03 15:59:37 +02:00
Aurelien DARRAGON	ee17d20245	MINOR: stats-file: add process slot management for shm stats file Now that all processes tied to the same shm stats file now share a common clock source, we introduce the process slot notion in this patch. Each living process registers itself in a map at a free index: each slot stores information about the process' PID and heartbeat. Each process is responsible for updating its heartbeat, a slot is considered as "free" if the heartbeat was never set or if the heartbeat is expired (60 seconds of inactivity). The total number of slots is set to 64, this is on purpose because it allows to easily store the "users" of a given shm object using a 64 bits bitmask. Given that when haproxy is reloaded olders processes are supposed to die eventually, it should be large enough (64 simultaneous processes) to be safe. If we manage to reach this limit someday, more slots could be added by splitting "users" bitmask on multiple 64bits variable.	2025-09-03 15:59:33 +02:00
Aurelien DARRAGON	443e657fd6	MEDIUM: stats-file: processes share the same clock source from shm-stats-file The use of the "shm-stats-file" directive now implies that all processes using the same file now share a common clock source, this is required for consistency regarding time-related operations. The clock source is stored in the shm stats file header. When the directive is set, all processes share the same clock (global_now_ms and global_now_ns both point to variables in the map), this is required for time-based counters such as freq counters to work consistently. Since all processes manipulate global clock with atomic operations exclusively during runtime, and don't systematically relies on it (thanks to local now_ms and now_ns), it is pretty much transparent.	2025-09-03 15:59:27 +02:00
Aurelien DARRAGON	c91d93ed1c	MINOR: stats-file: introduce shm-stats-file directive add initial support for the "shm-stats-file" directive and associated "shm-stats-file-max-objects" directive. For now they are flagged as experimental directives. The shared memory file is automatically created by the first process. The file is created using open() so it is up to the user to provide relevant path (either on regular filesystem or ramfs for performance reasons). The directive takes only one argument which is path of the shared memory file. It is passed as-is to open(). The maximum number of objects per thread-group (hard limit) that can be stored in the shm is defined by "shm-stats-file-max-objects" directive, Upon initial creation, the main shm stats file header is provisioned with the version which must remains the same to be compatible between processes and defaults to 2k. which means approximately 1mb max per thread group and should cover most setups. When the limit is reached (during startup) an error is reported by haproxy which invites the user to increase the "shm-stats-file-max-objects" if desired, but this means more memory will be allocated. Actual memory usage is low at start, because only the mmap (mapping) is provisionned with the maximum number of objects to avoid relocating the memory area during runtime, but the actual shared memory file is dynamically resized when objects are added (resized by following half power of 2 curve when new objects are added, see upcoming commits) For now only the file is created, further logic will be implemented in upcoming commits.	2025-09-03 15:59:22 +02:00
Aurelien DARRAGON	cb08bcb9d6	MINOR: counters: retrieve detailed errmsg upon failure with counters_{fe,be}_shared_prepare() counters_{fe,be}_shared_prepare now take an extra <errmsg> parameter that contains additional hints about the error in case of failure. It must be freed accordingly since it is allocated using memprintf	2025-09-03 15:59:17 +02:00
Willy Tarreau	46463d6850	OPTIM: stick-tables: exit expiry faster when the update lock is held It helps keep the contention level low: when we hold the update lock that we know other parts may be relying on (peers, track-sc etc), we decrease the remaining visit counters 4 times as fast to further reduce the contention. At this point no more warnings are seen during intense synchronization (2x64 cores, 1.5M req/s with a track-sc each, 5M entries in use).	2025-09-03 15:51:13 +02:00
Willy Tarreau	696793205b	MINOR: stick-tables: limit the number of visited nodes during expiration As reported by Felipe in GH issue #3084, on large systems it's not sufficient to leave the expiration process after a certain number of expired entries, because if they accumulate too fast, it's possible to still spend some time visiting many (e.g. those still in use), which takes time. Thus here we're taking a stricter approach consisting in counting the number of visited entries, which allows to leave early if we can't do the expected work in a reasonable amount of time. In order to avoid always stopping on first shards and never visiting last ones, we're always starting from a random shard number and looping from that one. This way even if we always leave early, all shards will be handled equally. This should be backported to 3.2.	2025-09-03 15:51:13 +02:00

... 2 3 4 5 6 ...

25449 Commits