haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-10-27 06:31:23 +01:00

Author	SHA1	Message	Date
Aurelien DARRAGON	c97ced3f93	BUG/MINOR: log: fix potential memory leak upon error in add_to_logformat_list() As reported on GH #3099, upon memory error add_to_logformat_list() will return and error but it fails to properly memory which was allocated within the function, which could result in memory leak. Let's free all relevant variables allocated by the function before returning. No backport needed unless 22ac1f5ee ("("BUG/MINOR: log: Add OOM checks for calloc() and malloc() in logformat parser and dup_logger()") is.	2025-09-04 23:07:22 +02:00
Frederic Lecaille	842f32f3f1	BUG/MEDIUM: quic-be: too early SSL_SESSION initialization When an SNI is set on a QUIC server line, ssl_sock_set_servername() is called from connect_server() (backend.c). This leads some BUG_ON() to be triggered because the CO_FL_WAIT_L6_CONN \| CO_FL_SSL_WAIT_HS were not set. This must be done into the ->init() xprt callback. This patch move the flags settings from ->start() to ->init() callback. Indeed, connect_server() calls these functions in this order: ->init(), ssl_sock_set_servername() # => crash if CO_FL_WAIT_L6_CONN \| CO_FL_SSL_WAIT_HS not set ->start() Furthermore ssl_sock_set_servername() has a side effect to reset the SSL_SESSION object (attached to SSL object) calling SSL_set_session(), leading to crashes as follows: [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1". Core was generated by `./haproxy -f quic_srv.cfg'. Program terminated with signal SIGSEGV, Segmentation fault. #0 tls_process_server_hello (s=0x560c259733b0, pkt=0x7fffac239f20) at ssl/statem/statem_clnt.c:1624 1624 if (s->session->session_id_length > 0) { [Current thread is 1 (Thread 0x7fc364e53dc0 (LWP 35514))] (gdb) bt #0 tls_process_server_hello (s=0x560c259733b0, pkt=0x7fffac239f20) at ssl/statem/statem_clnt.c:1624 #1 0x00007fc36540fba4 in ossl_statem_client_process_message (s=0x560c259733b0, pkt=0x7fffac239f20) at ssl/statem/statem_clnt.c:1042 #2 0x00007fc36540d028 in read_state_machine (s=0x560c259733b0) at ssl/statem/statem.c:646 #3 0x00007fc36540ca70 in state_machine (s=0x560c259733b0, server=0) at ssl/statem/statem.c:439 #4 0x00007fc36540c576 in ossl_statem_connect (s=0x560c259733b0) at ssl/statem/statem.c:250 #5 0x00007fc3653f1698 in SSL_do_handshake (s=0x560c259733b0) at ssl/ssl_lib.c:3835 #6 0x0000560c22620327 in qc_ssl_do_hanshake (qc=qc@entry=0x560c25961f60, ctx=ctx@entry=0x560c25963020) at src/quic_ssl.c:863 #7 0x0000560c226210be in qc_ssl_provide_quic_data (len=90, data=<optimized out>, ctx=0x560c25963020, level=ssl_encryption_initial, ncbuf=0x560c2588bb18) at src/quic_ssl.c:1071 #8 qc_ssl_provide_all_quic_data (qc=qc@entry=0x560c25961f60, ctx=0x560c25963020) at src/quic_ssl.c:1123 #9 0x0000560c2260ca5f in quic_conn_io_cb (t=0x560c25962f80, context=0x560c25961f60, state=<optimized out>) at src/quic_conn.c:791 #10 0x0000560c228255ed in run_tasks_from_lists (budgets=<optimized out>) at src/task.c:648 #11 0x0000560c22825f7a in process_runnable_tasks () at src/task.c:889 #12 0x0000560c22793dc7 in run_poll_loop () at src/haproxy.c:2836 #13 0x0000560c22794481 in run_thread_poll_loop (data=<optimized out>) at src/haproxy.c:3056 #14 0x0000560c2259082d in main (argc=<optimized out>, argv=<optimized out>) at src/haproxy.c:3667 <s> is the SSL object, and <s->session> is the SSL_SESSION object. For the client, this is the first call do SSL_do_handshake() which initializes this SSL_SESSION object from ->init() xpt callback. Then it is reset by ssl_sock_set_servername(), then tls_process_server_hello() TLS stack is called with NULL value for s->session when receiving the ServerHello TLS message. To fix this, simply move the first call to SSL_do_handshake to ->start xprt call back (qc_xprt_start()). No need to backport.	2025-09-04 20:49:06 +02:00
Amaury Denoyelle	687df405fe	BUG/MINOR: connection: streamline conn detach from lists Over their lifetime, connections are attached to different list. These lists depends on whether connection is on frontend or backend side. Attach point members are stored via a union in struct connection. The next commit reorganizes them so that a proper frontend/backend separation is performed : commit a96f1286a75246fef6db3e615fabdef1de927d83 BUG/MINOR: connection: rearrange union list members On conn_free(), connection instance must be removed from these lists to ensure there is no use-after-free case. However code was still shaky there, despite no real issue. Indeed, <toremove_list> was detached for all connections, despite being only used on backend side only. This patch streamlines the freeing of connection. Now, <toremove_list> detach is performed in conn_backend_deinit(). Moreover, a new helper conn_frontend_deinit() is defined. It ensures that <stopping_list> detach is done. Prior it was performed individually by muxes. Note that a similar procedure is performed when the connection is reversed. Hence, conn_frontend_deinit() is now used here as well, rendering reversal from FE to BE or vice versa symmetrical. As mentionned above, no crash occured prior to this patch, but the code was fragile, in particular access to <toremove_list> for frontend connections. Thus this patch is considered as a bug fix worthy of a backport along with above mentionned patch, currently up to 3.0.	2025-09-04 18:31:20 +02:00
Amaury Denoyelle	27ff7ff296	BUG/MEDIUM: conn: fix UAF on connection after reversal on edge When a connection is reversed, some elements must be resetted prior to reusing it. Most notably, connection must be removed from lists specific on frontend/backend sides. When reverse was performed for frontend to backend side, connection was not removed via its <stopping_list> attach point. On previous releases, this did not cause any issue. However, crashes start to occur recently, probably due to the recent reorganization of connection list attach points from the following patch. commit a96f1286a75246fef6db3e615fabdef1de927d83 BUG/MINOR: connection: rearrange union list members To fix this, simply ensure that <stopping_list> detach is performed via conn_reverse(). This patch must be backported up to 3.0 release.	2025-09-04 18:13:35 +02:00
Willy Tarreau	93cc18ac42	MAJOR: backend: switch the default balancing algo to "random" For many years, an unset load balancing algorithm would use "roundrobin". It was shown several times that "random" with at least 2 draws (the default) generally provides better performance and fairness in that it will automatically adapt to the server's load and capacity. This was further described with numbers in this discussion: https://www.mail-archive.com/haproxy@formilux.org/msg46011.html https://github.com/orgs/haproxy/discussions/3042 BTW there were no objection and only support for the change. The goal of this patch is to change the default algo when none is specified, from "roundrobin" to "random". This way, users who don't care and don't set the load balancing algorithm will benefit from a better one in most cases, while those who have good reasons to prefer roundrobin (for session affinity or for reproducible sequences like used in regtests) can continue to specify it. The vast majority of users should not notice a difference.	2025-09-04 08:30:35 +02:00
Amaury Denoyelle	9410b2ab97	BUG/MINOR: check: fix dst address when reusing a connection The keyword check-reuse-pool allows to reuse an idle connection to perform a health check instead of opening a new one. It is implemented similarly to HTTP transfer reuse : a hash is calculated with a subset of properties to lookup a connection with the same characteristics. One of these properties is the destination address. Initially it was always set to NULL prior to reuse check, as this is necessary to match connections on a reverse-HTTP server. However, this prevents reuse on other servers with a proper address configured. Indeed, in this case destination address is always used as key for connections inserted in idle pool. This patch fixes this by properly setting destination address for check reuse. By default, it reuses the address from the server. The only exception is if the server is using reverse-HTTP, in which case address remains NULL. A new test is also performed prior to try check reuse to ensure this is not performed on a transparent server. Indeed, in this case server address would be unset. Anyway, check cannot reuse a connection in this case so this is OK. Note that this does not prevent to continue check with a newly connection with a NULL address : this should be handled more properly in another patch. This must be backported up to 3.2.	2025-09-03 16:58:14 +02:00
Amaury Denoyelle	6d3c3c7871	BUG/MINOR: check: ensure check-reuse is compatible with SSL SSL may be activated implicitely if a server relies on SSL, even without check-ssl keyword. This is performed by init_srv_check() function. The main operation is to change xprt layer for check to SSL. Prior to this patch, <use_ssl> check member was also set, despite not strictly necessary. This has a negative side-effect of rendering check-reuse-pool ineffective. Indeed, reuse on check is only performed if no specific check configuration has been specified (see tcpcheck_use_nondefault_connect()). This patch fixes check reuse with SSL : <use_ssl> is not set in case SSL is inherited implicitely from server configuration. Thus, <use_ssl> is now only set if an explicit check-ssl keyword is set, which disables connection reuse for check. This must be backported up to 3.2.	2025-09-03 16:54:48 +02:00
Aurelien DARRAGON	f32bc8f0a4	MEDIUM: stats-file: add some BUG_ON() guards to ensure exported structs are not changed by accident Add two BUG_ON() in shm_stats_file_prepare() which will trigger if exported structures (shm_stats_file_hdr and shm_stats_file_object) change in size, because it means that they will become incompatible with older versions and thus precautions should be taken by the developer to ensure compatibility with olders versions, or at least detect incompatible versions by changing the version number to prevent bugs resulting from inconsistent mapping between versions. The BUG_ON() may be safely adjusted then. Please note that it doesn't protect against accidental struct member re-ordering if the resulting struct size is equal..	2025-09-03 16:29:55 +02:00
Aurelien DARRAGON	cf2562cddf	OPTIM: stats-file: don't unnecessarily die hard on shm_stats_file_reuse_object() shm_stats_file_reuse_object() has a non negligible cost, especially if the shm file contains a lot of objects because the functions scans the whole shm file to find available slots. During startup, if no existing objects could be mapped in the shm file shm_stats_file_add_object() for each object (server, fe, be or listener) with a GUID set. On large config it means shm_stats_file_add_object() could be called a lot of times in a row. With current implementation, each shm_stats_file_add_object() call leverages shm_stats_file_reuse_object(), so the more objects are defined in the config, the slower the startup will be. To try to optimize startup time a bit with large configs, we don't sytematically call shm_stats_file_reuse_object(), especially when we know that the previous attempt to reuse objects failed. In this case we add a small tempo between failed attempts to reuse objects because we assume the new attempt will probably fail anyway. (For slots to become available, either an old process has to clean its entries, or they have to time out which implies that the clock needs to be updated)	2025-09-03 16:28:41 +02:00
Aurelien DARRAGON	585ece4c92	MEDIUM: stats-file/counters: store and preload stats counters as shm file objects This is the last patch of the shm stats file series, in this patch we implement the logic to store and fetch shm stats objects and associate them to existing shared counters on the current process. Shm objects are stored in the same memory location as the shm stats file header. In fact they are stored right after it. All objects (struct shm_stats_file_object) have the same size (no matter their type), which allows for easy object traversal without having to check the object's type, and could permit the use of external tools to scan the SHM in the future. Each object stores a guid (of GUID_MAX_LEN+1 size) and tgid which allows to match corresponding shared counters indexes. Also, as stated before, each object stores the list of users making use of it. Objects are never released (the map can only grow), but unused objects (when no more users or active users are found in objects->users), the object is automatically recycled. Also, each object stores its type which defines how the object generic data member should be handled. Upon startup (or reload), haproxy first tries to scan existing shm to find objects that could be associated to frontends, backends, listeners or servers in the current config based on GUID. For associations that couldn't be made, haproxy will automatically create missing objects in the SHM during late startup. When haproxy matches with an existing object, it means the counter from an older process is preserved in the new process, so multiple processes temporarily share the same counter for as long as required for older processes to eventually exit.	2025-09-03 15:59:37 +02:00
Aurelien DARRAGON	ee17d20245	MINOR: stats-file: add process slot management for shm stats file Now that all processes tied to the same shm stats file now share a common clock source, we introduce the process slot notion in this patch. Each living process registers itself in a map at a free index: each slot stores information about the process' PID and heartbeat. Each process is responsible for updating its heartbeat, a slot is considered as "free" if the heartbeat was never set or if the heartbeat is expired (60 seconds of inactivity). The total number of slots is set to 64, this is on purpose because it allows to easily store the "users" of a given shm object using a 64 bits bitmask. Given that when haproxy is reloaded olders processes are supposed to die eventually, it should be large enough (64 simultaneous processes) to be safe. If we manage to reach this limit someday, more slots could be added by splitting "users" bitmask on multiple 64bits variable.	2025-09-03 15:59:33 +02:00
Aurelien DARRAGON	443e657fd6	MEDIUM: stats-file: processes share the same clock source from shm-stats-file The use of the "shm-stats-file" directive now implies that all processes using the same file now share a common clock source, this is required for consistency regarding time-related operations. The clock source is stored in the shm stats file header. When the directive is set, all processes share the same clock (global_now_ms and global_now_ns both point to variables in the map), this is required for time-based counters such as freq counters to work consistently. Since all processes manipulate global clock with atomic operations exclusively during runtime, and don't systematically relies on it (thanks to local now_ms and now_ns), it is pretty much transparent.	2025-09-03 15:59:27 +02:00
Aurelien DARRAGON	c91d93ed1c	MINOR: stats-file: introduce shm-stats-file directive add initial support for the "shm-stats-file" directive and associated "shm-stats-file-max-objects" directive. For now they are flagged as experimental directives. The shared memory file is automatically created by the first process. The file is created using open() so it is up to the user to provide relevant path (either on regular filesystem or ramfs for performance reasons). The directive takes only one argument which is path of the shared memory file. It is passed as-is to open(). The maximum number of objects per thread-group (hard limit) that can be stored in the shm is defined by "shm-stats-file-max-objects" directive, Upon initial creation, the main shm stats file header is provisioned with the version which must remains the same to be compatible between processes and defaults to 2k. which means approximately 1mb max per thread group and should cover most setups. When the limit is reached (during startup) an error is reported by haproxy which invites the user to increase the "shm-stats-file-max-objects" if desired, but this means more memory will be allocated. Actual memory usage is low at start, because only the mmap (mapping) is provisionned with the maximum number of objects to avoid relocating the memory area during runtime, but the actual shared memory file is dynamically resized when objects are added (resized by following half power of 2 curve when new objects are added, see upcoming commits) For now only the file is created, further logic will be implemented in upcoming commits.	2025-09-03 15:59:22 +02:00
Aurelien DARRAGON	cb08bcb9d6	MINOR: counters: retrieve detailed errmsg upon failure with counters_{fe,be}_shared_prepare() counters_{fe,be}_shared_prepare now take an extra <errmsg> parameter that contains additional hints about the error in case of failure. It must be freed accordingly since it is allocated using memprintf	2025-09-03 15:59:17 +02:00
Willy Tarreau	46463d6850	OPTIM: stick-tables: exit expiry faster when the update lock is held It helps keep the contention level low: when we hold the update lock that we know other parts may be relying on (peers, track-sc etc), we decrease the remaining visit counters 4 times as fast to further reduce the contention. At this point no more warnings are seen during intense synchronization (2x64 cores, 1.5M req/s with a track-sc each, 5M entries in use).	2025-09-03 15:51:13 +02:00
Willy Tarreau	696793205b	MINOR: stick-tables: limit the number of visited nodes during expiration As reported by Felipe in GH issue #3084, on large systems it's not sufficient to leave the expiration process after a certain number of expired entries, because if they accumulate too fast, it's possible to still spend some time visiting many (e.g. those still in use), which takes time. Thus here we're taking a stricter approach consisting in counting the number of visited entries, which allows to leave early if we can't do the expected work in a reasonable amount of time. In order to avoid always stopping on first shards and never visiting last ones, we're always starting from a random shard number and looping from that one. This way even if we always leave early, all shards will be handled equally. This should be backported to 3.2.	2025-09-03 15:51:13 +02:00
Willy Tarreau	2421c3769a	BUG/MEDIUM: peers: don't fail twice to grab the update lock When the expire task is running fast (i.e. running almost alone), it's super hard to grab the update lock and peers can easily trigger the watchdog because the time it takes to grab this lock is multiplied by the number of updates to perform. This is easier to trigger at the end of an injection session where the expire task is omni-present. Let's just record that we failed once and don't fail a second time in the loop. This should be backported to 3.2, but probably not further given that this area changed significantly in 3.2.	2025-09-03 15:51:13 +02:00
Willy Tarreau	324f0a60ab	BUG/MINOR: stick-tables: never leave used entries without expiration When trying to kill/expire entries, if a ref-counted entry is found, let's requeue it with its expiration timer instead of leaving it out, because other ref-counters (e.g. peers) will not purge it otherwise, leaving it orphan. This one seems trickier to trigger, though it seems to happen sometimes when peers are late and a long resync is active and competing with intense calls to process_table_expire() (i.e. when no other acitvity is there). This must be backported to 3.2. It's likely that older versions are affected as well, but possibly differently since the expiration mechanism changed between 3.1 and 3.2, so better not take unneeded risks there.	2025-09-03 15:51:13 +02:00
Willy Tarreau	8da6ed6b6a	BUG/MEDIUM: stick-tables: don't leave the expire loop with elements deleted In 3.2, the table expiration latency was improved by commit 994cc58576 ("MEDIUM: stick-tables: Limit the number of entries we expire"), however it introduced an issue by which it's possible to leave the loop after a certain number of elements were expired, without requeuing the deleted elements. The issue it causes is that other places with a non-null ref_cnt will not necessarily delete it themselves, resulting in orphan elements in the table. These ones will then pollute it and force recycling old ones more often which in turn results in an increase of the contention. Let's check for the expiration counter before deleting the element so that it can be found upon next visit. This fix must be backported to 3.2. It is directly related to GH issue #3084. Thanks to Felipe and Ricardo for sharing precious info and testing a candidate fix.	2025-09-03 15:51:13 +02:00
William Lallemand	554a15562f	MEDIUM: cfgparse: warn when using user/group when built statically In issue #3013, an user observed a crash at startup of haproxy when building statically and using the "user" global section. This is a known problem of the glibc and the linker even warn about this: > warning: Using 'getgrnam' in statically linked applications requires at runtime the shared libraries from the glibc version used for linking > warning: Using 'getpwnam' in statically linked applications requires at runtime the shared libraries from the glibc version used for linking Let's emit a warning when using user/group in this case.	2025-09-03 14:45:00 +02:00
Frederic Lecaille	58b153b882	MINOR: quic: Add more information about RX packets This patch is very useful to debug issues at RX packet processing level. Should be easily backported as far as 2.6 (for debug purposes).	2025-09-03 09:41:38 +02:00
Willy Tarreau	4902195313	BUILD: acl: silence a possible null deref warning in parse_acl_expr() The fix in commit 441cd614f9 ("BUG/MINOR: acl: set arg_list->kw to aclkw->kw string literal if aclkw is found") involves an unchecked access to "al" after that one is tested for possibly being NULL. This rightfully upsets Coverity (GH #3095) and might also trigger warnings depending on the compilers. However, no known caller to date passes a NULL arg list here so there's no way to trigger this theoretical bug. This should be backported along with the fix above to avoid emitting warnings, possibly as far as 2.6 since that fix was tagged as such.	2025-09-02 17:41:51 +02:00
Willy Tarreau	c128887b8e	BUG/MINOR: haproxy: be sure not to quit too early on soft stop The fix in 4a9e3e102e ("BUG/MINOR: haproxy: only tid 0 must not sleep if got signal") had the nasty side effect of breaking the graceful reload operations: threads whose id is non-zero could quit too early and not process incoming traffic, which is visible with broken connections during reloads. They just need to ignore the the stopping condition until the signal queue is empty. In any case, it's the thread in charge of the signal queue which will notify them once it receives the signal. It was verified that connections are no longer broken with this fix, and that the issue that required it (#2537, looping threads on reload) does not re-appear with the reproducer, while it still did without the fix above. Since the fix above was backported to every stable version, this one will also have to.	2025-09-02 11:33:14 +02:00
Amaury Denoyelle	36d28bfca3	MEDIUM: quic: strengthen BUG_ON() for unpad Initial packet on client To avoid anti-amplification limit, it is required that Initial packet are padded to be at least 1.200 bytes long. On server side, this only applies to ack-eliciting packets. However, for client side, this is mandatory for every packets. This patch adjusts qc_txb_store() BUG_ON statement used to catch too small Initial packets. On QUIC client side, ack-eliciting flag is now ignored, thus every packets are checked. This is labelled as MEDIUM as this BUG_ON() is known to be easily triggered, as QUIC datagrams encoding function are complex. However, it's important that a QUIC endpoint respects it, else the peer will drop the invalid packet and could immediately close the connection.	2025-09-02 10:41:49 +02:00
Amaury Denoyelle	209a54d539	BUG/MINOR: quic: pad Initial pkt with CONNECTION_CLOSE on client Currently, when connection is closing, only CONNECTION_CLOSE frame is emitted via qc_prep_pkts()/qc_do_build_pkt(). Also, only the first registered encryption level is considered while the others are dismissed. This results in a single packet datagram. This can cause issues for QUIC client support, as padding is required for every Initial packet, contrary to server side where only ack-eliciting packets are eligible. Thus a client must add padding to a CONNECTION_CLOSE frame on Initial level. This patch adjusts qc_prep_pkts() to ensure such packet will be correctly padded on client side. It sets <final_packet> variable which instructs that if padding is necessary it must be apply immediately on the current encryption level instead of the last one. It could appear as unnecessary to pad a CONNECTION_CLOSE packet, as the peer will enter in draining state when processing it. However, RFC mandates that a client Initial packet too small must be dropped by the server, so there is a risk that the CONNECTION_CLOSE is simply discarded prior to its processing if stored in a too small datagram. No need to backport as this is a QUIC backend issue only.	2025-09-02 10:34:12 +02:00
Amaury Denoyelle	e9b78e3fb1	BUG/MINOR: quic: fix padding issue on INITIAL retransmit On loss detection timer expiration, qc_dgrams_retransmit() is used to reemit lost packets. Different code paths are present depending on the active encryption level. If Initial level is still initialized, retransmit is performed both for Initial and Handshake spaces, by first retrieving the list of lost frames for each of them. Prior to this patch, Handshake level was always registered for emission after Initial, even if it dit not have any frame to reemit. In this case, most of the time it would result in a datagram containing Initial reemitted frames packet coalesced with a Handshake packet consisting only of a PADDING frame. This is because padding is only added for the last registered QEL. For QUIC backend support, this may cause issues. This is because contrary to QUIC server side, Initial and Handshake levels keys are not derived simultaneously for a QUIC client. Thus, if the latter keys are unavailable, Handshake packet cannot be encoded in sending, leaving a single Initial packet. However, this is now too late to add PADDING. Thus the resulting datagram is invalid : this triggers the BUG_ON() assert failure located on qc_txb_store(). This patch fixes this by amending qc_dgrams_retransmit(). Now, Handshake level is only registered for emission if there is frame to retransmit, which implies that Handshake keys are already available. Thus, PADDING will now either be added at Initial or Handshake level as expected. Note that this issue should not be present on QUIC frontend, due to Initial and Handshake keys derivation almost simultaneously. However, this should still be backported up to 3.0.	2025-09-02 10:31:32 +02:00
Amaury Denoyelle	34d5bfd23c	BUG/MINOR: quic: fix room check if padding requested qc_prep_pkts() activates padding when building an Initial packet. This ensures that resulting datagram will always be at least 1.200 bytes, which is mandatory to prevent deadlock over anti-amplication. Prior to padding activation, a check is performed to ensure that output buffer is big enough for a padded datagram. However, this did not take into account previously built packets which would be coalesced in the same datagram. Thus this patch fixes this comparison check. In theory, prior to this patch, in some cases Initial packets could not be built despite a datagram of the proper size. Currently, this probably never happens as Initial packet is always the first encoded in a datagram, thus there is no coalesced packet prior to it. However, there is no hard requirement on this, so it's better to reflect this in the code. This should be backported up to 2.6.	2025-09-02 10:29:11 +02:00
Frederic Lecaille	fba80c7fe8	BUG/MINOR: quic: ignore AGAIN ncbuf err when parsing CRYPTO frames This fix follows this previous one: BUG/MINOR: quic: reorder fragmented RX CRYPTO frames by their offsets which is not sufficient when a client fragments and mixes its CRYPTO frames AND leaveswith holes by packets. ngtcp2 (and perhaps chrome) splits theire CRYPTO frames but without hole by packet. In such a case, the CRYPTO parsing leads to QUIC_RX_RET_FRM_AGAIN errors which cannot be fixed when the peer resends its packets. Indeed, even if the peer resends its frames in a different order, this does not help because since the previous commit, the CRYPTO frames are ordered on haproxy side. This issue was detected thanks to the interopt tests with quic-go as client. This client fragments its CRYPTO frames, mixes them, and generate holes, and most of the times with the retry test. To fix this, when a QUIC_RX_RET_FRM_AGAIN error is encountered, the CRYPTO frames parsing is not stop. This leaves chances to the next CRYPTO frames to be parsed. Must be backported as far as 2.6 as the commit mentioned above.	2025-09-02 08:13:58 +02:00
Alexander Stephan	26776c7b8f	BUG/MINOR: tools: Add OOM check for malloc() in indent_msg() This patch adds a missing out-of-memory (OOM) check after the call to `malloc()` in `indent_msg()`. If memory allocation fails, the function returns NULL to prevent undefined behavior. Co-authored-by: Christian Norbert Menges <christian.norbert.menges@sap.com>	2025-09-02 07:29:54 +02:00
Alexander Stephan	aa20905ac9	BUG/MINOR: compression: Add OOM check for calloc() in parse_compression_options() This patch adds a missing out-of-memory (OOM) check after the call to `calloc()` in `parse_compression_options()`. If memory allocation fails, an error message is set, the function returns -1, and parsing is aborted to ensure safe handling of low-memory conditions. Co-authored-by: Christian Norbert Menges <christian.norbert.menges@sap.com>	2025-09-02 07:29:54 +02:00
Alexander Stephan	73f9a75894	BUG/MINOR: cfgparse: Add OOM check for calloc() in cfg_parse_listen() This commit adds a missing out-of-memory (OOM) check after the call to `calloc()` in `cfg_parse_listen()`. If memory allocation fails, an alert is logged, error codes are set, and parsing is aborted to prevent undefined behavior. Co-authored-by: Christian Norbert Menges <christian.norbert.menges@sap.com>	2025-09-02 07:29:54 +02:00
Alexander Stephan	c3e69cf065	BUG/MINOR: acl: Add OOM check for calloc() in smp_fetch_acl_parse() This patch adds a missing out-of-memory (OOM) check after the call to `calloc()` in `smp_fetch_acl_parse()`. If memory allocation fails, an error message is set and the function returns 0, improving robustness in low-memory situations. Co-authored-by: Christian Norbert Menges <christian.norbert.menges@sap.com>	2025-09-02 07:29:54 +02:00
Alexander Stephan	22ac1f5ee9	BUG/MINOR: log: Add OOM checks for calloc() and malloc() in logformat parser and dup_logger() This patch adds missing out-of-memory (OOM) checks after calls to `calloc()` and `malloc()` in the logformat parser and the `dup_logger()` function. If memory allocation fails, an error is reported or NULL is returned, preventing undefined behavior in low-memory conditions. Co-authored-by: Christian Norbert Menges <christian.norbert.menges@sap.com>	2025-09-02 07:29:54 +02:00
Christopher Faulet	8c555a4a4e	BUG/MINOR: acl: Properly detect overwritten matching method A bug was introduced by the commit 6ea50ba46 ("MINOR: acl; Warn when matching method based on a suffix is overwritten"). The test on the match function, when defined was not correct. It is now fixed. No backport needed, except if the commit above is backported.	2025-09-01 21:36:25 +02:00
Christopher Faulet	f8b7299ee7	BUG/MINOR: server: Duplicate healthcheck's sni inherited from default server It is not really an issue, but the "check-sni" value inerited from a default server is not duplicated while the paramter value is duplicated during the parsing. So here there is a small leak if several "check-sni" parameters are used on the same server line. The previous value is never released. But to fix this issue, the value inherited from the default server must also be duplicated. At the end it is safer this way and consistant with the parsing of the "sni" parameter. It is harmless so there is no reason to backport this patch.	2025-09-01 15:45:05 +02:00
Christopher Faulet	f7a04b428a	BUG/MEDIUM: server: Duplicate healthcheck's alpn inherited from default server When "check-alpn" parameter is inherited from the default server, the value is not duplicated, the pointer of the default server is used. However, when this parameter is overridden, the old value is released. So the "check-alpn" value of the default server is released. So it is possible to have a UAF if if another server inherit from the same the default server. To fix the issue, the "check-alpn" parameter must be handled the same way the "alpn" is. The default value is duplicated. So it could be safely released if it is forced on the server line. This patch should fix the issue #3096. It must be backported to all stable versions.	2025-09-01 15:45:05 +02:00
Christopher Faulet	6ea50ba462	MINOR: acl; Warn when matching method based on a suffix is overwritten From time to time, issues are reported about string matching based on suffix (for instance path_beg). Each time, it appears these ACLs are used in conjunction with a converter or followed by an explicit matching method (-m). Unfortunatly, it is not an issue but an expected behavior, while it is not obvious. matching suffixes can be consider as aliases on the corresponding '-m' matching method. Thus "path_beg" is equivalent to "path -m beg". When a converter is used the original matching (string) is used and the suffix is lost. When followed by an explicit matching method, it overwrites the matching method based on the suffix. It is expected but confusing. Thus now a warning is emitted because it is a configuration issue for sure. Following sample fetch functions are concerned: * base * path * req.cook * req.hdr * res.hdr * url * urlp The configuration manual was modified to make it less ambiguous.	2025-09-01 15:45:05 +02:00
Christopher Faulet	c51ddd5c38	MINOR: acl: Only allow one '-m' matching method Several '-m' explicit matching method was allowed, but only the last one was really used. There is no reason to specify several matching method and it is most probably an error or a lack of understanding of how matchings are performed. So now, an error is triggered during the configuration parsing to avoid any bad usage.	2025-09-01 15:45:05 +02:00
Amaury Denoyelle	1868ca9a95	MINOR: conn/muxes/ssl: add ASSUME_NONNULL() prior to _srv_add_idle When manipulating idle backend connections for input/output processing, special care is taken to ensure the connection cannot be accessed by another thread, for example via a takeover. When processing is over, connection is reinserted in its original list. A connection can either be attached to a session (private ones) or a server idle tree. In the latter case, <srv> is guaranteed to be non null prior to _srv_add_idle() thanks to CO_FL_LIST_MASK comparison with conn flags. This patch adds an ASSUME_NONNULL() to better reflect this. This should fix coverity reports from github issue #3095.	2025-09-01 15:35:22 +02:00
Amaury Denoyelle	dcf2261612	BUG/MAJOR: mux-quic: fix crash on reload during emission MUX QUIC restricts buffer allocation per connection based on the underlying congestion window. If a QCS instance cannot allocate a new buffer, it is put in a buf_wait list. Typically, this will cause stream upper layer to subscribe for sending. A BUG_ON() was present on snd_buf and nego_ff callback prologue to ensure that these functions were not called if QCS is already in buf_wait list. The objective was to guarantee that there is no wake up on a stream if it cannot allocate a buffer. However, this BUG_ON() is not correct, as it can be fired legitimely. Indeed, stream layer can retry emission even if no wake up occured. This case can happen on reload. Thus, BUG_ON() will cause an unexpected crash. Fix this by removing these BUG_ON(). Instead, snd_buf/nego_ff callbacks ensure that QCS is not subscribed in buf_wait list. If this is the case, a nul value will be returned, which is sufficient for the stream layer to pause emission and subscribe if necessary. Occurences for this crash have been reported on the mailing list. It is also the subject of github issue #3080, which should be fixed with this patch. This must be backported up to 3.0.	2025-09-01 15:35:22 +02:00
Frederic Lecaille	800ba73a9c	BUG/MEDIUM: quic: CRYPTO frame freeing without eb_delete() Since this commit: BUG/MINOR: quic: reorder fragmented RX CRYPTO frames by their offsets when they are parsed, the CRYPTO frames are ordered by their offsets into an ebtree. Then their data are provided to the ncbufs. But in case of error, when qc_handle_crypto_frm() returns QUIC_RX_RET_FRM_FATAL or QUIC_RX_RET_FRM_AGAIN), they remain attached to their tree. Then from <err> label, they are deteleted and deleted (with a while(node) { eb_delete(); qc_frm_free();} loop). But before this loop, these statements directly free the frame without deleting it from its tree, if this is a CRYPTO frame, leading to a use after free when running the loop: if (frm) qc_frm_free(qc, &frm); This issue was detected by the interop tests, with quic-go as client. Weirdly, this client sends CRYPTO frames by packet with holes. Must be backported as far as 2.6 as the commit mentioned above.	2025-09-01 10:39:00 +02:00
Frederic Lecaille	90126ec9b7	CLEANUP: quic: remove a useless CRYPTO frame variable assignment This modification should have arrived with this commit: MINOR: quic: remove ->offset qf_crypto struct field Since this commit, the CRYPTO offset node key assignment is done at parsing time when calling qc_parse_frm() from qc_parse_pkt_frms(). This useless assigment has been reported in GH #3095 by coverity. This patch should be easily backported as far as 2.6 as the one mentioned above to ease any further backport to come.	2025-09-01 09:31:04 +02:00
Christopher Faulet	8f3b537547	MEDIUM: proxy: Reject some header names for 'http-send-name-header' directive From time to time, we saw the 'http-send-name-header' directive used to overwrite the Host header to workaround limitations of a buggy application. Most of time, this led to troubles. This was never officially supported and each time we strongly discouraged anyone to do so. We already thought to deprecate this directive, but it seems to be still used by few people. So for now, we decided to strengthen the tests performed on it. The header name is now checked during the configuration parsing to forbid some risky names. 'Host', 'Content-Length', 'Transfer-Encoding' and 'Connection' header names are now rejected. But more headers could be added in future.	2025-08-29 09:27:01 +02:00
Amaury Denoyelle	2afcba1eb7	MINOR: proxy: extend "show servers conn" output CLI command "show servers conn" is used as a debugging tool to monitor the number of connections per server. This patch extends its output by adding the content of two server counters. <served> is the first added column. It represents the number of active streams on a server. <curr_sess_idle_conns> is the second added column. This is a recently added value which account private idle connections referencing a server.	2025-08-28 18:58:11 +02:00
Amaury Denoyelle	fac1de935a	MINOR: stats: display new curr_sess_idle_conns server counter Add a new stats column in proxy stats to display server counter for private idle connections. This counter has been introduced recently. The value is displayed on CSV output on the last column before modules. It is also displayed on HTLM page alongside other idle server counters.	2025-08-28 18:58:11 +02:00
Amaury Denoyelle	21f7974e05	OPTIM: backend: set release on takeover for strict maxconn When strict maxconn is enforced on a server, it may be necessary to kill an idle connection to never exceed the limit. To be able to delete a connection from any thread, takeover is first used to migrate it on the current thread prior to its deletion. As takeover is performed to delete a connection instead of reusing it, <release> argument can be set to true. This removes unnecessary allocations of resources prior to connection deletion. As such, this patch is a small optimization for strict maxconn implementation. Note that this patch depends on the previous one which removes any assumption in takeover implementation that thread isolation is active if <release> is true.	2025-08-28 16:11:32 +02:00
Amaury Denoyelle	d971d3fed8	MINOR: muxes: adjust takeover with buf_wait interaction Takeover operation defines an argument <release>. It's a boolean which if set indicate that freed connection resources during the takeover does not have to be reallocated on the new thread. Typically, it is set to false when takever is performed to reuse a connection. However, when used to be able to delete a connection from a different thread, <release> should be set to true. Previously, <release> was only set in conjunction with "del server" handler. This operation was performed under thread isolation, which guarantee that not thread-safe operation such as removal from buf_wait list could be performed on takeover if <release> was true. In the contrary case, takeover operation would fail. Recently, "del server" handler has been adjusted to remove idle connection cleanup with takeover. As such, <release> is never set to true in remaining takeover usage. However, takeover is also used to enforce strict-maxconn on a server. This is performed to delete a connection from any thread, which is the primary reason of <release> to true. But for the moment as takeover implementers considers that thread isolation is active if <release> is set, this is not yet applicable for strict-maxconn usage. Thus, the purpose of this patch is to adjust takeover implementation. Remove assumption between <release> and thread-isolation mode. It's not possible to remove a connection from a buf_wait list, an error will be return in any case.	2025-08-28 16:09:48 +02:00
William Lallemand	ffdccb6e04	BUILD: mworker: fix ignoring return value of ‘read’ Fix read return value unused result. src/haproxy.c: In function ‘main’: src/haproxy.c:3630:17: error: ignoring return value of ‘read’ declared with attribute ‘warn_unused_result’ [-Werror=unused-result] 3630 \| read(sock_pair[1], &c, 1); \| ^~~~~~~~~~~~~~~~~~~~~~~~~ Must be backported where d7f6819 is backported.	2025-08-28 15:13:01 +02:00
Amaury Denoyelle	7232677385	MAJOR: server: do not remove idle conns in del server Do not remove anymore idle and purgeable connections directly under the "del server" handler. The main objective of this patch is to reduce the amount of work performed under thread isolation. This should improve "del server" scheduling with other haproxy tasks. Another objective is to be able to properly support dynamic servers with QUIC. Indeed, takeover is not yet implemented for this protocol, hence it is not possible to rely on cleanup of idle connections performed by a single thread under "del server" handler. With this change it is not possible anymore to remove a server if there is still idle connections referencing it. To ensure this cannot be performed, srv_check_for_deletion() has been extended to check server counters for idle and idle private connections. Server deletion should still remain a viable procedure, as first it is mandatory to put the targetted server into maintenance. This step forces the cleanup of its existing idle connections. Thanks to a recent change, all finishing connections are also removed immediately instead of becoming idle. In short, this patch transforms idle connections removal from a synchronous to an asynchronous procedure. However, this should remain a steadfast and quick method achievable in less than a second. This patch is considered major as some users may notice this change when removing a server. In particular with the following CLI commands pipeline: "disable server <X>; shutdown sessions server <X>; del server <X>" Server deletion will now probably fail, as idle connections purge cannot be completed immediately. Thus, it is now highly advise to always use a small delay "wait srv-removable" before "del server" to ensure that idle connections purge is executed prior. Along with this change, documentation for "del server" and related "shutdown sessions server" has been refined, in particular to better highlight under what conditions a server can be removed.	2025-08-28 15:08:35 +02:00
Amaury Denoyelle	dbe31e3f65	MEDIUM: session: account on server idle conns attached to session This patch adds a new member <curr_sess_idle_conns> on the server. It serves as a counter of idle connections attached on a session instead of regular idle/safe trees. This is used only for private connections. The objective is to provide a method to detect if there is idle connections still referencing a server. This will be particularly useful to ensure that a server is removable. Currently, this is not yet necessary as idle connections are directly freed via "del server" handler under thread isolation. However, this procedure will be replaced by an asynchronous mechanism outside of thread isolation. Careful: connections attached to a session but not idle will not be accounted by this counter. These connections can still be detected via srv_has_streams() so "del server" will be safe. This counter is maintain during the whole lifetime of a private connection. This is mandatory to guarantee "del server" safety and is conform with other idle server counters. What this means it that decrement is performed only when the connection transitions from idle to in use, or just prior to its deletion. For the first case, this is covered by session_get_conn(). The second case is trickier. It cannot be done via session_unown_conn() as a private connection may still live a little longer after its removal from session, most notably when scheduled for idle purging. Thus, conn_free() has been adjusted to handle the final decrement. Now, conn_backend_deinit() is also called for private connections if CO_FL_SESS_IDLE flag is present. This results in a call to srv_release_conn() which is responsible to decrement server idle counters.	2025-08-28 15:08:35 +02:00

... 4 5 6 7 8 ...

20112 Commits