haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-11-24 12:20:59 +01:00

Author	SHA1	Message	Date
Olivier Houchard	c36aae2af1	MINOR: pollers: Add a fixup_tgid_takeover() method. Add a fixup_tgid_takeover() method to pollers for which it makes sense (epoll, kqueue and evport). That method can be called after a takeover of a fd from a different thread group, to make sure the poller's internal structure reflects the new state.	2025-02-26 13:00:18 +01:00
Olivier Houchard	752c5cba5d	MEDIUM: epoll: Make sure we can add a new event Check that the call to epoll_ctl() succeeds, and if it does not, if we're adding a new event and it fails with EEXIST, then delete and re-add the event. There are a few cases where we may already have events for a fd. If epoll_ctl() fails for any reason, use BUG_ON to make sure we immediately crash, as this should not happen.	2025-02-26 13:00:18 +01:00
Olivier Houchard	c5cc09c00d	MINOR: fd: Add fd_lock_tgid_cur(). Add fd_lock_tgid_cur(), a function that will lock the tgid, without modifying its value.	2025-02-26 13:00:18 +01:00
Olivier Houchard	52b97ff8dd	MEDIUM: fd: Wait if locked in fd_grab_tgid() and fd_take_tgid(). Wait while the tgid is locked in fd_grab_tgid() and fd_take_tgid(). As that lock is barely used, it should have no impact.	2025-02-26 13:00:18 +01:00
Ilia Shipitsin	814b5dfe30	BUILD: add possibility to use different QuicTLS variants initially QuicTLS started as a patchset on top of OpenSSL, currently project has started its own journey as QuicTLS somehow we need both ML: https://www.mail-archive.com/haproxy@formilux.org/msg45574.html GH: https://github.com/quictls/quictls/issues/244	2025-02-25 10:29:46 +01:00
Willy Tarreau	a826250659	OPTIM: connection: don't try to kill other threads' connection when !shared Users may have good reasons for using "tune.idle-pool.shared off", one of them being the cost of moving cache lines between cores, or the kernel- side locking associated with moving FDs. For this reason, when getting close to the file descriptors limits, we must not try to kill adjacent threads' FDs when the sharing of pools is disabled. This is extremely expensive and kills the performance. We must limit ourselves to our local FDs only. In such cases, it's up to the users to configure a large enough maxconn for their usages. Before this patch, perf top reported 9% CPU usage in connect_server() onthe trylock used to kill connections when running at 4800 conns for a global maxconn of 6400 on a 128-thread server. Now it doesn't spend its time there anymore, and performance has increased by 12%. Note, it was verified that disabling the locks in such a case has no effect at all, so better keep them and stay safe.	2025-02-25 09:23:46 +01:00
Willy Tarreau	2e0bac90da	BUG/MEDIUM: stream: don't use localtime in dumps from a signal handler In issue #2861, Jarosaw Rzesz�tko reported another issue with "show threads", this time in relation with the conversion of a stream's accept date to local time. Indeed, if the libc was interrupted in this same function, it could have been interrupted with a lock held, then it's no longer possible to dump the date, and we face a deadlock. This is easy to reproduce with logging enabled. Let's detect we come from a signal handler and do not try to resolve the time to localtime in this case.	2025-02-24 13:40:42 +01:00
Willy Tarreau	fb7874c286	MINOR: tinfo: split the signal handler report flags into 3 While signals are not recursive, one signal (e.g. wdt) may interrupt another one (e.g. debug). The problem this causes is that when leaving the inner handler, it removes the outer's flag, hence the protection that comes with it. Let's just have 3 distinct flags for regular signals, debug signal and watchdog signal. We add a 4th definition which is an aggregate of the 3 to ease testing.	2025-02-24 13:37:52 +01:00
Willy Tarreau	bbf824933f	BUG/MINOR: h2: always trim leading and trailing LWS in header values Annika Wickert reported some occasional disconnections between haproxy and varnish when communicating over HTTP/2, with varnish complaining about protocol errors while captures looked apparently normal. Nils Goroll managed to reproduce this on varnish by injecting the capture of the outgoing haproxy traffic and noticed that haproxy was forwarding a header value containing a trailing space, which is now explicitly forbidden since RFC9113. It turns out that the only way for such a header to pass through haproxy is to arrive in h2 and not be edited, in which case it will arrive in HTX with its undesired spaces. Since the code dealing with HTX headers always trims spaces around them, these are not observable in dumps, but only when started in debug mode (-d). Conversions to/from h1 also drop the spaces. With this patch we trim LWS both on input and on output. This way we always present clean headers in the whole stack, and even if some are manually crafted by the configuration or Lua, they will be trimmed on the output. This must be backported to all stable versions. Thanks to Annika for the helpful capture and Nils for the help with the analysis on the varnish side!	2025-02-24 09:39:57 +01:00
Vincent Dechenaux	9011b3621b	MINOR: compression: Introduce minimum size This is the introduction of "minsize-req" and "minsize-res". These two options allow you to set the minimum payload size required for compression to be applied. This helps save CPU on both server and client sides when the payload does not need to be compressed.	2025-02-22 11:32:40 +01:00
Willy Tarreau	e7510d6230	CLEANUP: task: move the barrier after clearing th_ctx->current There's a barrier after releasing the current task in the scheduler. However it's improperly placed, it's done after pool_free() while in fact it must be done immediately after resetting the current pointer. Indeed, the purpose is to make sure that nobody sees the task as valid when it's in the process of being released. This is something that could theoretically happen if interrupted by a signal in the inlined code of pool_free() if the compiler decided to postpone the write to ->current. In practice since nothing fancy is done in the inlined part of the function, there's currently no risk of reordering. But it could happen if the underlying __pool_free() were to be inlined for example, and in this case we could possibly observe th_ctx->current pointing to something currently being destroyed. With the barrier between the two, there's no risk anymore.	2025-02-21 18:31:46 +01:00
Willy Tarreau	eb41d768f9	MINOR: tools: use only opportunistic symbols resolution As seen in issue #2861, dladdr_and_size() an be quite expensive and will often hold a mutex in the underlying library. It becomes a real problem when issuing lots of "show threads" or wdt warnings in parallel because threads will queue up waiting for each other to finish, adding to their existing latency that possibly caused the warning in the first place. Here we're taking a different approach. If the thread is not isolated and not panicking, it's doing unimportant stuff like showing threads or warnings. In this case we try to grab a lock, and if we fail because another thread is already there, we just pretend we cannot resolve the symbol. This is not critical because then we fall back to the already used case which consists in writing "main+<offset>". In practice this will almost never happen except in bad situations which could have otherwise degenerated.	2025-02-21 18:26:29 +01:00
Willy Tarreau	3c22fa315b	BUG/MEDIUM: stream: use non-blocking freq_ctr calls from the stream dumper The stream dump function is called from signal handlers (warning, show threads, panic). It makes use of read_freq_ctr() which might possibly block if it tries to access a locked freq_ctr in the process of being updated, e.g. by the current thread. Here we're relying on the non-blocking API instead. It may return incorrect values (typically smaller ones after resetting the curr counter) but at least it will not block. This needs to be backported to stable versions along with the previous commit below: MINOR: freq_ctr: provide non-blocking read functions At least 3.1 is concerned as the warnings tend to increase the risk of this situation appearing.	2025-02-21 18:26:29 +01:00
Willy Tarreau	29e246a84c	MINOR: freq_ctr: provide non-blocking read functions Some code called by the debug handlers in the context of a signal handler accesses to some freq_ctr and occasionally ends up on a locked one from the same thread that is dumping it. Let's introduce a non-blocking version that at least allows to return even if the value is in the process of being updated, it's less problematic than hanging.	2025-02-21 18:26:29 +01:00
Willy Tarreau	84d4c948fc	BUG/MEDIUM: stream: never allocate connection addresses from signal handler In __strm_dump_to_buffer(), we call conn_get_src()/conn_get_dst() to try to retrieve the connection's IP addresses. But this function may be called from a signal handler to dump a currently running stream, and if the addresses were not allocated yet, a poll_alloc() will be performed while we might possibly already be running pools code, resulting in pool list corruption. Let's just make sure we don't call these sensitive functions there when called from a signal handler. This must be backported at least to 3.1 and ideally all other versions, along with this previous commit: MINOR: tinfo: add a new thread flag to indicate a call from a sig handler	2025-02-21 17:41:38 +01:00
Willy Tarreau	ddd173355c	MINOR: tinfo: add a new thread flag to indicate a call from a sig handler Signal handlers must absolutely not change anything, but some long and complex call chains may look innocuous at first glance, yet result in some subtle write accesses (e.g. pools) that can conflict with a running thread being interrupted. Let's add a new thread flag TH_FL_IN_SIG_HANDLER that is only set when entering a signal handler and cleared when leaving them. Note, we're speaking about real signal handlers (synchronous ones), not deferred ones. This will allow some sensitive call places to act differently when detecting such a condition, and possibly even to place a few new BUG_ON().	2025-02-21 17:41:38 +01:00
Willy Tarreau	a56dfbdcb4	BUG/MINOR: mux-h1: always make sure h1s->sd exists in h1_dump_h1s_info() This function may be called from a signal handler during a warning, a panic or a show thread. We need to be more cautious about what may or may not be dereferenced since an h1s is not necessarily fully initialized. Loops of "show threads" sometimes manage to crash when dereferencing a null h1s->sd, so let's guard it and add a comment remining about the unusual call place. This can be backported to the relevant versions.	2025-02-21 17:41:38 +01:00
Willy Tarreau	9d5bd47634	BUG/MINOR: stream: do not call co_data() from __strm_dump_to_buffer() co_data() was instrumented to detect cases where c->output > data and emits a warning if that's not correct. The problem is that it happens quite a bit during "show threads" if it interrupts traffic anywhere, and that in some environments building with -DDEBUG_STRICT_ACTION=3, it will kill the process. Let's just open-code the channel functions that make access to co_data(), there are not that many and the operations remain very simple. This can be backported to 3.1. It didn't trigger in earlier versions because they didn't have this CHECK_IF_HOT() test.	2025-02-21 17:18:00 +01:00
Ilia Shipitsin	0bdf414fa5	CI: QUIC Interop: clean old docker images currently temporary docker images are kept forever. let's delete outdated ones	2025-02-21 11:34:43 +01:00
Aurelien DARRAGON	97a19517ff	MINOR: clock: always use atomic ops for global_now_ms global_now_ms is shared between threads so we must give hint to the compiler that read/writes operations should be performed atomically. Everywhere global_now_ms was used, atomic ops were used, except in clock_update_global_date() where a read was performed without using atomic op. In practise it is not an issue because on most systems such reads should be atomic already, but to prevent any confusion or potential bug on exotic systems, let's use an explicit _HA_ATOMIC_LOAD there. This may be backported up to 2.8	2025-02-21 11:22:35 +01:00
Aurelien DARRAGON	9561b9fb69	BUG/MINOR: sink: add tempo between 2 connection attempts for sft servers When the connection for sink_forward_{oc}_applet fails or a previous one is destroyed, the sft->appctx is instantly released. However process_sink_forward_task(), which may run at any time, iterates over all known sfts and tries to create sessions for orphan ones. It means that instantly after sft->appctx is destroyed, a new one will be created, thus a new connection attempt will be made. It can be an issue with tcp log-servers or sink servers, because if the server is unavailable, process_sink_forward() will keep looping without any temporisation until the applet survives (ie: connection succeeds), which results in unexpected CPU usage on the threads responsible for that task. Instead, we add a tempo logic so that a delay of 1second is applied between two retries. Of course the initial attempt is not delayed. This could be backported to all stable versions.	2025-02-21 11:22:35 +01:00
Aurelien DARRAGON	c9d4192726	BUG/MINOR: log: fix outgoing abns address family While reviewing the code in an attempt to fix GH #2875, I stumbled on another case similar to aac570c ("BUG/MEDIUM: uxst: fix outgoing abns address family in connect()") that caused abns(z) addresses to fail when used as log targets. The underlying cause is the same as aac570c, which is the rework of the unix socket families in order to support custom addresses for different adressing schemes, where a real_family() was overlooked before passing a haproxy-internal address struct to socket-oriented syscall. To fix the issue, we first copy the target's addr, and then leverage real_family() to set the proper low-level address family that is passed to sendmsg() syscall. It should be backported in 3.1	2025-02-21 11:22:28 +01:00
Aurelien DARRAGON	26d97ec148	REGTESTS: fix reg-tests/server/abnsz.vtc It was proved in GH #2875 that the regtest was broken, at least for the server-side abnsz, as the connect() was not performed using the proper family, which results in kernel refusing to perform the call, while the reg-test actually succeeds. Indeed, in the test we used vtest client to connect to haproxy, which then routed the request to another haproxy instance listening on an abnsz socket, and this last haproxy was the one to answer the http request. As we only used "rxresp" in vtest client, the test succeeded with empty responses, which was the case due to the server connection failing on the first haproxy process.	2025-02-21 08:22:25 +01:00
Willy Tarreau	aac570cd03	BUG/MEDIUM: uxst: fix outgoing abns address family in connect() Since we reworked the unix socket families in order to support custom addresses for different addressing schemes, we've been using extra values for the ss_family field in sockaddr_storage. These ones have to be adjusted before calling bind() or connect(). It turns out that after the abns/abnsz updates in 3.1, the connect() code was not adjusted to take care of the change, resulting in AF_CUST_ABNS or AF_CUST_ABNSZ to be placed in the address that was passed to connect(). The right approach is to locally copy the address, get its length, fixup the family and use the fixed value and length for connect(). This must be backported to 3.1. Many thanks for @Mewp for reporting this issue in github issue #2875.	2025-02-21 07:59:08 +01:00
Valentine Krasnobaeva	390df282c1	BUG/MINOR: cfgparse: fix NULL ptr dereference in cfg_parse_peers When "peers" keyword is followed by more than one argument and it's the first "peers" section in the config, cfg_parse_peers() detects it and exits with "ERR_ALERT\|ERR_FATAL" err_code. So, upper layer parser, parse_cfg(), continues and parses the next keyword "peer" and then he tries to check the global cfg_peers, which should contain "my_cluster". The global cfg_peers is still NULL, because after alerting a user in alertif_too_many_args, cfg_parse_peers() exited. peers my_cluster __some_wrong_data__ peer haproxy1 1.1.1.1 1000 In order to fix this, let's add ERR_ABORT, if "peers" keyword is followed by more than one argument. Like this parse_cfg() will stops immediately and terminates haproxy with "too many args for peers my_cluster..." alert message. It's more reliable, than add checks "if (cfg_peers !=NULL)" in "peer" subparser, as we may have many "peers" sections. peers my_another_cluster peer haproxy1 1.1.1.2 1000 peers my_cluster __some_wrong_data__ peer haproxy1 1.1.1.1 1000 In addition, for the example above, parse_cfg() will parse all configuration until the end and only then terminates haproxy with the alert "too many args...". Peer haproxy1 will be wrongly associated with my_another_cluster. This fixes the issue #2872. This should be backported in all stable versions.	2025-02-20 17:10:26 +01:00
Christopher Faulet	851e52b551	BUG/MEDIUM: spoe/mux-spop: Introduce an NOOP action to deal with empty ACK In the SPOP protocol, ACK frame with empty payload are allowed. However, in that case, because only the payload is transferred, there is no data to return to the SPOE applet. Only the end of input is reported. Thus the applet is never woken up. It means that the SPOE filter will be blocked during the processing timeout and will finally return an error. To workaournd this issue, a NOOP action is introduced with the value 0. It is only an internal action for now. It does not exist in the SPOP protocol. When an ACK frame with an empy payload is received, this noop action is transferred to the SPOE applet, instead of nothing. Thanks to this trick, the applet is properly notified. This works because unknown actions are ignored by the SPOE filter. This patch must be backported to 3.1.	2025-02-20 11:56:27 +01:00
Christopher Faulet	efc46de294	BUG/MEDIUM: applet: Don't handle EOI/EOS/ERROR is applet is waiting for room The commit 7214dcd52 ("BUG/MEDIUM: applet: Don't pretend to have more data to handle EOI/EOS/ERROR") introduced a regression. Because of this patch, it was possible to handle EOI/EOS/ERROR applet flags too early while the applet was waiting for more room to transfer the last output data. This bug can be encountered with any applet using its own buffers (cache and stats for instance). And depending on the configuration and the timing, the data may be truncated or the stream may be blocked, infinitely or not. Streams blocked infinitely were observed with the cache applet and the HTTP compression enabled. For the record, it is important to detect EOI/EOS/ERROR applet flags to be able to report the corresponding event on the SE and by transitivity on the SC. Most of time, this happens when some data should be transferred to the stream. The .rcv_buf callback function is called and these flags are properly handled. However, some applets may also report them spontaneously, outside of any data transfer. In that case, the .rcv_buf callback is not called. It is the purpose of this patch (and the one above). Being able to detect pending EOI/EOS/ERROR applet flags. However, we must be sure to not handle them too early at this place. When these flags are set, it means no more data will be produced by the applet. So we must only wait to have transferred everything to the stream. And this happens when the applet is no longer waiting for more room. This patch must be backported to 3.1 with the one above.	2025-02-20 10:00:32 +01:00
Willy Tarreau	4ef6be4a1f	[RELEASE] Released version 3.2-dev6 Released version 3.2-dev6 with the following main changes : - BUG/MEDIUM: debug: close a possible race between thread dump and panic() - DEBUG: thread: report the spin lock counters as seek locks - DEBUG: thread: make lock time computation more consistent - DEBUG: thread: report the wait time buckets for lock classes - DEBUG: thread: don't keep the redundant _locked counter - DEBUG: thread: make lock_stat per operation instead of for all operations - DEBUG: thread: reduce the struct lock_stat to store only 30 buckets - MINOR: lbprm: add a new callback ->server_requeue to the lbprm - MEDIUM: server: allocate a tasklet for asyncronous requeuing - MAJOR: leastconn: postpone the server's repositioning under contention - BUG/MINOR: quic: reserve length field for long header encoding - BUG/MINOR: quic: fix CRYPTO payload size calcul for encoding - MINOR: quic: simplify length calculation for STREAM/CRYPTO frames - BUG/MINOR: mworker: section ignored in discovery after a post_section_parser - BUG/MINOR: mworker: post_section_parser for the last section in discovery - CLEANUP: mworker: "program" section does not have a post_section_parser anymore - MEDIUM: initcall: allow to register mutiple post_section_parser per section - CI: cirrus-ci: bump FreeBSD image to 14-2 - DOC: initcall: name correctly REGISTER_CONFIG_POST_SECTION() - REGTESTS: stop using truncated.vtc on freebsd - MINOR: quic: refactor STREAM encoding and splitting - MINOR: quic: refactor CRYPTO encoding and splitting - BUG/MEDIUM: fd: mark FD transferred to another process as FD_CLONED - BUG/MINOR: ssl/cli: "show ssl crt-list" lacks client-sigals - BUG/MINOR: ssl/cli: "show ssl crt-list" lacks sigals - MINOR: ssl/cli: display more filenames in 'show ssl cert' - DOC: watchdog: document the sequence of the watchdog and panic - MINOR: ssl: store the filenames resulting from a lookup in ckch_conf - MINOR: startup: allow hap_register_feature() to enable a feature in the list - MINOR: quic: support frame type as a varint - BUG/MINOR: startup: leave at first post_section_parser which fails - BUG/MINOR: startup: hap_register_feature() fix for partial feature name - BUG/MEDIUM: cli: Be sure to drop all input data in END state - BUG/MINOR: cli: Wait for the last ACK when FDs are xferred from the old worker - BUG/MEDIUM: filters: Handle filters registered on data with no payload callback - BUG/MINOR: fcgi: Don't set the status to 302 if it is already set - MINOR: ssl/crtlist: split the ckch_conf loading from the crtlist line parsing - MINOR: ssl/crtlist: handle crt_path == cc->crt in crtlist_load_crt() - MINOR: ssl/ckch: return from ckch_conf_clean() when conf is NULL - MEDIUM: ssl/crtlist: "crt" keyword in frontend - DOC: configuration: document the "crt" frontend keyword - DEV: h2: add a Lua-based HTTP/2 connection tracer - BUG/MINOR: quic: prevent crash on conn access after MUX init failure - BUG/MINOR: mux-quic: prevent crash after MUX init failure - DEV: h2: fix flags for the continuation frame - REGTESTS: Fix truncated.vtc to send 0-CRLF - BUG/MINOR: mux-h2: Properly handle full or truncated HTX messages on shut - Revert "REGTESTS: stop using truncated.vtc on freebsd" - MINOR: mux-quic: define a QCC application state member - MINOR: mux-quic/h3: emit SETTINGS via MUX tasklet handler - MINOR: mux-quic/h3: support temporary blocking on control stream sending v3.2-dev6	2025-02-19 18:39:51 +01:00
Amaury Denoyelle	a7645d7cd5	MINOR: mux-quic/h3: support temporary blocking on control stream sending When HTTP/3 layer is initialized via QUIC MUX, it first emits a SETTINGS frame on an unidirectional control stream. However, this could be prevented if client did not provide initial flow control. Previously, QUIC MUX was unable to deal with such situation. Thus, the connection was closed immediately and no transfer could occur. Improve this by extending QUIC MUX application layer API : initialization may now return a transient error. This allows MUX to continue to use the connection normally. Initialization will be retried periodically alter until it can succeed. This new API allows to deal with the flow control issue described above. Note that this patch is not considered as a bug fix. Indeed, clients are strongly advised to provide enough flow control for a SETTINGS frame exchange.	2025-02-19 11:08:02 +01:00
Amaury Denoyelle	06e7674399	MINOR: mux-quic/h3: emit SETTINGS via MUX tasklet handler Previously, QUIC MUX application layer was installed and initialized via MUX init. However, the latter stage involve I/O operations, for example when using HTTP/3 with the emission of a SETTINGS frame. Change this to prevent any I/O operations during MUX init. As such, finalize app_ops callback is now called during the first invokation of qcc_io_send(), in the context of MUX tasklet. To implement this, a new application state value is added, to detect the transition from NULL to INIT stage.	2025-02-19 11:03:40 +01:00
Amaury Denoyelle	188fc45b95	MINOR: mux-quic: define a QCC application state member Introduce a new QCC field to track the current application layer state. For the moment, only INIT and SHUT state are defined. This allows to replace the older flag QC_CF_APP_SHUT. This commit does not bring major changes. It is only necessary to permit future evolutions on QUIC MUX. The only noticeable change is that QMUX traces can now display this new field.	2025-02-19 10:59:53 +01:00
Christopher Faulet	4a99f15f0c	Revert "REGTESTS: stop using truncated.vtc on freebsd" This reverts commit 0b9a75e8781593c250f6366a64a019018ade688e. Thanks to the previous fixes ("REGTESTS: Fix truncated.vtc to send 0-CRLF" and "BUG/MINOR: mux-h2: Properly handle full or truncated HTX messages on shut"), this script can be reenabled for FreeBSD.	2025-02-18 17:35:00 +01:00
Christopher Faulet	b70921f2c1	BUG/MINOR: mux-h2: Properly handle full or truncated HTX messages on shut On shut, truncated HTX messages were not properly handled by the H2 multiplexer. Depending on how data were emitted, a chunked HTX message without the 0-CRLF could be considered as full and an empty data with ES flag set could be emitted instead of a RST_STREAM(CANCEL) frame. In the H2 multiplexer, when a shut is performed, an HTX message is considered as truncated if more HTX data are still expected. It is based on the presence or not of the H2_SF_MORE_HTX_DATA flag on the H2 stream. However, this flag is set or unset depending on the HTX extra field value. This field is used to state how much data that must still be transferred, based on the announced data length. For a message with a content-length, this assumption is valid. But for a chunked message, it is not true. Only the length of the current chunk is announced. So we cannot rely on this field in that case to know if a message is full or not. Instead, we must rely on the HTX start-line flags to know if more HTX data are expected or not. If the xfer length is known (the HTX_SL_F_XFER_LEN flag is set on the HTX start-line), it means that more data are always expected, until the end of message is reached (the HTX_FL_EOM flag is set on the HTX message). This is true for bodyless message because the end of message is reported with the end of headers. This is also true for tunneled messages because the end of message is received before switching the H2 stream in tunnel mode. This patch must be backported as far as 2.8.	2025-02-18 17:34:59 +01:00
Christopher Faulet	b93e419750	REGTESTS: Fix truncated.vtc to send 0-CRLF When a chunked messages is sent, the 0-CRLF must be explicitely sent. Since the begining, it is missing. Just add it.	2025-02-18 17:34:59 +01:00
Willy Tarreau	af5c07eee9	DEV: h2: fix flags for the continuation frame It's flag 2 (end of headers) that's defined there, not 3 (padded).	2025-02-18 14:17:17 +01:00
Amaury Denoyelle	2715dbe9d0	BUG/MINOR: mux-quic: prevent crash after MUX init failure qmux_init() may fail for several reasons. In this case, connection resources are freed and underlying and a CONNECTION_CLOSE will be emitted via its quic_conn instance. In case of qmux_init() failure, qcc_release() is used to clean up resources, but QCC <conn> member is first resetted to NULL, as connection released must be delayed. Some cleanup operations are thus skipped, one of them is the resetting of <ctx> connection member to NULL. This may cause a crash as <ctx> is a dangling pointer after QCC release. One of the possible reproducer is to activate QMUX traces, which will cause a segfault on the qmux_init() error leave trace. To fix this, simply reset <ctx> to NULL manually on qmux_init() failure. This must be backported up to 3.0.	2025-02-18 11:02:46 +01:00
Amaury Denoyelle	2cdc4695cb	BUG/MINOR: quic: prevent crash on conn access after MUX init failure Initially, QUIC-MUX was responsible to reset quic_conn <conn> member to NULL when MUX was released. This was performed via qcc_release(). However, qcc_release() is also used on qmux_init() failure. In this case, connection must be freed via its session, so QCC <conn> member is resetted to NULL prior to qcc_release(), which prevents quic_conn <conn> member to also be resetted. As the connection is freed soon after, quic_conn <conn> is a dangling pointer, which may cause crashes. This bug should be very rare as first it implies that QUIC-MUX initialization has failed (for example due to a memory alloc error). Also, <conn> member is rarely used by quic_conn instance. In fact, the only reproducible crash was done with QUIC traces activated, as in this case connection is accessed via quic_conn under __trace_enabled() function. To fix this, detach connection from quic_conn via the XPRT layer instead of the MUX. More precisely, this is performed via quic_close(). This should ensure that it will always be conducted, either on normal connection closure, but also after special conditions such as MUX init failure. This should be backported up to 2.6.	2025-02-18 10:43:56 +01:00
Willy Tarreau	607aa57b2e	DEV: h2: add a Lua-based HTTP/2 connection tracer The following config is sufficient to trace H2 exchanges between a client and a server: global lua-load "dev/h2/h2-tracer.lua" listen h2_sniffer mode tcp bind :8002 filter lua.h2-tracer #hex server s1 127.0.0.1:8003 The commented "hex" argument will also display full frames in hex (not recommended). The connections are prefixed with a 3-hex digit number in order to also support a bit of multiplexing without impacting the reading too much. The screen is split in two, with the request on the left and the response on the right. Here's an example of what it does between an haproxy backend and an haproxy frontend both in H2, when submitted a curl request for /?s=30k handled by httpterm: [001] ### req start [001] [PREFACE len=24] [001] [SETTINGS sid=0 len=24 (bytes=24)] [001] \| ### res start [001] \| [SETTINGS sid=0 len=18 (bytes=27)] [001] \| [SETTINGS ACK sid=0 len=0 (bytes=0)] [001] [SETTINGS ACK sid=0 len=0 (bytes=56)] [001] [HEADERS EH+ES sid=1 len=47 (bytes=47)] [001] \| [HEADERS EH sid=1 len=101 (bytes=15351)] [001] \| [DATA sid=1 len=15126 (bytes=15241)] [001] \| [DATA sid=1 len=1258 (bytes=106)] [001] \| ... -106 = 1152 [001] \| ... -1152 = 0 [001] [WINDOW_UPDATE sid=1 len=4 (bytes=43)] [001] [WINDOW_UPDATE sid=0 len=4 (bytes=30)] [001] [WINDOW_UPDATE sid=1 len=4 (bytes=17)] [001] [WINDOW_UPDATE sid=0 len=4 (bytes=4)] [001] \| [DATA ES sid=1 len=14336 (bytes=14336)] [001] [WINDOW_UPDATE sid=0 len=4 (bytes=4)] [001] ### req end: 31080 bytes total [001] \| [GOAWAY sid=0 len=8 (bytes=8)] [001] \| ### res end: 31097 bytes total It deserves some improvements. For instance at the moment it does not verify the preface, any 24 bytes will work. It does not perform any protocol validation either. Detecting some issues such as out-of-sequence frames could be helpful. But it already helps as-is.	2025-02-18 09:26:15 +01:00
William Lallemand	764f6910ed	DOC: configuration: document the "crt" frontend keyword Document the "crt" keyword of frontend and listen section.	2025-02-17 18:26:37 +01:00
William Lallemand	cd6a02ace9	MEDIUM: ssl/crtlist: "crt" keyword in frontend This patch implements the "crt" keywords in frontend, declaring an implicit crt-list named after the frontend. The patch is split in two steps: The first step is the crt keyword parser, which parses crt lines and fill a "cfg_crt_node" struct containing a ssl_bind_conf and a ckch_conf which are put in a list to be used later. After parsing the frontend section, as a 2nd step, a post_section_parser is called, it will create a crt-list named after the frontend and will fill it with certificates from the list of cfg_crt_node. Once created this crt-list will be loaded in every "ssl" bind lines that didn't declare any crt or crt-list. Example: listen https bind :443 ssl crt foobar.pem crt test1.net.crt key test1.net.key Implements part of #2854	2025-02-17 18:26:37 +01:00
William Lallemand	82f927817e	MINOR: ssl/ckch: return from ckch_conf_clean() when conf is NULL ckch_conf_clean() mustn't be executed when the argument is NULL, this will keep the API more consistant like any free() function.	2025-02-17 18:26:37 +01:00
William Lallemand	0330011acf	MINOR: ssl/crtlist: handle crt_path == cc->crt in crtlist_load_crt() Handle the case where crt_path == cc->crt, so the pointer doesn't get free'd before getting strdup'ed in crtlist_load_crt().	2025-02-17 18:26:37 +01:00
William Lallemand	69163cd63e	MINOR: ssl/crtlist: split the ckch_conf loading from the crtlist line parsing ckch_conf loading is not that simple as it requires to check - if the cert already exists in the ckchs_tree - if the ckch_conf is compatible with an existing cert in ckchs_tree - if the cert is a bundle which need to load multiple ckch_store This logic could be reuse elsewhere, so this commit introduce the new crtlist_load_crt() function which does that.	2025-02-17 18:26:37 +01:00
Christopher Faulet	ca79ed5eef	BUG/MINOR: fcgi: Don't set the status to 302 if it is already set When a "Location" header was found in a FCGI response, the status code was forced to 302. But it should only be performed if no status code was set first. So now, we take care to not override an already defined status code when the "Location" header is found. This patch should fix the issue #2865. It must backported to all stable versions.	2025-02-17 16:37:53 +01:00
Christopher Faulet	34542d5ec2	BUG/MEDIUM: filters: Handle filters registered on data with no payload callback An HTTP filter with no http_payload callback function may be registered on data. In that case, this filter is obviously not called when some data are received but it remains important to update its internal state to be sure to keep it synchronized on the stream, especially its offet value. Otherwise, the wrong calculation on the global offset may be performed in flt_http_end(), leading to an integer overflow when data are moved from input to output. This overflow triggers a BUG_ON() in c_adv(). The same is true for TCP filters with no tcp_payload callback function. This patch must be backport to all stable versions.	2025-02-17 16:16:29 +01:00
Christopher Faulet	49b7bcf583	BUG/MINOR: cli: Wait for the last ACK when FDs are xferred from the old worker On reload, the new worker requests bound FDs to the old one. The old worker sends them in message of at most 252 FDs. Each message is acknowledged by the new worker. All messages sent or received by the old worker are handled manually via sendmsg/recv syscalls. So the old worker must be sure consume all the ACK replies. However, the last one was never consumed. So it was considered as a command by the CLI applet. This issue was hidden since recently. But it was the root cause of the issue #2862. Note this last ack is also the first one when there are less than 252 FDs to transfer. This patch must be backported to all stable versions.	2025-02-17 15:31:07 +01:00
Christopher Faulet	972ce87676	BUG/MEDIUM: cli: Be sure to drop all input data in END state Commit 7214dcd ("BUG/MEDIUM: applet: Don't pretend to have more data to handle EOI/EOS/ERROR") revealed a bug with the CLI applet. Pending input data when the applet is in CLI_ST_END state were never consumed or dropped, leading to a wakeup loop. The CLI applet implements its own snd_buf callback function. It is important it consumes all pending input data. Otherwise, the applet is woken up in loop until it empties the request buffer. Another way to fix the issue would be to report an error. But in that case, it seems reasonnable to drop these data. The issue can be observed on reload, in master/worker mode, because of issue about the last ACK message which was never consummed by the _getsocks() command. This patch should fix the issue #2862. It must be backported to 3.1 with the commit above.	2025-02-17 15:31:07 +01:00
William Lallemand	ab2fa95bdd	BUG/MINOR: startup: hap_register_feature() fix for partial feature name In patch 2fe4cbd8e ("MINOR: startup: allow hap_register_feature() to enable a feature in the list"), the ability to overwrite a '-' in the feature list was added. However the code was not tokenizing correctly the string, and partial feature name found in the name could result in having the same feature name multiple time. This patch rewrites the lookup of the string by tokenizing it correctly.	2025-02-17 14:56:09 +01:00
William Lallemand	7268e9c249	BUG/MINOR: startup: leave at first post_section_parser which fails Since we are now iterating on post_section_parser() for a same keyword, we need to exit at the first ERR_ABORT. The post_section_parser() is called when parsing a new section, but also at the end of the file to be called for the last section. The changes in 4de86bb ("MEDIUM: initcall: allow to register mutiple post_section_parser per section") should have added tests on the ERR_ABORT value. Also pcs->post_section_parser() must be called instead of cs->post_section_parser() because we could have a NULL ptr. This bug does not affect anything since we don't use REGISTER_CONFIG_POST_SECTION() yet.	2025-02-17 11:21:20 +01:00
Amaury Denoyelle	32691e7c25	MINOR: quic: support frame type as a varint QUIC frame type is encoded as a variable-length integer. Thus, 64-bit integer should be used for them. Currently, this was not the case as type was represented as a 1-byte char inside quic_frame structure. This does not cause any issue with QUIC from RFC9000, as all frame types fit in this range. Furthermore, a QUIC implementation is required to use the smallest size varint when encoding a frame type. However, the current code is unable to accept QUIC extension with bigger frame types. This is notably the case for quic-on-streams draft. Thus, this commit readjusts quic_frame architecture to be able to support higher frame type values. First, type field of quic_frame is changed to a 64-bits variable. Both encoding and decoding frame functions uses variable-length integer helpers to manipulate the frame type field. Secondly, the quic_frame builders/parsers infrastructure is still preserved. However, it could be impossible to define new large frame type as an index into quic_frame_builders / quic_frame_parsers arrays. Thus, wrapper functions are now provided to access the builders and parsers. Both qf_builder() and qf_parser() wrappers can then be extended to return custom builder/parser instances for larger frame type. Finally, unknown frame type detection also uses the new wrapper quic_frame_is_known(). As with builders/parsers, for large frame type, this function must be manually completed to support a new type value.	2025-02-14 09:00:05 +01:00

1 2 3 4 5 ...

23987 Commits