haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-08 08:07:10 +02:00

Author	SHA1	Message	Date
William Lallemand	593c895eed	MINOR: ssl: allow to change the client-sigalgs on server lines This patch introduces the "client-sigalgs" keyword for the server line, which allows to configure the list of server signature algorithms negociated during the handshake. Also available as "ssl-default-server-client-sigalgs" in the global section.	2023-06-29 14:11:46 +02:00
William Lallemand	717f0ad995	MINOR: ssl: allow to change the server signature algorithm on server lines This patch introduces the "sigalgs" keyword for the server line, which allows to configure the list of server signature algorithms negociated during the handshake. Also available as "ssl-default-server-sigalgs" in the global section.	2023-06-29 13:40:18 +02:00
Emeric Brun	f473eb7206	BUG/MEDIUM: quic: error checking buffer large enought to receive the retry tag Building a retry message, the offset of the tag was checked instead of the remaining length into the buffer. Must be backported as far as 2.6.	2023-06-27 18:54:10 +02:00
Willy Tarreau	e12e202f6a	BUILD: mux-h1: silence a harmless fallthrough warning This warning happened in 2.9-dev with commit `723c73f8a` ("MEDIUM: mux-h1: Split h1_process_mux() to make code more readable"). It's the usual gcc crap that relies on comments to disable the warning but which drops these comments between the preprocessor and the compiler, so using any split build system (distcc, ccache etc) reintroduces the warning. Use the more reliable and portable __fallthrough instead. No backport needed.	2023-06-27 16:08:13 +02:00
William Lallemand	3388b23465	BUG/MINOR: ssl: SSL_ERROR_ZERO_RETURN returns CO_ER_SSL_EMPTY Return a more acurate error than the previous patch, CO_ER_SSL_EMPTY is the code for "Connection closed during SSL handshake" which is more precise than CO_ER_SSL_ABORT ("Connection error during SSL handshake"). No backport needed.	2023-06-26 19:10:24 +02:00
William Lallemand	e8e5762389	MEDIUM: ssl: handle the SSL_ERROR_ZERO_RETURN during the handshake During a SSL_do_handshake(), SSL_ERROR_ZERO_RETURN can be returned in case the remote peer sent a close_notify alert. Previously this would set the connection error to CO_ER_SSL_HANDSHAKE, this patch sets it to CO_ER_SSL_ABORT to have a more acurate error.	2023-06-26 18:52:53 +02:00
Frédéric Lécaille	1231810963	BUG/MINOR: quic: Prevent deadlock with CID tree lock This bug was introduced by this commit which was not sufficient: BUG/MINOR: quic: Possible endless loop in quic_lstnr_dghdlr() It was revealed by the blackhole interop runner test with neqo as client. qc_conn_release() could be called after having locke the CID tree when two different threads was creating the same connection at the same time. Indeed in this case the last thread which tried to create a new connection for the same an already existing CID could not manage to insert an already inserted CID in the connection CID tree. This was expected. It had to destroy the newly created for nothing connection calling qc_conn_release(). But this function also locks this tree calling free_quic_conn_cids() leading to a deadlock. A solution would have been to delete the new CID created from its tree before calling qc_conn_release(). A better solution is to stop inserting the first CID from qc_new_conn(), and to insert it into the CID tree only if there was not an already created connection. This is whas is implemented by this patch. Must be backported as far as 2.7.	2023-06-26 14:09:58 +02:00
William Lallemand	117b03ff4a	BUG/MINOR: mworker: leak of a socketpair during startup failure Aurelien Darragon found a case of leak when working on ticket #2184. When a reexec_on_failure() happens BEFORE protocol_bind_all(), the worker is not fork and the mworker_proc struct is still there with its 2 socketpairs. The socketpair that is supposed to be in the master is already closed in mworker_cleanup_proc(), the one for the worker was suppposed to be cleaned up in mworker_cleanlisteners(). However, since the fd is not bound during this failure, the fd is never closed. This patch fixes the problem by setting the fd to -1 in the mworker_proc after the fork, so we ensure that this it won't be close if everything was done right, and then we try to close it in mworker_cleanup_proc() when it's not set to -1. This could be triggered with the script in ticket #2184 and a `ulimit -H -n 300`. This will fail before the protocol_bind_all() when trying to increase the nofile setrlimit. In recent version of haproxy, there is a BUG_ON() in fd_insert() that could be triggered by this bug because of the global.maxsock check. Must be backported as far as 2.6. The problem could exist in previous version but the code is different and this won't be triggered easily without other consequences in the master.	2023-06-21 09:44:18 +02:00
Aurelien DARRAGON	d35cee972b	BUG/MINOR: http_ext: fix if-none regression in forwardfor option A regression was introduced in `730b983` ("MINOR: proxy: move 'forwardfor' option to http_ext") Indeed, when the forwardfor if-none option is specified on the frontend but forwardfor is not specified at all on the backend: if-none from the frontend is ignored. But this behavior conflicts with the historical one, if-none should only be ignored if forwardfor is also enabled on the backend and if-none is not set there. It should fix GH #2187. This should be backported in 2.8 with `730b983` ("MINOR: proxy: move 'forwardfor' option to http_ext")	2023-06-20 15:32:56 +02:00
Christopher Faulet	a150cfcfec	CLEANUP: mux-h1: Remove useless __maybe_unused statement h1_append_chunk_size() and h1_prepend_chunk_crlf() functions were marked as possibly unused to be able to add them in standalone commits. Now these functions are used, the __maybe_unused statement can be removed.	2023-06-20 13:59:24 +02:00
Christopher Faulet	c6ca6db034	MEDIIM: mux-h1: Add splicing support for chunked messages When the HTX was introduced, we have lost the support for the kernel splicing for chunked messages. Thanks to this patch set, it is possible again. Of course, we still need to keep the H1 parser synchronized. Thus only the chunk content can be spliced. We still need to read the chunk envelope using a buffer. There is no reason to backport this feature. But, just in case, this patch depends on following patches: * "MEDIUM: filters/htx: Don't rely on HTX extra field if payload is filtered" * "MINOR: mux-h1: Add function to prepend the chunk crlf to the output buffer" * "MINOR: mux-h1: Add function to append the chunk size to the output buffer" * "REORG: mux-h1: Rename functions to emit chunk size/crlf in the output buffer" * "MEDIUM: mux-h1: Split h1_process_mux() to make code more readable"	2023-06-20 13:34:49 +02:00
Christopher Faulet	8bd835b2d2	MEDIUM: filters/htx: Don't rely on HTX extra field if payload is filtered If an HTTP data filter is registered on a channel, we must not rely on the HTX extra field because the payload may be changed and we cannot predict if this value will change or not. It is too errorprone to let filters deal with this reponsibility. So we set it to 0 when payload filtering is performed, but only if the payload length can be determined. It is important because this field may be used when data are forwarded. In fact, it will be used by the H1 multiplexer to be able to splice chunk-encoded payload.	2023-06-20 13:34:46 +02:00
Christopher Faulet	05fe76b540	MINOR: mux-h1: Add function to prepend the chunk crlf to the output buffer h1_prepend_chunk_crlf() function does the opposite of h1_append_chunk_crlf(). It emit the chunk size in front of the output buffer.	2023-06-20 13:33:59 +02:00
Christopher Faulet	a07c85c5df	MINOR: mux-h1: Add function to append the chunk size to the output buffer h1_append_chunk_size() function does the opposite of h1_prepend_chunk_size(). It emit the chunk size at the end of the output buffer.	2023-06-20 13:33:53 +02:00
Christopher Faulet	e081efd448	REORG: mux-h1: Rename functions to emit chunk size/crlf in the output buffer h1_emit_chunk_size() and h1_emit_chunk_crlf() functions were renamed, respectively, h1_prepend_chunk_size() and h1_append_chunk_crlf().	2023-06-20 13:33:23 +02:00
Christopher Faulet	723c73f8a7	MEDIUM: mux-h1: Split h1_process_mux() to make code more readable h1_process_mux() function was pretty huge a quite hard to debug. So, the funcion is split in sub-functions. Each sub-function is responsible to a part of the message (start-line, headers, payload, trailers...). We are still relying on a HTTP parser to format the message to be sure to detect errors. Functionnaly speaking, there is no change. But the code is now more readable.	2023-06-20 13:33:01 +02:00
Frédéric Lécaille	a55acf993a	BUG/MINOR: quic: ticks comparison without ticks API use Replace a "less than" comparison between two tick variable by a call to tick_is_lt() in quic_loss_pktns(). This bug could lead to a wrong packet loss detection when the loss time computed values could wrap. This is the case 20 seconds after haproxy has started. Must be backported as far as 2.6.	2023-06-19 19:05:45 +02:00
William Lallemand	e6051a04ef	BUG/MEDIUM: mworker: increase maxsock with each new worker In ticket #2184, HAProxy is crashing in a BUG_ON() after a lot of reload when the previous processes did not exit. Each worker has a socketpair which is a FD in the master, when reloading this FD still exists until the process leaves. But the global.maxconn value is not incremented for each of these FD. So when there is too much workers and the number of FD reaches maxsock, the next FD inserted in the poller will crash the process. This patch fixes the issue by increasing the maxsock for each remaining worker. Must be backported in every maintained version.	2023-06-19 17:32:32 +02:00
Frédéric Lécaille	98b55d1260	BUG/MINOR: quic: Missing transport parameters initializations This bug was introduced by this commit: MINOR: quic: Remove pool_zalloc() from qc_new_conn() The transport parameters was not initialized. This leaded to a crash when dumping the received ones from TRACE()s. Also reset the lengths of the CIDs attached to a quic_conn struct to 0 value to prevent them from being dumped from traces when not already initialized. No backport needed.	2023-06-19 08:49:04 +02:00
Frédéric Lécaille	30254d5e75	MINOR: quic: Remove pool_zalloc() from quic_dgram_parse() Replace a call to pool_zalloc() by a call to pool_malloc() into quic_dgram_parse to allocate quic_rx_packet struct objects. Initialize almost all the members of quic_rx_packet struct. ->saddr is initialized by quic_rx_pkt_retrieve_conn(). ->pnl and ->pn are initialized by qc_do_rm_hp(). ->dcid and ->scid are initialized by quic_rx_pkt_parse() which calls quic_packet_read_long_header() for a long packet. For a short packet, only ->dcid will be initialized.	2023-06-16 16:56:08 +02:00
Frédéric Lécaille	9f9bd5f84d	MINOR: quic: Remove pool_zalloc() from qc_conn_alloc_ssl_ctx() pool_zalloc() is replaced by pool_alloc() into qc_conn_alloc_ssl_ctx() to allocate a ssl_sock_ctx struct. ssl_sock_ctx struct member are all initiliazed to null values excepted ->ssl which is initialized by the next statement: a call to qc_ssl_sess_init().	2023-06-16 16:56:08 +02:00
Frédéric Lécaille	ddc616933c	MINOR: quic: Remove pool_zalloc() from qc_new_conn() qc_new_conn() is ued to initialize QUIC connections with quic_conn struct objects. This function calls quic_conn_release() when it fails to initialize a connection. quic_conn_release() is also called to release the memory allocated by a QUIC connection. Replace pool_zalloc() by pool_alloc() in this function and initialize all quic_conn struct members which are referenced by quic_conn_release() to prevent use of non initialized variables in this fonction. The ebtrees, the lists attached to quic_conn struct must be initialized. The tasks must be reset to their NULL default values to be safely destroyed by task_destroy(). This is all the case for all the TLS cipher contexts of the encryption levels (struct quic_enc_level) and those for the keyupdate. The packet number spaces (struct quic_pktns) must also be initialized. ->prx_counters pointer must be initialized to prevent quic_conn_prx_cntrs_update() from dereferencing this pointer. ->latest_rtt member of quic_loss struct must also be initialized. This is done by quic_loss_init() called by quic_path_init().	2023-06-16 16:55:58 +02:00
Frédéric Lécaille	4ae29be18c	BUG/MINOR: quic: Possible endless loop in quic_lstnr_dghdlr() This may happen when the initilization of a new QUIC conn fails with qc_new_conn() when receiving an Initial paquet. This is done after having allocated a CID with new_quic_cid() called by quic_rx_pkt_retrieve_conn() which stays in the listener connections tree without a QUIC connection attached to. Then when the listener receives another Initial packet for the same CID, quic_rx_pkt_retrieve_conn() returns NULL again (no QUIC connection) but with an thread ID already bound to the connection, leading the datagram to be requeued in the same datagram handler thread queue. And so on. To fix this, the connection is created after having created the connection ID. If this fails, the connection is deallocated. During the race condition, when two different threads handle two datagrams for the same connection, in addition to releasing the newer created connection ID, the newer QUIC connection must also be released. Must be backported as far as 2.7.	2023-06-16 16:10:58 +02:00
Frédéric Lécaille	c02d898cd1	BUG/MINOR: quic: Possible crash in quic_conn_prx_cntrs_update() quic_conn_prx_cntrs_update() may be called from quic_conn_release() with NULL as value for ->prx_counters member. This is the case when qc_new_conn() fails when allocating <buf_area>. In this case quic_conn_prx_cntrs_update() BUG_ON(). Must be backported as far as 2.7.	2023-06-14 18:09:54 +02:00
Aurelien DARRAGON	4e17a3f5b1	BUG/MINOR: namespace: missing free in netns_sig_stop() On soft-stop, netns_sig_stop() function is called to purge the shared namespace tree by iterating over each entries to free them. However, once an entry is cleaned up and removed from the tree, the entry itself isn't freed and this results into a minor leak when soft-stopping because entry was allocated using calloc() in netns_store_insert() when parsing the configuration. This could be backported in every stable versions.	2023-06-14 11:27:29 +02:00
Aurelien DARRAGON	19b5a7c7a5	BUG/MINOR: server: inherit from netns in srv_settings_cpy() When support for 'namespace' keyword was added for the 'default-server' directive in `22f41a2` ("MINOR: server: Make 'default-server' support 'namespace' keyword."), we forgot to copy the attribute from the parent to the newly created server. This resulted in the 'namespace' keyword being parsed without errors when used from a 'default-server' directive, but in practise the option was simply ignored. There's no need to duplicate the netns struct because it is stored in a shared list, so copying the pointer does the job. This patch partially fixes GH #2038 and should be backported to all stable versions.	2023-06-14 11:27:29 +02:00
Frédéric Lécaille	4d56b725fb	BUG/MINOR: quic: Address inversion in "show quic full" The local address was dumped as "from" address by dump_quic_full() and the peer address as "to" address. This patch fixes this issue. Furthermore, to support the server side (QUIC client) to come, it is preferable to stop using "from" and "to" labels to dump the local and peer addresses which is confusing for a QUIC client which uses its local address as "from" address. To mimic netstat, this is "Local Address" and "Foreign Address" which will be displayed by "show quic" CLI command and "local_addr" and "foreign_addr" for "show quic full" command to mention the local addresses and the peer addresses. Must be backported as far as 2.7.	2023-06-14 09:33:28 +02:00
Frédéric Lécaille	9b1f91fde8	BUG/MINOR: quic: Wrong encryption level flags checking This bug arrived with this commit which was supposed to fix another one: BUG/MINOR: quic: Wrong Application encryption level selection when probing The aim of this patch was to prevent the Application encryption to be selected when probing leading to ACK only packets to be sent if the ack delay timer had fired in the meantime, leading to crashes when no 01-RTT had been sent because the ack range tree is empty in this case. This statement is not correct (qc->pktns->flags & QUIC_FL_PKTNS_PROBE_NEEDED) because qc->pktns is an array of packet number space. But it is equivalent to (qc->pktns[QUIC_TLS_PKTNS_INITIAL].flags & QUIC_FL_PKTNS_PROBE_NEEDED). That said, the patch mentionned above is not more useful since this following which disable the ack time during the handshakes: BUG/MINOR: quic: Do not use ack delay during the handshakes This commit revert the first patch mentionned above. Must be backported as far as 2.6.	2023-06-14 08:54:51 +02:00
William Lallemand	0c9ff0cde4	BUG/MINOR: ssl: log message non thread safe in SSL Hanshake failure It was reported in issue #2181, strange behavior during the new SSL hanshake failure logs. Errors were logged with the code 0, which is unknown to OpenSSL. This patch mades 2 changes: - It stops using ERR_error_string() when the SSL error code is 0 - It uses ERR_error_string_n() to be thread-safe Must be backported to 2.8.	2023-06-12 16:35:57 +02:00
Christopher Faulet	28d17e26b8	BUG/MEDIUM: hlua: Use front SC to detect EOI in HTTP applets' receive functions When an HTTP applet tries to get request data, we must take care to properly detect the end of the message. It an empty HTX message with the SC_FL_EOI flag set on the front SC. However, an issue was introduced during the SC refactoring performed in the 2.8. The backend SC is tested instead of the frontend one. Because of this bug, the receive functions hang because the test on SC_FL_EOI flag never succeeds. Of course, by checking the frontend SC (the opposite SC to the one attached to the appctx), it works. This patch should fix the issue #2180. It must be backported to the 2.8.	2023-06-12 09:16:29 +02:00
Aurelien DARRAGON	b7f8af3ca9	BUG/MINOR: proxy/server: free default-server on deinit proxy default-server is a specific type of server that is not allocated using new_server(): it is directly stored within the parent proxy structure. However, since it may contain some default config options that may be inherited by regular servers, it is also subject to dynamic members (strings, structures..) that needs to be deallocated when the parent proxy is cleaned up. Unfortunately, srv_drop() may not be used directly from p->defsrv since this function is meant to be used on regular servers only (those created using new_server()). To circumvent this, we're splitting srv_drop() to make a new function called srv_free_params() that takes care of the member cleaning which originally takes place in srv_drop(). This function is exposed through server.h, so it may be called from outside server.c. Thanks to this, calling srv_free_params(&p->defsrv) from free_proxy() prevents any memory leaks due to dynamic parameters allocated when parsing a default-server line from a proxy section. This partially fixes GH #2173 and may be backported to 2.8. [While it could also be relevant for other stable versions, the patch won't apply due to architectural changes / name changes between 2.4 => 2.6 and then 2.6 => 2.8. Considering this is a minor fix that only makes memory analyzers happy during deinit paths (at least for <= 2.8), it might not be worth the trouble to backport them any further?]	2023-06-06 15:15:17 +02:00
Aurelien DARRAGON	9be9225ef2	BUG/MINOR: proxy: add missing interface bind free in free_proxy bind->settings.interface hint is allocated when "interface" keyword is specified on a bind line, but the string isn't explicitly freed in proxy_free, resulting in minor memory leak on deinit paths when the keyword is being used. It partially fixes GH #2173 and may be backported to all stable versions. [in 2.2 free_proxy did not exist so the patch must be applied directly in deinit() function from haproxy.c]	2023-06-06 15:15:17 +02:00
Aurelien DARRAGON	c49224a29d	BUG/MINOR: cfgparse-tcp: leak when re-declaring interface from bind line When interface keyword is used multiple times within the same bind line, the previous value isn't checked and is rewritten as-is, resulting in a small memory leak. Ensuring the interface name is first freed before assigning it to a new value. This may be backported to every stable versions. [Note for 2.2, the fix must be performed in bind_parse_interface() from proto_tcp.c, directly within the listener's loop, also ha_free() was not available so free() must be used instead]	2023-06-06 15:15:17 +02:00
Christopher Faulet	2c29d1f524	BUG/MINOR: peers: Improve detection of config errors in peers sections There are several misuses in peers sections that are not detected during the configuration parsing and that could lead to undefined behaviors or crashes. First, only one listener is expected for a peers section. If several bind lines or local peer definitions are used, an error is triggered. However, if multiple addresses are set on the same bind line, there is no error while only the last listener is properly configured. On the 2.8, there is no crash but side effects are hardly predictable. On older version, HAProxy crashes if an unconfigured listener is used. Then, there is no check on remote peers name. It is unexpected to have same name for several remote peers. There is now a test, performed during the post-parsing, to verify all remote peer names are unique. Finally, server parsing options for the peers sections are changed to be sure a port is always defined, and not a port range or a port offset. This patch fixes the issue #2066. It could be backported to all stable versions.	2023-06-05 08:24:34 +02:00
Christopher Faulet	bc9fb64623	BUG/MINOR: spoe: Only skip sending new frame after a receive attempt When a SPOE appctx is processing frames in sync mode, we must only skip sending a new frame if it is still waiting for a ACK frame after a receive attempt. It was performed before the receive attempt. As a consequence, if the ACK frame was received, the SPOE appctx did not try to process queued messages immediately. This could increase the queue time and thus slow down the processing time of the stream. Thanks to Daniel Epperson for his help to diagnose the bug. This patch must be backported to every stable versions.	2023-06-05 08:24:34 +02:00
Frédéric Lécaille	29a1d3679b	BUG/MINOR: quic: Possible crash when SSL session init fails This is due to the fact that qc->conn is never initialized before calling qc_ssl_sess_init(). Must be backported as far as 2.6.	2023-06-02 18:12:48 +02:00
Willy Tarreau	4ad1c9635a	BUG/MINOR: stream: do not use client-fin/server-fin with HTX Historically the client-fin and server-fin timeouts were made to allow a connection closure to be effective quickly if the last data were sent down a socket and the client didn't close, something that can happen when the peer's FIN is lost and retransmits are blocked by a firewall for example. This made complete sense in 1.5 for TCP and HTTP in close mode. But nowadays with muxes, it's not done at the right layer anymore and even the description doesn't match what is being done, because what happens is that the stream will abort the whole transfer after it's done sending to the mux and this timeout expires. We've seen in GH issue 2095 that this can happen with very short timeout values, and while this didn't trigger often before, now that the muxes (h2 & quic) properly report an end of stream before even the first sc_conn_sync_recv(), it seems that it can happen more often, and have two undesirable effects: - logging a timeout when that's not the case - aborting the request channel, hence the server-side conn, possibly before it had a chance to be put back to the idle list, causing this connection to be closed and not reusable. Unfortunately for TCP (mux_pt) this remains necessary because the mux doesn't have a timeout task. So here we're adding tests to only do this through an HTX mux. But to be really clean we should in fact completely drop all of this and implement these timeouts in the mux itself. This needs to be backported to 2.8 where the issue was discovered, and maybe carefully to older versions, though that is not sure at all. In any case, using a higher timeout or removing client-fin in HTTP proxies is sufficient to make the issue disappear.	2023-06-02 16:33:40 +02:00
Tim Duesterhus	33a4461fa9	BUG/MINOR: stats: Fix Lua's `get_stats` function Lua's `get_stats` function stopped working in `4cfb0019e6`, due to the addition a new field ST_F_PROTO without a corresponding entry in `stat_fields`. Fix the issue by adding the entry, like `a46b142e88` did previously for a different field. This patch fixes GitHub Issue #2174, it should be backported to 2.8.	2023-06-02 08:29:25 +02:00
Willy Tarreau	6ccc8625b4	MINOR: quic/cli: clarify the "show quic" help message Make it clear what is expected in the "<format>" field on the help line. This should be backported to 2.7.	2023-05-31 16:15:24 +02:00
Fr�d�ric L�caille	a73563bfa7	MINOR: quic: Add QUIC connection statistical counters values to "show quic" Add the total number of sent packets for each QUIC connection dumped by "show quic". Also add the remaining counter values only if not null. Must be backported to 2.7.	2023-05-31 15:56:19 +02:00
Willy Tarreau	f279a2f148	BUG/MINOR: mux-h2: refresh the idle_timer when the mux is empty There's a rare case where on long fat pipes, we can see the keep-alive timeout trigger before the end of the transfer of the last large object, and the connection closed a bit quickly after the end of the transfer because a GOAWAY is queued. The data are not destroyed, except that the WINDOW_UPDATES from the client arriving late while the last data are being drained by the socket buffers may at some point trigger a reset, and some clients might choke a bit too early on these. Let's make sure we only arm the idle_start timestamp once the output buffer is empty. Of course it will still not cover for the data pending in the socket buffers but it will at least let those in the buffer leave in peace. More elaborate options can be used to protect the data in the kernel buffers, such as the one described in GH issue #5. It's very likely that this old issue was emphasized by the following commit in 2.6: `15a4733d5` ("BUG/MEDIUM: mux-h2: make use of http-request and keep-alive timeouts") and the behavior probably changed again with this one in 2.8, which was backported to 2.7 and scheduled for 2.6: `d38d8c6cc` ("BUG/MEDIUM: mux-h2: make sure control frames do not refresh the idle timeout") As such this patch should be backported to 2.6 after some observation period.	2023-05-31 10:45:30 +02:00
Amaury Denoyelle	d68f8b5a4a	CLEANUP: mux-quic: rename internal functions This patch is similar to the previous one but for QUIC mux functions used inside the mux code itself or application layer. Replace all occurences of qc_* prefix by qcc_* or qcs_*. This should help to better differentiate code between quic_conn and MUX. This should be backported up to 2.7.	2023-05-30 15:45:55 +02:00
Amaury Denoyelle	0f61e4f6d3	CLEANUP: mux-quic: rename functions for mux_ops Rename all QUIC mux function exposed through mux_ops structure. Use the prefix qmux_* or qmux_strm_. The objective is to remove qc_ prefix which should only be used in quic_conn layer. This should be backported up to 2.7.	2023-05-30 15:44:53 +02:00
Willy Tarreau	8fc7073906	BUG/MEDIUM: threads: fix a tiny race in thread_isolate() Aur�lien found a tiny race in thread_isolate() that can allow a thread that was running under isolation to continue running while another one enters isolation. The reason is that the check for harmless is only done before winning the CAS, but since the previously isolated thread doesn't wait for !rdv_request in thread_release(), it can effectively continue its activities while the next one believes it's isolated. A proper solution consists in looping once again in thread_isolate() to recheck (and wait) for all threads to be isolated once the CAS is won. The issue was introduced in 2.7 by commit `598cf3f22` ("MAJOR: threads: change thread_isolate to support inter-group synchronization") so the fix needs to be backported there.	2023-05-27 13:53:46 +02:00
Amaury Denoyelle	bfddb42c05	BUG/MEDIUM: mux-quic: only set EOI on FIN Recently stconn flags were reviewed for QUIC mux to be conform with other HTTP muxes. However, a mistake was made when dealing with a proper stream FIN with both EOI and EOS set. This was done as RESET_STREAM received after a FIN are ignored by QUIC mux and thus there is no difference between EOI or EOI+EOS. However, analyzers may interpret EOS as an interrupted request which result in a 400 HTTP error code. To fix this, only set EOI on proper stream FIN. EOS is set when input is interrupted (RESET_STREAM before FIN) or a STOP_SENDING is received which prevent transfer to complete. In this last case, EOS must be manually set too if FIN has been received before STOP_SENDING to go directly from ERR_PENDING to final ERROR state. This must be backported up to 2.7.	2023-05-26 17:17:25 +02:00
Amaury Denoyelle	6d6ee0dc0b	MINOR: quic: fix stats naming for flow control BLOCKED frames There was a misnaming in stats counter for *_BLOCKED frames in regard to QUIC rfc convention. This patch fixes it to prevent future ambiguity : - STREAMS_BLOCKED -> STREAM_DATA_BLOCKED - STREAMS_DATA_BLOCKED_BIDI -> STREAMS_BLOCKED_BIDI - STREAMS_DATA_BLOCKED_UNI -> STREAMS_BLOCKED_UNI This should be backported up to 2.7.	2023-05-26 17:17:00 +02:00
Amaury Denoyelle	087c5f041b	MINOR: mux-quic: remove nb_streams from qcc Remove nb_streams field from qcc. It was not used outside of a BUG_ON() statement to ensure we never have a negative count of streams. However this is already checked with other fields. This should be backported up to 2.7.	2023-05-26 17:17:00 +02:00
Amaury Denoyelle	7b41dfd834	CLEANUP: mux-quic: remove unneeded fields in qcc Remove fields from qcc structure which are unused. This should be backported up to 2.7.	2023-05-26 17:17:00 +02:00
Aurelien DARRAGON	33bbeecde3	BUILD: init: print rlim_cur as regular integer haproxy does not compile anymore on macOS+clang since `425d7ad` ("MINOR: init: pre-allocate kernel data structures on init"). This is due to rlim_cur being printed uncasted using %lu format specifier, with rlim_cur being stored as a rlim_t which is a typedef so its size may vary depending on the system's architecture. This is not the first time we need to dump rlim_cur in case of errors, there are already multiple occurences in the init code. Everywhere this happens, rlim is casted as a regular int and printed using the '%d' format specifier, so we do the same here as well to fix the build issue. No backport needed unless `425d7ad` gets backported.	2023-05-26 14:29:52 +02:00
eaglegai	ef667b1ad8	BUG/MINOR: thread: add a check for pthread_create preload_libgcc_s() use pthread_create to create a thread and then call pthread_join to use it, but it doesn't check if the option is successful. So add a check to aviod potential crash.	2023-05-26 12:08:23 +02:00
eaglegai	15c3d20e31	BUG/MINOR: ssl_sock: add check for ha_meth in __ssl_sock_init, BIO_meth_new may failed and return NULL if OPENSSL_zalloc failed. in this case, ha_meth will be NULL, and then crash happens in BIO_meth_set_write. So, we add a check for ha_meth.	2023-05-26 12:07:43 +02:00
Patrick Hemmer	425d7ad89d	MINOR: init: pre-allocate kernel data structures on init The Linux kernel maintains data structures to track a processes' open file descriptors, and it expands these structures as necessary when FD usage grows (at every FD=2^X starting at 64). However when threading is in use, during expansion the kernel will pause (observed up to 47ms) while it waits for thread synchronization (see https://bugzilla.kernel.org/show_bug.cgi?id=217366). This change addresses the issue and avoids the random pauses by opening the maximum file descriptor during initialization, so that expansion will not occur while processing traffic.	2023-05-26 09:28:18 +02:00
Christopher Faulet	535dd920df	MINOR: compression: Improve the way Vary header is added When a message is compressed, A "Vary" header is added with "accept-encoding" value. However, a new header is always added, regardless there is already a Vary header or not. In addition, if there is already a Vary header, there is no check on values to be sure "accept-encoding" value is not already there. So it is possible to have it twice. To improve this part, we now test Vary header values and "accept-encoding" is only added if it was not found. In addition, "accept-encoding" value is appended to the last Vary header found, if any. Otherwise, a new header is added.	2023-05-25 11:25:31 +02:00
Aurelien DARRAGON	1c07da4b48	BUG/MINOR: hlua: unsafe hlua_lua2smp() usage Fixing hlua_lua2smp() usage in hlua's code since it was assumed that hlua_lua2smp() makes a standalone smp out of lua data, but it is not the case. This is especially true when dealing with lua strings (string is extracted using lua_tolstring() which returns a pointer to lua string memory location that may be reclaimed by lua at any time when no longer used from lua's point of view). Thus, smp generated by hlua_lua2smp() may only be used from the lua context where the call was initially made, else it should be explicitly duplicated before exporting it out of lua's context to ensure safe (standalone) usage. This should be backported to all stable versions.	2023-05-24 16:48:17 +02:00
Aurelien DARRAGON	a3624cb528	DOC: hlua: document hlua_lua2smp() function Add some developer notes to hlua_lua2smp() function description since it lacks some important infos, including a critical usage restriction.	2023-05-24 16:48:17 +02:00
Aurelien DARRAGON	0aaf6c45ca	DOC: hlua: document hlua_lua2arg() function Add some developer notes to hlua_lua2arg() function description since it lacks some important infos, including an usage restriction.	2023-05-24 16:48:17 +02:00
Aurelien DARRAGON	e5c048a72d	MINOR: hlua: hlua_arg2lua() may LJMP Add LJMP hint to hlua_arg2lua() prototype since it relies on functions (e.g.: lua_pushlstring()) which may raise lua memory errors.	2023-05-24 16:48:17 +02:00
Aurelien DARRAGON	4121772c50	MINOR: hlua: hlua_smp2lua() may LJMP Add LJMP hint to hlua_smp2lua() prototype since it relies on functions (e.g.: lua_pushstring()) which may raise lua memory errors.	2023-05-24 16:48:17 +02:00
Aurelien DARRAGON	742b1a8797	MINOR: hlua: hlua_smp2lua_str() may LJMP Add LJMP hint to hlua_smp2lua_str() prototype since it relies on functions (e.g.: lua_pushstring()) which may raise lua memory errors.	2023-05-24 16:48:17 +02:00
Fr�d�ric L�caille	12a815ad19	MINOR: quic: Add a counter for sent packets Add ->sent_pkt counter to quic_conn struct to count the packet at QUIC connection level. Then, when the connection is released, the ->sent_pkt counter value is added to the one for the listener. Must be backported to 2.7.	2023-05-24 16:30:11 +02:00
Fr�d�ric L�caille	bdd64fd71d	MINOR: quic: Add some counters at QUIC connection level Add some statistical counters to quic_conn struct from quic_counters struct which are used at listener level to handle them at QUIC connection level. This avoid calling atomic functions. Furthermore this will be useful soon when a counter will be added for the total number of packets which have been sent which will be very often incremented. Some counters were not added, espcially those which count the number of QUIC errors by QUIC error types. Indeed such counters would be incremented most of the time only one time at QUIC connection level. Implement quic_conn_prx_cntrs_update() which accumulates the QUIC connection level statistical counters to the listener level statistical counters. Must be backported to 2.7.	2023-05-24 16:30:11 +02:00
Fr�d�ric L�caille	464281af46	CLEANUP: quic: Useless tests in qc_rx_pkt_handle() There is no reason to test <qc> nullity at the end of this function because it is clearly not null, furthermore the trace handle the case where <qc> is null. Must be backported to 2.7.	2023-05-24 16:30:11 +02:00
Fr�d�ric L�caille	ab3aa0ff22	CLEANUP: quic: Indentation fix quic_rx_pkt_retrieve_conn() Add missing spaces. Must be backported to 2.7.	2023-05-24 16:30:11 +02:00
Fr�d�ric L�caille	5fa633e22f	MINOR: quic: Align "show quic" command help information Align the "show quic" help information with all the others command help information. Furthermore, makes this information match the management documentation. Must be backported to 2.7.	2023-05-24 16:30:11 +02:00
Fr�d�ric L�caille	35b63964a0	BUG/MINOR: quic: Missing Retry token length on receipt quic_retry_token_check() must decipher the token sent to and received back from clients. This token is made of the token format byte, the ODCID prefixed by its one byte length, the timestamp of its creation, and terminated by an AEAD TAG followed by the salt used to derive the secret to cipher the token. So, the length of these data must be between 2 + QUIC_ODCID_MINLEN + sizeof(uint32_t) + QUIC_TLS_TAG_LEN + QUIC_RETRY_TOKEN_SALTLEN and 2 + QUIC_CID_MAXLEN + sizeof(uint32_t) + QUIC_TLS_TAG_LEN + QUIC_RETRY_TOKEN_SALTLEN. Must be backported to 2.7 and 2.6.	2023-05-24 16:30:11 +02:00
Fr�d�ric L�caille	6d6ddb2ce5	BUG/MINOR: quic: Wrong token length check (quic_generate_retry_token()) This bug would never occur because the buffer supplied to quic_generate_retry_token() to build a Retry token is large enough to embed such a token. Anyway, this patch fixes quic_generate_retry_token() implementation. There were two errors: this is the ODCID which is added to the token. Furthermore the timestamp was not taken into an account. Must be backported to 2.6 and 2.7.	2023-05-24 16:30:11 +02:00
Fr�d�ric L�caille	aaf32f0c83	MINOR: quic: Add low level traces (addresses, DCID) Add source and destination addresses to QUIC_EV_CONN_RCV trace event. This is used by datagram/socket level functions (quic_sock.c). Must be backported to 2.7.	2023-05-24 16:30:11 +02:00
Christopher Faulet	c2f1d0ee5e	BUG/MEDIUM: mux-h2: Propagate termination flags when frontend SC is created We must evaluate if EOS/EOI/ERR_PENDING/ERROR flags must be set on the SE when the frontend SC is created because the rxbuf is transferred to the steeam at this stage. It means the call to h2_rcv_buf() may be skipped on some circumstances. And indeed, it happens when HAproxy quickly replies, for instance because of a deny rule. In this case, depending on the scheduling, the abort may block the receive attempt from the SC. In this case if SE flags were not properly set earlier, there is no way to terminate the request and the session may be freezed. For now, I can't explain why there is no timeout when this happens but it remains an issue because here we should not rely on timeouts to close the stream. This patch relies on following commits: * MINOR: mux-h2: Add a function to propagate termination flags from h2s to SE * MINOR: mux-h2: Set H2_SF_ES_RCVD flag when decoding the HEADERS frame The issue was encountered on the 2.8 but it seems the bug exists since the 2.4. But it is probably a good idea to only backport the series to 2.7 only and wait for a bug report on earlier versions. This patch should solve the issue #2147.	2023-05-24 16:06:11 +02:00
Christopher Faulet	531dd050ff	MINOR: mux-h2: Add a function to propagate termination flags from h2s to SE The function h2s_propagate_term_flags() was added to check the H2S state and evaluate when EOI/EOS/ERR_PENDING/ERROR flags must be set on the SE. It is not the only place where those flags are set. But it centralizes the synchro between the H2 stream and the SC. For now, this function is only used at the end of h2_rcv_buf(). But it will be used to fix a bug.	2023-05-24 16:06:11 +02:00
Christopher Faulet	1a60a66306	MINOR: mux-h2: Set H2_SF_ES_RCVD flag when decoding the HEADERS frame The flag H2_SF_ES_RCVD is set on the H2 stream when the ES flag is found in a frame. On HEADERS frame, it was set in function processing the frame. It is moved in the function decoding the frame. Fundamentally, this changes nothing. But it will be useful to have this information earlier when a client H2 stream is created.	2023-05-24 16:06:11 +02:00
Christopher Faulet	78b1eb2b04	BUG/MINOR: mux-h2: Check H2_SF_BODY_TUNNEL on H2S flags and not demux frame ones In h2c_frt_stream_new(), H2_SF_BODY_TUNNEL flags was tested on demux frame flags (h2c->dff) instead of the h2s flags. By chance, it is a noop test becasue H2_SF_BODY_TUNNEL value, once converted to an int8_t, is 0. It is a 2.8-specific issue. No backport needed.	2023-05-24 16:06:11 +02:00
Amaury Denoyelle	152beeec34	MINOR: mux-quic: report error on stream-endpoint earlier A RESET_STREAM is emitted in several occasions : - protocol error during HTTP/3.0 parsing - STOP_SENDING reception In both cases, if a stream-endpoint is attached we must set its ERR flag. This was correctly done but after some delay as it was only when the RESET_STREAM was emitted. Change this to set the ERR flag as soon as one of the upper cases has been encountered. This should help to release faster streams in error. This should be backported up to 2.7.	2023-05-24 14:46:52 +02:00
Amaury Denoyelle	37d78997ae	MINOR: mux-quic: only set EOS on RESET_STREAM recv A recent review was done to rationalize ERR/EOS/EOI flags on stream endpoint. A common definition for both H1/H2/QUIC mux have been written in the following documentation : ./doc/internals/stconn-close.txt In QUIC it is possible to close each channels of a stream independently with RESET_STREAM and STOP_SENDING frames. When a RESET_STREAM is received, it indicates that the peer has ended its transmission in an abnormal way. However, it is still ready to receive. Previously, on RESET_STREAM reception, QUIC MUX set the ERR flag on stream-endpoint. However, according to the QUIC mechanism, it should be instead EOS but this was impossible due to a BUG_ON() which prevents EOS without EOI or ERR. This BUG_ON was only present because this case was never used before the introduction of QUIC. It was removed in a recent commit which allows us to now properly set EOS alone on RESET_STREAM reception. In practice, this change allows to continue to send data even after RESET_STREAM reception. However, currently browsers always emit it with a STOP_SENDING as this is used to abort the whole H3 streams. In the end this will result in a stream-endpoint with EOS and ERR_PENDING/ERR flags. This should be backported up to 2.7.	2023-05-24 14:39:17 +02:00
Amaury Denoyelle	8de35925f7	MINOR: mux-quic: set both EOI EOS for stream fin A recent review was done to rationalize ERR/EOS/EOI flags on stream endpoint. A common definition for both H1/H2/QUIC mux have been written in the following documentation : ./doc/internals/stconn-close.txt Always set EOS with EOI flag to conform to this specification. EOI is set whenever the proper stream end has been encountered : with QUIC it corresponds to a STREAM frame with FIN bit. At this step, RESET_STREAM frames are ignored by QUIC MUX as allowed by RFC 9000. This means we can always set EOS at the same time with EOI. This should be backported up to 2.7.	2023-05-24 14:23:22 +02:00
Christopher Faulet	2437377445	MEDIUM: stconn/applet: Allow SF_SL_EOS flag alone During the refactoring on SC/SE flags, it was stated that SE_FL_EOS flag should not be set without on of SE_FL_EOI or SE_FL_ERROR flags. In fact, it is a problem for the QUIC/H3 multiplexer. When a RST_STREAM frame is received, it means no more data will be received from the peer. And this happens before the end of the message (RST_STREAM frame received after the end of the message are ignored). At this stage, it is a problem to report an error because from the QUIC point of view, it is valid. Data may still be sent to the peer. If an error is reported, this will stop the data sending too. In the same idea, the H1 mulitplexer reports an error when the message is truncated because of a read0. But only an EOS flag should be reported in this case, not an error. Fundamentally, it is important to distinguish errors from shuts for reads because some cases are valid. For instance a H1 client can choose to stop uploading data if it received the server response. So, relax tests on SE flags by removing BUG_ON_HOT() on SE_FL_EOS flag. For now, the abort will be handled in the HTTP analyzers.	2023-05-23 15:52:35 +02:00
Amaury Denoyelle	aa39cc9f42	MINOR: quic: fix alignment of oneline show quic Output of 'show quic' CLI in oneline mode was not correctly done. This was caused both due to differing qc pointer size and ports length. Force proper alignment by using maximum sizes as expected and complete with blanks if needed. This should be backported up to 2.7.	2023-05-22 14:18:02 +02:00
Amaury Denoyelle	7385ff3f0c	BUG/MINOR: quic: handle Tx packet allocation failure properly qc_prep_app_pkts() is responsible to built several new packets for sending. It can fail due to memory allocation error. Before this patch, the Tx buffer was released on error even if some packets were properly generated. With this patch, if an error happens on qc_prep_app_pkts(), we still try to send already built packets if Tx buffer is not empty. The sending loop is then interrupted and the Tx buffer is released with data cleared. This should be backported up to 2.7.	2023-05-22 14:18:02 +02:00
Amaury Denoyelle	f8fbb0b94e	MINOR: quic: use WARN_ON for encrypt failures It is expected that quic_packet_encrypt() and quic_apply_header_protection() never fails as encryption is done in place. This allows to remove their return value. This is useful to simplify error handling on sending path. An error can only be encountered on the first steps when allocating a new packet or copying its frame content. After a clear packet is successfully built, no error is expected on encryption. However, it's still unclear if our assumption that in-place encryption function never fail. As such, a WARN_ON() statement is used if an error is detected at this stage. Currently, it's impossible to properly manage this without data loss as this will leave partially unencrypted data in the send buffer. If warning are reported a solution will have to be implemented. This should be backported up to 2.7.	2023-05-22 11:20:44 +02:00
Amaury Denoyelle	5eadc27623	MINOR: quic: remove return val of quic_aead_iv_build() quic_aead_iv_build() should never fail unless we call it with buffers of different size. This never happens in the code as every input buffers are of size QUIC_TLS_IV_LEN. Remove the return value and add a BUG_ON() to prevent future misusage. This is especially useful to remove one error handling on the sending patch via quic_packet_encrypt(). This should be backported up to 2.7.	2023-05-22 11:17:18 +02:00
Amaury Denoyelle	8d6d246dbc	CLEANUP: mux-quic/h3: complete BUG_ON with comments Complete each useful BUG_ON statements with a comment to explain its purpose. Also convert BUG_ON_HOT to BUG_ON as they should not have a big impact. This should be backported up to 2.7.	2023-05-22 11:17:18 +02:00
Aurelien DARRAGON	b6a24a52a2	BUG/MINOR: debug: fix pointer check in debug_parse_cli_task() Task pointer check in debug_parse_cli_task() computes the theoric end address of provided task pointer to check if it is valid or not thanks to may_access() helper function. However, relative ending address is calculated by adding task size to 't' pointer (which is a struct task pointer), thus it will result to incorrect address since the compiler automatically translates 't + x' to 't + x * sizeof(t)' internally (with sizeof(t) != 1 here). Solving the issue by using 'ptr' (which is the void * raw address) as starting address to prevent automatic address scaling. This was revealed by coverity, see GH #2157. No backport is needed, unless `9867987` ("DEBUG: cli: add "debug dev task" to show/wake/expire/kill tasks and tasklets") gets backported.	2023-05-17 16:49:17 +02:00
Aurelien DARRAGON	7428adaf0d	BUG/MINOR: hlua: SET_SAFE_LJMP misuse in hlua_event_runner() When hlua_event_runner() pauses the subscription (ie: if the consumer can't keep up the pace), hlua_traceback() is used to get the current lua trace (running context) to provide some info to the user. However, as hlua_traceback() may raise an error (__LJMP) is set, it is used within a SET_SAFE_LJMP() / RESET_SAFE_LJMP() combination to ensure lua errors are properly handled and don't result in unexpected behavior. But the current usage of SET_SAFE_LJMP() within the function is wrong since hlua_traceback() will run a second time (unprotected) if the first (protected) attempt fails. This is undefined behavior and could even lead to crashes. Hopefully it is very hard to trigger this code path, thus we can consider this as a minor bug. Also using this as an opportunity to enhance the message report to make it more meaningful to the user. This should fix GH #2159. It is a 2.8 specific bug, no backport needed unless `c84899c636` ("MEDIUM: hlua/event_hdl: initial support for event handlers") gets backported.	2023-05-17 16:48:40 +02:00
Christopher Faulet	06e9c81bd0	MEDIUM: resolvers: Stop scheduling resolution during stopping stage When the process is stopping, the server resolutions are suspended. However the task is still periodically woken up for nothing. If there is a huge number of resolution, it may lead to a noticeable CPU consumption for no reason. To avoid this extra CPU cost, we stop to schedule the the resolution tasks during the stopping stage. Of course, it is only true for server resolutinos. Dynamic ones, via do-resolve actions, are not concerned. These ones must still be triggered during stopping stage. Concretly, during the stopping stage, the resolvers task is no longer scheduled if there is no running resolutions. In this case, if a do-resolve action is evaluated, the task is woken up. This patch should partially solve the issue #2145.	2023-05-17 16:48:33 +02:00
Christopher Faulet	8bca3cc8c7	MEDIUM: checks: Stop scheduling healthchecks during stopping stage When the process is stopping, the health-checks are suspended. However the task is still periodically woken up for nothing. If there is a huge number of health-checks and if they are woken up in same time, it may lead to a noticeable CPU consumption for no reason. To avoid this extra CPU cost, we stop to schedule the health-check tasks when the proxy is disabled or stopped. This patch should partially solve the issue #2145.	2023-05-17 14:57:10 +02:00
Christopher Faulet	c8a7bb16b7	CLEANUP: fcgi-app; Remove useless assignment to NULL When the fcgi configuration is checked and fcgi rules are created, a useless assignment to NULL is reported by Covertiy. Let's remove it. This patch should fix the coverity report #2161.	2023-05-17 09:42:37 +02:00
Willy Tarreau	c7b9308f20	BUG/MINOR: clock: automatically adjust the internal clock with the boot time This is a better and more general solution to the problem described in this commit: BUG/MINOR: checks: postpone the startup of health checks by the boot time Now we're updating the now_offset that is used to compute now_ms at the few points where we update the ready date during boot. This ensures that now_ms while being stable during all the boot process will be correct and will start with the boot value right after the boot is finished. As such the patch above is rolled back (we don't want to count the boot time twice). This must not be backported because it relies on the more flexible clock architecture in 2.8.	2023-05-17 09:33:54 +02:00
Willy Tarreau	5345490b8e	MINOR: clock: provide a function to automatically adjust now_offset Right now there's no way to enforce a specific value of now_ms upon startup in order to compensate for the time it takes to load a config, specifically when dealing with the health check startup. For this we'd need to force the now_offset value to compensate for the last known value of the current date. This patch exposes a function to do exactly this.	2023-05-17 09:33:54 +02:00
Willy Tarreau	8e978a094d	BUG/MINOR: checks: postpone the startup of health checks by the boot time When health checks are started at boot, now_ms could be off by the boot time. In general it's not even noticeable, but with very large configs taking up to one or even a few seconds to start, this can result in a part of the servers' checks being scheduled slightly in the past. As such all of them will start groupped, partially defeating the purpose of the spread-checks setting. For example, this can cause a burst of connections for the network, or an excess of CPU usage during SSL handshakes, possibly even causing some timeouts to expire early. Here in order to compensate for this, we simply add the known boot time to the computed delay when scheduling the startup of checks. That's very simple and particularly efficient. For example, a config with 5k servers in 800 backends checked every 5 seconds, that was taking 3.8 seconds to start used to show this distribution of health checks previously despite the spread-checks 50: 3690 08:59:25 417 08:59:26 213 08:59:27 71 08:59:28 428 08:59:29 860 08:59:30 918 08:59:31 938 08:59:32 1124 08:59:33 904 08:59:34 647 08:59:35 890 08:59:36 973 08:59:37 856 08:59:38 893 08:59:39 154 08:59:40 Now with the fix it shows this: 470 08:59:59 929 09:00:00 896 09:00:01 937 09:00:02 854 09:00:03 827 09:00:04 906 09:00:05 863 09:00:06 913 09:00:07 873 09:00:08 162 09:00:09 This should be backported to all supported versions. It depends on this commit: MINOR: clock: measure the total boot time For 2.8 where the internal clock is now totally independent on the human one, an more generic fix will consist in simply updating now_ms to reflect the startup time.	2023-05-17 09:33:54 +02:00
Willy Tarreau	5723b382ed	MINOR: stats: report the boot time in "show info" Just like we have the uptime in "show info", let's add the boot time. It's trivial to collect as it's just the difference between the ready date and the start date, and will allow users to monitor this element in order to take action before it starts becoming problematic. Here the boot time is reported in milliseconds, so this allows to even observe sub-second anomalies in startup delays.	2023-05-17 09:33:54 +02:00
Willy Tarreau	da4aa6905c	MINOR: clock: measure the total boot time Some huge configs take a significant amount of time to start and this can cause some trouble (e.g. health checks getting delayed and grouped, process not responding to the CLI etc). For example, some configs might start fast in certain environments and slowly in other ones just due to the use of a wrong DNS server that delays all libc's resolutions. Let's first start by measuring it by keeping a copy of the most recently known ready date, once before calling check_config_validity() and then refine it when leaving this function. A last call is finally performed just before deciding to split between master and worker processes, and it covers the whole boot. It's trivial to collect and even allows to get rid of a call to clock_update_date() in function check_config_validity() that was used in hope to better schedule future events.	2023-05-17 09:33:54 +02:00
Willy Tarreau	52fd879953	CLEANUP: stats: update the trash chunk where it's used When integrating the number of warnings in "show info" in 2.8 with commit `3c4a297d2` ("MINOR: stats: report the total number of warnings issued"), the update of the trash buffer used by the Tainted flag got displaced lower. There's no harm for now util someone adds a new metric requiring a call to chunk_newstr() and gets both values merged. Let's move the call to its location now.	2023-05-17 09:33:54 +02:00
Christopher Faulet	cb76030356	CLEANUP: check; Remove some useless assignments to NULL In process_chk_conn(), some assignments to NULL are useless and are reported by Coverity as unused value. while it is harmless, these assignments can be removed. This patch should fix the coverity report #2158.	2023-05-17 09:28:23 +02:00
Aurelien DARRAGON	0d2f1acee6	BUG/MINOR: server: memory leak in _srv_update_status_op() on server DOWN When server is transitionning from UP to DOWN, a log message is generated. e.g.: "Server backend_name/server_name is DOWN") However since `f71e064` ("MEDIUM: server: split srv_update_status() in two functions"), the allocated buffer tmptrash which is used to prepare the log message is not freed after it has been used, resulting in a small memory leak each time a server goes DOWN because of an operational change. This is a 2.8 specific bug, no backport needed unless the above commit gets backported.	2023-05-17 09:21:01 +02:00
Aurelien DARRAGON	22d584a993	CLEANUP: server: remove useless tmptrash assigments in srv_update_status() Within srv_update_status subfunctions _op() and _adm(), each time tmptrash is freed, we assign it to NULL to ensure it will not be reused. However, within those functions it is not very useful given that tmptrash is never checked against NULL except upon allocation through alloc_trash_chunk(), which happens everytime a new log message is generated, sent, and then freed right away, so there are no code paths that could lead to tmptrash being checked for reuse (tmptrash is systematically overwritten since all log messages are independant from each other). This was raised by coverity, see GH #2162.	2023-05-17 09:21:01 +02:00
Christopher Faulet	2d5a5665fe	BUG/MINOR: tcp-rules: Don't shortened the inspect-delay when EOI is set A regression was introduced with the commit `cb59e0bc3` ("BUG/MINOR: tcp-rules: Stop content rules eval on read error and end-of-input"). We should not shorten the inspect-delay when the EOI flag is set on the SC. Idea of the inspect-delay is to wait a TCP rule is matching. It is only interrupted if an error occurs, on abort or if the peer shuts down. It is also interrupted if the buffer is full. This last case is a bit ambiguous and discutable. It could be good to add ACLS, like "wait_complete" and "wait_full" to do so. But for now, we only remove the test on SC_FL_EOI flag. This patch must be backported to all stable versions.	2023-05-17 09:21:01 +02:00
Willy Tarreau	b93758cec9	MINOR: checks: make sure spread-checks is used also at boot time This makes use of spread-checks also for the startup of the check tasks. This provides a smoother load on startup for uneven configurations which tend to enable only some servers. Below is the connection distribution per second of the SSL checks of a config with 5k servers spread over 800 backends, with a check inter of 5 seconds: - default: 682 08:00:50 826 08:00:51 773 08:00:52 1016 08:00:53 885 08:00:54 889 08:00:55 825 08:00:56 773 08:00:57 1016 08:00:58 884 08:00:59 888 08:01:00 491 08:01:01 - with spread-checks 50: 437 08:01:19 866 08:01:20 777 08:01:21 1023 08:01:22 1118 08:01:23 923 08:01:24 641 08:01:25 859 08:01:26 962 08:01:27 860 08:01:28 929 08:01:29 909 08:01:30 866 08:01:31 849 08:01:32 114 08:01:33 - with spread-checks 50 + this patch: 680 08:01:55 922 08:01:56 962 08:01:57 899 08:01:58 819 08:01:59 843 08:02:00 916 08:02:01 896 08:02:02 886 08:02:03 846 08:02:04 903 08:02:05 894 08:02:06 178 08:02:07 The load is much smoother from the start, this can help initial health checks succeed when many target the same overloaded server for example. This could be backported as it should make border-line configs more reliable across reloads.	2023-05-17 08:10:40 +02:00
Amaury Denoyelle	bf86d89ea6	BUG/MEDIUM: mux-quic: fix EOI for request without payload When a full message is received for a stream, MUX is responsible to set EOI flag. This was done through rcv_buf stream callback by checking if QCS HTX buffer contained the EOM flag. This is not correct for HTTP without body. In this case, QCS HTX buffer is never used. Only a local HTX buffer is used to transfer headers just as stream endpoint is created. As such, EOI is never transmitted to the upper layer. If the transfer occur without any issue, this does not seem to cause any problem. However, in case the transfer is aborted, the stream is never released which cause a memory leak and prevent the process soft-stop. To fix this, also check if EOM is put by application layer during headers conversion. If true, this is transferred through a new argument to qc_attach_sc() MUX function which is responsible to set the EOI flag. This issue was reproduced using h2load with hundred of connections. h2load is interrupted with a SIGINT which causes streams to never be closed on haproxy side. This should be backported up to 2.6.	2023-05-16 17:53:45 +02:00
Amaury Denoyelle	1a2faef92f	MINOR: mux-quic: uninline qc_attach_sc() Uninline and move qc_attach_sc() function to implementation source file. This will be useful for next commit to add traces in it. This should be backported up to 2.7.	2023-05-16 17:53:45 +02:00
Amaury Denoyelle	3cb78140cf	MINOR: mux-quic: properly report end-of-stream on recv MUX is responsible to put EOS on stream when read channel is closed. This happens if underlying connection is closed or a RESET_STREAM is received. FIN STREAM is ignored in this case. For connection closure, simply check for CO_FL_SOCK_RD_SH. For RESET_STREAM reception, a new flag QC_CF_RECV_RESET has been introduced. It is set when RESET_STREAM is received, unless we already received all data. This is conform to QUIC RFC which allows to ignore a RESET_STREAM in this case. During RESET_STREAM processing, input buffer is emptied so EOS can be reported right away on recv_buf operation. This should be backported up to 2.7.	2023-05-16 17:53:45 +02:00
Amaury Denoyelle	1649469be1	MINOR: mux-quic: add trace to stream rcv_buf operation Add traces to render each stream transition more explicit. Also, move ERR_PENDING to ERROR transition after other stream flags are set, as with the MUX H2 implementation. This is purely a cosmetic change and it should have no functional impact. This should be backported up to 2.7.	2023-05-16 17:53:45 +02:00
Amaury Denoyelle	6133aba889	BUG/MINOR: h3: missing goto on buf alloc failure The following patch introduced proper error management on buffer allocation failure : `0abde9dee6` BUG/MINOR: mux-quic: properly handle buf alloc failure However, when decoding an empty STREAM frame with just FIN bit set, this was not done correctly. Indeed, there is a missing goto statement in case of a NULL buffer check. This was reported thanks to coverity analysis. This should fix github issue #2163. This must be backported up to 2.6.	2023-05-15 14:57:56 +02:00
Amaury Denoyelle	1611a7659b	BUG/MINOR: mux-quic: handle properly Tx buf exhaustion Since the following patch commit `6c501ed23b` BUG/MINOR: mux-quic: differentiate failure on qc_stream_desc alloc it is not possible to check if Tx buf allocation failed due to a configured limit exhaustion or a simple memory failure. This patch fixes it as the condition was inverted. Indeed, if buf_avail is null, this means that the limit has been reached. On the contrary case, this is a real memory alloc failure. This caused the flag QC_CF_CONN_FULL to not be properly used and may have caused disruption on transfer with several streams or large data. This was detected due to an abnormal error QUIC MUX traces. Also change in consequence trace for limit exhaustion to be more explicit. This must be backported up to 2.6.	2023-05-15 14:06:21 +02:00
William Lallemand	6e0c39d7ac	BUILD: ssl: ssl_c_r_dn fetches uses functiosn only available since 1.1.1 Fix the openssl build with older openssl version by disabling the new ssl_c_r_dn fetch. This also disable the ssl_client_samples.vtc file for OpenSSL version older than 1.1.1	2023-05-15 12:07:52 +02:00
Willy Tarreau	d38d8c6ccb	BUG/MEDIUM: mux-h2: make sure control frames do not refresh the idle timeout Christopher found as part of the analysis of Tim's issue #1891 that commit `15a4733d5` ("BUG/MEDIUM: mux-h2: make use of http-request and keep-alive timeouts") introduced in 2.6 incompletely addressed a timeout issue in the H2 mux. The problem was that the http-keepalive and http-request timeouts were not applied before it. With that commit they are now considered, but if a GOAWAY is sent (or even attempted to be sent), then they are not used anymore again, because the way the code is arranged consists in applying the client-fin timeout (if set) to the current date, and falling back to the client timeout, without considering the idle_start period. This means that a config having a "timeout http-keepalive" would still not close the connection quickly when facing a client that periodically sends PING, PRIORITY or whatever other frame types. In addition, after the GOAWAY was attempted to be sent, there was no check for pending data in the output buffer, meaning that it would be possible to truncate some responses in configs involving a very short client-fin timeout. Finally the spreading of the closures during the soft-stop brought in 2.6 by commit `b5d968d9b` ("MEDIUM: global: Add a "close-spread-time" option to spread soft-stop on time window") didn't consider the particular case of an idle "pre-connect" connection, which would also live long if a browser failed to deliver a valid request for a long time. All of this indicates that the conditions must be reworked so as not to have that level of exclusion between conditions, but rather stick to the rules from the doc that are already enforced on other muxes: - timeout client always applies if there are data pending, and is relative to each new I/O ; - timeout http-request applies before the first complete request and is relative to the entry in idle state ; - timeout http-keepalive applies between idle and the next complete request and is relative to the entry in idle state ; - timeout client-fin applies when in idle after a shut was sent (here the shut is the GOAWAY). The shut may only be considered as sent if the buffer is empty and the flags indicate that it was successfully sent (or failed) but not if it's still waiting for some room in the output buffer for example. This implies that this timeout may then lower the http-keepalive/http-request ones. This is what this patch implements. Of course the client timeout still applies as a fallback when all the ones above are not set or when their conditions are not met. It would seem reasoanble to backport this to 2.7 first, then only after one or two releases to 2.6.	2023-05-15 12:01:20 +02:00
Abhijeet Rastogi	df97f472fa	MINOR: ssl: add new sample ssl_c_r_dn This patch addresses #1514, adds the ability to fetch DN of the root ca that was in the chain when client certificate was verified during SSL handshake.	2023-05-15 10:48:05 +02:00
William Lallemand	7f95469163	MEDIUM: proxy: stop emitting logs for internal proxies when stopping The HTTPCLIENT and the OCSP-UPDATE proxies are internal proxies, we don't need to display logs of them stopping during the stopping of the process. This patch checks if a proxy has the flag PR_CAP_INT so it doesn't display annoying messages.	2023-05-15 10:38:09 +02:00
Christopher Faulet	6eb53b138d	MINOR: stconn: Remove useless test on sedesc on detach to release the xref When the SC is detached from the endpoint, the xref between the endpoints is removed. At this stage, the sedesc cannot be undefined. So we can remove the test on it. This issue should fix the issue #2156. No backport needed.	2023-05-15 09:53:30 +02:00
William Lallemand	1601eebcd1	MEDIUM: mworker/cli: does not disconnect the master CLI upon error In the proxy CLI analyzer, when pcli_parse_request returns -1, the client was shut to prevent any problem with the master CLI. This behavior is a little bit excessive and not handy at all in prompt mode. For example one could have activated multiples mode, then have an error which disconnect the CLI, and they would have to reconnect and enter all the modes again. This patch introduces the pcli_error() function, which only output an error and flush the input buffer, instead of closing everything. When encountering a parsing error, this function is used, and the prompt is written again, without any disconnection.	2023-05-14 18:42:31 +02:00
William Lallemand	4adb4b9903	MEDIUM: session/ssl: return the SSL error string during a SSL handshake error SSL hanshake error were unable to dump the OpenSSL error string by default, to do so it was mandatory to configure a error-log-format with the ssl_fc_err fetch. This patch implements the session_build_err_string() function which creates the error log to send during session_kill_embryonic(), a special case is made with CO_ER_SSL_HANDSHAKE which is able to dump the error string with ERR_error_string(). Before: <134>May 12 17:14:04 haproxy[183151]: 127.0.0.1:49346 [12/May/2023:17:14:04.571] frt2/1: SSL handshake failure After: <134>May 12 17:14:04 haproxy[183151]: 127.0.0.1:49346 [12/May/2023:17:14:04.571] frt2/1: SSL handshake failure (error:0A000418:SSL routines::tlsv1 alert unknown ca)	2023-05-12 17:43:58 +02:00
Amaury Denoyelle	ee65efbfae	BUG/MINOR: mux-quic: free task on qc_init() app ops failure qc_init() is used to initialize a QUIC MUX instance. On failure, each resources are released via a series of goto statements. There is one issue if the app_ops.init callback fails. In this case, MUX task is not freed. This can cause a crash as the task is already scheduled. When the handler will run, it will crash when trying to access qcc instance. To fix this, properly destroy qcc task on fail_install_app_ops label. The impact of this bug is minor as app_ops.init callback succeeds most of the time. However, it may fail on allocation failure due to memory exhaustion. This may fix github issue #2154. This must be backported up to 2.7.	2023-05-12 16:37:27 +02:00
Amaury Denoyelle	6c501ed23b	BUG/MINOR: mux-quic: differentiate failure on qc_stream_desc alloc qc_stream_buf_alloc() can fail for two reasons : * limit of Tx buffer per connection reached * allocation failure The first case is properly treated. A flag QC_CF_CONN_FULL is set on the connection to interrupt emission. It is cleared when a buffer became available after in order ACK reception and the MUX tasklet is woken up. The allocation failure was handled with the same mechanism which in this case is not appropriate and could lead to a connection transfer freeze. Instead, prefer to close the connection with a QUIC internal error code. To differentiate the two causes, qc_stream_buf_alloc() API was changed to return the number of available buffers to the caller. This must be backported up to 2.6.	2023-05-12 16:26:20 +02:00
Amaury Denoyelle	50fe00650f	BUG/MINOR: quic: do not alloc buf count on alloc failure The total number of buffer per connection for sending is limited by a configuration value. To ensure this, <stream_buf_count> quic_conn field is incremented on qc_stream_buf_alloc(). qc_stream_buf_alloc() may fail if the buffer cannot be allocated. In this case, <stream_buf_count> should not be incremented. To fix this, simply move increment operation after buffer allocation. The impact of this bug is low. However, if a connection suffers from several buffer allocation failure, it may cause the <stream_buf_count> to be incremented over the limit without being able to go back down. This must be backported up to 2.6.	2023-05-12 15:55:41 +02:00
Amaury Denoyelle	d00b3093c9	BUG/MINOR: mux-quic: handle properly recv ncbuf alloc failure The function qc_get_ncbuf() is used to allocate a ncbuf content. Allocation failure was handled using a plain BUG_ON. Fix this by a proper error management. This buffer is only used for STREAM frame reception to support out-of-order offsets. When an allocation failed, close the connection with a QUIC internal error code. This should be backported up to 2.6.	2023-05-12 15:52:19 +02:00
Amaury Denoyelle	0abde9dee6	BUG/MINOR: mux-quic: properly handle buf alloc failure A convenience function qc_get_buf() is implemented to centralize buffer allocation on MUX and H3 layers. However, allocation failure was not handled properly with a BUG_ON() statement. Replace this by proper error management. On emission, streams is temporarily skip over until the next qc_send() invocation. On reception, H3 uses this function for HTX conversion; on alloc failure the connection will be closed with QUIC internal error code. This must be backported up to 2.6.	2023-05-12 15:51:15 +02:00
Amaury Denoyelle	93dd23cab4	MINOR: mux-quic: remove dedicated function to handle standalone FIN Remove QUIC MUX function qcs_http_handle_standalone_fin(). The purpose of this function was only used when receiving an empty STREAM frame with FIN bit. Besides, it was called by each application protocol which could have different approach and render the function purpose unclear. Invocation of qcs_http_handle_standalone_fin() have been replaced by explicit code in both H3 and HTTP/0.9 module. In the process, use htx_set_eom() to reliably put EOM on the HTX message. This should be backported up to 2.7, along with the previous patch which introduced htx_set_eom().	2023-05-12 15:50:30 +02:00
Amaury Denoyelle	25cf19d5c8	MINOR: htx: add function to set EOM reliably Implement a new HTX utility function htx_set_eom(). If the HTX message is empty, it will first add a dummy EOT block. This is a small trick needed to ensure readers will detect the HTX buffer as not empty and retrieve the EOM flag. Replace the H2 code related by a htx_set_eom() invocation. QUIC also has the same code which will be replaced in the next commit. This should be backported up to 2.7 before the related QUIC patch.	2023-05-12 15:29:28 +02:00
Frédéric Lécaille	76d502588d	BUG/MINOR: quic: Wrong redispatch for external data on connection socket It is possible to receive datagram from other connection on a dedicated quic-conn socket. This is due to a race condition between bind() and connect() system calls. To handle this, an explicit check is done on each datagram. If the DCID is not associated to the connection which owns the socket, the datagram is redispatch as if it arrived on the listener socket. This redispatch step was not properly done because the source address specified for the redispatch function was incorrect. Instead of using the datagram source address, we used the address of the socket quic-conn which received the datagram due to the above race condition. Fix this simply by using the address from the recvmsg() system call. The impact of this bug is minor as redispatch on connection socket should be really rare. However, when it happens it can lead to several kinds of problems, like for example a connection initialized with an incorrect peer address. It can also break the Retry token check as this relies on the peer address. In fact, Retry token check failure was the reason this bug was found. When using h2load with thousands of clients, the counter of Retry token failure was unusually high. With this patch, no failure is reported anymore for Retry. Must be backported to 2.7.	2023-05-12 14:48:30 +02:00
Aurelien DARRAGON	256d581fbd	BUG/MINOR: log: fix memory error handling in parse_logsrv() A check was missing in parse_logsrv() to make sure that malloc-dependent variable is checked for non-NULL before using it. If malloc fails, the function raises an error and stops, like it's already done at a few other places within the function. This partially fixes GH #2130. It should be backported to every stable versions.	2023-05-12 09:45:30 +02:00
Aurelien DARRAGON	d4dba38ab1	BUG/MINOR: errors: handle malloc failure in usermsgs_put() usermsgs_buf.size is set without first checking if previous malloc attempt succeeded. This could fool the buffer API into assuming that the buffer is initialized, resulting in unsafe read/writes. Guarding usermsgs_buf.size assignment with the malloc attempt result to make the buffer initialization safe against malloc failures. This partially fixes GH #2130. It should be backported up to 2.6.	2023-05-12 09:45:30 +02:00
Aurelien DARRAGON	ceb13b5ed3	MINOR: ncbuf: missing malloc checks in standalone code Some malloc resulsts were not checked in standalone ncbuf code. As this is debug/test code, we don't need to explicitly handle memory errors, we just add some BUG_ON() to ensure that memory is properly allocated and prevent unexpected results. This partially fixes issue GH #2130. No backport needed.	2023-05-12 09:45:30 +02:00
Willy Tarreau	94df1b57ee	BUILD: debug: fix build issue on 32-bit platforms in "debug dev task" Commit `986798718` ("DEBUG: cli: add "debug dev task" to show/wake/expire/kill tasks and tasklets") caused a build failure on 32-bit platforms when parsing the task's pointer. Let's use strtoul() and not strtoll(). No backport is needed, unless the commit above gets backported.	2023-05-12 04:40:06 +02:00
William Lallemand	e279f595ad	MINOR: httpclient: allow to disable the DNS resolvers of the httpclient httpclient.resolvers.disabled allow to disable completely the resolvers of the httpclient, prevents the creation of the "default" resolvers section, and does not insert the http do-resolve rule in the proxies.	2023-05-11 21:25:37 +02:00
Willy Tarreau	fe0ba0e9f9	MINOR: cli: make "show fd" identify QUIC connections and listeners Now we can detect the listener associated with a QUIC listener and report a bit more info (e.g. listening port and frontend name), and provide a bit more info about connections as well, and filter on both front connections and listeners using the "l" and "f" flags.	2023-05-11 17:20:39 +02:00
Willy Tarreau	ea07715ccf	MINOR: master/cli: also implement the timed prompt on the master CLI This provides more consistency between the master and the worker. When "prompt timed" is passed on the master, the timed mode is toggled. When enabled, for a master it will show the master process' uptime, and for a worker it will show this worker's uptime. Example: master> prompt timed [0:00:00:50] master> show proc #<PID> <type> <reloads> <uptime> <version> 11940 master 1 [failed: 0] 0d00h02m10s 2.8-dev11-474c14-21 # workers 11955 worker 0 0d00h00m59s 2.8-dev11-474c14-21 # old workers 11942 worker 1 0d00h02m10s 2.8-dev11-474c14-21 # programs [0:00:00:58] master> @!11955 [0:00:01:03] 11955> @!11942 [0:00:02:17] 11942> @ [0:00:01:10] master>	2023-05-11 16:38:52 +02:00
Willy Tarreau	225555711f	MINOR: cli: add an option to display the uptime in the CLI's prompt Entering "prompt timed" toggles reporting of the process' uptime in the prompt, which will report days, hours, minutes and seconds since it was started. As discussed with Tim in issue #2145, this can be convenient to roughly estimate the time between two outputs, as well as detecting that a process failed to be reloaded for example.	2023-05-11 16:38:52 +02:00
Willy Tarreau	21d7125c92	BUG/MINOR: cli: don't complain about empty command on empty lines There's something very irritating on the CLI, when just pressing ENTER, it complains "Unknown command: ''..." and dumps all the help. This action is often done to add a bit of clearance after a dump to visually find delimitors later, but this stupid error makes it unusable. This patch addresses this by just returning on empty command instead of trying to look up a matching keyword. It will result in an empty line to mark the end of the empty command and a prompt again. It's probably not worth backporting this given that nobody seems to have complained about it yet.	2023-05-11 16:38:52 +02:00
Aurelien DARRAGON	31b23aef38	CLEANUP: acl: discard prune_acl_cond() function Thanks to previous commit, we have no more use for prune_acl_cond(), let's remove it to prevent code duplication.	2023-05-11 15:37:04 +02:00
Aurelien DARRAGON	c610095258	MINOR: tree-wide: use free_acl_cond() where relevant Now that we have free_acl_cond(cond) function that does cond prune then frees cond, replace all occurences of this pattern: \| prune_acl_cond(cond) \| free(cond) with: \| free_acl_cond(cond)	2023-05-11 15:37:04 +02:00
Aurelien DARRAGON	cd9aff1321	CLEANUP: http_act: use http_free_redirect_rule() to clean redirect act Since redirect rules now have a dedicated cleanup function, better use it to prevent code duplication.	2023-05-11 15:37:04 +02:00
Aurelien DARRAGON	5313570605	BUG/MINOR: http_rules: fix errors paths in http_parse_redirect_rule() http_parse_redirect_rule() doesn't perform enough checks around NULL returning allocating functions. Moreover, existing error paths don't perform cleanups. This could lead to memory leaks. Adding a few checks and a cleanup path to ensure memory errors are properly handled and that no memory leaks occurs within the function (already allocated structures are freed on error path). It should partially fix GH #2130. This patch depends on ("MINOR: proxy: add http_free_redirect_rule() function") This could be backported up to 2.4. The patch is also relevant for 2.2 but "MINOR: proxy: add http_free_redirect_rule() function" would need to be adapted first. == Backport notes: -> For 2.2 only: Replace: (strcmp(args[cur_arg], "drop-query") == 0) with: (!strcmp(args[cur_arg],"drop-query")) -> For 2.2 and 2.4: Replace: "expects 'code', 'prefix', 'location', 'scheme', 'set-cookie', 'clear-cookie', 'drop-query', 'ignore-empty' or 'append-slash' (was '%s')", with: "expects 'code', 'prefix', 'location', 'scheme', 'set-cookie', 'clear-cookie', 'drop-query' or 'append-slash' (was '%s')",	2023-05-11 15:37:04 +02:00
Aurelien DARRAGON	7abc9224a6	MINOR: proxy: add http_free_redirect_rule() function Adding http_free_redirect_rule() function to free a single redirect rule since it may be required to free rules outside of free_proxy() function. This patch is required for an upcoming bugfix. [for 2.2, free_proxy function did not exist (first seen in 2.4), thus http_free_redirect_rule() needs to be deducted from haproxy.c deinit() function if the patch is required]	2023-05-11 15:37:04 +02:00
Aurelien DARRAGON	8dfc2491d2	BUG/MINOR: proxy: missing free in free_proxy for redirect rules cookie_str from struct redirect, which may be allocated through http_parse_redirect_rule() function is not properly freed on proxy cleanup within free_proxy(). This could be backported to all stable versions. [for 2.2, free_proxy() did not exist so the fix needs to be performed directly in deinit() function from haproxy.c]	2023-05-11 15:37:04 +02:00
Christopher Faulet	7542fb43d6	MINOR: stconn: Add a cross-reference between SE descriptor A xref is added between the endpoint descriptors. It is created when the server endpoint is attached to the SC and it is destroyed when an endpoint is detached. This xref is not used for now. But it will be useful to retrieve info about an endpoint for the opposite side. It is also the warranty there is still a endpoint attached on the other side.	2023-05-11 15:37:04 +02:00
Christopher Faulet	efebff35bb	BUG/MEDIUM: mux-fcgi: Don't request more room if mux is waiting for more data A mux must never report it is waiting for room in the channel buffer if this buffer is empty. Because there is nothing the application layer can do to unblock the situation. Indeed, when this happens, it means the mux is waiting for data to progress. It typically happens when all headers are not received. In the FCGI mux, if some data remain in the RX buffer but the channel buffer is empty, it does no longer report it is waiting for room. This patch should fix the issue #2150. It must be backported as far as 2.6.	2023-05-11 15:37:04 +02:00
Christopher Faulet	a272c39330	BUG/MEDIUM: mux-fcgi: Never set SE_FL_EOS without SE_FL_EOI or SE_FL_ERROR When end-of-stream is reported by a FCGI stream, we must take care to also report an error if end-of-input was not reported. Indeed, it is now mandatory to set SE_FL_EOI or SE_FL_ERROR flags when SE_FL_EOS is set. It is a 2.8-specific issue. No backport needed.	2023-05-11 15:37:04 +02:00
Willy Tarreau	4cfb0019e6	MINOR: stats: report the listener's protocol along with the address in stats When "optioon socket-stats" is used in a frontend, its listeners have their own stats and will appear in the stats page. And when the stats page has "stats show-legends", then a tooltip appears on each such socket with ip:port and ID. The problem is that since QUIC arrived, it was not possible to distinguish the TCP listeners from the QUIC ones because no protocol indication was mentioned. Now we add a "proto" legend there with the protocol name, so we can see "tcp4" or "quic6" and figure how the socket is bound.	2023-05-11 14:52:56 +02:00
Amaury Denoyelle	5f67b17a59	MEDIUM: mux-quic: adjust transport layer error handling Following previous patch, error notification from quic_conn has been adjusted to rely on standard connection flags. Most notably, CO_FL_ERROR on the connection instance when a fatal error is detected. Check for CO_FL_ERROR is implemented by qc_send(). If set the new flag QC_CF_ERR_CONN will be set for the MUX instance. This flag is similar to the local error flag and will abort most of the futur processing. To ensure stream upper layer is also notified, qc_wake_some_streams() called by qc_process() will put the stream on error if this new flag is set. This should be backported up to 2.7.	2023-05-11 14:12:48 +02:00
Amaury Denoyelle	b2e31d33f5	MEDIUM: quic: streamline error notification When an error is detected at quic-conn layer, the upper MUX must be notified. Previously, this was done relying on quic_conn flag QUIC_FL_CONN_NOTIFY_CLOSE set and the MUX wake callback called on connection closure. Adjust this mechanism to use an approach more similar to other transport layers in haproxy. On error, connection flags are updated with CO_FL_ERROR, CO_FL_SOCK_RD_SH and CO_FL_SOCK_WR_SH. The MUX is then notified when the error happened instead of just before the closing. To reflect this change, qc_notify_close() has been renamed qc_notify_err(). This function must now be explicitely called every time a new error condition arises on the quic_conn layer. To ensure MUX send is disabled on error, qc_send_mux() now checks CO_FL_SOCK_WR_SH. If set, the function returns an error. This should prevent the MUX from sending data on closing or draining state. To complete this patch, MUX layer must now check for CO_FL_ERROR explicitely. This will be the subject of the following commit. This should be backported up to 2.7.	2023-05-11 14:04:51 +02:00
Amaury Denoyelle	2ad41b8629	MINOR: mux-quic: simplify return path of qc_send() Remove the unnecessary err label for qc_send(). Anyway, this label cannot be used once some frames are sent because there is no cleanup part for it. This should be backported up to 2.7.	2023-05-11 14:04:51 +02:00
Amaury Denoyelle	b35e32e43b	MINOR: mux-quic: factorize send subscribing Factorize code for send subscribing on the lower layer in a dedicated function qcc_subscribe_send(). This allows to call the lower layer only if not already subscribed and print a trace in this case. This should help to understand when subscribing is really performed. In the future, this function may be extended to avoid subscribing under new conditions, such as connection already on error. This should be backported up to 2.7.	2023-05-11 14:04:51 +02:00
Amaury Denoyelle	04b2208aa0	MINOR: mux-quic: do not send STREAM frames if already subscribe Do not built STREAM frames if MUX is already subscribed for sending on lower layer. Indeed, this means that either socket currently encountered a transient error or congestion window is full. This change is an optimization which prevents to allocate and release a series of STREAM frames for nothing under congestion. Note that nothing is done for other frames (flow-control, RESET_STREAM and STOP_SENDING). Indeed, these frames are not restricted by flow control. However, this means that they will be allocated for nothing if send is blocked on a transient error. This should be backported up to 2.7.	2023-05-11 14:04:51 +02:00
Amaury Denoyelle	2d5c3f5cd1	MINOR: mux-quic: add traces for stream wake Add traces for when an upper layer stream is woken up by the MUX. This should help to diagnose frozen stream issues. This should be backported up to 2.7.	2023-05-11 14:04:51 +02:00
Amaury Denoyelle	69670e88bd	BUG/MINOR: mux-quic: no need to subscribe for detach streams When detach is conducted by stream endpoint layer, a stream is either freed or just flagged as detached if the transfer is not yet finished. In the latter case, the stream will be finally freed via qc_purge_streams() which is called periodically. A subscribe was done on quic-conn layer if a stream cannot be freed via qc_purge_streams() as this means FIN STREAM has not yet been sent. However, this is unnecessary as either HTX EOM was not yet received and we are waiting for the upper layer, or FIN stream is still in the buffer but was not yet transmitted due to an incomplete transfer, in which case a subscribe should have already been done. This should be backported up to 2.7.	2023-05-11 14:04:51 +02:00
Amaury Denoyelle	131f2d93e1	BUG/MINOR: mux-quic: do not free frame already released by quic-conn MUX uses qc_send_mux() function to send frames list over a QUIC connection. On network congestion, the lower layer will reject some frames and it is the MUX responsibility to free them. There is another category of error which are when the sendto() fails. In this case, the lower layer will free the packet and its attached frames and the MUX should not touch them. This model was violated by MUX layer for RESET_STREAM and STOP_SENDING emission. In this case, frames were freed every time by the MUX on error. This causes a double free error which lead to a crash. Fix this by always ensuring if frames were rejected by the lower layer before freeing them on the MUX. This is done simply by checking if frame list is not empty, as RESET_STREAM and STOP_SENDING are sent individually. This bug was never reproduced in production. Thus, it is labelled as MINOR. This must be backported up to 2.7.	2023-05-11 14:04:51 +02:00
Amaury Denoyelle	3fd40935d9	BUG/MINOR: mux-quic: do not prevent shutw on error Since recent modification of MUX error processing, shutw operation was skipped for a connection reported as on error. However, this can caused the stream layer to not be notified about error. The impact of this bug is unknown but it may lead to stream never closed. To fix this, simply skip over send operations when connection is on error while keep notifying the stream layer. This should be backported up to 2.7.	2023-05-11 14:04:51 +02:00
Willy Tarreau	9615102b01	MINOR: stats: report the number of times the global maxconn was reached As discussed a few times over the years, it's quite difficult to know how often we stop accepting connections because the global maxconn was reached. This is not easy to know because when we reach the limit we stop accepting but we don't know if incoming connections are pending, so it's not possible to know how many were delayed just because of this. However, an interesting equivalent metric consist in counting the number of times an accepted incoming connection resulted in the limit being reached. I.e. "we've accepted the last one for now". That doesn't imply any other one got delayed but it's a factual indicator that something might have been delayed. And by counting the number of such events, it becomes easier to know whether some limits need to be adjusted because they're reached often, or if it's exceptionally rare. The metric is reported as a counter in show info and on the stats page in the info section right next to "maxconn".	2023-05-11 13:51:31 +02:00
Willy Tarreau	3c4a297d2b	MINOR: stats: report the total number of warnings issued Now in "show info" we have a TotalWarnings field that reports the total number of warnings issued since the process started. It's also reported in the the stats page next to the uptime.	2023-05-11 12:02:21 +02:00
Frédéric Lécaille	0dd4fa58e6	BUG/MINOR: quic: Buggy acknowlegments of acknowlegments function qc_treat_ack_of_ack() must remove ranges of acknowlegments from an ebtree which have been acknowledged. This is done keeping track of the largest acknowledged packet number which has been acknowledged and sent with an ack-eliciting packet. But due to the data structure of the acknowledgement ranges used to build an ACK frame, one must leave at least one range in such an ebtree which must at least contain a unique one-element range with the largest acknowledged packet number as element. This issue was revealed by @Tristan971 in GH #2140. Must be backported in 2.7 and 2.6.	2023-05-11 10:33:23 +02:00
Aurelien DARRAGON	d7d507aa8a	CLEANUP: hlua_fcn/queue: make queue:push() easier to read Adding some spaces and code comments in queue:push() function to make it easier to read.	2023-05-11 09:23:14 +02:00
Aurelien DARRAGON	c0af7cdba2	BUG/MINOR: hlua_fcn/queue: fix reference leak When pushing a lua object through lua Queue class, a new reference is created from the object so that it can be safely restored when needed. Likewise, when popping an object from lua Queue class, the object is restored at the top of the stack via its reference id. However, once the object is restored the related queue entry is removed, thus the object reference must be dropped to prevent reference leak.	2023-05-11 09:23:14 +02:00

1 2 3 4 5 ...

16085 Commits