haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-07 23:56:57 +02:00

Author	SHA1	Message	Date
Frédéric Lécaille	ae885b9b68	REORG: quic: Move CRYPTO data buffer defintions to QUIC TLS module Move quic_crypto_buf struct definition from quic_conn-t.h to quic_tls-t.h. Also move its pool definition/declaration to quic_tls-t.h/quic_tls.c.	2023-11-28 15:37:22 +01:00
Frédéric Lécaille	0fc0d45745	REORG: quic: Add a new module to handle QUIC connection IDs Move quic_cid and quic_connnection_id from quic_conn-t.h to new quic_cid-t.h header. Move defintions of quic_stateless_reset_token_init(), quic_derive_cid(), new_quic_cid(), quic_get_cid_tid() and retrieve_qc_conn_from_cid() to quic_cid.c new C file.	2023-11-28 15:37:22 +01:00
Frédéric Lécaille	1564ec0a93	REORG: quic: Move some QUIC CLI code to its C file Move init_quic() from quic_conn.c to quic_cli.c and rename it to cli_quic_init().	2023-11-28 15:37:22 +01:00
Frédéric Lécaille	21615d4376	CLEANUP: quic: Remove dead definitions/declarations Remove useless definitions and declarations.	2023-11-28 15:37:22 +01:00
Christopher Faulet	af733ef6e4	BUG/MEDIUM: mux-h2: Remove H2_SF_NOTIFIED flag for H2S blocked on fast-forward When a H2 stream is blocked during data fast-forwarding, we must take care to remove H2_SF_NOTIFIED flag. This was only performed when data fast-forward was attempted. However, if the H2 stream was blocked for any reason, this flag was not removed. During our tests, we found it was possible to infinitely block a connection because one of its streams was in the send_list with the flag set. In this case, the stream was no longer woken up to resume the sends, blocking all other streams. No backport needed.	2023-11-28 14:01:56 +01:00
Amaury Denoyelle	fe3726cb76	BUG/MINOR: quic: fix CONNECTION_CLOSE_APP encoding CONNECTION_CLOSE_APP encoding is broken, which prevents the sending of every packet with such a frame. This bug was always present in quic haproxy. However, it was slightly dissimulated by the previous code which always initialized all frame members to zero, which was sufficient to ensure CONNECTION_CLOSE_APP encoding was ok. The below patch changes this behavior by removing this costly initialization step. `4cf784f38e` MINOR: quic: Avoid zeroing frame structures Now, frames members must always be initialized individually given the type of frame to used. However, for CONNECTION_CLOSE_APP this was not done as qc_cc_build_frm() accessed the wrong union member refering to a CONNECTION_CLOSE instead. This bug was detected when trying to generate a HTTP/3 error. The CONNECTION_CLOSE_APP frame encoding failed due to a non-initialized <reason_phrase_len> which was too big. This was reported by the following trace : "frame building error : qc@0x5555561b86c0 idle_timer_task@0x5555561e5050 flags=0x86038058 CONNECTION_CLOSE_APP" This must be backported up to 2.6. This is necessary even if above commit is not as previous code is also buggy, albeit with a different behavior.	2023-11-28 11:40:01 +01:00
Willy Tarreau	d656ac7e13	OPTIM: mux-h2/zero-copy: don't allocate more buffers per connections than streams It's the exact same as commit `0a7ab7067` ("OPTIM: mux-h2: don't allocate more buffers per connections than streams"), but for the zero-copy case this time. Previously it was only done on the regular snd_buf() path, but this one is needed as well. A transfer on 16 parallel streams now consumes half of the memory, and a single stream consumes much less. An alternate approach would be worth investigating in the future, based on the same principle as the CF_STREAMER_FAST at the higher level: in short, by monitoring how many mux buffers we write at once before refilling them, we would get an idea of how much is worth keeping in buffers max, given that anything beyond would just waste memory. Some tests show that a single buffer already seems almost as good, except for single-stream transfers, which is why it's worth spending more time on this.	2023-11-28 09:15:26 +01:00
Amaury Denoyelle	e97489a526	MINOR: trace: support -dt optional format Add an optional argument for "-dt". This argument is interpreted as a list of several trace statement separated by comma. For each statement, a specific trace name can be specifed, or none to act on all sources. Using double-colon separator, it is possible to add specifications on the wanted level and verbosity.	2023-11-27 17:15:14 +01:00
Amaury Denoyelle	670520cff8	MINOR: trace: parse verbosity in a function This patch is similar to the previous one except that it handles trace verbosity. Trace source must be specified unless "quiet" is used.	2023-11-27 17:11:14 +01:00
Amaury Denoyelle	ed9fbeed78	MINOR: trace: parse level in a function Extract conversion of level string argument to integer value in a dedicated internal function trace_parse_level(). This function is used to for CLI trace parsing and will also be useful for "-dt" process argument.	2023-11-27 17:11:14 +01:00
Amaury Denoyelle	cef29d3708	MINOR: trace: define simple -dt argument Add '-dt' haproxy process argument. This will automatically activate all trace sources on stderr with the error level. This could be useful to troubleshoot issues such as protocol violations.	2023-11-27 17:10:18 +01:00
Amaury Denoyelle	eabe477ad2	BUILD: map: fix build warning <pattern> field pointer of pat_ref_elt structure has been by a zero-length array. As such, it's now unneeded to check for NULL address before printing it. This type conversion was done in the following commit : `3ac9912837` OPTIM: pattern: save memory and time using ebst instead of ebis The current patch is mandatory to fix the following GCC warning : CC src/map.o src/map.c: In function ‘cli_io_handler_map_lookup’: src/map.c:549:54: error: the comparison will always evaluate as ‘true’ for the address of ‘pattern’ will never be NULL [-Werror=address] 549 \| if (pat->ref && pat->ref->pattern) \| No need to backport it unless the above commit is.	2023-11-27 15:03:41 +01:00
Willy Tarreau	3ac9912837	OPTIM: pattern: save memory and time using ebst instead of ebis In the pat_ref_elt struct, the pattern string is stored outside of the node element, using a pointer to an strdup(). Not only this needlessly wastes at least 16-24 bytes per entry (8 for the pointer, 8-16 for the allocator), it also makes the tree descent less efficient since both the node and the string have to be visited for each layer (hence at least two cache lines). Let's use an ebmb storage and place the pattern right at the end of the pat_ref_elt, making it a variable-sized element instead. The set-map test below jumps from 173 to 182 kreq/s/core, and the memory usage drops from 356 MB to 324 MB: http-request set-map(/dev/null) %[rand(1000000)] 1 This is even more visible with large maps: after loading 16M IP addresses into a map, the process uses this amount of memory: - 3.15 GB with haproxy-2.8 - 4.21 GB with haproxy-2.9-dev11 - 3.68 GB with this patch So that's a net saving of 32 bytes per entry here, which cuts in half the extra cost of the tree, and loading a large map takes about 20% less time.	2023-11-27 11:25:07 +01:00
Christopher Faulet	b4eaadae84	BUG/MEDIUM: mux-h1: Properly ignore trailers when a content-length is announced It is not possible in H1, but in H2 (and probably H3) it is possible to have trailers at the end of a message while a Content-Length was announced. However, depending if the trailers are received with the last HTX DATA block or the zero-copy forwarding is used or not, an processing error may be triggered, leading to a 500-internal-error. To fix the issue, when a content-length is announced and all the payload was processed, we switch the message to H1_MSG_DONE state only if the end-of-message was also reported (HTX_FL_EOM flag set). Otherwise, it is switched to H1_MSG_TRAILERS state to be able to properly ignored the trailers, if so. The patch must be backported as far as 2.4. Be careful, this part was highly refactored. The patch will have to be adapted to be backported.	2023-11-27 08:37:48 +01:00
William Lallemand	3dd55fa132	MINOR: mworker/cli: implement hard-reload over the master CLI The mworker mode never had a proper 'hard-stop' (-st) for the reload, this is a mode which was commonly used with the daemon mode, but it was never implemented in mworker mode. This patch fixes the problem by implementing a "hard-reload" command over the master CLI. It does the same as the "reload" command, but instead of waiting for the connections to stop in the previous process, it immediately quits the previous process after binding.	2023-11-24 21:44:25 +01:00
William Lallemand	77a97536e8	MEDIUM: ssl: use ssl_sock_chose_sni_ctx() in the clienthello callback This patch removes the code which selects the SSL certificate in the OpenSSL Client Hello callback, to use the ssl_sock_chose_sni_ctx() function which does the same. The bigger part of the function which remains is the extraction of the servername, ciphers and sigalgs, because it's done manually by parsing the TLS extensions. This is not supposed to change anything functionally.	2023-11-24 20:07:27 +01:00
William Lallemand	9f2e07bf7b	MINOR: ssl: move certificate selection in a dedicate function The certificate selection used in the WolfSSL cert_cb and in the OpenSSL clienthello callback is the same, the function was duplicate to achieve the same. This patch move the selection code to a common function called ssl_sock_chose_sni_ctx(). The servername string is still lowered in the callback, however the search for the first dot in the string (wildp) is done in ssl_sock_chose_sni_ctx() The function uses the same certificate selection algorithm as before, it needs to know if you need rsa or ecdsa, the bind_conf to achieve the lookup, and the servername string. This patch moves the code for WolSSL only.	2023-11-24 20:07:27 +01:00
William Lallemand	b900a3533c	MINOR: ssl: replace 'trash.area' by 'servername' in ssl_sock_switchctx_cbk() Replace 'trash.area' by 'servername' for more readibility.	2023-11-24 20:07:27 +01:00
William Lallemand	3750442bc4	MEDIUM: ssl: implement rsa/ecdsa selection with WolfSSL PR https://github.com/wolfSSL/wolfssl/pull/6963 implements primitives to extract ciphers and algorithm signatures. It allows to chose a certificate depending on the sigals and ciphers presented by the client (RSA or ECDSA). Since WolfSSL does not implement the clienthello callback, the patch uses the certificate callback (SSL_CTX_set_cert_cb()) The callback is inspired by our clienthello callback, however the extraction of client ciphers and sigalgs is simpler, wolfSSL_get_sigalg_info() and wolfSSL_get_ciphersuite_info() are used. This is not enabled by default yet as the PR was not merged.	2023-11-24 20:07:27 +01:00
Aurelien DARRAGON	20437b3e32	MINOR: log/balance: set lbprm tot_weight on server on queue/dequeue Maintain proper px->lbprm.tot_weight for log backends. server's weight is considered as 1 as long as the server is usable. This will allow the stats page to correctly display the proxy status since the check currently relies on proxy's lbprm.tot_weight variable.	2023-11-24 16:27:55 +01:00
Aurelien DARRAGON	661c079bc5	MINOR: log/backend: prevent "use-server" rules use with LOG mode server_rules declared using "use-server" keyword within a proxy are not supported inside a log backend (with "mode log" set), so we report a warning to the user and reset the setting.	2023-11-24 16:27:55 +01:00
Aurelien DARRAGON	f2629ebd4e	MINOR: proxy: add free_server_rules() helper function Take the px->server_rules freeing part out of free_proxy() and make it a dedicated helper function so that it becomes possible to use it from anywhere.	2023-11-24 16:27:55 +01:00
Aurelien DARRAGON	481e9317e3	MINOR: proxy: add free_logformat_list() helper function There are multiple places inside free_proxy() where we need to perform the exact same operation: freeing a logformat list which includes freeing every member. To prevent code duplication, we add the free_logformat_list() function that takes such list as parameter and does all the freeing job on its own.	2023-11-24 16:27:55 +01:00
Aurelien DARRAGON	8f878d5969	Revert "MINOR: cfgparse-listen: warn when use-server rules is used in wrong mode" This reverts commit 5884e46ec8c8231e73c68e1bdd345c75c9af97a0 since we cannot perform the test during parsing as the effective proxy mode is not yet known.	2023-11-24 16:27:55 +01:00
Aurelien DARRAGON	ffae3ca34b	MINOR: backend: remove invalid mode test for "hash-balance-factor" This is a leftover from `1e0093a317` ("MINOR: backend/balance: "balance" requires TCP or HTTP mode"). Indeed, we cannot perform the test during parsing as the effective proxy type is not yet known. Moreover, thanks to `b61147fd` ("MEDIUM: log/balance: merge tcp/http algo with log ones") we could potentially benefit from this setting even in log mode, but for now it is ignored by all log compatible load-balancing algorithms.	2023-11-24 16:27:55 +01:00
Aurelien DARRAGON	c886fb58eb	MINOR: server/ip: centralize server ip updates Add a new helper function named _srv_update_inetaddr() to centralize ip addr and port updates during runtime.	2023-11-24 16:27:55 +01:00
Aurelien DARRAGON	24da4d3ee7	MINOR: tools: use const for read only pointers in ip{cmp,cpy} In this patch we fix the prototype for ipcmp() and ipcpy() functions so that input pointers that are used exclusively for reads are used as const pointers. This way, the compiler can safely assume that those variables won't be altered by the function.	2023-11-24 16:27:55 +01:00
Aurelien DARRAGON	683b2ae013	MINOR: server/event_hdl: add SERVER_INETADDR event In this patch we add the support for a new SERVER event in the event_hdl API. SERVER_INETADDR is implemented as an advanced server event. It is published each time the server's ip address or port is about to change. (ie: from the cli, dns, lua...) SERVER_INETADDR data is an event_hdl_cb_data_server_inetaddr struct that provides additional info related to the server inet addr change, but can be casted as a regular event_hdl_cb_data_server struct if additional info is not needed.	2023-11-24 16:27:55 +01:00
Christopher Faulet	671e07617c	BUG/MINOR: global: Fix tune.disable-(fast-forward/zero-copy-forwarding) options These options were not properly handled during configration parsing. A wrong bitwise operation was used. No backport needed.	2023-11-24 09:33:56 +01:00
Christopher Faulet	8d46a2c973	MAJOR: h3: Implement zero-copy support to send DATA frame When possible, we try send DATA frame without copying data. To do so, we swap the input buffer with QCS tx buffer. It is only possible iff: * There is only one HTX block of data at the beginning of the message * Amount of data to send is equal to the size of the HTX data block * The QCS tx buffer is empty In this case, both buffers are swapped. The frame metadata are written at the begining of the buffer, before data and where the HTX structure is stored.	2023-11-24 07:42:43 +01:00
Christopher Faulet	1bcc0f8892	MEDIUM: mux-quic: Add consumer-side fast-forwarding support The QUIC multiplexer now implements callbacks to consume fast-forwarded data. It relies on the H3 stack to acquire the buffer and format the frame.	2023-11-24 07:42:43 +01:00
Willy Tarreau	cd352c0dbe	MINOR: log/balance: rename "log-sticky" to "sticky" After giving it some thought, it could pretty well happen that other protocols benefit from the sticky algorithm that some used to emulate using a "stick-on int(0)" or things like this previously. So better rename it to "sticky" right now instead of having to keep that "log-" prefix forever. It's still limited to logs, of course, only the algo is renamed in the config.	2023-11-23 18:21:31 +01:00
Amaury Denoyelle	71ed381249	MINOR: listener: allow thread kw for rhttp bind Thanks to previous commit, a reverse HTTP listener is able to distribute actively opened connections accross its threads. To be able to exploit this, allow "thread" keyword for such a listener. An extra check is added to explicitely forbids a reverse bind to span multiple thread groups. Without this, multiple listeners instances will be created, each with its owned "nbconn" value. This may surprise users so for now, better to deactivate this possibility.	2023-11-23 17:46:00 +01:00
Amaury Denoyelle	3d0c7f2e2a	MEDIUM: rhttp: support multi-thread active connect Implement support for active HTTP reverse task migration on listener threads. This operation is done each time a new reversable connection will be instantiated. Instead of directly allocate the connection, a lookup is done among all the listener threads. A comparison is done to select the thread with the smallest number of current reverse connection. If the thread found is different from the current one, the connection allocation is delayed and the task rescheduled on the chosen thread. The connection will then be created and pinned on the new thread. This mechanisms allows to balance reverse HTTP connections accross different threads. Note that rhttp_set_affinity is still defined to disable thread migration on accept. This is necessary as it's unsafe to move an existing connection to another thread. However, active reverse task migration should be sufficient to distribute connections accross several threads. Better than that, this design allows to differentiate standard frontend and reversable connections. The latest are designed to be long-lived so it's useful to have their repartition solely based on others reversed connections.	2023-11-23 17:45:56 +01:00
Amaury Denoyelle	a3187fe06c	MINOR: rhttp: add count of active conns per thread Add a new member <nb_rhttp_conns> in thread_ctx structure. Its purpose is to count the current number of opened reverse HTTP connections regarding from their listeners membership. This patch will be useful to support multi-thread for active reverse HTTP, in order to select the less loaded thread. Note that despite access to <nb_rhttp_conns> are only done by the current thread, atomic operations are used. This is because once multi-thread support will be added, external threads will also retrieve values from others.	2023-11-23 17:43:01 +01:00
Amaury Denoyelle	55e78ff7e1	MINOR: rhttp: large renaming to use rhttp prefix Previous commit renames 'proto_reverse_connect' module to 'proto_rhttp'. This commits follows this by replacing various custom prefix by 'rhttp_' to make the code uniform. Note that 'reverse_' prefix was kept in connection module. This is because if a new reversable protocol not based on HTTP is implemented, it may be necessary to reused the same connection function which are protocol agnostic.	2023-11-23 17:40:01 +01:00
Amaury Denoyelle	e09af499b4	MINOR: rhttp: rename proto_reverse_connect This commit is renaming of module proto_reverse_connect to proto_rhttp. This name is selected as it is shorter and more precise.	2023-11-23 17:38:58 +01:00
Christopher Faulet	85da7116a9	BUG/MEDIUM: mux-h1: Don't set CO_SFL_MSG_MORE flag on last fast-forward send In the mux-to-mux fast-forwarding, when end-of-input is reached on the producer side, the consumer side must not set the CO_SFL_MSG_MORE flag on send. It means the H1C_F_CO_MSG_MORE flag must be removed from the H1 connection. No backport needed.	2023-11-23 17:30:18 +01:00
Willy Tarreau	1de44daf7d	MINOR: ext-check: add an option to preserve environment variables In Github issue #2128, @jvincze84 explained the complexity of using external checks in some advanced setups due to the systematic purge of environment variables, and expressed the desire to preserve the existing environment. During the discussion an agreement was found around having an option to "external-check" to do that and that solution was tested and confirmed to work by user @nyxi. This patch just cleans this up, implements the option as "preserve-env" and documents it. The default behavior does not change, the environment is still purged, unless "preserve-env" is passed. The choice of not using "import-env" instead was made so that we could later use it to name specific variables that have to be imported instead of keeping the whole environment. The patch is simple enough that it could be backported if needed (and was in fact tested on 2.6 first).	2023-11-23 16:53:57 +01:00
Ilya Shipitsin	80813cdd2a	CLEANUP: assorted typo fixes in the code and comments This is 37th iteration of typo fixes	2023-11-23 16:23:14 +01:00
Willy Tarreau	45a9e4e24b	MINOR: init: add info about the main program to the post_mortem struct This way we'll still have haproxy's version, build options etc in core dumps and centralized all at once.	2023-11-23 15:39:21 +01:00
Willy Tarreau	6455fd5024	MINOR: debug: add the ability to enter components in the post_mortem struct Here the idea is to collect components' versions and build options. The main component is haproxy, but the API is made so that any sub-system can easily add a component there (for example the detailed version of a device detection lib, or some info about a lib loaded from Lua). The elements are stored as a pointer to an array of structs and its count so that it's sufficient to issue this in gdb to list them all at once: print *post_mortem.components@post_mortem.nb_components For now we collect name, version, toolchain, toolchain options, build options and path. Maybe more could be useful in the future.	2023-11-23 15:39:21 +01:00
Willy Tarreau	a88a3482b5	MINOR: debug: dump the mapping of the libs into post_mortem Having the libs and their addresses listed in the post_mortem struct is also helpful. Sometimes it helps notice that one version is not the expected one, e.g. due to some LD_LIBRARY_PATH. We don't emit it on "show dev" however since that's already available via "show libs".	2023-11-23 15:39:21 +01:00
Willy Tarreau	37e3dd718c	MINOR: debug: copy the thread info into the post_mortem struct The last starting thread now copies the pthread ID and stack top of each thread into post_mortem. That way it's as easy as issuing "p post_mortem" in gdb to see all thread IDs and stack frames and more easily map them to the threads met in a core.	2023-11-23 15:39:21 +01:00
Willy Tarreau	c0eec3a4aa	MINOR: debug: collect some boot-time info related to the process Here we collect the original uid/gid/rlimits for FD and RAM since these ones do affect behavior and are sometimes different from expected in containers or when starting as a service.	2023-11-23 15:39:21 +01:00
Willy Tarreau	ff9e06cd53	MINOR: debug: report any detected hypervisor in post_mortem When the x86 CPU flags show the "hypervisor" flag, we know we're running inside QEMU, VMware or possibly other flavors of hypervisors. In this case we'll report either "qemu", "vmware" or "yes" for other ones in the "virt_techno" field, based on the DMI hardware vendor name, otherwise "no" when the flag is not found.	2023-11-23 15:39:21 +01:00
Willy Tarreau	0cc799bdd1	MINOR: debug: detect CPU model and store it in post_mortem The CPU model and type has significant impact on certain bugs, such as contention issues caused by CPUs having split L3 caches, or stricter memory models that exhibit some barrier issues. It's complicated though because the info about the model depends on the arch. For example, x86 reports an SKU name while ARM rather reports the CPU core types, families and versions for each CPU core. There, the SoC will sometimes be reported in the device tree or DMI info instead. But we don't really care, it's essentially useful to know if the code is running on an armv8.0 such as A53, a 8.2 such as A55/A76/Neoverse etc. For MIPS the model appears to generally be there, and in addition the SoC is often present in the "system type" field before the first CPU, and the type of machine in the "machine" field, to replace the missing DMI and DT, so they are also collected. Note that only the first CPU is checked and reported, that's expected to be vastly sufficient, since we're just trying to spot known incompatibilities or issues.	2023-11-23 15:39:21 +01:00
Willy Tarreau	2974f3e71b	MINOR: debug: report in post_mortem if the container techno used is docker If we detect we're running inside a container on Linux, let's check if it seems to be docker. Docker usually creates a /.dockerenv file, which is easy to check. It's uncertain whether it's always the case, but on the few tested instances that was true, and we don't really care, what matters is to place helpful debugging info for developers. When this file is detected, we report "docker" instead of "yes" in the container techno.	2023-11-23 15:39:21 +01:00
Willy Tarreau	cf8be50a3d	MINOR: debug: report in port_mortem whether a container was detected Containers often cause significant trouble depending on how they're set up, and they're not always trivial for their users to extract info from. Here we're trying to detect if we're running inside a container on Linux. There are plenty of approaches and none is perfectly clean nor reliable, which makes sense since the goal is to remain transparent enough. One interesting approach is to rely on the observation that containers generally do not expose most kernel threads, and that the very firsts of them are extremely stable across all kernel versions: pid 2 was called "keventd" in kernel 2.4, became "kthreadd" in kernel 2.6, and has since not changed. This is true on all architectures tested, even with highly stripped down kernels such as those found on 15 year-old OpenWRT images. And this one doesn't appear inside containers. Thus here we check if we find such a thread via /proc and whether it's called keventd or kthreadd, to detect a container, and we set the "cont_techno" variable to "yes" or "no" depending on what is found.	2023-11-23 15:39:21 +01:00
Willy Tarreau	4e3f9921de	MINOR: debug: add OS/hardware info to the post_mortem struct Let's extract some info about the system (board model, vendor etc), this will indicate some hypervisors, some cloud instances or some uncommon embedded boards etc. Typically, vmware, qemu and raspberry-pi are visible here and can help during the troubleshooting session.	2023-11-23 15:39:21 +01:00

1 2 3 4 5 ...

16712 Commits