haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-15 03:26:59 +02:00

Author	SHA1	Message	Date
Willy Tarreau	7ddcdff33f	BUG/MEDIUM: debug: close a possible race between thread dump and panic() The rework of the thread dumping mechanism in 2.8 with commit `9a6ecbd590` ("MEDIUM: debug: simplify the thread dump mechanism") opened a small race, which is that a thread in the process of dumping other ones may block the other one from panicing while it's looping at the end of ha_thread_dump_fill(), or any other sequence involving the currently dumped one. This was emphasized in 3.1 with commit `148eb5875f` ("DEBUG: wdt: better detect apparently locked up threads and warn about them") that allowed to emit warnings about long-stuck threads, because in this case, what happens is that sometimes a thread starts to emit a warning (or a set of warnings), and while the warning is being awaited for, a panic finally happens and interrupts either the dumping thread, which never finishes and waits for the target's pointer to become NULL which will never happen since it was supposed to do it itself, or the currently dumped thread which could wait for the dumping thread to become ready while this one has not released the former. In order to address this, first we now make sure never to dump a thread that is already in the process of dumping another one. We're adding a new thread flag to know this situation, that is set in ha_thread_dump_fill() and cleared in ha_thread_dump_done(). And similarly, we don't trigger the watchdog on a thread waiting for another one to finish its dump, as it's likely a case of warning (and maybe even a panic) that makes them wait for each other and we don't want such cases to be reentrant. Finally, we check in the main polling loop that the flag never accidentally leaked (e.g. wrong flag manipulation) as this would be difficult to spot with bad consequences. This should be backported at least to 2.8, and should resolve github issue #2860. Thanks to Chris Staite for the very informative backtrace that exhibited the problem.	2025-02-10 18:34:26 +01:00
Willy Tarreau	37e84676c7	[RELEASE] Released version 3.2-dev5 Released version 3.2-dev5 with the following main changes : - BUG/MINOR: ssl: put ssl_sock_load_ca under SSL_NO_GENERATE_CERTIFICATES - CLEANUP: ssl: rename ssl_sock_load_ca to ssl_sock_gencert_load_ca - CLEANUP: ssl: move ssl_sock_gencert_load_ca declaration in ssl_gencert.h - CLEANUP: tree-wide: define and use acl_match_cond() helper - MINOR: epoll: permit to mask certain specific events - MINOR: proxies: Add a per-thread group field to struct proxy. - MINOR: Add fields to the per-thread group field in struct server. - MINOR: proxies/servers: Calculate queueslength and use it. - MEDIUM: servers/proxies: Switch to using per-tgroup queues. - BUG/MINOR: stream: Properly handle "on-marked-up shutdown-backup-sessions" - MEDIUM: stream: Map task wake up reasons to dedicated stream events - MEDIUM: stream: No longer use TASK_F_UEVT* to shut a stream down - BUILD: tools: fix build on BSD by dropping the ETIME check - MINOR: queues: use __ha_cpu_relax() on failed CAS. - BUILD: queues: Use unsigned int when needed - BUILD: ssl: allow to build without the renegotiation API of WolfSSL - BUILD: ssl: more cleaner approach to WolfSSL without renegotiation - BUG/MEDIUM: chunk: make sure to flush the trash pool before resizing - MINOR: quic: remove references to burst in quic-cc-algo parsing - MINOR: quic: allow BBR testing without pacing - MINOR: quic: transform pacing settings into a global option - MAJOR: quic: mark pacing as stable and enable it by default - MINOR: quic: mark BBR as stable - MINOR: quic: define quic_tune - BUILD: quic: fix overflow in global tune - DEBUG: fd: add a counter of takeovers of an FD since it was last opened - MINOR: fd: add a generation number to file descriptors - DEBUG: epoll: store and compare the FD's generation count with reported event - MEDIUM: epoll: skip reports of stale file descriptors - MINOR: mux-h1: Add masks to group H1S DEMUX and MUX errors - BUG/MINOR: mux-h1: Only report a SE error on demux error - MINOR: tevt: Add the termination events log's fundations - MINOR: tevt/stconn: Add a termination events log in the SE descriptor - MINOR: tevt/mux-h1: Report termination events for the H1C and H1S - MINOR: tevt/mux-h2: Report termination events for the H2C - MINOR: tevt/stream/stconn: Report termination events for stream and sc - MINOR: tevt/conn: Report intercepted event for L4 rules - MINOR: tevt/mux-h1/mux-h2: Add termination events log when dumping mux info - MINOR: tevt/muxes: Add CTL and SCTL command to get the termination event logs - MINOR: tevt/mux-pt: Add support for termination event logs - MINOR: tevt/connection: Add dedicated termination events for lower locations - MEDIUM: tevt/muxes: Add dedicated termination events for muxc/se locations - MINOR: tevt/stconn: Be more accurate to report shutw events - MEDIUM: tevt/stconn/stream: Add dedicated termination events for stream location - MINOR: tevt: Don't duplicate termination event during reporting - MINOR: tevt/applet: Add limited support for termination event logs for applets - MINOR: tevt: Add a sample to get termination events for all locations - MINOR: tevt: Improve function to convert a termination events log to string - REORG: tevt/connection: Move enums at the end of the header file - MINOR: tevt/dev: Add term_events tool - MINOR: tevt/connection: Add support for POLL_HUP/POLL_ERR events - MINOR: tevt/dev: Parse tuple of termination events - BUG/MEDIUM: htx: wrong count computation in htx_xfer_blks() - DOC: htx: clarify <mark> parameter for htx_xfer_blks() - BUILD: quic: remove GCC undefined error in qc_release_lost_pkts() - MEDIUM: htx: prevent <mark> to copy incomplete headers in htx_xfer_blks() - BUG/MEDIUM: mux-fcgi: Properly handle read0 on partial records - BUG/MINOR: tevt/http-ana: Remove badly placed event reports - DEBUG: http-ana: Remove debug counters from HTTP analyzers - DEBUG: mux-h1: Remove some debug counters - BUG/MINOR: tcp-rules: Don't forward close during tcp-response content rules eval - MEDIUM: stream: interrupt costly rulesets after too many evaluations - BUG/MINOR: http-check: Don't pretend a C-L heeader is set before adding it - BUILD: ssl: remove a boringssl definition defined by recent boringssl libs - BUG/MINOR: tevt/mux-h2: Set truncated receive/eos events at SE level on error - BUG/MEDIUM: flt-spoe: Set/test applet flags instead of SE flags from I/O handler - BUG/MEDIUM: applet: Don't pretend to have more data to handle EOI/EOS/ERROR - BUG/MEDIUM: flt-spoe: Properly handle end of stream from the SPOE applet - MINOR: flt-spoe: Report end of input immediately after applet init - MINOR: mux-spop: Report EOI on the SE when a ACK is received for a stream - MINOR: mux-spop: Set SPOP_CF_ERROR flag on connection error only - MINOR: tevt/mux-spop: Report termination events for the SPOP connect/stream - CLEANUP: mux-spop: Remove useless comments - MINOR: mux-spop: Dump info about connections and streams in dedicated functions - MINOR: mux-spop: Implement .show_sd callback function - MEDIUM: mux-fcgi: Add a function to propagate termination flags from fstrm to SE - BUG/MEDIUM: mux-fcgi: Propagate flags to SE in fcgi_strm_wake_one_stream - MINOR: tevt/mux-fcgi: Report termination events for the FCGI connect/stream - MINOR: mux-fcgi: Dump info about connections and streams in dedicated functions - MINOR: mux-spop/mux-fcgi: Add support of the debug string for logs - BUG/MINOR: cli: Don't set SE flags from the cli applet - BUG/MINOR: cli: Fix memory leak on error for _getsocks command - BUG/MINOR: cli: Fix a possible infinite loop in _getsocks() - BUG/MINOR: config/userlist: Support one 'users' option for 'group' directive - BUG/MINOR: auth: Fix a leak on error path when parsing user's groups - BUG/MINOR: flt-trace: Support only one name option - MINOR: filters: Improve errors formating during filters parsing - BUG/MINOR: stats-json: Define JSON_INT_MAX as a signed integer - DOC: option redispatch should mention persist options - BUG/MINOR: debug: make "debug dev sched" accept a negative TID - BUG/MINOR: debug: make sure the "debug dev sched" tasks don't block stopping - IMPORT: plock: export the uninlined version of the lock wait function - IMPORT: plock: give higher precedence to W than S - IMPORT: plock: lower the slope of the exponential back-off - IMPORT: plock: use cpu_relax() for a shorter time in EBO - Revert "IMPORT: plock: export the uninlined version of the lock wait function" - BUG/MEDIUM: ssl: chosing correct certificate using RSA-PSS with TLSv1.3	2025-02-08 05:53:40 +01:00
William Lallemand	3912780b1e	BUG/MEDIUM: ssl: chosing correct certificate using RSA-PSS with TLSv1.3 The clienthello callback was written when TLSv1.3 was not yet out, and signatures algorithm changed since then. With TLSv1.2, the least significant byte was used to determine the SignatureAlgorithm, which could be rsa(1), dsa(2), ecdsa(3). https://datatracker.ietf.org/doc/html/rfc5246#section-7.4.1.4.1 This was used to chose which type of certificate to push to the client. But TLSv1.3 changed that, and introduced new RSA-PSS algorithms that does not have the least sinificant byte to 1. https://datatracker.ietf.org/doc/html/rfc8446#section-4.2.3 This would result in chosing the wrong certificate when an RSA an ECDSA ones are in the configuration for the same SNI or default entry. This patch fixes the issue by parsing bothe hash and signature field to check the RSA-PSS signature scheme. This must fix issue #2852. This must be backported in every stable versions. The code was moved from ssl_sock.c to ssl_clienthello in recent versions.	2025-02-07 20:56:42 +01:00
Willy Tarreau	ae540e3d9c	Revert "IMPORT: plock: export the uninlined version of the lock wait function" This reverts commit `5496d06b2b`. It breaks the build on Windows which apparently doesn't support the weak attribute well on functions. It's not big deal anyway, playing with build options while debugging still works though it's less easy to use.	2025-02-07 19:51:15 +01:00
Willy Tarreau	b957e2f3ef	IMPORT: plock: use cpu_relax() for a shorter time in EBO Tests have shown that on modern CPUs it's interesting to wait a bit less in cpu_relax(). Till now we were looping down to 60 iterations and then switching to just barriers. Increasing the threshold to 90 iterations left before getting out of the loop improved the average and max time to grab a write lock by a few percent (e.g. 10% at 1us, 20% at 256ns or lower). Higher values tend to progressively lose that gain so let's stick to this one. This was measured on an EPYC 74F3 like previous measurements that initially led to this value, and the value might possibly depend on the mask applied to the loop counter. This is plock commit 74ca0a7307fa6aec3139f27d3b7e534e1bdb748e.	2025-02-07 18:04:29 +01:00
Willy Tarreau	253fba01a7	IMPORT: plock: lower the slope of the exponential back-off Along many tests involving both haproxy's scheduler and forwarded traffic, various exponents and algorithms were attempted for the EBO and their effects were measured. It was found that a growth in 1.25^N limited to 128k cycles consistently gives a better latency than 1.5^N limited to 256k cycles, without degrading general performance. The measures of the time to grab a write lock on a 48-thread EPYC show that the number of occurrences of low times was roughly multiplied by 2-3 while the number of occurrences of times above 64us was reduced by similar factors, to even reach 300 at 64us and limiting the maximum time by a factor of 4. The other variants that were experimented with are: m = ((m + (m >> 1)) + 2) & 0x3ffff; // original m = ((m + (m >> 1) + (m >> 3)) + 2) & 0x3ffff; m = ((m + (m >> 1) + (m >> 4)) + 2) & 0x3ffff; m = ((m + (m >> 1) + (m >> 4)) + 2) & 0x1ffff; m = ((m + (m >> 1) + (m >> 4)) + 1) & 0x1ffff; m = ((m + (m >> 2) + (m >> 4)) + 1) & 0x1ffff; // lowest CPU on pl_wr test + good perf m = ((m + (m >> 2)) + 1) & 0x1ffff; // even lower cpu usage, lowest max m = ((m + (m >> 1) + (m >> 2)) + 1) & 0x1ffff; // correct but slightly higher maxes m = ((m + (m >> 1) + (m >> 3)) + 1) & 0x1ffff; // less good than m+m>>2 m = ((m + (m >> 2) + (m >> 3)) + 1) & 0x1ffff; // better but not as good as m+m>>2 m = ((m + (m >> 3) + (m >> 4)) + 1) & 0x1ffff; // less good, lower rates on small coounts. m = ((m + (m >> 2) + (m >> 3) + (m >> 4)) + 1) & 0x1ffff; // less good as well m = ((m & 0x7fff) + (m >> 1) + (m >> 4)) + 2; m = ((m & 0xffff) + (m >> 1) + (m >> 4)) + 2; This is plock commit dddd9ee01c522da33c353e2e4d4fd743d8336ec3.	2025-02-07 18:04:29 +01:00
Willy Tarreau	9dd56da730	IMPORT: plock: give higher precedence to W than S It was noticed in haproxy that in certain extreme cases, a write lock subject to EBO may fail for a very long time in front of a large set of readers constantly trying to upgrade to the S state. The reason is that among many readers, one will succeed in its upgrade, and this situation can last for a very long time with many readers upgrading in turn, while the writer waits longer and longer before trying again. Here we're taking a reasonable approach which is that the write lock should have a higher precedence in its attempt to grab the lock. What is done is that instead of fully rolling back in case of conflict with a pure S lock, the writer will only release its read part in order to let the S upgrade to W if needed, and finish its operations. This guarantees no other seek/read/write can enter. Once the conflict is resolved, the writer grabs the read part again and waits for readers to be gone (in practice it could even return without waiting since we know that any possible wanderers would leave or even not be there at all, but it avoids a complicated loop code that wouldn't improve the practical situation but inflate the code). Thanks to this change, the maximum write lock latency on a 48 threads AMD with aheavily loaded scheduler went down from 256 to 64 ms, and the number of occurrences of 32ms or more was divided by 300, while all occurrences of 1ms or less were multiplied by up to 3 (3 for the 4-16ns cases). This is plock commit b6a28366d156812f59c91346edc2eab6374a5ebd.	2025-02-07 18:04:29 +01:00
Willy Tarreau	5496d06b2b	IMPORT: plock: export the uninlined version of the lock wait function The inlining of the lock waiting function was made more easily configurable with commit 7505c2e ("plock: always expose the inline version of the lock wait function"). However, the standard one remained static, but in order to resolve the symbols in "perf top", it's much better to export it, so let's move "static" with "inline" and leave it exported when PLOCK_INLINE_EBO is not set. This is plock commit 3bea7812ec705b9339bbb0ed482a2cd8aa6c185c.	2025-02-07 18:04:29 +01:00
Willy Tarreau	8d63dc50ab	BUG/MINOR: debug: make sure the "debug dev sched" tasks don't block stopping When "debug dev sched" is used to pop up background tasks, these tasks are never stopped, so we must be careful to stop them when the stopping flag is set, otherwise they can prevent the process from stopping when sufficiently numerous (tests went as far as 100 million tasks, leading the run queue never being completely purged in one poll round). No backport is needed since this is only used when debugging and tuning the scheduler.	2025-02-07 18:04:29 +01:00
Willy Tarreau	6765a32eb4	BUG/MINOR: debug: make "debug dev sched" accept a negative TID The TID passed to "debug dev sched" is used to pin the task to a given thread. A negative value normally means the task is unpinned and goes to the shared wait queue and run queue. However due to the type of the variable, negative values were mapped as highly positive values and were set to the current thread. Let's add the proper cast to fix this. No backport is needed since this is only used to experiment with the scheduler and measure its performance.	2025-02-07 18:04:29 +01:00
Lukas Tribus	5926fb7823	DOC: option redispatch should mention persist options "option redispatch" remains vague in which cases a session would persist; let's mention "option persist" and "force-persist" as an example so folks don't draw the conclusion that this may be default. Should be backported to stable branches.	2025-02-06 17:49:13 +01:00
Christopher Faulet	d48b5add88	BUG/MINOR: stats-json: Define JSON_INT_MAX as a signed integer A JSON integer is defined in the range [-(253)+1, (253)-1]. Macro are used to define the minimum and the maximum value, The minimum one is defined using the maximum one. So JSON_INT_MAX must be defined as a signed integer value to avoid wrong cast of JSON_INT_MIN. It was reported by Coverity in #2841: CID 1587769. This patch could be backported to all stable versions.	2025-02-06 17:19:49 +01:00
Christopher Faulet	bc487afc85	MINOR: filters: Improve errors formating during filters parsing The error message reported by a filter during parsing are displayed between quotes. It is not really user friendly. So let's remove the quotes here.	2025-02-06 17:03:40 +01:00
Christopher Faulet	b20e2c96cf	BUG/MINOR: flt-trace: Support only one name option When a trace filter is defined, only one 'name' option is expected. But it was not tested. Thus it was possible to set several names leading to a memory leak. It is now tested, and it is not allowed to redefine the trace filter name. It was reported by Coverity in #2841: CID 1587768. This patch could be backported to all stable versions.	2025-02-06 17:01:15 +01:00
Christopher Faulet	a7f513af91	BUG/MINOR: auth: Fix a leak on error path when parsing user's groups In a userlist section, when a user is parsed, if a specified group is not found, an error is reported. In this case we must take care to release the alredy built groups list. It was reported by Coverity in #2841: CID 1587770. This patch could be backported to all stable versions.	2025-02-06 16:55:37 +01:00
Christopher Faulet	a1e14d2a82	BUG/MINOR: config/userlist: Support one 'users' option for 'group' directive When a group is defined in a userlist section, only one 'users' option is expected. But it was not tested. Thus it was possible to set several options leading to a memory leak. It is now tested, and it is not allowed to redefine the users option. It was reported by Coverity in #2841: CID 1587771. This patch could be backported to all stable versions.	2025-02-06 16:55:29 +01:00
Christopher Faulet	75e8c8ed33	BUG/MINOR: cli: Fix a possible infinite loop in _getsocks() In _getsocks() functuoin, when we failed to set the unix socket in non-blocking mode, a goto to "out" label led to loop infinitly. To fix the issue, we must only let the function exit. This patch should be backported to all stable versions.	2025-02-06 15:44:21 +01:00
Christopher Faulet	372cc696d4	BUG/MINOR: cli: Fix memory leak on error for _getsocks command Some errors in parse function of _getsocks commands were not properly handled and immediately returned, leading to a memory leak on cmsgbuf and tmpbuf buffers. To fix the issue, instead of immediately return with -1, we jump to "out" label. Returning 1 intead of -1 in that case is valid. This was reported by Coverity in #2841: CIDs 1587773 and 1587772. This patch should be backported as far as 2.4.	2025-02-06 15:43:04 +01:00
Christopher Faulet	7e927243b9	BUG/MINOR: cli: Don't set SE flags from the cli applet Since the CLI was updated to use the new applet API, it should no longer set directly the SE flags. Instead, the corresponding applet flags must be set, using the applet API (appet_set_*). It is true for the CLI I/O handler but also for the commands parse function and I/O callback function. This patch should be backported as far as 3.0.	2025-02-06 15:23:20 +01:00
Christopher Faulet	0aa69e7865	MINOR: mux-spop/mux-fcgi: Add support of the debug string for logs Now it is possible to have debug info about FCGI and SPOP multiplexers. To do so, the support for the MUX_SCTL_DBG_STR command was implemented for these muxes. The have this log message, the log-format must be set to: log-format "$HAPROXY_HTTP_LOG_FMT bs=<%[bs.debug_str]>"	2025-02-06 11:19:32 +01:00
Christopher Faulet	456cfa450a	MINOR: mux-fcgi: Dump info about connections and streams in dedicated functions fcgi_show_fd() function was splitted to dump the info about the FCGI connections and the FCGI streams in dedicated functions, duplicating this way what is performed in other muxes. In addition, the FCGI multiplexer now implements the .show_sd callback function called by "show sess" CLI command.	2025-02-06 11:19:32 +01:00
Christopher Faulet	bbc8c98a54	MINOR: tevt/mux-fcgi: Report termination events for the FCGI connect/stream Termination events are now reported for the FCGI connections and the FCGI streams. In addition, all available termination events logs are reported in the "show-fd" callback function. The .ctl and .sctl callback functions were also update to support, respectively, MUX_CTL_TEVTS and MUX_SCTL_TEVTS commands.	2025-02-06 11:19:32 +01:00
Christopher Faulet	5b1c2277ae	BUG/MEDIUM: mux-fcgi: Propagate flags to SE in fcgi_strm_wake_one_stream The commit is flagged as a bug because the same fix on the H2 multiplexer was reported as a bug. But no issue was reported. When a stream is explicitly woken up by the FCGI conneciton, if an error condition is detected, the corresponding error flag is set on the SE. So SE_FL_ERROR or SE_FL_ERR_PENDING, depending if the end of stream was reported or not. However, there is no attempt to propagate other termination flags. We must be sure to properly set SE_FL_EOI and SE_FL_EOS when appropriate to be able to switch a pending error to a fatal error. Because of this bug, the SE could remain with a pending error and no end of stream, preventing the applicative stream to trully abort it. It means on some abort scenario, it seems to be possible to block a stream infinitely. This patche depends on: * MEDIUM: mux-fcgi: Add a function to propagate termination flags from fstrm to SE * BUG/MEDIUM: mux-fcgi: Properly handle read0 on partial records This patch could be backported at least as far as 2.8 after a period of observation. However no bug was reportedn so there is no rush.	2025-02-06 11:19:32 +01:00
Christopher Faulet	ccdca4bb77	MEDIUM: mux-fcgi: Add a function to propagate termination flags from fstrm to SE The function fcgi_strm_propagate_term_flags() was added to check the FSTRM state and evaluate when EOI/EOS/ERR_PENDING/ERROR flags must be set on the SE. It is not the only place where those flags are set. But it centralizes the synchro between the FCGI stream and the SC. For now, this function is only used at the end of fcgi_rcv_buf(). But it will be used to fix a potential bug.	2025-02-06 11:19:32 +01:00
Christopher Faulet	7b638eb1a6	MINOR: mux-spop: Implement .show_sd callback function The SPOP multiplexer now implements the .show_sd callback function called by "show sess" CLI command.	2025-02-06 11:19:32 +01:00
Christopher Faulet	5aeb678762	MINOR: mux-spop: Dump info about connections and streams in dedicated functions spop_show_fd() function was splitted to dump the info about the SPOP connections and the SPOP streams in dedicated functions, duplicating this way what is performed in other muxes.	2025-02-06 11:19:32 +01:00
Christopher Faulet	eb4e517489	CLEANUP: mux-spop: Remove useless comments Just a small cleanup to remove some comments added during the development of the mux.	2025-02-06 11:19:32 +01:00
Christopher Faulet	4f8ae5b1f6	MINOR: tevt/mux-spop: Report termination events for the SPOP connect/stream Termination events are now reported for the SPOP connections and the SPOP streams. In addition, all available termination events logs are reported in the "show-fd" callback function. The .ctl and .sctl callback functions were also update to support, respectively, MUX_CTL_TEVTS and MUX_SCTL_TEVTS commands.	2025-02-06 11:19:32 +01:00
Christopher Faulet	514a912a4d	MINOR: mux-spop: Set SPOP_CF_ERROR flag on connection error only The SPOP_CF_ERROR flag is now set on connection error only. It was also set on some demux failures. But it is not mandatory because the connection is closed anyway. And it is handy to have a flag dedicated to tcp connection error. It was the original purpose of this flag. This patch could be backported to 3.1 to ease future backports.	2025-02-06 11:19:32 +01:00
Christopher Faulet	d16c534511	MINOR: mux-spop: Report EOI on the SE when a ACK is received for a stream The spop stream now reports the end of input when the ACK is transferred to the SPOE applet. To do so, the flag SPOP_SF_ACK_RCVD was added. It is set on the SPOP stream when its ACK is received by the SPOP connection. In addition when SPOP stream flags are propagated to the SE, the error is now reported if end of input was not reached instead of testing the connection error code. It is more accurate. This patch should be backported to 3.1.	2025-02-06 11:19:32 +01:00
Christopher Faulet	f7e5718596	MINOR: flt-spoe: Report end of input immediately after applet init The SPOE applet forwards the message that must be sent to agent during its init stage. So just after it is created. When it is performed, the end of input must be reported because no more data will be forwarded. However, it was performed after receiving the ACK response. It is harmless, but there is no reason to delay the EOI. It is now fixed. This patch must be backported to 3.1.	2025-02-06 11:19:32 +01:00
Christopher Faulet	38aac2c7bc	BUG/MEDIUM: flt-spoe: Properly handle end of stream from the SPOE applet The previous fix ("BUG/MEDIUM: applet: Don't pretend to have more data to handle EOI/EOS/ERROR") revealed an issue with the way the SPOE applet was reporting the end of stream, leading to never shut the applet down. In fact, there is two bug in one. The first one is about the applet shutdown. Since the fix above, the applet is no longer closed. Before, it was closed because it was reported in error. But now, it is just delayed because the applet and the SPOP stream are declared to support half close connections. So the applet is only closed when the SPOP connection is closed. To fix this bug, both side are now stating that half close connections are not supported. The second bug is about the way the end of stream is reported. It is reported when the ACK response is received. But it is too early, because the parent stream must process the response first. So now, we take care to have processed the ACK from the parent applet before reporting an end of stream. This patch must be backported with the commit above to 3.1.	2025-02-06 11:19:32 +01:00
Christopher Faulet	7214dcd52d	BUG/MEDIUM: applet: Don't pretend to have more data to handle EOI/EOS/ERROR The way appctx EOI/EOS/ERROR flags were reported for applets using the new API were to state the applet had more data to deliver. But it was not correct and for APPCTX_FL_EOS, this led to report an error on the SE because it is not expected. More data to deliver and an end of stream is an impossible situation. This was added as a fix by commit `b8ca114031` ("BUG/MEDIUM: applet: State appctx have more data if its EOI/EOS/ERROR flag is set"), mainly to make the SPOE applet work. When an applet set one of these flags, it really means it has no more data to deliver. So we must not try to trigger a new receive to handle these flags. Instead we must handle them directly in task_process_applet() function and only if the corresponding SE flags were not already set. This patch must be backported to 3.1.	2025-02-06 11:19:32 +01:00
Christopher Faulet	db504fbdbe	BUG/MEDIUM: flt-spoe: Set/test applet flags instead of SE flags from I/O handler The SPOE applet is using the new applet API. Thus end of input, end of stream and errors must be reported using the applet flags, not the SE flags. This was not the case. So let's fix it. It seems this bug is harmless for now. This patch must be backported to 3.1.	2025-02-06 11:19:32 +01:00
Christopher Faulet	54a09dfe0f	BUG/MINOR: tevt/mux-h2: Set truncated receive/eos events at SE level on error When receive or EOS termination events are reported at the SE level, a truncation was erroneously reported when no error was detected. Of course, it must be the opposite. No backport needed.	2025-02-06 11:19:32 +01:00
Frederic Lecaille	85cb1cc7f4	BUILD: ssl: remove a boringssl definition defined by recent boringssl libs This is the case for AWS-LC which derives from boringssl, where X509_OBJECT_get0_X509_CRL() is already defined. There is definitively no more need to define this function to build haproxy against TLS libs derived from boringssl.	2025-02-06 10:48:25 +01:00
Christopher Faulet	fad68cb16d	BUG/MINOR: http-check: Don't pretend a C-L heeader is set before adding it When a GET/HEAD/OPTIONS/DELETE healthcheck request was formatted, we claimed there was a "content-length" header set even when there was no payload, leading to actually send a "content-length: 0" header to the server. It was unexpected and could be rejected by servers. When a healthcheck request is sent we must take care to state there is a "content-length" header when it is explicitly added. This patch should fix the issue #2851. It must be backported as far as 2.9.	2025-02-03 18:46:41 +01:00
Aurelien DARRAGON	0846638f7f	MEDIUM: stream: interrupt costly rulesets after too many evaluations It is not rare to see configurations with a large number of "tcp-request content" or "http-request" rules for instance. A large number of rules combined with cpu-demanding actions (e.g.: actions that work on content) may create thread contention as all the rules from a given ruleset are evaluated under the same polling loop if the evaluation is not interrupted Thus, in this patch we add extra logic around "tcp-request content", "tcp-response content", "http-request" and "http-response" rulesets, so that when a certain number of rules are evaluated under the single polling loop, we force the evaluating function to yield. As such, the rule which was about to be evaluated is saved, and the function starts evaluating rules from the save pointer when it returns (in the next polling loop). We use task_wakeup(task, TASK_WOKEN_MSG) to explicitly wake the task so that no time is wasted and the processing is resumed ASAP. TASK_WOKEN_MSG is mandatory here because process_stream() expects TASK_WOKEN_MSG for explicit analyzers re-evaluation. rules_bcount stream's attribute was added to count how manu rules were evaluated since last interruption (yield). Also, SF_RULE_FYIELD flag was added to know that the s->current_rule was assigned due to forced yield and not regular yield. By default haproxy will enforce a yield every 50 rules, this behavior can be configured using the "tune.max-rules-at-once" global keyword. There is a limitation though: for now, if the ACT_OPT_FINAL flag is set on act_opts, we consider it is not safe to yield (as it is already the case for automatic yield). In this case instead of yielding an taking the risk of not being called back, we skip the yield and hope it will not create contention. This is something we should ideally try to improve in order to yield in all conditions.	2025-02-03 17:09:48 +01:00
Christopher Faulet	04bbfa4354	BUG/MINOR: tcp-rules: Don't forward close during tcp-response content rules eval When the tcp-response content ruleset evaluation is delayed because of an ACL condition, the close forwarding on the client side is not explicitly blocked. So it is possible to close the client side before the end of the response evaluation. To fix the issue, this is now done in all cases where some data are missing. Concretely, channel_dont_close() is called in "missing_data" goto label. Note it is only a theorical bug (or pending bug). It is not possible to trigger it for now because an ACL cannot wait for more data when a close was received. But the code remains a bit weak. It is safer this way. It is especially mandatory for the "force yield" option that should be added soon. This patch could be backported to all stable versions.	2025-02-03 15:31:59 +01:00
Christopher Faulet	431c5533b7	DEBUG: mux-h1: Remove some debug counters Several debug counters were added to debug a strange issue about early aborts. Most of them are now useless, especially because it is now possible to rely on the termination events logs. So, it is better to remove them. Note that these counters are still there in 3.1.	2025-02-03 08:48:31 +01:00
Christopher Faulet	1c6512f8fc	DEBUG: http-ana: Remove debug counters from HTTP analyzers Several debug counters were added in HTTP analyzers to help debugging a strange issue about early aborts. But these counters are a bit overkill now. Especially because it is now possible to rely on the termination event log. So just remove them. Note that these counters are still there in 3.1.	2025-02-03 08:28:45 +01:00
Christopher Faulet	274c9d21a6	BUG/MINOR: tevt/http-ana: Remove badly placed event reports When specific events for the stream location were added, some reports about message interception were not removed. These reports are now removed. No need to backport.	2025-02-03 08:20:41 +01:00
Christopher Faulet	5f927f603a	BUG/MEDIUM: mux-fcgi: Properly handle read0 on partial records A Read0 event could be ignored by the FCGI multiplexer if it is blocked on a partial record. Instead of handling the event, it remained blocked, waiting for the end of the record. To fix the issue, the same solution than the H2 multiplexer is used. Two flags are introduced. The first one, FCGI_CF_END_REACHED, is used to acknowledge a read0. This flag is set when a read0 was received AND the FCGI multiplexer must handle it. The second one, FCGI_CF_DEM_SHORT_READ, is set when the demux is interrupted on a partial record. A short read and a read0 lead to set the FCGI_CF_END_REACHED flag. With these changes, the FCGI mux should be able to properly handle read0 on partial records. This patch should be backported to all stable versions after a period of observation.	2025-02-03 07:49:50 +01:00
William Lallemand	0a28b1ea0c	MEDIUM: htx: prevent <mark> to copy incomplete headers in htx_xfer_blks() Prevent a partial copy of trailers or headers when using the <mark> parameter. When using htx_xfer_blks(), transfering partial headers or trailers are prevented when restricted by the <count> parameter. However using the <mark> parameter will still allow to do it. This patch changes the behavior by checking the <mark> type only after checking the headers/trailers type, so we can still rollback on partial transfer. No impact on the current code, which does not try to do that yet.	2025-01-31 15:51:51 +01:00
Amaury Denoyelle	4ad2accfee	BUILD: quic: remove GCC undefined error in qc_release_lost_pkts() Every once in a while, GCC reports issues with qc_release_lost_pkts() function. It seems that its static analysis is foiled by the code structuring. The latest warning reports the following issue : CC src/quic_loss.o src/quic_loss.c: In function ‘qc_release_lost_pkts’: src/quic_loss.c:313:58: error: potential null pointer dereference [-Werror=null-dereference] 313 \| unsigned int period = newest_lost->time_sent_ms - oldest_lost->time_sent_ms; \| ~~~~~~~~~~~^~~~~~~~~~~~~~ To fix definitely this, change slightly the code. <oldest_lost> and <newest_lost> are now initialized on the first list entry outside of the loop. This is enough to guarantee to GCC that they cannot be NULL for the remainder of the function.	2025-01-31 15:34:30 +01:00
William Lallemand	c17e029232	DOC: htx: clarify <mark> parameter for htx_xfer_blks() Clarify the fact that the first <mark> block is transferred before stopping when using htx_xfer_blks()	2025-01-31 15:23:47 +01:00
William Lallemand	c6390cdf9c	BUG/MEDIUM: htx: wrong count computation in htx_xfer_blks() When transfering blocks from an src to another dst htx representation, htx_xfer_blks() decreases the size of each block removed from the <count> value passed in parameter, so it can't transfer more than <count>. The size must also contains the metadata, represented by a simple sizeof(struct htk_blk). However, the code was doing a sizeof(dstblk) instead of a sizeof(*dstblk) which as the consequence of removing only a size_t from count. Fortunately htx_blk size is 64bits, so that does not provoke any problem in 64bits. But on 32bits architecture, the count value is not decreased correctly and the function could try to transfer more blocks than allowed by the count parameter. Must be backported in every stable release.	2025-01-31 15:02:58 +01:00
Christopher Faulet	956cb5d554	MINOR: tevt/dev: Parse tuple of termination events term_events tool is now able to parse tuple of termination events, as returned by "term_events" sample fetch function.	2025-01-31 10:46:08 +01:00
Christopher Faulet	71320fc9c1	MINOR: tevt/connection: Add support for POLL_HUP/POLL_ERR events Connection errors can be detected via connect/recv/send syscall, but also because it was reported by the poller. So dedicated events, at the FD level, are introduced to make the difference. term_events tool was updated accordingly.	2025-01-31 10:41:50 +01:00
Christopher Faulet	c7457427ab	MINOR: tevt/dev: Add term_events tool This development tool can be used to convert a string representing a termination event logs to its human redable representation. Several string may be converting at a time. To do so, several arguments can be specified on the commeand line or they can be provided on STDIN, using "-" argument. Here is an exemple: > term_events f2x2f4x4 m2m4m1 e2e1 s2s1S1 E1 M1 F1 ### f2x2f4x4 : fd:shutr > xprt:shutr > fd:snd_err > xprt:snd_err ### m2m4m1 : muxc:shutr > muxc:snd_err > muxc:shutw ### e2e1 : se:eos > se:shutw ### s2s1S1 : strm:eos > strm:shutw > STRM:shutw ### E1 : SE:shutw ### M1 : MUXC:shutw ### F1 : FD:shutw The make target "dev/term_events/term_events" must be used to compile it.	2025-01-31 10:41:50 +01:00

... 9 10 11 12 13 ...

24409 Commits