haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-14 02:57:01 +02:00

Author	SHA1	Message	Date
Aurelien DARRAGON	350a3ab052	BUG/MINOR: deinit: release uri_auth admin rules When uri_auth admin rules were implemented in `474be415` ("[MEDIUM] stats: add an admin level") no attempt was made to free the list of allocated rules, which makes valgrind unhappy upon deinit when "stats admin" is used in the config. To fix the issue, let's cleanup the admin rules list upon deinit where uri_auth freeing is already handled. While this could be backported to every stable versions, given how minor this is and has no impact on the dying process, it is probably not worth the effort.	2024-11-14 15:03:27 +01:00
Willy Tarreau	df93cf72b9	MINOR: mux-h2: count glitches when they're reported The h2c_report_glitch() function is now replaced with a macro to support enumerating counters for each individual glitch line. For now this adds 43 such counters. The macro supports an optional description, though that is not being used for now. It gives outputs like this (note that the last one was purposely instrumented to pass a description): > debug dev counters glt all 0 GLT mux_h2.c:5976 h2c_dec_hdrs() 0 GLT mux_h2.c:5960 h2c_dec_hdrs() (...) 0 GLT mux_h2.c:2207 h2c_frt_recv_preface() 0 GLT mux_h2.c:1954 h2c_frt_stream_new(): new stream too early As a reminder, this requires to build with -DDEBUG_GLITCHES=1.	2024-11-14 09:01:57 +01:00
Willy Tarreau	502790ed7e	MINOR: debug: add a new counter type for glitches COUNT_GLITCH() will implement an unconditional counter on its declaration line when DEBUG_GLITCHES is set, and do nothing otherwise. The output will be reported as "GLT" and can be filtered as "glt" on the CLI. The purpose is to help figure what's happening if some glitches counters start going through the roof. The macro supports an optional string argument to describe the cause of the glitch (e.g. "truncated header"), which is then reported in the dump. For now this is conditioned by DEBUG_GLITCHES but if it turns out to be light enough, maybe we'll keep it enabled full time. In this case it might have to be moved away from debug dev, or at least documented (or done as debug counters maybe so that dev can remain undocumented and updatable within a branch?).	2024-11-14 08:49:38 +01:00
Willy Tarreau	e119095290	MINOR: debug: explicitly permit the counter condition to be empty In order to count new event types, we'll need to support empty conditions so that we don't have to fake if (1) that would pollute the output. This change checks if #cond is an empty string before concatenating it with the optional var args, and avoids dumping the colon on the dump if the whole description is empty.	2024-11-14 08:47:00 +01:00
Christopher Faulet	8f28dbeea9	BUG/MEDIUM: resolvers: Insert a non-executed resulution in front of the wait list When a resolver is woken up to process DNS resolutions, it is possible to trigger an infinite loop on the resolver's wait list because delayed resolutions are always reinserted at the end of this list. This leads the watchdog to kill the process. By re-inserting them in front of the list, that fixes the bug. When a resolver tries to send the queries for the resolutions in its wait list, it may be unable to proceed for a resolution. This may happen because the resolution must be skipped (no hostname to resolv, a resolution already in-progress) or when an error occurred. In that case, the resolution is re-inserted in the resolver's wait list to be retry later, on a next wakeup. However, the resolution is inserted at the end of the wait list. So it is immediately reevaluated, in the same execution loop, instead of to be delayed. Most of time, it is not an issue because the resolution is considered as not expired on the second run. But it is an problem when the internal time wraps and is equal to 0. In that case, the resolution expiration date is badly computed and it is always considered as expired. If two or more resolutions are in that state, the resolver loops for ever on its wait list, until the process is killed by the watchdog. So we can argue that the way the resolution expiration date is computed must be fixed. And it would be true in a perfect world. However, the resolvers code is so crapy that it is hard to be sure to not introduce regressions. It is farly easier to re-insert delayed resolutions in front of the wait list. This fixes the issue and at worst, these resolutions will be evaluated one time too many on the next wakeup and only if now_ms was equal to 0 on the prior wakeup. This patch should be backported to all stable versions. On 2.2, LIST_ADD() must be used instead of LIST_INSERT()	2024-11-13 10:53:27 +01:00
Christopher Faulet	72e529829b	BUG/MEDIUM: stconn: Don't forward shut for SC in connecting state In connecting state, shutdown must not be forwarded or scheduled because otherwise this will prevent any connection retries. Indeed, if a EOS is reported by the mux during the connection establishment, this should be handled by the stream to eventually retries. If the write side is closed first, this will not be possible because the stconn will be switched in DIS state. If the shut is scheduled because pending data are blocked, the same may happen, depending on the abort-on-close option. This patch should be slowly be backported as far as 2.4. But an observation period is mandatory. On 2.4, the patch must be adapted to use the stream-interface API.	2024-11-13 10:53:27 +01:00
Valentine Krasnobaeva	113745e6f0	BUG/MINOR: cli: don't show sockpairs in HAPROXY_CLI and HAPROXY_MASTER_CLI Before this fix, HAPROXY_CLI and HAPROXY_MASTER_CLI have contained along with CLI sockets addresses internal sockpairs, which are used only for master CLI (reload sockpair and sockpair shared with a worker process). These internal sockpairs are always need to be hidden. At the moment there is no any client, who uses sockpair addresses for the stats listener or in order to connect to master CLI. So, let's simply not copy these internal sockpair addresses of MASTER and GLOBAL proxy listeners. As listeners with sockpairs are skipped and they can be presented in the listeners list in any order, let's add semicolon separator between addresses only in the case, when there are already some string saved in the trash and we are sure, that we are adding a new address to it. Otherwise, we could have such weird output: HAPROXY_MASTER_CLI=unix@/tmp/mcli.sock;; This fix is need to be backported in all stable versions.	2024-11-13 09:50:05 +01:00
Valentine Krasnobaeva	1f0cd91fe7	BUG/MINOR: startup: set HAPROXY_CFGFILES in read_cfg load_cfg() is called only once before the first reading of the configuration (we parse here only the global section). Then, before reading the rest of the sections (second call of read_cfg()), we call clean_env(). As HAPROXY_CFGFILES is set in load_cfg(), which is called only once, clean_env() erases it. Thus, it's not longer shown in "show env" output. To fix this, let's set HAPROXY_CFGFILES in read_cfg(). Like this in master-worker mode it is set for master and for worker processes, as it was before the refactoring. This fix doesn't need to be backported as related to the latest master-worker architecture change.	2024-11-13 09:50:05 +01:00
Valentine Krasnobaeva	d5d41dee3d	MINOR: startup: replace HAPROXY_LOAD_SUCCESS with global load_status After master-worker refactoring, master performs re-exec only once up to receiving "reload" command or USR2 signal. There is no more the second master's re-exec to free unused memory. Thus, there is no longer need to export environment variable HAPROXY_LOAD_SUCCESS with worker process load status. This status can be simply saved in a global variable load_status.	2024-11-13 09:50:05 +01:00
Miroslav Zagorac	aadda34fd6	BUILD: ot: use a cebtree instead of a list for variable names In order for the function flt_ot_vars_scope_dump() to work, it is necessary to take into account the changes made by the commits `47ec7c681` ("OPTIM: vars: use a cebtree instead of a list for variable names") and `5d350d1e5` ("OPTIM: vars: use multiple name heads in the vars struct"). The function is only used if the OT_DEBUG=1 option is set when compiling HAProxy.	2024-11-12 11:07:13 +01:00
William Lallemand	581c8a27d9	MEDIUM: mworker: depreciate the 'program' section The program section is unreliable and should not be used, more reliable alternatives exist outside HAProxy. Let's depreciate the section so we could remove it completely in 3.3.	2024-11-08 17:06:58 +01:00
Willy Tarreau	0434e87348	[RELEASE] Released version 3.1-dev12 Released version 3.1-dev12 with the following main changes : - MINOR: startup: tune.renice.{startup,runtime} allow to change priorities - BUG/MEDIUM: promex: Fix dump of extra counters - BUILD: import/mt_list: support building with TCC - BUILD: compiler: define __builtin_prefetch() for tcc - CLEANUP: quic: Remove the useless directive "tune.quic.backend.max-idle-timeou" - DOC: config: document connection error 44 (reverse connect failure) - CLEANUP: connection: properly name the CO_ER_SSL_FATAL enum entry - DEBUG: cli: support closing "hard" using close() in addition to fd_delete() - MINOR: connection: add more connection error codes to cover common errno - MINOR: rawsock: set connection error codes when returning from recv/send/splice - MINOR: connection: add new sample fetch functions fc_err_name and bc_err_name - MINOR: quic: Help diagnosing malformed probing packets - BUG/MINOR: quic: fix malformed probing packet building - MINOR: listener: Remove useless checks on the receiver protocol existence - MINOR: http-conv: Remove unreachable goto statement in sample_conv_q_preferred - MINOR: http: don't %-encode the payload when not relevant - MINOR: quic: simplify qc_parse_pkt_frms() return path - MINOR: quic: use dynamically allocated frame on parsing - MINOR: quic: extend return value of CRYPTO parsing - BUG/MINOR: quic: repeat packet parsing to deal with fragmented CRYPTO - BUG/MINOR: mworker: do 'program' postparser checks in read_cfg_in_discovery_mode - EXAMPLES: add "traces.cfg" with traces examples - BUG/MEDIUM: quic: do not consider ACK on released stream as error - CLEANUP: stats: fix misleading comment on top of stat_idx_info - MINOR: wdt: move the local timers to a struct - MINOR: debug: add a function to dump a stuck thread - DEBUG: wdt: better detect apparently locked up threads and warn about them - DEBUG: cli: make it possible for "debug dev loop" to trigger warnings - DEBUG: wdt: make the blocked traffic warning delay configurable - DEBUG: wdt: add a stats counter "BlockedTrafficWarnings" in show info - DEBUG: wdt: set the default blocked task delay to 100 ms - MINOR: debug: move the "recover now" warn message after the optional notes - MINOR: event_hdl: add event_hdl_sub_list_empty() helper func - MINOR: pattern: add _pat_ref_new() helper func - OPTIM: pattern: use malloc() to initialize new pat_ref struct - MINOR: pattern: add pat_ref_free() helper func - CLEANUP: guid: remove global tree export - BUG/MINOR: guid/server: ensure thread-safety on GUID insert/delete - DOC: management: explain the change of behavior of the program section - BUG/MEDIUM: mux-h2: try to wait for the peer to read the GOAWAY - BUG/MEDIUM: quic: prevent crash due to CRYPTO parsing error	2024-11-08 15:46:54 +01:00
Amaury Denoyelle	2975e8805d	BUG/MEDIUM: quic: prevent crash due to CRYPTO parsing error A packet which contains several splitted and out of order CRYPTO frames may be parsed multiple times to ensure it can be handled via ncbuf. Only 3 iterations can be performed to prevent excessive CPU usage. There is a risk of crash if packet parsing is interrupted after maximum iterations is reached, or no progress can be made on the ncbuf. This is because <frm> may be dangling after list_for_each_entry_safe() The crash occurs on qc_frm_free() invokation, on error path of qc_parse_pkt_frms(). To fix it, always reset frm to NULL after list_for_each_entry_safe() to ensure it is not dangling. This should fix new report on github isue #2776. This regression has been triggered by the following patch : `1767196d5b` BUG/MINOR: quic: repeat packet parsing to deal with fragmented CRYPTO As such, it must be backported up to 2.6, after the above patch.	2024-11-08 15:19:57 +01:00
Willy Tarreau	3ed9361688	BUG/MEDIUM: mux-h2: try to wait for the peer to read the GOAWAY When timeout http-keep-alive is very short (e.g. 10ms), it's possible sometimes for a client to face truncated responses due to an early close that happens while the system is still pushing the last data, colliding with the client's WINDOW_UPDATEs that trigger RSTs. Here we're trying to do better: first we send a GOAWAY on timeout, then we wait up to clientfin/client timeout for the peer to react so that we don't immediately close. This is sufficient to avoid truncation as soon as the timeout is more than a few hundred ms. It's not certain it should be backported, because it's a bit sensistive and might possibly fall into certain edge cases.	2024-11-08 14:31:07 +01:00
William Lallemand	75b302d123	DOC: management: explain the change of behavior of the program section The program section does not work exactly the same way with the master-worker rework of HAProxy 3.1. Let's explain it in the program documentation.	2024-11-08 12:00:26 +01:00
Amaury Denoyelle	8e0e7d9d1a	BUG/MINOR: guid/server: ensure thread-safety on GUID insert/delete Since 3.0, it is possible to assign a GUID to proxies, listeners and servers. These objects are stored in a global tree guid_tree. Proxies and listeners are static. However, servers may be added or deleted at runtime, which imply that guid_tree must be protected. Fix this by declaring a read-write lock to protect tree access. For now, only guid_insert() and guid_remove() are protected using a write lock. Outside of these, GUID tree is not accessed at runtime. If server CLI commands are extended to support GUID as server identifier, lookup operation should be extended with a read lock protection. Note that during stat-file preloading, GUID tree is accessed for lookup. However, as it is performed on startup which is single threaded, there is no need for lock here. A BUG_ON() has been added to ensure this precondition remains true. This bug could caused a segfault when using dynamic servers with GUID. However, it was never reproduced for now. This must be backported up to 3.0. To avoid a conflict issue, the previous cleanup patch can be merged before it.	2024-11-07 18:17:03 +01:00
Amaury Denoyelle	b70880cdc9	CLEANUP: guid: remove global tree export guid_tree is not directly used outside of functions provided by the guid module. Remove its export from the include file.	2024-11-07 17:20:00 +01:00
Aurelien DARRAGON	aba3ed62ae	MINOR: pattern: add pat_ref_free() helper func For now, pat_ref struct are never freed, except during init in case of error. The freeing is done directly in the init functions because we don't have an helper for that. No having an helper func to properly free pat_ref struct doesn't encourage us to free unused pat_ref structs, plus it is error-prone if new dynamic members are added to pat_ref struct in the future. To fix that, let's add a pat_ref_free() helper func and use it where relevant (which means only under pat_ref init function for now..)	2024-11-07 11:36:13 +01:00
Aurelien DARRAGON	e8a0dbff93	OPTIM: pattern: use malloc() to initialize new pat_ref struct As mentioned in the previous commit, in _pat_ref_new(), it was not strictly needed to explicitly assign all struct members to 0 since the struct was allocated with calloc() which does the zeroing for us. However, it was verified that we already initialize all fields explictly, thus there is no reason to keep using calloc() instead of malloc(). In fact using malloc() is less expensive, so let's use that instead now.	2024-11-07 11:36:08 +01:00
Aurelien DARRAGON	d1397401f0	MINOR: pattern: add _pat_ref_new() helper func pat_ref_newid() and pat_ref_new() are two functions to create and initialize a pat_ref struct based on input parameters. Both function perform the same generic allocation and initialization for pat_ref struct, thus there is quite a lot of code redundancy. This is error-prone if the pat_ref init sequence has to be updated at some point. To reduce maintenance costs, let's add a _pat_ref_new() helper func that takes care of the generic allocation and base initialization for pat_ref struct.	2024-11-07 11:36:01 +01:00
Aurelien DARRAGON	79a346aa28	MINOR: event_hdl: add event_hdl_sub_list_empty() helper func event_hdl_sub_list_empty() may be used to know if the subscription list passed as argument is empty or not (ie: if there currently are any subcribers or not). It can be useful to know if the subscription is empty is order to avoid unecessary preparation work and skip event publishing to save CPU time if we already know that no one is interested in tracking the changes for a given subscription list.	2024-11-07 11:35:55 +01:00
Willy Tarreau	5dcf2012fc	MINOR: debug: move the "recover now" warn message after the optional notes At the end of the too long processing warning added by commit `0950778b3a` ("MINOR: debug: add a function to dump a stuck thread"), there can be some optional notes about lua and memory trimming. However it's a bit awkward that they appear after the "trying to recover now" message. Let's just move that message after the notes.	2024-11-07 07:56:13 +01:00
Willy Tarreau	5f4fe20116	DEBUG: wdt: set the default blocked task delay to 100 ms The warn-blocked-traffic-after can be significantly lowered. In any case, in order to be usable it must be well below the limit to have a chance to emit exploitable traces before the watchdog finally fires. Even configured at 1ms it looks very difficult to trigger it on a laptop doing SSL and compression, so applying a 100-fold factor to cover for large configs and small machines sounds sane for 3.1. In any case, even at 100ms, the service degradation becomes quite visible.	2024-11-06 18:35:42 +01:00
Willy Tarreau	84dd05e7d8	DEBUG: wdt: add a stats counter "BlockedTrafficWarnings" in show info Every time a warning is issued about traffic being blocked, let's increment a global counter so that we can check for this situation in "show info".	2024-11-06 18:35:42 +01:00
Willy Tarreau	6127e5a4e9	DEBUG: wdt: make the blocked traffic warning delay configurable The new global "warn-blocked-traffic-after" allows one to configure after how much time a warning should be emitted when traffic is blocked.	2024-11-06 18:35:42 +01:00
Willy Tarreau	7337c42224	DEBUG: cli: make it possible for "debug dev loop" to trigger warnings A new argument "warn" allows to force the emission of a warning while stuck in the loop by making the internal state inconsistent.	2024-11-06 18:35:42 +01:00
Willy Tarreau	148eb5875f	DEBUG: wdt: better detect apparently locked up threads and warn about them In order to help users detect when threads are behaving abnormally, let's try to emit a warning when one is no longer making any progress. This will allow to catch faulty situations more accurately, instead of occasionally triggering just after the long task. It will also let users know that there is something wrong with their configuration, and inspect the call trace to figure whether they're using excessively long rules or Lua for example (the usual warnings about lua-load vs lua-load-per-thread are still reported). The warning will only be emitted for threads not yet marked as stuck so as not to interfere with panic dumps and avoid sending a warning just before a panic. A tainted flag is set when this happens however (0x2000).	2024-11-06 18:35:42 +01:00
Willy Tarreau	0950778b3a	MINOR: debug: add a function to dump a stuck thread There's currently no way to just emit a warning informing that a thread is stuck without crashing. This is a problem because sometimes users would benefit from this info to clean up their configuration (e.g. abuse of map_regm, lua-load etc). This commit adds a new function ha_stuck_warning() that will emit a warning indicating that the designated thread has been stuck for XX milliseconds, with a number of streams blocked, and will make that thread dump its own state. The warning will then be sent to stderr, along with some reminders about the impacts of such situations to encourage users to fix their configuration. In order not to disrupt operations, a local 4kB buffer is allocated in the stack. This should be quite sufficient. For now the function is not used.	2024-11-06 18:35:42 +01:00
Willy Tarreau	3f4d646849	MINOR: wdt: move the local timers to a struct Better have a local struct for per-thread timers, as this will allow us to store extra info that are useful to improve accurate reporting.	2024-11-06 18:35:42 +01:00
Willy Tarreau	1f34a0fd27	CLEANUP: stats: fix misleading comment on top of stat_idx_info The comment asks to update the "metrics_info" array, which does not exist, instead it's called stat_cols_info[] and is in stats.c. Let's mention all that to save time searching for the needed info. While no version seems to have ever known that "metrics_info", it's not needed to backport this as it's only a comment.	2024-11-06 18:35:42 +01:00
Amaury Denoyelle	3b851a326b	BUG/MEDIUM: quic: do not consider ACK on released stream as error When an ACK is received by haproxy, a lookup is performed to retrieve the related emitted frames. For STREAM type frames, a lookup is performed under quic_conn stream_desc tree. Indeed, the corresponding stream instance could be already released if multiple ACK were received refering to the same stream offset, which can happen notably if retransmission occured. qc_handle_newly_acked_frm() implements this logic. If the case with an already released stream is encounted, an error is returned. In the end, this error is propagated via qc_parse_pkt_frms() into qc_treat_rx_pkts(), despite being in fact a perfectly valid case. Fix this by adjusting ACK handling function to return a success value for the particular case of released stream instead. The impact of this bug is unknown, but it can have several consequences. * if the packet with the ACK contains other frames after it, their content will be skipped * the packet won't be acknowledged by haproxy, even if it contains other frames and is ack-eliciting. This may cause unneeded retransmission by the client. * RTT sampling information related to this ACK is ignored by haproxy Finally, it also caused the increment of the quic_conn counter dropped_parsing (droppars in "show quic" output) which should be reserved only for real error cases. This regression is present since the following patch : `e7578084b0` MINOR: quic: implement dedicated type for out-of-order stream ACK Before, qc_handle_newly_acked_frm() return type was always ignored. As such, no backport is needed.	2024-11-06 17:37:44 +01:00
William Lallemand	66bff034d7	EXAMPLES: add "traces.cfg" with traces examples Add an example on how to use the traces section. The example use the 3.1-dev8 syntax and enables all traces on stderr.	2024-11-06 17:32:32 +01:00
Valentine Krasnobaeva	e9928c306c	BUG/MINOR: mworker: do 'program' postparser checks in read_cfg_in_discovery_mode cfg_program_postparser() contains 2 parts: - check the combination of MODE_MWORKER and "program" section. if "program" section was parsed, MODE_MWORKER is mandatory; - check "command" keyword, which is mandatory for this section as well. This is more appropriate now, after the master-worker refactoring, do the first part in read_cfg_in_discovery_mode, where we already check the combination of MODE_MWORKER and -S option. We need to do the second part just below, in read_cfg_in_discovery_mode() as well, because it's only the master process, who parses now program section and programs are forked before running postparser functions in step_init_2. Otherwise, mworker_ext_launch_all() will emit a log message, that program is started, but actually nothing has been launched, if 'command' keyword is absent. This not needs to be backported, as related to the master-worker refactoring.	2024-11-06 15:49:44 +01:00
Amaury Denoyelle	1767196d5b	BUG/MINOR: quic: repeat packet parsing to deal with fragmented CRYPTO A ClientHello may be splitted accross several different CRYPTO frames, then mixed in a single QUIC packet. This is used notably by clients such as chrome to render the first Initial packet opaque to middleboxes. Each packet frame is handled sequentially. Out-of-order CRYPTO frames are buffered in a ncbuf, until gaps are filled and data is transferred to the SSL stack. If CRYPTO frames are heavily splitted with small fragments, buffering may fail as ncbuf does not support small gaps. This causes the whole packet to be rejected and unacknowledged. It could be solved if the client reemits its ClientHello after remixing its CRYPTO frames. This patch is written to improve CRYPTO frame parsing. Each CRYPTO frames which cannot be buffered due to ncbuf limitation are now stored in a temporary list. Packet parsing is completed until all frames have been handled. If temporary list is not empty, reparsing is done on the stored frames. With the newly buffered CRYPTO frames, ncbuf insert operation may this time succeeds if the frame now covers a whole gap. Reparsing will loop until either no progress can be made or it has been done at least 3 times, to prevent CPU utilization. This patch should fix github issue #2776. This should be backported up to 2.6, after a period of observation. Note that it relies on the following refactor patches : MINOR: quic: extend return value of CRYPTO parsing MINOR: quic: use dynamically allocated frame on parsing MINOR: quic: simplify qc_parse_pkt_frms() return path	2024-11-06 14:29:14 +01:00
Amaury Denoyelle	d65e782c8c	MINOR: quic: extend return value of CRYPTO parsing qc_handle_crypto_frm() is the function used to handled a newly received CRYPTO frame. Change its API to use a newly dedicated return type. This allows to report if the frame was properly handled, ignored if already parsed previously or rejected after a fatal error. This commit does not have any functional changes. However, it allows to simplify qc_handle_crypto_frm() API by removing <fast_retrans> as output parameter. Also, this patch will be necessary to support multiple iteration of packet parsing for CRYPTO frames.	2024-11-06 14:28:14 +01:00
Amaury Denoyelle	190fc97606	MINOR: quic: use dynamically allocated frame on parsing qc_parse_pkt_frms() is the function responsible to parse a received QUIC packet. Payload is decoded and splitted into individual frames which are then handled individually. Previously, frame was used as locally stack allocated. Change this to work on a dynamically allocated frame. This commit does bring any functional changes. However, it will be useful to extend packet parsing. In particular, it will be necessary to save some frames during parsing to reparse them after the others.	2024-11-06 14:28:14 +01:00
Amaury Denoyelle	498a99a849	MINOR: quic: simplify qc_parse_pkt_frms() return path Change qc_parse_pkt_frms() return path for normal and error cases. Most notably, it allows to remove local variable ret as now return value is hardcoded on normal and err label. This also allows to define a different trace for error leaving code.	2024-11-06 14:28:14 +01:00
Aurelien DARRAGON	24dd7154a6	MINOR: http: don't %-encode the payload when not relevant As reported by Pierre Maoui in GH #2477, it's not possible to render control chars from variables or expressions verbatim in the payload part of http-return statements. That's a problem because this part should not require to be encoded at all (we could even imagine building favicons on the fly for example). In fact it is the LOG_OPT_HTTP option when passed as default options on parse_logformat_string() which tells the log encoder that the payload should be http-encoded using lf_chunk() instead of being printed using the per-type encoder. This option was set when parsing logformat expressions for lf-string expression under http-return statements, as well as logformat expressions for set-map action. While it is true that those actions may only be used under http context, the LOG_OPT_HTTP logformat option is not relevant there, because the payload is expected to be used without being encoded. So let's simply get rid of this option when parsing logformat expressions for set-map action key/value and lf-string from http-request return action, and add a note next to LOG_OPT_HTTP option to indicate that it is used to tell the log encoder that the payload should be HTTP-encoded. Thanks to Pierre for having reported the issue and Willy for the analysis and patch proposal.	2024-11-06 10:21:15 +01:00
Christopher Faulet	97d3096040	MINOR: http-conv: Remove unreachable goto statement in sample_conv_q_preferred This was reported by Coverity. In sample_conv_q_preferred() function, a goto statement after a "while(1)" loop is unreachable. Instead of just removing it, the same goto statement in the loop is replaced by a break. It is safer this way, in case the loop change in future. This patch should fix the issue #2683.	2024-11-06 10:06:52 +01:00
Christopher Faulet	1cc9340afd	MINOR: listener: Remove useless checks on the receiver protocol existence The receiver protocol is always set when a listener is created or cloned. At least for now. And there is no check on it at many places, except in listener_accept() function. So, let's remove remaining useless checks. That will avoid false Coverity reports in future. This patch should fix the issue #2631.	2024-11-06 09:35:01 +01:00
Frederic Lecaille	217e467e89	BUG/MINOR: quic: fix malformed probing packet building This bug arrived with this commit: `cdfceb10a` MINOR: quic: refactor qc_prep_pkts() loop which prevents haproxy from sending PING only packets/datagrams (some packets/datagrams with only PING frame as ack-eliciting frames inside). Such packets/datagrams are useful in rare cases during retransmissions when one wants to probe the peer without exceeding the anti-amplification limit. Modify the condition passed to qc_build_pkt() to add padding to the current datagram. One does not want to do that when probing the peer without ack-eliciting frames passed as <frms> parameter. Indeed qc_build_pkt() calls qc_do_build_pkt() which supports this case: if <probe> is true (probing required), qc_do_build_pkt() handles the case where some padding must be added to a PING only packet/datagram. This is the case when probing with an empty <frms> frame list of ack-eliciting frames without exceeding the anti-amplification limit from qc_dgrams_retransmit(). Add some comments to qc_build_pkt() and qc_do_build_pkt() to clarify this as this code is easy to break! Thank you for @Tristan971 for having reported this issue in GH #2709. Must be backported to 3.0.	2024-11-05 20:17:35 +01:00
Frederic Lecaille	444a19ea38	MINOR: quic: Help diagnosing malformed probing packets Add a BUG_ON() to detect some malformed packets which are supposed to probe the peer without being ack-eliciting: the peer would not acknowledged such packets.	2024-11-05 20:17:35 +01:00
Willy Tarreau	601b34fe7b	MINOR: connection: add new sample fetch functions fc_err_name and bc_err_name These functions return a symbolic error code such as ECONNRESET to keep logs compact while making them human-readable. It's a good alternative to the numeric code in that it's more expressive, and a good one to the full message since it's shorter and more precise (some codes even match errno names). The doc was updated so that the symbolic names appear in the table. It could be useful to backport this feature to help with troubleshooting some issues, though backporting the doc might possibly be more annoying in case users have local patches already, so maybe the table update does not need to be backported in this case.	2024-11-05 18:57:43 +01:00
Willy Tarreau	822d82caf4	MINOR: rawsock: set connection error codes when returning from recv/send/splice For a long time the errno values returned by recv/send/splice() were not translated to connection error codes. There are not that many eligible and having them would help a lot when debugging some complex issues where logs disagree with network traces. Let's add them now.	2024-11-05 18:57:43 +01:00
Willy Tarreau	00c383ff65	MINOR: connection: add more connection error codes to cover common errno While we get reports of connection setup errors in fc_err/bc_err, we don't have the equivalent for the recv/send/splice syscalls. Let's add provisions for new codes that cover the common errno values that recv/send/splice can return, i.e. ECONNREFUSED, ENOMEM, EBADF, EFAULT, EINVAL, ENOTCONN, ENOTSOCK, ENOBUFS, EPIPE. We also add a special case for when the poller reported the error itself. It's worth noting that EBADF/EFAULT/EINVAL will generally indicate serious bugs in the code and should not be reported. The only thing is that it's quite hard to forcefully (and reliably) trigger these errors in automated tests as the timing is critical. Using iptables to manually reset established connections in the middle of large transfers at least permits to see some ECONNRESET and/or EPIPE, but the other ones are harder to trigger.	2024-11-05 18:57:43 +01:00
Willy Tarreau	0f1d37a479	DEBUG: cli: support closing "hard" using close() in addition to fd_delete() "debug dev close <fd>" currently closes that FD using fd_delete() after checking that it's known from the fdtab. Sometimes we also want to just perform a pure close() of FDs not in the fdtab (pollers, etc) in order to provoke certain error cases. The optional "hard" argument to the command will make it use a plain close() instead of fd_delete() and skip the fd owner check. The main visible effect when closing a traffic socket with it is that instead of dying from a double fd_delete() by seeing that fd.owner is already 0, it will die during the next fd_insert() seeing that fd.owner was not 0.	2024-11-05 18:57:43 +01:00
Willy Tarreau	393957908b	CLEANUP: connection: properly name the CO_ER_SSL_FATAL enum entry It was the only one prefixed with "CO_ERR_", making it harder to batch process and to look up. It was added in 2.5 by commit `61944f7a73` ("MINOR: ssl: Set connection error code in case of SSL read or write fatal failure") so it can be backported as far as 2.6 if needed to help integrate other patches.	2024-11-05 18:57:42 +01:00
Willy Tarreau	abed9e0426	DOC: config: document connection error 44 (reverse connect failure) It was missing from commit `ac1164de7c` ("MINOR: connection: define error for reverse connect"), and can be backported to 3.0 and 2.9.	2024-11-05 18:57:42 +01:00
Christopher Faulet	1f71ec85b0	CLEANUP: quic: Remove the useless directive "tune.quic.backend.max-idle-timeou" First there is a typo in the directive name, then it is not documented and finally, it is not used at all. The directive is only removed from the keyword list. Parsing function is not updated. This patch should fix the issue #2601.	2024-11-05 18:53:54 +01:00
Willy Tarreau	b300db55f6	BUILD: compiler: define __builtin_prefetch() for tcc We're using a few occurrences of __builtin_prefetch() but tcc doesn't know about it so let's give it a dummy definition. Now the code builds and works again with tcc without thread support.	2024-11-05 15:43:17 +01:00

... 5 6 7 8 9 ...

23650 Commits