haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-06 15:17:01 +02:00

Author	SHA1	Message	Date
William Manley	366b722f7e	MINOR: rhttp: Don't require SSL when attach-srv name parsing An attach-srv config line usually looks like this: tcp-request session attach-srv be/srv name ssl_c_s_dn(CN) while a rhttp server line usually looks like this: server srv rhttp@ sni req.hdr(host) The server sni argument is used as a key for looking up connection in the connection pool. The attach-srv name argument is used as a key for inserting connections into the pool. For it to work correctly they must match. There was a check that either both the attach-srv and server provide that key or neither does. It also checked that SSL and SNI was activated on the server. However, thanks to current connect_server() implementation, it appears that SNI is usable even without SSL to identify a connection in the pool. Thus, it can be diverted from its original intent in reverse HTTP case to serve even without SSL activated. For example, this could be useful to use `fc_pp_unique_id` as a name expression (DISCLAIMER: note that for now PROXY protocol is not compatible with rhttp). Error is still reported if either SNI or name is used without the other. This patch adjust the message to a more helpful one. Arguably it would be easier to understand if instead of using `name` and `sni` for `attach-srv` and `server` rules it used the same term in both places - like "conn-pool-key" or something. That would make it clear that the two must match.	2024-05-14 16:39:07 +02:00
Aurelien DARRAGON	32f0cd3242	BUG/MINOR: log: smp_rgs array issues with inherited global log directives When a log directive is defined in the global section, each time we use "log global" in a proxy section, the global log directives are duplicated for the current proxy. This works by creating a new proxy logger struct and duplicating every members for each global one. However, smp_rgs logger member is a special pointer member that is allocated when "range" is used on a log directive. Currently, we simply copy the array pointer (from the global one), instead of creating our own copy. Because of that, range log sampling may not work properly in some situations prior to `3f1284560` ("MINOR: log: remove the unused curr_idx in struct smp_log_range") when used in global log directives, for instance: global log 127.0.0.1:5114 format raw sample 1-2,3:4 local0 info # should receive 75% of all proxy logs log 127.0.0.1:5115 format raw sample 4:4 local0 info # should receive 25% of all proxy logs listen proxy1 log global listen proxy2 log global May not work as expected, because curr_idx was stored within smp_rgs array member prior to `3f1284560`, and due to this bug, it happens to be shared between every log directive inherited from a "global" one. The result is that curr_idx counter will not behave properly because the index will be increased globally instead of per-log directive, and it could even suffer from concurrent thread accesses under load since we don't own the global log directive's lock when manipulating it. Another issue that was revealed because of this bug is that the smp_rgs array allocated during config parsing is never freed in free_logger(), resulting in small memory leak during clean exit. To fix these issues all at once, let's properly duplicate smp_rgs logger struct member in dup_logger() like we already do for other special members so that every log directive have its own sms_rgs copy, and then systematically free it in free_logger(). While this bug affects all stable versions (including 2.4), it's probably best to not backport this beyond 2.6 because of `211ea252d` ("BUG/MINOR: logs: fix logsrv leaks on clean exit") prerequisite that first appears in 2.6. [ada: for versions prior to 2.9, `969e212` ("MINOR: log: add dup_logsrv() helper function") and `76acde91` ("BUG/MINOR: log: keep the ref in dup_logger()") must be backported first. Note: Some ctx adjustments should be performed because 'logger' struct used to be named 'logsrv' in the past and 2.9 introduced logger target struct member. Thus it's probably easier to manually apply `76acde91` and the current bugfix by hand directly on top of `969e212`. ]	2024-05-14 12:00:23 +02:00
Aurelien DARRAGON	9d4a44e713	BUG/MINOR: log: fix leak in add_sample_to_logformat_list() error path If add_sample_to_logformat_list() fails to allocate new logformat_node, then we directly jump to error_free label to cleanup the node using free_logformat_node() before returning an error. However if the node failed to allocate, then the sample expression that was allocated just before (not yet assigned) isn't released (free_logformat_node() is a no-op when NULL is provided). Thus if expr wasn't assigned to the node during early failure, then it must be manually released. This bug was introduced by `2462e5bcc` ("BUG/MINOR: log: fix potential lf->name memory leak") which wasn't marked for backports. It only affects 3.0.	2024-05-13 16:44:27 +02:00
Willy Tarreau	0ce51dc93b	MEDIUM: dynbuf: implement emergency buffers The buffer reserve set by tune.buffers.reserve has long been unused, and in order to deal gracefully with failed memory allocations we'll need to resort to a few emergency buffers that are pre-allocated per thread. These buffers are only for emergency use, so every time their count is below the configured number a b_free() will refill them. For this reason their count can remain pretty low. We changed the default number from 2 to 4 per thread, and the minimum value is now zero (e.g. for low-memory systems). The tune.buffers.limit setting has always been a problem when trying to deal with the reserve but now we could simplify it by simply pushing the limit (if set) to match the reserve. That was already done in the past with a static value, but now with threads it was a bit trickier, which is why the per-thread allocators increment the limit on the fly before allocating their own buffers. This also means that the configured limit is saner and now corresponds to the regular buffers that can be allocated on top of emergency buffers. At the moment these emergency buffers are not used upon allocation failure. The only reason is to ease bisecting later if needed, since this commit only has to deal with resource management.	2024-05-10 17:18:13 +02:00
Willy Tarreau	47665be083	MEDIUM: mux-h1: allocate without queuing when retrying Now when trying to allocate a buffer, we can check if we've been notified of availability via the callback, in which case we should not consult the queue, or if we're doing a first allocation and check the queue. At this point it still doesn't change much since the stream still doesn't make use of it but some progress is expected.	2024-05-10 17:18:13 +02:00
Willy Tarreau	b5714b45e8	MEDIUM: stream: allocate without queuing when retrying Now when trying to allocate the work buffer, we can check if we've been notified of availability via the buf_wait callback, in which case we should not consult the queue, or if we're doing a first allocation and check the queue.	2024-05-10 17:18:13 +02:00
Willy Tarreau	f552f79ba5	MINOR: mux-h1: report that a buffer allocation succeeded When the buffer allocation callback is notified of a buffer availability, it will now set a MAYALLOC flag in addition to clearing the ALLOC one, for each of the 3 levels where we may fail an allocation. The flag will be cleared upon a successful allocation. This will soon be used to decide to re-allocate without waiting again in the queue. For now it has no effect. There's just a trick, we need to clear the various *_ALLOC flags before testing h1_recv_allowed() otherwise it will return false!	2024-05-10 17:18:13 +02:00
Willy Tarreau	cb2d758043	MINOR: applet: report about buffer allocation success When appctx_buf_available() is called, it now sets APPCTX_FL_IN_MAYALLOC or APPCTX_FL_OUT_MAYALLOC depending on the reportedly permitted buffer allocation, and these flags are cleared when the said buffers are allocated. For now they're not used for anything else.	2024-05-10 17:18:13 +02:00
Willy Tarreau	17d8916bb1	MINOR: stream: report that a buffer allocation succeeded When the buffer allocation callback is notified of a buffer availability, it will now set a MAYALLOC flag on the stream so that the stream knows it is allowed to bypass the queue checks. For now this is not used.	2024-05-10 17:18:13 +02:00
Willy Tarreau	a160b3c50c	MEDIUM: dynbuf/mux-h1: do not allocate the buffers in the callback One of the problematic designs with the buffer_wait mechanism is that the callbacks pre-allocate the buffers and stay in the run queue for a while, resulting in all of the few buffers being assigned to waiting tasks instead of being all available to one task that needs them all at once. Here we simply stop doing this, the callback clears the waiting flags and wakes the task up so that it has a chance of still finding some buffers.	2024-05-10 17:18:13 +02:00
Willy Tarreau	c510e81a3f	MINOR: dynbuf/mux-h1: use different criticalities for buffer allocations While it could certainly still be improved, this first approach consists in assigning buffers like this in the H1 mux: - h1c->obuf : DB_MUX_TX - h1c->ibuf : DB_MUX_RX - h1s->rxbuf: DB_SE_RX That's done via 3 distinct functions for better code clarity, and it also allowed to move the missing buffer flags assignment there. Among possible improvements would be to take into consideration the state of the parser (i.e. no data yet vs data, or headers vs payload) so that even server beginning of response or pure payload can be lowered in priority.	2024-05-10 17:18:13 +02:00
Willy Tarreau	4ffb3b5ebe	MINOR: applet: set the blocking flag in the buffer allocation function Instead of having each caller of appctx_get_buf() think about setting the blocking flag, better have the function do it, since it's already handling the queue anyway. This way we're sure that both are consistent.	2024-05-10 17:18:13 +02:00
Willy Tarreau	ee0d56ac85	MEDIUM: applet: make appctx_buf_available() only wake the applet up, not allocate Now we don't want bufwait handlers to preallocate the resources they were expecting since it contributes to the shortage. Let's just wake the applet up and that's all.	2024-05-10 17:18:13 +02:00
Willy Tarreau	9a27d7aa6f	MEDIUM: dynbuf/stream: do not allocate the buffers in the callback One of the problematic designs with the buffer_wait mechanism is that the callbacks pre-allocate the buffers and stay in the run queue for a while, resulting in all of the few buffers being assigned to waiting tasks instead of being all available to one task that needs them all at once. Here we simply stop doing this, the callback clears the waiting flags and wakes the task up so that it has a chance of still finding some buffers.	2024-05-10 17:18:13 +02:00
Willy Tarreau	db21062881	MEDIUM: dynbuf/stream: re-enable queueing upon failed buffer allocation The errors were not working fine anyway since we know that upon low memory condition everything freezes. However we have a chance to do better now, so let's start by re-enabling queueing when allocations fail.	2024-05-10 17:18:13 +02:00
Willy Tarreau	f5566afec6	MEDIUM: dynbuf: generalize the use of b_dequeue() to detach buffer_wait Now thanks to this the bufq_map field is expected to remain accurate.	2024-05-10 17:18:13 +02:00
Willy Tarreau	a5d6a79986	MEDIUM: dynbuf: make the buffer_wq an array of list heads Let's turn the buffer_wq into an array of 4 list heads. These are chosen by criticality. The DB_CRIT_TO_QUEUE() macro maps each criticality level into one of these 4 queues. The goal here clearly is to make it possible to wake up the most critical queues in priority in order to let some tasks finish their job and release buffers that others can use. In order to avoid having to look up all queues, a bit map indicates which queues are in use, which also allows to avoid looping in the most common case where queues are empty..	2024-05-10 17:18:13 +02:00
Willy Tarreau	a214197ce7	MINOR: dynbuf: use the b_queue()/b_requeue() functions everywhere The code places that were used to manipulate the buffer_wq manually now just call b_queue() or b_requeue(). This will simplify the multiple list management later.	2024-05-10 17:18:13 +02:00
Willy Tarreau	72d0dcda8e	MINOR: dynbuf: pass a criticality argument to b_alloc() The goal is to indicate how critical the allocation is, between the least one (growing an existing buffer ring) and the topmost one (boot time allocation for the life of the process). The 3 tcp-based muxes (h1, h2, fcgi) use a common allocation function to try to allocate otherwise subscribe. There's currently no distinction of direction nor part that tries to allocate, and this should be revisited to improve this situation, particularly when we consider that mux-h2 can reduce its Tx allocations if needed. For now, 4 main levels are planned, to translate how the data travels inside haproxy from a producer to a consumer: - MUX_RX: buffer used to receive data from the OS - SE_RX: buffer used to place a transformation of the RX data for a mux, or to produce a response for an applet - CHANNEL: the channel buffer for sync recv - MUX_TX: buffer used to transfer data from the channel to the outside, generally a mux but there can be a few specificities (e.g. http client's response buffer passed to the application, which also gets a transformation of the channel data). The other levels are a bit different in that they don't strictly need to allocate for the first two ones, or they're permanent for the last one (used by compression).	2024-05-10 17:18:13 +02:00
Amaury Denoyelle	cc9827bb09	BUG/MEDIUM: mux-quic: fix crash on STOP_SENDING received without SD Abort reason code received on STOP_SENDING is notified to upper layer since the following commit : `367ce1ebf3` MINOR: mux-quic: Set tha SE abort reason when a STOP_SENDING frame is received However, this causes a crash when a STOP_SENDING is received on a QCS instance without any stream instantiated. Fix this by checking first if qcs->sd is not NULL before setting abort code. This bug can easily be reproduced by emitting a STOP_SENDING as first frame of a stream. This should fix github issue #2563. This does not need to be backported.	2024-05-10 11:01:05 +02:00
Aurelien DARRAGON	fbbc2925d4	BUG/MEDIUM: log/ring: broken syslog octet counting As reported by Tristan in GH #2561, syslog messages sent over rings are malformed since commit `01aa0a05` ("MEDIUM: ring: change the ring reader to use the new vector-based API now"). Indeed, take a look at the following log message produced prior to `01aa0a05`: 181 <134>1 2024-05-07T09:45:21.543263+02:00 - haproxy 113700 - - 127.0.0.1:56136 [07/May/2024:09:45:21.491] front front/s1 0/0/21/30/51 404 369 - - ---- 1/1/0/0/0 0/0 "GET / HTTP/1.1" Starting with `01aa0a05`, here's the equivalent log message: <134>1 2024-05-07T09:45:21.543263+02:00 - haproxy 112729 - - 127.0.0.1:56136 [07/May/2024:09:45:21.491] front front/s1 0/0/66/39/105 404 369 - - ---- 1/1/0/0/0 0/0 "GET / HTTP/1.1"-fwr -> Message is missing octet counting header, and garbage bytes are found at the end of the payload. This bug is caused by a small mistake in syslog_applet_append_event(): when the function was refactored to use vector API instead of buffer API, we used 'trash.area' as starting pointer to write the event instead of 'trash.area + trash.data', causing existing octet counting prefix (already written in trash) to be overwritten and trash.data to be wrongly incremented. No backport needed (`01aa0a05` was introduced during 3.0 development)	2024-05-07 19:23:01 +02:00
Christopher Faulet	bd47e344b8	MINOR: connection: Add samples to retrieve info on streams for a connection Thanks to the previous fix, it is now possible to get the number of opened streams for a connection and the negociated limit. Here, corresponding sample feches are added, in fc_ and bc_ scopes. On frontend side, the limit of streams is imposed by HAProxy. But on the backend side, the limit is defined by the server. it may be useful for debugging purpose because it may explain slow-downs on some processing.	2024-05-06 22:00:01 +02:00
Christopher Faulet	eca9831ec8	MINOR: muxes: Add ctl commands to get info on streams for a connection There are 2 new ctl commands that may be used to retrieve the current number of streams openned for a connection and its limit (the maximum number of streams a mux connection supports). For the PT and H1 muxes, the limit is always 1 and the current number of streams is 0 for idle connections, otherwise 1 is returned. For the H2 and the FCGI muxes, info are already available in the mux connection. For the QUIC mux, the limit is also directly available. It is the maximum initial sub-ID of bidirectional stream allowed for the connection. For the current number of streams, it is the number of SC attached on the connection and the number of not already attached streams present in the "opening_list" list.	2024-05-06 22:00:00 +02:00
Christopher Faulet	12fb6d73cd	MINOR: mux-quic: Add .ctl callback function to get info about a mux connection Other muxes implement this callback function. It was not implemented for the QUIC mux because it was useless. It will be used to retrieve the current/max number of stream for a quic connection. So let's added it, adding the default support for MUX_CTL_EXIT_STATUS command.	2024-05-06 22:00:00 +02:00
Christopher Faulet	068ce2d5d2	MINOR: stconn: Add samples to retrieve about stream aborts It is now possible to retrieve some info about the abort received for a server or a client stream, if any. * fs.aborted and bs.aborted can be used to know if an abort was received on frontend or backend side. A boolean is returned. * fs.rst_code and bs.rst_code return the code of the received RESET_STREAM frame for a H2 stream or the code of the received STOP_SENDING frame for a QUIC stream. In both cases, the error code attached to the frame is returned. The sample fetch fails if no such frame was received or if the stream is not an H2/QUIC stream.	2024-05-06 22:00:00 +02:00
Christopher Faulet	367ce1ebf3	MINOR: mux-quic: Set tha SE abort reason when a STOP_SENDING frame is received When STOP_SENDING frame is received for a quic stream, the error code is now saved in the SE abort reason. To do so, we use the QUIC source (SE_ABRT_SRC_MUX_QUIC). For now, this code is only set but not used on the opposite side.	2024-05-06 22:00:00 +02:00
Christopher Faulet	20b156ee15	MEDIUM: mux-h2: Forward h2 client cancellations to h2 servers When a H2 client sends a RST_STREAM(CANCEL) frame to abort a request, the abort reason is now used on server side, in the H2 mux, to set the RST_STREAM code. The main use case is to forward client cancellations to gRPC applications. This patch should fix the issue #172.	2024-05-06 22:00:00 +02:00
Christopher Faulet	dea79f3fe1	MINOR: mux-h2: Set the SE abort reason when a RST_STREAM frame is received When RST_STREAM frame is received, the error code is now saved in the SE abort reason. To do so, we use the H2 source (SE_ABRT_SRC_MUX_H2). For now, this code is only set but not used on the opposite side.	2024-05-06 22:00:00 +02:00
Christopher Faulet	96f8b7ad08	MEDIUM: stconn/muxes: Add an abort reason for SE shutdowns on muxes A reason is now passed as parameter to muxes shutdowns to pass additional info about the abort, if any. No info means no abort or only generic one. For now, the reason is composed of 2 32-bits integer. The first on represents the abort code and the other one represents the info about the code (for instance the source). The code should be interpreted according to the associated info. One info is the source, encoding on 5 bits. Other bits are reserverd for now. For now, the muxes are the only supported source. But we can imagine to extend it to applets, streams, health-checks... The current design is quite simple and will most probably evolved.. But the idea is to let the opposite side forward some errors and let's a mux know why its stream was aborted. At first glance, a abort reason must only be evaluated if SE_SHW_SILENT flag is set. The main goal at short term, is to forward some H2 RST_STREAM codes because it is mandatory for gRPC applications, mainly to forward gRPC cancellation from an H2 client to an H2 server. But we can imagine to alter this reason at the applicative level to enrich it. It would also be used to report more accurate errors in logs.	2024-05-06 22:00:00 +02:00
Patrick Hemmer	28489021b3	BUG/MINOR: cfgparse: use curproxy global var from config post validation Previously check_config_validity() had its own curproxy variable. This resulted in the acl() sample fetch being unable to determine which proxy was in use when used from within log-format statements. This change addresses the issue by having the check_config_validity() function use the global variable instead.	2024-05-06 18:45:47 +02:00
Patrick Hemmer	93d4e99714	BUG/MINOR: acl: support built-in ACLs with acl() sample Built-in ACLs were not being searched by the acl() sample fetch. This fixes that so they are searched if no other match is found.	2024-05-06 18:42:54 +02:00
Valentine Krasnobaeva	4a9e3e102e	BUG/MINOR: haproxy: only tid 0 must not sleep if got signal This patch fixes the commit `eea152ee68` ("BUG/MINOR: signals/poller: ensure wakeup from signals"). There is some probability that run_poll_loop() becomes inifinite, if TH_FL_SLEEPING is withdrawn from all threads in the second signal_queue_len check, when a signal has received just after the first one. In such particular case, the 'wake' variable, which is used to terminate thread's poll loop is never reset to 0. So, we never enter to the "stopping" part of the run_poll_loop() and threads, except the one with id 0 (tid 0 handles signals), will continue to call _do_poll() eternally and will never sleep, as its TH_FL_SLEEPING flag was unset. This flag needs to be removed only for the tid 0, as it was done in the first signal_queue_len check. This fixes an issue #2537 "infinite loop when shutting down". This fix must be backported in every stable version.	2024-05-06 18:39:08 +02:00
Aurelien DARRAGON	03ca16f38b	OPTIM: log: resolve logformat options during postparsing In lf_buildctx_prepare(), we perform costly bitwise operations for every nodes to resolve node options and check for incompatibilities with global options. In fact, all this logic may safely be performed during postparsing. This is what we're doing in this commit. Doing so saves us from unnecessary runtime checks and could help speedup sess_build_logline(). Since checks are not as costly as before (due to them being performed during postparsing and not on log building path anymore), an complementary check for OPT_HTTP vs OPT_ENCODE incompatibity was added: encoding is ignored if HTTP option is set, unless HTTP option wasn't set globally and encoding was set globally, which means encoding takes the precedence Thanks to this patch, lf_buildctx_prepare() now only takes care of assigning proper typecast and options settings depending if it's used from global or per-node context, and prepares CBOR-specific structure members when CBOR encode option is set.	2024-05-06 11:13:46 +02:00
Ilia Shipitsin	a7cf2454dd	BUILD: clock: improve check for pthread_getcpuclockid() if _POSIX_THREAD_CPUTIME is greater than 0, pthread_getcpuclockid() is implemented. This should fix the build on Solaris 11. Reference: https://docs.oracle.com/cd/E88353_01/html/E37842/unistd-3head.html ML: https://www.mail-archive.com/haproxy@formilux.org/msg44915.html	2024-05-06 08:25:17 +02:00
Aurelien DARRAGON	d26a160133	OPTIM: log: speedup date printing in sess_build_logline() when no encoding is used In sess_build_logline(), we have multiple fieds such as '%t' that build a fixed-length string out of a date struct and then print it using lf_rawtext(). In fact, printing it using lf_rawtext() is only mandatory to deal with encoding options, but when no encoding is used we can output the result to tmplog directly. Since most dates generate between 25 and 30 chars, doing so spares us from writing them twice and could help make sess_build_logline() a bit faster when no encoding is used. (to match with pre-encoding patch series performance).	2024-05-04 10:13:05 +02:00
Aurelien DARRAGON	bf3b4001ce	OPTIM: log: use lf_buildctx's buffer instead of temporary stack buffers Now that lf_buildctx isn't pushed on the stack anymore, let's take this opportunity to store a small buffer of 256 bytes within it, and then use this buffer as general purpose buffer to build fixed-length strings that are then printed using lf_{raw}text() function. By doing so we stop relying on temporary stack buffers.	2024-05-04 10:13:05 +02:00
Aurelien DARRAGON	ccc4341258	OPTIM: log: use thread local lf_buildctx to stop pushing it on the stack Following previous commit's logic, let's move lf_buildctx ctx away from sess_build_logline() to stop abusing from the stack to push large structure each time sess_build_logline() is called. Also, don't memset the structure for each invokation, but only reset members explicitly when required. For that we now declare one static lf_buildctx per thread (using THREAD_LOCAL) and make sess_build_logline() refer to it using a pointer.	2024-05-04 10:13:05 +02:00
Aurelien DARRAGON	728b5aa835	OPTIM: log: declare empty buffer as global variable 'empty' buffer used in sess_build_logline() inside a loop, and since it is only being read from and not modified, until recently it ended up being cached most of the time and didn't cause overhead due to systematic push on the stack. However, due recent encoding work and new added variables on the stack, we're starting to reach a stack limit and declaring 'empty' buffer within the loop seems to cause non-negligible CPU overhead. Since the variable isn't modified during log generation, let's declare 'empty' buffer as a global variable outside from sess_build_logline() to prevent pushing it on the stack for each node evaluation.	2024-05-04 10:13:05 +02:00
Aurelien DARRAGON	cc2e94a948	BUG/MINOR: log: prevent double spaces emission in sess_build_logline() Christian reported in GH #2556 that since 3.0-dev double spaces may be found in log messages on some cases where it was not the case before. As we were able to easily reproduce, a quick bisect led us to `c6a7138` ("MINOR: log: simplify last_isspace in sess_build_logline()"). While it is true that all switch cases set the last_isspace variable to 0, there was a subtelty for some fields such as '%hr', '%hrl', '%hs' or '%hsl' and I overlooked it. Indeed, for '%hr', last_isspace was only set to 0 if data was emitted, else the assignment didn't occur. But with `c6a7138`, last_isspace is always set to 0 as long as the current node type is not a separator. Because of that, if no data is emitted for the current node value, and a space was already emitted prior to the current node, then an extra space could be emitted after the node, resulting in two spaces being emitted. Note that while `c6a7138` introduces a slight behavior regression regarding last_isspace logic with the specific fields mentionned above, this behavior could already be triggered with a failing or empty logformat node sample expression. Consider this logformat expression: log-format "%{-M}o \| %[str()] \|" str() will not print anything, and since we disabled mandatory option with '-M', nothing gets printed for the node sample expression. As a result, we have the following output: "\| \|" Instead of (when mandatory option is enabled): "\| - \|" Thus in order to stick to the historical behavior, systematically set last_isspace to 0 for EXPR nodes, and only set last_isspace to 0 when data was written for TAG nodes. This way, '%hr', '%hrl', '%hs' or '%hsl' should behave as before. No backport needed.	2024-05-03 16:48:21 +02:00
Aurelien DARRAGON	48e0efb00b	MEDIUM: log: optimizing tmp->type handling in sess_build_logline() Instead of chaining 2 switchcases and performing encoding checks for all nodes let's actually split the logic in 2: first handle simple node types (text/separator), and then handle dynamic node types (tag, expr). Encoding options are only evaluated for dynamic node types. Also, last_isspace is always set to 0 after next_fmt label, since next_fmt label is only used for dynamic nodes, thus != LOG_FMT_SEPARATOR. Since LF_NODE_WITH_OPT() macro (which was introduced recently) is now unused, let's get rid of it. No functional change should be expected. (Use diff -w to check patch changes since reindentation makes the patch look heavy, but in fact it remains fairly small)	2024-05-03 16:48:21 +02:00
Ilia Shipitsin	a65c6d3574	CLEANUP: assorted typo fixes in the code and comments This is 42nd iteration of typo fixes	2024-05-03 09:01:36 +02:00
Amaury Denoyelle	53782b9ea5	MINOR: stats: extract proxy clear-counter in a dedicated function Split code related to proxies list looping in cli_parse_clear_counters() to a new dedicated function. This function is placed in the new module stats-proxy.	2024-05-02 16:43:26 +02:00
Amaury Denoyelle	f0644d1bd7	REORG: stats: define stats-proxy source module Create a new module stats-proxy. Move stats functions related to proxies list looping in it. This allows to reduce stats source file dividing its size by half.	2024-05-02 16:42:36 +02:00
William Lallemand	271def959c	MINOR: ssl: rename ocsp_update.http_proxy into ocsp-update.httpproxy Rename to the option to have a more consistent name.	2024-05-02 16:32:06 +02:00
William Lallemand	964f093504	CLEANUP: ssl: rename new_ckch_store_load_files_path() to ckch_store_new_load_files_path() Rename the new_ckch_store_load_files_path() function to ckch_store_new_load_files_path(), in order to be more consistent.	2024-05-02 16:03:20 +02:00
Amaury Denoyelle	10ab56831e	MINOR: stats: convert age as generic column for proxy stat Convert FN_AGE in stat_cols_px[] as generic columns. These values will be automatically used for dump/preload of a stats-file. Remove srv_lastsession() / be_lastsession() function which are now useless as last_sess is calculated via me_generate_field().	2024-05-02 10:55:25 +02:00
Amaury Denoyelle	e92ae8f0ba	MINOR: stats: support age in stats-file Extend generic stat column support to be able to fully support age stats type. Several changes were required. On output, me_generate_field() has been updated to report the difference between the current tick with the stored value for FN_AGE type. Also, if an age stats is hidden in show stats, -1 is returned instead of an empty metric, which is the value to mark an age as unset. On counters preload, load_ctr() was updated to handled FN_AGE. A similar substraction is performed to the current tick value.	2024-05-02 10:55:25 +02:00
Amaury Denoyelle	634cc2a5d8	MINOR: counters: move last_change into counters struct last_change was a member present in both proxy and server struct. It is used as an age statistics to report the last update of the object. Move last_change into fe_counters/be_counters. This is necessary to be able to manipulate it through generic stat column and report it into stats-file. Note that there is a change for proxy structure with now 2 different last_change values, on frontend and backend side. Special care was taken to ensure that the value is initialized only on the proxy side. The other value is set to 0 unless a listen proxy is instantiated. For the moment, only backend counter is reported in stats. However, with now two distinct values, stats could be extended to report it on both side.	2024-05-02 10:55:25 +02:00
Amaury Denoyelle	9b35e1f30c	MINOR: stats: convert rate as generic column for proxy stats Convert every FN_RATE in stat_cols_px[] to generic column. Thanks to prior patch, this allows to automatically dump their value into stats-file and preload corresponding freq-ctr on process startup.	2024-05-02 10:55:25 +02:00
Amaury Denoyelle	fec2ae9b76	MINOR: stats: support rate in stats-file Implement support for FN_RATE stat column into stat-file. For the output part, only minimal change is required. Reuse the function read_freq_ctr() to print the same value in both stats output and stats-file dump. For counter preloading, define a new utility function preload_freq_ctr(). This can be used to initialize a freq-ctr type by preloading previous period value. Reuse this function in load_ctr() during stats-file parsing. At the moment, no rate column is defined as generic. Thus, this commit does not have functional change. This will be changed as soon as FN_RATE are converted to generic columns.	2024-05-02 10:55:25 +02:00
Amaury Denoyelle	639e73f8f2	MINOR: counters: move freq-ctr from proxy/server into counters struct Move freq-ctr defined in proxy or server structures into their dedicated fe_counters/be_counters struct. Functionnaly no change here. This commit will allow to convert rate stats column to generic one, which is mandatory to manipulate them in the stats-file.	2024-05-02 10:55:25 +02:00
Amaury Denoyelle	4e9e841878	MINOR: stats: prepare stats-file support for values other than FN_COUNTER Currently, only FN_COUNTER are dumped and preloaded via a stats-file. Thus in several places we relied on the assumption that only FN_COUNTER are valid in stats-file context. New stats types will soon be implemented as they are also eligilible to statistics reloading on process startup. Thus, prepare stats-file functions to remove any FN_COUNTER restriction. As one of this change, generate_stat_tree() now uses stcol_is_generic() for stats name tree indexing before stats-file parsing. Also related to stats-file parsing, individual counter preloading step as been extracted from line parsing in a dedicated new function load_ctr(). This will allow to extend it to support multiple mechanism of counter preloading depending on the stats type.	2024-05-02 10:55:25 +02:00
Amaury Denoyelle	933b4ae27d	MINOR: stats: convert req_tot as generic column req_tot counter is a special case as it is not managed identically between frontend and backend side. For the backend side, this metric is available directly into be_counters, which allows to use a generic stat column definition. On the frontend side however, the metric value is an aggredate of multiple fe_counters value. This is the case since the splitting between HTTP version introduced in the following patch : `9969adbcdc` MINOR: stats: add by HTTP version cumulated number of sessions and requests This difference cannot be handled automatically by me_generate_field(). Add a special case in the function to produce it on frontend side reusing the aggregated value. This not done however for stats-file as there is no counter to preload.	2024-05-02 10:55:25 +02:00
Amaury Denoyelle	56e6c57aa1	MINOR: stats: fix visual alignment for stat_cols_px definition Simply adjust visual alignment in definition of proxy stats columns definition for ST_I_PX_HANAFAIL column.	2024-05-02 10:55:25 +02:00
William Lallemand	3a19698b81	CLEANUP: ssl: move the global ocsp-update options parsing to ssl_ocsp.c Move the global tunel.ssl.ocsp-update option parsing to ssl_ocsp.c.	2024-05-02 10:48:05 +02:00
William Lallemand	622c635815	CLEANUP: ssl: clean the includes in ssl_ocsp.c Clean the includes in ssl_ocsp.c which were copied from ssl_sock.c and are not relevant anymore. Also move the include in the right order.	2024-05-02 10:35:27 +02:00
Valentine Krasnobaeva	5cbb278fae	MINOR: capabilities: add cap_sys_admin support If 'namespace' keyword is used in the backend server settings or/and in the bind string, it means that haproxy process will call setns() to change its default namespace to the configured one and then, it will create a socket in this new namespace. setns() syscall requires CAP_SYS_ADMIN capability in the process Effective set (see man 2 setns). Otherwise, the process must be run as root. To avoid to run haproxy as root, let's add cap_sys_admin capability in the same way as we already added the support for some other network capabilities. As CAP_SYS_ADMIN belongs to CAP_SYS_* capabilities type, let's add a separate flag LSTCHK_SYSADM for it. This flag is set, if the 'namespace' keyword was found during configuration parsing. The flag may be unset only in prepare_caps_for_setuid() or in prepare_caps_from_permitted_set(), which inspect process EUID/RUID and Effective and Permitted capabilities sets. If system doesn't support Linux capabilities or 'cap_sys_admin' was not set in 'setcap', but 'namespace' keyword is presented in the configuration, we keep the previous strict behaviour. Process, that has changed uid to the non-priviledged user, will terminate with alert. This alert invites the user to recheck its configuration. In the case, when haproxy will start and run under a non-root user and 'cap_sys_admin' is not set, but 'namespace' keyword is presented, this patch does not change previous behaviour as well. We'll still let the user to try its configuration, but we inform via warning, that unexpected things, like socket creation errors, may occur.	2024-04-30 21:40:17 +02:00
Valentine Krasnobaeva	13ef552488	MINOR: sock: add EPERM case in sock_handle_system_err setns() may return EPERM if thread, that tries to move into different namespace, do not have CAP_SYS_ADMIN capability in its Effective set. So, extending sock_handle_system_err() with this error allows to send appropriate log message and set SF_ERR_PRXCOND (SC termination flag in log) as stream termination error code. This error code can be simply checked with SF_ERR_MASK at protocol layer.	2024-04-30 21:39:32 +02:00
Valentine Krasnobaeva	d3fc982cd7	MEDIUM: proto: make common fd checks in sock_create_server_socket quic_connect_server(), tcp_connect_server(), uxst_connect_server() duplicate same code to check different ERRNOs, that socket() and setns() may return. They also duplicate some runtime condition checks, applied to the obtained server socket fd. So, in order to remove these duplications and to improve code readability, let's encapsulate socket() and setns() ERRNOs handling in sock_handle_system_err(). It must be called just before fd's runtime condition checks, which we also move in sock_create_server_socket by the same reason.	2024-04-30 21:39:24 +02:00
Valentine Krasnobaeva	772d070ab5	MINOR: sock_set_mark: take sock family in account SO_MARK, SO_USER_COOKIE, SO_RTABLE socket options (used to set the special mark/ID on socket, in order to perform mark-based routing) are only supported by AF_INET sockets. So, let's check socket address family, when we enter into this function.	2024-04-30 21:38:29 +02:00
Valentine Krasnobaeva	d602d568e0	MEIDUM: unix sock: use my_socketat to create bind socket As UNIX Domain sockets could be attached to Linux namespaces (see more details about it from the Linux kernel patch set below: https://lore.kernel.org/netdev/m1hbl7hxo3.fsf@fess.ebiederm.org), it is better to use my_socket_at() in order to create UNIX listener's socket. my_socket_at() takes in account a network namespace, that may be configured for a frontend in the bind line: frontend fe_foo ... bind uxst@frontend.sock user haproxy group haproxy mode 660 namespace frontend Like this, namespace aware applications as netstat for example, will see this listening socket in its 'frontend' namespace and not in the root namespace as it was before. It is important to mention, that fixes in Linux kernel referenced above allow to connect to this listener's socket from the root and from any other namespace. UNIX Domain socket is protected by its permission set, which must be set with caution on its inode.	2024-04-30 21:38:24 +02:00
Valentine Krasnobaeva	84babc93ce	MEDIUM: proto_uxst: take in account server namespace As UNIX Domain sockets could be attached to Linux namespaces (see more details about it from the Linux kernel patch set below: https://lore.kernel.org/netdev/m1hbl7hxo3.fsf@fess.ebiederm.org), it is better to use sock_create_server_socket() in UNIX stream protocol implementation, as this function calls my_socket_at() and the latter takes in account server network namespace, which could be configured as in example below: backend be_bar ... server rpicam0 /run/ustreamer.sock namespace foonet So, for UNIX Domain socket, used as an address of some backend server, this patch makes possible to perform connect() to this backend server from the same network namespace, where the server is running, or where its listening socket was created. Using sock_create_server_socket() in UNIX stream protocol implementation also makes the code of uxst_connect_server() more uniform with tcp_connect_server() and quic_connect_server().	2024-04-30 21:38:18 +02:00
Valentine Krasnobaeva	a0b5324cff	MINOR: sock: rename sock to sock_fd in sock_create_server_socket Renaming sock to sock_fd makes it more clear, that sock_create_server_socket returns the fd of newly created server socket and then we check this fd. As we heavily use "fd" variable name in all protocol implementations, let's prefix this one with the name of its object file: sock.o.	2024-04-30 21:38:12 +02:00
Willy Tarreau	072686dafd	BUG/MINOR: stconn: don't wake up an applet waiting on buffer allocation Since the extension of the buffers API to applets in 3.0-dev, an applet may find itself unable to allocate a buffer, and will block respectively on APPCTX_FL_OUTBLK_ALLOC or APPCTX_FL_INBLK_ALLOC depending on the direction. However the code in sc_applet_process() doesn't consider this situation when deciding to wake up an applet, so when the condition arises, the applet keeps ringing and is killed by the loop detector. The fix is trivial and simply consists in checking for the flags above. No backport is needed since this is new in 3.0.	2024-04-30 21:36:47 +02:00
Aurelien DARRAGON	12d08cf912	BUG/MEDIUM: log: don't ignore disabled node's options In `3f2e8d0ed` ("MEDIUM: log: lf_* build helpers now take a ctx argument") I made a mistake, because starting with this commit it is no longer possible from a node to disable global logformat options. The result is that when an option is set globally, it cannot be disabled anymore. For instance, it is not possible to do this anymore: log-format "%{+X}o %{-X}Ts" The original intent was to prevent encoding options from being disabled once enabled globally, because when encoding is enabled globally we start the object enumeration right away (ie: in CBOR and JSON we announce dynamic map, and for each node we announce the key..), thus it doesn't make sense to mix encoding types there, unless encoding is only used per-node, in which case only the value gets encoded, thus it remains possible to print a value in JSON/CBOR-compatible format while the next one shouldn't be printed as-is. Thus, to restore the original behavior, slightly change the logic in lf_buildctx_prepare() so that only global encoding options take the precedence over node's options (instead of all options). No backport needed.	2024-04-30 18:45:07 +02:00
Aurelien DARRAGON	41d7e82e0f	MINOR: log/cbor: _lf_cbor_encode_byte() explicitly requires non-NULL ctx (again) The BUG_ON() statement that was added in `9bdea51` ("MINOR: log/cbor: _lf_cbor_encode_byte() explicitly requires non-NULL ctx") isn't sufficient as Coverity still thinks the lf_buildctx itself may be NULL as shown in GH #2554. In fact the original reports complains about the lf_buildctx itself and I didn't understand it properly, let's add another check in the BUG_ON() to ensure both cbor_ctx and cbor_ctx->ctx are not NULL since it is not expected if used properly.	2024-04-30 10:10:35 +02:00
Aurelien DARRAGON	9931a62c3f	BUG/MINOR: log: fix global lf_expr node options behavior (2nd try) In `98b44e8` ("BUG/MINOR: log: fix global lf_expr node options behavior"), I properly restored global node options behavior for when encoding is not used, however the fix is not optimal when encoding is involved: Indeed, encoding logic in sess_build_logline() relies on global node options to know if encoding must be handled expression-wide or individually. However, because of the above fix, if an expression is made of 1 or multiple nodes that all set an encoding option manually (without '%o'), we consider that the option was set globally, but that's probably not what the user intended. Instead we should only evaluate global options from '%o', so that it remains possible to skip global encoding when needed. No backport needed.	2024-04-30 10:10:35 +02:00
Aurelien DARRAGON	97240d01b3	BUG/MINOR: log/encode: fix potential NULL-dereference in LOGCHAR() When CBOR encoding was added in `c614fd3b9` ("MINOR: log: add +cbor encoding option"), in LOGCHAR(), we forgot to check that we don't assign the NULL value to tmplog (as we assume that tmplog cannot be NULL at the end of sess_build_logline()) No backport needed.	2024-04-30 10:10:35 +02:00
Aurelien DARRAGON	949ac95aa6	BUG/MINOR: log/encode: consider global options for key encoding In sess_build_logline(), contrary to what's stated in the comment "only consider global ctx for key encoding", we check for LOG_OPT_ENCODE flag on the current ctx options instead of global ones. Because of this, we could end up doing the wrong thing if the previous node had encoding enabled but it isn't set globally for instance. To fix the issue, let's simply check the presence of the flag on g_options before entering the "key encoding" block. This bug was introduced with `3f7c8387` ("MINOR: log: add +json encoding option"), no backport needed.	2024-04-30 10:10:35 +02:00
William Lallemand	6b634c4779	MINOR: ssl: introduce ocsp_update.http_proxy for ocsp-update keyword The ocsp_update.http_proxy global option allows to set an HTTP proxy address which will be used to send the OCSP update request with an absolute form URI.	2024-04-29 17:23:02 +02:00
William Lallemand	95949e6868	MINOR: httpclient: allow to use absolute URI with new flag HC_F_HTTPROXY The new HC_F_HTTPPROXY flag allows to use an absolute URI within a request that won't be modified in order to use an http proxy.	2024-04-29 17:10:47 +02:00
Aurelien DARRAGON	9bdce67585	CLEANUP: log: add a macro to know if a lf_node is configurable LF_NODE_WITH_OPT(node) returns true if the node's option may be set and thus should be considered. Logic is based on logformat node's type: for now only TAG and FMT nodes can be configured.	2024-04-29 14:47:37 +02:00
Aurelien DARRAGON	98b44e8edb	BUG/MINOR: log: fix global lf_expr node options behavior In `507223d5` ("MINOR: log: global lf_expr node options"), a mistake was made because it was assumed that only the last occurence of %o (LOG_FMT_GLOBAL) should be kept as global node options. However, although not documented, it is possible to have multiple %o within a single logformat expression to change the global settings on the fly. For instance, consider this example: log-format "%{+X}o test1=%ms %{-X}o test2=%ms %{+X}o test3=%ms" Prior to `3f2e8d0ed` ("MEDIUM: log: lf_* build helpers now take a ctx argument"), this would output something like this: test1=18B test2=395 test3=18B This is because global options is properly updated as the lf_expr string is parsed. But now due to `507223d5` and `3f2e8d0ed`, only the last %o occurence is considered. With the above example, this gives: test1=18B test2=18B test3=18B To restore historical behavior, let's partially revert `507223d5`: to compute global node options, we now start with all options enabled and then for each configurable node in lf_expr_postcheck(), we keep options common to the current node and previous nodes using AND masking, this way we really end up with options common to all nodes. No backport needed.	2024-04-29 14:47:37 +02:00
Aurelien DARRAGON	9bdea51d7e	MINOR: log/cbor: _lf_cbor_encode_byte() explicitly requires non-NULL ctx As shown in GH #2550, Coverity is tempted to think that NULL-dereference can occur in _lf_cbor_encode_byte() due to user-ctx being dereferenced from cbor_ctx, while coverity thinks that cbor_ctx may be NULL. In practise this cannot happen, because _lf_cbor_encode_byte() is only leveraged through a function pointer that is set in conjunction with the function pointer ctx (which ain't NULL). All this logic is done inside lf_buildctx_prepare() when LOG_OPT_ENCODE_CBOR is set. Since coverity doesn't seem to understand the logic properly, then it might as well confuse humans, so let's make it clear in _lf_cbor_encode_byte() that we expect non-NULL ctx by adding a BUG_ON()	2024-04-29 14:47:37 +02:00
Aurelien DARRAGON	0e2aea8224	CLEANUP: tools/cbor: rename cbor_encode_ctx struct members Rename e_byte_fct to e_fct_byte and e_fct_byte_ctx to e_fct_ctx, and adjust some comments to make it clear that e_fct_ctx is here to provide additional user-ctx to the custom cbor encode function pointers. For now, only e_fct_byte function may be provided, but we could imagine having e_fct_int{16,32,64}() one day to speed up the encoding when we know we can encode multiple bytes at a time, but for now it's not worth the hassle.	2024-04-29 14:47:37 +02:00
Amaury Denoyelle	20bc42e697	BUG/MINOR: stats: replace objt_* by __objt_* macros Update parse_stat_line() used during stats-file parsing. For each line, GUID is extracted first to access to the object instance. obj_type() is then invoked to retrieve the correct object type. Replace objt_* by __objt_* macros to mark its result as safe and non NULL. This should fix coverity report from github issue #2550. No need to backport.	2024-04-29 14:21:10 +02:00
Remi Tricot-Le Breton	0610f52bcd	BUG/MEDIUM: cache: Vary not working properly on anything other than accept-encoding If a response varies on anything other than accept-encoding (origin or referer) but still contains an 'Encoding' header, the cached responses were never sent back. This is because of the 'set_secondary_key_encoding' call that always filled the accept-encoding part of the secondary signature with the response's actual encoding, regardless of whether the response varies on this or not. This meant that the accept-encoding part of the signature could be non-null in the cached entry which made the 'get_secondary_entry' calls in 'http_action_req_cache_use' always fail because in those cases the request's secondary signature always had a null accept-encoding part. This patch can be backported up to branch 2.4.	2024-04-29 10:41:46 +02:00
Willy Tarreau	b957e741b0	MINOR: cli/wait: rename the condition "srv-unused" to "srv-removable" As previously discussed, "srv-unused" is sufficiently ambiguous to cause some trouble over the long term. Better use "srv-removable" to indicate that the server is removable, and if the conditions to delete a server change over time, the wait condition will be adjusted without renaming it.	2024-04-27 09:36:36 +02:00
Willy Tarreau	bc236ad133	CLEANUP: dynbuf: move the reserve and limit parsers to dynbuf.c I just added a new setting to set the number of reserved buffer, to discover we already had one... Let's move the parsing of this keyword (tune.buffers.reserve) and tune.buffers.limit to dynbuf.c where they should be.	2024-04-27 09:36:36 +02:00
Aurelien DARRAGON	c33b857df9	MINOR: log: support true cbor binary encoding CBOR in hex format as implemented in previous commit is convenient because the produced output is portable and can easily be embedded in regular syslog payloads. However, one of the goal of CBOR implementation is to be able to produce "Concise Binary" object representation. Here is an excerpt from cbor.io website: "Some applications also benefit from CBOR itself being encoded in binary. This saves bulk and allows faster processing." Currently we don't offer that with '+cbor', quite the opposite actually since a text string encoded with '+cbor' option will be larger than a text string encoded with '+json' or without encoding at all, because for each CBOR binary byte, 2 characters will be emitted. Hopefully, the sink/log API allows for binary data to be passed as parameter, this is because all relevant functions in the chain don't rely on the terminating NULL byte and take a string pointer + string length as parameter. We can actually rely on this property to support the '+bin' option when combined with '+cbor' to produce RAW binary CBOR output. Be careful though, as this is only intended for use with set-var-fmt or to send binary data to capable UDP/ring endpoints. Example: log-format "%{+cbor,+bin}o %(test)[bin(00AABB)]" Will produce: bf64746573745f4300aabbffff (output was piped to `hexdump -ve '1/1 "%.2x"'` to dump raw bytes as HEX characters) With cbor.me pretty printer, it gives us: BF # map() 64 # text(4) 74657374 # "test" 5F # bytes() 43 # bytes(3) 00AABB # "\u0000\xAA\xBB" FF # primitive() FF # primitive()	2024-04-26 18:39:32 +02:00
Aurelien DARRAGON	c614fd3b9f	MINOR: log: add +cbor encoding option In this patch, we make use of the CBOR (RFC8949) encode helper functions from the previous commit to implement '+cbor' encoding option for log- formats. The logic behind it is pretty similar to '+json' encoding option, except that the produced output is a CBOR payload written in HEX format so that it remains compatible to use this with regular syslog endpoints. Example: log-format "%{+cbor}o %[int(4)] test %(named_field)[str(ok)]" Will produce: BF6B6E616D65645F6669656C64626F6BFF Detailed view (from cbor.me): BF # map() 6B # text(11) 6E616D65645F6669656C64 # "named_field" 62 # text(2) 6F6B # "ok" FF # primitive() If the option isn't set globally, but on a specific node instead, then only the value will be encoded according to CBOR specification. Example: log-format "test cbor bool: %{+cbor}[bool(true)]" Will produce: test cbor bool: F5	2024-04-26 18:39:32 +02:00
Aurelien DARRAGON	810303e3e6	MINOR: tools: add cbor encode helpers Add cbor helpers to encode strings (bytes/text) and integers according to RFC8949, also add cbor_encode_ctx struct to pass encoding options such as how to encode a single byte.	2024-04-26 18:39:32 +02:00
Aurelien DARRAGON	3f7c8387c0	MINOR: log: add +json encoding option In this patch, we add the "+json" log format option that can be set globally or per log format node. What it does, it that it sets the LOG_OPT_ENCODE_JSON flag for the current context which is provided to all lf_* log building function. This way, all lf_* are now aware of this option and try to comply with JSON specification when the option is set. If the option is set globally, then sess_build_logline() will produce a map-like object with key=val pairs for named logformat nodes. (logformat nodes that don't have a name are simply ignored). Example: log-format "%{+json}o %[int(4)] test %(named_field)[str(ok)]" Will produce: {"named_field": "ok"} If the option isn't set globally, but on a specific node instead, then only the value will be encoded according to JSON specification. Example: log-format "{ \"manual_key\": %(named_field){+json}[bool(true)] }" Will produce: {"manual_key": true} When the option is set, +E option will be ignored, and partial numerical values (ie: because of logasap) will be encoded as-is.	2024-04-26 18:39:32 +02:00
Aurelien DARRAGON	b7c3d8c87c	MINOR: log: add +bin logformat node option Support '+bin' option argument on logformat nodes to try to preserve binary output type with binary sample expressions. For this, we rely on the log/sink API which is capable of conveying binary data since all related functions don't search for a terminating NULL byte in provided log payload as they take a string pointer and a string length as argument. Example: log-format "%{+bin}o %[bin(00AABB)]" Will produce: 00aabb (output was piped to `hexdump -ve '1/1 "%.2x"'` to dump raw bytes as HEX characters) This should be used carefully, because many syslog endpoints don't expect binary data (especially NULL bytes). This is mainly intended for use with set-var-fmt actions or with ring/udp log endpoints that know how to deal with such binary payloads. Also, this option is only supported globally (for use with '%o'), it will not have any effect when set on an individual node. (it makes no sense to have binary data in the middle of log payload that was started without binary data option)	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	162e311a0e	MINOR: log: add no_escape_map to bypass escape with _lf_encode_bytes() Providing no_escape_map as <map> argument to _lf_encode_bytes() function will make the function skip escaping since the map is empty. This is for convenience, as it might be useful to call lf_encode_chunk() to encoding binary data without escaping it.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	fb8b47fed8	MINOR: log: postpone conversion for sample expressions in sess_build_logline() In sess_build_logline(), for sample expression nodes, instead of directly calling sample_fetch_as_type(... SMP_T_STR), let's first process the sample using sample_process(), and then proceed with the conversion to str if required. Doing so will allow us to implement type casting and preserving logic.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	84963fb743	MINOR: log: expose node typecast in lf_buildctx struct Store node->typecast setting inside lf_buildctx struct so that encoding functions may benefit from it.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	3f2e8d0ed2	MEDIUM: log: lf_* build helpers now take a ctx argument Add internal lf_buildctx struct that is only used inside sess_build_logline() scope and is passed to lf_* log building helpers to expose current building context. For now, node options and the in_text counter are stored in the ctx struct. Thanks to this change, lf_* building functions don't depend on a logformat_node struct pointer, and may be used in a standalone manner as long as a build context is provided. Also, global options are now handled explictly in sess_build_logline() to make sure that global options are always considered even if they were not duplicated on every nodes. No functional change should be expected.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	f7cb384f1a	MINOR: log: merge lf_encode_string() and lf_encode_chunk() logic lf_encode_string() and lf_encode_chunk() function are pretty similar. The only difference is the stopping behavior, encode_chunk stops at a given position while encode_string stops when encountering '\0'. Moreover, both functions leverage tools.c encode helpers, but because of the LOG_OPT_ESC option, they reimplement those helpers with added logic. Instead of having to deal with code duplication which makes both functions harder to maintain, let's define a _lf_encode_bytes() helper function which satisfies lf_encode_string() and lf_encode_chunk() needs while keeping the function as simple as possible. _lf_encode_bytes() itself is made of multiple static inline helper functions, in the attempt to keep checks outside of core loop for better performance.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	a1583ec7c7	MINOR: log: make all lf_* sess build helper static There is no need to expose such functions since they are only involved in the log building process that occurs inside sess_build_logline(). Making functions static and removing their public prototype to ease code maintenance.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	3b9096bd36	MINOR: log: use LOG_VARTEXT_{START,END} to enclose text strings Rename LOGQUOTE_{START,END} macros to more generic LOG_VARTEXT_{START,END} in order to prepare for new encoding types that rely on specific treatment for variable-length texts. No functional change should be expected.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	278d6c3379	MINOR: log: explicitly handle %ts and %tsc as text strings Build fixed-length strings for %ts and %tsc to be able to print them using lf_rawtext_len(), this way it will be easier to encode them when new encoding options will be added. No functional change should be expected.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	2e4cc517bf	MEDIUM: log: use lf_rawtext for lf_ip() and lf_port() hex strings Same as the previous commit, but for ip and port oriented values when +X option is provided. No functional change should be expected. Because of this patch, we add a little overhead because we first generate the text into a temporary variable and then use lf_rawtext() to print it. Thus we have a double-copy, and this could have some performance implications that were not yet evaluated. Due to the small number of bytes that can end up being copied twice, we could be lucky and have no visible performance impact, but if we happen to see a significant impact, it could be useful to add a passthrough mechanism (to keep historical behavior) when no encoding is involved.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	3a3bdf1c76	MEDIUM: log: write raw strings using lf_rawtext() Make use of the previous commit to print strings that should not be modified. For instance, when +X option is provided, we have to print numerical values in ASCII HEX form. For that, we used snprintf() to output the result to the log output buffer directly, but now we build the string in a temporary buffer of fixed-size and then print it using lf_rawtext() which will take care of encoding options. Because of this patch, we add a little overhead because we first generate the text into a temporary variable and then use lf_rawtext() to print it. Thus we have a double-copy, and this could have some performance implications that were not yet evaluated. Due to the small number of bytes that can end up being copied twice, we could be lucky and have no visible performance impact, but if we happen to see a significant impact, it could be useful to add a passthrough mechanism (to keep historical behavior) when no encoding is involved.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	0d1e99c086	MEDIUM: log: pass date strings to lf_rawtext() Don't directly call functions that take date as argument and output the string representation to the log output buffer under sess_build_logline(), and instead build the strings in temporary buffers of fixed size (hopefully such functions, such as date2str_log() and gmt2str_log() procuce strings of known size), and then print the result using lf_rawtext() helper function. This way, we will be able to encode them automatically as regular string/text when new encoding methods are added. Because of this patch, we add a little overhead because we first generate the text into a temporary variable and then use lf_rawtext() to print it. Thus we have a double-copy, and this could have some performance implications that were not yet evaluated. Due to the small number of bytes that can end up being copied twice (< 30), we could be lucky and have no visible performance impact, but if we happen to see a significant impact, it could be useful to add a passthrough mechanism (to keep historical behavior) when no encoding is involved.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	fcb7e4beaa	MINOR: log: add lf_rawtext{_len}() functions similar to lf_text_{len}, except that quoting and mandatory options are ignored. Use this to print the input string without any modification ( except for encoding logic).	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	1fa2da18cd	MINOR: log: add lf_int() wrapper to print integers Wrap ltoa(), lltoa(), ultoa() and utoa_pad() functions that are used by sess_build_logline() to print numerical values by implementing a dedicated helper named lf_int() that takes <dft_hld> as argument to know how to write the integer by default (when no encoding is specified). LF_INT_UTOA_PAD_4 is used to emulate utoa_pad(x, 4) since it's found only once under sess_build_logline(), thus there is no need to pass an extra parameter to lf_int() function.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	d3c92a3a83	MINOR: log: skip custom logformat_node name if empty Reminder: Since 3.0-dev4, we can optionally give a name to logformat nodes: log-format "%(custom_name1)B %(custom_name2)[str(value)]" But we may also optionally set the expected node type by appending ':type' after the name, type being either sint,str or bool, like this: log-format "%(string_as_int:sint)[str(14)]" However, it is currently not possible to provide a type without providing a name that is a least 1 char long. But it could be useful to provide a type without setting a name, like this, for typecasting purposes only: log-format "%(:sint)[bool(true)]" Thus in order to allow this usage, don't set node->name if node name is not at least 1 character long. By doing so, node->name will remain NULL and will not be considered, but the typecast setting will.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	c584600083	CLEANUP: log: simplify complex values usages in sess_build_logline() make sess_build_logline() switch case more readable by performing some simplifications: complex values are first extracted in a temporary variable so that it's easier to refer to them and at a single place.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	507223d527	MINOR: log: global lf_expr node options Add options to lf_expr->nodes to store global options (those that are common to all node) for easier access. No functional change should be expected.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	7ff4f09e23	MINOR: log: store lf_expr nodes inside substruct Add another struct level inside lf_expr struct to allow new information to be stored alongside lf_expr nodes.	2024-04-26 18:39:31 +02:00
Aurelien DARRAGON	f8e1357a05	CLEANUP: log: remove unused checks for encode_{chunk,string} Thanks to `8226e92eb` ("BUG/MINOR: tools/log: invalid encode_{chunk,string} usage"), we only need to check for NULL return value from encode_{chunk,string}() and escape_string() to know if the call failed.	2024-04-26 18:39:31 +02:00
William Lallemand	2ab42dddc4	BUG/MINOR: mworker: reintroduce way to disable seamless reload with -x /dev/null Since the introduction of the automatic seamless reload using the internal socketpair, there is no way of disabling the seamless reload. Previously we just needed to remove -x from the startup command line, and remove any "expose-fd" keyword on stats socket lines. This was introduced in `2be557f7c` ("MEDIUM: mworker: seamless reload use the internal sockpairs"). The patch copy /dev/null again and pass it to the next exec so we never try to get socket from the -x. Must be backported as far as 2.6.	2024-04-26 15:25:49 +02:00
Amaury Denoyelle	e4a29447ce	MEDIUM: stats: define stats-file keyword This commit is the final to implement preloading of haproxy internal counters via stats-file parsing. Define a global keyword "stats-file". It allows to specify the path to the stats-file which will be parsed on process startup.	2024-04-26 14:18:15 +02:00
Amaury Denoyelle	782be288ca	MINOR: stats: parse values from stats-file This patch implement parsing of counter values line from stats-file. It reuses domain context previously set by the last header line. Each value is separated by ',' character, relative to the list of column names describe by the header line. This is implemented via static function parse_stat_line(). It first extract a GUID and retrieve the object instance. Then each numerical value is parsed and object counters updated. For the moment, only U64 counters metrics is supported. parse_stat_line() is called on each line until a new header line is found.	2024-04-26 11:34:02 +02:00
Amaury Denoyelle	374dc08611	MINOR: stats: parse header lines from stats-file This patch implements parsing of headers line from stats-file. A header line is defined as starting with '#' character. It is directly followed by a domain name. For the moment, either 'fe' or 'be' is allowed. The following lines will contain counters values relatives to the domain context until the next header line. This is implemented via static function parse_header_line(). It first sets the domain context used during apply_stats_file(). A stats column array is generated to contains the order on which column are stored. This will be reused to parse following lines values. If an invalid line is found and no header was parsed, considered the stats-file as ill formatted and stop parsing. This allows to immediately interrupt parsing if a garbage file was used without emitting a ton of warnings to the user.	2024-04-26 11:34:02 +02:00
Amaury Denoyelle	34ae7755b3	MINOR: stats: apply stats-file on process startup This commit is the first one of a serie to implement preloading of haproxy counters via stats-file parsing. This patch defines a basic apply_stats_file() function. It implements reading line by line of a stats-file without any parsing for the moment. It is called automatically on process startup via init().	2024-04-26 11:29:25 +02:00
Amaury Denoyelle	83731c8048	MINOR: guid: define guid_is_valid_fmt() Extract GUID format validation in a dedicated function named guid_is_valid_fmt(). For the moment, it is only used on guid_insert(). This will be reused when parsing stats-file, to ensure GUID has a valid format before tree lookup.	2024-04-26 11:29:25 +02:00
Amaury Denoyelle	e74148fb7c	MEDIUM: stats: implement dump stats-file CLI Define a new CLI command "dump stats-file" with its handler cli_parse_dump_stat_file(). It will loop twice on proxies_list to dump first frontend and then backend side. It reuses the common function stats_dump_stat_to_buffer(), using STAT_F_BOUND to restrict on the correct side. A new module stats-file.c is added to regroup function specifics to stats-file. It defines two main functions : * stats_dump_file_header() to generate the list of column list prefixed by the line context, either "#fe" or "#be" * stats_dump_fields_file() to generate each stat lines. Object without GUID are skipped. Each stat entry is separated by a comma. For the moment, stats-file does not support statistics modules. As such, stats_dump_*_line() functions are updated to prevent looping over stats module on stats-file output.	2024-04-26 10:20:57 +02:00
Amaury Denoyelle	83281303f6	MINOR: stats: define stats-file output format support Prepare stats function to handle a new format labelled "stats-file". Its purpose is to generate a statistics dump with a format closed from the CSV output. Such output will be then used to preload haproxy internal counters on process startup. stats-file output differs from a standard CSV on several points. First, only an excerpt of all statistics is outputted. All values that does not make sense to preload are excluded. For the moment, stats-file only list stats fully defined via "struct stat_col" method. Contrary to a CSV, sll columns of a stats-file will be filled. As such, empty field value is used to mark stats which should not be outputted. Some adaptation specifics to stats-file are necessary into me_generate_field(). First, stats-file will output separatedly values from frontend and backend sides with their own respective set of columns. As such, an empty field value is returned if stat is not defined for either frontend/listener, or backend/server when outputting the other side. Also, as stats-file does not support empty column, stcol_hide() is not used for it. A minor adjustement was necessary for stats_fill_fe_line() to pass context flags. This is necessary to detect stat output format. All other listener/server/backend corresponding functions already have it.	2024-04-26 10:20:57 +02:00
Amaury Denoyelle	6615252656	MEDIUM: stats: convert counters to new column definition Convert most of proxy counters statistics to new "struct stat_col" definition. Remove their corresponding switch..case entries in stats_fill_*_line() functions. Their value are automatically calculate via me_generate_field() invocation. Along with this, also complete stcol_hide() when some stats should be hidden. Only a few counters where not converted. This is because they rely on values stored outside of fe/be_counters structure, which me_generate_field() cannot use for now.	2024-04-26 10:20:57 +02:00
Amaury Denoyelle	168301411d	MINOR: stats: hide some columns in output Metric style stats can be automatically calculate since the introduction of metric_generate() when using "struct stat_col" as input. This would allow to centralize statistics generation. However, some stats are not outputted under specific condition. For example, health check failures on a server are only reported if checks are active. To support this, define a new function metric_hide(). It is called by metric_generate(). If true, it will skip metric calcuation and return an empty field value instead. This allows to define "stat_col" metrics and calculate them with metric_generate() but hiding them under certain circumstances.	2024-04-26 10:20:57 +02:00
Amaury Denoyelle	a7810b7be6	MINOR: stats: implement automatic metric generation from stat_col This commit is a direct follow-up of the previous one which define a new type "struct stat_col" to fully define a statistic entry. Define a new function metric_generate(). For metrics statistics, it is able to automatically calculate a stat value field for "offsets" from "struct stat_col". Use it in stats_fill_*_stats() functions. Maintain a fallback to previously used switch-case for old-style statistics. This commit does not introduce functional change as currently no statistic is defined as "struct stat_col". This will be the subject of a future commit.	2024-04-26 10:20:57 +02:00
Amaury Denoyelle	65624876f2	MINOR: stats: introduce a more expressive stat definition method Previously, statistics were simply defined as a list of name_desc, as for example "stat_cols_px" for proxy stats. No notion of type was fixed for each stat definition. This correspondance was done individually inside stats_fill_*_line() functions. This renders the process to define new statistics tedious. Implement a more expressive stat definition method via a new API. A new type "struct stat_col" for stat column to replace name_desc usage is defined. It contains a field to store the stat nature and format. A <cap> field is also defined to be able to define a proxy stat only for certain type of objects. This new type is also further extended to include counter offsets. This allows to define a method to automatically generate a stat value field from a "struct stat_col". This will be the subject of a future commit. New type "struct stat_col" is fully compatible full name_desc. This allows to gradually convert stats definition. The focus will be first for proxies counters to implement statistics preservation on reload.	2024-04-26 10:20:57 +02:00
Amaury Denoyelle	861370a6d4	MINOR: stats: update ambiguous "metrics" naming to "stat_cols" The name "metrics" was chosen to represent the various list of haproxy exposed statistics. However, it is deemed as ambiguous as some stats are indeed metric in the true sense, but some are not, as highlighted by various "enum field_origin" values. Replace it by the new name "stat_cols" for statistic columns. Along with the already existing notion of stat lines it should better reflect its purpose.	2024-04-26 10:20:57 +02:00
Christopher Faulet	4b1a7ea66c	BUG/MINOR: peers: Don't wait for a remote resync if there no remote peer When a resync is needed, a local resync is first tried and if it does not work, a remote resync is tried. It happens when the worker is started for instance. There is a timeout to wait for the local resync, except for the first start. And if the local resync fails or times out, the same timeout is applied to the remote resync. This one is always applied, even if there is no remote peer. On the other hand, on reload, if the old worker has never performed its resync, it does not try to resync the new worker. And here there is an issue. On the first reload, when there is no remote peer, we must wait for the resync timeout expiration to have a chance to resync the new worker. If the reload happens too early, there is no resync at all. Concretly, after a fresh start, if a reload happens in the first 5 seconds, there is no resync with the new worker. The issue only concerns the first reload and affects the second worker. To fix the issue, we must only skip the remote resync if there is no remote peer. This way, on a fresh start, the worker is immediately considered as resync. The local reynsc is skipped because it is the first worker and the remote resync is skipped because there is no remote peer. This patch must be backported to all stable versions.	2024-04-25 21:47:02 +02:00
Christopher Faulet	0243691de1	REORG: peers: Rename all occurrences to 'ps' variable In loops on the peer list in the code, the 'ps' variable was used as a shortcut for the peer session. However, if mays be confusing with the peers section too. So, all occurrences to 'ps' variable were renamed to 'peer'.	2024-04-25 18:29:58 +02:00
Christopher Faulet	fff5f63e10	BUG/MEDIUM: peers: Use atomic operations on peers flags when necessary Peers flags are mainly used from the sync task. At least, it is only updated by the sync task. However, there is one place where a peer may read these flags, when the message marking the end of a synchro is sent. So to be sure the value retrieved at this place is consistent, we must use an atomic operation to read it. And of course, from the sync task, atomic operations must be used to update peers flags. However, from the sync task, there is no reason to use atomic operations to read flags because they cannot be update from somewhere eles.	2024-04-25 18:29:58 +02:00
Christopher Faulet	608e23c495	MINOR: peers: Use a static variable to wait a resync on reload When a process is reloaded, the old process must performed a synchronisation with the new process. To do so, the sync task notify the local peer to proceed and waits. Internally, the sync task used PEERS_F_DONOTSTOP flag to know it should wait. However, this flag was only set/unset in a single function. There is no real reason to set a flag to do so. A static variable set to 1 when the resync starts and to 0 when it is finished is enough.	2024-04-25 18:29:58 +02:00
Christopher Faulet	bdcfacdb78	MINOR: peers: Add comment on processing functions of the sync task Just add a comment on __process_running_peer_sync() and __process_stopping_peer_sync() functions.	2024-04-25 18:29:58 +02:00
Christopher Faulet	697bd69efc	REORG: peers: Move peer and peers flags in the corresponding header file PEER_F_* and PEERS_F_ * flags were moved to <peer-t.h> header file. It is mandatory to decode them from "flags" dev tool.	2024-04-25 18:29:58 +02:00
Christopher Faulet	31f544209d	MINOR: peers: Reorder and rename PEERS flags Peers flags were renamed and reordered, mainly to move flags used for debugging purpose at the end. PEERS_F_RESYNC_LOCAL and PEERS_F_RESYNC_REMOTE were also renamed to PEERS_F_RESYNC_LOCAL_FINISHED and PEERS_F_RESYNC_REMOTE_FINISHED to be clear on the fact the operation is finished when the flag is set.	2024-04-25 18:29:58 +02:00
Christopher Faulet	17c4030aaa	MINOR: peers: Reorder and slightly rename PEER flags There are too many holes in peer flags. So let's reorder them. In addition, PEER_F_RESYNC_REQUESTED flag was renamed to PEER_F_DBG_RESYNC_REQUESTED to clearly state it is a flag set for debugging purpose. Finally, PEER_TEACH_RESET was replaced by PEER_TEACH_FLAGS and the bitwise complement operator is now used on lines updating the peer flags. It is a far more common way to do (in HAProxy code at least) and less surprising.	2024-04-25 18:29:58 +02:00
Christopher Faulet	9934eebc19	MINOR: peers: Rename PEERS_F_TEACH_COMPLETE to PEERS_F_LOCAL_TEACH_COMPLETE PEERS_F_TEACH_COMPLETE flag is only used for the old local peer to let the sync task know it can stop waiting during a soft-stop. So it is less confusing to rename this flag to clearly state it concerns local peer only.	2024-04-25 18:29:57 +02:00
Christopher Faulet	45f4698725	MINOR: peers: Start learning for local peer before receiving messages A local peer assigned for leaning can immediately start to learn, without sending any request. So we can do that first, before receiving messages. This way, only PEER_LR_ST_PROCESSING state is evaluating when received messages are processed. In addition, when the resync request is sent, we are sure it is for a remote peer.	2024-04-25 18:29:57 +02:00
Christopher Faulet	c904f7b440	MEDIUM: peers: Use true states for the learn state of a peer Some flags were used to define the learn state of a peer. It was a bit confusing, especially because the learn state of a peer is manipulated from the peer applet but also from the sync task. It is harder to understand the transitions if it is based on flags than if it is based a dedicated state based on an enum. It is the purpose of this patch. Now, we can define the following rules regarding this learn state: * A peer is assigned to learn by the sync task * The learn state is then changed by the peer itself to notify the learning is in progress and when it is finished. * Finally, when the peer finished to learn, the sync task must acknowledge it by unassigning the peer.	2024-04-25 18:29:57 +02:00
Christopher Faulet	ea9bd6d075	MEDIUM: peers: Use true states for the peer applets as seen from outside This patch is a cleanup of the recent change about the relation between a peer and the applet used to deal with I/O. Three flags was introduced to reflect the peer applet state as seen from outside (from the sync task in fact). Using flags instead of true states was in fact a bad idea. This work but it is confusing. Especially because it was mixed with LEARN and TEACH peer flags. So, now, to make it clearer, we are now using a dedicated state for this purpose. From the outside, the peer may be in one of the following state with respects of its applet: * the peer has no applet, it is stopped (PEER_APP_ST_STOPPED). * the peer applet was created with a validated connection from the protocol perspective. But the sync task must synchronized it with the peers section. It is in starting state (PEER_APP_ST_STARTING). * The starting starting was acknowledged by the sync task, the peer applet can start to process messages. It is in running state (PEER_APP_ST_RUNNING). * The last peer applet was released and the associated connection closed. But the sync task must synchronized it with the peers section. It is in stopping state (PEER_APP_ST_STOPPING). Functionnaly speaking, there is no true change here. But it should be easier to understand now. In addition to these changes, __process_peer_state() function was renamed sync_peer_app_state().	2024-04-25 18:29:57 +02:00
Christopher Faulet	229755d8f5	MEDIUM: peers: Simplify the peer flags dealing with the connection state Recently, some peer flags were added to deal with the connection state (PEER_F_ST_). 3 states were added: RELEASED: Set when we forced to shutdown the peer session and no new session was created yet. * CONNECTED: Set when the peer has established connection and validated it from the peer protocol point of view * ACCEPTED: Set when the peer has accepted a connection and validated it from the peer protocol point of view However, management of these pseudo states is a bit confusing. And it appears there is no reason to have 2 flags to express there is a validated peer session. CONNECTED state was used for a peer session on the frontend side while ACCEPTED state was used for a peer session on the backend side. So, there is now only one "connected" state and we test if the applet was created on the frontend or the backend side to decide what to do, in addition to the fact the peer is local or remote. It is a transitionnal patch. True states will be created to deal with all this stuff and corresponding flags will be removed. This patch depends on the commit "MINOR: applet: Add a function to know the sidde where an applet was created".	2024-04-25 18:29:57 +02:00
Christopher Faulet	0c1ea46fe0	MINOR: peers: Remove unused PEERS_F_RESYNC_PROCESS flag This flag is now set or unset but never tested. So we can safely remove it.	2024-04-25 18:29:57 +02:00
Christopher Faulet	e35293b2d3	BUG/MEDIUM: peers: Wait for sync task ack when a resynchro is finished When a learning process is finished, partially or not, the event must be processed by the sync task. It is important for the peer applet to wait in this case, especially if the same peer is teaching to another peer, to be sure to send the right resync finished message (full or partial). Thanks to the previous patch, we can set PEER_F_WAIT_SYNCTASK_ACK flag on the peer when a PEER_MSG_CTRL_RESYNCPARTIAL or PEER_MSG_CTRL_RESYNCFINISHED message is received to be sure to stop the processing. Of course, we must also take care to wake the peer up after having acknowledged the learn status from the sync task. This patch depends on the commit "BUG/MEDIUM: peers: Wait for sync task ack when a resynchro is finished". Both must be backported if commit `9425aeaffb` ("BUG/MAJOR: peers: Update peers section state from a thread-safe manner") is backported.	2024-04-25 18:29:57 +02:00
Christopher Faulet	12014587fa	MINOR: peers: Use a peer flag to block the applet waiting ack of the sync task Since recent fixes on peers, some changes on a peer must be acknowledged by the sync task before letting the peer applet processing messages. Blocking conditions was based on a combination of flags. It was errorprone. So, this patch introduces PEER_F_WAIT_SYNCTASK_ACK peer flag for this purpose. This flag is set by the peer when it must wait for an ack from the sync task. This sync task, on its side, must remove it and wake the peer up.	2024-04-25 18:29:57 +02:00
Christopher Faulet	f80f1635ec	MINOR: peers: Don't set TEACH flags on a peer from the sync task The TEACH flags only concerns the peer applet. There is no reason to set it from the sync task. It is confusing. And at the end, after some refactoring/fixes, setting these flags directly from the peer applet will allow us to immediatly performing the corresponding teach processing, while for now we must wait the sync task acknowledges the changes.	2024-04-25 18:29:57 +02:00
Christopher Faulet	6380fd5eb9	MINOR: peers: Remove unused PEERS_F_RESYNC_REQUESTED flag This flag was used for debugging purpose to know a resync was requested at least once in the process life. Since the last bunch of fixes about the peers locking mechanism, this info is now set per-peer. There is no reason to still have it on peers too. So, just remove it.	2024-04-25 18:29:57 +02:00
Christopher Faulet	2a902e3188	BUG/MEDIUM: peers: Reprocess peer state after all session shutdowns When a session is shut down, the peer is switched in released state (PEER_F_ST_RELEASED) and the sync task must process it to eventually perform some clean up, in case the peer was assigned to learn. However, this was only true when the session was shut down from the peer applet itself. This was not performed when it was shut down from the sync task. It is now fixed.	2024-04-25 18:29:57 +02:00
Christopher Faulet	3541c54481	BUG/MEDIUM: peers: Automatically start to learn on local peer The previous fix (`c0b2015aae` "BUG/MEDIUM: peers: Don't set PEERS_F_RESYNC_PROCESS flag on a peer") was made due to lack of knowledge on the peers. A local peer, when assigned to learn, must start to learn immediately without sending any request. This happens on reload. Thus, in this case, the PEER_F_LEARN_PROCESS flag must be set with PEER_F_LEARN_ASSIGN flag from the sync task. This patch must only be backported if the above commit is backported.	2024-04-25 18:29:57 +02:00
Willy Tarreau	e158b7efb7	CLEANUP: h1: make use of the multi-byte matching functions Instead of leaving the hard-coded non-trivial operations in the H1 parsing code, let's just rely on the new intops functions that do the same and that are less prone to being accidentally touched. It was verified that the resulting code is exactly the same.	2024-04-24 16:05:38 +02:00
Willy Tarreau	b9bf16b382	BUG/MINOR: h1: fix detection of upper bytes in the URI In 1.7 with commit `5f10ea30f4` ("OPTIM: http: improve parsing performance of long URIs") we improved the URI parser's performance on platforms supporting unaligned accesses by reading 4 chars at a time in a 32-bit word. However, as reported in GH issue #2545, there's a bug in the way the top bytes are checked, as the parser will stop when all 4 of them are above 7e instead of when one of them is, so certain patterns can be accepted through if the last ones are all valid. The fix requires to negate the value but on the other hand it allows to parallelize some of the tests and fuse the masks, which could even end up slightly faster. This needs to be backported to all stable versions, but be careful, this code moved a lot over time, from proto_http.c to h1.c, to http_msg.c, to h1.c again. Better just grep for "24242424" or "21212121" in each version to find it. Big kudos to Martijn van Oosterhout (@kleptog) for spotting this problem while analyzing that piece of code, and reporting it.	2024-04-24 11:50:36 +02:00
David Carlier	98d22f212a	MEDIUM: shctx: Naming shared memory context From Linux 5.17, anonymous regions can be name via prctl/PR_SET_VMA so caches can be identified when looking at HAProxy process memory mapping. The most possible error is lack of kernel support, as a result we ignore it, if the naming fails the mapping of memory context ought to still occur.	2024-04-24 10:25:38 +02:00
Tim Duesterhus	3ef60012ae	MINOR: Add support for UUIDv7 to the `uuid` sample fetch This adds support for UUIDv7 to the existing `uuid` sample fetch that was added in `8a694b859c`.	2024-04-24 08:23:56 +02:00
Tim Duesterhus	aab6477b67	MINOR: Add `ha_generate_uuid_v7` This function generates a version 7 UUID as per draft-ietf-uuidrev-rfc4122bis-14.	2024-04-24 08:23:56 +02:00
Tim Duesterhus	c6cea750a9	MINOR: tools: Rename `ha_generate_uuid` to `ha_generate_uuid_v4` This is in preparation of adding support for other UUID versions.	2024-04-24 08:23:56 +02:00
Willy Tarreau	19f8762a98	BUILD: stick-tables: silence build warnings when threads are disabled Since 3.0-dev7 with commit `1a088da7c2` ("MAJOR: stktable: split the keys across multiple shards to reduce contention"), building without threads yields a warning about the shard not being used. This is because the locks API does nothing of its arguments, which is the only place where the shard is being used. We cannot modify the lock API to pretend to consume its argument because quite often it's not even instantiated. Let's just pretend we consume shard using an explict ALREADY_CHECKED() statement instead. While we're at it, let's make sure that XXH32() is not called when there is a single bucket! No backport is needed.	2024-04-24 08:23:56 +02:00
Christopher Faulet	589fb12904	BUG/MEDIUM: applet: Let's applets decide if they have more data to deliver Unlike the muxes, the applets have the responsibility to notify the SC if they have more data to deliver to the stream. The same is done to notify the SC that applets must be woken up ASAP to continue some processing. When an applet is woken up, we pretend it has no more data to deliver by setting SE_FL_HAVE_NO_DATA flag. If the applet removes this flag, we must take care to not set it again just after. Otherwise, the applet may remain blocked if there is no other condition to wake it up. It is an issue for the applets using their own buffers because SE_FL_HAVE_NO_DATA is erroneously set in sc_applet_recv() function, after the applet execution. For instance, it happens for the cli applet when a huge map is cleared. No data are delivered to the stream but we pretend it is the case to clear the map per batches. This patch should fix the issue #2543. No Backported needed.	2024-04-23 07:33:10 +02:00
Amaury Denoyelle	341bf913d4	MINOR: stats: use STAT_F_* prefix for flags Some flags are defined during statistics generation and output. They use the prefix STAT_* which is also used for other purposes. Rename them with the new prefix STAT_F_* to differentiate them from the other usages.	2024-04-22 16:25:18 +02:00
Amaury Denoyelle	e97375dcab	MINOR: stats: use stricter naming stats/field/line Several unique names were used for different purposes under statistics implementation. This caused the code to be difficult to understand. * stat/stats name is removed when a more specific name could be used * restrict field usage to purely refer to <struct field> which represents a raw stat value. * use "line" naming to represent an array of <struct field>	2024-04-22 16:25:18 +02:00
Amaury Denoyelle	8dbb74542f	MINOR: stats: rename info stats Info are used to expose haproxy global metrics. It is similar to proxy statistics and any other module. As such, rename info indexes using SI_I_INF_* prefix. Also info variable is renamed stat_line_info. Thanks to this, naming is now consistent between info and other statistics. It will help to integrate it as a "global" statistics module.	2024-04-22 16:25:18 +02:00
Amaury Denoyelle	02e0dd6d30	MINOR: stats: rename ambiguous stat_l and stat_count Statistics were extended with the introduction of stats module. This mechanism allows to expose various metrics for several haproxy components. As a consequence of this, some static variables were transformed to dynamic ones to be able to regroup all statistics definition. Rename these variables with more explicit naming : * stat_lines can be used to generate one line of statistics for any module using struct field as value * metrics and metrics_len are used to stored description of metrics indexed by module Note that info is not integrated in the statistics module mechanism. However, it could be done in the future to better reflect its purpose.	2024-04-22 16:25:18 +02:00
Amaury Denoyelle	8fc0b18087	MINOR: stats: rename proxy stats This commit is the first one of a serie which adjust naming convention for stats module. The objective is to remove ambiguity and better reflect how stats are implemented, especially since the introduction of stats module. This patch renames elements related to proxies statistics. One of the main change is to rename ST_F_* statistics indexes prefix with the new name ST_I_PX_*. This remove the reference to field which represents another concept in the stats module. In the same vein, global stat_fields variable is renamed metrics_px.	2024-04-22 16:25:18 +02:00
Amaury Denoyelle	282a8e9f52	BUG/MINOR: stats: fix stot metric for listeners This commit is part of a series to align counters usage between frontends/listeners on one side and backends/servers on the other. On frontend side, "stot" is the total count of sessions for both proxies and listeners. For proxies, fe_counters <cum_sess> is correctely used. The bug is on listeners where <cum_conn> value is returned, which instead indicates a number of connection. This commit fixes this by returning <cum_sess> counter value for "stot" metric. Along this fixes, use the opportunity to report "conn_tot" for listeners using <cum_conn> value, as for frontend proxies. This commit fixes a bug but must not be backported as stats output is changed.	2024-04-22 10:35:18 +02:00
Amaury Denoyelle	c02ec9a9db	BUG/MINOR: backend: use cum_sess counters instead of cum_conn This commit is part of a serie to align counters usage between frontends/listeners on one side and backends/servers on the other. "stot" metric refers to the total number of sessions. On backend side, it is interpreted as a number of streams. Previously, this was accounted using <cum_sess> be_counters field for servers, but <cum_conn> instead for backend proxies. Adjust this by using <cum_sess> for both proxies and servers. As such, <cum_conn> field can be removed from be_counters. Note that several diagnostic messages which reports total frontend and backend connections were adjusted to use <cum_sess>. However, this is an outdated and misleading information as it does reports streams count on backend side. These messages should be fixed in a separate commit. This should be backported to all stable releases.	2024-04-22 10:35:18 +02:00

1 2 3 4 5 ...

17603 Commits