haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-12-03 00:31:00 +01:00

Author	SHA1	Message	Date
Willy Tarreau	8f3edf2ac6	MEDIUM: log/sink: make the log forwarder code use ring_dispatch_messages() This code becomes even simpler and almost does not need any knowledge of the structure of the ring anymore. It even highlighted that an old race had not been fixed due to code duplication, but that's now done.	2024-03-25 17:34:19 +00:00
Willy Tarreau	c262442b1a	MEDIUM: sink: move the generic ring forwarder code use ring_dispatch_messages() Now the code is much simpler than the ring forwarding function almost does not need any knowledge of the structure of the ring anymore.	2024-03-25 17:34:19 +00:00
Willy Tarreau	8022ae326c	MEDIUM: ring/sink: use applet_append_line()/syslog_applet_append_event() for readers The rink reader code was duplicated as-is in 2.2 for the ring forwarding code in commits 494c505703 ("MEDIUM: ring: add server statement to forward messages from a ring") and 975564784f ("MEDIUM: ring: add new srv statement to support octet counting forward") (which only differs by using a prefix instead of a suffix to delimit messages). Unfortunately, that makes it almost impossible to rework the core ring code because all these parts rely on it. This first commit aims at restoring a common structure for the core loop by just calling a distinct function based on the use case. The functions are either applet_append_line() when a whole line is to be emitted followed by an LF character, or syslog_applet_appent_event() when trying to send a TCP syslog line prepended with its size in decimal. There is no functional change beyond this.	2024-03-25 17:34:19 +00:00
Willy Tarreau	758cb450a2	OPTIM: sink: drop the sink lock used to count drops The sink lock was made to prevent event producers from passing while there were other threads trying to print a "dropped" message, in order to guarantee the absence of reordering. It has a serious impact however, which is that all threads need to take the read lock when producing a regular trace even when there's no reader. This patch takes a different approach. The drop counter is shifted left by one so that the lowest bit is used to indicate that one thread is already taking care of trying to dump the counter. Threads only read this value normally, and will only try to change it if it's non-null, in which case they'll first check if they are the first ones trying to dump it, otherwise will simply count another drop and leave. This has a large benefit. First, it will avoid the locking that causes stalls as soon as a slow reader is present. Second, it avoids any write on the fast path as long as there's no drop. And it remains very lightweight since we just need to add +2 or subtract 2*dropped in operations, while offering the guarantee that the sink_write() has succeeded before unlocking the counter. While a reader was previously limiting the traffic to 11k RPS under 4C/8T, now we reach 36k RPS vs 14k with no reader, so readers will no longer slow the traffic down and will instead even speed it up due to avoiding the contention down the chain in the ring. The locking cost dropped from ~75% to ~60% now (it's in ring_write now).	2024-03-09 11:23:52 +01:00
Willy Tarreau	eb7b2ec83a	OPTIM: sink: try to merge "dropped" messages faster When a reader doesn't read fast enough and causes drops, subsequent threads try to produce a "dropped" message. But it takes time to produce and emit this message, in part due to the use of chunk_printf() that relies on vfprintf() which has to parse the printf format, and during this time other threads may continue to increment the counter. This is the reason why this is currently performed in a loop. When reading what is received, it's common to see a large count followed by one or two single-digit counts, indicating that we could possibly have improved that by writing faster. Let's improve the situation a little bit. First we're now using a static message prefixed with enough space to write the digits, and a call to ultoa_r() fills these digits from right to left so that we don't have to process a format string nor perform a copy of the message. Second, we now re-check the counter immediately after having prepared the message so that we still get an opportunity for updating it. In order to avoid too long loops, this is limited to 10 iterations. Tests show that the number of single-digit "dropped" counters on output now dropped roughly by 15-30%. Also, it was observed that with 8 threads, there's almost never more than one retry.	2024-03-09 11:23:52 +01:00
Willy Tarreau	962c129dc1	BUG/MINOR: sink: fix a race condition in the TCP log forwarding code That's exactly the same as commit 53bfab080c ("BUG/MINOR: sink: fix a race condition between the writer and the reader") that went into 2.7 and was backported as far as 2.4, except that since the code was duplicated, the second instance was not noticed, leaving the race present. The race has a limited impact, if a forwarder reaches the end of the logs and a new message arrives before it leaves, the forwarder will only wake up after yet another new message will be sent. In practice it remains unnoticeable because for the race to trigger, one needs to have a steady flow of logs, which means the wakeup will happen anyway. This should be backported, but no need to insist on it if it resists.	2024-03-05 11:48:44 +01:00
Christopher Faulet	dcd917d972	MINOR: applet: Remove uselelss test on SE_FL_SHR/SHW flags These both flags are set after releasing the applet, in appctx_shut(). Concretly, it means the applet is shutdown for reads and writes. Once set, the applet's I/O handler was no longer called. Tests on these flags are useless. There is no chance to match them.	2024-02-14 14:22:36 +01:00
Ilya Shipitsin	80813cdd2a	CLEANUP: assorted typo fixes in the code and comments This is 37th iteration of typo fixes	2023-11-23 16:23:14 +01:00
Aurelien DARRAGON	078ebde870	CLEANUP: sink: useless leftover in sink_add_srv() Removing a useless leftover which has been introduced with 31e8a003a5 ("MINOR: sink: function to add new sink servers")	2023-11-10 17:49:57 +01:00
Aurelien DARRAGON	2694621151	CLEANUP: sink: bad indent in sink_new_from_logger() Fixing bad indent in sink_new_from_logger() which was recently introduced	2023-11-10 17:49:57 +01:00
Aurelien DARRAGON	d710dfbacc	BUG/MINOR: sink: don't learn srv port from srv addr Since 04276f3d ("MEDIUM: server: split the address and the port into two different fields") we should not use srv->addr to store server's port and rely on srv->svc_port instead. For sink servers, we correctly set >svc_port upon server creation but we didn't use it when initializing address for the connection. As a result, FQDN resolution will not work properly with sink servers. Hopefully, this used to work by accident because sink servers were resolved using the PA_O_RESOLVE flag in str2sa_range(), which made the srv->addr contain the port in addition to the address. But this will fail to work when FQDN resolution is postponed because only ->svc_port will contain the proper server port upon resolution. For instance, FQDN resolution with servers from log backends (which are resolved as regular servers, that is, without the PA_O_RESOLVE) will fail to work because of this. This may be backported as far as 2.2 even though the bug didn't have noticeable effects for versions below 2.9 [In 2.2, sink_forward_session_init() didn't exist it should be applied in sink_forward_session_create()]	2023-11-10 17:49:57 +01:00
Aurelien DARRAGON	94d0f77deb	MINOR: server: introduce "log-bufsize" kw "log-bufsize" may now be used for a log server (in a log backend) to configure the bufsize of implicit ring associated to the server (which defaults to BUFSIZE).	2023-10-13 10:05:07 +02:00
Aurelien DARRAGON	e58a9b4baf	MINOR: sink: add sink_new_from_srv() function This helper function can be used to create a new sink from an existing server struct (and thus existing proxy as well), in order to spare some resources when possible.	2023-10-13 10:05:06 +02:00
Aurelien DARRAGON	5c0d1c1a74	MEDIUM: sink: inherit from caller fmt in ring_write() when rings didn't set one implicit rings were automatically forced to the parent logger format, but this was done upon ring creation. This is quite restrictive because we might want to choose the desired format right before generating the log header (ie: when producing the log message), depending on the logger (log directive) that is responsible for the log message, and with current logic this is not possible. (To this day, we still have dedicated implicit ring per log directive, but this might change) In ring_write(), we check if the sink->fmt is specified: - defined: we use it since it is the most precise format (ie: for named rings) - undefined: then we fallback to the format from the logger With this change, implicit rings' format is now set to UNSPEC upon creation. This is safe because the log header building function automatically enforces the "raw" format when UNSPEC is set. And since logger->format also defaults to "raw", no change of default behavior should be expected.	2023-10-13 10:05:06 +02:00
Aurelien DARRAGON	6dad0549a5	MEDIUM: log/sink: simplify log header handling Introduce log_header struct to easily pass log header data between functions and use that to simplify the logic around log header handling. While at it, some outdated comments were updated as well. No change in behavior should be expected.	2023-10-13 10:05:06 +02:00
Aurelien DARRAGON	cc3dfe89ed	MEDIUM: sink/log: stop relying on AF_UNSPEC for rings Since a5b325f92 ("MINOR: protocol: add a real family for existing FDs"), we don't rely anymore on AF_UNSPEC for buffer rings in do_send_log. But we kept it as a parsing hint to differentiate between implicit and named rings during ring buffer postparsing. However it is still a bit confusing and forces us to systematically rely on target->addr, even for named buffer rings where it doesn't make much sense anymore. Now that target->addr was made a pointer in a recent commit, we can choose not to initialize it when not needed (i.e.: named rings) and use this as a hint to distinguish implicit rings during init since they rely on the addr struct to temporarily store the ring's address until the ring is actually created during postparsing step.	2023-10-13 10:05:06 +02:00
Aurelien DARRAGON	a9b185f34e	MEDIUM: log: introduce log target log targets were immediately embedded in logger struct (previously named logsrv) and could not be used outside of this context. In this patch, we're introducing log_target type with the associated helper functions so that it becomes possible to declare and use log targets outside of loggers scope.	2023-10-13 10:05:06 +02:00
Aurelien DARRAGON	18da35c123	MEDIUM: tree-wide: logsrv struct becomes logger When 'log' directive was implemented, the internal representation was named 'struct logsrv', because the 'log' directive would directly point to the log target, which used to be a (UDP) log server exclusively at that time, hence the name. But things have become more complex, since today 'log' directive can point to ring targets (implicit, or named) for example. Indeed, a 'log' directive does no longer reference the "final" server to which the log will be sent, but instead it describes which log API and parameters to use for transporting the log messages to the proper log destination. So now the term 'logsrv' is rather confusing and prevents us from introducing a new level of abstraction because they would be mixed with logsrv. So in order to better designate this 'log' directive, and make it more generic, we chose the word 'logger' which now replaces logsrv everywhere it was used in the code (including related comments). This is internal rewording, so no functional change should be expected on user-side.	2023-10-13 10:05:06 +02:00
Aurelien DARRAGON	31e8a003a5	MINOR: sink: function to add new sink servers Move the sft creation part out of sink_finalize() function so that it becomes possible to register sink's servers without forward_px being set.	2023-10-06 15:34:31 +02:00
Aurelien DARRAGON	205d480d9f	MINOR: sink: refine forward_px usage now forward_px only serves as a hint to know if a proxy was created specifically for the sink, in which case the sink is responsible for it. Everywhere forward_px was used in appctx context: get the parent proxy from the sft->srv instead. This permits to finally get rid of the double link dependency between sink and proxy.	2023-10-06 15:34:31 +02:00
Aurelien DARRAGON	405567c125	MINOR: sink: don't rely on forward_px to init sink forwarding Instead, we check if at least one sft has been registered into the sink, if it is the case, then we need to init the forwarding for the sink.	2023-10-06 15:34:31 +02:00
Aurelien DARRAGON	3c53f6cb76	MINOR: sink: don't rely on p->parent in sink appctx Removing unnecessary dependency on proxy->parent pointer in sink appctx functions by directly using the sink sft from the applet->svcctx to get back to sink related structs. Thanks to this, proxy used for a ringbuf does not have to be exclusive to a single sink anymore.	2023-10-06 15:34:31 +02:00
Aurelien DARRAGON	ec770b7924	MINOR: sink: remove useless check after sink creation It's useless to check if sink has been created with BUF type after calling sink_new_buf() since the goal of the function is to create a new sink of BUF type.	2023-10-06 15:34:31 +02:00
Aurelien DARRAGON	cb01da8d12	MINOR: sink/log: fix some typos around postparsing logic Fixing some typos that have been overlooked during the recent log/sink API improvements. Using this patch to make sink_new_from_logsrv() static since it is not used outside of sink.c	2023-10-06 15:34:31 +02:00
Willy Tarreau	cbbee15462	CLEANUP: ring: rename the ring lock "RING_LOCK" instead of "LOGSRV_LOCK" The ring lock was initially mostly used for the logs and used to inherit its name in lock stats. Now that it's exclusively used by rings, let's rename it accordingly.	2023-09-20 21:38:33 +02:00
Aurelien DARRAGON	8e6339aa29	MEDIUM: sink: add sink_finalize() function To further clean the code and remove duplication, some sink postparsing and sink->sft finalization is now performed in a dedicated function named sink_finalize().	2023-09-06 16:06:39 +02:00
Aurelien DARRAGON	b2879e3502	MEDIUM: sink/ring: introduce high level ring creation helper function ease code maintenance.	2023-09-06 16:06:39 +02:00
Aurelien DARRAGON	5a8755681d	MINOR: sink: add helper function to deallocate sink struct In this patch we move sink freeing logic outside of sink_deinit() function in order to create the sink_free() helper function that could be used on error paths for example.	2023-09-06 16:06:39 +02:00
Aurelien DARRAGON	d9b81e5b49	MEDIUM: log/sink: make logsrv postparsing more generic We previously had postparsing logic but only for logsrv sinks, but now we need to make this operation on logsrv directly instead of sinks to prepare for additional postparsing logic that is not sink-specific. To do this, we migrated post_sink_resolve() and sink_postresolve_logsrvs() to their postresolve_logsrvs() and postresolve_logsrv_list() equivalents. Then, we split postresolve_logsrv_list() so that the sink-only logic stays in sink.c (sink_resolve_logsrv_buffer() function), and the "generic" target part stays in log.c as resolve_logsrv(). Error messages formatting was preserved as far as possible but some slight variations are to be expected. As for the functional aspect, no change should be expected.	2023-09-06 16:06:39 +02:00
Aurelien DARRAGON	32f1db6d0d	MEDIUM: sink: don't perform implicit truncations when maxlen is not set maxlen now defaults ~0 (instead of BUFSIZE) to make sure no implicit truncation will be performed when the option is not specified, since the doc doesn't mention any default value for maxlen. As such, if the payload is too big, it will be dropped (this is the default expected behavior).	2023-09-06 16:06:39 +02:00
Aurelien DARRAGON	fdf82d058b	MINOR: sink: inform the user when logs will be implicitly truncated Consider the following example: \|log ring@test-ring len 2000 local0 \| \|ring test-ring \| maxlen 1000 This would result in emitted logs being silently truncated to 1000 because test-ring maxlen is smaller than the log directive maxlen. In this patch we're adding an extra check in post_sink_resolve() to detect this kind of confusing setups and warn the user about the implicit truncation when DIAG mode is on. This commit depends on: - "MINOR: sink: simplify post_sink_resolve function"	2023-09-06 16:06:39 +02:00
Aurelien DARRAGON	ceaa1ddb06	MINOR: log/sink: detect when log maxlen exceeds sink size To prevent logs from being silently (and unexpectly droppped) at runtime, we check that the maxlen parameter from the log directives are strictly inferior to the targeted ring size. \|global \| tune.bufsize 16384 \| log tcp@127.0.0.1:514 len 32768 \| log myring@127.0.0.1:514 len 32768 \|ring myring \| # no explicit size On such configs, a diag warning will be reported. This commit depends on: - "MINOR: sink: simplify post_sink_resolve function" - "MINOR: ring: add a function to compute max ring payload"	2023-09-06 16:06:39 +02:00
Aurelien DARRAGON	d499485aa9	MINOR: sink: simplify post_sink_resolve function Simplify post_sink_resolve() function to reduce code duplication and make it easier to maintain.	2023-09-06 16:06:39 +02:00
Aurelien DARRAGON	ddd8671b19	BUG/MEDIUM: ring: adjust maxlen consistency check When user specifies a maxlen parameter that is greater than the size of a given ring section, a warning is emitted to inform that the max length exceeds size, and then the maxlen is forced to size. The logic is good, but imprecise, because it doesn't take into account the slight overhead from storing payloads into the ring. In practise, we cannot store a single message which is exactly the same length than size. Doing so will result in the message being dropped at runtime. Thanks to the ring_max_payload() function introduced in "MINOR: ring: add a function to compute max ring payload", we can now deduce the maximum value for the maxlen parameter before it could result in messages being dropped. When maxlen value is set to an improper value, the warning will be emitted and maxlen will be forced to the maximum "single" payload len that could fit in the ring buffer, preventing messages from being dropped unexpectedly. This commit depends on: - "MINOR: ring: add a function to compute max ring payload" This may be backported as far as 2.2	2023-09-06 16:06:39 +02:00
Aurelien DARRAGON	c38cf3cf98	BUG/MINOR: sink/log: properly deinit srv in sink_new_from_logsrv() When errors are encountered in sink_new_from_logsrv() function, incompetely allocated ressources are freed to prevent memory leaks. For instance: logsrv implicit server is manually cleaned up on error prior to returning from the function. However, since 198e92a8e5 ("MINOR: server: add a global list of all known servers") every server created using new_server() is registered to the global list, but unfortunately the manual srv cleanup in sink_new_from_logsrv() doesn't remove the srv from the global list, so the freed server will still be referenced there, which can result in invalid reads later. Moreover, server API has evolved since, and now the srv_drop() function is available for that purpose, so let's use it, but make sure that srv is freed before the proxy because on older versions srv_drop() expects the srv to be linked to a valid proxy pointer. This must be backported up to 2.4. [For 2.4 version, free_server() must be used instead of srv_drop()]	2023-07-11 10:26:09 +02:00
Aurelien DARRAGON	9859e00981	BUG/MINOR: sink: fix errors handling in cfg_post_parse_ring() Multiple error paths (memory,IO related) in cfg_post_parse_ring() were not implemented correcly and could result in memory leak or undefined behavior. Fixing them all at once. This can be backported in 2.4	2023-07-10 18:28:08 +02:00
Aurelien DARRAGON	a26b736300	BUG/MINOR: sink: invalid sft free in sink_deinit() sft freeing attempt made in a575421 ("BUG/MINOR: sink: missing sft free in sink_deinit()") is incomplete, because sink->sft is meant to be used as a list and not a single sft entry. Because of that, the previous fix only frees the first sft entry, which fixes memory leaks for single-server forwarders (this is the case for implicit rings), but could still result in memory leaks when multiple servers are configured in a explicit ring sections. What this patch does: instead of directly freeing sink->sft, it iterates over every list members to free them. It must be backported up to 2.4 with a575421.	2023-07-10 18:28:08 +02:00
Aurelien DARRAGON	b6e2d62fb3	MINOR: sink/api: pass explicit maxlen parameter to sink_write() sink_write() currently relies on sink->maxlen to know when to stop writing a given payload. But it could be useful to pass a smaller, explicit value to sink_write() to stop before the ring maxlen, for instance if the ring is shared between multiple feeders. sink_write() now takes an optional maxlen parameter: if maxlen is > 0, then sink_write will stop writing at maxlen if maxlen is smaller than ring->maxlen, else only ring->maxlen will be considered. [for haproxy <= 2.7, patch must be applied by hand: that is: __sink_write() and sink_write() should be patched to take maxlen into account and function calls to sink_write() should use 0 as second argument to keep original behavior]	2023-07-10 18:28:08 +02:00
Aurelien DARRAGON	c103379847	BUG/MINOR: ring: maxlen warning reported as alert When maxlen parameter exceeds sink size, a warning is generated and maxlen is enforced to sink size. But the err_code is incorrectly set to ERR_ALERT Indeed, being a "warning", ERR_WARN should be used here. This may be backported as far as 2.2	2023-07-10 18:28:08 +02:00
Aurelien DARRAGON	30ff33bd9b	BUG/MINOR: ring: size warning incorrectly reported as fatal error When a ring section defines its size using the "size" directive with a smaller size than the default one or smaller than the previous one, a warning is generated to inform the user that the new size will be ignored. However the err_code is returned as FATAL, so this cause haproxy to incorrectly abort the init sequence. Changing the err_code to ERR_WARN so that this warning doesn't refrain from successfully starting the process. This should be backported as far as 2.4	2023-07-10 18:28:08 +02:00
Aurelien DARRAGON	a5754219e7	BUG/MINOR: sink: missing sft free in sink_deinit() Adding missing free for sft (string_forward_target) in sink_deinit(), which resulted in minor leak for each declared ring target at deinit(). (either explicit and implicit rings are affected) This may be backported up to 2.4.	2023-07-06 15:41:17 +02:00
Aurelien DARRAGON	999699a277	BUG/MEDIUM: sink: invalid server list in sink_new_from_logsrv() forward proxy server list created from sink_new_from_logsrv() is invalid Indeed, srv->next is literally assigned to itself. This did not cause issues during syslog handling because the sft was properly set, but it will cause the free_proxy(sink->forward_px) at deinit to go wild since free_proxy() will try to iterate through the proxy srv list to free ressources, but because of the improper list initialization, double-free and infinite-loop will occur. This bug was revealed by 9b1d15f53a ("BUG/MINOR: sink: free forward_px on deinit()") It must be backported as far as 2.4.	2023-07-06 15:41:17 +02:00
Willy Tarreau	69530f59ae	MEDIUM: clock: replace timeval "now" with integer "now_ns" This puts an end to the occasional confusion between the "now" date that is internal, monotonic and not synchronized with the system's date, and "date" which is the system's date and not necessarily monotonic. Variable "now" was removed and replaced with a 64-bit integer "now_ns" which is a counter of nanoseconds. It wraps every 585 years, so if all goes well (i.e. if humanity does not need haproxy anymore in 500 years), it will just never wrap. This implies that now_ns is never nul and that the zero value can reliably be used as "not set yet" for a timestamp if needed. This will also simplify date checks where it becomes possible again to do "date1<date2". All occurrences of "tv_to_ns(&now)" were simply replaced by "now_ns". Due to the intricacies between now, global_now and now_offset, all 3 had to be turned to nanoseconds at once. It's not a problem since all of them were solely used in 3 functions in clock.c, but they make the patch look bigger than it really is. The clock_update_local_date() and clock_update_global_date() functions are now much simpler as there's no need anymore to perform conversions nor to round the timeval up or down. The wrapping continues to happen by presetting the internal offset in the short future so that the 32-bit now_ms continues to wrap 20 seconds after boot. The start_time used to calculate uptime can still be turned to nanoseconds now. One interrogation concerns global_now_ms which is used only for the freq counters. It's unclear whether there's more value in using two variables that need to be synchronized sequentially like today or to just use global_now_ns divided by 1 million. Both approaches will work equally well on modern systems, the difference might come from smaller ones. Better not change anyhting for now. One benefit of the new approach is that we now have an internal date with a resolution of the nanosecond and the precision of the microsecond, which can be useful to extend some measurements given that timestamps also have this resolution.	2023-04-28 16:08:08 +02:00
Willy Tarreau	eed5da1037	MINOR: clock: do not use now.tv_sec anymore Instead we're using ns_to_sec(tv_to_ns(&now)) which allows the tv_sec part to disappear. At this point, "now" is only used as a timeval in clock.c where it is updated.	2023-04-28 16:08:08 +02:00
Tim Duesterhus	1307cd42d2	CLEANUP: Stop checking the pointer before calling `ring_free()` Changes performed with this Coccinelle patch: @@ expression e; @@ - if (e != NULL) { ring_free(e); - } @@ expression e; @@ - if (e) { ring_free(e); - } @@ expression e; @@ - if (e) ring_free(e); @@ expression e; @@ - if (e != NULL) ring_free(e);	2023-04-23 00:28:25 +02:00
Aurelien DARRAGON	9b1d15f53a	BUG/MINOR: sink: free forward_px on deinit() When a ring section is configured, a new sink is created and forward_px proxy may be allocated and assigned to the sink. Such sink-related proxies are added to the sink_proxies_list and thus don't belong to the main proxy list which is cleaned up in haproxy deinit() function. We don't have to manually clean up sink_proxies_list in the main deinit() func: sink API already provides the sink_deinit() function so we just add the missing free_proxy(sink->forward_px) there. This could be backported up to 2.4. [in 2.4, commit b0281a49 ("MINOR: proxy: check if p is NULL in free_proxy()") must be backported first]	2023-04-05 08:58:16 +02:00
Christopher Faulet	a739dc22c5	MEDIUM: sink: Use the sedesc to report and detect end of processing Just like for other applets, we now use the SE descriptor instead of the channel to report error and end-of-stream.	2023-04-05 08:57:06 +02:00
Christopher Faulet	4b866959d8	MINOR: sink: Remove the tests on the opposite SC state to process messages The state of the opposite SC is already tested to wait the connection is established before sending messages. So, there is no reason to test it again before looping on the ring buffer.	2023-04-05 08:57:06 +02:00
Christopher Faulet	9a790f63ed	MINOR: stconn/channel: Move CF_READ_DONTWAIT into the SC and rename it The channel flag CF_READ_DONTWAIT is renamed to SC_FL_RCV_ONCE and moved into the stream-connector.	2023-04-05 08:57:05 +02:00
Aurelien DARRAGON	2c98867187	BUG/MEDIUM: sink/forwarder: ensure ring offset is properly readjusted to head Since d9c7188 ("MEDIUM: ring: make the offset relative to the head/tail instead of absolute"), ring offset calculation has changed: we don't rely on ring->ofs absolute offset anymore. But with the above patch, relative offset is not properly calculated in sink_forward_oc_io_handler() and sink_forward_io_handler(). The issue here is the same as 737d10f ("BUG/MEDIUM: dns: ensure ring offset is properly reajusted to head") since dns and sink_forward share the same ring logic: When the ring is becoming full, ring_write() will try to regain some space to insert new data by calling b_del() on older messages. Here b_del() moves buffer's head under the hood, and since ring->ofs cannot be used to "correct" the relative offset, both sink_forward_oc_io_handler() and sink_forward_io_handler() start to get invalid offset. At this point, we will suffer from ring data corruption resulting in unexpected behavior or process crashes. This can be easily demonstrated with the following test: \|log-forward syslog \| dgram-bind 127.0.0.1:5114 \| log ring@logbuffer local0 \| \|ring logbuffer \| format rfc5424 \| size 16384 \| server logserver 127.0.0.1:5114 Haproxy will forward incoming logs on udp@127.0.0.1:5114 to tcp@127.0.0.1:5114 Then use the following tcp server: nc -l -p 5114 With the following udp log sender: \|while [ 1 ] \|do \| logger --udp --server 127.0.0.1 -P 5114 -p user.warn "Test 7" \|done Once the ring buffer is full (it takes less that a second to fill the 16k buffer) haproxy starts to misbehave and the log forwarding stops. We apply the same fix as in 737d10f ("BUG/MEDIUM: dns: ensure ring offset is properly reajusted to head"). Please note the ~0 case that is handled slightly differently in this patch: this is required to properly start reading from a non-empty ring. This case will be fixed in dns related code in the following patch. This does not need to be backported as d9c7188 was not marked for backports.	2023-03-08 08:54:43 +01:00

1 2 3 4 5

231 Commits