haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-11-18 17:31:30 +01:00

Author	SHA1	Message	Date
Christopher Faulet	ece002af1d	BUG/MEDIUM: applet: Add a flag to state an applet is using zero-copy forwarding An issue was introduced when zero-copy forwarding was added to the stats and cache applets. There is no test to be sure the upper layer is ready to use the zero-copy forwarding. So these applets refuse to deliver the response into the applet's output buffer if the zero-copy forwarding is supported by the opposite endpoint. It is especially an issue when a filter, like the compression, is in-use on the response channel. Because of this bug, the response is not delivered and the applet is woken up in loop to produce data. To fix the issue, an appctx flag was added, APPCTX_FL_FASTFWD, to know when the zero-copy forwarding is in-use. We rely on this flag to not fill the outbuf in the applet's I/O handler. No backport needed.	2024-02-14 14:22:36 +01:00
Christopher Faulet	1465eb570b	MINOR: stats: Use a dedicated function to check if output is almost full This simplifies a bit the stats applet. Because the CLI part was not refactored yet to use the applet's buffers, there are 3 ways to produce data: * the HTX message for the HTTP stats when zero-copy forwarding is not used * raw data in the opposite endpoint buffer for the HTTP stats when zero-copy forwarding is used * the channel buffer when the CLI "show stat" command is evaluated There is already a dedicated function to take care to copy data at the right place. There is now also a dedicated function to check us the output buffer is almost full.	2024-02-14 14:22:36 +01:00
Christopher Faulet	a9301c96f1	MINOR: applet: Use an option to disable zero-copy forwarding for all applets At the beginning of the 3.0-dev cycle, the zero-copy forwarding support was added only for the cache applet with an option to disable it. This was a hack, waiting for a better integration with applets. It is now possible to implement the zero-copy forwarding for any applets. So the specific option for the cache applet was renamed to be used for all applets. And this option is now also checked for the stats applet. Concretely, 'tune.cache.zero-copy-forwarding' was renamed to 'tune.applet.zero-copy-forwarding'.	2024-02-07 15:05:01 +01:00
Christopher Faulet	ee53d8421f	MEDIUM: applet: Simplify a bit API to exchange data with applets Default .rcv_buf and .snd_buf functions that applets can use are now specialized to manipulate raw buffers or HTX buffers. Thus a TCP applet should use appctx_raw_rcv_buf() and appctx_raw_snd_buf() while HTTP applet should use appctx_htx_rcv_buf() and appctx_htx_snd_buf(). Note that the appctx is now directly passed to these functions instead of the SC.	2024-02-07 15:04:52 +01:00
Christopher Faulet	868205943c	MAJOR: stats: Send stats dump over HTTP using zero-copy forwarding Just like for the cache applet, it is now possible to send response to the opposite side using the zero-copy forwarding. Internal functions were slightly updated but there is nothing special to say. Except the requested size during the nego stage is not exact.	2024-02-07 15:04:48 +01:00
Christopher Faulet	18845a0624	MAJOR: stats: Update HTTP stats applet to handle its own buffers The HTTP stat applets and all internal functions was adapted to use its own buffers instead of the channels ones. The CLI part was not refactored yet, thus there are still some access to channels in the file. But for the HTTP part, we no longer use the channels at all. To do so, the HTTP stats applet now uses default .rcv_buf and .snd_buf callback function. In addition, it sets appctx flags instead of SE ones.	2024-02-07 15:04:13 +01:00
Christopher Faulet	a4dcd3e54b	MEDIUM: stats: Don't interrupt processing on partial post We no longer test the opposite stream-connector to detect aborted partial post. Applets must not try to access to info ouside their scope. This make the code more sensitive to changes and it is a common source of bug. Tests on the sedesc flags at the begining of the I/O handler should be enough.	2024-02-07 15:04:09 +01:00
Christopher Faulet	3246f863d6	MEDIUM: stats: Be able to access a specific field into a stats module It is now possible to selectively retrieve extra counters from stats modules. H1, H2, QUIC and H3 fill_stats() callback functions are updated to return a specific counter.	2024-02-01 12:00:53 +01:00
Christopher Faulet	fd366a106b	MINOR: stats: Be able to access to registered stats modules from anywhere The list of modules registered on the stats to expose extra counters is now public. It is required to export these counters into the Prometheus exporter.	2024-02-01 12:00:53 +01:00
Emeric Brun	ef02dba7bc	BUG/MEDIUM: cli: some err/warn msg dumps add LR into CSV output on stat's CLI The initial purpose of CSV stats through CLI was to make it easely parsable by scripts. But in some specific cases some error or warning messages strings containing LF were dumped into cells of this CSV. This made some parsing failure on several tools. In addition, if a warning or message contains to successive LF, they will be dumped directly but double LFs tag the end of the response on CLI and the client may consider a truncated response. This patch extends the 'csv_enc_append' and 'csv_enc' functions used to format quoted string content according to RFC with an additionnal parameter to convert multi-lines strings to one line: CRs are skipped, and LFs are replaced with spaces. In addition and optionally, it is also possible to remove resulting trailing spaces. The call of this function to fill strings into stat's CSV output is updated to force this conversion. This patch should be backported on all supported branches (issue was already present in v2.0)	2024-01-24 08:38:59 +01:00
Aurelien DARRAGON	ef9d692544	MINOR: stats: store the parent proxy in stats ctx (http) Some HTTP related stats functions need to know the parent proxy, mainly to get a pointer on the related uri_auth set by the proxy or to check scope settings. The current design (probably historical as only the http context existed by then) took the other approach: it propagates the uri pointer from the http context deep down the calling stack up to the relevant functions. For non-http contexts (cli), the pointer is set to NULL. Doing so is not very pretty and not easy to maintain. Moreover, there were still some places in the code were the uri pointer was learned directly from the stream proxy because the argument was not available as argument from those functions. This is error-prone, because if one day we decide to change the source proxy in the parent function, we might still have some functions down the stack that ignore the top most argument and still do on their own, and we'll probably end up with inconsistencies. So in this patch, we take a safer approach: the caller responsible for creating the stats applet should set the http_px pointer so that any stats function running under the applet that needs to know if it's running in http context or needs to access parent proxy info may do so thanks to the dedicated ctx->http_px pointer.	2023-12-21 14:20:03 +01:00
Christopher Faulet	322d660d08	MINOR: tree-wide: Only rely on co_data() to check channel emptyness Because channel_is_empty() function does now only check the channel's buffer, we can remove it and rely on co_data() instead. Of course, all tests must be inverted. channel_is_empty() is thus removed.	2023-10-17 18:51:13 +02:00
Christopher Faulet	d0b04920d1	BUG/MINOR: htpp-ana/stats: Specify that HTX redirect messages have a C-L header Redirect responses sent during the HTTP analysis have no payload. However there is still a "Content-Length" header. It is important to set the corresponding flag on the HTX start-line to be sure to preserve this header when the reponse is sent to the client. The same is true with the stats applet, when it returns a redirect responses. It is especially important because we no ignore in-fly modifications of "Content-Length" or "Transfer-Encoding" headers without updating the HTX start-line flags. This patch may be backported to all stable versions but it is probably useless because only the 2.9-dev is affected by the bug.	2023-10-17 18:11:04 +02:00
Willy Tarreau	28ff1a5d56	MINOR: tasks/stats: report the number of niced tasks in "show info" We currently know the number of tasks in the run queue that are niced, and we don't expose it. It's too bad because it can give a hint about what share of the load is relevant. For example if one runs a Lua script that was purposely reniced, or if a stats page or the CLI is hammered with slow operations, seeing them appear there can help identify what part of the load is not caused by the traffic, and improve monitoring systems or autoscalers.	2023-09-06 17:44:44 +02:00
Tim Duesterhus	33a4461fa9	BUG/MINOR: stats: Fix Lua's `get_stats` function Lua's `get_stats` function stopped working in 4cfb0019e65bce79953164eddf54c1bbb61add62, due to the addition a new field ST_F_PROTO without a corresponding entry in `stat_fields`. Fix the issue by adding the entry, like a46b142e8807ea640e041d3a29e3fd427844d559 did previously for a different field. This patch fixes GitHub Issue #2174, it should be backported to 2.8.	2023-06-02 08:29:25 +02:00
Willy Tarreau	5723b382ed	MINOR: stats: report the boot time in "show info" Just like we have the uptime in "show info", let's add the boot time. It's trivial to collect as it's just the difference between the ready date and the start date, and will allow users to monitor this element in order to take action before it starts becoming problematic. Here the boot time is reported in milliseconds, so this allows to even observe sub-second anomalies in startup delays.	2023-05-17 09:33:54 +02:00
Willy Tarreau	52fd879953	CLEANUP: stats: update the trash chunk where it's used When integrating the number of warnings in "show info" in 2.8 with commit 3c4a297d2 ("MINOR: stats: report the total number of warnings issued"), the update of the trash buffer used by the Tainted flag got displaced lower. There's no harm for now util someone adds a new metric requiring a call to chunk_newstr() and gets both values merged. Let's move the call to its location now.	2023-05-17 09:33:54 +02:00
Willy Tarreau	4cfb0019e6	MINOR: stats: report the listener's protocol along with the address in stats When "optioon socket-stats" is used in a frontend, its listeners have their own stats and will appear in the stats page. And when the stats page has "stats show-legends", then a tooltip appears on each such socket with ip:port and ID. The problem is that since QUIC arrived, it was not possible to distinguish the TCP listeners from the QUIC ones because no protocol indication was mentioned. Now we add a "proto" legend there with the protocol name, so we can see "tcp4" or "quic6" and figure how the socket is bound.	2023-05-11 14:52:56 +02:00
Willy Tarreau	9615102b01	MINOR: stats: report the number of times the global maxconn was reached As discussed a few times over the years, it's quite difficult to know how often we stop accepting connections because the global maxconn was reached. This is not easy to know because when we reach the limit we stop accepting but we don't know if incoming connections are pending, so it's not possible to know how many were delayed just because of this. However, an interesting equivalent metric consist in counting the number of times an accepted incoming connection resulted in the limit being reached. I.e. "we've accepted the last one for now". That doesn't imply any other one got delayed but it's a factual indicator that something might have been delayed. And by counting the number of such events, it becomes easier to know whether some limits need to be adjusted because they're reached often, or if it's exceptionally rare. The metric is reported as a counter in show info and on the stats page in the info section right next to "maxconn".	2023-05-11 13:51:31 +02:00
Willy Tarreau	3c4a297d2b	MINOR: stats: report the total number of warnings issued Now in "show info" we have a TotalWarnings field that reports the total number of warnings issued since the process started. It's also reported in the the stats page next to the uptime.	2023-05-11 12:02:21 +02:00
Christopher Faulet	a236c58223	BUG/MEDIUM: stats: Require more room if buffer is almost full This was lost with commit f4258bdf3 ("MINOR: stats: Use the applet API to write data"). When the buffer is almost full, the stats applet gives up. When this happens, the applet must require more room. Otherwise, data in the channel buffer are sent to the client but the applet is not woken up in return. It is a 2.8-specific bug, no backport needed.	2023-05-09 16:36:45 +02:00
Christopher Faulet	7b3d38a633	MEDIUM: tree-wide: Change sc API to specify required free space to progress sc_need_room() now takes the required free space to receive more data as parameter. All calls to this function are updated accordingly. For now, this value is set but not used. When we are waiting for a buffer, 0 is used. So we expect to be unblocked ASAP. However this must be reviewed because SC_FL_NEED_BUF is probably enough in this case and this flag is already set if the input buffer allocation fails.	2023-05-05 15:44:23 +02:00
Christopher Faulet	f4258bdf3b	MINOR: stats: Use the applet API to write data stats_putchk() is updated to use the applet API instead of the channel API to write data. To do so, the appctx is passed as parameter instead of the channel. This way, the applet does not need to take care to request more room it it fails to put data into the channel's buffer.	2023-05-05 15:41:29 +02:00
Tim Duesterhus	0ababda701	BUG/MINOR: stats: fix typo in `TotalSplicedBytesOut` field name An additional `d` slipped in there. This likely should not be backported, because scripts might rely on the typoed name. Public discussion on this topic here: https://www.mail-archive.com/haproxy@formilux.org/msg43359.html	2023-05-02 11:15:49 +02:00
Willy Tarreau	c05d30e9d8	MINOR: clock: replace the timeval start_time with start_time_ns Now that "now" is no more a timeval, there's no point keeping a copy of it as a timeval, let's also switch start_time to nanoseconds, it simplifies operations.	2023-04-28 16:08:08 +02:00
Willy Tarreau	69530f59ae	MEDIUM: clock: replace timeval "now" with integer "now_ns" This puts an end to the occasional confusion between the "now" date that is internal, monotonic and not synchronized with the system's date, and "date" which is the system's date and not necessarily monotonic. Variable "now" was removed and replaced with a 64-bit integer "now_ns" which is a counter of nanoseconds. It wraps every 585 years, so if all goes well (i.e. if humanity does not need haproxy anymore in 500 years), it will just never wrap. This implies that now_ns is never nul and that the zero value can reliably be used as "not set yet" for a timestamp if needed. This will also simplify date checks where it becomes possible again to do "date1<date2". All occurrences of "tv_to_ns(&now)" were simply replaced by "now_ns". Due to the intricacies between now, global_now and now_offset, all 3 had to be turned to nanoseconds at once. It's not a problem since all of them were solely used in 3 functions in clock.c, but they make the patch look bigger than it really is. The clock_update_local_date() and clock_update_global_date() functions are now much simpler as there's no need anymore to perform conversions nor to round the timeval up or down. The wrapping continues to happen by presetting the internal offset in the short future so that the 32-bit now_ms continues to wrap 20 seconds after boot. The start_time used to calculate uptime can still be turned to nanoseconds now. One interrogation concerns global_now_ms which is used only for the freq counters. It's unclear whether there's more value in using two variables that need to be synchronized sequentially like today or to just use global_now_ns divided by 1 million. Both approaches will work equally well on modern systems, the difference might come from smaller ones. Better not change anyhting for now. One benefit of the new approach is that we now have an internal date with a resolution of the nanosecond and the precision of the microsecond, which can be useful to extend some measurements given that timestamps also have this resolution.	2023-04-28 16:08:08 +02:00
Willy Tarreau	eed5da1037	MINOR: clock: do not use now.tv_sec anymore Instead we're using ns_to_sec(tv_to_ns(&now)) which allows the tv_sec part to disappear. At this point, "now" is only used as a timeval in clock.c where it is updated.	2023-04-28 16:08:08 +02:00
Willy Tarreau	563efe62e9	MINOR: stats: use nanoseconds, not timeval to compute uptime Now that we have the required functions, let's get rid of the timeval in intermediary calculations.	2023-04-28 16:08:08 +02:00
Willy Tarreau	7222db7b84	BUG/MINOR: stats: report the correct start date in "show info" The "show info" help for "Start_time_sec" says "Start time in seconds" so it's definitely the start date in human format, not the internal one that is solely used to compute uptime. Since commit 28360dc ("MEDIUM: clock: force internal time to wrap early after boot"), both are split apart since the start time takes into account the offset needed to cause the early wraparound, so we must only use start_date here. No backport is needed.	2023-04-28 16:08:08 +02:00
Aurelien DARRAGON	1746b56e68	MINOR: server: change srv_op_st_chg_cause storage type This one is greatly inspired by "MINOR: server: change adm_st_chg_cause storage type". While looking at current srv_op_st_chg_cause usage, it was clear that the struct needed some cleanup since some leftovers from asynchronous server state change updates were left behind and resulted in some useless code duplication, and making the whole thing harder to maintain. Two observations were made: - by tracking down srv_set_{running, stopped, stopping} usage, we can see that the <reason> argument is always a fixed statically allocated string. - check-related state change context (duration, status, code...) is not used anymore since srv_append_status() directly extracts the values from the server->check. This is pure legacy from when the state changes were applied asynchronously. To prevent code duplication, useless string copies and make the reason/cause more exportable, we store it as an enum now, and we provide srv_op_st_chg_cause() function to fetch the related description string. HEALTH and AGENT causes (check related) are now explicitly identified to make consumers like srv_append_op_chg_cause() able to fetch checks info from the server itself if they need to.	2023-04-21 14:36:45 +02:00
Aurelien DARRAGON	9b1ccd7325	MINOR: server: change adm_st_chg_cause storage type Even though it doesn't look like it at first glance, this is more like a cleanup than an actual code improvement: Given that srv->adm_st_chg_cause has been used to exclusively store static strings ever since it was implemented, we make the choice to store it as an enum instead of a fixed-size string within server struct. This will allow to save some space in server struct, and will make it more easily exportable (ie: event handlers) because of the reduced memory footprint during handling and the ability to later get the corresponding human-readable message when it's explicitly needed.	2023-04-21 14:36:45 +02:00
Christopher Faulet	ca5309a9a3	MINOR: stconn: Add a flag to report EOS at the stream-connector level SC_FL_EOS flag is added to report the end-of-stream at the SC level. It will be used to distinguish end of stream reported by the endoint, via the SE_FL_EOS flag, and the abort triggered by the stream, via the SC_FL_ABRT_DONE flag. In this patch, the flag is defined and is systematically tested everywhere SC_FL_ABRT_DONE is tested. It should be safe because it is never set.	2023-04-17 17:41:28 +02:00
Christopher Faulet	214f1b5c16	MINOR: tree-wide: Replace several chn_prod() by the corresponding SC At many places, call to chn_prod() can be easily replaced by the corresponding SC. It is a bit easier to understand which side is manipulated.	2023-04-14 15:06:04 +02:00
Christopher Faulet	0c370eee6d	MINOR: stconn: Rename SC_FL_SHUTR in SC_FL_ABRT_DONE Here again, it is just a flag renaming. In SC flags, there is no longer shutdown for reads but aborts. For now this flag is set when a read0 is detected. It is of couse not accurate. This will be changed later.	2023-04-14 14:51:22 +02:00
Christopher Faulet	9837bd86dc	BUG/MEDIUM: stats: Eat output data when waiting for appctx shutdown When the stats applet is executed while a shut is pending, the remaining output data must always be consumed. Otherwise, this can prevent the stream to exit, leading to a spinning loop on the applet. It is 2.8-specific. No backport needed.	2023-04-11 07:43:26 +02:00
Willy Tarreau	fc458ec8aa	CLEANUP: tree-wide: remove strpcy() from constant strings These ones are genenerally harmless on modern compilers because the compiler checks them. While gcc optimizes them away without even referencing strcpy(), clang prefers to call strcpy(). Nevertheless they prevent from enabling stricter checks so better remove them altogether. They were all replaced by strlcpy2() and the size of the destination which is always known there.	2023-04-07 18:14:28 +02:00
Olivier Houchard	dea25f51b6	MINOR: compression: Count separately request and response compression Duplicate the compression counters, so that we have separate counters for request and response compression.	2023-04-07 00:47:04 +02:00
Aurelien DARRAGON	99a8d0f5d8	BUG/MINOR: stats: properly handle server stats dumping resumption In stats_dump_proxy_to_buffer() function, special care was taken when dealing with servers dump. Indeed, stats_dump_proxy_to_buffer() can be interrupted and resumed if buffer space is not big enough to complete dump. Thus, a reference is taken on the server being dumped in the hope that the server will still be valid when the function resumes. (to prevent the server from being freed in the meantime) While this is now true thanks to: - "BUG/MINOR: server/del: fix legacy srv->next pointer consistency" We still have an issue: when resuming, saved server reference is not dropped. This prevents the server from being freed when we no longer use it. Moreover, as the saved server might now be deleted (SRV_F_DELETED flag set), the current deleted server may still be dumped in the stats and while this is not a bug, this could be misleading for the user. Let's add a px_st variable to detect if the stats_dump_proxy_to_buffer() is being resumed at the STAT_PX_ST_SV stage: perform some housekeeping to skip deleted servers and properly drop the reference on the saved server. This commit depends on: - "MINOR: server: add SRV_F_DELETED flag" - "BUG/MINOR: server/del: fix legacy srv->next pointer consistency" This should be backported up to 2.6	2023-04-05 08:58:16 +02:00
Christopher Faulet	87633c3a11	MEDIUM: tree-wide: Move flags about shut from the channel to the SC The purpose of this patch is only a one-to-one replacement, as far as possible. CF_SHUTR(_NOW) and CF_SHUTW(_NOW) flags are now carried by the stream-connecter. CF_ prefix is replaced by SC_FL_ one. Of course, it is not so simple because at many places, we were testing if a channel was shut for reads and writes in same time. To do the same, shut for reads must be tested on one side on the SC and shut for writes on the other side on the opposite SC. A special care was taken with process_stream(). flags of SCs must be saved to be able to detect changes, just like for the channels.	2023-04-05 08:57:06 +02:00
Christopher Faulet	df15a5d1f3	MEDIUM: stats: Use the sedesc to report and detect end of processing Just like for other applets, we now use the SE descriptor instead of the channel to report error and end-of-stream.	2023-04-05 08:57:06 +02:00
Christopher Faulet	92297749e1	MINOR: applet: No longer set EOI on the SC Thanks to the previous patch, it is now possible for applets to not set the CF_EOI flag on the channels. On this point, the applets get closer to the muxes.	2023-04-05 08:57:05 +02:00
Christopher Faulet	41a454da0a	BUG/MINOR: stats: Don't replace sc_shutr() by SE_FL_EOS flag yet In commit c2c043ed4 ("BUG/MEDIUM: stats: Consume the request except when parsing the POST payload"), a change about applet was pushed too early. The applet must still call cf_shutr() when the response is fully sent. It is planned to rely on SE_FL_EOS flag, just like connections. But it is not possible for now. However, at first glance, this bug has no visible effect. It is 2.8-specific. No backport needed.	2023-03-28 14:36:05 +02:00
Christopher Faulet	c2c043ed43	BUG/MEDIUM: stats: Consume the request except when parsing the POST payload The stats applet is designed to consume the request at the end, when it finishes to send the response. And during the response forwarding, because the request is not consumed, the applet states it will not consume data. This avoid to wake the applet up in loop. When it finishes to send the response, the request is consumed. For POST requests, there is no issue because the response is small enough. It is sent in one time and must be processed by HTTP analyzers. Thus the forwarding is not performed by the applet itself. The applet is always able to consume the request, regardless the payload length. But for other requests, it may be an issue. If the response is too big to be sent in one time and if the requests is not fully received when the response headers are sent, the applet may be blocked infinitely, not consuming the request. Indeed, in the case the applet will be switched in infinite forward mode, the request will not be consumed immediately. At the end, the request buffer is flushed. But if some data must still be received, the applet is not woken up because it is still in a "not-consuming" mode. So, to fix the issue, we must take care to re-enable data consuming when the end of the response is reached. This patch must be backported as far as 2.6.	2023-03-24 09:24:27 +01:00
Christopher Faulet	b08c5259eb	MINOR: stconn: Always report READ/WRITE event on shutr/shutw It was done by hand by callers when a shutdown for read or write was performed. It is now always handled by the functions performing the shutdown. This way the callers don't take care of it. This will avoid some bugs.	2023-02-22 15:59:16 +01:00
Willy Tarreau	b685ad0774	BUG/MINOR: clock/stats: also use start_time not start_date in HTML info For an unknown reason in the change of uptime calculation for the HTML page didn't make it to commit 6093ba47c ("BUG/MINOR: clock: do not mix wall-clock and monotonic time in uptime calculation"). Let's address it as well otherwise the stats page will display an incorrect uptime. No backport needed unless the patch above is backported.	2023-02-10 16:53:35 +01:00
Willy Tarreau	6093ba47c0	BUG/MINOR: clock: do not mix wall-clock and monotonic time in uptime calculation We've had a start date even before the internal monotonic clock existed, but once the monotonic clock was added, the start date was not updated to distinguish the wall clock time units and the internal monotonic time units. The distinction is important because both clocks do not necessarily progress at the same speed. The very rare occurrences of the wall-clock date are essentially for human consumption and communication with third parties (e.g. report the start date in "show info" for monitoring purposes). However currently this one is also used to measure the distance to "now" as being the process' uptime. This is actually not correct. It only works because for now the two dates are initialized at the exact same instant at boot but could still be wrong if the system's date shows a big jump backwards during startup for example. In addition the current situation prevents us from enforcing an abritrary offset at boot to reveal some heisenbugs. This patch adds a new "start_time" at boot that is set from "now" and is used in uptime calculations. "start_date" instead is now set from "date" and will always reflect the system date for human consumption (e.g. in "show info"). This way we're now sure that any drift of the internal clock relative to the system date will not impact the reported uptime. This could possibly be backported though it's unlikely that anyone has ever noticed the problem.	2023-02-08 11:06:55 +01:00
Frédéric Lécaille	d97d1d7c7c	BUG/MINOR: stats: Prevent HTTP "other sessions" counter underflows Due to multithreading concurrency, it is difficult at this time to figure out how this counter may become negative. This simple patch only checks this will never be the case. This issue arrives with this commit: "9969adbcdc MINOR: stats: add by HTTP version cumulated number of sessions and requests" So, this patch should be backported when the latter has been backported.	2023-02-06 14:04:27 +01:00
Aurelien DARRAGON	90304dcdd8	BUG/MINOR: stats: fix STAT_STARTED behavior with full htx When stats_putchk() fails to peform the dump because available data space in htx is less than the number of bytes pending in the dump buffer, we wait for more room in the htx (ie: sc_need_room()) to retry the dump attempt on the next applet invocation. To provide consistent output, we have to make sure that the stat ctx is not updated (or at least correctly reverted) in case stats_putchk() fails so that the new dumping attempt behaves just like the previous (failed) one. STAT_STARTED is not following this logic, the flag is set in stats_dump_fields_json() as soon as some data is written to the output buffer. It's done too early: we need to delay this step after the stats_putchk() has successfully returned if we want to correctly handle the retries attempts. Because of this, JSON output could suffer from extraneous ',' characters which could make json parsers unhappy. For example, this is the kind of errors you could get when using `python -m json.tool` on such badly formatted outputs: "Expecting value: line 1 column 2 (char 1)" Unfortunately, fixing this means that the flag needs to be enabled at multiple places, which is what we're doing in this patch. (in stats_dump_proxy_to_buffer() where stats_dump_one_line() is involved by underlying stats_dump_{fe,li,sv,be} functions) Thereby, this raises the need for a cleanup to reduce code duplication around stats_dump_proxy_to_buffer() function and simplify things a bit. It could be backported to 2.6 and 2.7	2023-02-06 07:53:03 +01:00
Aurelien DARRAGON	28a23617ce	BUG/MINOR: stats: fix show stats field ctx for servers In ("MINOR: stats: introduce stats field ctx"), we forgot to apply the patch to servers. This prevents "BUG/MINOR: stats: fix show stat json buffer limitation" from working with servers dump. We're adding the missing part related to servers dump. This commit should be backported with the aforementioned commits.	2023-02-06 07:53:03 +01:00
Aurelien DARRAGON	9b07d4fecd	BUG/MINOR: stats: fix ctx->field update in stats_dump_proxy_to_buffer() When ctx->field was introduced with ("MINOR: stats: introduce stats field ctx") a mistake was made for the STAT_PX_ST_LI state in stats_dump_proxy_to_buffer(): current_field reset is placed after the for loop, ie: after multiple lines are dumped. Instead it should be placed right after each li line is dumped. This could cause some output inconsistencies (missing fields), especially when http dump is used with JSON output and "socket-stats" option is enabled on the proxy, because when htx is full we restore the ctx->field with current_field (which contains outdated value in this case). This should be backported with ("MINOR: stats: introduce stats field ctx")	2023-02-06 07:53:03 +01:00

1 2 3 4 5 ...

417 Commits