haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-10 17:17:06 +02:00

Author	SHA1	Message	Date
Willy Tarreau	c5977728b3	MINOR: stats: make "show info" able to report rates as floats when asked Now "show info float" will also report SSL rates, connection rates and key reuse ratios as floats. This can be convenient at very low rates. Note that the SSL reuse ratio which used to commonly oscillate between 0 and 1 under load is now more often above zero with small values. It indicates that for better stability we shouldn't be comparing a key rate with a connection rate but instead we should measure the reuse rate at its source.	2021-05-08 10:52:12 +02:00
Willy Tarreau	e8abc3293f	MINOR: stats: report uptime and start time as floats with subsecond resolution When "show info float" is used, the uptime and start time will be reported with subsecond resolution (microsecond actually since timeval is used).	2021-05-08 10:52:12 +02:00
Willy Tarreau	d37e26eaa6	MINOR: stats: use tv_remain() to precisely compute the uptime We'll have to support reporting sub-second uptimes, so let's use the appropriate function which will automatically adjust the tv_usec field. In addition to this, it will also report a more accurate uptime thanks to considering the sub-second part in the result.	2021-05-08 10:52:12 +02:00
Willy Tarreau	2745620240	MINOR: stats: support an optional "float" option to "show info" This will allow some fields to be produced with a higher accuracy when the requester indicates being able to parse floats. Rates and times are among the elements which can make sense.	2021-05-08 10:52:12 +02:00
Willy Tarreau	0b26b3866c	MINOR: stats: pass the appctx flags to stats_fill_info() Currently the stats filling function knows nothing about the caller's needs, so let's pass the STAT_* flags so that it can adapt to the requester's constraints.	2021-05-08 10:52:12 +02:00
Willy Tarreau	6004fb7681	MINOR: stats: add the HTML conversion for float types For the prometheus exporter, a new float type was added for the fields and its conversion was added everywhere except for the HTML output. Now that we have F2H() we can implement it for consistency.	2021-05-08 10:48:17 +02:00
Willy Tarreau	065ba3186e	MINOR: stats: avoid excessive padding of float values with trailing zeroes When emitting stats, we don't need to have 6 zeroes after the decimal point for each value, so let's trim floating point numbers to the longest needed only.	2021-05-08 10:48:17 +02:00
Willy Tarreau	ae03d26eea	MINOR: tools: add a float-to-ascii conversion function We already had ultoa_r() and friends but nothing to emit inline floats. This is now done with ftoa_r() and F2A/F2H. Note that the latter both use the itoa_str[] as temporary storage and that the HTML format currently is the exact same as the ASCII one. The trailing zeroes are always timmed so these outputs are usable in user-visible output.	2021-05-08 10:48:17 +02:00
Willy Tarreau	56d1d8dab0	MINOR: tools: implement trimming of floating point numbers When using "%f" to print a float, it automatically gets 6 digits after the decimal point and there's no way to automatically adjust to the required ones by dropping trailing zeroes. This function does exactly this and automatically drops the decimal point if all digits after it were zeroes. This will make numbers more friendly in stats and makes outputs shorter (e.g. JSON where everything is just a "number"). The function is designed to be easy to use with snprint() and chunks: snprintf: flt_trim(buf, 0, snprintf(buf, sizeof(buf), "%f", x)); chunk_printf: out->data = flt_trim(out->area, 0, chunk_printf(out, "%f", x)); chunk_appendf: size_t prev_data = out->data; out->data = flt_trim(out->area, prev_data, chunk_appendf(out, "%f", x));	2021-05-08 10:42:11 +02:00
Willy Tarreau	a1169b6231	MINOR: sample: improve error reporting on missing arg to strcmp() converter Calling the strcmp() converter with no argument yields this strange error: [ALERT] (31439) : parsing [test.cfg:3] : error detected in frontend 'f' while parsing 'http-request redirect' rule : failed to parse sample expression <src,strcmp]> : invalid args in converter 'strcmp' : failed to register variable name ''. This is because the vars name check tries to see if it can create such a variable having an empty name. Let's at least make a special case of the missing argument. Now we can read a more explicit: [ALERT] (31655) : parsing [test.cfg:3] : error detected in frontend 'f' while parsing 'http-request redirect' rule : failed to parse sample expression <src,strcmp]> : invalid args in converter 'strcmp' : missing variable name. This was done for secure_strcmp() as well.	2021-05-08 06:55:25 +02:00
Amaury Denoyelle	24abb0cdc1	BUG/MINOR: server: do not report diag for peer servers with null weight Only check servers attached to a proxy with PR_CAP_LB. This does not need to be backported as the diag message was added in the current 2.4-dev branch.	2021-05-07 15:20:54 +02:00
Amaury Denoyelle	b979f59871	MINOR: proxy: define PR_CAP_LB Add a new proxy capability for proxy with load-balancing capabilities. This help to differentiate listen/frontend/backend with special proxies such as peer proxies.	2021-05-07 15:12:20 +02:00
Amaury Denoyelle	86c1d0fddb	BUILD: fix usage of ha_alert without format string The compilation is failing due to no format string used in ha_alert. This does not need to be backported.	2021-05-07 15:07:21 +02:00
Amaury Denoyelle	a9e639afe2	MINOR: http_act: mark normalize-uri as experimental normalize-uri http rule is marked as experimental, so it cannot be activated without the global 'expose-experimental-directives'. The associated vtc is updated to be able to use it.	2021-05-07 14:35:02 +02:00
Amaury Denoyelle	5dfdf3e5b0	MINOR: stats: report tainted on show info Add a new info field ST_F_TAINTED to dump tainted status at the end of the 'show info' output.	2021-05-07 14:35:02 +02:00
Amaury Denoyelle	f492992065	MINOR: cli: set tainted when using CLI expert/experimental mode Mark the process as tainted as soon as a command command only accessible in expert or experimental mode is executed.	2021-05-07 14:35:02 +02:00
Amaury Denoyelle	0351773534	MINOR: action: implement experimental actions Support experimental actions. It is mandatory to use 'expose-experimental-directives' before to be able to use them. If such action is present in the config file, the tainted status of the process is updated. Another tainted status is set when an experimental action is executed.	2021-05-07 14:35:02 +02:00
Amaury Denoyelle	e4a617c931	MINOR: action: replace match_pfx by a keyword flags field Define a new keyword flag KWF_MATCH_PREFIX. This is used to replace the match_pfx field of action struct. This has the benefit to have more explicit action declaration, and now it is possible to quickly implement experimental actions.	2021-05-07 14:35:01 +02:00
Amaury Denoyelle	d2e53cd47e	MINOR: cfgparse: implement experimental config keywords Add a new flag to mark a keyword as experimental. An experimental keyword cannot be used if the global 'expose-experimental-directives' is not present first. Only keywords parsed through a standard cfg_keywords lists in global/proxies section will be automatically detected if declared experimental. To support a keyword outside of these lists, check_kw_experimental must be called manually during its parsing. If an experimental keyword is present in the config, the tainted flag is updated. For the moment, no keyword is marked as experimental.	2021-05-07 14:34:41 +02:00
Amaury Denoyelle	484454d906	MINOR: global: define tainted flag Add a global flag named 'tainted'. Its purpose is to report various status about experimental features used for the current process lifetime. By default it is initialized to 0. It can be set/retrieve by a couple of new functions mark_tainted()/get_tainted(). Once a flag is set, it cannot be resetted. Currently, no tainted status is implemented, it will be the subject of the following commits.	2021-05-07 14:12:27 +02:00
Christopher Faulet	ea86083718	BUG/MINOR: checks: Reschedule check on observe mode only if fastinter is set On observe mode, if a server is marked as DOWN, the server's health-check is rescheduled using the fastinter timeout if the new expiration date is newer that the current one. But this must only be performed if the fastinter timeout is defined. Internally, tick_is_lt() function only checks the date and does not perform any verification on the provided args. Thus, we must take care of it. However, it is possible to disable the server health-check by setting its task expiration date to TICK_ETERNITY. This patch must be backported as far as 2.2. It is related to	2021-05-07 12:10:30 +02:00
Christopher Faulet	92017a3215	BUG/MINOR: checks: Handle synchronous connect when a tcpcheck is started A connection may be synchronously established. In the tcpcheck context, it may be a problem if several connections come one after another. In this case, there is no event to close the very first connection before starting the next one. The checks is thus blocked and timed out, a L7 timeout error is reported. To fix the bug, when a tcpcheck is started, we immediately evaluate its state. Most of time, nothing is performed and we must wait. But it is thus possible to handle the result of a successfull connection. This patch should fix the issue #1234. It must be backported as far as 2.2.	2021-05-07 12:00:56 +02:00
Christopher Faulet	30aa0da532	BUG/MINOR: stream: Reset stream final state and si error type on L7 retry Thanks to a previous fix, the stream error mask is now cleared on L7 retry. But the stream final state (SF_FINST_*) and the stream-interface error type must also be reset to properly restart a new connection and be sure to not inherit errors from the previous connection attempt. In addition, SF_ADDR_SET flag is not systematically removed. stream_choose_redispatch() already takes care to unset it if necessary. When the connection is not redispatch, the server address can be preserved. This patch must be backported as far as 2.0.	2021-05-07 12:00:56 +02:00
Willy Tarreau	b205bfdab7	CLEANUP: cli/tree-wide: properly re-align the CLI commands' help messages There were 102 CLI commands whose help were zig-zagging all along the dump making them unreadable. This patch realigns all these messages so that the command now uses up to 40 characters before the delimiting colon. About a third of the commands did not correctly list their arguments which were added after the first version, so they were all updated. Some abuses of the term "id" were fixed to use a more explanatory term. The "set ssl ocsp-response" command was not listed because it lacked a help message, this was fixed as well. The deprecated enable/disable commands for agent/health/server were prominently written as deprecated. Whenever possible, clearer explanations were provided.	2021-05-07 11:51:26 +02:00
Willy Tarreau	7190b987ab	MINOR: config: add a new message directive: .diag This one works just like .notice/.warning/.alert except that it prints the message at level "DIAG" only when haproxy runs in diagnostic mode (-dD). This can be convenient for example to pass a few hints to help locate certain config parts or to leave messages about certain temporary workarounds. Example: .diag "WTA/2021-05-07: $.LINE: replace 'redirect' with 'return' after final switch to 2.4" http-request redirect location /goaway if ABUSE	2021-05-07 09:06:40 +02:00
Willy Tarreau	9f903af510	MEDIUM: log: slightly refine the output format of alerts/warnings/etc For about 20 years we've been emitting cryptic messages on warnings and alerts, that nobody knows how to parse: [NOTICE] 126/080118 (3115) : haproxy version is 2.4-dev18-0b7c78-49 [NOTICE] 126/080118 (3115) : path to executable is ./haproxy [WARNING] 126/080119 (3115) : Server default/srv1 is DOWN via static/srv1. 0 active and 0 backup servers left. 0 sessions active, 0 requeued, 0 remaining in queue. [ALERT] 126/080119 (3115) : backend 'default' has no server available! Hint: the first 3-digit number is the day of year, and the 6 digits after it represent the time of day in format HHMMSS, then the pid in parenthesis. These are not quite user-friendly and such cryptic into are not useful at all. This patch slightly adjusts the output by performing these minimal changes: - removing the date/time, as they were added very early when haproxy was meant to be used in foreground as a debugging tool, and they're provided in more details in logs nowadays ; - better aligning the fields by padding the severity tag to 10 chars. The diag output was renamed to "DIAG" only. Now the output provides this: [NOTICE] (4563) : haproxy version is 2.4-dev18-75a428-51 [NOTICE] (4563) : path to executable is ./haproxy [WARNING] (4563) : Server default/srv1 is DOWN via static/srv1. 0 active and 0 backup servers left. 0 sessions active, 0 requeued, 0 remaining in queue. [ALERT] (4563) : backend 'default' has no server available! The useless space before the colon was kept so as not to confuse any possible output parser. The few entries in the doc referring to this format were adjusted to reflect the new one. The change was tagged "MEDIUM" as it may have visible consequences on home-grown monitoring tools, though it is extremely unlikely due to the limited extent of these changes.	2021-05-07 08:55:11 +02:00
Willy Tarreau	75a4284bab	BUG/MINOR: stream: properly clear the previous error mask on L7 retries The cleanup of the previous error was incorrect on L7 retries, it would OR two values while they're part of an enum, leaving some bits set. Depending on the errors it was possible to occasionally see an internal error ("I" flag) being logged. This should be backported as far as 2.0, though the do_l7_retry() function in in proto_htx.c in older versions.	2021-05-07 08:22:16 +02:00
Willy Tarreau	2639e2edc2	BUG/MINOR: activity: use the new pointer to calculate the new size in realloc() When memory profiling is enabled, realloc() can occasionally get the area size wrong due to the wrong pointer being used to check the new size. When the old area gets unmapped in the operation, this may even result in a crash. There's no impact without memory profiling though. No backport is needed as this is exclusively 2.4-dev.	2021-05-07 08:01:35 +02:00
Willy Tarreau	0b7c78aa05	MINOR: config: add predicates "version_atleast" and "version_before" to cond blocks These predicates respectively verify that the current version is at least a given version or is before a specific one. The syntax is exactly the one reported by "haproxy -v", though each component is optional, so both "1.5" and "2.4-dev18-88910-48" are supported. Missing components equal zero, and "dev" is below "pre" or "rc", which are both inferior to no such mention (i.e. they are negative). Thus "2.4-dev18" is older than "2.4-rc1" which is older than "2.4".	2021-05-06 17:04:45 +02:00
Willy Tarreau	58ca706e16	MINOR: config: add predicate "feature" to detect certain built-in features The "feature(name)" predicate will return true if <name> corresponds to a name listed after a '+' in the features list, that is it was enabled at build time with USE_<name>=1. Typical use cases will include OPENSSL, LUA and LINUX_SPLICE. But maybe it will also be convenient to use with optional addons such as PROMEX and the device detection modules to help keeping the same configs across various deployments.	2021-05-06 17:02:36 +02:00
Willy Tarreau	6492e87b0e	MINOR: config: add predicates "streq()" and "strneq()" to conditional expressions "streq(str1,str2)" will return true if the two strings match while "strneq(str1,str2)" will return true only if they differ. This is convenient to match an environment variable against a predefined value.	2021-05-06 17:02:36 +02:00
Willy Tarreau	42ed14b529	MINOR: config: add predicate "defined()" to conditional expression blocks "defined(name)" will return true if <name> is a defined environment variable otherwise false, regardless of its contents.	2021-05-06 17:02:36 +02:00
Willy Tarreau	732525fae7	MINOR: config: make cfg_eval_condition() support predicates with arguments Now we can look up a list of known predicates and pre-parse their arguments. For now the list is empty. The code needed to be arranged with a common exit point to release all arguments because there's no default argument freeing function (it likely only used to exist in the deinit code). Since we only support simple arguments for now it's no big deal, only a 2-liner loop.	2021-05-06 17:02:36 +02:00
Willy Tarreau	299bd1c3ae	MINOR: config: improve .if condition error reporting Let's return the position of the first unparsable character on error, so that instead of just saying "unparsable conditional expression blah" we can have: [ALERT] 125/150618 (13995) : parsing [test-conds2.cfg:1]: unparsable conditional expression '12/blah' in '.if' at position 1: .if 12/blah ^ This is important because conditions will be made from environment variables or later from more complex expressions where the error will not always be easy to locate.	2021-05-06 17:02:36 +02:00
Willy Tarreau	a43dfda4e1	MINOR: global: add version comparison functions The new function split_version() converts a parsable haproxy version to an array of integers. The function compare_current_version() compares an arbitrary version to the current one. These two functions were written by Thierry Fournier in 2013, and are still usable as-is. They will be used to write config language predicates.	2021-05-06 17:02:36 +02:00
Willy Tarreau	f0d3b732fb	MINOR: global: export the build features string list Till now it was only presented in the version output but could not be consulted outside of haproxy.c, let's export it as a variable, and set it to an empty string if not defined.	2021-05-06 17:02:36 +02:00
Willy Tarreau	3e293a9135	MINOR: arg: improve the error message on missing closing parenthesis When the closing brace is missing after an argument (acl, ...), the error may report something like "expected ')' before ''". Let's just drop "before ''" when the final word is empty to make the message a bit clearer.	2021-05-06 17:02:36 +02:00
Willy Tarreau	7541056aa0	BUILD: activity: do not include malloc.h It doesn't exist on MacOS and broke the build. We don't need it as it's already included by compat.h when relevant. No backport is needed.	2021-05-06 11:38:41 +02:00
Willy Tarreau	a46f1af2b1	MINOR: config: support some pseudo-variables for file/line/section The new pseudo-variables ".FILE", ".LINE" and ".SECTION" will be resolved on the fly by the config parser and will respectively retrieve the current configuration file name, the current line number and the current section being parsed. This may help emit logs, errors, and debugging information (e.g. which rule matched). The '.' in the first char was reserved for such pseudo-variables and no other variable is permitted. This will allow to add support for new ones in the future if they prove to be useful (e.g. randoms/uuid for secret keying or automatic naming of configuration objects).	2021-05-06 10:36:38 +02:00
Willy Tarreau	5150805a5c	MINOR: config: keep up-to-date current file/line/section in the global struct Let's add a few fields to the global struct to store information about the current file being processed, the current line number and the current section. This will be used to retrieve them using special variables.	2021-05-06 10:35:03 +02:00
Willy Tarreau	6a2110c717	MINOR: config: centralize the ".if"/".elif" condition parser and evaluator Instead of duplicating the condition evaluations, let's have a single function cfg_eval_condition() that returns true/false/error. It takes less code and will ease its extension.	2021-05-06 10:35:03 +02:00
Willy Tarreau	71990e6bec	BUG/MINOR: config: .if/.elif should also accept negative integers The doc about .if/.elif config block conditions says: a non-nul integer (e.g. '1'), always returns "true" So we must accept negative integers as well. The test was made on atoi() > 0. No backport is needed, this is only 2.4.	2021-05-06 10:35:03 +02:00
Willy Tarreau	f67ff02072	BUG/MINOR: config: add a missing "ELIF_TAKE" test for ".elif" condition evaluator This missing state was causing a second elif condition to be evaluated after a first one succeeded after a .if failed. For example in the test below the else would be executed: .if 0 .elif 1 .elif 0 .else .endif No backport is needed, this is 2.4-only.	2021-05-06 10:35:03 +02:00
Willy Tarreau	6e647c94f2	BUG/MINOR: config: fix uninitialized initial state in ".if" block evaluator The condition to skip the block in the ".if" evaluator forgot to check that the level was high enough, resulting in rare cases where a random value matched one of the 5 values that cause the block to be skipped. No backport is needed as it's 2.4-only.	2021-05-06 10:35:03 +02:00
Christopher Faulet	e763c8c99f	BUG/MINOR: stream: Decrement server current session counter on L7 retry When a L7 retry is performed, we must not forget to decrement the current session counter of the assigned server. Of course, it must only be done if the current session is already counted on the server, thus if SF_CURR_SESS flag is set on the stream. This patch is related to the issue #1003. It must be backported as far as 2.0.	2021-05-06 09:21:12 +02:00
Christopher Faulet	10a8670f28	MINOR: mux-h1: Manage processing blocking flags on the H1 stream Because H1C_F_RX_BLK and H1C_F_TX_BLK flags now only concerns data processing, at the H1 stream level, there is no reason to still manage them on the H1 connection. Thus, these flags are now set on the H1 stream.	2021-05-06 09:21:00 +02:00
Christopher Faulet	14ee9b8c8b	CLEANUP: mux-h1: rename WAIT_INPUT/WAIT_OUTPUT flags These flags are used to block, respectively, the output and the input processing. Thus, to be more explicit, H1C_F_WAIT_INPUT is renamed to H1C_F_TX_BLK and H1C_F_WAIT_OUTPUT is renamed to H1C_F_RX_BLK.	2021-05-06 09:21:00 +02:00
Christopher Faulet	02c92c3e6f	MEDIUM: mux-h1: Wake H1 stream when both sides a synchronized Instead of subscribing for reads or sends to restart data processing, when both sides are synchronized, the H1 stream is woken up. This happens when H1C_F_WAIT_INPUT or H1C_F_WAIT_OUTPUT flags are removed, Indeed, these flags block the data processing and not raw data sending or receiving.	2021-05-06 09:21:00 +02:00
Christopher Faulet	94d35108b4	MINOR: mux-h1: Always subscribe for reads when splicing is disabled In h1_rcv_pipe(), when the splicing is not possible or disabled at the end of the fnuction, we make sure to subscribe for reads. It is not a bug but it avoid an extra call to h1_rcv_pipe() to handle the subscription in some cases (end of message, end of chunk or read0). In addition, the condition to detect end of splicing has been simplified. We now only rely on H1C_F_WANT_SPLICE flags.	2021-05-06 09:21:00 +02:00
Christopher Faulet	8454f2dbbc	MINOR: mux-h1: Subscribe for sends if output buffer is not empty in h1_snd_pipe In h1_snd_pipe(), before sending spliced data, we take care to flush the output buffer by subscribing for sends. However, the condition to do so is not accurate. We test data remaining in the pipe. It works but it also unnecessarily subscribes H1C for sends when the output buffer is empty if we are unable to send all spliced data in one time. Instead, H1C is now subscribed for sends if output buffer is not empty.	2021-05-06 09:21:00 +02:00
Christopher Faulet	2b861bf723	MINOR: mux-h1: clean up conditions to enabled and disabled splicing First, there is no reason to announce the splicing support at the conn-stream level when it is created, at least for now. GTUNE_USE_SPLICE option is already handled at the stream level. Second, in h1_rcv_buf(), there is no reason to test the message state to switch the H1C in splicing mode (via H1C_F_WANT_SPLICE flag). h1_process_input() already takes care to set CS_FL_MAY_SPLICE flag on the conn-stream when appropriate. Thus, in h1_rcv_buf(), we can rely on this flag to change the H1C state. Finally, if h1_rcv_pipe() is called, it means the H1C is already in the splicing mode. H1C_F_WANT_SPLICE flag is necessarily already set. Thus no reason to force it.	2021-05-06 09:21:00 +02:00
Christopher Faulet	1baef1523d	BUG/MEDIUM: mux-h1: Properly report client close if abortonclose option is set On client side, if CO_RFL_KEEP_RECV flags is set when h1_rcv_buf() is called, we force subscription for reads to be able to catch read0. This way, the event will be reported to upper layer to let the stream abort the request. This patch fixes the abortonclose option for H1 connections. It depends on following patches : * MEDIUM: mux-h1: Don't block reads when waiting for the other side * MINOR: conn-stream: Force mux to wait for read events if abortonclose is set But to be sure the event is handled by the stream, the following patches are also required : * BUG/MINOR: stream-int: Don't block reads in si_update_rx() if chn may receive * MINOR: channel: Rely on HTX version if appropriate in channel_may_recv() All the series must be backported with caution as far as 2.0, and only after a period of observation to be sure nothing broke.	2021-05-06 09:19:06 +02:00
Christopher Faulet	ec4207cb68	MEDIUM: mux-h1: Don't block reads when waiting for the other side When we are waiting for the other side to read more data, or to read the next request, we must only stop the processing of input data and not the data receipt. This patch don't change anything on the subscribes for reads. So it should not change anything. The only difference is that the H1 connection will try to read data if it is woken up for an I/O event and if it was subscribed for reads. This patch is required to fix abortonclose option for H1 client connections.	2021-05-06 09:19:06 +02:00
Christopher Faulet	d8219b31e7	MINOR: conn-stream: Force mux to wait for read events if abortonclose is set When the abortonclose option is enabled, to be sure to be immediately notified when a shutdown is received from the client, the frontend conn-stream must be sure the mux will wait for read events. To do so, the CO_RFL_KEEP_RECV flag is set when mux->rcv_buf() is called. This new flag instructs the mux to wait for read events, regardless its internal state. This patch is required to fix abortonclose option for H1 client connections.	2021-05-06 09:19:05 +02:00
Christopher Faulet	e0dec4b7b2	BUG/MINOR: stream-int: Don't block reads in si_update_rx() if chn may receive In si_update_rx() function, the reads may be blocked because we explicitly don't want to read or because of a lack of room in the input buffer. The first condition is valid. However the second one only test if the channel is empty or not. It means the reads are blocked if there are still some output data in the input channel, in its buffer or its pipe. This condition is not accurate. The reads must not be blocked if the channel can still receive data. Thus instead of relying on channel_is_empty() function, we now call channel_may_recv(). This patch is especially useful to be able to catch read0 on client side when we are waiting for a connection to the server, when abortonclose option is enabled. Otherwise, the client abort is not detected. This patch depends on "MINOR: channel: Rely on HTX version if appropriate in channel_may_recv()". Both must be backported as far as 2.0 after a period of observation to be sure nothing broke.	2021-05-06 09:19:05 +02:00
Willy Tarreau	ca3afc2456	MINOR: activity: add the profiling.memory global setting This allows to enable/disable memory usage profiling very early, which can be convenient to trace the memory usage in maps, certificates, Lua etc.	2021-05-05 19:09:19 +02:00
Willy Tarreau	993d44d234	MINOR: activity: make "show profiling" also dump the memoery usage Now the memory usage stats are dumped. They are first sorted by total alloc+free so that the first ones are always the most relevant, and that most symmetric alloc/free pairs appear next to each other. This way it becomes convenient to only show a small part of them such as: show profiling memory 20 It's worth noting that the sorting is performed upon each call to the iohandler so it is technically possible that an entry could appear twice or be dropped if the ordering changes between two calls. In practice it is not an issue but it's worth being mentioned.	2021-05-05 19:09:19 +02:00
Willy Tarreau	42712cb6d4	MINOR: activity: make "show profiling" support a few arguments These ones allow to limit the output to only certain sections and/or a number of lines per dump.	2021-05-05 19:09:19 +02:00
Willy Tarreau	637d85a93e	MINOR: activity: clean up the show profiling io_handler a little bit Let's rearrange it to make it more configurable and allow to iterate over multiple parts (header, tasks, memory etc), to restart from a given line number (previously it didn't work, though fortunately it didn't happen), and to support dumping only certain parts and a given number of lines. A few entries from ctx.cli are now used to store a restart point and the current step.	2021-05-05 19:09:19 +02:00
Willy Tarreau	f93c7be87f	MEDIUM: activity: collect memory allocator statistics with USE_MEMORY_PROFILING When built with USE_MEMORY_PROFILING the main memory allocation functions are diverted to collect statistics per caller. It is a bit tricky because the only way to call the original ones is to find their pointer, which requires dlsym(), and which is not available everywhere. Thus all functions are designed to call their fallback function (the original one), which is preset to an initialization function that is supposed to call dlsym() to resolve the missing symbols, and vanish. This saves expensive tests in the critical path. A second problem is that dlsym() calls calloc() to initialize some error messages. After plenty of tests with posix_memalign(), valloc() and friends, it turns out that returning NULL still makes it happy. Thus we currently use a visit counter (in_memprof) to detect if we're reentering, in which case all allocation functions return NULL. In order to convert a return address to an entry in the stats, we perform a cheap hash consisting in multiplying the pointer by a balanced number (as many zeros as ones) and keeping the middle bits. The hash is already pretty good like this, achieving to store up to 638 entries in a 2048-entry table without collision. But in order to further refine this and improve the fill ratio of the table, in case of collision we move up to 16 adjacent entries to find a free place. This remains quite cheap and manages to store all of these inside a 1024-entries hash table with even less risk of collision. Also, free(NULL) does not produce any stats. By doing so we reduce from 638 to 208 the average number of entries needed for a basic config using SSL. free(NULL) not only provides no information as it's a NOP, but keeping it is pure pollution as it happens all the time. When DEBUG_MEM_STATS is enabled, malloc/calloc/realloc are redefined as macros, preventing the code from compiling. Thus, when this option is detected, the macros are undefined as they are pointless there anyway. The functions are optimized to quickly jump to the fallback and as such become almost invisible in terms of processing time, execpt an extra "if" on a read_mostly variable and a jump. Considering that this only happens for pool misses and library routines, this remains acceptable. Performance tests in SSL (the most stressful test) shows less than 1% performance loss when profiling is enabled on 2c4t. The code was written in a way to ease backporting to modern versions (2.2+) if needed, so it keeps the long names for integers and doesn't use the _INC version of the atomic ops.	2021-05-05 19:09:19 +02:00
Willy Tarreau	db87fc7d36	MINOR: activity: declare the storage for memory usage statistics We'll need to store for each call place, the pointer to the caller (the return address to be more exact as with free() it's not uncommon to see tail calls), the number of calls to alloc/free and the total alloc/free bytes. realloc() will be counted either as alloc or free depending on the balance of the size before vs after. We store 1024+1 entries. The first ones are used as hashes and the last one for collisions. When profiling is enabled via the CLI, all the stats are reset.	2021-05-05 18:55:28 +02:00
Willy Tarreau	00dd44f67f	MINOR: activity: add a "memory" entry to "profiling" This adds the necessary flags to permit run-time enabling/disabling of memory profiling. For now this is disabled. A few words were added to the management doc about it and recalling that this is limited to certain OSes.	2021-05-05 18:55:02 +02:00
Willy Tarreau	ef7380f916	CLEANUP: activity: mark the profiling and task_profiling_mask __read_mostly These ones are only read by the scheduler and occasionally written to by the CLI parser, so let's move them to read_mostly so that they do not risk to suffer from cache line pollution.	2021-05-05 18:38:05 +02:00
Willy Tarreau	64192392c4	MINOR: tools: add functions to retrieve the address of a symbol get_sym_curr_addr() will return the address of the first occurrence of the given symbol while get_sym_next_addr() will return the address of the next occurrence of the symbol. These ones return NULL on non-linux, non-ELF, non-USE_DL.	2021-05-05 16:24:52 +02:00
Amaury Denoyelle	d3a88c1c32	MEDIUM: connection: close front idling connection on soft-stop Implement a safe mechanism to close front idling connection which prevents the soft-stop to complete. Every h1/h2 front connection is added in a new per-thread list instance. On shutdown, a new task is waking up which calls wake mux operation on every connection still present in the new list. A new stopping_list attach point has been added in the connection structure. As this member is only used for frontend connections, it shared the same union as the session_list reserved for backend connections.	2021-05-05 14:39:23 +02:00
Amaury Denoyelle	efc6e95642	MEDIUM: mux_h1: release idling frontend conns on soft-stop In h1_process, if the proxy of a frontend connection is disabled, release the connection. This commit is in preparation to properly close idling front connections on soft-stop. h1_process must still be called, this will be done via a dedicated task which monitors the global variable stopping.	2021-05-05 14:35:36 +02:00
Amaury Denoyelle	3109ccfe70	MINOR: srv: close all idle connections on shutdown Implement a function to close all server idle connections. This function is called via a global deinit server handler. The main objective is to prevents from leaving sockets in TIME_WAIT state. To limit the set of operations on shutdown and prevents tasks rescheduling, only the ctrl stack closing is done.	2021-05-05 14:33:51 +02:00
Willy Tarreau	1ab6c0bfd2	MINOR: pools/debug: slightly relax DEBUG_DONT_SHARE_POOLS The purpose of this debugging option was to prevent certain pools from masking other ones when they were shared. For example, task, http_txn, h2s, h1s, h1c, session, fcgi_strm, and connection are all 192 bytes and would normally be mergedi, but not with this option. The problem is that certain pools are declared multiple times with various parameters, which are often very close, and due to the way the option works, they're not shared either. Good examples of this are captures and stick tables. Some configurations have large numbers of stick-tables of pretty similar types and it's very common to end up with the following when the option is enabled: $ socat - /tmp/sock1 <<< "show pools" \| grep stick - Pool sticktables (160 bytes) : 0 allocated (0 bytes), 0 used, needed_avg 0, 0 failures, 1 users, @0x753800=56 - Pool sticktables (160 bytes) : 0 allocated (0 bytes), 0 used, needed_avg 0, 0 failures, 1 users, @0x753880=57 - Pool sticktables (160 bytes) : 0 allocated (0 bytes), 0 used, needed_avg 0, 0 failures, 1 users, @0x753900=58 - Pool sticktables (160 bytes) : 0 allocated (0 bytes), 0 used, needed_avg 0, 0 failures, 1 users, @0x753980=59 - Pool sticktables (160 bytes) : 0 allocated (0 bytes), 0 used, needed_avg 0, 0 failures, 1 users, @0x753a00=60 - Pool sticktables (160 bytes) : 0 allocated (0 bytes), 0 used, needed_avg 0, 0 failures, 1 users, @0x753a80=61 - Pool sticktables (160 bytes) : 0 allocated (0 bytes), 0 used, needed_avg 0, 0 failures, 1 users, @0x753b00=62 - Pool sticktables (224 bytes) : 0 allocated (0 bytes), 0 used, needed_avg 0, 0 failures, 1 users, @0x753780=55 In addition to not being convenient, it can have important effects on the memory usage because these pools will not share their entries, so one stick table cannot allocate from another one's pool. This patch solves this by going back to the initial goal which was not to have different pools in the same list. Instead of masking the MAP_F_SHARED flag, it simply adds a test on the pool's name, and disables pool sharing if the names differ. This way pools are not shared unless they're of the same name and size, which doesn't hinder debugging. The same test above now returns this: $ socat - /tmp/sock1 <<< "show pools" \| grep stick - Pool sticktables (160 bytes) : 0 allocated (0 bytes), 0 used, needed_avg 0, 0 failures, 7 users, @0x3fadb30 [SHARED] - Pool sticktables (224 bytes) : 0 allocated (0 bytes), 0 used, needed_avg 0, 0 failures, 1 users, @0x3facaa0 [SHARED] This is much better. This should probably be backported, in order to limit the side effects of DEBUG_DONT_SHARE_POOLS being enabled in production.	2021-05-05 07:47:29 +02:00
Willy Tarreau	48129be18a	MINOR: debug: add a new "debug dev sym" command in expert mode This command attempts to resolve a pointer to a symbol name. This is convenient during development as it's easier to get such pointers live than by issuing a debugger or calling addr2line.	2021-05-05 07:47:29 +02:00
William Lallemand	5ba80d677d	BUG/MINOR: ssl/cli: fix a lock leak when no memory available This bug was introduced in `e5ff4ad` ("BUG/MINOR: ssl: fix a trash buffer leak in some error cases"). When cli_parse_set_cert() returns because alloc_trash_chunk() failed, it does not unlock the spinlock which can lead to a deadlock later. Must be backported as far as 2.1 where `e5ff4ad` was backported.	2021-05-04 16:40:44 +02:00
Willy Tarreau	18b2a9dd87	BUG/MEDIUM: cli: prevent memory leak on write errors Since the introduction of payload support on the CLI in 1.9-dev1 by commit `abbf60710` ("MEDIUM: cli: Add payload support"), a chunk is temporarily allocated for the CLI to support defragmenting a payload passed with a command. However it's only released when passing via the CLI_ST_END state (i.e. on clean shutdown), but not on errors. Something as trivial as: $ while :; do ncat --send-only -U /path/to/cli <<< "show stat"; done with a few hundreds of servers is enough see the number of allocated trash chunks go through the roof in "show pools". This needs to be backported as far as 2.0.	2021-05-04 16:27:45 +02:00
Christopher Faulet	c31b200872	BUG/MINOR: hlua: Don't rely on top of the stack when using Lua buffers When the lua buffers are used, a variable number of stack slots may be used. Thus we cannot assume that we know where the top of the stack is. It was not an issue for lua < 5.4.3 (at least for small buffers). But 'socket:receive()' now fails with lua 5.4.3 because a light userdata is systematically pushed on the top of the stack when a buffer is initialized. To fix the bug, in hlua_socket_receive(), we save the index of the top of the stack before creating the buffer. This way, we can check the number of arguments, regardless anything was pushed on the stack or not. Note that the other buffer usages seem to be safe. This patch should solve the issue #1240. It should be backport to all stable branches.	2021-05-03 10:34:48 +02:00
Willy Tarreau	29202013c1	CLEANUP: map/cli: properly align the map/acl help Due to extra options on some commands, the help started to become a bit of a mess, so let's realign all the commands.	2021-04-30 15:36:31 +02:00
Willy Tarreau	bb51c44d64	MINOR: map/acl: make "add map/acl" support an optional version number By passing a version number to "add map/acl", it becomes possible to atomically replace maps and ACLs. The principle is that a new version number is first retrieved by calling"prepare map/acl", and this version number is used with "add map" and "add acl". Newly added entries then remain invisible to the matching mechanism but are visible in "show map/acl" when the version number is specified, or may be cleard with "clear map/acl". Finally when the insertion is complete, a "commit map/acl" command must be issued, and the version is atomically updated so that there is no intermediate state with incomplete entries.	2021-04-30 15:36:31 +02:00
Willy Tarreau	7a562ca809	MINOR: map/acl: add the "commit map/acl" CLI command The command is used to atomically replace a map/acl with the pending contents of the designated version. The new version must have been allocated by "prepare map/acl" prior to this. At the moment it is not possible to force the version when adding new entries, so this may only be used to atomically clear an ACL/map.	2021-04-30 15:36:31 +02:00
Willy Tarreau	97218ce3a9	MINOR: map/acl: add the "prepare map/acl" CLI command This command allocates a new version for the map/acl, that will be usable later to prepare the addition of new values to atomically replace existing ones. Technically speaking the operation consists in atomically incrementing the next version. There's no "undo" operation here, if a version is not committed, it will automatically be trashed when committing a newer version.	2021-04-30 15:36:31 +02:00
Willy Tarreau	ff3feeb5cf	MINOR: map/acl: add the possibility to specify the version in "clear map/acl" This will ease maintenance of versionned maps by allowing to clear old or failed updates instead of the current version. Nothing was done to allow clearing everyhing, though if there was a need for this, implementing "@all" or something equivalent wouldn't require more than 3 lines of code.	2021-04-30 15:36:31 +02:00
Willy Tarreau	a13afe6535	MINOR: pattern: support purging arbitrary ranges of generations Instead of being able to purge only values older than a specific value, let's support arbitrary ranges and make pat_ref_purge_older() just be one special case of this one.	2021-04-30 15:36:31 +02:00
Willy Tarreau	95f753e403	MINOR: map/acl: add the possibility to specify the version in "show map/acl" The maps and ACLs internally all have two versions, the "current" one, which is the one being matched against, and the "next" one, the one being filled during an atomic replacement. Till now the "show" commands only used to show the current one but it can be convenient to be able to show other ones as well, so let's add the ability to do this with "show map" and "show acl". The method used here consists in passing the version number as "@<ver>" before the map/acl name or ID. It would have been better after it but that could create confusion with keys already using such a format.	2021-04-30 15:36:31 +02:00
Willy Tarreau	e3a42a6c2d	MINOR: map: show the current and next pattern version in "show map" The "show map" command wasn't updated when pattern generations were added for atomic reloads, let's report them in the "show map" command that lists all known maps. It will be useful for users.	2021-04-30 15:36:31 +02:00
Willy Tarreau	4053b03caa	MINOR: map: get rid of map_add_key_value() This function was only used once in cli_parse_add_map(), and half of the work it used to do was already known from the caller or testable outside of the lock. Given that we'll need to modify it soon to pass a generation number, let's remerge it in the caller instead, using pat_ref_load() which is the one we'll need.	2021-04-30 15:36:31 +02:00
Willy Tarreau	f7dd0e8796	CLEANUP: map: slightly reorder the add map function The function uses two distinct code paths for single the key/value pair and multiple pairs inserted as payload, each with a copy-paste of the error handling. Let's modify the loop to factor them out.	2021-04-30 15:36:31 +02:00
Amaury Denoyelle	eafd701dc5	MINOR: server: fix doc/trace on lb algo for dynamic server creation The text mentionned that only backends with consistent hash method were supported for dynamic servers. In fact, it is only required that the lb algorith is dynamic.	2021-04-29 14:59:42 +02:00
Willy Tarreau	7e702d13f4	CLEANUP: hlua: rename hlua_appctx* appctx to luactx There is some serious confusion in the lua interface code related to sockets and services coming from the hlua_appctx structs being called "appctx" everywhere, and where the real appctx is reached using appctx->appctx. This part is a bit of a pain to debug so let's rename all occurrences of this local variable to "luactx".	2021-04-28 17:59:21 +02:00
Willy Tarreau	b4476c6a8c	CLEANUP: freq_ctr: make arguments of freq_ctr_total() const freq_ctr_total() doesn't modify the freq counters, it should take a const argument.	2021-04-28 17:44:37 +02:00
Willy Tarreau	fe16126acc	BUG/MEDIUM: time: fix updating of global_now upon clock drift During commit `7e4a557f6` ("MINOR: time: change the global timeval and the the global tick at once") the approach made sure that the new now_ms was always higher than or equal to global_now_ms, but by forgetting the old value. This can cause the first update to global_now_ms to fail if it's already out of sync, going back into the loop, and the subsequent call would then succeed due to commit `4d01f3dcd` ("MINOR: time: avoid overwriting the same values of global_now"). And if it goes out of sync, it will fail to update forever, as observed by Ashley Penney in github issue #1194, causing incorrect freq counters calculations everywhere. One possible trigger for this issue is one thread spinning for a few milliseconds while the other ones continue to work. The issue really is that old_now_ms ought not to be modified in the loop as it's used for the CAS. But we don't need to structurally guarantee that global_now_ms grows monotonically as it's computed from the new global_now which is already verified for this via the __tv_islt() test. Thus, dropping any corrections on global_now_ms in the loop is the correct way to proceed as long as this one is always updated to follow global_now. No backport is needed, this is only for 2.4-dev.	2021-04-28 17:43:55 +02:00
Emeric Brun	ccdfbae62c	MINOR: peers: add informative flags about resync process for debugging This patch adds miscellenous informative flags raised during the initial full resync process performed during the reload for debugging purpose. 0x00000010: Timeout waiting for a full resync from a local node 0x00000020: Timeout waiting for a full resync from a remote node 0x00000040: Session aborted learning from a local node 0x00000080: Session aborted learning from a remote node 0x00000100: A local node teach us and was fully up to date 0x00000200: A remote node teach us and was fully up to date 0x00000400: A local node teach us but was partially up to date 0x00000800: A remote node teach us but was partially up to date 0x00001000: A local node was assigned for a full resync 0x00002000: A remote node was assigned for a full resync 0x00004000: A resync was explicitly requested This patch could be backported on any supported branch	2021-04-28 14:23:10 +02:00
Emeric Brun	1a6b43e13e	BUG/MEDIUM: peers: reset tables stage flags stages on new conns Flags used as context to know current status of each table pushing a full resync to a peer were correctly reset receiving a new resync request or confirmation message but in case of local peer sync during reload the resync request is implicit and those flags were not correctly reset in this case. This could result to a partial initial resync of some tables after reload if the connection with the old process was broken and retried. This patch reset those flags at the end of the handshake for all new connections to be sure to push a entire full resync if needed. This patch should be backported on all supported branches ( v >= 1.6 )	2021-04-28 14:23:10 +02:00
Emeric Brun	8e7a13ed66	BUG/MEDIUM: peers: re-work updates lookup during the sync on the fly Only entries between the opposite of the last 'local update' rotating counter were considered to be pushed. This processing worked in most cases because updates are continually pushed trying to reach this point but it remains some cases where updates id are more far away in the past and appearing in futur and the push of updates is stuck until the head reach again the tail which could take a very long time. This patch re-work the lookup to consider that all positions on the rotating counter is considered in the past until we reach exactly the 'local update' value. Doing this, the updates push won't be stuck anymore. This patch should be backported on all supported branches ( >= 1.6 )	2021-04-28 14:23:10 +02:00
Emeric Brun	cc9cce9351	BUG/MEDIUM: peers: reset commitupdate value in new conns The commitupdate value of the table is used to check if the update is still pending for a push for all peers. To be sure to not miss a push we reset it just after a handshake success. This patch should be backported on all supported branches ( >= 1.6 )	2021-04-28 14:23:10 +02:00
Emeric Brun	d9729da982	BUG/MEDIUM: peers: reset starting point if peers appears longly disconnected If two peers are disconnected and during this period they continue to process a large amount of local updates, after a reconnection they may take a long time before restarting to push their updates. because the last pushed update would appear internally in futur. This patch fix this resetting the cursor on acked updates at the maximum point considered in the past if it appears in futur but it means we may lost some updates. A clean fix would be to update the protocol to be able to signal a remote peer that is was not updated for a too long period and needs a full resync but this is not yet supported by the protocol. This patch should be backported on all supported branches ( >= 1.6 )	2021-04-28 14:23:10 +02:00
Emeric Brun	b0d60bed36	BUG/MEDIUM: peers: stop considering ack messages teaching a full resync The re-con cursor was updated receiving any ack message even if we are pushing a complete resync to a peer. This cursor is reset at the end of the resync but if the connection is broken during resync, we could re-start at an unwanted point. With this patch, the peer stops to consider ack messages pushing a resync since the resync process has is own acknowlegement and is always restarted from the beginning in case of broken connection. This patch should be backported on all supported branches ( >= 1.6 )	2021-04-28 14:23:10 +02:00
Emeric Brun	437e48ad92	BUG/MEDIUM: peers: register last acked value as origin receiving a resync req Receiving a resync request, the origins to start the full sync and to reset after the full resync are mistakenly computed based on the last update on the table instead of computed based on the the last update acked by the node requesting the resync. It could result in disordered or missing updates pushing to the requester This patch sets correctly those origins. This patch should be backported on all supported branches ( >= 1.6 )	2021-04-28 14:23:10 +02:00
Emeric Brun	2c4ab41816	BUG/MEDIUM: peers: initialize resync timer to get an initial full resync If a reload is performed and there is no incoming connections from the old process to push a full resync, the new process can be stuck waiting indefinitely for this conn and it never tries a fallback requesting a full resync from a remote peer because the resync timer was init to TICK_ETERNITY. This patch forces a reset of the resync timer to default value (5 secs) if we detect value is TICK_ETERNITY. This patch should be backported on all supported branches ( >= 1.6 )	2021-04-28 14:23:10 +02:00
Willy Tarreau	8a022d5049	MINOR: config: add a new "default-path" global directive By default haproxy loads all files designated by a relative path from the location the process is started in. In some circumstances it might be desirable to force all relative paths to start from a different location just as if the process was started from such locations. This is what this directive is made for. Technically it will perform a temporary chdir() to the designated location while processing each configuration file, and will return to the original directory after processing each file. It takes an argument indicating the policy to use when loading files whose path does not start with a slash ('/'). A few options are offered, "current" (the default), "config" (files relative to config file's dir), "parent" (files relative to config file's parent dir), and "origin" with an absolute path. This should address issue #1198.	2021-04-28 11:30:13 +02:00
Willy Tarreau	da543e130c	CLEANUP: cfgparse: de-uglify early file error handling in readcfgfile() In readcfgfile() when malloc() fails to allocate a buffer for the config line, it currently says "parsing[<file>]: out of memory" while the error is unrelated to the config file and may make one think it has to do with the file's size. The second test (fopen() returning error) needs to release the previously allocated line. Both directly return -1 which is not even documented as a valid error code for the function. Let's simply make sure that the few variables freed at the end are properly preset, and jump there upon error, after having displayed a meaningful error message. Now at least we can get this: $ ./haproxy -f /dev/kmem [NOTICE] 116/191904 (23233) : haproxy version is 2.4-dev17-c3808c-13 [NOTICE] 116/191904 (23233) : path to executable is ./haproxy [ALERT] 116/191904 (23233) : Could not open configuration file /dev/kmem : Permission denied	2021-04-28 11:21:32 +02:00
Christopher Faulet	925abdfdac	BUG/MEDIUM: mux-h2: Handle EOM flag when sending a DATA frame with zero-copy When a DATA frame is sent, we must take care to properly detect the EOM flag on the HTX message to set ES flag on the frame when necessary, to finish the stream. But it is only done when data are copied from the HTX message to the mux buffer and not when the frame are sent via a zero-copy. This patch fixes this bug. It is a 2.4-specific bug. No backport is needed.	2021-04-28 11:08:35 +02:00
Christopher Faulet	bd878d2c73	BUG/MINOR: hlua: Don't consume headers when starting an HTTP lua service When an HTTP lua service is started, headers are consumed before calling the script. When it was initialized, the headers were stored in a lua array, thus they can be removed from the HTX message because the lua service will no longer access them. But it is a problem with bodyless messages because the EOM flag is lost. Indeed, once the headers are consumed, the message is empty and the buffer is reset, included the flags. Now, the headers are not immediately consumed. We will skip them if applet:receive() or applet:getline(). This way, the EOM flag is preserved. At the end, when the script is finished, all output data are consumed, thus this remains safe. It is a 2.4-specific bug. No backport is needed.	2021-04-28 11:05:05 +02:00
Christopher Faulet	1eedf9b4cb	BUG/MINOR: applet: Notify the other side if data were consumed by an applet If an applet consumed output data (the amount of output data has changed between before and after the call to the applet), the producer is notified. It means CF_WRITE_PARTIAL and CF_WROTE_DATA are set on the output channel and the opposite stream interface is notified some room was made in its input buffer. This way, it is no longer the applet responsibility to take care of it. However, it doesn't matter if the applet does the same. Said like that, it looks like an improvement not a bug. But it really fixes a bug in the lua, for HTTP applets. Indeed, applet:receive() and applet:getline() are buggy for HTTP applets. Data are consumed but the producer is not notified. It means if the payload is not fully received in one time, the applet may be blocked because the producer remains blocked (it is time dependent). This patch must be backported as far as 2.0 (only for the HTX part).	2021-04-28 10:51:08 +02:00
Christopher Faulet	f506d96839	MEDIUM: http-ana: handle read error on server side if waiting for response A read error on the server side is also reported as a write error on the client side. It means some times, a server side error is handled on the client side. Among others, it is the case when the client side is waiting for the response while the request processing is already finished. In this case, the error is not handled as a server error. It is not accurate. So now, when the request processing is finished but not the response processing and if a read error was encountered on the server side, the error is not immediatly processed on the client side, to let a chance to response analysers to properly catch the error.	2021-04-28 10:51:08 +02:00
Christopher Faulet	3d87558f35	BUG/MINOR: mux-h2: Don't encroach on the reserve when decoding headers Since the input buffer is transferred to the stream when it is created, there is no longer control on the request size to be sure the buffer's reserve is still respected. It was automatically performed in h2_rcv_buf() because the caller took care to provide the correct available space in the buffer. The control is still there but it is no longer applied on the request headers. Now, we should take care of the reserve when the headers are decoded, before the stream creation. The test is performed for the request and the response. It is a 2.4-specific bug. No backport is needed.	2021-04-28 10:51:08 +02:00
Christopher Faulet	2b78f0bfc4	CLEANUP: htx: Remove unsued hdrs_bytes field from the HTX start-line Thanks to the htx_xfer_blks() refactoring, it is now possible to remove hdrs_bytes field from the start-line because no function rely on it anymore.	2021-04-28 10:51:08 +02:00
Christopher Faulet	c92ec0ba71	MEDIUM: htx: Refactor htx_xfer_blks() to not rely on hdrs_bytes field It is the only function using the hdrs_bytes start-line field. Thus the function has been refactored to no longer rely on it. To do so, we first copy HTX blocks to the destination message, without removing them from the source message. If the copy is interrupted on headers or trailers, we roll back. Otherwise, data are drained from the source buffer. Most of time, the copy will succeeds. So the roll back is only performed in the worst but very rare case.	2021-04-28 10:51:08 +02:00
Christopher Faulet	5e9b24f4b4	BUG/MINOR: htx: Preserve HTX flags when draining data from an HTX message When all data of an HTX message are drained, we rely on htx_reset() to reinit the message state. However, the flags must be preserved. It is, among other things, important to preserve processing or parsing errors. This patch must be backported as far as 2.0.	2021-04-27 22:57:46 +02:00
Amaury Denoyelle	8f685c11e0	BUG/MEDIUM: cpuset: fix build on MacOS The compilation fails due to the following commit: `fc6ac53dca` BUG/MAJOR: fix build on musl with cpu_set_t support The new global variable cpu_map conflicted with a local variable of the same name in the code path for the apple platform when setting the process affinity. This does not need to be backported.	2021-04-27 16:49:35 +02:00
Amaury Denoyelle	fc6ac53dca	BUG/MAJOR: fix build on musl with cpu_set_t support Move cpu_map structure outside of the global struct to a global variable defined in cpuset.c compilation unit. This allows to reorganize the includes without having to define _GNU_SOURCE everywhere for the support of the cpu_set_t. This fixes the compilation with musl libc, most notably used for the alpine based docker image. This fixes the github issue #1235. No need to backport as this feature is new in the current 2.4-dev.	2021-04-27 14:11:26 +02:00
Remi Tricot-Le Breton	43899ec83d	BUG/MINOR: ssl: ssl_sock_prepare_ssl_ctx does not return an error code The return value check was wrongly based on error codes when the function actually returns an error number. This bug was introduced by `f3eedfe195` which is a feature not present before branch 2.4. It does not need to be backported.	2021-04-26 15:57:26 +02:00
Ilya Shipitsin	b2be9a1ea9	CLEANUP: assorted typo fixes in the code and comments This is 22nd iteration of typo fixes	2021-04-26 10:42:58 +02:00
Christopher Faulet	df3db630e4	REORG: htx: Inline htx functions to add HTX blocks in a message The HTX functions used to add new HTX blocks in a message have been moved to the header file to inline them in calling functions. These functions are small enough.	2021-04-26 10:24:57 +02:00
Christopher Faulet	fb38c910f8	BUG/MINOR: mux-fcgi: Don't send normalized uri to FCGI application A normalized URI is the internal term used to specify an URI is stored using the absolute format (scheme + authority + path). For now, it is only used for H2 clients. It is the default and recommended format for H2 request. However, it is unusual for H1 servers to receive such URI. So in this case, we only send the path of the absolute URI. It is performed for H1 servers, but not for FCGI applications. This patch fixes the difference. Note that it is not a real bug, because FCGI applications should support abosolute URI. Note also a normalized URI is only detected for H2 clients when a request is received. There is no such test on the H1 side. It means an absolute URI received from an H1 client will be sent without modification to an H1 server or a FCGI application. To make it possible, a dedicated function has been added to get the H1 URI. This function is called by the H1 and the FCGI multiplexer when a request is sent to a server. This patch should fix the issue #1232. It must be backported as far as 2.2.	2021-04-26 10:23:18 +02:00
Tim Duesterhus	2e4a18e04a	MINOR: uri_normalizer: Add a `percent-decode-unreserved` normalizer This normalizer decodes percent encoded characters within the RFC 3986 unreserved set. See GitHub Issue #714.	2021-04-23 19:43:45 +02:00
Willy Tarreau	07bf21cdcb	BUG/MEDIUM: config: fix missing initialization in numa_detect_topology() The error path of the NUMA topology detection introduced in commit `b56a7c89a` ("MEDIUM: cfgparse: detect numa and set affinity if needed") lacks an initialization resulting in possible crashes at boot. No backport is needed since that was introduced in 2.4-dev.	2021-04-23 19:09:16 +02:00
Emeric Brun	2cc201f97e	BUG/MEDIUM: peers: re-work refcnt on table to protect against flush In proxy.c, when process is stopping we try to flush tables content using 'stktable_trash_oldest'. A check on a counter "table->syncing" was made to verify if there is no pending resync in progress. But using multiple threads this counter can be increased by an other thread only after some delay, so the content of some tables can be trashed earlier and won't be pushed to the new process (after reload, some tables appear reset and others don't). This patch re-names the counter "table->syncing" to "table->refcnt" and the counter is increased during configuration parsing (registering a table to a peer section) to protect tables during runtime and until resync of a new process has succeeded or failed. The inc/dec operations are now made using atomic operations because multiple peer sections could refer to the same table in futur. This fix addresses github #1216. This patch should be backported on all branches multi-thread support (v >= 1.8)	2021-04-23 18:03:06 +02:00
Emeric Brun	cbfe5ebc1c	BUG/MEDIUM: peers: re-work connection to new process during reload. The peers task handling the "stopping" could wake up multiple times in stopping state with WOKEN_SIGNAL: the connection to the local peer initiated on the first processing was immediatly shutdown by the next processing of the task and the old process exits considering it is unable to connect. It results on empty stick-tables after a reload. This patch checks the flag 'PEERS_F_DONOTSTOP' to know if the signal is considered and if remote peers connections shutdown is already done or if a connection to the local peer must be established. This patch should be backported on all supported branches (v >= 1.6)	2021-04-23 18:03:06 +02:00
Emeric Brun	1675ada4f4	BUG/MINOR: peers: remove useless table check if initial resync is finished The old process checked each table resync status even if the resync process is finished. This behavior had no known impact except useless processing and was discovered during debugging on an other issue. This patch could be backported in all supported branches (v >= 1.6) but once again, it has no impact except avoid useless processing.	2021-04-23 18:03:06 +02:00
Willy Tarreau	1f9e11e7f0	CLEANUP: time: use __tv_to_ms() in tv_update_date() instead of open-coding Instead of calculating the current date in milliseconds by hand, let's use __tv_to_ms() which was made exactly for this purpose.	2021-04-23 18:03:06 +02:00
Willy Tarreau	4d01f3dcdc	MINOR: time: avoid overwriting the same values of global_now In tv_update_date(), we calculate the new global date based on the local one. It's very likely that other threads will end up with the exact same now_ms date (at 1 million wakeups/s it happens 99.9% of the time), and even the microsecond was measured to remain unchanged ~70% of the time with 16 threads, simply because sometimes another thread already updated a more recent version of it. In such cases, performing a CAS to the global variable requires a cache line flush which brings nothing. By checking if they're changed before writing, we can divide by about 6 the number of writes to the global variables, hence the overall contention. In addition, it's worth noting that all threads will want to update at the same time, so let's place a cpu relax call before trying again, this will spread attempts apart.	2021-04-23 18:03:06 +02:00
Willy Tarreau	481795de13	MINOR: time: avoid unneeded updates to now_offset The time adjustment is very rare, even at high pool rates. Tests show that only 0.2% of tv_update_date() calls require a change of offset. Such concurrent writes to a shared variable have an important impact on future loads, so let's only update the variable if it changed.	2021-04-23 18:03:06 +02:00
Amaury Denoyelle	a6f9c5d2a7	BUG/MINOR: cpuset: fix compilation on platform without cpu affinity The compilation is currently broken on platform without USE_CPU_AFFINITY set. An error has been reported by the cygwin build of the CI. This does not need to be backported. In file included from include/haproxy/global-t.h:27, from include/haproxy/global.h:26, from include/haproxy/fd.h:33, from src/ev_poll.c:22: include/haproxy/cpuset-t.h:32:3: error: #error "No cpuset support implemented on this platform" 32 \| # error "No cpuset support implemented on this platform" \| ^~~~~ include/haproxy/cpuset-t.h:37:2: error: unknown type name ‘CPUSET_REPR’ 37 \| CPUSET_REPR cpuset; \| ^~~~~~~~~~~ make: * [Makefile:944: src/ev_poll.o] Error 1 make: * Waiting for unfinished jobs.... In file included from include/haproxy/global-t.h:27, from include/haproxy/global.h:26, from include/haproxy/fd.h:33, from include/haproxy/connection.h:30, from include/haproxy/ssl_sock.h:27, from src/ssl_sample.c:30: include/haproxy/cpuset-t.h:32:3: error: #error "No cpuset support implemented on this platform" 32 \| # error "No cpuset support implemented on this platform" \| ^~~~~ include/haproxy/cpuset-t.h:37:2: error: unknown type name ‘CPUSET_REPR’ 37 \| CPUSET_REPR cpuset; \| ^~~~~~~~~~~ make: *** [Makefile:944: src/ssl_sample.o] Error 1	2021-04-23 17:04:24 +02:00
Amaury Denoyelle	c5ed1f9d87	BUG/MINOR: haproxy: fix compilation on macOS Fix the warning treated as error on the CI for the macOS compilation : "src/haproxy.c:2939:23: error: unused variable 'set' [-Werror,-Wunused-variable]" This does not need to be backported.	2021-04-23 16:41:22 +02:00
Amaury Denoyelle	0f50cb9c73	MINOR: global: add option to disable numa detection Render numa detection optional with a global configuration statement 'no numa-cpu-mapping'. This can be used if the applied affinity of the algorithm is not optimal. Also complete the documentation with this new keyword.	2021-04-23 16:06:49 +02:00
Amaury Denoyelle	b56a7c89a8	MEDIUM: cfgparse: detect numa and set affinity if needed On process startup, the CPU topology of the machine is inspected. If a multi-socket CPU machine is detected, automatically define the process affinity on the first node with active cpus. This is done to prevent an impact on the overall performance of the process in case the topology of the machine is unknown to the user. This step is not executed in the following condition : - a non-null nbthread statement is present - a restrictive 'cpu-map' statement is present - the process affinity is already restricted, for example via a taskset call For the record, benchmarks were executed on a machine with 2 CPUs Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz. In both clear and ssl scenario, the performance were sub-optimal without the automatic rebinding on a single node.	2021-04-23 16:06:49 +02:00
Amaury Denoyelle	a80823543c	MINOR: cfgparse: support the comma separator on parse_cpu_set Allow to specify multiple cpu ids/ranges in parse_cpu_set separated by a comma. This is optional and must be activated by a parameter. The comma support is disabled for the parsing of the 'cpu-map' config statement. However, it will be useful to parse files in sysfs when inspecting the cpus topology for NUMA automatic process binding.	2021-04-23 16:06:49 +02:00
Amaury Denoyelle	4c9efdecf5	MINOR: thread: implement the detection of forced cpu affinity Create a function thread_cpu_mask_forced. Its purpose is to report if a restrictive cpu mask is active for the current proces, for example due to a taskset invocation. It is only implemented for the linux platform currently.	2021-04-23 16:06:49 +02:00
Amaury Denoyelle	982fb53390	MEDIUM: config: use platform independent type hap_cpuset for cpu-map Use the platform independent type hap_cpuset for the cpu-map statement parsing. This allow to address CPU index greater than LONGBITS. Update the documentation to reflect the removal of this limit except for platforms without cpu_set_t type or equivalent.	2021-04-23 16:06:49 +02:00
Amaury Denoyelle	c90932bc8e	MINOR: cfgparse: use hap_cpuset for parse_cpu_set Replace the unsigned long parameter by a hap_cpuset. This allows to address CPU with index greater than LONGBITS. This function is used to parse the 'cpu-map' statement. However at the moment, the result is casted back to a long to store it in the global structure. The next step is to replace ulong in in cpu_map in the global structure with hap_cpuset.	2021-04-23 16:06:49 +02:00
Amaury Denoyelle	f75c640f7b	MINOR: cpuset: define a platform-independent cpuset type This module can be used to manipulate a cpu sets in a platform agnostic way. Use the type cpu_set_t/cpuset_t if available on the platform, or fallback to unsigned long, which limits de facto the maximum cpu index to LONGBITS.	2021-04-23 16:06:49 +02:00
Christopher Faulet	de9d605aa5	BUG/MEDIUM: mux-h2: Properly handle shutdowns when received with data The H2_CF_RCVD_SHUT flag is used to report a read0 was encountered. It is used by the H2 mux to properly handle shutdowns. However, this flag is only set when no data are received. If it is detected at the socket level when some data are received, it is not handled. And because the event was reported on the connection, any other read attempts are blocked. In this case, we are unable to close the connection and release the mux immediately. We must wait the mux timeout expires. This patch should fix the issue #1231. It must be backported as far as 2.0.	2021-04-23 15:42:39 +02:00
Willy Tarreau	5e65f4276b	CLEANUP: compression: remove calls to SLZ init functions As we now embed the library we don't need to support the older 1.0 API any more, so we can remove the explicit calls to slz_make_crc_table() and slz_prepare_dist_table().	2021-04-22 16:11:19 +02:00
Willy Tarreau	12840be005	BUILD: compression: switch SLZ from out-of-tree to in-tree Now that SLZ is merged, let's update the makefile and compression files to use it. As a result, SLZ_INC and SLZ_LIB are neither defined nor used anymore. USE_SLZ is enabled by default ("USE_SLZ=default") and can be disabled by passing "USE_SLZ=" or by enabling USE_ZLIB=1. The doc was updated to reflect the changes.	2021-04-22 16:08:25 +02:00
Willy Tarreau	ab2b7828e2	IMPORT: slz: import slz into the tree SLZ is rarely packaged by distros and there have been complaints about the CPU and memory usage of ZLIB, leading to some suggestions to better address the issue by simply integrating SLZ into the tree (just 3 files). See discussions below: https://www.mail-archive.com/haproxy@formilux.org/msg38037.html https://www.mail-archive.com/haproxy@formilux.org/msg40079.html https://www.mail-archive.com/haproxy@formilux.org/msg40365.html This patch does just this, after minor adjustments to these files: - tables.h was renamed to slz-tables.h - tables.h had the precomputed tables removed since not used here - slz.c uses includes <import/slz> instead of "slz.h" The slz commit imported here was b06c172 ("slz: avoid a build warning with -Wimplicit-fallthrough"). No other change was performed either to SLZ nor to haproxy at this point so that this operation may be replicated if needed for a future version.	2021-04-22 15:50:41 +02:00
William Lallemand	aba7f8b313	BUG/MINOR: mworker: don't use oldpids[] anymore for reload Since commit `3f12887` ("MINOR: mworker: don't use children variable anymore"), the oldpids array is not used anymore to generate the new -sf parameters. So we don't need to set nb_oldpids to 0 during the first start of the master process. This patch fixes a bug when 2 masters process tries to synchronize their peers, there is a small chances that it won't work because nb_oldpids equals 0. Should be backported as far as 2.0.	2021-04-21 16:55:34 +02:00
William Lallemand	ea6bf83d62	BUG/MINOR: mworker/init: don't reset nb_oldpids in non-mworker cases This bug affects the peers synchronisation code which rely on the nb_oldpids variable to synchronize the peer from the old PID. In the case the process is not started in master-worker mode and tries to synchronize using the peers, there is a small chance that won't work because nb_oldpids equals 0. Fix the bug by setting the variable to 0 only in the case of the master-worker when not reloaded. It could also be a problem when trying to synchronize the peers between 2 masters process which should be fixed in another patch. Bug exists since commit `8a361b5` ("BUG/MEDIUM: mworker: don't reuse PIDs passed to the master"). Sould be backported as far as 1.8.	2021-04-21 16:42:18 +02:00
Amaury Denoyelle	a2944ecf5d	MINOR: config: add a diag for invalid cpu-map statement If a cpu-statement is refering to multiple processes and threads, it is silently ignored. Add a diag message to report it to the user.	2021-04-21 15:18:57 +02:00
Amaury Denoyelle	af02c57406	BUG/MEDIUM: config: fix cpu-map notation with both process and threads The application of a cpu-map statement with both process and threads is broken (P-Q/1 or 1/P-Q notation). For example, before the fix, when using P-Q/1, proc_t1 would be updated. Then it would be AND'ed with thread which is still 0 and thus does nothing. Another problem is when using 1/1[-Q], thread[0] is defined. But if there is multiple processes, every processes will use this define affinity even if it should be applied only to 1st process. The solution to the fix is a little bit too complex for my taste and there is maybe a simpler solution but I did not wish to break the storage of global.cpu_map, as it is quite painful to test all the use-cases. Besides, this code will probably be clean up when multiprocess support removed on the future version. Let's try to explain my logic. * either haproxy runs in multiprocess or multithread mode. If on multiprocess, we should consider proc_t1 (P-Q/1 notation). If on multithread, we should consider thread (1/P-Q notation). However during parsing, the final number of processes or threads is unknown, thus we have to consider the two possibilities. * there is a special case for the first thread / first process which is present in both execution modes. And as a matter of fact cpu-map 1 or 1/1 notation represents the same thing. Thus, thread[0] and proc_t1[0] represents the same thing. To solve this problem, only thread[0] is used for this special case. This fix must be backported up to 2.0.	2021-04-21 15:18:57 +02:00
Maximilian Mader	ff3bb8b609	MINOR: uri_normalizer: Add a `strip-dot` normalizer This normalizer removes "/./" segments from the path component. Usually the dot refers to the current directory which renders those segments redundant. See GitHub Issue #714.	2021-04-21 12:15:14 +02:00
Maximilian Mader	c9c79570d4	CLEANUP: uri_normalizer: Remove trailing whitespace This patch removes a single trailing space.	2021-04-21 12:15:14 +02:00
Maximilian Mader	11f6f85c4b	BUG/MINOR: uri_normalizer: Use delim parameter when building the sorted query in uri_normalizer_query_sort Currently the delimiter is hardcoded as ampersand (&) but the function takes the delimiter as a paramter. This patch replaces the hardcoded ampersand with the given delimiter.	2021-04-21 12:15:14 +02:00
Christopher Faulet	cb1847c772	BUG/MEDIUM: mux-h2: Fix dfl calculation when merging CONTINUATION frames When header are splitted over several frames, payload of HEADERS and CONTINUATION frames are merged to form a unique HEADERS frame before decoding the payload. To do so, info about the current frame are updated (dff, dfl..) with info of the next one. Here there is a bug when the frame length (dfl) is update. We must add the next frame length (hdr.dfl) and not only the amount of data found in the buffer (clen). Because HEADERS frames are decoded in one pass, dfl value is the whole frame length or 0. nothing intermediary. This patch must be backported as far as 2.0.	2021-04-21 12:13:12 +02:00
Christopher Faulet	07f88d7582	BUG/MAJOR: mux-h2: Properly detect too large frames when decoding headers In the function decoding payload of HEADERS frames, an internal error is returned if the frame length is too large. it cannot exceed the buffer size. The same is true when headers are splitted on several frames. The payload of HEADERS and CONTINUATION frames are merged and the overall size must not exceed the buffer size. However, there is a bug when the current frame is big enough to only have the space for a part of the header of the next frame. Because, in this case, we wait for more data, to have the whole frame header. We don't properly detect that the headers are too large to be stored in one buffer. In fact the test to trigger this error is not accurate. When the buffer is full, the error is reported if the frame length exceeds the amount of data in the buffer. But in reality, an error must be reported when we are unable to decode the current frame while the buffer is full. Because, in this case, we know there is no way to change this state. When the bug happens, the H2 connection is woken up in loop, consumming all the CPU. But the traffic is not blocked for all that. This patch must be backported as far as 2.0.	2021-04-21 12:13:12 +02:00
Amaury Denoyelle	d6b4b6da3f	BUG/MINOR: server: fix potential null gcc error in delete server gcc still reports a potential null pointer dereference in delete server function event with a BUG_ON before it. Remove the misleading NULL check in the for loop which should never happen. This does not need to be backported.	2021-04-21 12:02:30 +02:00
Amaury Denoyelle	e558043e13	MINOR: server: implement delete server cli command Implement a new CLI command 'del server'. It can be used to removed a dynamically added server. Only servers in maintenance mode can be removed, and without pending/active/idle connection on it. Add a new reg-test for this feature. The scenario of the reg-test need to first add a dynamic server. It is then deleted and a client is used to ensure that the server is non joinable. The management doc is updated with the new command 'del server'.	2021-04-21 11:00:31 +02:00
Amaury Denoyelle	d38e7fa233	MINOR: server: add log on dynamic server creation Add a notice log to report the creation of a new server. The log is printed at the end of the function.	2021-04-21 11:00:31 +02:00
Amaury Denoyelle	cece918625	BUG/MEDIUM: server: ensure thread-safety of server runtime creation cli_parse_add_server can be executed in parallel by several CLI instances and so must be thread-safe. The critical points of the function are : - server duplicate detection - insertion of the server in the proxy list The mode of operation has been reversed. The server is first instantiated and parsed. The duplicate check has been moved at the end just before the insertion in the proxy list, under the thread isolation. Thus, the thread safety is guaranteed and server allocation is kept outside of locks/thread isolation.	2021-04-21 11:00:30 +02:00
Amaury Denoyelle	d688e01032	BUG/MINOR: logs: free logsrv.conf.file on exit Config information has been added into the logsrv struct. The filename is duplicated and should be freed on exit. Introduced in the current release. This does not need to be backported.	2021-04-21 11:00:29 +02:00
Amaury Denoyelle	fb247946a1	BUG/MINOR: server: free srv.lb_nodes in free_server lb_nodes is allocated for servers using lb_chash (balance random or hash-type consistent). It can be backported up to 1.8.	2021-04-21 11:00:03 +02:00
Willy Tarreau	2b71810cb3	CLEANUP: lists/tree-wide: rename some list operations to avoid some confusion The current "ADD" vs "ADDQ" is confusing because when thinking in terms of appending at the end of a list, "ADD" naturally comes to mind, but here it does the opposite, it inserts. Several times already it's been incorrectly used where ADDQ was expected, the latest of which was a fortunate accident explained in `6fa922562` ("CLEANUP: stream: explain why we queue the stream at the head of the server list"). Let's use more explicit (but slightly longer) names now: LIST_ADD -> LIST_INSERT LIST_ADDQ -> LIST_APPEND LIST_ADDED -> LIST_INLIST LIST_DEL -> LIST_DELETE The same is true for MT_LISTs, including their "TRY" variant. LIST_DEL_INIT keeps its short name to encourage to use it instead of the lazier LIST_DELETE which is often less safe. The change is large (~674 non-comment entries) but is mechanical enough to remain safe. No permutation was performed, so any out-of-tree code can easily map older names to new ones. The list doc was updated.	2021-04-21 09:20:17 +02:00
Tim Duesterhus	3b9cdf1cb7	CLEANUP: sample: Use explicit return for successful `json_query`s Move the `return 1` into each of the cases, instead of relying on the single `return 1` at the bottom of the function.	2021-04-20 20:33:38 +02:00
Tim Duesterhus	8f3bc8ffca	CLEANUP: sample: Explicitly handle all possible enum values from mjson This makes it easier to find bugs, because -Wswitch can help us.	2021-04-20 20:33:34 +02:00
Tim Duesterhus	4809c8c955	CLEANUP: sample: Improve local variables in sample_conv_json_query This improves the use of local variables in sample_conv_json_query: - Use the enum type for the return value of `mjson_find`. - Do not use single letter variables. - Reduce the scope of variables that are only needed in a single branch. - Add missing newlines after variable declaration.	2021-04-20 20:33:31 +02:00
Willy Tarreau	dcb121fd9c	BUG/MINOR: server: make srv_alloc_lb() allocate lb_nodes for consistent hash The test in srv_alloc_lb() to allocate the lb_nodes[] array used in the consistent hash was incorrect, it wouldn't do it for consistent hash and could do it for regular random. No backport is needed as this was added for dynamic servers in 2.4-dev by commit `f99f77a50` ("MEDIUM: server: implement 'add server' cli command").	2021-04-20 11:39:54 +02:00
Willy Tarreau	942b89f7dc	BUILD: pools: fix build with DEBUG_FAIL_ALLOC Amaury noticed that I managed to break the build of DEBUG_FAIL_ALLOC for the second time with `207c09509` ("MINOR: pools: move the fault injector to __pool_alloc()"). The joy of endlessly reworking patch sets... No backport is needed, that was in the just merged cleanup series.	2021-04-19 18:36:48 +02:00
Willy Tarreau	b2a853d5f0	CLEANUP: pools: uninline pool_put_to_cache() This function has become too big (251 bytes) and is now hurting performance a lot, with up to 4% request rate being lost over the last pool changes. Let's move it to pool.c as a regular function. Other attempts were made to cut it in half but it's still inefficient. Doing this results in saving ~90kB of object code, and even 112kB since the pool changes, with code that is even slightly faster! Conversely, pool_get_from_cache(), which remains half of this size, is still faster inlined, likely in part due to the immediate use of the returned pointer afterwards.	2021-04-19 15:24:33 +02:00
Willy Tarreau	fa19d20ac4	MEDIUM: pools: make pool_put_to_cache() always call pool_put_to_local_cache() Till now it used to call it only if there were not too many objects into the local cache otherwise would send the latest one directly into the shared cache. Now it always sends to the local cache and it's up to the local cache to free its oldest objects. From a cache freshness perspective it's better this way since we always evict cold objects instead of hot ones. From an API perspective it's better because it will help make the shared cache invisible to the public API.	2021-04-19 15:24:33 +02:00
Willy Tarreau	87212036a1	MINOR: pools: evict excess objects using pool_evict_from_local_cache() Till now we could only evict oldest objects from all local caches using pool_evict_from_local_caches() until the cache size was satisfying again, but there was no way to evict excess objects from a single cache, which is the reason why pool_put_to_cache() used to refrain from putting into the local cache and would directly write to the shared cache, resulting in massive writes when caches were full. Let's add this new function now. It will stop once the number of objects in the local cache is no higher than 16+total/8 or the cache size is no more than 75% full, just like before. For now the function is not used.	2021-04-19 15:24:33 +02:00
Willy Tarreau	b8498e961a	MEDIUM: pools: make CONFIG_HAP_POOLS control both local and shared pools Continuing the unification of local and shared pools, now the usage of pools is governed by CONFIG_HAP_POOLS without which allocations and releases are performed directly from the OS using pool_alloc_nocache() and pool_free_nocache().	2021-04-19 15:24:33 +02:00
Willy Tarreau	45e4e28161	MINOR: pools: factor the release code into pool_put_to_os() There are two levels of freeing to the OS: - code that wants to keep the pool's usage counters updated uses pool_free_area() and handles the counters itself. That's what pool_put_to_shared_cache() does in the no-global-pools case. - code that does not want to update the counters because they were already updated only calls pool_free_area(). Let's extract these calls to establish the symmetry with pool_get_from_os() and pool_alloc_nocache(), resulting in pool_put_to_os() (which only updates the allocated counter) and pool_free_nocache() (which also updates the used counter). This will later allow to simplify the generic code.	2021-04-19 15:24:33 +02:00
Willy Tarreau	2b5579f6da	MINOR: pools: always use atomic ops to maintain counters A part of the code cannot be factored out because it still uses non-atomic inc/dec for pool->used and pool->allocated as these are located under the pool's lock. While it can make sense in terms of bus cycles, it does not make sense in terms of code normalization. Further, some operations were still performed under a lock that could be totally removed via the use of atomic ops. There is still one occurrence in pool_put_to_shared_cache() in the locked code where pool_free_area() is called under the lock, which must absolutely be fixed.	2021-04-19 15:24:33 +02:00
Willy Tarreau	13843641e5	MINOR: pools: split the OS-based allocator in two Now there's one part dealing with the allocation itself and keeping counters up to date, and another one on top of it to return such an allocated pointer to the user and update the use count and stats. This is in anticipation for being able to group cache-related parts. The release code is still done at once.	2021-04-19 15:24:33 +02:00
Willy Tarreau	207c095098	MINOR: pools: move the fault injector to __pool_alloc() Till now it was limited to objects allocated from the OS which means it had little use as soon as pools were enabled. Let's move it upper in the layers so that any code can benefit from fault injection. In addition this allows to pass a new flag POOL_F_NO_FAIL to disable it if some callers prefer a no-failure approach.	2021-04-19 15:24:33 +02:00
Willy Tarreau	20f88abad5	MINOR: pools: use cheaper randoms for fault injections ha_random() is quite heavy and uses atomic ops or even a lock on some architectures. Here we don't seek good randoms, just statistical ones, so let's use the statistical prng instead.	2021-04-19 15:24:33 +02:00
Willy Tarreau	635cced32f	CLEANUP: pools: rename __pool_free() to pool_put_to_shared_cache() Now the multi-level cache becomes more visible: pool_get_from_local_cache() pool_put_to_local_cache() pool_get_from_shared_cache() pool_put_to_shared_cache()	2021-04-19 15:24:33 +02:00
Willy Tarreau	8c77ee5ae5	CLEANUP: pools: rename pool__{from,to}_cache() to _local_cache() The functions were rightfully called from/to_cache when the thread-local cache was considered as the only cache, but this is getting terribly confusing. Let's call them from/to local_cache to make it clear that it is not related with the shared cache. As a side note, since pool_evict_from_cache() used not to work for a particular pool but for all of them at once, it was renamed to pool_evict_from_local_caches() (plural form).	2021-04-19 15:24:33 +02:00
Willy Tarreau	8fe726f118	CLEANUP: pools: re-merge pool_refill_alloc() and __pool_refill_alloc() They were strictly equivalent, let's remerge them and rename them to pool_alloc_nocache() as it's the call which performs a real allocation which does not check nor update the cache. The only difference in the past was the former taking the lock and not the second but now the lock is not needed anymore at this stage since the pool's list is not touched. In addition, given that the "avail" argument is no longer used by the function nor by its callers, let's drop it.	2021-04-19 15:24:33 +02:00
Willy Tarreau	eb3cc29622	MEDIUM: pools: unify pool_refill_alloc() across all models Now we don't loop anymore trying to refill multiple items at once, and an allocated object is directly returned to the requester instead of being stored into the shared pool. This has multiple benefits. The first one is that no locking is needed anymore on the allocation path and the second one is that the loop will no longer cause latency spikes.	2021-04-19 15:24:33 +02:00
Willy Tarreau	64383b8181	MINOR: pools: make the basic pool_refill_alloc()/pool_free() update needed_avg This is a first step towards unifying all the fallback code. Right now these two functions are the only ones which do not update the needed_avg rate counter since there's currently no shared pool kept when using them. But their code is similar to what could be used everywhere except for this one, so let's make them capable of maintaining usage statistics. As a side effect the needed field in "show pools" will now be populated.	2021-04-19 15:24:33 +02:00
Willy Tarreau	53a7fe49aa	MINOR: pools: enable the fault injector in all allocation modes The mem_should_fail() call enabled by DEBUG_FAIL_ALLOC used to be placed only in the no-cache version of the allocator. Now we can generalize it to all modes and remove the exclusive test on CONFIG_HAP_NO_GLOBAL_POOLS.	2021-04-19 15:24:33 +02:00
Willy Tarreau	2d6f628d34	MINOR: pools: rename CONFIG_HAP_LOCAL_POOLS to CONFIG_HAP_POOLS We're going to make the local pool always present unless pools are completely disabled. This means that pools are always enabled by default, regardless of the use of threads. Let's drop this notion of "local" pools and make it just "pool". The equivalent debug option becomes DEBUG_NO_POOLS instead of DEBUG_NO_LOCAL_POOLS. For now this changes nothing except the option and dropping the dependency on USE_THREAD.	2021-04-19 15:24:33 +02:00
Willy Tarreau	d5140e7c6f	MINOR: pool: remove the size field from pool_cache_head Everywhere we have access to the pool so we don't need to cache a copy of the pool's size into the pool_cache_head. Let's remove it.	2021-04-19 15:24:33 +02:00
Willy Tarreau	9f3129e583	MEDIUM: pools: move the cache into the pool header Initially per-thread pool caches were stored into a fixed-size array. But this was a bit ugly because the last allocated pools were not able to benefit from the cache at all. As a work around to preserve performance, a size of 64 cacheable pools was set by default (there are 51 pools at the moment, excluding any addon and debugging code), so all in-tree pools were covered, at the expense of higher memory usage. In addition an index had to be calculated for each pool, and was used to acces the pool cache head into that array. The pool index was not even stored into the pools so it was required to determine it to access the cache when the pool was already known. This patch changes this by moving the pool cache head into the pool head itself. This way it is certain that each pool will have its own cache. This removes the need for index calculation. The pool cache head is 32 bytes long so it was aligned to 64B to avoid false sharing between threads. The extra cost is not huge (~2kB more per pool than before), and we'll make better use of that space soon. The pool cache head contains the size, which should probably be removed since it's already in the pool's head.	2021-04-19 15:24:33 +02:00
Willy Tarreau	3e970b11eb	MINOR: pools: drop the unused static history of artificially failed allocs When building with DEBUG_FAIL_ALLOC we call a random generator to decide whether the pool alloc should succeed or fail, and there was a preliminary debugging mechanism to keep sort of a history of the previous decisions. But it was never used, enforces a lock during the allocation, and forces to use static variables, all of which are limiting the ability to pursue the pools cleanups with no real benefit. Let's get rid of them now.	2021-04-19 15:24:33 +02:00
Willy Tarreau	a5b229d01d	BUG/MINOR: pools/buffers: make sure to always reserve the required buffers Since recent commit ae07592 ("MEDIUM: pools: add CONFIG_HAP_NO_GLOBAL_POOLS and CONFIG_HAP_GLOBAL_POOLS") the pre-allocation of all desired reserved buffers was not done anymore on systems not using the shared cache. This basically has no practical impact since these ones will quickly be refilled by all the ones used at run time, but it may confuse someone checking if they're allocated in "show pools". That's only 2.4-dev, no backport is needed.	2021-04-19 15:24:33 +02:00
Willy Tarreau	932dd19cc3	BUG/MINOR: pools: maintain consistent ->allocated count on alloc failures When running with CONFIG_HAP_NO_GLOBAL_POOLS, it's theoritically possible to keep an incorrect count of allocated entries in a pool because the allocated counter was used as a cumulated counter of alloc calls instead of a number of currently allocated items (it's possible the meaning has changed over time). The only impact in this mode essentially is that "show pools" will report incorrect values. But this would only happen on limited pools, which is not even certain still exist. This was added by recent commit `0bae07592` ("MEDIUM: pools: add CONFIG_HAP_NO_GLOBAL_POOLS and CONFIG_HAP_GLOBAL_POOLS") so no backport is needed.	2021-04-19 15:24:33 +02:00
Tim Duesterhus	5be6ab269e	MEDIUM: http_act: Rename uri-normalizers This patch renames all existing uri-normalizers into a more consistent naming scheme: 1. The part of the URI that is being touched. 2. The modification being performed as an explicit verb.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	a407193376	MINOR: uri_normalizer: Add a `percent-upper` normalizer This normalizer uppercases the hexadecimal characters used in percent-encoding. See GitHub Issue #714.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	d7b89be30a	MINOR: uri_normalizer: Add a `sort-query` normalizer This normalizer sorts the `&` delimited query parameters by parameter name. See GitHub Issue #714.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	560e1a6352	MINOR: uri_normalizer: Add support for supressing leading `../` for dotdot normalizer This adds an option to supress `../` at the start of the resulting path.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	9982fc2bbd	MINOR: uri_normalizer: Add a `dotdot` normalizer to http-request normalize-uri This normalizer merges `../` path segments with the predecing segment, removing both the preceding segment and the `../`. Empty segments do not receive special treatment. The `merge-slashes` normalizer should be executed first. See GitHub Issue #714.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	d371e99d1c	MINOR: uri_normalizer: Add a `merge-slashes` normalizer to http-request normalize-uri This normalizer merges adjacent slashes into a single slash, thus removing empty path segments. See GitHub Issue #714.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	d2bedcc4ab	MINOR: uri_normalizer: Add `http-request normalize-uri` This patch adds the `http-request normalize-uri` action that was requested in GitHub issue #714. Normalizers will be added in the next patches.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	dbd25c34de	MINOR: uri_normalizer: Add uri_normalizer module This is in preparation for future patches.	2021-04-19 09:05:57 +02:00
Christopher Faulet	1d26f22e05	BUG/MINOR: logs: Report the true number of retries if there was no connection When the session is aborted before any connection attempt to any server, the number of connection retries reported in the logs is wrong. It happens because when the retries counter is not strictly positive, we consider the max number of retries was reached and the backend retries value is used. It is obviously wrong when no connectioh was performed. In fact, at this stage, the retries counter is initialized to 0. But the backend stream-interface is in the INI state. Once it is set to SI_ST_REQ, the counter is set to the backend value. And it is the only possible state transition from INI state. Thus it is safe to rely on it to fix the bug. This patch must be backported to all stable versions.	2021-04-19 08:52:17 +02:00
Christopher Faulet	a7d6cf24fb	BUG/MINOR: http_htx: Remove BUG_ON() from http_get_stline() function The http_get_stline() was designed to be called from HTTP analyzers. Thus before any data forwarding. To prevent any invalid usage, two BUG_ON() statements were added. However, it is not a good idea because it is pretty hard to be sure no HTTP sample fetch will never be called outside the analyzers context. Especially because there is at least one possible area where it may happens. An HTTP sample fetch may be used inside the unique-id format string. On the normal case, it is generated in AN_REQ_HTTP_INNER analyzer. But if an error is reported too early, the id is generated when the log is emitted. So, it is safer to remove the BUG_ON() statements and consider the normal behavior is to return NULL if the first block is not a start-line. Of course, this means all calling functions must test the return value or be sure the start-line is really there. This patch must be backported as far as 2.0.	2021-04-19 08:51:22 +02:00
Christopher Faulet	003df1cff9	MINOR: tcp_samples: Be able to call bc_src/bc_dst from the health-checks The new L4 sample fetches used to get source and destination info of the backend connection may now be called from an health-check.	2021-04-19 08:31:05 +02:00
Christopher Faulet	7d081f02a4	MINOR: tcp_samples: Add samples to get src/dst info of the backend connection This patch adds 4 new sample fetches to get the source and the destination info (ip address and port) of the backend connection : * bc_dst : Returns the destination address of the backend connection * bc_dst_port : Returns the destination port of the backend connection * bc_src : Returns the source address of the backend connection * bc_src_port : Returns the source port of the backend connection The configuration manual was updated accordingly.	2021-04-19 08:31:05 +02:00
Christopher Faulet	6f97a611c8	BUG/MINOR: http-fetch: Make method smp safe if headers were already forwarded When method sample fetch is called, if an exotic method is found (HTTP_METH_OTHER), when smp_prefetch_htx() is called, we must be sure the start-line is still there. Otherwise, HAproxy may crash because of a NULL pointer dereference, for instance if the method sample fetch is used inside a unique-id format string. Indeed, the unique id may be generated when the log message is emitted. At this stage, the request channel is empty. This patch must be backported as far as 2.0. But the bug exists in all stable versions for the legacy HTTP mode too. Thus it must be adapted to the legacy HTTP mode and backported to all other stable versions.	2021-04-19 08:31:05 +02:00
Christopher Faulet	4bef8d1d46	BUG/MINOR: ssl-samples: Fix ssl_bc_* samples when called from a health-check For all ssl_bc_* sample fetches, the test on the keyword when called from a health-check is inverted. We must be sure the 5th charater is a 'b' to retrieve a connection. This patch must be backported as far as 2.2.	2021-04-19 08:31:05 +02:00
Christopher Faulet	242f8ce060	MINOR: connection: Make bc_http_major compatible with tcp-checks bc_http_major sample fetch now works when it is called from a tcp-check. When it happens, the session origin is a check. The backend connection is retrieved from the conn-stream attached to the check. If required, this path may easily be backported as far as 2.2.	2021-04-19 08:31:05 +02:00
Christopher Faulet	f4dd9ae5c7	BUG/MINOR: connection: Fix fc_http_major and bc_http_major for TCP connections fc_http_major and bc_http_major sample fetches return the major digit of the HTTP version used, respectively, by the frontend and the backend connections, based on the mux. However, in reality, "2" is returned if the H2 mux is detected, otherwise "1" is inconditionally returned, regardless the mux used. Thus, if called for a raw TCP connection, "1" is returned. To fix this bug, we now get the multiplexer flags, if there is one, to be sure MX_FL_HTX is set. I guess it was made this way on purpose when the H2 multiplexer was introduced in the 1.8 and with the legacy HTTP mode there is no other solution at the connection level. Thus this patch should be backported as far as 2.2. For the 2.0, it must be evaluated first because of the legacy HTTP mode.	2021-04-19 08:24:38 +02:00
Christopher Faulet	fd81848c22	MINOR: logs: Add support of checks as session origin to format lf strings When a log-format string is built from an health-check, the session origin is the health-check itself and not a connection. In addition, there is no stream. It means for now some formats are not supported: %s, %sc, %b, %bi, %bp, %si and %sp. Thanks to this patch, the session origin is converted to a check. So it is possible to retrieve the backend and the backend connection. Note this session have no listener, thus %ft format must be guarded. This patch is light and standalone, thus it may be backported as far as 2.2 if required. However, because the error is human, it is probably better to wait a bit to be sure everything is properly protected.	2021-04-19 08:22:15 +02:00
Christopher Faulet	0f1fc23d4e	BUG/MINOR: checks: Set missing id to the dummy checks frontend The dummy frontend used to create the session of the tcp-checks is initialized without identifier. However, it is required because this id may be used without any guard, for instance in log-format string via "%f" or when fe_name sample fetch is called. Thus, an unset id may lead to crashes. This patch must be backported as far as 2.2.	2021-04-17 11:14:58 +02:00
Christopher Faulet	76b44195c9	MINOR: threads: Only consider running threads to end a thread harmeless period When a thread ends its harmeless period, we must only consider running threads when testing threads_want_rdv_mask mask. To do so, we reintroduce all_threads_mask mask in the bitwise operation (It was removed to fix a deadlock). Note that for now it is useless because there is no way to stop threads or to have threads reserved for another task. But it is safer this way to avoid bugs in the future.	2021-04-17 11:14:58 +02:00
Alex	51c8ad45ce	MINOR: sample: converter: Add json_query converter With the json_query can a JSON value be extacted from a header or body of the request and saved to a variable. This converter makes it possible to handle some JSON workload to route requests to different backends.	2021-04-15 17:07:03 +02:00
Alex	41007a6835	MINOR: sample: converter: Add mjson library. This library is required for the subsequent patch which adds the JSON query possibility. It is necessary to change the include statement in "src/mjson.c" because the imported includes in haproxy are in "include/import" orig: #include "mjson.h" new: #include <import/mjson.h>	2021-04-15 17:05:38 +02:00
Moemen MHEDHBI	848216f108	CLEANUP: sample: align samples list in sample.c	2021-04-13 17:28:22 +02:00
Moemen MHEDHBI	92f7d43c5d	MINOR: sample: add ub64dec and ub64enc converters ub64dec and ub64enc are the base64url equivalent of b64dec and base64 converters. base64url encoding is the "URL and Filename Safe Alphabet" variant of base64 encoding. It is also used in in JWT (JSON Web Token) standard. RFC1421 mention in base64.c file is deprecated so it was replaced with RFC4648 to which existing converters, base64/b64dec, still apply. Example: HAProxy: http-request return content-type text/plain lf-string %[req.hdr(Authorization),word(2,.),ub64dec] Client: Token=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJ1c2VyIjoiZm9vIiwia2V5IjoiY2hhZTZBaFhhaTZlIn0.5VsVj7mdxVvo1wP5c0dVHnr-S_khnIdFkThqvwukmdg $ curl -H "Authorization: Bearer ${TOKEN}" http://haproxy.local {"user":"foo","key":"chae6AhXai6e"}	2021-04-13 17:28:13 +02:00
Thayne McCombs	b28430591d	BUG/MEDIUM: sample: Fix adjusting size in field converter Adjust the size of the sample buffer before we change the "area" pointer. The change in size is calculated as the difference between the original pointer and the new start pointer. But since the `smp->data.u.str.area` assignment results in `smp->data.u.str.area` and `start` being the same pointer, we always ended up substracting zero. This changes it to change the size by the actual amount it changed. I'm not entirely sure what the impact of this is, but the previous code seemed wrong. [wt: from what I can see the only harmful case is when the output is converted to a stick-table key, it could result in zeroing past the end of the buffer; other cases do not touch beyond ->data]	2021-04-13 12:12:48 +02:00
Christopher Faulet	b15625a43b	MINOR: cfgparse/proxy: Group alloc error handling during proxy section parsing All allocation errors in cfg_parse_listen() are now handled in a unique place under the "alloc_error" label. This simplify a bit error handling in this function.	2021-04-12 22:04:19 +02:00
Christopher Faulet	b45a7d4b74	BUG/MINOR: cfgparse/proxy: Hande allocation errors during proxy section parsing At several places during the proxy section parsing, memory allocation was performed with no check. Result is now tested and an error is returned if the allocation fails. This patch may be backported to all stable version but it only fixes allocation errors during configuration parsing. Thus, it is not mandatory.	2021-04-12 21:35:12 +02:00
Christopher Faulet	0c6d1dcf7d	BUG/MINOR: listener: Handle allocation error when allocating a new bind_conf Allocation error are now handled in bind_conf_alloc() functions. Thus callers, when not already done, are also updated to catch NULL return value. This patch may be backported (at least partially) to all stable versions. However, it only fix errors durung configuration parsing. Thus it is not mandatory.	2021-04-12 21:33:43 +02:00
Christopher Faulet	2e848a9b75	BUG/MINOR: cfgparse/proxy: Fix some leaks during proxy section parsing Allocated variables are now released when an error occurred during use_backend, use-server, force/ignore-parsing, stick-table, stick and stats directives parsing. For some of these directives, allocation errors have been added. This patch may be backported to all stable version but it only fixes leaks or allocation errors during configuration parsing. Thus, it is not mandatory. It should fix issue #1119.	2021-04-12 21:33:39 +02:00
Christopher Faulet	3a9a12bb2a	BUG/MINOR: hlua: Fix memory leaks on error path when registering a cli keyword When an error occurred in hlua_register_cli(), the allocated lua function and keyword must be released to avoid memory leaks. This patch depends on "MINOR: hlua: Add function to release a lua function". It may be backported in all stable versions.	2021-04-12 19:05:05 +02:00
Christopher Faulet	5c028d7f9d	BUG/MINOR: hlua: Fix memory leaks on error path when registering a service When an error occurred in hlua_register_service(), the allocated lua function and keyword must be released to avoid memory leaks. This patch depends on "MINOR: hlua: Add function to release a lua function". It may be backported in all stable versions.	2021-04-12 19:04:42 +02:00
Christopher Faulet	4fc9da01d2	BUG/MINOR: hlua: Fix memory leaks on error path when registering an action When an error occurred in hlua_register_action(), the allocated lua function and keyword must be released to avoid memory leaks. This patch depends on "MINOR: hlua: Add function to release a lua function". It may be backported in all stable versions.	2021-04-12 19:04:42 +02:00
Christopher Faulet	528526f2cc	BUG/MINOR: hlua: Fix memory leaks on error path when parsing a lua action hen an error occurred in action_register_lua(), the allocated hlua rule and arguments must be released to avoid memory leaks. This patch may be backported in all stable versions.	2021-04-12 19:04:42 +02:00
Christopher Faulet	2567f18382	BUG/MINOR: hlua: Fix memory leaks on error path when registering a fetch When an error occurred in hlua_register_fetches(), the allocated lua function and keyword must be released to avoid memory leaks. This patch depends on "MINOR: hlua: Add function to release a lua function". It may be backported in all stable versions. It should fix #1112.	2021-04-12 19:04:42 +02:00
Christopher Faulet	aa22430bba	BUG/MINOR: hlua: Fix memory leaks on error path when registering a converter When an error occurred in hlua_register_converters(), the allocated lua function and keyword must be released to avoid memory leaks. This patch depends on "MINOR: hlua: Add function to release a lua function". It may be backported in all stable versions.	2021-04-12 19:04:42 +02:00
Christopher Faulet	5294ec0708	BUG/MINOR: hlua: Fix memory leaks on error path when registering a task When an error occurred in hlua_register_task(), the allocated lua context and task must be released to avoid memory leaks. This patch may be backported in all stable versions.	2021-04-12 19:04:42 +02:00
Christopher Faulet	dda44442d5	MINOR: hlua: Add function to release a lua function release_hlua_function() must be used to release a lua function. Some fixes depends on this function.	2021-04-12 15:46:53 +02:00
Christopher Faulet	147b8c919c	MINOIR: checks/trace: Register a new trace source with its events Add the trace support for the checks. Only tcp-check based health-checks are supported, including the agent-check. In traces, the first argument is always a check object. So it is easy to get all info related to the check. The tcp-check ruleset, the conn-stream and the connection, the server state...	2021-04-12 12:09:36 +02:00
Christopher Faulet	6d80b63e3c	MINOR: trace: Add the checks as a possible trace source To be able to add the trace support for the checks, a new kind of source must be added for this purpose.	2021-04-12 12:09:36 +02:00
Willy Tarreau	44982715ba	MEDIUM: time: make the clock offset global and no per-thread Since 1.8 for simplicity the time offset used to compensate for time drift and jumps had been stored per thread. But with a global time, the complexit has significantly increased. What this patch does in order to address this is to get back to the origins of the pre-thread time drift correction, and keep a single offset between the system's date and the current global date. The thread first verifies from the before_poll date if the time jumped backwards or forward, then either fixes it by computing the new most likely date, or applies the current offset to this latest system date. In the first case, if the date is out of range, the old one is reused with the max_wait offset or not depending on the interrupted flag. Then it compares its date to the global date and updates both so that both remain monotonic and that the local date always reflects the latest known global date. In order to support atomic updates to the offset, it's saved as a ullong which contains both the tv_sec and tv_usec parts in its high and low words. Note that a part of the patch comes from the inlining of the equivalent of tv_add applied to the offset to make sure that signed ints are permitted (otherwise it depends on how timeval is defined). This is significantly more reliable than the previous model as the global time should move in a much smoother way, and not according to what thread last updated it, and the thread-local time should always be very close to the global one. Note that (at least for debugging) a cheap way to measure processing lag would consist in measuring the difference between global_now_ms and now_ms, as long as other threads keep it up-to-date.	2021-04-11 23:59:37 +02:00
Willy Tarreau	7e4a557f64	MINOR: time: change the global timeval and the the global tick at once Instead of using two CAS loops, better compute the two units simultaneously and update them at once. There is no guarantee that the update will be synchronous, but we don't care, what matters is that both are monotonically updated and that global_now_ms always follows the last known value of global_now.	2021-04-11 23:47:54 +02:00
Willy Tarreau	70cb3026a8	MINOR: time: remove useless variable copies in tv_update_date() In the global_now loop, we used to set tmp_adj from adjusted, then set update it from tmp_now, then set adjusted back to tmp_adj, and finally set now from adjusted. This is a long and unneeded set of moves resulting from years of code changes. Let's just set now directly in the loop, stop using adjusted and remove tmp_adj.	2021-04-11 23:47:01 +02:00
Willy Tarreau	c4c80fb4ea	MINOR: time: move the time initialization out of tv_update_date() The time initialization was made a bit complex because we rely on a dummy negative argument to reset all fields, leaving no distinction between process-level initialization and thread-level initialization. This patch changes this by introducing two functions, one for the process and the second one for the threads. This removes ambigous test and makes sure that the relevant fields are always initialized exactly once. This also offers a better solution to the bug fixed in commit `b48e7c001` ("BUG/MEDIUM: time: make sure to always initialize the global tick") as there is no more special values for global_now_ms. It's simple enough to be backported if any other time-related issues are encountered in stable versions in the future.	2021-04-11 23:45:48 +02:00
Willy Tarreau	61c72c366e	CLEANUP: time: remove the now unused ms_left_scaled It was only used by freq_ctr and is not used anymore. In addition the local curr_sec_ms was removed, as well as the equivalent extern definitions which did not exist anymore either.	2021-04-11 14:01:53 +02:00
Willy Tarreau	fc6323ad82	MEDIUM: freq_ctr: replace the per-second counters with the generic ones It remains cumbersome to preserve two versions of the freq counters and two different internal clocks just for this. In addition, the savings from using two different mechanisms are not that important as the only saving is a divide that is replaced by a multiply, but now thanks to the freq_ctr_total() unificaiton the code could also be simplified to optimize it in case of constants. This patch turns all non-period freq_ctr functions to static inlines which call the period-based ones with a period of 1 second. A direct benefit is that a single internal clock is now needed for any counter and that they now all rely on ticks. These 1-second counters are essentially used to report request rates and to enforce a connection rate limitation in listeners. It was verified that these continue to work like before.	2021-04-11 11:12:55 +02:00
Willy Tarreau	fa1258f02c	MINOR: freq_ctr: unify freq_ctr and freq_ctr_period into freq_ctr Both structures are identical except the name of the field starting the period and its description. Let's call them all freq_ctr and the period's start "curr_tick" which is generic. This is only a temporary change and fields are expected to remain the same with no code change (verified).	2021-04-11 11:11:27 +02:00
Willy Tarreau	607be24a85	MEDIUM: freq_ctr: reimplement freq_ctr_remain_period() from freq_ctr_total() Now the function becomes an inline one and only contains a divide and a max. The divide will automatically go away with constant periods.	2021-04-11 11:11:03 +02:00
Willy Tarreau	a7a31b2602	MEDIUM: freq_ctr: make read_freq_ctr_period() use freq_ctr_total() This one is the easiest to implement, it just requires a call and a divide of the result. Anti-flapping correction for low-rates was preserved. Now calls using a constant period will be able to use a reciprocal multiply for the period instead of a divide.	2021-04-11 11:11:03 +02:00
Willy Tarreau	f3a9f8dc5a	MINOR: freq_ctr: add a generic function to report the total value Most of the functions designed to read a counter over a period go through the same complex loop and only differ in the way they use the returned values, so it was worth implementing all this into freq_ctr_total() which returns the total number of events over a period so that the caller can finish its operation using a divide or a remaining time calculation. As a special case, read_freq_ctr_period() doesn't take pending events but requires to enable an anti-flapping correction at very low frequencies. Thus the function implements it when pend<0. Thanks to this function it will be possible to reimplement the other ones as inline and merge the per-second ones with the arbitrary period ones without always adding the cost of a 64 bit divide.	2021-04-11 11:10:57 +02:00
Willy Tarreau	6eb3d37bf4	MINOR: trace: make trace sources read_mostly The trace sources are checked at plenty of places in the code and their contents only change when trace status changes, let's mark them read_mostly.	2021-04-10 19:29:26 +02:00
Willy Tarreau	295a89c029	MINOR: pattern: make the pat_lru_seed read_mostly This seed is created once at boot and is used in every LRU hash when caching results. Let's mark it read_mostly.	2021-04-10 19:27:41 +02:00
Willy Tarreau	ad6722ea3a	MINOR: protocol: move __protocol_by_family to read_mostly This one is used for each outgoing connection and never changes after boot, move it to read_mostly.	2021-04-10 19:27:41 +02:00
Willy Tarreau	14015b8880	MINOR: server: move idle_conn_task to read_mostly This pointer is used when adding connections to the idle list and is never changed, let's move it to the read_mostly section.	2021-04-10 19:27:41 +02:00
Willy Tarreau	56c3b8b4e8	MINOR: threads: mark all_threads_mask as read_mostly This variable almost never changes and is read a lot in time-critical sections. threads_want_rdv_mask is read very often as well in thread_harmless_end() and is almost never changed (only when someone uses thread_isolate()). Let's move both to read_mostly.	2021-04-10 19:27:41 +02:00
Willy Tarreau	ff88270ef9	MINOR: pool: move pool declarations to read_mostly All pool heads are accessed via a pointer and should not be shared with highly written variables. Move them to the read_mostly section.	2021-04-10 19:27:41 +02:00
Willy Tarreau	8209c9aa18	MINOR: kqueue: move kqueue_fd to read_mostly This one only contains the list of per-thread kqueue FDs, and is used a lot during updates. Let's mark it read_mostly to avoid false sharing of FDs placed at the extremities.	2021-04-10 19:27:41 +02:00
Willy Tarreau	26d212c744	MINOR: epoll: move epoll_fd to read_mostly This one only contains the list of per-thread epoll FDs, and is used a lot during updates. Let's mark it read_mostly to avoid false sharing of FDs placed at the extremities.	2021-04-10 19:27:41 +02:00
Willy Tarreau	a1090a5b61	MINOR: fd: move a few read-mostly variables to their own section Some pointer to arrays such as fdtab, fdinfo, polled_mask etc are never written to at run time but are used a lot. fdtab accesses appear a lot in perf top because ha_used_fds is in the same cache line and is modified all the time. This patch moves all these read-mostly variables to the read_mostly section when defined. This way their cache lines will be able to remain in shared state in all CPU caches.	2021-04-10 19:27:41 +02:00
Willy Tarreau	f459640ef6	MINOR: global: declare a read_mostly section Some variables are mostly read (mostly pointers) but they tend to be merged with other ones in the same cache line, slowing their access down in multi-thread setups. This patch declares an empty, aligned variable in a section called "read_mostly". This will force a cache-line alignment on this section so that any variable declared in it will be certain to avoid false sharing with other ones. The section will be eliminated at link time if not used. A __read_mostly attribute was added to compiler.h to ease use of this section.	2021-04-10 19:27:41 +02:00
Willy Tarreau	9057a0026e	CLEANUP: pattern: make all pattern tables read-only Interestingly, all arrays used to declare patterns were read-write while only hard-coded. Let's mark them const so that they move from data to rodata and don't risk to experience false sharing.	2021-04-10 17:49:41 +02:00
Christopher Faulet	e2c65ba344	BUG/MINOR: mux-pt: Fix a possible UAF because of traces in mux_pt_io_cb In mux_pt_io_cb(), if a connection error or a shutdown is detected, the mux is destroyed. Thus we must be careful to not use it in a trace message once destroyed. No backport needed. This patch should fix the issue #1220.	2021-04-10 09:02:36 +02:00
Christopher Faulet	c0ae097b95	MINOIR: mux-pt/trace: Register a new trace source with its events As for the other muxes, traces are now supported in the pt mux. All parts of the multiplexer is covered by these traces. Events are splitted by categories (connection, stream, rx and tx). In traces, the first argument is always a connection. So it is easy to get the mux context (conn->ctx). The second argument is always a conn-stream and mau be NUUL. The third one is a buffer and it may also be NULL. Depending on the context it is the request or the response. In all cases it is owned by a channel. Finally, the fourth argument is an integer value. Its meaning depends on the calling context.	2021-04-09 17:46:58 +02:00
Tim Duesterhus	403fd722ac	CLEANUP: Remove useless malloc() casts This is not C++.	2021-04-08 20:11:58 +02:00
Tim Duesterhus	b8ee894b66	CLEANUP: htx: Make http_get_stline take a `const struct` Nothing is being modified there, so this can be `const`.	2021-04-08 19:40:59 +02:00
Emeric Brun	c8f3e45c6a	MEDIUM: resolvers: add support of tcp address on nameserver line. This patch re-works configuration parsing, it removes the "server" lines from "resolvers" sections introduced in commit `56fc5d9eb`: MEDIUM: resolvers: add supports of TCP nameservers in resolvers. It also extends the nameserver lines to support stream server addresses such as: resolvers nameserver localhost tcp@127.0.0.1:53 Doing so, a part of nameserver's init code was factorized in function 'parse_resolvers' and removed from 'post_parse_resolvers'.	2021-04-08 14:20:40 +02:00
Willy Tarreau	4781b1521a	CLEANUP: atomic/tree-wide: replace single increments/decrements with inc/dec This patch replaces roughly all occurrences of an HA_ATOMIC_ADD(&foo, 1) or HA_ATOMIC_SUB(&foo, 1) with the equivalent HA_ATOMIC_INC(&foo) and HA_ATOMIC_DEC(&foo) respectively. These are 507 changes over 45 files.	2021-04-07 18:18:37 +02:00
Willy Tarreau	185157201c	CLEANUP: atomic: add a fetch-and-xxx variant for common operations The fetch_and_xxx variant is often missing for add/sub/and/or. In fact it was only provided for ADD under the name XADD which corresponds to the x86 instruction name. But for destructive operations like AND and OR it's missing even more as it's not possible to know the value before modifying it. This patch explicitly adds HA_ATOMIC_FETCH_{OR,AND,ADD,SUB} which cover these standard operations, and renames XADD to FETCH_ADD (there were only 6 call places). In the future, backport of fixes involving such operations could simply remap FETCH_ADD(x) to XADD(x), FETCH_SUB(x) to XADD(-x), and for the OR/AND if needed, these could possibly be done using BTS/BTR. It's worth noting that xchg could have been renamed to fetch_and_store() but xchg already has well understood semantics and it wasn't needed to go further.	2021-04-07 18:18:37 +02:00
Willy Tarreau	1db427399c	CLEANUP: atomic: add an explicit _FETCH variant for add/sub/and/or Currently our atomic ops return a value but it's never known whether the fetch is done before or after the operation, which causes some confusion each time the value is desired. Let's create an explicit variant of these operations suffixed with _FETCH to explicitly mention that the fetch occurs after the operation, and make use of it at the few call places.	2021-04-07 18:18:37 +02:00
Willy Tarreau	184b21259b	MINOR: cli/show-fd: slightly reorganize the FD status flags Slightly reorder the status flags to better match their order in the "state" field, and also decode the "shut" state which is particularly useful and already part of this field.	2021-04-07 18:18:37 +02:00
Willy Tarreau	1673c4a883	MINOR: fd: implement an exclusive syscall bit to remove the ugly "log" lock There is a function called fd_write_frag_line() that's essentially used by loggers and that is used to write an atomic message line over a file descriptor using writev(). However a lock is required around the writev() call to prevent messages from multiple threads from being interleaved. Till now a SPIN_TRYLOCK was used on a dedicated lock that was common to all FDs. This is quite not pretty as if there are multiple output pipes to collect logs, there will be quite some contention. Now that there are empty flags left in the FD state and that we can finally use atomic ops on them, let's add a flag to indicate the FD is locked for exclusive access by a syscall. At least the locking will now be on an FD basis and not the whole process, so we can remove the log_lock.	2021-04-07 18:18:37 +02:00
Willy Tarreau	9063a660cc	MINOR: fd: move .exported into fdtab[].state No need to keep this flag apart any more, let's merge it into the global state.	2021-04-07 18:10:36 +02:00
Willy Tarreau	5362bc9044	MINOR: fd: move .et_possible into fdtab[].state No need to keep this flag apart any more, let's merge it into the global state.	2021-04-07 18:09:43 +02:00
Willy Tarreau	0cc612818d	MINOR: fd: move .initialized into fdtab[].state No need to keep this flag apart any more, let's merge it into the global state. The bit was not cleared in fd_insert() because the only user is the function used to create and atomically send a log message to a pipe FD, which never registers the fd. Here we clear it nevertheless for the sake of clarity. Note that with an extra cleaning pass we could have a bit number here and simply use a BTS to test and set it.	2021-04-07 18:09:08 +02:00
Willy Tarreau	030dae13a0	MINOR: fd: move .cloned into fdtab[].state No need to keep this flag apart any more, let's merge it into the global state.	2021-04-07 18:08:29 +02:00
Willy Tarreau	b41a6e9101	MINOR: fd: move .linger_risk into fdtab[].state No need to keep this flag apart any more, let's merge it into the global state. The CLI's output state was extended to 6 digits and the linger/cloned flags moved inside the parenthesis.	2021-04-07 18:07:49 +02:00
Willy Tarreau	f509065191	MEDIUM: fd: merge fdtab[].ev and state for FD_EV_* and FD_POLL_* into state For a long time we've had fdtab[].ev and fdtab[].state which contain two arbitrary sets of information, one is mostly the configuration plus some shutdown reports and the other one is the latest polling status report which also contains some sticky error and shutdown reports. These ones used to be stored into distinct chars, complicating certain operations and not even allowing to clearly see concurrent accesses (e.g. fd_delete_orphan() would set the state to zero while fd_insert() would only set the event to zero). This patch creates a single uint with the two sets in it, still delimited at the byte level for better readability. The original FD_EV_* values remained at the lowest bit levels as they are also known by their bit value. The next step will consist in merging the remaining bits into it. The whole bits are now cleared both in fd_insert() and _fd_delete_orphan() because after a complete check, it is certain that in both cases these functions are the only ones touching these areas. Indeed, for _fd_delete_orphan(), the thread_mask has already been zeroed before a poller can call fd_update_event() which would touch the state, so it is certain that _fd_delete_orphan() is alone. Regarding fd_insert(), only one thread will get an FD at any moment, and it as this FD has already been released by _fd_delete_orphan() by definition it is certain that previous users have definitely stopped touching it. Strictly speaking there's no need for clearing the state again in fd_insert() but it's cheap and will remove some doubts during some troubleshooting sessions.	2021-04-07 18:04:39 +02:00
Willy Tarreau	8d27c203ed	MEDIUM: fd: prepare FD_POLL_* to move to bits 8-15 In preparation of merging FD_POLL* and FD_EV, this only changes the value of FD_POLL_ to use bits 8-15 (the second byte). The size of the field has been temporarily extended to 32 bits already, as well as the temporary variables that carry the new composite value inside fd_update_events(). The resulting fdtab entry becomes temporarily unaligned. All places making access to .ev or FD_POLL_* were carefully inspected to make sure they were safe regarding this change. Only one temporary update was needed for the "show fd" code. The code was only slightly inflated at this step.	2021-04-07 15:08:40 +02:00
Emeric Brun	26754901e9	BUG/MEDIUM: log: fix config parse error logging on stdout/stderr or any raw fd The regression was introduced by commit previous commit `94aab06`: MEDIUM: log: support tcp or stream addresses on log lines. This previous patch tries to retrieve the used protocol parsing the address using the str2sa_range function but forgets that the raw file descriptor adresses don't specify a protocol and str2sa_range probes an error. This patch re-work the str2sa_range function to stop probing error if an authorized RAW_FD address is parsed whereas the caller request also a protocol. It also modify the code of parse_logsrv to switch on stream logservers only if a protocol was detected.	2021-04-07 15:01:00 +02:00

... 3 4 5 6 7 ...

11657 Commits