haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-12 01:56:58 +02:00

Author	SHA1	Message	Date
Willy Tarreau	54496a6a5b	MINOR: vars: make the vars() sample fetch function support a default value It is quite common to see in configurations constructions like the following one: http-request set-var(txn.bodylen) 0 http-request set-var(txn.bodylen) req.hdr(content-length) ... http-request set-header orig-len %[var(txn.bodylen)] The set-var() rules are almost always duplicated when manipulating integers or any other value that is mandatory along operations. This is a problem because it makes the configurations complicated to maintain and slower than needed. And it becomes even more complicated when several conditions may set the same variable because the risk of forgetting to initialize it or to accidentally reset it is high. This patch extends the var() sample fetch function to take an optional argument which contains a default value to be returned if the variable was not set. This way it becomes much simpler to use the variable, just set it where needed, and read it with a fall back to the default value: http-request set-var(txn.bodylen) req.hdr(content-length) ... http-request set-header orig-len %[var(txn.bodylen,0)] The default value is always passed as a string, thus it will experience a cast to the output type. It doesn't seem userful to complicate the configuration to pass an explicit type at this point. The vars.vtc regtest was updated accordingly.	2021-09-03 12:08:54 +02:00
Willy Tarreau	e352b9dac7	MINOR: vars: make vars_get_by_* support an optional default value In preparation for support default values when fetching variables, we need to update the internal API to pass an extra argument to functions vars_get_by_{name,desc} to provide an optional default value. This patch does this and always passes NULL in this argument. var_to_smp() was extended to fall back to this value when available.	2021-09-03 12:08:54 +02:00
Willy Tarreau	be7e00d134	CLEANUP: vars: factor out common code from vars_get_by_{desc,name} The two functions vars_get_by_name() and vars_get_by_scope() perform almost the same operations except that they differ from the way the name and scope are retrieved. The second part in common is more complex and involves locking, so better factor this one out into a new function. There is no other change than refactoring.	2021-09-03 11:43:35 +02:00
Willy Tarreau	e93bff4107	MEDIUM: vars: also support format strings in CLI's "set var" command Most often "set var" on the CLI is used to set a string, and using only expressions is not always convenient, particularly when trying to concatenate variables sur as host names and paths. Now the "set var" command supports an optional keyword before the value to indicate its type. "expr" takes an expression just like before this patch, and "fmt" a format string, making it work like the "set-var-fmt" actions. The VTC was updated to include a test on the format string.	2021-09-03 11:01:48 +02:00
Willy Tarreau	753d4db5f3	MINOR: vars: add a "set-var-fmt" directive to the global section Just like the set-var-fmt action for tcp/http rules, the set-var-fmt directive in global sections allows to pre-set process-wide variables using a format string instead of a sample expression. This is often more convenient when it is required to concatenate multiple fields, or when emitting just one word.	2021-09-03 11:01:48 +02:00
Willy Tarreau	9c20433aca	CLEANUP: vars: name the temporary proxy "CFG" instead of "CLI" for global vars We're using a dummy temporary proxy when creating global variables in the configuration file, it was copied from the CLI's code and was mistakenly called "CLI", better name it "CFG". It should not appear anywhere except maybe when debugging cores.	2021-09-03 11:01:48 +02:00
Willy Tarreau	c767eebf1f	BUG/MINOR: vars: do not talk about global section in CLI errors for set-var When attempting to set a variable does not start with the "proc" scope on the CLI, we used to emit "only proc is permitted in the global section" which obviously is a leftover from the initial code. This may be backported to 2.4.	2021-09-03 11:01:12 +02:00
Willy Tarreau	1402fef58a	BUG/MINOR: vars: truncate the variable name in error reports about scope. When a variable starts with the wrong scope, it is named without stripping the extra characters that follow it, which usually are closing parenthesis. Let's make sure we only report what is expected. This may be backported to 2.4.	2021-09-03 11:01:12 +02:00
Willy Tarreau	c77bad2467	BUG/MEDIUM: vars: run over the correct list in release_store_rules() In commit `9a621ae76` ("MEDIUM: vars: add a new "set-var-fmt" action") we introduced the support for format strings in variables with the ability to release them on exit, except that it's the wrong list that was being scanned for the rule (http vs vars), resulting in random crashes during deinit. This was a recent commit in 2.5-dev, no backport is needed.	2021-09-03 11:01:12 +02:00
Willy Tarreau	9a621ae76d	MEDIUM: vars: add a new "set-var-fmt" action The set-var() action is convenient because it preserves the input type but it's a pain to deal with when trying to concatenate values. The most recurring example is when it's needed to build a variable composed of the source address and the source port. Usually it ends up like this: tcp-request session set-var(sess.port) src_port tcp-request session set-var(sess.addr) src,concat(":",sess.port) This is even worse when trying to aggregate multiple fields from stick-table data for example. Due to this a lot of users instead abuse headers from HTTP rules: http-request set-header(x-addr) %[src]:%[src_port] But this requires some careful cleanups to make sure they won't leak, and it's significantly more expensive to deal with. And generally speaking it's not clean. Plus it must be performed for each and every request, which is expensive for this common case of ip+port that doesn't change for the whole session. This patch addresses this limitation by implementing a new "set-var-fmt" action which performs the same work as "set-var" but takes a format string in argument instead of an expression. This way it becomes pretty simple to just write: tcp-request session set-var-fmt(sess.addr) %[src]:%[src_port] It is usable in all rulesets that already support the "set-var" action. It is not yet implemented for the global "set-var" directive (which already takes a string) and the CLI's "set var" command, which would definitely benefit from it but currently uses its own parser and engine, thus it must be reworked. The doc and regtests were updated.	2021-09-02 21:22:22 +02:00
Willy Tarreau	54b96d9955	BUG/MINOR: vars: properly set the argument parsing context in the expression When the expression called in "set-var" uses argments that require late resolution, the context must be set. At the moment, any unknown argument is misleadingly reported as "ACL": frontend f bind :8080 mode http http-request set-var(proc.a) be_conn(foo) parsing [b1.cfg:4]: unable to find backend 'foo' referenced in arg 1 \ of ACL keyword 'be_conn' in proxy 'f'. Once the context is properly set, it now says the truth: parsing [b1.cfg:8]: unable to find backend 'foo' referenced in arg 1 \ of sample fetch keyword 'be_conn' in http-request expression in proxy 'f'. This may be backported but is not really important. If so, the preceeding patches "BUG/MINOR: vars: improve accuracy of the rules used to check expression validity" and "MINOR: sample: add missing ARGC_ entries" must be backported as well.	2021-09-02 20:34:30 +02:00
Willy Tarreau	843096d72a	BUG/MINOR: vars: improve accuracy of the rules used to check expression validity The set-var() expression naturally checks whether expressions are valid in the context of the rule, but it fails to differentiate frontends from backends. As such for tcp-content and http-request rules, it will only accept frontend-compatible sample-fetches, excluding those declared with SMP_UES_BKEND (a few such as be_id, be_name). For the response it accepts the backend-compatible expressions only, though it seems that there are no sample-fetch function that are valid only in the frontend's content, so that should not cause any problem. Note that while allowing valid configs to be used, the fix might also uncover some incorrect configurations where some expressions currently return nothing (e.g. something depending on frontend declared in a backend), and which could be rejected, but there does not seem to be any such keyword. Thus while it should be backported, better not backport it too far (2.4 and possibly 2.3 only).	2021-09-02 19:23:43 +02:00
Willy Tarreau	2819210a83	BUG/MINOR: vars: fix set-var/unset-var exclusivity in the keyword parser The parser checks first for "set-var" then "unset-var" from the updated offset instead of testing it only when the other one fails, so it validates this rule as "unset-var": http-request set-varunset-var(proc.a) This should be backported everywhere relevant, though it's mostly harmless as it's unlikely that some users are purposely writing this in their conf!	2021-09-02 18:46:22 +02:00
Christopher Faulet	c4439f71b0	BUG/MINOR: vars: Be sure to have a session to get checks variables It is now possible to get any variables from the cli. Concretely, only variables in the PROC scope can be retrieved because there is neither stream nor session defined. But, nothing forbids anyone to try to get a variable in any scope. No value will be found, but it is allowed. Thus, we must be sure to not rely on an undefined session or stream in that case. Especially, the session must be tested before retrieving variables in CHECK scope. This patch should fix the issue #1249. It must be backported to 2.4.	2021-06-02 11:55:14 +02:00
Willy Tarreau	89f6dedf48	BUG/MINOR: lua/vars: prevent get_var() from allocating a new name Variable names are stored into a unified list that helps compare them just based on a pointer instead of duplicating their name with every variable. This is convenient for those declared in the configuration but this started to cause issues with Lua when random names would be created upon each access, eating lots of memory and CPU for lookups, hence the work in 2.2 with commit `4e172c93f` ("MEDIUM: lua: Add `ifexist` parameter to `set_var`") to address this. But there remains a corner case with get_var(), which also allocates a new variables. After a bit of thinking and discussion, it never makes sense to allocate a new variable name on get_var(): - if the name exists, it will be returned ; - if it does not exist, then the only way for it to appear will be that some code calls set_var() on it - a call to get_var() after a careful set_var(ifexist) ruins the effort on set_var(). For this reason, this patch addresses this issue by making sure that get_var() will never cause a variable to be allocated. This is done by modifying vars_get_by_name() to always call register_name() with alloc=0, since vars_get_by_name() is exclusively used by Lua and the new CLI's "get/set var" which also benefit from this protection. It probably makes sense to backport this as far as 2.2 after some observation period and feedback from users. For more context and discussions about the issues this was causing, see https://www.mail-archive.com/haproxy@formilux.org/msg40451.html and in issue #664.	2021-05-13 13:44:32 +02:00
Willy Tarreau	67046bfc50	BUILD: vars: include tools.h in vars.c A number of functions from tools.h are used there but the file was not included.	2021-05-08 13:56:31 +02:00
Willy Tarreau	cfc4f24d80	REORG: vars: move the "proc" scope variables out of the global struct The presence of this field causes a long dependency chain because almost everyone includes global-t.h, and vars include sample_data which include some system includes as well as HTTP parts. There is absolutely no reason for having the process-wide variables in the global struct, let's just move them into vars.c and vars.h. This reduces from ~190k to ~170k the preprocessed output of version.c.	2021-05-08 12:11:29 +02:00
Amaury Denoyelle	e4a617c931	MINOR: action: replace match_pfx by a keyword flags field Define a new keyword flag KWF_MATCH_PREFIX. This is used to replace the match_pfx field of action struct. This has the benefit to have more explicit action declaration, and now it is possible to quickly implement experimental actions.	2021-05-07 14:35:01 +02:00
Willy Tarreau	b205bfdab7	CLEANUP: cli/tree-wide: properly re-align the CLI commands' help messages There were 102 CLI commands whose help were zig-zagging all along the dump making them unreadable. This patch realigns all these messages so that the command now uses up to 40 characters before the delimiting colon. About a third of the commands did not correctly list their arguments which were added after the first version, so they were all updated. Some abuses of the term "id" were fixed to use a more explanatory term. The "set ssl ocsp-response" command was not listed because it lacked a help message, this was fixed as well. The deprecated enable/disable commands for agent/health/server were prominently written as deprecated. Whenever possible, clearer explanations were provided.	2021-05-07 11:51:26 +02:00
Willy Tarreau	2b71810cb3	CLEANUP: lists/tree-wide: rename some list operations to avoid some confusion The current "ADD" vs "ADDQ" is confusing because when thinking in terms of appending at the end of a list, "ADD" naturally comes to mind, but here it does the opposite, it inserts. Several times already it's been incorrectly used where ADDQ was expected, the latest of which was a fortunate accident explained in `6fa922562` ("CLEANUP: stream: explain why we queue the stream at the head of the server list"). Let's use more explicit (but slightly longer) names now: LIST_ADD -> LIST_INSERT LIST_ADDQ -> LIST_APPEND LIST_ADDED -> LIST_INLIST LIST_DEL -> LIST_DELETE The same is true for MT_LISTs, including their "TRY" variant. LIST_DEL_INIT keeps its short name to encourage to use it instead of the lazier LIST_DELETE which is often less safe. The change is large (~674 non-comment entries) but is mechanical enough to remain safe. No permutation was performed, so any out-of-tree code can easily map older names to new ones. The list doc was updated.	2021-04-21 09:20:17 +02:00
Willy Tarreau	374edc70ba	CLEANUP: vars: always pre-initialize smp in vars_parse_cli_get_var() In issue #1200 Coverity believes we may use an uninitialized field smp.sess here while it's not possible because the returned variable necessarily matches SCOPE_PROC hence smp.sess is not used. But it cannot see this and it could be confusing if the code later evolved into something more complex. That's not a critical path so let's first reset the sample.	2021-04-01 17:04:17 +02:00
Willy Tarreau	b8bd1ee893	MEDIUM: cli: add a new experimental "set var" command set var <name> <expression> Allows to set or overwrite the process-wide variable 'name' with the result of expression <expression>. Only process-wide variables may be used, so the name must begin with 'proc.' otherwise no variable will be set. The <expression> may only involve "internal" sample fetch keywords and converters even though the most likely useful ones will be str('something') or int(). Note that the command line parser doesn't know about quotes, so any space in the expression must be preceeded by a backslash. This command requires levels "operator" or "admin". This command is only supported on a CLI connection running in experimental mode (see "experimental-mode on"). Just like for "set-var" in the global section, the command uses a temporary dummy proxy to create a temporary "set-var(name)" rule to assign the value. The reg test was updated to verify that an updated global variable is properly reflected in subsequent HTTP responses.	2021-03-26 16:57:43 +01:00
Willy Tarreau	c35eb38f1d	MINOR: vars/cli: add a "get var" CLI command to retrieve global variables Process-wide variables can now be displayed from the CLI using "get var" followed by the variable name. They must all start with "proc." otherwise they will not be found. The output is very similar to the one of the debug converter, with a type and value being reported for the embedded sample. This command is limited to clients with the level "operator" or higher, since it can possibly expose traffic-related data.	2021-03-26 16:52:13 +01:00
Willy Tarreau	2f836de100	MINOR: action: add a new ACT_F_CLI_PARSER origin designation In order to process samples from the command line interface we'll need rules as well, and these rules will have to be marked as coming from the CLI parser. This new origin is used for this.	2021-03-26 16:34:53 +01:00
Willy Tarreau	13d2ba2a82	MEDIUM: vars: add support for a "set-var" global directive While we do support process-wide variables ("proc.<name>"), there was no way to preset them from the configuration. This was particularly limiting their usefulness since configs involving them always had to first check if the variable was set prior to performing an operation. This patch adds a new "set-var" directive in the global section that supports setting the proc.<name> variables from an expression, like other set-var actions do. The syntax however follows what is already being done for setenv, which consists in having one argument for the variable name and another one for the expression. Only "constant" expressions are allowed here, such as "int", "str" etc, combined with arithmetic or string converters, and variable lookups. A few extra sample fetch keywords like "date", "rand" and "uuid" are also part of the constant expressions and may make sense to allow to create a random key or differentiate processes. The way it was done consists in parsing a dummy rule an executing the expression in the CFG_PARSE context, then releasing the expression. This is safe because the sample that variables store does not hold a back pointer to expression that created them.	2021-03-26 16:34:53 +01:00
Willy Tarreau	01d580ae86	MINOR: action: add a new ACT_F_CFG_PARSER origin designation In order to process samples from the config file we'll need rules as well, and these rules will have to be marked as coming from the config parser. This new origin is used for this.	2021-03-26 16:23:45 +01:00
Willy Tarreau	0209c97038	MINOR: sample: mark the truly constant sample fetch keywords as such A number of keywords are really constant and safe to use at config time. This is the case for str(), int() etc but also env(), hostname(), nbproc() etc. By extension a few other ones which can be useful to preset values in a configuration were enabled as well, like data(), rand() or uuid(). At the moment this doesn't change anything as they are still only usable from runtime rules. The "var()" keyword was also marked as const as it can definitely return stable stuff at boot time.	2021-03-26 16:23:45 +01:00
Willy Tarreau	f26db14dfb	MINOR: vars: make the var() sample fetch keyword depend on nothing This sample fetch doesn't require any L4 client session in practice, as get_var() now checks for the session. This is important to remove this dependency in order to support accessing variables in scope "proc" from anywhere.	2021-03-26 16:23:45 +01:00
Willy Tarreau	a07d61be4c	MINOR: vars: make get_vars() allow the session to be null In order to support manipulating variables from outside a session, let's make get_vars() not assume that the session is always set.	2021-03-26 16:23:45 +01:00
Willy Tarreau	3d1d178933	CLEANUP: vars: make the error message clearer on missing arguments for set-var The error message after "http-response set-var" isn't very clear: [ALERT] 070/115043 (30526) : parsing [/dev/stdin:2] : error detected in proxy 'f' while parsing 'http-response set-var' rule : invalid variable 'set-var'. Expects 'set-var(<var-name>)' or 'unset-var(<var-name>)'. Let's change it to this instead: [ALERT] 070/115608 (30799) : parsing [/dev/stdin:2] : error detected in proxy 'f' while parsing 'http-response set-var' rule : invalid or incomplete action 'set-var'. Expects 'set-var(<var-name>)' or 'unset-var(<var-name>)'. With a wrong action name, it also works better (it's handled as a prefix due to the opening parenthesis): [ALERT] 070/115608 (30799) : parsing [/dev/stdin:2] : error detected in proxy 'f' while parsing 'http-response set-varxxx' rule : invalid or incomplete action 'set-varxxx'. Expects 'set-var(<var-name>)' or 'unset-var(<var-name>)'.	2021-03-12 14:13:20 +01:00
Willy Tarreau	018251667e	CLEANUP: config: make the cfg_keyword parsers take a const for the defproxy The default proxy was passed as a variable to all parsers instead of a const, which is not without risk, especially when some timeout parsers used to make some int pointers point to the default values for comparisons. We want to be certain that none of these parsers will modify the defaults sections by accident, so it's important to mark this proxy as const. This patch touches all occurrences found (89).	2021-03-09 10:09:43 +01:00
Willy Tarreau	5b52b00393	CLEANUP: vars: always zero the pointers after a free() In sample_store(), depending on the new sample types, the area pointer was not always zeroed after being freed. Let's make sure it's always the case to avoid the risk of dangling pointers being misused.	2021-02-26 21:21:21 +01:00
Dragan Dosen	ec0a604f27	CLEANUP: vars: make smp_fetch_var() to reuse vars_get_by_desc() They both do the same thing, so let's remove unneeded code duplication.	2021-02-23 17:23:53 +01:00
Dragan Dosen	14518f2305	BUG/MEDIUM: vars: make functions vars_get_by_{name,desc} thread-safe This patch adds a lock to functions vars_get_by_name() and vars_get_by_desc() to protect accesses to the list of variables. After the variable is fetched, a sample data is duplicated by using smp_dup() because the variable may be modified by another thread. This should be backported to all versions supporting vars along with "BUG/MINOR: sample: secure convs that accept base64 string and var name as args" which this patch depends on.	2021-02-23 17:22:46 +01:00
Miroslav Zagorac	6deab79d59	MINOR: vars: replace static functions with global ones The OpenTracing filter uses several internal HAProxy functions to work with variables and therefore requires two static local HAProxy functions, var_accounting_diff() and var_clear(), to be declared global. In fact, the var_clear() function was not originally defined as static, but it lacked a declaration.	2020-12-16 14:20:08 +01:00
Tim Duesterhus	bbdd5b8ca9	CLEANUP: Add static void vars_deinit() vars_deinit() frees all var_names during deinit().	2020-07-07 16:52:35 +02:00
Ilya Shipitsin	46a030cdda	CLEANUP: assorted typo fixes in the code and comments This is 11th iteration of typo fixes	2020-07-06 14:34:32 +02:00
Tim Duesterhus	01a0ce39e2	BUG/MAJOR: vars: Fix bogus free() during deinit() for http-request rules We cannot simply `release_sample_expr(rule->arg.vars.expr)` for a `struct act_rule`, because `rule->arg` is a union that might not contain valid `vars`. This leads to a crash on a configuration using `http-request redirect` and possibly others: frontend http mode http bind 127.0.0.1:80 http-request redirect scheme https Instead a `struct act_rule` has a `release_ptr` that must be used to properly free any additional storage allocated. This patch fixes a regression in commit `ff78fcdd7f`. It must be backported to whereever that patch is backported. It has be verified that the configuration above no longer crashes. It has also been verified that the configuration in `ff78fcdd7f` does not leak.	2020-06-15 18:51:11 +02:00
Willy Tarreau	b2551057af	CLEANUP: include: tree-wide alphabetical sort of include files This patch fixes all the leftovers from the include cleanup campaign. There were not that many (~400 entries in ~150 files) but it was definitely worth doing it as it revealed a few duplicates.	2020-06-11 10:18:59 +02:00
Willy Tarreau	36979d9ad5	REORG: include: move the error reporting functions to from log.h to errors.h Most of the files dealing with error reports have to include log.h in order to access ha_alert(), ha_warning() etc. But while these functions don't depend on anything, log.h depends on a lot of stuff because it deals with log-formats and samples. As a result it's impossible not to embark long dependencies when using ha_warning() or qfprintf(). This patch moves these low-level functions to errors.h, which already defines the error codes used at the same places. About half of the users of log.h could be adjusted, sometimes revealing other issues such as missing tools.h. Interestingly the total preprocessed size shrunk by 4%.	2020-06-11 10:18:59 +02:00
Willy Tarreau	51cd5956ee	REORG: check: move tcpchecks away from check.c Checks.c remains one of the largest file of the project and it contains too many things. The tcpchecks code represents half of this file, and both parts are relatively isolated, so let's move it away into its own file. We now have tcpcheck.c, tcpcheck{,-t}.h. Doing so required to export quite a number of functions because check.c has almost everything made static, which really doesn't help to split!	2020-06-11 10:18:58 +02:00
Willy Tarreau	6be7849f39	REORG: include: move cfgparse.h to haproxy/cfgparse.h There's no point splitting the file in two since only cfgparse uses the types defined there. A few call places were updated and cleaned up. All of them were in C files which register keywords. There is nothing left in common/ now so this directory must not be used anymore.	2020-06-11 10:18:58 +02:00
Willy Tarreau	dfd3de8826	REORG: include: move stream.h to haproxy/stream{,-t}.h This one was not easy because it was embarking many includes with it, which other files would automatically find. At least global.h, arg.h and tools.h were identified. 93 total locations were identified, 8 additional includes had to be added. In the rare files where it was possible to finalize the sorting of includes by adjusting only one or two extra lines, it was done. But all files would need to be rechecked and cleaned up now. It was the last set of files in types/ and proto/ and these directories must not be reused anymore.	2020-06-11 10:18:58 +02:00
Willy Tarreau	c2b1ff04e5	REORG: include: move http_ana.h to haproxy/http_ana{,-t}.h It was moved without any change, however many callers didn't need it at all. This was a consequence of the split of proto_http.c into several parts that resulted in many locations to still reference it.	2020-06-11 10:18:58 +02:00
Willy Tarreau	4aa573da6f	REORG: include: move checks.h to haproxy/check{,-t}.h All includes that were not absolutely necessary were removed because checks.h happens to very often be part of dependency loops. A warning was added about this in check-t.h. The fields, enums and structs were a bit tidied because it's particularly tedious to find anything there. It would make sense to split this in two or more files (at least extract tcp-checks). The file was renamed to the singular because it was one of the rare exceptions to have an "s" appended to its name compared to the struct name.	2020-06-11 10:18:58 +02:00
Willy Tarreau	8b550afe1e	REORG: include: move tcp_rules.h to haproxy/tcp_rules.h There's no type file on this one which is pretty simple.	2020-06-11 10:18:58 +02:00
Willy Tarreau	a171892501	REORG: include: move vars.h to haproxy/vars{,-t}.h A few includes (sessions.h, stream.h, api-t.h) were added for arguments that were first declared in function prototypes.	2020-06-11 10:18:58 +02:00
Willy Tarreau	e6ce10be85	REORG: include: move sample.h to haproxy/sample{,-t}.h This one is particularly tricky to move because everyone uses it and it depends on a lot of other types. For example it cannot include arg-t.h and must absolutely only rely on forward declarations to avoid dependency loops between vars -> sample_data -> arg. In order to address this one, it would be nice to split the sample_data part out of sample.h.	2020-06-11 10:18:58 +02:00
Willy Tarreau	c761f843da	REORG: include: move http_rules.h to haproxy/http_rules.h There was no include file. This one still includes types/proxy.h.	2020-06-11 10:18:57 +02:00
Willy Tarreau	aa74c4e1b3	REORG: include: move arg.h to haproxy/arg{,-t}.h Almost no change was needed; chunk.h was replaced with buf-t.h. It dpeends on types/vars.h and types/protocol_buffers.h.	2020-06-11 10:18:57 +02:00
Willy Tarreau	cd72d8c981	REORG: include: split common/http.h into haproxy/http{,-t}.h So the enums and structs were placed into http-t.h and the functions into http.h. This revealed that several files were dependeng on http.h but not including it, as it was silently inherited via other files.	2020-06-11 10:18:57 +02:00
Willy Tarreau	853b297c9b	REORG: include: split mini-clist into haproxy/list and list-t.h Half of the users of this include only need the type definitions and not the manipulation macros nor the inline functions. Moves the various types into mini-clist-t.h makes the files cleaner. The other one had all its includes grouped at the top. A few files continued to reference it without using it and were cleaned. In addition it was about time that we'd rename that file, it's not "mini" anymore and contains a bit more than just circular lists.	2020-06-11 10:18:56 +02:00
Willy Tarreau	4c7e4b7738	REORG: include: update all files to use haproxy/api.h or api-t.h if needed All files that were including one of the following include files have been updated to only include haproxy/api.h or haproxy/api-t.h once instead: - common/config.h - common/compat.h - common/compiler.h - common/defaults.h - common/initcall.h - common/tools.h The choice is simple: if the file only requires type definitions, it includes api-t.h, otherwise it includes the full api.h. In addition, in these files, explicit includes for inttypes.h and limits.h were dropped since these are now covered by api.h and api-t.h. No other change was performed, given that this patch is large and affects 201 files. At least one (tools.h) was already freestanding and didn't get the new one added.	2020-06-11 10:18:42 +02:00
Tim Duesterhus	b4fac1eb3c	MINOR: vars: Make vars_(un\|)set_by_name(_ifexist\|) return a success value Change the return type from `void` to `int` and return whether setting the variable was successful.	2020-05-25 08:12:27 +02:00
Tim Duesterhus	7329327333	CLEANUP: vars: Remove void vars_unset_by_name(const char, size_t, struct sample) With "MINOR: lua: Use vars_unset_by_name_ifexist()" the last user was removed and as outlined in that commit there is no good reason for this function to exist. May be backported together with the commit mentioned above.	2020-05-25 08:12:23 +02:00
Christopher Faulet	0fca7edeb5	MINOR: checks: Use the check as origin when a session is created Before, the server was used as origin during session creation. It was only used to get the check associated to the server when a variable is get or set in the check scope or when a check sample fetch was called. So it seems easier to use the check as origin of a session. It is also more logical becaues the session is created by the health-check itself and not its server.	2020-04-27 09:39:38 +02:00
Gaetan Rivet	0c39eccdd4	MINOR: checks: Add support to set-var and unset-var rules in tcp-checks Evaluate the registered action_ptr associated with each CHK_ACTION_KW rules from a ruleset. Currently only the 'set-var' and 'unset-var' are parsed by the tcp-check parser. Thus it is now possible to set or unset variables. It is possible to use such rules before the first connect of the ruleset.	2020-04-27 09:39:37 +02:00
Gaetan Rivet	707b52f17e	MEDIUM: checks: Parse custom action rules in tcp-checks Register the custom action rules "set-var" and "unset-var", that will call the parse_store() command upon parsing. These rules are thus built and integrated to the tcp-check ruleset, but have no further effect for the moment.	2020-04-27 09:39:37 +02:00
Gaetan Rivet	13a5043a9e	MINOR: checks/vars: Add a check scope for variables Add a dedicated vars scope for checks. This scope is considered as part of the session scope for accounting purposes. The scope can be addressed by a valid session, even embryonic. The stream is not necessary. The scope is initialized after the check session is created. All variables are then pruned before the session is destroyed.	2020-04-27 09:39:37 +02:00
Willy Tarreau	908071171b	BUILD: general: always pass unsigned chars to is* functions The isalnum(), isalpha(), isdigit() etc functions from ctype.h are supposed to take an int in argument which must either reflect an unsigned char or EOF. In practice on some platforms they're implemented as macros referencing an array, and when passed a char, they either cause a warning "array subscript has type 'char'" when lucky, or cause random segfaults when unlucky. It's quite unconvenient by the way since none of them may return true for negative values. The recent introduction of cygwin to the list of regularly tested build platforms revealed a lot of breakage there due to the same issues again. So this patch addresses the problem all over the code at once. It adds unsigned char casts to every valid use case, and also drops the unneeded double cast to int that was sometimes added on top of it. It may be backported by dropping irrelevant changes if that helps better support uncommon platforms. It's unlikely to fix bugs on platforms which would already not emit any warning though.	2020-02-25 08:16:33 +01:00
Willy Tarreau	e3b57bf92f	MINOR: sample: make sample_parse_expr() able to return an end pointer When an end pointer is passed, instead of complaining that a comma is missing after a keyword, sample_parse_expr() will silently return the pointer to the current location into this return pointer so that the caller can continue its parsing. This will be used by more complex expressions which embed sample expressions, and may even permit to embed sample expressions into arguments of other expressions.	2020-02-14 19:02:06 +01:00
Christopher Faulet	6d0c3dfac6	MEDIUM: http: Add a ruleset evaluated on all responses just before forwarding This patch introduces the 'http-after-response' rules. These rules are evaluated at the end of the response analysis, just before the data forwarding, on ALL HTTP responses, the server ones but also all responses generated by HAProxy. Thanks to this ruleset, it is now possible for instance to add some headers to the responses generated by the stats applet. Following actions are supported : * allow * add-header * del-header * replace-header * replace-value * set-header * set-status * set-var * strict-mode * unset-var	2020-02-06 14:55:34 +01:00
Christopher Faulet	fc9cfe4006	REORG: proto_htx: Move HTX analyzers & co to http_ana.{c,h} files The old module proto_http does not exist anymore. All code dedicated to the HTTP analysis is now grouped in the file proto_htx.c. So, to finish the polishing after removing the legacy HTTP code, proto_htx.{c,h} files have been moved in http_ana.{c,h} files. In addition, all HTX analyzers and related functions prefixed with "htx_" have been renamed to start with "http_" instead.	2019-07-19 09:24:12 +02:00
Willy Tarreau	4b7531f48b	BUG/MEDIUM: vars: make the tcp/http unset-var() action support conditions Patrick Hemmer reported that http-request unset-var(foo) if ... fails to parse. The reason is that it reuses the same parser as "set-var(foo)" which makes a special case of the arguments, supposed to be a sample expression for set-var, but which must not exist for unset-var. Unfortunately the parser finds "if" or "unless" and believes it's an expression. Let's simply drop the test so that the outer rule parser deals with potential extraneous keywords. This should be backported to all versions supporting unset-var().	2019-06-04 16:48:15 +02:00
Willy Tarreau	f37b140b06	BUG/MEDIUM: vars: make sure the scope is always valid when accessing vars Patrick Hemmer reported that a simple tcp rule involving a variable like this is enough to crash haproxy : frontend foo bind :8001 tcp-request session set-var(txn.foo) src The tests on the variables scopes is not strict enough, it needs to always verify if the stream is valid when accessing a req/res/txn variable. This patch does this by adding a new get_vars() function which does the job instead of open-coding all the lookups everywhere. It must be backported to all versions supporting set-var and "tcp-request session" so at least 1.9 and 1.8.	2019-06-04 16:27:36 +02:00
Tim Duesterhus	a6cc7e872a	BUG/MINOR: vars: Fix memory leak in vars_check_arg vars_check_arg previously leaked the string containing the variable name: Consider this config: frontend fe1 mode http bind :8080 http-request set-header X %[var(txn.host)] Starting HAProxy and immediately stopping it by sending a SIGINT makes Valgrind report this leak: ==7795== 9 bytes in 1 blocks are definitely lost in loss record 15 of 71 ==7795== at 0x4C2DB8F: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==7795== by 0x4AA2AD: my_strndup (standard.c:2227) ==7795== by 0x51FCC5: make_arg_list (arg.c:146) ==7795== by 0x4CF095: sample_parse_expr (sample.c:897) ==7795== by 0x4BA7D7: add_sample_to_logformat_list (log.c:495) ==7795== by 0x4BBB62: parse_logformat_string (log.c:688) ==7795== by 0x4E70A9: parse_http_req_cond (http_rules.c:239) ==7795== by 0x41CD7B: cfg_parse_listen (cfgparse-listen.c:1466) ==7795== by 0x480383: readcfgfile (cfgparse.c:2089) ==7795== by 0x47A081: init (haproxy.c:1581) ==7795== by 0x4049F2: main (haproxy.c:2591) This leak can be detected even in HAProxy 1.6, this patch thus should be backported to all supported branches [Cf: This fix was reverted because the chunk's area was inconditionnaly released, making haproxy to crash when spoe was enabled. Now the chunk is released by calling chunk_destroy(). This function takes care of the chunk's size to release it or not. It is the responsibility of callers to set or not the chunk's size.]	2019-05-13 11:09:12 +02:00
Willy Tarreau	f7b0523425	Revert "BUG/MINOR: vars: Fix memory leak in vars_check_arg" This reverts commit `6ea00195c4`. As found by Christopher, this fix is not correct due to the way args are built at various places. For example some config or runtime parsers will place a substring pointer there, and calling free() on it will immediately crash the program. A quick audit of the code shows that there are not that many users, but the way it's done requires to properly set the string as a regular chunk (size=0 if free not desired, then call chunk_destroy() at release time), and given that the size is currently set to len+1 in all parsers, a deeper audit needs to be done to figure the impacts of not setting it anymore. Thus for now better leave this harmless leak which impacts only the config parsing time. This fix must be backported to all branches containing the fix above.	2019-05-13 10:10:01 +02:00
Tim Duesterhus	6ea00195c4	BUG/MINOR: vars: Fix memory leak in vars_check_arg vars_check_arg previously leaked the string containing the variable name: Consider this config: frontend fe1 mode http bind :8080 http-request set-header X %[var(txn.host)] Starting HAProxy and immediately stopping it by sending a SIGINT makes Valgrind report this leak: ==7795== 9 bytes in 1 blocks are definitely lost in loss record 15 of 71 ==7795== at 0x4C2DB8F: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==7795== by 0x4AA2AD: my_strndup (standard.c:2227) ==7795== by 0x51FCC5: make_arg_list (arg.c:146) ==7795== by 0x4CF095: sample_parse_expr (sample.c:897) ==7795== by 0x4BA7D7: add_sample_to_logformat_list (log.c:495) ==7795== by 0x4BBB62: parse_logformat_string (log.c:688) ==7795== by 0x4E70A9: parse_http_req_cond (http_rules.c:239) ==7795== by 0x41CD7B: cfg_parse_listen (cfgparse-listen.c:1466) ==7795== by 0x480383: readcfgfile (cfgparse.c:2089) ==7795== by 0x47A081: init (haproxy.c:1581) ==7795== by 0x4049F2: main (haproxy.c:2591) This leak can be detected even in HAProxy 1.6, this patch thus should be backported to all supported branches.	2019-05-11 06:00:50 +02:00
Olivier Houchard	25ad13f9a0	MEDIUM: vars: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Willy Tarreau	8ceae72d44	MEDIUM: init: use initcall for all fixed size pool creations This commit replaces the explicit pool creation that are made in constructors with a pool registration. Not only this simplifies the pools declaration (it can be done on a single line after the head is declared), but it also removes references to pools from within constructors. The only remaining create_pool() calls are those performed in init functions after the config is parsed, so there is no more user of potentially uninitialized pool now. It has been the opportunity to remove no less than 12 constructors and 6 init functions.	2018-11-26 19:50:32 +01:00
Willy Tarreau	86abe44e42	MEDIUM: init: use self-initializing spinlocks and rwlocks This patch replaces a number of __decl_hathread() followed by HA_SPIN_INIT or HA_RWLOCK_INIT by the new __decl_spinlock() or __decl_rwlock() which automatically registers the lock for initialization in during the STG_LOCK init stage. A few static modifiers were lost in the process, but since they were not essential at all it was not worth extending the API to provide such a variant.	2018-11-26 19:50:32 +01:00
Willy Tarreau	0108d90c6c	MEDIUM: init: convert all trivial registration calls to initcalls This switches explicit calls to various trivial registration methods for keywords, muxes or protocols from constructors to INITCALL1 at stage STG_REGISTER. All these calls have in common to consume a single pointer and return void. Doing this removes 26 constructors. The following calls were addressed : - acl_register_keywords - bind_register_keywords - cfg_register_keywords - cli_register_kw - flt_register_keywords - http_req_keywords_register - http_res_keywords_register - protocol_register - register_mux_proto - sample_register_convs - sample_register_fetches - srv_register_keywords - tcp_req_conn_keywords_register - tcp_req_cont_keywords_register - tcp_req_sess_keywords_register - tcp_res_cont_keywords_register - flt_register_keywords	2018-11-26 19:50:32 +01:00
Joseph Herlant	0767689e93	CLEANUP: fix typos in the comments of the vars subsystem Those are mostly misspells of the words available and variable.	2018-11-18 22:23:15 +01:00
Willy Tarreau	61c112aa5b	REORG: http: move HTTP rules parsing to http_rules.c These ones are mostly called from cfgparse.c for the parsing and do not depend on the HTTP representation. The functions's prototypes were moved to proto/http_rules.h, making this file work exactly like tcp_rules. Ideally we should stop calling these functions directly from cfgparse and register keywords, but there are a few cases where that wouldn't work (stats http-request) so it's probably not worth trying to go this far.	2018-10-02 18:28:05 +02:00
Willy Tarreau	35b51c6e5b	REORG: http: move the HTTP semantics definitions to http.h/http.c It's a bit painful to have to deal with HTTP semantics for each protocol version (H1 and H2), and working on the version-agnostic code further emphasizes the problem. This patch creates http.h and http.c which are agnostic to the version in use, and which borrow a few parts from proto_http and from h1. For example the once thought h1-specific h1_char_classes array is in fact dictated by RFC7231 and is used to parse HTTP headers. A few changes were made to a few files which were including proto_http.h while they only needed http.h. Certain string definitions pre-dated the introduction of indirect strings (ist) so some were used to simplify the definition of the known HTTP methods. The current lookup code saves 2 kB of a heavily used table and is faster than the previous table based lookup (typ. 14 ns vs 16 before).	2018-09-11 10:30:25 +02:00
Willy Tarreau	843b7cbe9d	MEDIUM: chunks: make the chunk struct's fields match the buffer struct Chunks are only a subset of a buffer (a non-wrapping version with no head offset). Despite this we still carry a lot of duplicated code between buffers and chunks. Replacing chunks with buffers would significantly reduce the maintenance efforts. This first patch renames the chunk's fields to match the name and types used by struct buffers, with the goal of isolating the code changes from the declaration changes. Most of the changes were made with spatch using this coccinelle script : @rule_d1@ typedef chunk; struct chunk chunk; @@ - chunk.str + chunk.area @rule_d2@ typedef chunk; struct chunk chunk; @@ - chunk.len + chunk.data @rule_i1@ typedef chunk; struct chunk chunk; @@ - chunk->str + chunk->area @rule_i2@ typedef chunk; struct chunk chunk; @@ - chunk->len + chunk->data Some minor updates to 3 http functions had to be performed to take size_t ints instead of ints in order to match the unsigned length here.	2018-07-19 16:23:43 +02:00
Christopher Faulet	eb3e276d39	BUG/MEDIUM: threads/vars: Fix deadlock in register_name In register_name, before locking the var_names array, we check the variable name validity. So if we try to register an invalid or empty name, we need to return without unlocking it (because it was never locked). This patch must be backported in 1.8.	2017-12-08 10:37:24 +01:00
Willy Tarreau	bafbe01028	CLEANUP: pools: rename all pool functions and pointers to remove this "2" During the migration to the second version of the pools, the new functions and pool pointers were all called "pool_something2()" and "pool2_something". Now there's no more pool v1 code and it's a real pain to still have to deal with this. Let's clean this up now by removing the "2" everywhere, and by renaming the pool heads "pool_head_something".	2017-11-24 17:49:53 +01:00
Christopher Faulet	767a84bcc0	CLEANUP: log: Rename Alert/Warning in ha_alert/ha_warning	2017-11-24 17:19:12 +01:00
Christopher Faulet	9dcf9b6f03	MINOR: threads: Use __decl_hathreads to declare locks This macro should be used to declare variables or struct members depending on the USE_THREAD compile option. It avoids the encapsulation of such declarations between #ifdef/#endif. It is used to declare all lock variables.	2017-11-13 11:38:17 +01:00
Christopher Faulet	2a944ee16b	BUILD: threads: Rename SPIN/RWLOCK macros using HA_ prefix This remove any name conflicts, especially on Solaris.	2017-11-07 11:10:24 +01:00
Christopher Faulet	e95f2c3ef5	MEDIUM: thread/vars: Make vars thread-safe A RW lock has been added to the vars structure to protect each list of variables. And a global RW lock is used to protect registered names. When a varibable is fetched, we duplicate sample data because the variable could be modified by another thread.	2017-10-31 13:58:32 +01:00
Christopher Faulet	d02210cd30	MINOR: samples: Don't allocate memory for SMP_T_METH sample when method is known For known methods (GET,POST...), in samples, an enum is used instead of a chunk to reference the method. So there is no needs to allocate memory when a variable is stored with this kind of sample.	2017-07-24 17:16:11 +02:00
Thierry FOURNIER / OZON.IO	d2f6f47597	BUG/MEDIUM: variables: some variable name can hide another ones The variable are compared only using text, the final '\0' (or the string length) are not checked. So, the variable name "txn.internal" matchs other one call "txn.int". This patch fix this behavior It must be backported ni 1.6 and 1.7	2016-12-12 14:34:56 +01:00
Willy Tarreau	397131093f	REORG: tcp-rules: move tcp rules processing to their own file There's no more reason to keep tcp rules processing inside proto_tcp.c given that there is nothing in common there except these 3 letters : tcp. The tcp rules are in fact connection, session and content processing rules. Let's move them to "tcp-rules" and let them live their life there.	2016-11-25 15:57:38 +01:00
Willy Tarreau	e365815007	BUILD: vars: remove a build warning on vars.c gcc 3.4.6 noticed a possibly unitialized variable in vars.c, and while it cannot happen the way the function is used, it's surprizing that newer versions did not report it. This fix may be backported to 1.6.	2016-11-24 21:25:43 +01:00
Christopher Faulet	85d79c94a9	MINOR: vars: Add 'unset-var' action/converter It does the opposite of 'set-var' action/converter. It is really useful for per-process variables. But, it can be used for any scope. The lua function 'unset_var' has also been added.	2016-11-09 22:57:01 +01:00
Christopher Faulet	ff2613ed7a	MEDIUM: vars: Add a per-process scope for variables Now it is possible to use variables attached to a process. The scope name is 'proc'. These variables are released only when HAProxy is stopped. 'tune.vars.proc-max-size' directive has been added to confiure the maximum amount of memory used by "proc" variables. And because memory accounting is hierachical for variables, memory for "proc" vars includes memory for "sess" vars.	2016-11-09 22:57:00 +01:00
Christopher Faulet	09c9df286b	MINOR: vars: Add vars_set_by_name_ifexist function This function, unsurprisingly, sets a variable value only if it already exists. In other words, this function will succeed only if the variable was found somewhere in the configuration during HAProxy startup. It will be used by SPOE filter. So an agent will be able to set a value only for existing variables. This prevents an agent to create a very large number of unused variables to flood HAProxy and exhaust the memory reserved to variables..	2016-11-09 22:57:00 +01:00
Christopher Faulet	b71557a98b	MINOR: vars: Allow '.' in variable names This is required to have implicit prefix or scope. SPOE filter will use it to keep variables set by an agent in its own namespace.	2016-11-09 22:57:00 +01:00
Christopher Faulet	0099a8ca9d	BUG: vars: Fix 'set-var' converter because of a typo The 'set-var' converter uses function smp_conv_store (vars.c). In this function, we should use the first argument (index 0) to retrieve the variable name and its scope. But because of a typo, we get the scope of the second argument (index 1). In this case, there is no second argument. So the scope used was always 0 (SCOPE_SESS), always setting the variable in the session scope. So, due to this bug, this rules tcp-request content accept if { src,set-var(txn.foo) -m found } always set the variable 'sess.foo' instead of 'txn.foo'.	2016-11-09 22:50:54 +01:00
Willy Tarreau	620408f406	MEDIUM: tcp: add registration and processing of TCP L5 rules This commit introduces "tcp-request session" rules. These are very much like "tcp-request connection" rules except that they're processed after the handshake, so it is possible to consider SSL information and addresses rewritten by the proxy protocol header in actions. This is particularly useful to track proxied sources as this was not possible before, given that tcp-request content rules are processed after each HTTP request. Similarly it is possible to assign the proxied source address or the client's cert to a variable.	2016-10-21 18:19:24 +02:00
Willy Tarreau	87846e42a4	BUG/MINOR: vars: smp_fetch_var() doesn't depend on HTTP but on the session Thus the SMP_USE_HTTP_ANY dependency is incorrect, we have to depend on SMP_USE_L5_CLI (the session). It's particularly important for session-wide variables which are kept across HTTP requests. For now there is no impact but it will make a difference with tcp-request session rules.	2016-10-21 17:53:46 +02:00
Willy Tarreau	7513d001c8	BUG/MINOR: vars: make smp_fetch_var() more robust against misuses smp_fetch_var() may be called from everywhere since it just reads a variable. It must ensure that the stream exists before trying to return a stream-dependant variable. For now there is no impact but it will cause trouble with tcp-request session rules.	2016-10-21 17:53:46 +02:00
Willy Tarreau	108a8fd8be	BUG/MINOR: vars: use sess and not s->sess in action_store() This causes the stream to be dereferenced when not needed. It will cause trouble when variables are used outside of a stream.	2016-10-21 17:53:46 +02:00
Hubert Verstraete	831962e3b3	CLEANUP: fixed some usages of realloc leading to memory leak Changed all the cases where the pointer passed to realloc is overwritten by the pointer returned by realloc. The new function my_realloc2 has been used except in function register_name. If register_name fails to add a new variable because of an "out of memory" error, all the existing variables remain valid. If we had used my_realloc2, the array of variables would have been freed.	2016-06-29 10:45:18 +02:00
Willy Tarreau	6204cd9f27	BUG/MAJOR: vars: always retrieve the stream and session from the sample This is the continuation of previous patch called "BUG/MAJOR: samples: check smp->strm before using it". It happens that variables may have a session-wide scope, and that their session is retrieved by dereferencing the stream. But nothing prevents them from being used from a streamless context such as tcp-request connection, thus crashing the process. Example : tcp-request connection accept if { src,set-var(sess.foo) -m found } In order to fix this, we have to always ensure that variable manipulation only happens via the sample, which contains the correct owner and context, and that we never use one from a different source. This results in quite a large change since a lot of functions are inderctly involved in the call chain, but the change is easy to follow. This fix must be backported to 1.6, and requires the last two patches.	2016-03-10 17:28:04 +01:00
Willy Tarreau	658b85b68d	MEDIUM: actions: pass a new "flags" argument to custom actions Since commit `bc4c1ac` ("MEDIUM: http/tcp: permit to resume http and tcp custom actions"), some actions may yield and be called back when new information are available. Unfortunately some of them may continue to yield because they simply don't know that it's the last call from the rule set. For this reason we'll need to pass a flag to the custom action to pass such information and possibly other at the same time.	2015-09-27 11:04:06 +02:00
Thierry FOURNIER	42148735bc	MEDIUM: actions: remove ACTION_STOP Before this patch, two type of custom actions exists: ACT_ACTION_CONT and ACT_ACTION_STOP. ACT_ACTION_CONT is a non terminal action and ACT_ACTION_STOP is a terminal action. Note that ACT_ACTION_STOP is not used in HAProxy. This patch remove this behavior. Only type type of custom action exists, and it is called ACT_CUSTOM. Now, the custion action can return a code indicating the required behavior. ACT_RET_CONT wants that HAProxy continue the current rule list evaluation, and ACT_RET_STOP wants that HAPRoxy stops the the current rule list evaluation.	2015-09-02 18:36:38 +02:00
Thierry FOURNIER	4dc15d1a2d	MINOR: actions: Remove wrappers Now the prototype for each action from each section are the same, and a discriminant for determining for each section we are called are added. So, this patch removes the wrappers for the action functions called from more than one section. This patch removes 132 lines of useless code.	2015-08-20 17:13:47 +02:00

1 2 3 4

171 Commits