haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-10 09:07:02 +02:00

Author	SHA1	Message	Date
Thierry FOURNIER	b9b08460a2	MEDIUM: pattern: add indexation function. Before this patch, the indexation function check the declared patttern matching function and index the data according with this function. This is not useful to add some indexation mode. This commit adds dedicated indexation function. Each struct pattern is associated with one indexation function. This function permit to index data according with the type of pattern and with the type of match.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	3ead5b93c6	MINOR: pattern: separe list element from the data part. This commit separes the "struct list" used for the chain the "struct pattern" which contain the pattern data. Later, this change will permit to manipulate lists ans trees with the same "struct pattern".	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	972028fa67	MEDIUM: pattern: Change the prototype of the function pattern_register(). Each pattern parser take only one string. This change is reported to the function prototype of the function "pattern_register()". Now, it is called with just one string and no need to browse the array of args.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	511e9475f2	MEDIUM: acl/pattern: standardisation "of pat_parse_int()" and "pat_parse_dotted_ver()" The goal of these patch is to simplify the prototype of "pat_pattern_()" functions. I want to replace the argument "char args" by a simple "char arg" and remove the "opaque" argument. "pat_parse_int()" and "pat_parse_dotted_ver()" are the unique pattern parser using the "opaque" argument and using more than one string argument of the char **args. These specificities are only used with ACL. Other systems using this pattern parser (MAP and CLI) just use one string for describing a range. This two functions can read a range, but the min and the max must y specified. This patch extends the syntax to describe a range with implicit min and max. This is used for operators like "lt", "le", "gt", and "ge". the syntax is the following: ":x" -> no min to "x" "x:" -> "x" to no max This patch moves the parsing of the comparison operator from the functions "pat_parse_int()" and "pat_parse_dotted_ver()" to the acl parser. The acl parser read the operator and the values and build a volatile string readable by the functions "pat_parse_int()" and "pat_parse_dotted_ver()". The transformation is done with these rules: If the parser is "pat_parse_int()": "eq x" -> "x" "le x" -> ":x" "lt x" -> ":y" (with y = x - 1) "ge x" -> "x:" "gt x" -> "y:" (with y = x + 1) If the parser is "pat_parse_dotted_ver()": "eq x.y" -> "x.y" "le x.y" -> ":x.y" "lt x.y" -> ":w.z" (with w.z = x.y - 1) "ge x.y" -> "x.y:" "gt x.y" -> "w.z:" (with w.z = x.y + 1) Note that, if "y" is not present, assume that is "0". Now "pat_parse_int()" and "pat_parse_dotted_ver()" accept only one pattern and the variable "opaque" is no longer used. The prototype of the pattern parsers can be changed.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	9eec0a646b	MAJOR: auth: Change the internal authentication system. This patch remove the limit of 32 groups. It also permit to use standard "pat_parse_str()" function in place of "pat_parse_strcat()". The "pat_parse_strcat()" is no longer used and its removed. Before this patch, the groups are stored in a bitfield, now they are stored in a list of strings. The matching is slower, but the number of groups is low and generally the list of allowed groups is short. The fetch function "smp_fetch_http_auth_grp()" used with the name "http_auth_group" return valid username. It can be used as string for displaying the username or with the acl "http_auth_group" for checking the group of the user. Maybe the names of the ACL and fetch methods are no longer suitable, but I keep the current names for conserving the compatibility with existing configurations. The function "userlist_postinit()" is created from verification code stored in the big function "check_config_validity()". The code is adapted to the new authentication storage system and it is moved in the "src/auth.c" file. This function is used to check the validity of the users declared in groups and to check the validity of groups declared on the "user" entries. This resolve function is executed before the check of all proxy because many acl needs solved users and groups.	2014-03-17 18:06:06 +01:00
Willy Tarreau	2049092f26	CLEANUP: acl: remove obsolete test in parse_acl_expr() The ACL keyword returned by find_acl_kw() is checked for having a valid ->parse() function. This dates back 2007 when ACLs were reworked in order to differenciate old and new keywords. This check is inappropriate and confusing since all keywords have a parser now.	2014-03-17 18:04:27 +01:00
Willy Tarreau	4bfa4228dc	BUG/MINOR: acl: parser must also stop at comma on ACL-only keywords Igor at owind reported that "url_reg,lower" does not parse because find_acl_kw() looks for the parenthesis but not for the comma.	2013-12-16 22:01:06 +01:00
Willy Tarreau	adaddc21ee	CLEANUP: acl: remove useless blind copy-paste from sample converters The second error pointer is not needed, we use memprintf() to report errors, it's a leftover from a blind copy-paste of the original code.	2013-12-13 01:35:08 +01:00
Willy Tarreau	c37a3c770b	MEDIUM: acl: fix the initialization order of the ACL expression The ACL expression parser recently became a huge mess like a spaghetti plate. The keyword is looked up at the beginning, then sample fetches are processed, then an expression is initialized, then arguments and converters are parsed but only if the keyword was an ACL one, etc... Lots of "if" and redundant variables everywhere making it hard to read and follow. Let's move the args/conv parsing just after the keyword lookup. At least now it's consistent that when we leave this if/else statement, we have a sample expression initialized and full parsed wherever the elements came from.	2013-12-13 01:35:08 +01:00
Willy Tarreau	131b466f98	MEDIUM: acl: fix the argument parser to let the lower layer report detailed errors Just like for the last commit, we need to fix the ACL argument parser so that it lets the lower layer do the job of referencing unresolved arguments and correctly report the type of missing arguments.	2013-12-13 01:35:08 +01:00
Willy Tarreau	975c1784c8	MINOR: sample: make sample_parse_expr() use memprintf() to report parse errors Doing so ensures that we're consistent between all the functions in the whole chain. This is important so that we can extract the argument parsing from this function.	2013-12-12 23:16:54 +01:00
Thierry FOURNIER	7609064fc3	MINOR: pattern: make the pattern matching function return a pointer to the matched element This feature will be used by the CLI to look up keys.	2013-12-12 15:44:05 +01:00
Thierry FOURNIER	7148ce6ef4	MEDIUM: pattern: Extract the index process from the pat_parse_() functions Now, the pat_parse_() functions parses the incoming data. The input "pattern" struct can be preallocated. If the parser needs to add some buffers, it allocates memory. The function pattern_register() runs the call to the parser, process the key indexation and associate the "sample_storage" used by maps.	2013-12-12 15:42:11 +01:00
Willy Tarreau	f3489d2ccd	MINOR: pattern: do not assign SMP_TYPES by default to patterns This is never used since we exit on parse failure, and it's confusing.	2013-12-12 15:42:11 +01:00
Thierry FOURNIER	e3ded59706	MEDIUM: acl: Last patch change the output type This patch remove the compatibility check from the input type and the match method. Now, it checks if a casts from the input type to output type exists and the pattern_exec_match() function apply casts before each pattern matching.	2013-12-12 15:42:11 +01:00
Willy Tarreau	3d536ac378	BUG/MINOR: acl: fix sample expression error reporting ACL parse errors are not easy to understand since recent commit `348971e` (MEDIUM: acl: use the fetch syntax 'fetch(args),conv(),conv()' into the ACL keyword) : [ALERT] 339/154717 (26437) : parsing [check-bug.cfg:10] : error detected while parsing a 'stats admin' rule : unknown ACL or sample keyword 'env(a,b,c)': invalid arg 2 in fetch method 'env' : end of arguments expected at position 2, but got ',b,c'.. This error is only relevant to sample fetch keywords, so the new form is a bit easier to understand : [ALERT] 339/160011 (26626) : parsing [check-bug.cfg:12] : error detected while parsing a 'stats admin' rule : invalid arg 2 in fetch method 'env' : end of arguments expected at position 2, but got ',b,c' in sample expression 'env(a,b,c),upper'. No backport is needed.	2013-12-06 16:02:46 +01:00
Thierry FOURNIER	ab92cf3a09	BUG/MINOR: acl: acl parser does not recognize empty converter list Commit `348971e` (MEDIUM: acl: use the fetch syntax 'fetch(args),conv(),conv()' into the ACL keyword) introduced a regression in the ACL parser. The second argument of an ACL keyword is now mistakenly confused with a converter. This bug is post-dev19 and does not require any backport.	2013-12-06 11:32:19 +01:00
Willy Tarreau	6f8fe310cf	MINOR: pattern: import acl_find_match_name() into pattern.h It's only dedicated to pattern match lookups, so it was renamed pat_find_match_name().	2013-12-02 23:31:33 +01:00
Willy Tarreau	0cba607400	MINOR: acl/pattern: use types different from int to clarify who does what. We now have the following enums and all related functions return them and consume them : enum pat_match_res { PAT_NOMATCH = 0, /* sample didn't match any pattern / PAT_MATCH = 3, / sample matched at least one pattern / }; enum acl_test_res { ACL_TEST_FAIL = 0, / test failed / ACL_TEST_MISS = 1, / test may pass with more info / ACL_TEST_PASS = 3, / test passed / }; enum acl_cond_pol { ACL_COND_NONE, / no polarity set yet / ACL_COND_IF, / positive condition (after 'if') / ACL_COND_UNLESS, / negative condition (after 'unless') */ }; It's just in order to avoid doubts when reading some code.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	a65b343eee	MEDIUM: pattern: rename "acl" prefix to "pat" This patch just renames functions, types and enums. No code was changed. A significant number of files were touched, especially the ACL arrays, so it is likely that some external patches will not apply anymore. One important thing is that we had to split ACL_PAT_* into two groups : - ACL_TEST_{PASS\|MISS\|FAIL} - PAT_{MATCH\|UNMATCH} A future patch will enforce enums on all these places to avoid confusion.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	d163e1ce30	MEDIUM: pattern: create pattern expression This new structure contains the data needed for pattern matching. It's the first step to the complete independance of the pattern matching.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	ed66c297c2	REORG: acl/pattern: extract pattern matching from the acl file and create pattern.c This patch just moves code without any change. The ACL are just the association between sample and pattern. The pattern contains the match method and the parse method. These two things are different. This patch cleans the code by splitting it.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	dd69a04666	MEDIUM: acl: associate "struct sample_storage" to each "struct acl_pattern" This will be used later with maps. Each map will associate an entry with a sample_storage value. This patch changes the "parse" prototype and all the parsing methods. The goal is to associate "struct sample_storage" to each entry of "struct acl_pattern". Only the "parse" function can add the sample value into the "struct acl_pattern".	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	29d47b87c4	MINOR: acl: Extract the pattern matching function The map feature will need to match acl patterns. This patch extracts the matching function from the global ACL function "acl_exec_cond". The code was only moved to its own function, no functional changes were made.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	3a103c5a6b	MINOR: acl: Extract the pattern parsing and indexation from the "acl_read_patterns_from_file()" function With this split, the pattern indexation can apply to any source. The map feature needs this functionality because the map cannot be loaded with the same file format as the ones supported by acl_read_patterns_from_file(). The code was only moved to its own function, no functional changes were made.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	319e495a96	MINOR: acl: export acl arrays The map feature needs to use the acl parser and converters.	2013-12-02 23:31:32 +01:00
Thierry FOURNIER	9c1d67ecbd	MINOR: sample: provide the original sample_conv descriptor struct to the argument checker function. Note that this argument checker is still unused but will be used by maps.	2013-12-02 23:31:32 +01:00
Thierry FOURNIER	348971ea28	MEDIUM: acl: use the fetch syntax 'fetch(args),conv(),conv()' into the ACL keyword If the acl keyword is a "fetch", the dedicated parsing function "sample_parse_expr()" is used. Otherwise, the acl parsing function "parse_acl_expr()" is extended to understand the syntax of a series of converters placed after the "fetch" keyword. Before this patch, each acl uses a "struct sample_fetch" and executes it with the "<fetch>->process()" function. Now, the dedicated function "sample_process()" is called. These syntax are now avalaible: acl bad req.hdr(host),lower -m str www http-request redirect prefix /go-away if bad acl bad hdr_beg(host),lower www http-request redirect prefix /go-away if bad	2013-12-02 23:31:32 +01:00
Willy Tarreau	60eccc1841	BUG/MINOR: acl: remove patterns from the tree before freeing them A call to free_pattern_tree() upon exit() is made to free all ACL patterns allocated in a tree (strings or IP addresses). Unfortunately it happens that this function has been bogus from the beginning, it walks over the whole tree, frees the nodes but forgets to remove them from the tree prior to freeing them. So after visiting a leaf, the next eb_next() call will require to revisit some of the upper nodes that were just freed. This can remain unnoticed for a long time because free() often just marks the area as free. But in cases of aggressive memory freeing, the location will not be mapped anymore and the process segfaults. Note that the bug has no impact other than polluting kernel logs and frightening sysadmins, since it happens just before exit(). Simply adding the debug code below makes it easier to reproduce the same bug : while (node) { next = eb_next(node); + node->node_p = (void *)-1; free(node); node = next; } Many thanks to the StackExchange team for their very detailed bug report that permitted to quickly understand this non-obvious bug! This fix should be backported to 1.4 which introduced the bug.	2013-11-14 16:37:50 +01:00
Willy Tarreau	3c3dfd5c61	MINOR: acl: add a warning when an ACL keyword is used without any value It's quite common to write directives like the following : tcp-request reject if WAIT_END { sc0_inc_gpc0 } This one will never reject, because sc0_inc_gpc0 is provided no value to compare against. The proper form should have been something like this : tcp-request reject if WAIT_END { sc0_inc_gpc0 gt 0 } or : tcp-request reject if WAIT_END { sc0_inc_gpc0 -m found } Now we detect the absence of any argument on the command line and emit a warning suggesting alternatives or the use of "--" to really avoid matching anything (might be used when debugging).	2013-11-04 18:12:20 +01:00
Willy Tarreau	79c412b673	BUG/MEDIUM: acl: do not evaluate next terms after a miss When a condition does something like : action if A B C \|\| D E F If B returns a miss (can't tell true or false), C must not be evaluated. This is important when C has a side effect (eg: sc*_inc_gpc0). However the second part after the \|\| can still be evaluated.	2013-10-30 19:30:32 +01:00
Thierry FOURNIER	ef37a66628	CLEANUP: The function "regex_exec" needs the string length but in many case they expect null terminated char. If haproxy is compiled with the USE_PCRE_JIT option, the length of the string is used. If it is compiled without this option the function doesn't use the length and expects a null terminated string. The prototype of the function is ambiguous, and depends on the compilation option. The developer can think that the length is always used, and many bugs can be created. This patch makes sure that the length is used. The regex_exec function adds the final '\0' if it is needed.	2013-10-23 12:19:51 +02:00
Willy Tarreau	9ca6936c80	BUG/MINOR: acl: implicit arguments of ACL keywords were not properly resolved William Lallemand reported a bug which happens when an ACL keyword using an implicit argument (eg: a proxy name) is used : the keyword is not properly set in the arglist field, resulting in an error about the previous keyword being returned, or "(null)" if the faulty ACL appears first. The bug only affects error reporting and is 1.5-specific, so no backport is nedeed.	2013-10-22 19:31:04 +02:00
Thierry FOURNIER	ed5a4aefae	CLEANUP: regex: Create regex_comp function that compiles regex using compilation options The current file "regex.h" define an abstraction for the regex. It provides the same struct name and the same "regexec" function for the 3 regex types supported: standard libc, basic pcre and jit pcre. The regex compilation function is not provided by this file. If the developper wants to use regex, he must write regex compilation code containing "#define JIT". This patch provides a unique regex compilation function according to the compilation options. In addition, the "regex.h" file checks the presence of the "#define PCRE_CONFIG_JIT" when "USE_PCRE_JIT" is enabled. If this flag is not present, the pcre lib doesn't support JIT and "#error" is emitted.	2013-10-14 14:42:50 +02:00
Willy Tarreau	8cc16530c4	BUG/MINOR: acl: fix improper string size assignment in proxy argument This minor bug was found using the coccinelle script "da.cocci". The len was initialized twice instead of setting the size. It's harmless since no operations are performed on this empty string but needs to be fixed anyway.	2013-09-29 12:54:58 +02:00
Willy Tarreau	ef38c39287	MEDIUM: sample: systematically pass the keyword pointer to the keyword We're having a lot of duplicate code just because of minor variants between fetch functions that could be dealt with if the functions had the pointer to the original keyword, so let's pass it as the last argument. An earlier version used to pass a pointer to the sample_fetch element, but this is not the best solution for two reasons : - fetch functions will solely rely on the keyword string - some other smp_fetch_* users do not have the pointer to the original keyword and were forced to pass NULL. So finally we're passing a pointer to the keyword as a const char *, which perfectly fits the original purpose.	2013-08-01 21:17:13 +02:00
Willy Tarreau	5b8ad22228	CLEANUP: acl: move the 3 remaining sample fetches to samples.c There is no more reason for having "always_true", "always_false" and "env" in acl.c while they're the most basic sample fetch keywords, so let's move them to sample.c where it's easier to find them.	2013-07-25 15:00:37 +02:00
Willy Tarreau	dc13c11c1e	BUG/MEDIUM: prevent gcc from moving empty keywords lists into BSS Benoit Dolez reported a failure to start haproxy 1.5-dev19. The process would immediately report an internal error with missing fetches from some crap instead of ACL names. The cause is that some versions of gcc seem to trim static structs containing a variable array when moving them to BSS, and only keep the fixed size, which is just a list head for all ACL and sample fetch keywords. This was confirmed at least with gcc 3.4.6. And we can't move these structs to const because they contain a list element which is needed to link all of them together during the parsing. The bug indeed appeared with 1.5-dev19 because it's the first one to have some empty ACL keyword lists. One solution is to impose -fno-zero-initialized-in-bss to everyone but this is not really nice. Another solution consists in ensuring the struct is never empty so that it does not move there. The easy solution consists in having a non-null list head since it's not yet initialized. A new "ILH" list head type was thus created for this purpose : create an Initialized List Head so that gcc cannot move the struct to BSS. This fixes the issue for this version of gcc and does not create any burden for the declarations.	2013-06-21 23:29:02 +02:00
Willy Tarreau	24b2c766f4	BUG/MINOR: acl: correctly consider boolean fetches when doing casts Commit `5adeda1` (acl: add option -m to change the pattern matching method) was not completely correct with regards to boolean fetches. It only used the sample type to determine if the test had to be performed as a boolean instead of relying on the match function. Due to this, a test such as the following would not correctly match as the pattern would be ignored : acl srv_down srv_is_up(s2) -m int 0 No backport is needed as this was merged first in 1.5-dev18.	2013-06-12 22:26:05 +02:00
Willy Tarreau	595ec54d38	MINOR: acl: add the new "env()" fetch method to retrieve an environment variable This is useful in order to take different actions across restarts without touching the configuration (eg: soft-stop), or to pass some information such as the local host name to the next hop.	2013-06-12 22:26:05 +02:00
Willy Tarreau	6d4e4e8dd2	MEDIUM: acl: remove a lot of useless ACLs that are equivalent to their fetches The following 116 ACLs were removed because they're redundant with their fetch function since last commit which allows the fetch function to be used instead for types BOOL, INT and IP. Most places are now left with an empty ACL keyword list that was not removed so that it's easier to add other ACLs later. always_false, always_true, avg_queue, be_conn, be_id, be_sess_rate, connslots, nbsrv, queue, srv_conn, srv_id, srv_is_up, srv_sess_rate, res.comp, fe_conn, fe_id, fe_sess_rate, dst_conn, so_id, wait_end, http_auth, http_first_req, status, dst, dst_port, src, src_port, sc1_bytes_in_rate, sc1_bytes_out_rate, sc1_clr_gpc0, sc1_conn_cnt, sc1_conn_cur, sc1_conn_rate, sc1_get_gpc0, sc1_gpc0_rate, sc1_http_err_cnt, sc1_http_err_rate, sc1_http_req_cnt, sc1_http_req_rate, sc1_inc_gpc0, sc1_kbytes_in, sc1_kbytes_out, sc1_sess_cnt, sc1_sess_rate, sc1_tracked, sc1_trackers, sc2_bytes_in_rate, sc2_bytes_out_rate, sc2_clr_gpc0, sc2_conn_cnt, sc2_conn_cur, sc2_conn_rate, sc2_get_gpc0, sc2_gpc0_rate, sc2_http_err_cnt, sc2_http_err_rate, sc2_http_req_cnt, sc2_http_req_rate, sc2_inc_gpc0, sc2_kbytes_in, sc2_kbytes_out, sc2_sess_cnt, sc2_sess_rate, sc2_tracked, sc2_trackers, sc3_bytes_in_rate, sc3_bytes_out_rate, sc3_clr_gpc0, sc3_conn_cnt, sc3_conn_cur, sc3_conn_rate, sc3_get_gpc0, sc3_gpc0_rate, sc3_http_err_cnt, sc3_http_err_rate, sc3_http_req_cnt, sc3_http_req_rate, sc3_inc_gpc0, sc3_kbytes_in, sc3_kbytes_out, sc3_sess_cnt, sc3_sess_rate, sc3_tracked, sc3_trackers, src_bytes_in_rate, src_bytes_out_rate, src_clr_gpc0, src_conn_cnt, src_conn_cur, src_conn_rate, src_get_gpc0, src_gpc0_rate, src_http_err_cnt, src_http_err_rate, src_http_req_cnt, src_http_req_rate, src_inc_gpc0, src_kbytes_in, src_kbytes_out, src_sess_cnt, src_sess_rate, src_updt_conn_cnt, table_avl, table_cnt, ssl_c_ca_err, ssl_c_ca_err_depth, ssl_c_err, ssl_c_used, ssl_c_verify, ssl_c_version, ssl_f_version, ssl_fc, ssl_fc_alg_keysize, ssl_fc_has_crt, ssl_fc_has_sni, ssl_fc_use_keysize,	2013-06-11 21:22:58 +02:00
Willy Tarreau	9987ea9967	MEDIUM: acl: automatically detect the type of certain fetches Commit `bef91e71` added the possibility to automatically use some fetch functions instead of ACL functions, but for the fetch output type was never used and setting the match method using -m was always mandatory. Some fetch types are non-ambiguous and can intuitively be associated with some ACL types : SMP_T_BOOL -> bool SMP_T_UINT/SINT -> int SMP_T_IPV4/IPV6 -> ip So let's have the ACL expression parser detect these ones automatically. Other types are more ambiguous, especially everything related to strings, as there are many string matching methods available and none of them is the obvious standard matching method for any string. These ones will still have to be specified using -m.	2013-06-11 21:09:06 +02:00
Hiroaki Nakamura	e3cf2229ad	BUG: regex: fix pcre compile error when using JIT According to "man pcreapi", pcre_compile() does not accept being passed a NULL pointer in errptr or erroffset. It immediately returns NULL, causing any expression to fail. So let's pass real variables and make use of them to improve error reporting.	2013-04-11 08:17:37 +02:00
Willy Tarreau	f75d008c45	BUG/MAJOR: acl: add implicit arguments to the resolve list When an ACL keyword needs a mandatory argument and this argument is of type proxy or table, it is allowed not to specify it so that current proxy is used by default. In order to achieve this, the ACL expression parser builds a dummy argument from scratch and marks it unresolved. However, since recent changes on the ACL and samples, an unresolved argument needs to be added to the unresolved list. This specific code did not do it, resulting in random data being used as a proxy pointer if no argument was passed for a proxy name, possibly even causing a crash. A quick workaround consists explicitly naming proxies in ACLs.	2013-04-07 21:20:44 +02:00
Willy Tarreau	a4312fa28e	MAJOR: sample: maintain a per-proxy list of the fetch args to resolve While ACL args were resolved after all the config was parsed, it was not the case with sample fetch args because they're almost everywhere now. The issue is that ACLs now solely rely on sample fetches, so their args resolving doesn't work anymore. And many fetches involving a server, a proxy or a userlist don't work at all. The real issue is that at the bottom layers we have no information about proxies, line numbers, even ACLs in order to report understandable errors, and that at the top layers we have no visibility over the locations where fetches are referenced (think log node). After failing multiple unsatisfying solutions attempts, we now have a new concept of args list. The principle is that every proxy has a list head which contains a number of indications such as the config keyword, the context where it's used, the file and line number, etc... and a list of arguments. This list head is of the same type as the elements, so it serves as a template for adding new elements. This way, it is filled from top to bottom by the callers with the information they have (eg: line numbers, ACL name, ...) and the lower layers just have to duplicate it and add an element when they face an argument they cannot resolve yet. Then at the end of the configuration parsing, a loop passes over each proxy's list and resolves all the args in sequence. And this way there is all necessary information to report verbose errors. The first immediate benefit is that for the first time we got very precise location of issues (arg number in a keyword in its context, ...). Second, in order to do this we had to parse log-format and unique-id-format a bit earlier, so that was a great opportunity for doing so when the directives are encountered (unless it's a default section). This way, the recorded line numbers for these args are the ones of the place where the log format is declared, not the end of the file. Userlists report slightly more information now. They're the only remaining ones in the ACL resolving function.	2013-04-03 02:13:02 +02:00
Willy Tarreau	bef91e7144	MEDIUM: acl: support using sample fetches directly in ACLs Now it becomes possible to directly use sample fetches as the ACL fetch methods. In this case, the matching method is mandatory. This allows to form more ACL combinations from existing fetches and will limit the need for new ACLs when everything is available to form them from sample fetches and matches.	2013-04-03 02:13:02 +02:00
Willy Tarreau	93fddf1dbc	MEDIUM: acl: have a pointer to the keyword name in acl_expr The acl_expr struct used to hold a pointer to the ACL keyword. But since we now have all the relevant pointers, we don't need that anymore, we just need the pointer to the keyword as a string in order to return warnings and error messages. So let's change this in order to remove the dependency on the acl_keyword struct from acl_expr. During this change, acl_cond_kw_conflicts() used to return a pointer to an ACL keyword but had to be changed to return a const char* for the same reason.	2013-04-03 02:13:01 +02:00
Willy Tarreau	acca90d8e5	MINOR: acl: remove the use_count in acl keywords use_cnt is not used at all anymore, let's get rid of it.	2013-04-03 02:13:01 +02:00
Willy Tarreau	5adeda1f63	MAJOR: acl: add option -m to change the pattern matching method ACL expressions now support "-m" in addition to "-i" and "-f". This new option is followed by the name of the pattern matching method to be used on the extracted pattern. This makes it possible to reuse existing sample fetch methods with other matching methods (eg: regex). A "found" matching method ignores any pattern and only verifies that the required sample was found (useful for cookies).	2013-04-03 02:13:01 +02:00
Willy Tarreau	d76a98a5fc	MEDIUM: acl: move the ->parse, ->match and ->smp fields to acl_expr We'll need each ACL expression to be able to support its own parse and match methods, so we're moving these fields to the ACL expression.	2013-04-03 02:13:01 +02:00
Willy Tarreau	d86e29d2a1	CLEANUP: acl: remove unused references to ACL_USE_* Now that acl->requires is not used anymore, we can remove all references to it as well as all ACL_USE_* flags.	2013-04-03 02:13:00 +02:00
Willy Tarreau	a91d0a583c	MAJOR: acl: convert all ACL requires to SMP use+val instead of ->requires The ACLs now use the fetch's ->use and ->val to decide upon compatibility between the place where they are used and where the information are fetched. The code is capable of reporting warnings about very fine incompatibilities between certain fetches and an exact usage location, so it is expected that some new warnings will be emitted on some existing configurations. Two degrees of detection are provided : - detecting ACLs that never match - detecting keywords that are ignored All tests show that this seems to work well, though bugs are still possible.	2013-04-03 02:13:00 +02:00
Willy Tarreau	25320b2906	MEDIUM: proxy: remove acl_requires and just keep a flag "http_needed" Proxy's acl_requires was a copy of all bits taken from ACLs, but we'll get rid of ACL flags and only rely on sample fetches soon. The proxy's acl_requires was only used to allocate an HTTP context when needed, and was even forced in HTTP mode. So better have a flag which exactly says what it's supposed to be used for.	2013-04-03 02:13:00 +02:00
Willy Tarreau	4a96bf5a5d	CLEANUP: acl: remove ACL hooks which were never used These hooks, which established the relation between ACL_USE_* and the location where the ACL were used, were never used because they were superseded with the sample capabilities. Remove them now.	2013-04-03 02:12:59 +02:00
Willy Tarreau	9baae63d8d	MAJOR: acl: remove fetch argument validation from the ACL struct ACL fetch being inherited from the sample fetch keyword, we don't need anymore to specify what function to use to validate the fetch arguments. Note that the job is still done in the ACL parsing code based on elements from the sample fetch structs.	2013-04-03 02:12:59 +02:00
Willy Tarreau	c48c90dfa5	MAJOR: acl: remove the arg_mask from the ACL definition and use the sample fetch's Now that ACLs solely rely on sample fetch functions, make them use the same arg mask. All inconsistencies have been fixed separately prior to this patch, so this patch almost only adds a new pointer indirection and removes all references to ARG*() in the definitions. The parsing is still performed by the ACL code though.	2013-04-03 02:12:58 +02:00
Willy Tarreau	8ed669b12a	MAJOR: acl: make all ACLs reference the fetch function via a sample. ACL fetch functions used to directly reference a fetch function. Now that all ACL fetches have their sample fetches equivalent, we can make ACLs reference a sample fetch keyword instead. In order to simplify the code, a sample keyword name may be NULL if it is the same as the ACL's, which is the most common case. A minor change appeared, http_auth always expects one argument though the ACL allowed it to be missing and reported as such afterwards, so fix the ACL to match this. This is not really a bug.	2013-04-03 02:12:58 +02:00
Willy Tarreau	d4c33c8889	MEDIUM: samples: move payload-based fetches and ACLs to their own file The file acl.c is a real mess, it both contains functions to parse and process ACLs, and some sample extraction functions which act on buffers. Some other payload analysers were arbitrarily dispatched to proto_tcp.c. So now we're moving all payload-based fetches and ACLs to payload.c which is capable of extracting data from buffers and rely on everything that is protocol-independant. That way we can safely inflate this file and only use the other ones when some fetches are really specific (eg: HTTP, SSL, ...). As a result of this cleanup, the following new sample fetches became available even if they're not really useful : always_false, always_true, rep_ssl_hello_type, rdp_cookie_cnt, req_len, req_ssl_hello_type, req_ssl_sni, req_ssl_ver, wait_end The function 'acl_fetch_nothing' was wrong and never used anywhere so it was removed. The "rdp_cookie" sample fetch used to have a mandatory argument while it was optional in ACLs, which are supposed to iterate over RDP cookies. So we're making it optional as a fetch too, and it will return the first one.	2013-04-03 02:12:57 +02:00
Willy Tarreau	e0db1e8946	MEDIUM: acl: remove flag ACL_MAY_LOOKUP which is improperly used This flag is used on ACL matches that support being looking up patterns in trees. At the moment, only strings and IPs support tree-based lookups, but the flag is randomly set also on integers and binary data, and is not even always set on strings nor IPs. Better get rid of this mess by only relying on the matching function to decide whether or not it supports tree-based lookups, this is safer and easier to maintain.	2013-04-03 02:12:56 +02:00
Hiroaki Nakamura	7035132349	MEDIUM: regex: Use PCRE JIT in acl This is a patch for using PCRE JIT in acl. I notice regex are used in other places, but they are more complicated to modify to use PCRE APIs. So I focused to acl in the first try. BTW, I made a simple benchmark program for PCRE JIT beforehand. https://github.com/hnakamur/pcre-jit-benchmark I read the manual for PCRE JIT http://www.manpagez.com/man/3/pcrejit/ and wrote my benchmark program. https://github.com/hnakamur/pcre-jit-benchmark/blob/master/test-pcre.c	2013-04-02 00:02:54 +02:00
Willy Tarreau	f678b7f322	BUG/MINOR: config: fix improper check for failed memory alloc in ACL parser The wrong variable is checked after a calloc() so a memory shortage would result in a segfault while loading the config instead of a clean error. This fix may be backported to 1.4 and 1.3 which are both affected. Reported-by: Dinko Korunic <dkorunic@reflected.net>	2013-01-24 16:19:18 +01:00
Willy Tarreau	4a6e5c6d69	BUG/MEDIUM: acl: make prue_acl_expr() correctly free ACL expressions upon exit When leaving, during the deinit() process, prune_acl_expr() is called to delete all ACL expressions. A bug was introduced with commit `34db1084` that caused every other expression argument to be skipped, and more annoyingly, it introduced the risk of scanning past the arg list and crashing or freezing the old process during a reload. Credits for finding this issue go to Dmitry Sivachenko who first reported it, and second did a lot of research to narrow it down to a minimal configuration.	2012-11-24 00:02:14 +01:00
Willy Tarreau	7d1df41171	BUG/MEDIUM: acl: correctly resolve all args, not just the first one Since 1.5-dev9, ACLs support multiple args. The changes performed in acl_find_targets() were bogus as they were not always applied to the current argument being processed, but sometimes to the first one only. Fortunately till now, all ACLs which support resolvable arguments have it in the first place only, so there was no impact.	2012-11-23 23:47:36 +01:00
Willy Tarreau	19d14ef104	MEDIUM: make the trash be a chunk instead of a char * The trash is used everywhere to store the results of temporary strings built out of s(n)printf, or as a storage for a chunk when chunks are needed. Using global.tune.bufsize is not the most convenient thing either. So let's replace trash with a chunk and directly use it as such. We can then use trash.size as the natural way to get its size, and get rid of many intermediary chunks that were previously used. The patch is huge because it touches many areas but it makes the code a lot more clear and even outlines places where trash was used without being that obvious.	2012-10-29 16:57:30 +01:00
Willy Tarreau	c919dc66a3	CLEANUP: remove trashlen trashlen is a copy of global.tune.bufsize, so let's stop using it as a duplicate, fall back to the original bufsize, it's less confusing this way.	2012-10-26 20:04:27 +02:00
Cyril Bont�	4c01beb64b	BUG/MEDIUM: acls using IPv6 subnets patterns incorrectly match IPs Some tests revealed that IPs not in the range of IPv6 subnets incorrectly matched (for example "acl BUG src 2804::/16" applied to a src IP "127.0.0.1"). This is caused by the acl_match_ip() function applies a mask in host byte order, whereas it should be in network byte order.	2012-10-24 01:00:53 +02:00
Emeric Brun	07ca496ea9	MINOR: acl: add parse and match primitives to use binary type on ACLs Binary ACL match patterns can now be entered as hex digit strings.	2012-10-22 18:54:24 +02:00
Willy Tarreau	2e845be249	MEDIUM: sample: pass an empty list instead of a null for fetch args ACL and sample fetches use args list and it is really not convenient to check for null args everywhere. Now for empty args we pass a constant list of end of lists. It will allow us to remove many useless checks.	2012-10-19 19:49:09 +02:00
Willy Tarreau	9b28e03b66	MAJOR: channel: replace the struct buffer with a pointer to a buffer With this commit, we now separate the channel from the buffer. This will allow us to replace buffers on the fly without touching the channel. Since nobody is supposed to keep a reference to a buffer anymore, doing so is not a problem and will also permit some copy-less data manipulation. Interestingly, these changes have shown a 2% performance increase on some workloads, probably due to a better cache placement of data.	2012-10-13 09:07:52 +02:00
Willy Tarreau	f332af7715	CLEANUP: acl: use 'chn' instead of 'b' to name channel pointers As with previous patches, this naming is confusing.	2012-10-12 23:58:13 +02:00
Willy Tarreau	eb6cead1de	MINOR: standard: make memprintf() support a NULL destination Doing so removes many checks that were systematically made because the callees don't know if the caller passed a valid pointer.	2012-09-24 10:53:16 +02:00
Willy Tarreau	c7e4238df0	REORG: buffers: split buffers into chunk,buffer,channel Many parts of the channel definition still make use of the "buffer" word.	2012-09-03 20:47:32 +02:00
Willy Tarreau	572bf9095d	REORG/MAJOR: extract "struct buffer" from "struct channel" At the moment, the struct is still embedded into the struct channel, but all the functions have been updated to use struct buffer only when possible, otherwise struct channel. Some functions would likely need to be splitted between a buffer-layer primitive and a channel-layer function. Later the buffer should become a pointer in the struct buffer, but doing so requires a few changes to the buffer allocation calls.	2012-09-02 21:54:56 +02:00
Willy Tarreau	7421efb85f	REORG/MAJOR: use "struct channel" instead of "struct buffer" This is a massive rename. We'll then split channel and buffer. This change needs a lot of cleanups. At many locations, the parameter or variable is still called "buf" which will become ambiguous. Also, the "struct channel" is still defined in buffers.h.	2012-09-02 21:54:55 +02:00
Willy Tarreau	e3a461118c	BUG/MINOR: ACL implicit arguments must be created with unresolved flag Commit 496aa0 fixed a design issue by adding an "unresolved" flag to the ACL arguments. Unfortunately this unresolved flag was not set when building the fake argument some ACL need when using an implicit argument pointing to the local proxy. Special thanks to Michael Kearey who reported the issue with a reproducer and the commit introducing the bug.	2012-06-15 08:02:34 +02:00
Willy Tarreau	496aa0111e	BUG/MEDIUM: ensure that unresolved arguments are freed exactly once When passing arguments to ACLs and samples, some types are stored as strings then resolved later after config parsing is done. Upon exit, the arguments need to be freed only if the string was not resolved yet. At the moment we can encounter double free during deinit() because some arguments (eg: userlists) are freed once as their own type and once as a string. The solution consists in adding an "unresolved" flag to the args to say whether the value is still held in the <str> part or is final. This could be debugged thanks to a useful bug report from Sander Klein.	2012-06-01 10:40:52 +02:00
David du Colombier	7af4605ef7	BUG/MAJOR: trash must always be the size of a buffer Before it was possible to resize the buffers using global.tune.bufsize, the trash has always been the size of a buffer by design. Unfortunately, the recent buffer sizing at runtime forgot to adjust the trash, resulting in it being too short for content rewriting if buffers were enlarged from the default value. The bug was encountered in 1.4 so the fix must be backported there.	2012-05-16 14:21:55 +02:00
Willy Tarreau	46b39d0dc6	BUG/MEDIUM: config: don't crash at config load time on invalid userlist names Cyril Bont� reported that passing an invalid userlist name to http_auth_group() caused haproxy to crash at load. This was due to an attempt to use the unresolved userlist pointer later to resolve auth groups since we report many errors before leaving now. This issue does not exist in earlier versions since they immediately abort on the first error, so no backport is needed.	2012-05-10 23:42:22 +02:00
Willy Tarreau	ceb4ac9c34	MEDIUM: acl: support IPv6 address matching Make use of the new IPv6 pattern type so that acl_match_ip() knows how to compare pattern and sample. IPv6 may be entered in their usual form, with or without a netmask appended. Only bit counts are accepted for IPv6 netmasks. In order to avoid any risk of trouble with randomly resolved IP addresses, host names are never allowed in IPv6 patterns. HAProxy is also able to match IPv4 addresses with IPv6 addresses in the following situations : - tested address is IPv4, pattern address is IPv4, the match applies in IPv4 using the supplied mask if any. - tested address is IPv6, pattern address is IPv6, the match applies in IPv6 using the supplied mask if any. - tested address is IPv6, pattern address is IPv4, the match applies in IPv4 using the pattern's mask if the IPv6 address matches with 2002:IPV4::, ::IPV4 or ::ffff:IPV4, otherwise it fails. - tested address is IPv4, pattern address is IPv6, the IPv4 address is first converted to IPv6 by prefixing ::ffff: in front of it, then the match is applied in IPv6 using the supplied IPv6 mask.	2012-05-08 21:28:14 +02:00
Willy Tarreau	c92ddbc37d	MINOR: acl: add types to ACL patterns We cannot currently match IPv6 addresses in ACL simply because we don't support types on the patterns. Let's introduce this notion. For now, we rely on the SMP_TYPES though it doesn't seem like it will last forever given that some types are not present there (eg: regex, meth). Still it should be enough to support mixed matchings for most types. We use the special impossible value SMP_TYPES for types that don't exist in the SMP_T_* space.	2012-05-08 20:57:21 +02:00
Willy Tarreau	7dcb6480db	MEDIUM: acl: extend the pattern parsers to report meaningful errors By passing the error pointer to all ACL parsers, we can make them report useful errors and not simply fail.	2012-05-08 20:57:20 +02:00
Willy Tarreau	08ad0b38c4	MINOR: acl: report errors encountered when loading patterns from files This happens in acl_read_patterns_from_file(). Errors are still incomplete, parsing functions must be improved to report parsing errors.	2012-05-08 20:57:20 +02:00
Willy Tarreau	b7451bb660	MEDIUM: acl: report parsing errors to the caller All parsing errors were known but impossible to return. Now by making use of memprintf(), we're able to build meaningful error messages that the caller can display.	2012-05-08 20:57:20 +02:00
Willy Tarreau	ae52f06da3	MINOR: acl: add a val_args field to keywords This will make it possible to delegate argument validating to functions shared with smp_fetch_*.	2012-05-08 20:57:19 +02:00
Willy Tarreau	7a777edbdf	MINOR: acl: set SMP_OPT_ITERATE on fetch functions This way, fetch functions will be able to tell if they're called for a single request or as part of a loop. This is important for instance when we use hdr(foo), because in an ACL this means that all hdr(foo) occurrences must be checked while in a pattern it means only one of them (eg: last one).	2012-05-08 20:57:18 +02:00
Willy Tarreau	32a6f2e572	MEDIUM: acl/pattern: use the same direction scheme Patterns were using a bitmask to indicate if request or response was desired in fetch functions and keywords. ACLs were using a bitmask in fetch keywords and a single bit in fetch functions. ACLs were also using an ACL_PARTIAL bit in fetch functions indicating that a non-final fetch was performed, which was an abuse of the existing direction flag. The change now consists in using : - a capabilities field for fetch keywords => SMP_CAP_REQ/RES to indicate if a keyword supports requests, responses, both, etc... - an option field for fetch functions to indicate what the caller expects (request/response, final/non-final) The ACL_PARTIAL bit was reversed to get SMP_OPT_FINAL as it's more explicit to know we're working on a final buffer than on a non-final one. ACL_DIR_* were removed, as well as PATTERN_FETCH_*. L4 fetches were improved to support being called on responses too since they're still available. The <dir> field of all fetch functions was changed to <opt> which is now unsigned. The patch is large but mostly made of cosmetic changes to accomodate this, as almost no logic change happened.	2012-05-08 20:57:17 +02:00
Willy Tarreau	24e32d8c6b	MEDIUM: acl: replace acl_expr with args in acl fetch_* functions Having the args everywhere will make it easier to share fetch functions between patterns and ACLs. The only place where we could have needed the expr was in the http_prefetch function which can do well without.	2012-05-08 20:57:16 +02:00
Willy Tarreau	32389b7d04	MEDIUM: acl/pattern: switch rdp_cookie functions stack up-down Previously, both pattern, backend and persist_rdp_cookie would build fake ACL expressions to fetch an RDP cookie by calling acl_fetch_rdp_cookie(). Now we switch roles. The RDP cookie fetch function is provided as a sample fetch function that all others rely on, including ACL. The code is exactly the same, only the args handling moved from expr->args to args. The code was moved to proto_tcp.c, but probably that a dedicated file would be more suited to content handling.	2012-05-08 20:57:16 +02:00
Willy Tarreau	21e5b0e3cb	MEDIUM: get rid of SMP_F_READ_ONLY and SMP_F_MUST_FREE These ones were either unused or improperly used. Some integers were marked read-only, which does not make much sense. Buffers are not read-only, they're "constant" in that they must be kept intact after any possible change.	2012-05-08 20:57:15 +02:00
Willy Tarreau	197e10aaae	MEDIUM: acl: get rid of the SET_RES flags We now simply rely on a boolean result from a fetch to declare a match. Booleans are not compared against patterns, they fix the result.	2012-05-08 20:57:15 +02:00
Willy Tarreau	f853c46bc3	MEDIUM: pattern/acl: get rid of temp_pattern in ACLs This one is not needed anymore as we can return the data and its type in the sample provided by the caller. ACLs now always return the proper type. BOOL is already returned when the result is expected to be processed as a boolean. temp_pattern has been unexported now.	2012-05-08 20:57:14 +02:00
Willy Tarreau	3740635b88	MAJOR: acl: make use of the new sample struct and get rid of acl_test This change is invasive in lines of code but not much in terms of functionalities as it's mainly a replacement of struct acl_test with struct sample.	2012-05-08 20:57:14 +02:00
Willy Tarreau	422aa0792d	MEDIUM: pattern: add new sample types to replace pattern types The new sample types are necessary for the acl-pattern convergence. These types are boolean and signed int. Some types were renamed for less ambiguity (ip->ipv4, integer->uint).	2012-05-08 20:57:14 +02:00
Willy Tarreau	8f7406e9b4	MEDIUM: acl: remove the ACL_TEST_F_NULL_MATCH flag This flag was used to force a boolean match even if there was no pattern to match. It was used only by http_auth() and designed only for this one. It's easier and cleaner to make the fetch function perform the test and report the boolean result as a few other functions already do. It simplifies the acl_exec_cond() logic and will help merging ACLs and patterns.	2012-05-08 20:57:13 +02:00
Willy Tarreau	fc2c1fd449	MAJOR: acl: ensure that implicit table and proxies are valid A large number of ACLs make use of frontend, backend or table names in their arguments, and fall back to the current proxy when no argument is passed. If the expected capability is not available, the ACL silently fails at runtime. Now we make all those names mandatory in the parser and we rely on acl_find_targets() to replace the missing names with the holding proxy, then to perform the appropriate tests, and to reject errors at parsing time. It is possible that some faulty configurations will get rejected from now on, while they used to silently fail till now. This is the reason why this change is marked as MAJOR.	2012-05-08 20:57:12 +02:00
Willy Tarreau	d28c353fc5	MAJOR: acl: make acl_find_targets also resolve proxy names at config time Proxy names are now resolved when the config is parsed and not at runtime. This means that errors will be caught for real instead of having an ACL silently never match. Another benefit is that the fetch will be much faster since the lookup will not have to be performed anymore, eg for all ACLs based on explicitly named stick-tables. However some buggy configurations which used to silently fail in the past will now refuse to load, hence the MAJOR tag.	2012-05-08 20:57:11 +02:00
Willy Tarreau	63364eed75	MEDIUM: acl: acl_find_target() now resolves arguments based on their types This function does not rely on the keyword anymore but just on its type. It's much cleaner and much safer. It should be extended to do the same for all PRX type arguments.	2012-05-08 20:57:11 +02:00
Willy Tarreau	61612d49a7	MAJOR: acl: store the ACL argument types in the ACL keyword declaration The types and minimal number of ACL keyword arguments are now stored in their declaration. This will allow many more fantasies if some ACL use several arguments or types. Doing so required to rework all ACL keyword declarations to add two parameters. So this was a good opportunity for a general cleanup and to sort all entries in alphabetical order. We still have two pending issues : - parse_acl_expr() checks for errors but has no way to report them to the user ; - the types of some arguments are still not resolved and kept as strings (eg: ARGT_FE/BE/TAB) for compatibility reasons, which must be resolved in acl_find_targets()	2012-05-08 20:57:11 +02:00
Willy Tarreau	34db108423	MAJOR: acl: make use of the new argument parsing framework The ACL parser now uses the argument parser to build a typed argument list. Right now arguments are all strings and only one argument is supported since this is what ACLs currently support.	2012-05-08 20:57:11 +02:00
Willy Tarreau	89fa706d39	MAJOR: buffers: replace buf->w with buf->p - buf->o This change introduces the buffer's base pointer, which is the limit between incoming and outgoing data. It's the point where the parsing should start from. A number of computations have already been greatly simplified, but more simplifications are expected to come from the removal of buf->r. The changes appear good and have revealed occasional improper use of some pointers. It is possible that this patch has introduced bugs or revealed some, although preliminary testings tend to indicate that everything still works as it should.	2012-05-08 12:28:10 +02:00

1 2 3 4 5

214 Commits