haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-14 11:06:56 +02:00

Author	SHA1	Message	Date
Willy Tarreau	a4312fa28e	MAJOR: sample: maintain a per-proxy list of the fetch args to resolve While ACL args were resolved after all the config was parsed, it was not the case with sample fetch args because they're almost everywhere now. The issue is that ACLs now solely rely on sample fetches, so their args resolving doesn't work anymore. And many fetches involving a server, a proxy or a userlist don't work at all. The real issue is that at the bottom layers we have no information about proxies, line numbers, even ACLs in order to report understandable errors, and that at the top layers we have no visibility over the locations where fetches are referenced (think log node). After failing multiple unsatisfying solutions attempts, we now have a new concept of args list. The principle is that every proxy has a list head which contains a number of indications such as the config keyword, the context where it's used, the file and line number, etc... and a list of arguments. This list head is of the same type as the elements, so it serves as a template for adding new elements. This way, it is filled from top to bottom by the callers with the information they have (eg: line numbers, ACL name, ...) and the lower layers just have to duplicate it and add an element when they face an argument they cannot resolve yet. Then at the end of the configuration parsing, a loop passes over each proxy's list and resolves all the args in sequence. And this way there is all necessary information to report verbose errors. The first immediate benefit is that for the first time we got very precise location of issues (arg number in a keyword in its context, ...). Second, in order to do this we had to parse log-format and unique-id-format a bit earlier, so that was a great opportunity for doing so when the directives are encountered (unless it's a default section). This way, the recorded line numbers for these args are the ones of the place where the log format is declared, not the end of the file. Userlists report slightly more information now. They're the only remaining ones in the ACL resolving function.	2013-04-03 02:13:02 +02:00
Willy Tarreau	bef91e7144	MEDIUM: acl: support using sample fetches directly in ACLs Now it becomes possible to directly use sample fetches as the ACL fetch methods. In this case, the matching method is mandatory. This allows to form more ACL combinations from existing fetches and will limit the need for new ACLs when everything is available to form them from sample fetches and matches.	2013-04-03 02:13:02 +02:00
Willy Tarreau	93fddf1dbc	MEDIUM: acl: have a pointer to the keyword name in acl_expr The acl_expr struct used to hold a pointer to the ACL keyword. But since we now have all the relevant pointers, we don't need that anymore, we just need the pointer to the keyword as a string in order to return warnings and error messages. So let's change this in order to remove the dependency on the acl_keyword struct from acl_expr. During this change, acl_cond_kw_conflicts() used to return a pointer to an ACL keyword but had to be changed to return a const char* for the same reason.	2013-04-03 02:13:01 +02:00
Willy Tarreau	acca90d8e5	MINOR: acl: remove the use_count in acl keywords use_cnt is not used at all anymore, let's get rid of it.	2013-04-03 02:13:01 +02:00
Willy Tarreau	5adeda1f63	MAJOR: acl: add option -m to change the pattern matching method ACL expressions now support "-m" in addition to "-i" and "-f". This new option is followed by the name of the pattern matching method to be used on the extracted pattern. This makes it possible to reuse existing sample fetch methods with other matching methods (eg: regex). A "found" matching method ignores any pattern and only verifies that the required sample was found (useful for cookies).	2013-04-03 02:13:01 +02:00
Willy Tarreau	d76a98a5fc	MEDIUM: acl: move the ->parse, ->match and ->smp fields to acl_expr We'll need each ACL expression to be able to support its own parse and match methods, so we're moving these fields to the ACL expression.	2013-04-03 02:13:01 +02:00
Willy Tarreau	d86e29d2a1	CLEANUP: acl: remove unused references to ACL_USE_* Now that acl->requires is not used anymore, we can remove all references to it as well as all ACL_USE_* flags.	2013-04-03 02:13:00 +02:00
Willy Tarreau	a91d0a583c	MAJOR: acl: convert all ACL requires to SMP use+val instead of ->requires The ACLs now use the fetch's ->use and ->val to decide upon compatibility between the place where they are used and where the information are fetched. The code is capable of reporting warnings about very fine incompatibilities between certain fetches and an exact usage location, so it is expected that some new warnings will be emitted on some existing configurations. Two degrees of detection are provided : - detecting ACLs that never match - detecting keywords that are ignored All tests show that this seems to work well, though bugs are still possible.	2013-04-03 02:13:00 +02:00
Willy Tarreau	25320b2906	MEDIUM: proxy: remove acl_requires and just keep a flag "http_needed" Proxy's acl_requires was a copy of all bits taken from ACLs, but we'll get rid of ACL flags and only rely on sample fetches soon. The proxy's acl_requires was only used to allocate an HTTP context when needed, and was even forced in HTTP mode. So better have a flag which exactly says what it's supposed to be used for.	2013-04-03 02:13:00 +02:00
Willy Tarreau	4a96bf5a5d	CLEANUP: acl: remove ACL hooks which were never used These hooks, which established the relation between ACL_USE_* and the location where the ACL were used, were never used because they were superseded with the sample capabilities. Remove them now.	2013-04-03 02:12:59 +02:00
Willy Tarreau	9baae63d8d	MAJOR: acl: remove fetch argument validation from the ACL struct ACL fetch being inherited from the sample fetch keyword, we don't need anymore to specify what function to use to validate the fetch arguments. Note that the job is still done in the ACL parsing code based on elements from the sample fetch structs.	2013-04-03 02:12:59 +02:00
Willy Tarreau	c48c90dfa5	MAJOR: acl: remove the arg_mask from the ACL definition and use the sample fetch's Now that ACLs solely rely on sample fetch functions, make them use the same arg mask. All inconsistencies have been fixed separately prior to this patch, so this patch almost only adds a new pointer indirection and removes all references to ARG*() in the definitions. The parsing is still performed by the ACL code though.	2013-04-03 02:12:58 +02:00
Willy Tarreau	8ed669b12a	MAJOR: acl: make all ACLs reference the fetch function via a sample. ACL fetch functions used to directly reference a fetch function. Now that all ACL fetches have their sample fetches equivalent, we can make ACLs reference a sample fetch keyword instead. In order to simplify the code, a sample keyword name may be NULL if it is the same as the ACL's, which is the most common case. A minor change appeared, http_auth always expects one argument though the ACL allowed it to be missing and reported as such afterwards, so fix the ACL to match this. This is not really a bug.	2013-04-03 02:12:58 +02:00
Willy Tarreau	d4c33c8889	MEDIUM: samples: move payload-based fetches and ACLs to their own file The file acl.c is a real mess, it both contains functions to parse and process ACLs, and some sample extraction functions which act on buffers. Some other payload analysers were arbitrarily dispatched to proto_tcp.c. So now we're moving all payload-based fetches and ACLs to payload.c which is capable of extracting data from buffers and rely on everything that is protocol-independant. That way we can safely inflate this file and only use the other ones when some fetches are really specific (eg: HTTP, SSL, ...). As a result of this cleanup, the following new sample fetches became available even if they're not really useful : always_false, always_true, rep_ssl_hello_type, rdp_cookie_cnt, req_len, req_ssl_hello_type, req_ssl_sni, req_ssl_ver, wait_end The function 'acl_fetch_nothing' was wrong and never used anywhere so it was removed. The "rdp_cookie" sample fetch used to have a mandatory argument while it was optional in ACLs, which are supposed to iterate over RDP cookies. So we're making it optional as a fetch too, and it will return the first one.	2013-04-03 02:12:57 +02:00
Willy Tarreau	e0db1e8946	MEDIUM: acl: remove flag ACL_MAY_LOOKUP which is improperly used This flag is used on ACL matches that support being looking up patterns in trees. At the moment, only strings and IPs support tree-based lookups, but the flag is randomly set also on integers and binary data, and is not even always set on strings nor IPs. Better get rid of this mess by only relying on the matching function to decide whether or not it supports tree-based lookups, this is safer and easier to maintain.	2013-04-03 02:12:56 +02:00
Hiroaki Nakamura	7035132349	MEDIUM: regex: Use PCRE JIT in acl This is a patch for using PCRE JIT in acl. I notice regex are used in other places, but they are more complicated to modify to use PCRE APIs. So I focused to acl in the first try. BTW, I made a simple benchmark program for PCRE JIT beforehand. https://github.com/hnakamur/pcre-jit-benchmark I read the manual for PCRE JIT http://www.manpagez.com/man/3/pcrejit/ and wrote my benchmark program. https://github.com/hnakamur/pcre-jit-benchmark/blob/master/test-pcre.c	2013-04-02 00:02:54 +02:00
Willy Tarreau	f678b7f322	BUG/MINOR: config: fix improper check for failed memory alloc in ACL parser The wrong variable is checked after a calloc() so a memory shortage would result in a segfault while loading the config instead of a clean error. This fix may be backported to 1.4 and 1.3 which are both affected. Reported-by: Dinko Korunic <dkorunic@reflected.net>	2013-01-24 16:19:18 +01:00
Willy Tarreau	4a6e5c6d69	BUG/MEDIUM: acl: make prue_acl_expr() correctly free ACL expressions upon exit When leaving, during the deinit() process, prune_acl_expr() is called to delete all ACL expressions. A bug was introduced with commit `34db1084` that caused every other expression argument to be skipped, and more annoyingly, it introduced the risk of scanning past the arg list and crashing or freezing the old process during a reload. Credits for finding this issue go to Dmitry Sivachenko who first reported it, and second did a lot of research to narrow it down to a minimal configuration.	2012-11-24 00:02:14 +01:00
Willy Tarreau	7d1df41171	BUG/MEDIUM: acl: correctly resolve all args, not just the first one Since 1.5-dev9, ACLs support multiple args. The changes performed in acl_find_targets() were bogus as they were not always applied to the current argument being processed, but sometimes to the first one only. Fortunately till now, all ACLs which support resolvable arguments have it in the first place only, so there was no impact.	2012-11-23 23:47:36 +01:00
Willy Tarreau	19d14ef104	MEDIUM: make the trash be a chunk instead of a char * The trash is used everywhere to store the results of temporary strings built out of s(n)printf, or as a storage for a chunk when chunks are needed. Using global.tune.bufsize is not the most convenient thing either. So let's replace trash with a chunk and directly use it as such. We can then use trash.size as the natural way to get its size, and get rid of many intermediary chunks that were previously used. The patch is huge because it touches many areas but it makes the code a lot more clear and even outlines places where trash was used without being that obvious.	2012-10-29 16:57:30 +01:00
Willy Tarreau	c919dc66a3	CLEANUP: remove trashlen trashlen is a copy of global.tune.bufsize, so let's stop using it as a duplicate, fall back to the original bufsize, it's less confusing this way.	2012-10-26 20:04:27 +02:00
Cyril Bont�	4c01beb64b	BUG/MEDIUM: acls using IPv6 subnets patterns incorrectly match IPs Some tests revealed that IPs not in the range of IPv6 subnets incorrectly matched (for example "acl BUG src 2804::/16" applied to a src IP "127.0.0.1"). This is caused by the acl_match_ip() function applies a mask in host byte order, whereas it should be in network byte order.	2012-10-24 01:00:53 +02:00
Emeric Brun	07ca496ea9	MINOR: acl: add parse and match primitives to use binary type on ACLs Binary ACL match patterns can now be entered as hex digit strings.	2012-10-22 18:54:24 +02:00
Willy Tarreau	2e845be249	MEDIUM: sample: pass an empty list instead of a null for fetch args ACL and sample fetches use args list and it is really not convenient to check for null args everywhere. Now for empty args we pass a constant list of end of lists. It will allow us to remove many useless checks.	2012-10-19 19:49:09 +02:00
Willy Tarreau	9b28e03b66	MAJOR: channel: replace the struct buffer with a pointer to a buffer With this commit, we now separate the channel from the buffer. This will allow us to replace buffers on the fly without touching the channel. Since nobody is supposed to keep a reference to a buffer anymore, doing so is not a problem and will also permit some copy-less data manipulation. Interestingly, these changes have shown a 2% performance increase on some workloads, probably due to a better cache placement of data.	2012-10-13 09:07:52 +02:00
Willy Tarreau	f332af7715	CLEANUP: acl: use 'chn' instead of 'b' to name channel pointers As with previous patches, this naming is confusing.	2012-10-12 23:58:13 +02:00
Willy Tarreau	eb6cead1de	MINOR: standard: make memprintf() support a NULL destination Doing so removes many checks that were systematically made because the callees don't know if the caller passed a valid pointer.	2012-09-24 10:53:16 +02:00
Willy Tarreau	c7e4238df0	REORG: buffers: split buffers into chunk,buffer,channel Many parts of the channel definition still make use of the "buffer" word.	2012-09-03 20:47:32 +02:00
Willy Tarreau	572bf9095d	REORG/MAJOR: extract "struct buffer" from "struct channel" At the moment, the struct is still embedded into the struct channel, but all the functions have been updated to use struct buffer only when possible, otherwise struct channel. Some functions would likely need to be splitted between a buffer-layer primitive and a channel-layer function. Later the buffer should become a pointer in the struct buffer, but doing so requires a few changes to the buffer allocation calls.	2012-09-02 21:54:56 +02:00
Willy Tarreau	7421efb85f	REORG/MAJOR: use "struct channel" instead of "struct buffer" This is a massive rename. We'll then split channel and buffer. This change needs a lot of cleanups. At many locations, the parameter or variable is still called "buf" which will become ambiguous. Also, the "struct channel" is still defined in buffers.h.	2012-09-02 21:54:55 +02:00
Willy Tarreau	e3a461118c	BUG/MINOR: ACL implicit arguments must be created with unresolved flag Commit 496aa0 fixed a design issue by adding an "unresolved" flag to the ACL arguments. Unfortunately this unresolved flag was not set when building the fake argument some ACL need when using an implicit argument pointing to the local proxy. Special thanks to Michael Kearey who reported the issue with a reproducer and the commit introducing the bug.	2012-06-15 08:02:34 +02:00
Willy Tarreau	496aa0111e	BUG/MEDIUM: ensure that unresolved arguments are freed exactly once When passing arguments to ACLs and samples, some types are stored as strings then resolved later after config parsing is done. Upon exit, the arguments need to be freed only if the string was not resolved yet. At the moment we can encounter double free during deinit() because some arguments (eg: userlists) are freed once as their own type and once as a string. The solution consists in adding an "unresolved" flag to the args to say whether the value is still held in the <str> part or is final. This could be debugged thanks to a useful bug report from Sander Klein.	2012-06-01 10:40:52 +02:00
David du Colombier	7af4605ef7	BUG/MAJOR: trash must always be the size of a buffer Before it was possible to resize the buffers using global.tune.bufsize, the trash has always been the size of a buffer by design. Unfortunately, the recent buffer sizing at runtime forgot to adjust the trash, resulting in it being too short for content rewriting if buffers were enlarged from the default value. The bug was encountered in 1.4 so the fix must be backported there.	2012-05-16 14:21:55 +02:00
Willy Tarreau	46b39d0dc6	BUG/MEDIUM: config: don't crash at config load time on invalid userlist names Cyril Bont� reported that passing an invalid userlist name to http_auth_group() caused haproxy to crash at load. This was due to an attempt to use the unresolved userlist pointer later to resolve auth groups since we report many errors before leaving now. This issue does not exist in earlier versions since they immediately abort on the first error, so no backport is needed.	2012-05-10 23:42:22 +02:00
Willy Tarreau	ceb4ac9c34	MEDIUM: acl: support IPv6 address matching Make use of the new IPv6 pattern type so that acl_match_ip() knows how to compare pattern and sample. IPv6 may be entered in their usual form, with or without a netmask appended. Only bit counts are accepted for IPv6 netmasks. In order to avoid any risk of trouble with randomly resolved IP addresses, host names are never allowed in IPv6 patterns. HAProxy is also able to match IPv4 addresses with IPv6 addresses in the following situations : - tested address is IPv4, pattern address is IPv4, the match applies in IPv4 using the supplied mask if any. - tested address is IPv6, pattern address is IPv6, the match applies in IPv6 using the supplied mask if any. - tested address is IPv6, pattern address is IPv4, the match applies in IPv4 using the pattern's mask if the IPv6 address matches with 2002:IPV4::, ::IPV4 or ::ffff:IPV4, otherwise it fails. - tested address is IPv4, pattern address is IPv6, the IPv4 address is first converted to IPv6 by prefixing ::ffff: in front of it, then the match is applied in IPv6 using the supplied IPv6 mask.	2012-05-08 21:28:14 +02:00
Willy Tarreau	c92ddbc37d	MINOR: acl: add types to ACL patterns We cannot currently match IPv6 addresses in ACL simply because we don't support types on the patterns. Let's introduce this notion. For now, we rely on the SMP_TYPES though it doesn't seem like it will last forever given that some types are not present there (eg: regex, meth). Still it should be enough to support mixed matchings for most types. We use the special impossible value SMP_TYPES for types that don't exist in the SMP_T_* space.	2012-05-08 20:57:21 +02:00
Willy Tarreau	7dcb6480db	MEDIUM: acl: extend the pattern parsers to report meaningful errors By passing the error pointer to all ACL parsers, we can make them report useful errors and not simply fail.	2012-05-08 20:57:20 +02:00
Willy Tarreau	08ad0b38c4	MINOR: acl: report errors encountered when loading patterns from files This happens in acl_read_patterns_from_file(). Errors are still incomplete, parsing functions must be improved to report parsing errors.	2012-05-08 20:57:20 +02:00
Willy Tarreau	b7451bb660	MEDIUM: acl: report parsing errors to the caller All parsing errors were known but impossible to return. Now by making use of memprintf(), we're able to build meaningful error messages that the caller can display.	2012-05-08 20:57:20 +02:00
Willy Tarreau	ae52f06da3	MINOR: acl: add a val_args field to keywords This will make it possible to delegate argument validating to functions shared with smp_fetch_*.	2012-05-08 20:57:19 +02:00
Willy Tarreau	7a777edbdf	MINOR: acl: set SMP_OPT_ITERATE on fetch functions This way, fetch functions will be able to tell if they're called for a single request or as part of a loop. This is important for instance when we use hdr(foo), because in an ACL this means that all hdr(foo) occurrences must be checked while in a pattern it means only one of them (eg: last one).	2012-05-08 20:57:18 +02:00
Willy Tarreau	32a6f2e572	MEDIUM: acl/pattern: use the same direction scheme Patterns were using a bitmask to indicate if request or response was desired in fetch functions and keywords. ACLs were using a bitmask in fetch keywords and a single bit in fetch functions. ACLs were also using an ACL_PARTIAL bit in fetch functions indicating that a non-final fetch was performed, which was an abuse of the existing direction flag. The change now consists in using : - a capabilities field for fetch keywords => SMP_CAP_REQ/RES to indicate if a keyword supports requests, responses, both, etc... - an option field for fetch functions to indicate what the caller expects (request/response, final/non-final) The ACL_PARTIAL bit was reversed to get SMP_OPT_FINAL as it's more explicit to know we're working on a final buffer than on a non-final one. ACL_DIR_* were removed, as well as PATTERN_FETCH_*. L4 fetches were improved to support being called on responses too since they're still available. The <dir> field of all fetch functions was changed to <opt> which is now unsigned. The patch is large but mostly made of cosmetic changes to accomodate this, as almost no logic change happened.	2012-05-08 20:57:17 +02:00
Willy Tarreau	24e32d8c6b	MEDIUM: acl: replace acl_expr with args in acl fetch_* functions Having the args everywhere will make it easier to share fetch functions between patterns and ACLs. The only place where we could have needed the expr was in the http_prefetch function which can do well without.	2012-05-08 20:57:16 +02:00
Willy Tarreau	32389b7d04	MEDIUM: acl/pattern: switch rdp_cookie functions stack up-down Previously, both pattern, backend and persist_rdp_cookie would build fake ACL expressions to fetch an RDP cookie by calling acl_fetch_rdp_cookie(). Now we switch roles. The RDP cookie fetch function is provided as a sample fetch function that all others rely on, including ACL. The code is exactly the same, only the args handling moved from expr->args to args. The code was moved to proto_tcp.c, but probably that a dedicated file would be more suited to content handling.	2012-05-08 20:57:16 +02:00
Willy Tarreau	21e5b0e3cb	MEDIUM: get rid of SMP_F_READ_ONLY and SMP_F_MUST_FREE These ones were either unused or improperly used. Some integers were marked read-only, which does not make much sense. Buffers are not read-only, they're "constant" in that they must be kept intact after any possible change.	2012-05-08 20:57:15 +02:00
Willy Tarreau	197e10aaae	MEDIUM: acl: get rid of the SET_RES flags We now simply rely on a boolean result from a fetch to declare a match. Booleans are not compared against patterns, they fix the result.	2012-05-08 20:57:15 +02:00
Willy Tarreau	f853c46bc3	MEDIUM: pattern/acl: get rid of temp_pattern in ACLs This one is not needed anymore as we can return the data and its type in the sample provided by the caller. ACLs now always return the proper type. BOOL is already returned when the result is expected to be processed as a boolean. temp_pattern has been unexported now.	2012-05-08 20:57:14 +02:00
Willy Tarreau	3740635b88	MAJOR: acl: make use of the new sample struct and get rid of acl_test This change is invasive in lines of code but not much in terms of functionalities as it's mainly a replacement of struct acl_test with struct sample.	2012-05-08 20:57:14 +02:00
Willy Tarreau	422aa0792d	MEDIUM: pattern: add new sample types to replace pattern types The new sample types are necessary for the acl-pattern convergence. These types are boolean and signed int. Some types were renamed for less ambiguity (ip->ipv4, integer->uint).	2012-05-08 20:57:14 +02:00
Willy Tarreau	8f7406e9b4	MEDIUM: acl: remove the ACL_TEST_F_NULL_MATCH flag This flag was used to force a boolean match even if there was no pattern to match. It was used only by http_auth() and designed only for this one. It's easier and cleaner to make the fetch function perform the test and report the boolean result as a few other functions already do. It simplifies the acl_exec_cond() logic and will help merging ACLs and patterns.	2012-05-08 20:57:13 +02:00

1 2 3

120 Commits