haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-09 08:37:04 +02:00

Author	SHA1	Message	Date
Tim Duesterhus	b584b4475b	BUG/MINOR: pattern: Do not pass len = 0 to calloc() The behavior of calloc() when being passed `0` as `nelem` is implementation defined. It may return a NULL pointer. Avoid this issue by checking before allocating. While doing so adjust the local integer variables that are used to refer to memory offsets to `size_t`. This issue was introced in commit `f91ac19299`. This patch should be backported together with that commit.	2020-03-18 05:17:28 +01:00
Willy Tarreau	52bf839394	BUG/MEDIUM: random: implement a thread-safe and process-safe PRNG This is the replacement of failed attempt to add thread safety and per-process sequences of random numbers initally tried with commit `1c306aa84d` ("BUG/MEDIUM: random: implement per-thread and per-process random sequences"). This new version takes a completely different approach and doesn't try to work around the horrible OS-specific and non-portable random API anymore. Instead it implements "xoroshiro128*", a reputedly high quality random number generator, which is one of the many variants of xorshift, which passes all quality tests and which is described here: http://prng.di.unimi.it/ While not cryptographically secure, it is fast and features a 2^128-1 period. It supports fast jumps allowing to cut the period into smaller non-overlapping sequences, which we use here to support up to 2^32 processes each having their own, non-overlapping sequence of 2^96 numbers (~710^28). This is enough to provide 1 billion randoms per second and per process for 2200 billion years. The implementation was made thread-safe either by using a double 64-bit CAS on platforms supporting it (x86_64, aarch64) or by using a local lock for the time needed to perform the shift operations. This ensures that all threads pick numbers from the same pool so that it is not needed to assign per-thread ranges. For processes we use the fast jump method to advance the sequence by 2^96 for each process. Before this patch, the following config: global nbproc 8 frontend f bind :4445 mode http log stdout format raw daemon log-format "%[uuid] %pid" redirect location / Would produce this output: a4d0ad64-2645-4b74-b894-48acce0669af 12987 a4d0ad64-2645-4b74-b894-48acce0669af 12992 a4d0ad64-2645-4b74-b894-48acce0669af 12986 a4d0ad64-2645-4b74-b894-48acce0669af 12988 a4d0ad64-2645-4b74-b894-48acce0669af 12991 a4d0ad64-2645-4b74-b894-48acce0669af 12989 a4d0ad64-2645-4b74-b894-48acce0669af 12990 82d5f6cd-f6c1-4f85-a89c-36ae85d26fb9 12987 82d5f6cd-f6c1-4f85-a89c-36ae85d26fb9 12992 82d5f6cd-f6c1-4f85-a89c-36ae85d26fb9 12986 (...) And now produces: f94b29b3-da74-4e03-a0c5-a532c635bad9 13011 47470c02-4862-4c33-80e7-a952899570e5 13014 86332123-539a-47bf-853f-8c8ea8b2a2b5 13013 8f9efa99-3143-47b2-83cf-d618c8dea711 13012 3cc0f5c7-d790-496b-8d39-bec77647af5b 13015 3ec64915-8f95-4374-9e66-e777dc8791e0 13009 0f9bf894-dcde-408c-b094-6e0bb3255452 13011 49c7bfde-3ffb-40e9-9a8d-8084d650ed8f 13014 e23f6f2e-35c5-4433-a294-b790ab902653 13012 There are multiple benefits to using this method. First, it doesn't depend anymore on a non-portable API. Second it's thread safe. Third it is fast and more proven than any hack we could attempt to try to work around the deficiencies of the various implementations around. This commit depends on previous patches "MINOR: tools: add 64-bit rotate operators" and "BUG/MEDIUM: random: initialize the random pool a bit better", all of which will need to be backported at least as far as version 2.0. It doesn't require to backport the build fixes for circular include files dependecy anymore.	2020-03-08 10:09:02 +01:00
Willy Tarreau	0fbf28a05b	Revert "BUG/MEDIUM: random: implement per-thread and per-process random sequences" This reverts commit `1c306aa84d`. It breaks the build on all non-glibc platforms. I got confused by the man page (which possibly is the most confusing man page I've ever read about a standard libc function) and mistakenly understood that random_r was portable, especially since it appears in latest freebsd source as well but not in released versions, and with a slightly different API :-/ We need to find a different solution with a fallback. Among the possibilities, we may reintroduce this one with a fallback relying on locking around the standard functions, keeping fingers crossed for no other library function to call them in parallel, or we may also provide our own PRNG, which is not necessarily more difficult than working around the totally broken up design of the portable API.	2020-03-07 11:24:39 +01:00
Willy Tarreau	1c306aa84d	BUG/MEDIUM: random: implement per-thread and per-process random sequences As mentioned in previous patch, the random number generator was never made thread-safe, which used not to be a problem for health checks spreading, until the uuid sample fetch function appeared. Currently it is possible for two threads or processes to produce exactly the same UUID. In fact it's extremely likely that this will happen for processes, as can be seen with this config: global nbproc 8 frontend f bind :4445 mode http log stdout daemon format raw log-format "%[uuid] %pid" redirect location / It typically produces this log: 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30645 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30641 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30644 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30639 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30646 07764439-c24d-4e6f-a5a6-0138be59e7a8 30645 07764439-c24d-4e6f-a5a6-0138be59e7a8 30639 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30643 07764439-c24d-4e6f-a5a6-0138be59e7a8 30646 b6773fdd-678f-4d04-96f2-4fb11ad15d6b 30646 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30642 07764439-c24d-4e6f-a5a6-0138be59e7a8 30642 What this patch does is to use a distinct per-thread and per-process seed to make sure the same sequences will not appear, and will then extend these seeds by "burning" a number of randoms that depends on the global random seed, the thread ID and the process ID. This adds roughly 20 extra bits of randomness, resulting in 52 bits total per thread and per process. It only takes a few milliseconds to burn these randoms and given that threads start with a different seed, we know they will not catch each other. So these random extra bits are essentially added to ensure randomness between boots and cluster instances. This replaces all uses of random() with ha_random() which uses the thread-local state. This must be backported as far as 2.0 or any version having the UUID sample-fetch function since it's the main victim here. It's important to note that this patch, in addition to depending on the previous one "BUG/MEDIUM: init: initialize the random pool a bit better", also depends on the preceeding build fixes to address a circular dependency issue in the include files that prevented it from building. Part or all of these patches may need to be backported or adapted as well.	2020-03-07 06:11:15 +01:00
Carl Henrik Lunde	f91ac19299	OPTIM: startup: fast unique_id allocation for acl. pattern_finalize_config() uses an inefficient algorithm which is a problem with very large configuration files. This affects startup, and therefore reload time. When haproxy is deployed as a router in a Kubernetes cluster the generated configuration file may be large and reloads are frequently occuring, which makes this a significant issue. The old algorithm is O(n^2) * allocate missing uids - O(n^2) * sort linked list - O(n^2) The new algorithm is O(n log n): * find the user allocated uids - O(n) * store them for efficient lookup - O(n log n) * allocate missing uids - n times O(log n) * sort all uids - O(n log n) * convert back to linked list - O(n) Performance examples, startup time in seconds: pat_refs old new 1000 0.02 0.01 10000 2.1 0.04 20000 12.3 0.07 30000 27.9 0.10 40000 52.5 0.14 50000 77.5 0.17 Please backport to 1.8, 2.0 and 2.1.	2020-03-06 08:11:58 +01:00
Willy Tarreau	296cfd17ef	MINOR: pattern: fix all remaining strict aliasing issues There were still a number of struct casts from various sizes. All of them were now replaced with read_u32(), read_u16(), read_u64() or memcpy().	2020-02-25 10:24:51 +01:00
Jerome Magnin	b8bd6d7efd	BUILD: pattern: include errno.h Commit `3c79d4bdc` introduced the use of errno in pattern.c without including errno.h. If we build haproxy without any option errno is not defined and the build fails.	2020-01-17 18:30:06 +01:00
Jerome Magnin	3c79d4bdc4	BUG/MINOR: pattern: handle errors from fgets when trying to load patterns We need to do some error handling after we call fgets to make sure everything went fine. If we don't users can be fooled into thinking they can load pattens from directory because cfgparse doesn't flinch. This applies to acl patterns map files. This should be backported to all supported versions.	2020-01-17 17:09:50 +01:00
Willy Tarreau	403bfbb130	BUG/MEDIUM: pattern: make the pattern LRU cache thread-local and lockless As reported in issue #335, a lot of contention happens on the PATLRU lock when performing expensive regex lookups. This is absurd since the purpose of the LRU cache was to have a fast cache for expressions, thus the cache must not be shared between threads and must remain lockless. This commit makes the LRU cache thread-local and gets rid of the PATLRU lock. A test with 7 threads on 4 cores climbed from 67kH/s to 369kH/s, or a scalability factor of 5.5. Given the huge performance difference and the regression caused to users migrating from processes to threads, this should be backported at least to 2.0. Thanks to Brian Diekelman for his detailed report about this regression.	2019-10-23 07:27:25 +02:00
Willy Tarreau	33ccf1cce0	BUILD: pattern: work around an internal compiler bug in gcc-3.4 gcc-3.4 fails to compile pattern.c : src/pattern.c: In function `pat_match_ip': src/pattern.c:1092: error: unrecognizable insn: (insn 186 185 187 9 src/pattern.c:970 (set (reg/f:SI 179) (high:SI (const:SI (plus:SI (symbol_ref:SI ("static_pattern") [flags 0x22] <var_decl fe5bae80 static_pattern>) (const_int 8 [0x8]))))) -1 (nil) (nil)) src/pattern.c:1092: internal compiler error: in extract_insn, at recog.c:2083 This happens when performing the memcpy() on the union, and in this case the workaround is trivial (and even cleaner) using a cast instead.	2019-06-16 18:40:33 +02:00
Dragan Dosen	2674303912	MEDIUM: regex: modify regex_comp() to atomically allocate/free the my_regex struct Now we atomically allocate the my_regex struct within function regex_comp() and compile the regex or free both in case of failure. The pointer to the allocated my_regex struct is returned directly. The my_regex* argument to regex_comp() is removed. Function regex_free() was modified so that it systematically frees the my_regex entry. The function does nothing when called with a NULL as argument (like free()). It will avoid existing risk of not properly freeing the initialized area. Other structures are also updated in order to be compatible (the ones related to Lua and action rules).	2019-05-07 06:58:15 +02:00
Dragan Dosen	e99af978c8	BUG/MEDIUM: pattern: fix memory leak in regex pattern functions The allocated regex is not freed properly and can cause a memory leak, eg. when patterns are updated via CLI socket. This patch should be backported to all supported versions.	2019-05-02 10:05:11 +02:00
Willy Tarreau	49ee3b2f9a	BUG/MAJOR: map/acl: real fix segfault during show map/acl on CLI A previous commit `8d85aa44d` ("BUG/MAJOR: map: fix segfault during 'show map/acl' on cli.") was provided to address a concurrency issue between "show acl" and "clear acl" on the CLI. Sadly the code placed there was copy-pasted without changing the element type (which was struct stream in the original code) and not tested since the crash is still present. The reproducer is simple : load a large ACL file (e.g. geolocation addresses), issue "show acl #0" in loops in one window and issue a "clear acl #0" in the other one, haproxy crashes. This fix was also tested with threads enabled and looks good since the locking seems to work correctly in these areas though. It will have to be backported as far as 1.6 since the commit above went that far as well...	2019-04-30 11:50:59 +02:00
Willy Tarreau	86abe44e42	MEDIUM: init: use self-initializing spinlocks and rwlocks This patch replaces a number of __decl_hathread() followed by HA_SPIN_INIT or HA_RWLOCK_INIT by the new __decl_spinlock() or __decl_rwlock() which automatically registers the lock for initialization in during the STG_LOCK init stage. A few static modifiers were lost in the process, but since they were not essential at all it was not worth extending the API to provide such a variant.	2018-11-26 19:50:32 +01:00
Joseph Herlant	4189d671b7	CLEANUP: Fix typos in the pattern subsystem Fixes typos in the code comments of the pattern subsystem.	2018-11-18 22:23:15 +01:00
Dragan Dosen	f147479bd5	BUG/MEDIUM: patterns: fix possible double free when reloading a pattern list A null pointer assignment was missing after free() in function pat_ref_reload() which can lead to segfault. This bug was introduced in commit `b5997f7` ("MAJOR: threads/map: Make acls/maps thread safe"). Must be backported to 1.8.	2018-09-19 06:46:51 +02:00
Willy Tarreau	83061a820e	MAJOR: chunks: replace struct chunk with struct buffer Now all the code used to manipulate chunks uses a struct buffer instead. The functions are still called "chunk*", and some of them will progressively move to the generic buffer handling code as they are cleaned up.	2018-07-19 16:23:43 +02:00
Willy Tarreau	843b7cbe9d	MEDIUM: chunks: make the chunk struct's fields match the buffer struct Chunks are only a subset of a buffer (a non-wrapping version with no head offset). Despite this we still carry a lot of duplicated code between buffers and chunks. Replacing chunks with buffers would significantly reduce the maintenance efforts. This first patch renames the chunk's fields to match the name and types used by struct buffers, with the goal of isolating the code changes from the declaration changes. Most of the changes were made with spatch using this coccinelle script : @rule_d1@ typedef chunk; struct chunk chunk; @@ - chunk.str + chunk.area @rule_d2@ typedef chunk; struct chunk chunk; @@ - chunk.len + chunk.data @rule_i1@ typedef chunk; struct chunk chunk; @@ - chunk->str + chunk->area @rule_i2@ typedef chunk; struct chunk chunk; @@ - chunk->len + chunk->data Some minor updates to 3 http functions had to be performed to take size_t ints instead of ints in order to match the unsigned length here.	2018-07-19 16:23:43 +02:00
William Lallemand	579fb25b62	BUG/MAJOR: map: fix a segfault when using http-request set-map The bug happens with an existing entry, when you try to overwrite the value with wrong data, for example, a string when the type is INT. The code path was not secure and tried to set err and merr while err = merr = NULL when performing an http action. Must be backported in 1.6, 1.7, 1.8.	2018-06-11 11:02:06 +02:00
Aur�lien Nephtali	564d15a71e	BUG/MINOR: pattern: Add a missing HA_SPIN_INIT() in pat_ref_newid() pat_ref_newid() is lacking a spinlock init. It was probably forgotten in `b5997f740b` ("MAJOR: threads/map: Make acls/maps thread safe"). Signed-off-by: Aur�lien Nephtali <aurelien.nephtali@corp.ovh.com>	2018-04-19 17:49:48 +02:00
Christopher Faulet	9dcf9b6f03	MINOR: threads: Use __decl_hathreads to declare locks This macro should be used to declare variables or struct members depending on the USE_THREAD compile option. It avoids the encapsulation of such declarations between #ifdef/#endif. It is used to declare all lock variables.	2017-11-13 11:38:17 +01:00
Christopher Faulet	09fdf4b112	BUG/MINOR: pattern: Rely on the sample type to copy it in pattern_exec_match To be thread safe, the function pattern_exec_match copy data (the pattern and the inner sample) in thread-local variables. But when the sample is duplicated, we must check its type and not the pattern one. This is specific to threads, no backport is needed.	2017-11-09 17:19:20 +01:00
Christopher Faulet	2a944ee16b	BUILD: threads: Rename SPIN/RWLOCK macros using HA_ prefix This remove any name conflicts, especially on Solaris.	2017-11-07 11:10:24 +01:00
Emeric Brun	b5997f740b	MAJOR: threads/map: Make acls/maps thread safe locks have been added in pat_ref and pattern_expr structures to protect all accesses to an instance of on of them. Moreover, a global lock has been added to protect the LRU cache used for pattern matching. Patterns are now duplicated after a successfull matching, to avoid modification by other threads when the result is used. Finally, the function reloading a pattern list has been modified to be thread-safe.	2017-10-31 13:58:32 +01:00
Emeric Brun	7d27f3c12d	BUG/MEDIUM: map/acl: fix unwanted flags inheritance. The bug: Maps/ACLs using the same file/id can mistakenly inherit their flags from the last declared one. i.e. $ cat haproxy.conf listen mylistener mode http bind 0.0.0.0:8080 acl myacl1 url -i -f mine.acl acl myacl2 url -f mine.acl acl myacl3 url -i -f mine.acl redirect location / if myacl2 $ cat mine.acl foobar Shows an unexpected redirect for request 'GET /FOObAR HTTP/1.0\n\n'. This fix should be backported on mainline branches v1.6 and v1.7.	2017-07-04 10:45:53 +02:00
Emeric Brun	8d85aa44da	BUG/MAJOR: map: fix segfault during 'show map/acl' on cli. The reference of the current map/acl element to dump could be destroyed if map is updated from an 'http-request del-map' configuration rule or throught a 'del map/acl' on CLI. We use a 'back_refs' chaining element to fix this. As it is done to dump sessions. This patch needs also fix: 'BUG/MAJOR: cli: fix custom io_release was crushed by NULL.' To clean the back_ref and avoid a crash on a further del/clear map operation. Those fixes should be backported on mainline branches 1.7 and 1.6. This patch wont directly apply on 1.6.	2017-06-30 06:49:42 +02:00
Andreas Seltenreich	e6e22e8e90	BUG/MINOR: pattern: Avoid memory leak on out-of-memory condition pattern_new_expr() failed to free the allocated list element when an out-of-memory error occurs during initialization of the element. As this only happens when loading the configuration file or evaluating commands via the CLI, it is unlikely for this leak to be relevant unless the user makes automated, heavy use of the CLI. Found in HAProxy 1.5.14.	2016-03-13 07:47:25 +01:00
Andreas Seltenreich	f0653192e3	CLEANUP: pattern: Ignore unknown samples in pat_match_ip(). Ignore samples that are neither SMP_T_IPV4 nor SMP_T_IPV6 instead of matching with an uninitialized value in this case. This situation should not occur in the current codebase but triggers warnings in static code analysis tools. Found in haproxy 1.5.	2016-03-08 12:55:06 +01:00
Thierry Fournier	8feaa661b6	MINOR: map: Add regex matching replacement This patch declares a new map which provides a string based on a string with back references replaced by the content matched by the regex.	2016-02-10 23:38:34 +01:00
peter cai	aede6ddd1f	BUG/MEDIUM: pattern: fixup use_after_free in the pat_ref_delete_by_id I found there is use_after_free bug in the pat_ref_delete_by_id. [wt: it seems this fix must be backported to 1.5 as well]	2015-10-13 18:31:49 +02:00
Thierry FOURNIER	136f9d34a9	MINOR: samples: rename union from "data" to "u" The union name "data" is a little bit heavy while we read the source code because we can read "data.data.sint". The rename from "data" to "u" makes the read easiest like "data.u.sint".	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	8c542cac07	MEDIUM: samples: Use the "struct sample_data" in the "struct sample" This patch remove the struct information stored both in the struct sample_data and in the striuct sample. Now, only thestruct sample_data contains data, and the struct sample use the struct sample_data for storing his own data.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	503bb09873	MINOR: samples: rename some struct member from "smp" to "data" This members contains data and not sample.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	12ba0c29d6	MINOR: samples: rename a struct from sample_storage to sample_data This a first step of sample reorganization.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	07ee64ef4d	MAJOR: sample: converts uint and sint in 64 bits signed integer This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed. This makes easy the usage of integer and clarify signed and unsigned use. With the previous version, signed and unsigned are used ones in place of others, and sometimes the converter loose the sign. For example, divisions are processed with "unsigned", if one entry is negative, the result is wrong. Note that the integer pattern matching and dotted version pattern matching are already working with signed 64 bits integer values. There is one user-visible change : the "uint()" and "sint()" sample fetch functions which used to return a constant integer have been replaced with a new more natural, unified "int()" function. These functions were only introduced in the latest 1.6-dev2 so there's no impact on regular deployments.	2015-07-22 00:48:23 +02:00
Christopher Faulet	f90ac55d9e	MINOR: lru: Add the possibility to free data when an item is removed Now, When a item is committed in an LRU tree, you can define a function to free data owned by this item. This function will be called when the item is removed from the LRU tree or when the tree is destroyed..	2015-06-12 18:06:59 +02:00
Willy Tarreau	aee9314e71	BUILD: pattern: fix build warnings introduced in the LRU cache They're caused by the cast to long long from ptr in 32-bit. src/pattern.c: In function 'pat_match_str': src/pattern.c:479:44: warning: cast from pointer to integer of different size [-Wpointer-to-int-cast]	2015-05-04 17:18:42 +02:00
Willy Tarreau	f3045d2a06	MAJOR: pattern: add LRU-based cache on pattern matching The principle of this cache is to have a global cache for all pattern matching operations which rely on lists (reg, sub, dir, dom, ...). The input data, the expression and a random seed are used as a hashing key. The cached entries contains a pointer to the expression and a revision number for that expression so that we don't accidently used obsolete data after a pattern update or a very unlikely hash collision. Regarding the risk of collisions, 10k entries at 10k req/s mean 1% risk of a collision after 60 years, that's already much less than the memory's reliability in most machines and more durable than most admin's life expectancy. A collision will result in a valid result to be returned for a different entry from the same list. If this is not acceptable, the cache can be disabled using tune.pattern.cache-size. A test on a file containing 10k small regex showed that the regex matching was limited to 6k/s instead of 70k with regular strings. When enabling the LRU cache, the performance was back to 70k/s.	2015-04-29 19:15:24 +02:00
Willy Tarreau	72f073b6c7	MEDIUM: pattern: add a revision to all pattern expressions This will be used to detect any change on the pattern list between two operations, ultimately making it possible to implement a cache which immediately invalidates obsolete keys after an update. The revision is simply taken from the timestamp counter to ensure that even upon a pointer reuse we cannot accidently come back to the same (expr,revision) tuple.	2015-04-29 19:15:24 +02:00
Thierry FOURNIER	73bc285be1	BUG/MEDIUM: pattern: some entries are not deleted with case insensitive match ACL or map entries are not deleted with the command "del acl" or "del map" if the case insentive flag is set. This is because the the case insensitive string are stored in a list and the default delete function associated with string looks in a tree. I add a check of the case insensitive flag and execute the delete function for lists if it is set. This patch must be backported in 1.5 version.	2015-02-06 18:04:36 +01:00
Thierry FOURNIER	8aa8384e22	BUG/MINOR: pattern: error message missing This patch must be backported in 1.5 version.	2015-02-06 18:04:27 +01:00
Willy Tarreau	4deaf39243	BUG/MEDIUM: patterns: previous fix was incomplete Dmitry Sivachenko <trtrmitya@gmail.com> reported that commit `315ec42` ("BUG/MEDIUM: pattern: don't load more than once a pattern list.") relies on an uninitialised variable in the stack. While it used to work fine during the tests, if the uninitialized variable is non-null, some patterns may be aggregated if loaded multiple times, resulting in slower processing, which was the original issue it tried to address. The fix needs to be backported to 1.5.	2014-11-26 13:17:03 +01:00
Thierry FOURNIER	315ec4217f	BUG/MEDIUM: pattern: don't load more than once a pattern list. A memory optimization can use the same pattern expression for many equal pattern list (same parse method, index method and index_smp method). The pattern expression is returned by "pattern_new_expr", but this function dont indicate if the returned pattern is already in use. So, the caller function reload the list of patterns in addition with the existing patterns. This behavior is not a problem with tree indexed pattern, but it grows the lists indexed patterns. This fix add a "reuse" flag in return of the function "pattern_new_expr". If the flag is set, I suppose that the patterns are already loaded. This fix must be backported into 1.5.	2014-11-24 15:40:16 +01:00
Willy Tarreau	5def8ef786	BUG/MINOR: pattern: remove useless allocation of unused trash in pat_parse_reg() Just like previous patch, this is a remains of an early implementation. Also fix the outdated comments above. The fix may be backported to 1.5 though the bug cannot be triggerred, thus it's just a matter of keeping the code clean.	2014-08-29 15:19:33 +02:00
Thierry FOURNIER	b8f980cc19	MINOR: regex: Create JIT compatible function that return match strings This patchs rename the "regex_exec" to "regex_exec2". It add a new "regex_exec", "regex_exec_match" and "regex_exec_match2" function. This function can match regex and return array containing matching parts. Otherwise, this function use the compiled method (JIT or PCRE or POSIX). JIT require a subject with length. PCREPOSIX and native POSIX regex require a null terminted subject. The regex_exec* function are splited in two version. The first version take a null terminated string, but it execute strlen() on the subject if it is compiled with JIT. The second version (terminated by "2") take the subject and the length. This version adds a null character in the subject if it is compiled with PCREPOSIX or native POSIX functions. The documentation of posix regex and pcreposix says that the function returns 0 if the string matche otherwise it returns REG_NOMATCH. The REG_NOMATCH macro take the value 1 with posix regex and the value 17 with the pcreposix. The documentaion of the native pcre API (used with JIT) returns a negative number if no match, otherwise, it returns 0 or a positive number. This patch fix also the return codes of the regex_exec* functions. Now, these function returns true if the string match, otherwise it returns false.	2014-06-18 15:07:50 +02:00
Dirkjan Bussink	07fcaaa4cd	MINOR: fix a few memory usage errors These are either use after free errors or small leaks where memory is not free'd after some error state is detected.	2014-05-15 08:06:57 +02:00
Willy Tarreau	b1dd9bf308	MEDIUM: pattern: use ebtree's longest match to index/lookup string beginning Being able to map prefixes to values is already used for IPv4/IPv6 but was not yet used with strings. It can be very convenient to map directories to server farms but large lists may be slow. By using ebmb_insert_prefix() and ebmb_lookup_longest(), we can insert strings with their own length as a prefix, and lookup candidate strings and ensure that the longest matching one will be returned, which is the longest string matching the entry.	2014-05-10 08:53:48 +02:00
Thierry FOURNIER	e47e4e2385	BUG/MEDIUM: patterns: last fix was still not enough Last fix did address the issue for inlined patterns, but it was not enough because the flags are lost as well when updating patterns dynamically over the CLI. Also if the same file was used once with -i and another time without -i, their references would have been merged and both would have used the same matching method. It's appear that the patterns have two types of flags. The first ones are relative to the pattern matching, and the second are relative to the pattern storage. The pattern matching flags are the same for all the patterns of one expression. Now they are stored in the expression. The storage flags are information returned by the pattern mathing function. This information is relative to each entry and is stored in the "struct pattern". Now, the expression matching flags are forwarded to the parse and index functions. These flags are stored during the configuration parsing, and they are used during the parse and index actions. This issue was introduced in dev23 with the major pattern rework, and is a continuation of commit `a631fc8` ("BUG/MAJOR: patterns: -i and -n are ignored for inlined patterns"). No backport is needed.	2014-04-28 14:19:17 +02:00
Willy Tarreau	a631fc8de8	BUG/MAJOR: patterns: -i and -n are ignored for inlined patterns These flags are only passed to pattern_read_from_file() which loads the patterns from a file. The functions used to parse the patterns from the current line do not provide the means to pass the pattern flags so they're lost. This issue was introduced in dev23 with the major pattern rework, and was reported by Graham Morley. No backport is needed.	2014-04-27 09:21:08 +02:00
Willy Tarreau	3b78696858	BUG/MEDIUM: pattern: a typo breaks automatic acl/map numbering Dmitry Sivachenko reported that nice warning : src/pattern.c:2243:43: warning: if statement has empty body [-Wempty-body] if (&ref2->list == &pattern_reference); ^ src/pattern.c:2243:43: note: put the semicolon on a separate line to silence this warning It was merged as is with the code from commit `af5a29d` ("MINOR: pattern: Each pattern is identified by unique id"). So it looks like we can reassign an ID which is still in use because of this.	2014-04-26 12:41:32 +02:00

1 2 3

137 Commits