haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-10 09:07:02 +02:00

Author	SHA1	Message	Date
Willy Tarreau	5060326798	BUG/MINOR: sample: fix the closing bracket and LF in the debug converter The closing bracket was emitted for the "debug" converter even when the opening one was not sent, and the new line was not always emitted. Let's fix this. This is harmless since this converter is not built by default.	2019-12-17 09:04:38 +01:00
Damien Claisse	ae6f125c7b	MINOR: sample: add us/ms support to date/http_date It can be sometimes interesting to have a timestamp with a resolution of less than a second. It is currently painful to obtain this, because concatenation of date and date_us lead to a shorter timestamp during first 100ms of a second, which is not parseable and needs ugly ACLs in configuration to prepend 0s when needed. To improve this, add an optional <unit> parameter to date sample to report an integer with desired unit. Also support this unit in http_date converter to report a date string with sub-second precision.	2019-10-31 08:47:31 +01:00
Tim Duesterhus	4381d26edc	BUG/MINOR: sample: Make the `field` converter compatible with `-m found` Previously an expression like: path,field(2,/) -m found always returned `true`. Bug exists since the `field` converter exists. That is: `f399b0debf` The fix should be backported to 1.6+.	2019-10-21 15:49:42 +02:00
Luca Schimweg	8a694b859c	MINOR: sample: Add UUID-fetch Adds the fetch uuid(int). It returns a UUID following the format of version 4 in the RFC4122 standard. New feature, but could be backported.	2019-09-13 04:43:33 +02:00
Fr�d�ric L�caille	be36793d1d	BUG/MEDIUM: stick-table: Wrong stick-table backends parsing. When parsing references to stick-tables declared as backends, they are added to a list of proxies (they are proxies!) which refer to this stick-tables. Before this patch we added them to these list without checking they were already present, making the silly hypothesis the actions/sample were checked/resolved in the same order the proxies are parsed. This patch implement a simple inline function to in_proxies_list() to test the presence of a proxy in a list of proxies. We use this function when resolving /checking samples/actions. This bug was introduced by `015e4d7` commit. Must be backported to 2.0.	2019-08-07 10:32:31 +02:00
Fr�d�ric L�caille	9417f4534a	BUG/MAJOR: sample: Wrong stick-table name parsing in "if/unless" ACL condition. This bug was introduced by `1b8e68e` commit which supposed the stick-table was always stored in struct arg at parsing time. This is never the case with the usage of "if/unless" conditions in stick-table declared as backends. In this case, this is the name of the proxy which must be considered as the stick-table name. This must be backported to 2.0.	2019-06-21 09:48:28 +02:00
Tim Duesterhus	d437630237	MINOR: sample: Add sha2([<bits>]) converter This adds a converter for the SHA-2 family, supporting SHA-224, SHA-256 SHA-384 and SHA-512. The converter relies on the OpenSSL implementation, thus only being available when HAProxy is compiled with USE_OPENSSL. See GitHub issue #123. The hypothetical `ssl_?_sha256` fetch can then be simulated using `ssl_?_der,sha2(256)`: http-response set-header Server-Cert-FP %[ssl_f_der,sha2(256),hex]	2019-06-17 13:36:42 +02:00
Dragan Dosen	2674303912	MEDIUM: regex: modify regex_comp() to atomically allocate/free the my_regex struct Now we atomically allocate the my_regex struct within function regex_comp() and compile the regex or free both in case of failure. The pointer to the allocated my_regex struct is returned directly. The my_regex* argument to regex_comp() is removed. Function regex_free() was modified so that it systematically frees the my_regex entry. The function does nothing when called with a NULL as argument (like free()). It will avoid existing risk of not properly freeing the initialized area. Other structures are also updated in order to be compatible (the ones related to Lua and action rules).	2019-05-07 06:58:15 +02:00
Fr�d�ric L�caille	015e4d7d93	MINOR: stick-tables: Add peers process binding computing. Add a list of proxies for all the stick-tables (->proxies_list struct stktable member) so that to be able to compute the process bindings of the peers after having parsed the configuration file. The proxies are added to the stick-tables they reference when parsing stick-tables lines in proxy sections, when checking the actions in check_trk_action() and when resolving samples args for stick-tables without checking is they are duplicates. We check only there is no loop. Then, after having parsed everything, we add the proxy bindings to the peers frontend bindings with stick-tables they reference.	2019-05-07 06:54:07 +02:00
Fr�d�ric L�caille	1b8e68e89a	MEDIUM: stick-table: Stop handling stick-tables as proxies. This patch adds the support for the "table" line parsing in "peers" sections to declare stick-table in such sections. This also prevents the user from having to declare dummy backends sections with a unique stick-table inside. Even if still supported, this usage will become deprecated. To do so, the ->table member of proxy struct which is a stktable struct is replaced by a pointer to a stktable struct allocated at parsing time in src/cfgparse-listen.c for the dummy stick-table backends and in src/cfgparse.c for "peers" sections. This has an impact on the code for stick-table sample converters and on the stickiness rules parsers which first store the name of the dummy before resolving the rules. This patch replaces proxy_tbl_by_name() calls by stktable_find_by_name() calls to lookup for stick-tables stored in "stktable_by_name" ebtree at parsing time. There is only one remaining place where proxy_tbl_by_name() is used: src/hlua.c. At several places in the code we relied on the fact that ->size member of stick-table was equal to zero to consider the stick-table was present by not configured, this do not make sense anymore as ->table member of struct proxyis fow now on a pointer. These tests are replaced by a test on ->table value itself. In "peers" section we do not have to temporary store the name of the section the stick-table are attached to because this name is obviously already known just after having entered this "peers" section. About the CLI stick-table I/O handler, the pointer to proxy struct is replaced by a pointer to a stktable struct.	2019-05-07 06:54:06 +02:00
Fr�d�ric L�caille	bfe6138150	MINOR: sample: Add a protocol buffers specific converter. This patch adds "protobuf" protocol buffers specific converter wich may used in combination with "ungrpc" as first converter to extract a protocol buffers field value. It is simply implemented reusing protobuf_field_lookup() which is the protocol buffers specific parser already used by "ungrpc" converter which only parse a gRPC header in addition of parsing protocol buffers message. Update the documentation for this new "protobuf" converter.	2019-03-06 15:36:02 +01:00
Fr�d�ric L�caille	5f33f85ce8	MINOR: sample: Extract some protocol buffers specific code. We move the code responsible of parsing protocol buffers messages inside gRPC messages from sample.c to include/proto/protocol_buffers.h so that to reuse it to cascade "ungrpc" converter.	2019-03-06 15:36:02 +01:00
Fr�d�ric L�caille	756d97f205	MINOR: sample: Rework gRPC converter code. For now on, "ungrpc" may take a second optional argument to provide the protocol buffers types used to encode the field value to be extracted. When absent the field value is extracted as a binary sample which may then followed by others converters like "hex" which takes binary as input sample. When this second argument is a type which does not match the one found by "ungrpc", this field is considered as not found even if present. With this patch we also remove the useless "varint" and "svarint" converters. Update the documentation about "ungrpc" converters.	2019-03-05 11:04:23 +01:00
Fr�d�ric L�caille	7c93e88d0c	MINOR: sample: Code factorization "ungrpc" converter. Parsing protocol buffer fields always consists in skip the field if the field is not found or store the field value if found. So, with this patch we factorize a little bit the code for "ungrpc" converter.	2019-03-05 11:03:53 +01:00
Fr�d�ric L�caille	50290fbb42	MINOR: sample: Replace "req.ungrpc" smp fetch by a "ungrpc" converter. This patch simply extracts the code of smp_fetch_req_ungrpc() for "req.ungrpc" from http_fetch.c to move it to sample.c with very few modifications. Furthermore smp_fetch_body_buf() used to fetch the body contents is no more needed. Update the documentation for gRPC.	2019-03-04 08:28:42 +01:00
Fr�d�ric L�caille	fd95c62f1b	MINOR: sample: Add two sample converters for protocol buffers. Add "varint" to convert all the protocol buffers binary varints excepted the signed ones ("sint32" and "sint64") to an integer. The binary signed varints may be converted to an integer with "svarint" converter implemented by this patch. These two new converters do not take any argument.	2019-02-26 16:27:05 +01:00
Willy Tarreau	1a0fe3becd	BUG/MINOR: config: make sure to count the error on incorrect track-sc/stick rules When commit `151e1ca98` ("BUG/MAJOR: config: verify that targets of track-sc and stick rules are present") added a check for some process inconsistencies between rules and their stick tables, some errors resulted in a "return 0" statement, which is taken as "no error" in some cases. Let's fix this. This must be backported to all versions using the above commit.	2019-02-06 10:25:07 +01:00
Willy Tarreau	151e1ca989	BUG/MAJOR: config: verify that targets of track-sc and stick rules are present Stick and track-sc rules may optionally designate a table in a different proxy. In this case, a number of verifications are made such as validating that this proxy actually exists. However, in multi-process mode, the target table might indeed exist but not be bound to the set of processes the rules will execute on. This will definitely result in a random behaviour especially if these tables do require peer synchronization, because some tasks will be started to try to synchronize form uninitialized areas. The typical issue looks like this : peers my-peers peer foo ... listen proxy bind-process 1 stick on src table ip ... backend ip bind-process 2 stick-table type ip size 1k peers my-peers While it appears obvious that the example above will not work, there are less obvious situations, such as having bind-process in a defaults section and having a larger set of processes for the referencing proxy than the referenced one. The present patch adds checks for such situations by verifying that all processes from the referencing proxy are present on the other one in all track-sc* and stick-* rules, and in sample fetch / converters referencing another table so that sc_inc_gpc0() and similar are safe as well. This fix must be backported to all maintained versions. It may potentially disrupt configurations which already randomly crash. There hardly is any intermediary solution though, such configurations need to be fixed.	2019-02-05 11:54:49 +01:00
Olivier Houchard	4468f1cacb	BUG/MEDIUM: sample: Don't treat SMP_T_METH as SMP_T_STR. In smp_dup(), don't consider a SMP_T_METH with an unknown method the same as SMP_T_STR. The string and string length aren't stored at the same place. This should be backported to 1.8.	2018-12-07 15:31:43 +01:00
Willy Tarreau	0108d90c6c	MEDIUM: init: convert all trivial registration calls to initcalls This switches explicit calls to various trivial registration methods for keywords, muxes or protocols from constructors to INITCALL1 at stage STG_REGISTER. All these calls have in common to consume a single pointer and return void. Doing this removes 26 constructors. The following calls were addressed : - acl_register_keywords - bind_register_keywords - cfg_register_keywords - cli_register_kw - flt_register_keywords - http_req_keywords_register - http_res_keywords_register - protocol_register - register_mux_proto - sample_register_convs - sample_register_fetches - srv_register_keywords - tcp_req_conn_keywords_register - tcp_req_cont_keywords_register - tcp_req_sess_keywords_register - tcp_res_cont_keywords_register - flt_register_keywords	2018-11-26 19:50:32 +01:00
Willy Tarreau	70fe94419c	MINOR: sample: add cpu_calls, cpu_ns_avg, cpu_ns_tot, lat_ns_avg, lat_ns_tot These sample fetch keywords report performance metrics about the task calling them. They are useful to report in logs which requests consume too much CPU time and what negative performane impact it has on other requests. Typically logging cpu_ns_avg and lat_ns_avg will show culprits and victims.	2018-11-22 16:07:39 +01:00
Joseph Herlant	757f5ad73a	CLEANUP: Fix typos in the sample subsystem Fix some typos in the code comment of the sample subsystem.	2018-11-18 22:26:42 +01:00
Willy Tarreau	35b51c6e5b	REORG: http: move the HTTP semantics definitions to http.h/http.c It's a bit painful to have to deal with HTTP semantics for each protocol version (H1 and H2), and working on the version-agnostic code further emphasizes the problem. This patch creates http.h and http.c which are agnostic to the version in use, and which borrow a few parts from proto_http and from h1. For example the once thought h1-specific h1_char_classes array is in fact dictated by RFC7231 and is used to parse HTTP headers. A few changes were made to a few files which were including proto_http.h while they only needed http.h. Certain string definitions pre-dated the introduction of indirect strings (ist) so some were used to simplify the definition of the known HTTP methods. The current lookup code saves 2 kB of a heavily used table and is faster than the previous table based lookup (typ. 14 ns vs 16 before).	2018-09-11 10:30:25 +02:00
Willy Tarreau	83061a820e	MAJOR: chunks: replace struct chunk with struct buffer Now all the code used to manipulate chunks uses a struct buffer instead. The functions are still called "chunk*", and some of them will progressively move to the generic buffer handling code as they are cleaned up.	2018-07-19 16:23:43 +02:00
Willy Tarreau	843b7cbe9d	MEDIUM: chunks: make the chunk struct's fields match the buffer struct Chunks are only a subset of a buffer (a non-wrapping version with no head offset). Despite this we still carry a lot of duplicated code between buffers and chunks. Replacing chunks with buffers would significantly reduce the maintenance efforts. This first patch renames the chunk's fields to match the name and types used by struct buffers, with the goal of isolating the code changes from the declaration changes. Most of the changes were made with spatch using this coccinelle script : @rule_d1@ typedef chunk; struct chunk chunk; @@ - chunk.str + chunk.area @rule_d2@ typedef chunk; struct chunk chunk; @@ - chunk.len + chunk.data @rule_i1@ typedef chunk; struct chunk chunk; @@ - chunk->str + chunk->area @rule_i2@ typedef chunk; struct chunk chunk; @@ - chunk->len + chunk->data Some minor updates to 3 http functions had to be performed to take size_t ints instead of ints in order to match the unsigned length here.	2018-07-19 16:23:43 +02:00
Tim Duesterhus	ca097c16a8	MINOR: sample: Add strcmp sample converter This converter supplements the existing string matching by allowing strings to be converted to a variable. Example usage: http-request set-var(txn.host) hdr(host) # Check whether the client is attempting domain fronting. acl ssl_sni_http_host_match ssl_fc_sni,strcmp(txn.host) eq 0	2018-04-28 07:03:39 +02:00
Willy Tarreau	9eb2a4addf	BUILD: sample: avoid build warning in sample.c Recent commit `9631a28` ("MEDIUM: sample: Extend functionality for field/word converters") introduced this minor build warning that this patch addresses : src/sample.c: In function 'sample_conv_word': src/sample.c:2108:8: warning: suggest explicit braces to avoid ambiguous 'else' [-Wparentheses] src/sample.c:2137:8: warning: suggest explicit braces to avoid ambiguous 'else' [-Wparentheses] No backport is needed.	2018-04-19 10:33:28 +02:00
Marcin Deranek	9631a28275	MEDIUM: sample: Extend functionality for field/word converters Extend functionality of field/word converters, so it's possible to extract field(s)/word(s) counting from the beginning/end and/or extract multiple fields/words (including separators) eg. str(f1_f2_f3__f5),field(2,_,2) # f2_f3 str(f1_f2_f3__f5),field(2,_,0) # f2_f3__f5 str(f1_f2_f3__f5),field(-2,_,3) # f2_f3_ str(f1_f2_f3__f5),field(-3,_,0) # f1_f2_f3 str(w1_w2_w3___w4),word(3,_,2) # w3___w4 str(w1_w2_w3___w4),word(2,_,0) # w2_w3___w4 str(w1_w2_w3___w4),word(-2,_,3) # w1_w2_w3 str(w1_w2_w3___w4),word(-3,_,0) # w1_w2 Change is backward compatible.	2018-04-17 11:27:48 +02:00
Emmanuel Hocdet	50791a7df3	MINOR: samples: add crc32c converter This patch adds the support of CRC32c (rfc4960).	2018-03-21 16:17:00 +01:00
Willy Tarreau	280f42b99e	MINOR: sample: add a new "concat" converter It's always a pain not to be able to combine variables. This commit introduces the "concat" converter, which appends a delimiter, a variable's contents and another delimiter to an existing string. The result is a string. This makes it easier to build composite variables made of other variables.	2018-02-19 15:34:12 +01:00
Tim Duesterhus	1478aa795e	MEDIUM: sample: Add IPv6 support to the ipmask converter Add an optional second parameter to the ipmask converter that specifies the number of bits to mask off IPv6 addresses. If the second parameter is not given IPv6 addresses fail to mask (resulting in an empty string), preserving backwards compatibility: Previously a sample like `src,ipmask(24)` failed to give a result for IPv6 addresses. This feature can be tested like this: defaults log global mode http option httplog option dontlognull timeout connect 5000 timeout client 50000 timeout server 50000 frontend fe bind :::8080 v4v6 # Masked IPv4 for IPv4, empty for IPv6 (with and without this commit) http-response set-header Test %[src,ipmask(24)] # Correctly masked IP addresses for both IPv4 and IPv6 http-response set-header Test2 %[src,ipmask(24,ffff:ffff:ffff:ffff::)] # Correctly masked IP addresses for both IPv4 and IPv6 http-response set-header Test3 %[src,ipmask(24,64)] default_backend be backend be server s example.com:80 Tested-By: Jarno Huuskonen <jarno.huuskonen@uef.fi>	2018-01-25 22:25:40 +01:00
Tim Duesterhus	bf5ce02eff	BUG/MINOR: sample: Fix output type of c_ipv62ip c_ipv62ip failed to set the output type of the cast to SMP_T_IPV4 even for a successful conversion. This bug exists as of commit `cc4d1716a2` which is the first commit adding this function. v1.6-dev4 is the first tag containing this commit, the fix should be backported to haproxy 1.6 and newer.	2018-01-25 22:25:40 +01:00
Tim Duesterhus	ec6b0a2d18	CLEANUP: sample: Fix outdated comment about sample casts functions The cast functions modify their output type as of commit: `b805f71d1b` v1.5-dev20 is the first tag containing this comment, the fix should be backported to haproxy 1.5 and newer.	2018-01-25 22:25:40 +01:00
Tim Duesterhus	c555ee0c45	CLEANUP: sample: Fix comment encoding of sample.c The file contained an 'e' with an gravis accent and thus was not US-ASCII, but ISO-8859-1. Also correct the spelling in the incorrect comment. The incorrect character was introduced in commit: `4d9a1d1a5c` v1.6-dev1 is the first tag containing this comment, the fix should be backported to haproxy 1.6 and newer.	2018-01-25 22:25:40 +01:00
Etienne Carriere	a792a0aa93	MINOR: sample: add date_us sample Add date_us sample that returns the microsecond part of the timeval structure representing the date of the structure. The "second" part of the timeval can already be fetched by the "date" sample	2018-01-21 07:56:42 +01:00
Willy Tarreau	60a2ee7945	MINOR: sample: rename the "len" converter to "length" This converter was recently introduced by commit `ed0d24e` ("MINOR: sample: add len converter"). As found by Cyril, it causes an issue in "http-request capture" statements. The non-obvious problem is that an old syntax for sample expressions and converters used to support a series of words, each representing a converter. This used to be how the "stick" directives were created initially. By having a converter called "len", a statement such as "http-request capture foo len 10" considers "len" as a converter and not as the capture length. This obsolete syntax needs to be changed in 1.9 but it's too late for other versions. It's worth noting that the same problem can happen if converters are registered on the fly using Lua. Other language keywords that currently have to be avoided in converters include "id", "table", "if", "unless".	2017-12-15 07:13:48 +01:00
Etienne Carriere	ed0d24ebed	MINOR: sample: add len converter Add len converter that returns the length of a string	2017-12-14 14:36:10 +01:00
Christopher Faulet	767a84bcc0	CLEANUP: log: Rename Alert/Warning in ha_alert/ha_warning	2017-11-24 17:19:12 +01:00
Christopher Faulet	34adb2af96	MINOR: sample: Add "thread" sample fetch It returns id of the thread calling the function.	2017-11-23 16:33:13 +01:00
Emeric Brun	e5c918bcef	MINOR: threads/sample: Change temp_smp into a thread local variable	2017-10-31 13:58:31 +01:00
Dragan Dosen	3f957b2f83	MINOR: sample: add the hex2i converter Converts a hex string containing two hex digits per input byte to an integer. If the input value can not be converted, then zero is returned.	2017-10-25 04:46:08 +02:00
Dragan Dosen	6e5a9ca948	MINOR: sample: add the sha1 converter This converter can be used to generate a SHA1 digest from binary type sample. The result is a binary sample with length of 20 bytes.	2017-10-25 04:45:58 +02:00
Christopher Faulet	ec10051349	MINOR: samples: Handle the type SMP_T_METH when we duplicate a sample in smp_dup First, the type SMP_T_METH was not handled by smp_dup function. It was never called with this kind of samples, so it's not really a problem. But, this could be useful in future. For all known HTTP methods (GET, POST...), there is no extra space allocated for a sample of type SMP_T_METH. But for unkown methods, it uses a chunk. So, like for strings, we duplicate data, using a trash chunk.	2017-07-24 17:15:47 +02:00
Holger Just	1bfc24ba03	MINOR: sample: Add b64dec sample converter Add "b64dec" as a new converter which can be used to decode a base64 encoded string into its binary representation. It performs the inverse operation of the "base64" converter.	2017-05-12 15:56:52 +02:00
Nenad Merdanovic	50c8044423	CLEANUP: Remove comment that's no longer valid Code was deleted in `ad63582eb`, but the comment remained. Signed-off-by: Nenad Merdanovic <nmerdan@haproxy.com>	2017-03-13 18:26:05 +01:00
Nenad Merdanovic	807a6e7856	MINOR: Add hostname sample fetch It adds "hostname" as a new sample fetch. It does exactly the same as "%H" in a log format except that it can be used outside of log formats. Signed-off-by: Nenad Merdanovic <nmerdan@haproxy.com>	2017-03-13 18:26:05 +01:00
Thierry FOURNIER	01e0974b5a	MINOR: samples: add xx-hash functions This patch adds the support of xx-hash 32 and 64-bits functions.	2016-12-26 12:45:04 +01:00
Willy Tarreau	97108e08ce	CLEANUP: sample: report "converter" instead of "conv method" in error messages This was inherited from the very early stick-tables code but it's about time to produce understandable error messages :-)	2016-11-25 07:36:22 +01:00
Thierry FOURNIER / OZON.IO	a69c912187	CLEANUP: log-format: useless file and line in json converter The caller must log location information, so this information is provided two times in the log line. The error log is like this: [ALERT] 327/011513 (14291) : parsing [o3.conf:38]: 'http-response set-header': Sample fetch <method,json(rrr)> failed with : invalid args in conv method 'json' : Unexpected input code type at file 'o3.conf', line 38. Allowed value are 'ascii', 'utf8', 'utf8s', 'utf8p' and 'utf8ps'. This patch removes the second location indication, the the same error becomes: [ALERT] 327/011637 (14367) : parsing [o3.conf:38]: 'http-response set-header': Sample fetch <method,json(rrr)> failed with : invalid args in conv method 'json' : Unexpected input code type. Allowed value are 'ascii', 'utf8', 'utf8s', 'utf8p' and 'utf8ps'.	2016-11-24 18:54:25 +01:00
Christopher Faulet	f7e4e7e096	MAJOR: spoe: Add an experimental Stream Processing Offload Engine SPOE makes possible the communication with external components to retrieve some info using an in-house binary protocol, the Stream Processing Offload Protocol (SPOP). In the long term, its aim is to allow any kind of offloading on the streams. This first version, besides being experimental, won't do lot of things. The most important today is to validate the protocol design and lay the foundations of what will, one day, be a full offload engine for the stream processing. So, for now, the SPOE can offload the stream processing before "tcp-request content", "tcp-response content", "http-request" and "http-response" rules. And it only supports variables creation/suppression. But, in spite of these limited features, we can easily imagine to implement a SSO solution, an ip reputation service or an ip geolocation service. Internally, the SPOE is implemented as a filter. So, to use it, you must use following line in a proxy proxy section: frontend my-front ... filter spoe [engine <name>] config <file> ... It uses its own configuration file to keep the HAProxy configuration clean. It is also a easy way to disable it by commenting out the filter line. See "doc/SPOE.txt" for all details about the SPOE configuration.	2016-11-09 22:57:01 +01:00
Christopher Faulet	476e5d0e03	REORG: sample: move code to release a sample expression in sample.c This code has been moved from haproxy.c to sample.c and the function release_sample_expr can now be called from anywhere to release a sample expression. This function will be used by the stream processing offload engine (SPOE).	2016-11-09 22:57:00 +01:00
Willy Tarreau	2235b261b6	OPTIM: http: move all http character classs tables into a single one We used to have 7 different character classes, each was 256 bytes long, resulting in almost 2kB being used in the L1 cache. It's as cheap to test a bit than to check the byte is not null, so let's store a 7-bit composite value and check for the respective bits there instead. The executable is now 4 kB smaller and the performance on small objects increased by about 1% to 222k requests/second with a config involving 4 http-request rules including 1 header lookup, one header replacement, and 2 variable assignments.	2016-11-05 15:58:08 +01:00
Willy Tarreau	f0645dce4f	MINOR: sample: use smp_make_rw() in upper/lower converters There's no point in always duplicating the sample, just ensure it's writable, as was done prior to the smp_dup() change. This should be backported to 1.6 to avoid a performance regression caused by this change (about 30% more time for upper/lower due to the copy).	2016-08-09 14:31:25 +02:00
Willy Tarreau	ad63582eb9	BUG/MEDIUM: samples: make smp_dup() always duplicate the sample Vedran Furac reported a strange problem where the "base" sample fetch would not always work for tracking purposes. In fact, it happens that commit `bc8c404` ("MAJOR: stick-tables: use sample types in place of dedicated types") merged in 1.6 exposed a fundamental bug related to the way samples use chunks as strings. The problem is that chunks convey a base pointer, a length and an optional size, which may be zero when unknown or when the chunk is allocated from a read-only location. The sole purpose of this size is to know whether or not the chunk may be appended new data. This size cause some semantics issue in the sample, which has its own SMP_F_CONST flag to indicate read-only contents. The problem was emphasized by the commit above because it made use of new calls to smp_dup() to convert a sample to a table key. And since smp_dup() would only check the SMP_F_CONST flag, it would happily return read-write samples indicating size=0. So some tests were added upon smp_dup() return to ensure that the actual length is smaller than size, but this in fact made things even worse. For example, the "sni" server directive does some bad stuff on many occasions because it limits len to size-1 and effectively sets it to -1 and writes the zero byte before the beginning of the string! It is therefore obvious that smp_dup() needs to be modified to take this nature of the chunks into account. It's not enough but is needed. The core of the problem comes from the fact that smp_dup() is called for 5 distinct needs which are not always fulfilled : 1) duplicate a sample to keep a copy of it during some operations 2) ensure that the sample is rewritable for a converter like upper() 3) ensure that the sample is terminated with a \0 4) set a correct size on the sample 5) grow the sample in case it was extracted from a partial chunk Case 1 is not used for now, so we can ignore it. Case 2 indicates the wish to modify the sample, so its R/O status must be removed if any, but there's no implied requirement that the chunk becomes larger. Case 3 is used when the sample has to be made compatible with libc's str* functions. There's no need to make it R/W nor to duplicate it if it is already correct. Case 4 can happen when the sample's size is required (eg: before performing some changes that must fit in the buffer). Case 5 is more or less similar but will happen when the sample by be grown but we want to ensure we're not bound by the current small size. So the proposal is to have different functions for various operations. One will ensure a sample is safe for use with str* functions. Another one will ensure it may be rewritten in place. And smp_dup() will have to perform an inconditional duplication to guarantee at least #5 above, and implicitly all other ones. This patch only modifies smp_dup() to make the duplication inconditional. It is enough to fix both the "base" sample fetch and the "sni" server directive, and all use cases in general though not always optimally. More patches will follow to address them more optimally and even better than the current situation (eg: avoid a dup just to add a \0 when possible). The bug comes from an ambiguous design, so its roots are old. 1.6 is affected and a backport is needed. In 1.5, the function already existed but was only used by two converters modifying the data in place, so the bug has no effect there.	2016-08-09 14:03:23 +02:00
Herve COMMOWICK	8dfe863fbf	DOC: fix json converter example and error message	2016-08-07 08:08:18 +02:00
Willy Tarreau	5f6e9054b9	BUILD: fix build on Solaris 11 htonll()/ntohll() already exist on Solaris 11 with a different declaration, causing a build error as reported by Jonathan Fisher. They used to exist on OSX with a #define which allowed us to detect them. It was a bad idea to give these functions a name subject to conflicts like this. Simply rename them my_htonll()/my_ntohll() to definitely get rid of the conflict. This patch must be backported to 1.6.	2016-05-26 07:15:57 +02:00
David Carlier	64a16ab19c	BUG/MEDIUM: sample: initialize the pointer before parse_binary call. parse_binary line 2025 checks the nullity of binstr parameter. Other calls of parse_binary properly zeroify this parameter. [wt: this could result in random failures of the const parser]	2016-04-12 11:08:24 +02:00
Vincent Bernat	02779b6263	CLEANUP: uniformize last argument of malloc/calloc Instead of repeating the type of the LHS argument (sizeof(struct ...)) in calls to malloc/calloc, we directly use the pointer name (sizeof(...)). The following Coccinelle patch was used: @@ type T; T x; @@ x = malloc( - sizeof(T) + sizeof(x) ) @@ type T; T x; @@ x = calloc(1, - sizeof(T) + sizeof(*x) ) When the LHS is not just a variable name, no change is made. Moreover, the following patch was used to ensure that "1" is consistently used as a first argument of calloc, not the last one: @@ @@ calloc( + 1, ... - ,1 )	2016-04-03 14:17:42 +02:00
Willy Tarreau	6204cd9f27	BUG/MAJOR: vars: always retrieve the stream and session from the sample This is the continuation of previous patch called "BUG/MAJOR: samples: check smp->strm before using it". It happens that variables may have a session-wide scope, and that their session is retrieved by dereferencing the stream. But nothing prevents them from being used from a streamless context such as tcp-request connection, thus crashing the process. Example : tcp-request connection accept if { src,set-var(sess.foo) -m found } In order to fix this, we have to always ensure that variable manipulation only happens via the sample, which contains the correct owner and context, and that we never use one from a different source. This results in quite a large change since a lot of functions are inderctly involved in the call chain, but the change is easy to follow. This fix must be backported to 1.6, and requires the last two patches.	2016-03-10 17:28:04 +01:00
Willy Tarreau	7560dd4b6a	MINOR: sample: always set a new sample's owner before evaluating it Some functions like sample_conv_var2smp(), var_get_byname(), and var_set_byname() directly or indirectly need to access the current stream and/or session and must find it in the sample itself and not as a distinct argument. Thus we first need to call smp_set_owner() prior to each such calls.	2016-03-10 16:42:58 +01:00
Willy Tarreau	1777ea63e0	MINOR: sample: add a new helper to initialize the owner of a sample Since commit `6879ad3` ("MEDIUM: sample: fill the struct sample with the session, proxy and stream pointers") merged in 1.6-dev2, the sample contains the pointer to the stream and sample fetch functions as well as converters use it heavily. This requires from a lot of call places to initialize 4 fields, and it was even forgotten at a few places. This patch provides a convenient helper to initialize all these fields at once, making it easy to prepare a new sample from a previous one for example. A few call places were cleaned up to make use of it. It will be needed by further fixes. At one place in the Lua code, it was moved earlier because we used to call sample casts with a non completely initialized sample, which is not clean eventhough at the moment there are no consequences.	2016-03-10 16:42:58 +01:00
Dragan Dosen	0b85ecee53	MEDIUM: logs: add a new RFC5424 log-format for the structured-data This patch adds a new RFC5424-specific log-format for the structured-data that is automatically send by __send_log() when the sender is in RFC5424 mode. A new statement "log-format-sd" should be used in order to set log-format for the structured-data part in RFC5424 formatted syslog messages. Example: log-format-sd [exampleSDID@1234\ bytes=\"%B\"\ status=\"%ST\"]	2015-09-28 14:01:27 +02:00
Thierry FOURNIER	136f9d34a9	MINOR: samples: rename union from "data" to "u" The union name "data" is a little bit heavy while we read the source code because we can read "data.data.sint". The rename from "data" to "u" makes the read easiest like "data.u.sint".	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	8c542cac07	MEDIUM: samples: Use the "struct sample_data" in the "struct sample" This patch remove the struct information stored both in the struct sample_data and in the striuct sample. Now, only thestruct sample_data contains data, and the struct sample use the struct sample_data for storing his own data.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	cc4d1716a2	MINOR: sample: Add ipv6 to ipv4 and sint to ipv6 casts The RFC4291 says that when the IPv6 adress have the followin form: 0000::ffff:a.b.c.d, if can be converted to an IPv4 adress. This patch enable this conversion in casts. As the sint can be casted as ipv4, and ipv4 can be casted as ipv6, we can directly cast sint as ipv6 using the RFC4291.	2015-08-11 14:14:10 +02:00
Thierry FOURNIER	5d86fae234	MEDIUM: vars/sample: operators can use variables as parameter This patch allow the existing operators to take a variable as parameter. This is useful to add the content of two variables. This patch modify the behavior of operators.	2015-07-22 00:48:24 +02:00
Thierry FOURNIER	00c005c726	MEDIUM: sample: switch to saturated arithmetic This patch check calculus for overflow and returns capped values. This permits to protect against integer overflow in certain operations involving ratios, percentages, limits or anything. That can sometimes be critically important with some operations (eg: content-length < X).	2015-07-22 00:48:24 +02:00
Thierry FOURNIER	bf65cd4d77	MAJOR: arg: converts uint and sint in sint This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed.	2015-07-22 00:48:23 +02:00
Thierry FOURNIER	07ee64ef4d	MAJOR: sample: converts uint and sint in 64 bits signed integer This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed. This makes easy the usage of integer and clarify signed and unsigned use. With the previous version, signed and unsigned are used ones in place of others, and sometimes the converter loose the sign. For example, divisions are processed with "unsigned", if one entry is negative, the result is wrong. Note that the integer pattern matching and dotted version pattern matching are already working with signed 64 bits integer values. There is one user-visible change : the "uint()" and "sint()" sample fetch functions which used to return a constant integer have been replaced with a new more natural, unified "int()" function. These functions were only introduced in the latest 1.6-dev2 so there's no impact on regular deployments.	2015-07-22 00:48:23 +02:00
Thierry FOURNIER	fac9ccfb70	BUG/MINOR: http/sample: gmtime/localtime can fail The man said that gmtime() and localtime() can return a NULL value. This is not tested. It appears that all the values of a 32 bit integer are valid, but it is better to check the return of these functions. However, if the integer move from 32 bits to 64 bits, some 64 values can be unsupported.	2015-07-20 12:21:35 +02:00
Willy Tarreau	28d976d5ee	MINOR: args: add new context for servers We'll have to support fetch expressions and args on server lines for "usesrc", "usedst", "sni", etc...	2015-07-09 11:39:33 +02:00
Adis Nezirovic	79beb248b9	CLEANUP: sample: generalize sample_fetch_string() as sample_fetch_as_type() This modification makes possible to use sample_fetch_string() in more places, where we might need to fetch sample values which are not plain strings. This way we don't need to fetch string, and convert it into another type afterwards. When using aliased types, the caller should explicitly check which exact type was returned (e.g. SMP_T_IPV4 or SMP_T_IPV6 for SMP_T_ADDR). All usages of sample_fetch_string() are converted to use new function.	2015-07-06 16:17:25 +02:00
Dragan Dosen	93b38d9191	MEDIUM: 51Degrees code refactoring and cleanup Moved 51Degrees code from src/haproxy.c, src/sample.c and src/cfgparse.c into a separate files src/51d.c and include/import/51d.h. Added two new functions init_51degrees() and deinit_51degrees(), updated Makefile and other code reorganizations related to 51Degrees.	2015-06-30 10:43:03 +02:00
Thierry FOURNIER	cc103299c7	MINOR: samples: add samples which returns constants This patch adds sample which returns constants values. This is useful for intialising variables.	2015-06-13 23:01:37 +02:00
Thierry FOURNIER	9687c77c91	MINOR: debug: add a special converter which display its input sample content. This converter displays its input sample type and content. It is useful for debugging some complex configurations.	2015-06-13 23:01:36 +02:00
Thierry FOURNIER	9c627e84b2	MEDIUM: sample: Add type any This type is used to accept any type of sample as input, and prevent any automatic "cast". It runs like the type "ADDR" which accept the type "IPV4" and "IPV6".	2015-06-13 22:59:14 +02:00
Thierry FOURNIER	0f811440d5	BUG/MINOR: sample: wrong conversion of signed values The signed values are casted as unsigned before conversion. This patch use the good converters according with the sample type. Note: it depends on previous patch to parse signed ints.	2015-06-13 22:59:14 +02:00
Thierry FOURNIER	4c2479e1c4	BUG/MINOR: debug: display (null) in place of "meth" The array which contains names of types, miss the METH entry. [wt: should be backported to 1.5 as well]	2015-06-09 10:58:14 +02:00
Thomas Holmes	4d441a759c	MEDIUM: sample: add trie support to 51Degrees Trie or pattern algorithm is used depending on what 51Degrees source files are provided to MAKE.	2015-06-02 19:30:53 +02:00
Thomas Holmes	951d44d24d	MEDIUM: sample: add fiftyone_degrees converter. It takes up to 5 string arguments that are to be 51Degrees property names. It will then create a chunk with values detected based on the request header supplied (this should be the User-Agent).	2015-06-02 14:00:25 +02:00
Willy Tarreau	f63386ad27	CLEANUP: da: move the converter registration to da.c There's no reason to put it into sample.c, it's better to register it locally in da.c, it removes a number of ifdefs and exports.	2015-06-02 13:42:12 +02:00
David Carlier	4542b10ae1	MEDIUM: sample: add the da-csv converter This diff declares the deviceatlas module and can accept up to 5 property names for the API lookup. [wt: this should probably be moved to its own file using the keyword registration mechanism]	2015-06-02 13:24:50 +02:00
Willy Tarreau	e2dc1fa8ca	MEDIUM: stick-table: remove the now duplicate find_stktable() function Since proxy_tbl_by_name() already does the same job, let's not keep duplicate functions and use this one only.	2015-05-26 12:08:07 +02:00
Willy Tarreau	9e0bb1013e	CLEANUP: proxy: make the proxy lookup functions more user-friendly First, findproxy() was renamed proxy_find_by_name() so that its explicit that a name is required for the lookup. Second, we give this function the ability to search for tables if needed. Third we now provide inline wrappers to pass the appropriate PR_CAP_* flags and to explicitly look up a frontend, backend or table.	2015-05-26 11:24:42 +02:00
Thierry FOURNIER	0786d05a04	MEDIUM: sample: change the prototype of sample-fetches functions This patch removes the "opt" entry from the prototype of the sample-fetches fucntions. This permits to remove some weight in the prototype call.	2015-05-11 20:03:08 +02:00
Thierry FOURNIER	1d33b882d2	MINOR: sample: fill the struct sample with the options. Options are relative to the sample. Each sample fetched is associated with fetch options or fetch flags. This patch adds the 'opt' vaue in the sample struct. This permits to reduce the sample-fetch function prototype. In other way, the converters will have more detail about the origin of the sample.	2015-05-11 20:02:11 +02:00
Thierry FOURNIER	0a9a2b8cec	MEDIUM: sample change the prototype of sample-fetches and converters functions This patch removes the structs "session", "stream" and "proxy" from the sample-fetches and converters function prototypes. This permits to remove some weight in the prototype call.	2015-05-11 20:01:42 +02:00
Thierry FOURNIER	6879ad31a5	MEDIUM: sample: fill the struct sample with the session, proxy and stream pointers Some sample analyzer (sample-fetch or converters) needs to known the proxy, session and stream attached to the sampel. The sample-fetches and the converters function pointers cannot be called without these 3 pointers filled. This patch permits to reduce the sample-fetch and the converters called prototypes, and provides a new mean to add information for this type of functions.	2015-05-11 20:00:03 +02:00
Willy Tarreau	192252e2d8	MAJOR: sample: pass a pointer to the session to each sample fetch function Many such function need a session, and till now they used to dereference the stream. Once we remove the stream from the embryonic session, this will not be possible anymore. So as of now, sample fetch functions will be called with this : - sess = NULL, strm = NULL : never - sess = valid, strm = NULL : tcp-req connection - sess = valid, strm = valid, strm->txn = NULL : tcp-req content - sess = valid, strm = valid, strm->txn = valid : http-req / http-res	2015-04-06 11:37:25 +02:00
Willy Tarreau	15e91e1b36	MAJOR: sample: don't pass l7 anymore to sample fetch functions All of them can now retrieve the HTTP transaction if it exists from the stream and be sure to get NULL there when called with an embryonic session. The patch is a bit large because many locations were touched (all fetch functions had to have their prototype adjusted). The opportunity was taken to also uniformize the call names (the stream is now always "strm" instead of "l4") and to fix indent where it was broken. This way when we later introduce the session here there will be less confusion.	2015-04-06 11:35:53 +02:00
Willy Tarreau	87b09668be	REORG/MAJOR: session: rename the "session" entity to "stream" With HTTP/2, we'll have to support multiplexed streams. A stream is in fact the largest part of what we currently call a session, it has buffers, logs, etc. In order to catch any error, this commit removes any reference to the struct session and tries to rename most "session" occurrences in function names to "stream" and "sess" to "strm" when that's related to a session. The files stream.{c,h} were added and session.{c,h} removed. The session will be reintroduced later and a few parts of the stream will progressively be moved overthere. It will more or less contain only what we need in an embryonic session. Sample fetch functions and converters will have to change a bit so that they'll use an L5 (session) instead of what's currently called "L4" which is in fact L6 for now. Once all changes are completed, we should see approximately this : L7 - http_txn L6 - stream L5 - session L4 - connection \| applet There will be at most one http_txn per stream, and a same session will possibly be referenced by multiple streams. A connection will point to a session and to a stream. The session will hold all the information we need to keep even when we don't yet have a stream. Some more cleanup is needed because some code was already far from being clean. The server queue management still refers to sessions at many places while comments talk about connections. This will have to be cleaned up once we have a server-side connection pool manager. Stream flags "SN_*" still need to be renamed, it doesn't seem like any of them will need to move to the session.	2015-04-06 11:23:56 +02:00
Thierry FOURNIER	8fd1376014	MINOR: converters: add function to browse converters This patch adds a fucntion to browse each converter. This is used with Lua for using the converters with a wrapper.	2015-03-11 19:55:10 +01:00
Thierry FOURNIER	4d9a1d1a5c	MINOR: sample: add function for browsing samples. This function is useful with the incoming lua functions.	2015-02-28 23:12:32 +01:00
Thierry FOURNIER	f41a809dc9	MINOR: sample: add private argument to the struct sample_fetch The add of this private argument is to prepare the integration of the lua fetchs.	2015-02-28 23:12:31 +01:00
Thierry FOURNIER	68a556e282	MINOR: converters: give the session pointer as converter argument Some usages of the converters need to know the attached session. The Lua needs the session for retrieving his running context. This patch adds the "session" as an argument of the converters prototype.	2015-02-28 23:12:31 +01:00
Thierry FOURNIER	1edc971919	MINOR: converters: add a "void *private" argument to converters This permits to store specific configuration pointer. It is useful with future Lua integration.	2015-02-28 23:12:31 +01:00
Willy Tarreau	9770787e70	MEDIUM: samples: provide basic arithmetic and bitwise operators This commit introduces a new category of converters. They are bitwise and arithmetic operators which support performing basic operations on integers. Some bitwise operations are supported (and, or, xor, cpl) and some arithmetic operations are supported (add, sub, mul, div, mod, neg). Some comparators are provided (odd, even, not, bool) which make it possible to report a match without having to write an ACL. The detailed list of new operators as they appear in the doc is : add(<value>) Adds <value> to the input value of type unsigned integer, and returns the result as an unsigned integer. and(<value>) Performs a bitwise "AND" between <value> and the input value of type unsigned integer, and returns the result as an unsigned integer. bool Returns a boolean TRUE if the input value of type unsigned integer is non-null, otherwise returns FALSE. Used in conjunction with and(), it can be used to report true/false for bit testing on input values (eg: verify the presence of a flag). cpl Takes the input value of type unsigned integer, applies a twos-complement (flips all bits) and returns the result as an unsigned integer. div(<value>) Divides the input value of type unsigned integer by <value>, and returns the result as an unsigned integer. If <value> is null, the largest unsigned integer is returned (typically 2^32-1). even Returns a boolean TRUE if the input value of type unsigned integer is even otherwise returns FALSE. It is functionally equivalent to "not,and(1),bool". mod(<value>) Divides the input value of type unsigned integer by <value>, and returns the remainder as an unsigned integer. If <value> is null, then zero is returned. mul(<value>) Multiplies the input value of type unsigned integer by <value>, and returns the product as an unsigned integer. In case of overflow, the higher bits are lost, leading to seemingly strange values. neg Takes the input value of type unsigned integer, computes the opposite value, and returns the remainder as an unsigned integer. 0 is identity. This operator is provided for reversed subtracts : in order to subtract the input from a constant, simply perform a "neg,add(value)". not Returns a boolean FALSE if the input value of type unsigned integer is non-null, otherwise returns TRUE. Used in conjunction with and(), it can be used to report true/false for bit testing on input values (eg: verify the absence of a flag). odd Returns a boolean TRUE if the input value of type unsigned integer is odd otherwise returns FALSE. It is functionally equivalent to "and(1),bool". or(<value>) Performs a bitwise "OR" between <value> and the input value of type unsigned integer, and returns the result as an unsigned integer. sub(<value>) Subtracts <value> from the input value of type unsigned integer, and returns the result as an unsigned integer. Note: in order to subtract the input from a constant, simply perform a "neg,add(value)". xor(<value>) Performs a bitwise "XOR" (exclusive OR) between <value> and the input value of type unsigned integer, and returns the result as an unsigned integer.	2015-01-27 15:41:13 +01:00
Willy Tarreau	d817e468bf	BUG/MINOR: sample: fix case sensitivity for the regsub converter Two commits ago in `7eda849` ("MEDIUM: samples: add a regsub converter to perform regex-based transformations"), I got caught for the second time with the inverted case sensitivity usage of regex_comp(). So by default it is case insensitive and passing the "i" flag makes it case sensitive. I forgot to recheck that case before committing the cleanup. No harm anyway, nobody had the time to use it.	2015-01-23 20:27:41 +01:00
Willy Tarreau	7eda849dce	MEDIUM: samples: add a regsub converter to perform regex-based transformations We can now replace matching regex parts with a string, a la sed. Note that there are at least 3 different behaviours for existing sed implementations when matching 0-length strings. Here is the result of the following operation on each implementationt tested : echo 'xzxyz' \| sed -e 's/xy/A/g' GNU sed 4.2.1 => AzAzA Perl's sed 5.16.1 => AAzAAzA Busybox v1.11.2 sed => AzAz The psed behaviour was adopted because it causes the least exceptions in the code and seems logical from a certain perspective : - "x" matches xy => add "A" and skip "x" - "z" matches xy => add "A" and keep "z", not part of the match - "xy" matches xy => add "A" and skip "xy" - "z" matches xy => add "A" and keep "z", not part of the match - "" matches xy => add "A" and stop here Anyway, given the incompatibilities between implementations, it's unlikely that some processing will rely on this behaviour. There currently is one big limitation : the configuration parser makes it impossible to pass commas or closing parenthesis (or even closing brackets in log formats). But that's still quite usable to replace certain characters or character sequences. It will become more complete once the config parser is reworked.	2015-01-22 14:24:53 +01:00
Willy Tarreau	469477879c	MINOR: args: implement a new arg type for regex : ARGT_REG This one will be used when a regex is expected. It is automatically resolved after the parsing and compiled into a regex. Some optional flags are supported in the type-specific flags that should be set by the optional arg checker. One is used during the regex compilation : ARGF_REG_ICASE to ignore case.	2015-01-22 14:24:53 +01:00

1 2 3 4 5

207 Commits