haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-09 16:47:18 +02:00

Author	SHA1	Message	Date
Thierry FOURNIER	136f9d34a9	MINOR: samples: rename union from "data" to "u" The union name "data" is a little bit heavy while we read the source code because we can read "data.data.sint". The rename from "data" to "u" makes the read easiest like "data.u.sint".	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	8c542cac07	MEDIUM: samples: Use the "struct sample_data" in the "struct sample" This patch remove the struct information stored both in the struct sample_data and in the striuct sample. Now, only thestruct sample_data contains data, and the struct sample use the struct sample_data for storing his own data.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	a6b6343cff	CLEANUP: http/tcp actions: remove the scope member The scope member is not used. This patch removes this entry.	2015-08-11 13:44:53 +02:00
Thierry FOURNIER	9b49f589ed	CLEANUP: proto_http: remove useless initialisation This initialisation of the opaque array is useless.	2015-08-11 13:44:51 +02:00
Willy Tarreau	53a09d520e	MAJOR: http: remove references to appsession appsessions started to be deprecated with the introduction of stick tables, and the latter are much more powerful and flexible, and in addition they are replicated between nodes and maintained across reloads. Let's now remove appsession completely.	2015-08-10 19:16:18 +02:00
Willy Tarreau	449d74a906	MEDIUM: backend: add the "http-reuse aggressive" strategy This strategy is less extreme than "always", it only dispatches first requests to validated reused connections, and moves a connection from the idle list to the safe list once it has seen a second request, thus proving that it could be reused.	2015-08-06 16:29:01 +02:00
Willy Tarreau	8dff998b91	MAJOR: backend: initial work towards connection reuse In connect_server(), if we don't have a connection attached to the stream-int, we first look into the server's idle_conns list and we pick the first one there, we detach it from its owner if it had one. If we used to have a connection, we close it. This mechanism works well but doesn't scale : as servers increase, the likeliness that the connection attached to the stream interface doesn't match the server and gets closed increases.	2015-08-06 11:34:21 +02:00
Willy Tarreau	387ebf84dd	MINOR: connection: add a new flag CO_FL_PRIVATE This flag is set on an outgoing connection when this connection gets some properties that must not be shared with other connections, such as dynamic transparent source binding, SNI or a proxy protocol header, or an authentication challenge from the server. This will be needed later to implement connection reuse.	2015-08-06 11:14:17 +02:00
Willy Tarreau	4320eaac62	MINOR: stream-int: make si_idle_conn() only accept valid connections This function is now dedicated to idle connections only, which means that it must not be used without any endpoint nor anything not a connection. The connection remains attached to the stream interface.	2015-08-06 11:11:10 +02:00
Willy Tarreau	323a2d925c	MEDIUM: stream-int: queue idle connections at the server Now we get a per-server list of all idle connections. That way we'll be able to reclaim them upon shortage later.	2015-08-06 11:06:25 +02:00
Willy Tarreau	973a54235f	MEDIUM: stream-int: simplify si_alloc_conn() Since we now always call this function with the reuse parameter cleared, let's simplify the function's logic as it cannot return the existing connection anymore. The savings on this inline function are appreciable (240 bytes) : $ size haproxy.old haproxy.new text data bss dec hex filename 1020383 40816 36928 1098127 10c18f haproxy.old 1020143 40816 36928 1097887 10c09f haproxy.new	2015-08-05 21:51:09 +02:00
Thierry FOURNIER	bf65cd4d77	MAJOR: arg: converts uint and sint in sint This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed.	2015-07-22 00:48:23 +02:00
Thierry FOURNIER	07ee64ef4d	MAJOR: sample: converts uint and sint in 64 bits signed integer This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed. This makes easy the usage of integer and clarify signed and unsigned use. With the previous version, signed and unsigned are used ones in place of others, and sometimes the converter loose the sign. For example, divisions are processed with "unsigned", if one entry is negative, the result is wrong. Note that the integer pattern matching and dotted version pattern matching are already working with signed 64 bits integer values. There is one user-visible change : the "uint()" and "sint()" sample fetch functions which used to return a constant integer have been replaced with a new more natural, unified "int()" function. These functions were only introduced in the latest 1.6-dev2 so there's no impact on regular deployments.	2015-07-22 00:48:23 +02:00
Thierry FOURNIER	fac9ccfb70	BUG/MINOR: http/sample: gmtime/localtime can fail The man said that gmtime() and localtime() can return a NULL value. This is not tested. It appears that all the values of a 32 bit integer are valid, but it is better to check the return of these functions. However, if the integer move from 32 bits to 64 bits, some 64 values can be unsupported.	2015-07-20 12:21:35 +02:00
Adis Nezirovic	2fbcafc9ce	MEDIUM: http: Add new 'set-src' option to http-request This option enables overriding source IP address in a HTTP request. It is useful when we want to set custom source IP (e.g. front proxy rewrites address, but provides the correct one in headers) or we wan't to mask source IP address for privacy or compliance. It acts on any expression which produces correct IP address.	2015-07-06 16:17:28 +02:00
Adis Nezirovic	79beb248b9	CLEANUP: sample: generalize sample_fetch_string() as sample_fetch_as_type() This modification makes possible to use sample_fetch_string() in more places, where we might need to fetch sample values which are not plain strings. This way we don't need to fetch string, and convert it into another type afterwards. When using aliased types, the caller should explicitly check which exact type was returned (e.g. SMP_T_IPV4 or SMP_T_IPV6 for SMP_T_ADDR). All usages of sample_fetch_string() are converted to use new function.	2015-07-06 16:17:25 +02:00
Thierry FOURNIER	4834bc773c	MEDIUM: vars: adds support of variables This patch adds support of variables during the processing of each stream. The variables scope can be set as 'session', 'transaction', 'request' or 'response'. The variable type is the type returned by the assignment expression. The type can change while the processing. The allocated memory can be controlled for each scope and each request, and for the global process.	2015-06-13 23:01:37 +02:00
Thierry FOURNIER	0e11863a6f	MINOR: tcp/http/conf: extends the keyword registration options This patch permits to register a new keyword with the keyword "tcp-request content" 'tcp-request connection", tcp-response content", http-request" and "http-response" which is identified only by matching the start of the keyword. for example, we register the keyword "set-var" with the option "match_pfx" and the configuration keyword "set-var(var_name)" matchs this entry.	2015-06-13 23:01:37 +02:00
Willy Tarreau	b8cdf52da0	BUG/MEDIUM: http: fix body processing for the stats applet Commit `9fbe18e` ("MEDIUM: http: add a new option http-buffer-request") introduced a regression due to a misplaced check causing the admin mode of the HTTP stats not to work anymore. This patch tried to ensure that when we need a request body for the stats applet, and we have already waited for this body, we don't wait for it again, but the condition was applied too early causing a disabling of the entire processing the body, and based on the wrong HTTP state (MSG_BODY) resulting in the test never matching. Thanks to Chad Lavoie for reporting the problem. This bug is 1.6-only, no backport is needed.	2015-05-29 01:12:38 +02:00
Willy Tarreau	2de8a50918	MEDIUM: http: no need to close the request on redirect if data was parsed There are two reasons for not keeping the client connection alive upon a redirect : - save the client from uploading all data - avoid keeping a connection alive if the redirect goes to another domain The first case should consider an exception when all the data from the client have been read already. This specifically happens on response redirects after a POST to a server. This is an easy situation to detect. It could later be improved to cover the cases where option http-buffer-request is used.	2015-05-28 17:45:43 +02:00
Willy Tarreau	51d861a44f	MEDIUM: http: implement http-response redirect rules Sometimes it's problematic not to have "http-response redirect" rules, for example to perform a browser-based redirect based on certain server conditions (eg: match of a header). This patch adds "http-response redirect location <fmt>" which gives enough flexibility for most imaginable operations. The connection to the server is closed when this is performed so that we don't risk to forward any pending data from the server. Any pending response data are trimmed so that we don't risk to forward anything pending to the client. It's harmless to also do that for requests so we don't need to consider the direction.	2015-05-28 17:45:43 +02:00
Willy Tarreau	be4653b6d4	MINOR: http: prepare support for parsing redirect actions on responses In order to support http-response redirect, the parsing needs to be adapted a little bit to only support the "location" type, and to adjust the log-format parser so that it knows the direction of the sample fetch calls.	2015-05-28 17:43:11 +02:00
Willy Tarreau	b329a312e3	CLEANUP: http: explicitly reference request in http_apply_redirect_rules() This function was made to perform a redirect on requests only, it was using a message or txn->req in an inconsistent way and did not consider the possibility that it could be used for the other direction. Let's clean it up to have both a request and a response messages.	2015-05-28 17:42:16 +02:00
Thierry FOURNIER	e80fadaaca	MEDIUM: capture: adds http-response capture This patch adds a http response capture keyword with the same behavior as the previous patch called "MEDIUM: capture: Allow capture with slot identifier".	2015-05-28 13:51:00 +02:00
Thierry FOURNIER	82bf70dff4	MEDIUM: capture: Allow capture with slot identifier This patch modifies the current http-request capture function and adds a new keyword "id" that permits to identify a capture slot. If the identified doesn't exists, the action fails silently. Note that this patch removs an unused list initilisation, which seems to be inherited from a copy/paste. It's harmless and does not need to be backported. LIST_INIT((struct list *)&rule->arg.act.p[0]);	2015-05-28 13:50:29 +02:00
Thierry FOURNIER	35ab27561e	MINOR: capture: add two "capture" converters This patch adds "capture-req" and "capture-res". These two converters capture their entry in the allocated slot given in argument and pass the input on the output.	2015-05-28 13:50:29 +02:00
Willy Tarreau	98d0485a90	MAJOR: config: remove the deprecated reqsetbe / reqisetbe actions These ones were already obsoleted in 1.4, marked for removal in 1.5, and not documented anymore. They used to emit warnings, and do still require quite some code to stay in place. Let's remove them now.	2015-05-26 12:18:29 +02:00
Dragan Dosen	26f77e534c	BUG/MEDIUM: http: fix the url_param fetch The "name" and "name_len" arguments in function "smp_fetch_url_param" could be left uninitialized for subsequent calls. [wt: no backport needed, this is an 1.6 regression introduced by commit `4fdc74c` ("MINOR: http: split the url_param in two parts") ]	2015-05-25 19:01:39 +02:00
Thierry FOURNIER	8be451c52a	MEDIUM: http: url-encoded parsing function can run throught wrapped buffer The functions smp_fetch_param(), find_next_url_param() and find_url_param_pos() can look for argument in 2 chunks and not only one.	2015-05-20 16:05:38 +02:00
Thierry FOURNIER	e28c49975a	MINOR: http: add body_param fetch This fetch returns one body param or the list of each body param. This first version runs only with one chunk.	2015-05-20 15:56:23 +02:00
Thierry FOURNIER	0948d41a12	CLEANUP: http: bad indentation Some function argument uses space in place of tabulation for the indentation.	2015-05-20 15:56:23 +02:00
Thierry FOURNIER	4fdc74c22c	MINOR: http: split the url_param in two parts This patch is the part of the body_param fetch. The goal is to have generic url-encoded parser which can used for parsing the query string and the body.	2015-05-20 15:56:23 +02:00
Willy Tarreau	1ede1daab6	MEDIUM: http: make url_param iterate over multiple occurrences There are some situations hwere it's desirable to scan multiple occurrences of a same parameter name in the query string. This change ensures this can work, even with an empty name which will then iterate over all parameters.	2015-05-19 13:16:07 +02:00
Thierry FOURNIER	0786d05a04	MEDIUM: sample: change the prototype of sample-fetches functions This patch removes the "opt" entry from the prototype of the sample-fetches fucntions. This permits to remove some weight in the prototype call.	2015-05-11 20:03:08 +02:00
Thierry FOURNIER	0a9a2b8cec	MEDIUM: sample change the prototype of sample-fetches and converters functions This patch removes the structs "session", "stream" and "proxy" from the sample-fetches and converters function prototypes. This permits to remove some weight in the prototype call.	2015-05-11 20:01:42 +02:00
Willy Tarreau	bbfb6c4085	BUG/MEDIUM: http: don't forward client shutdown without NOLINGER except for tunnels There's an issue related with shutting down POST transfers or closing the connection after the end of the upload : the shutdown is forwarded to the server regardless of the abortonclose option. The problem it causes is that during a scan, brute force or whatever, it becomes possible that all source ports are exhausted with all sockets in TIME_WAIT state. There are multiple issues at once in fact : - no action is done for the close, it automatically happens at the lower layers thanks for channel_auto_close(), so we cannot act on NOLINGER ; - we do want to continue to send a clean shutdown in tunnel mode because some protocols transported over HTTP may need this, regardless of option abortonclose, thus we can't set the option inconditionally - for all other modes, we do want to close the dirty way because we're certain whether we've sent everything or not, and we don't want to eat all source ports. The solution is a bit complex and applies to DONE/TUNNEL states : 1) disable automatic close for everything not a tunnel and not just keep-alive / server-close. Force-close is now covered, as is HTTP/1.0 which implicitly works in force-close mode ; 2) when processing option abortonclose, we know we can disable lingering if the client has closed and the connection is not in tunnel mode. Since the last case above leads to a situation where the client side reports an error, we know the connection will not be reused, so leaving the flag on the stream-interface is safe. A client closing in the middle of the data transmission already aborts the transaction so this case is not a problem. This fix must be backported to 1.5 where the problem was detected.	2015-05-11 19:05:42 +02:00
Thierry FOURNIER	82ff3c9b05	MINOR: sample: add url_dec converter This converter decodes an url-encoded string. It takes a string as input and returns string as output.	2015-05-11 11:40:36 +02:00
Willy Tarreau	3986ac1860	BUG/MEDIUM: http: fix the http-request capture parser Due to the code being mostly inspired from the tcp-request parser, it does some crap because both don't work the same way. The "len" argument could be mismatched and then the length could be used uninitialized.	2015-05-08 16:13:42 +02:00
Willy Tarreau	a9083d0722	MEDIUM: http: add new "capture" action for http-request This is only possible in frontends of course, but it will finally make it possible to capture arbitrary http parts, including URL parameters or parts of the message body. It's worth noting that an ugly (char **) cast had to be done to call sample_fetch_string() which is caused by a 5- or 6- levels of inheritance of this type in the API. Here it's harmless since the function uses it as a const, but this API madness must be fixed, starting with the one or two rare functions that modify the args and inflict this on each and every keyword parser. (cherry picked from commit 484a4f38460593919a1c1d9a047a043198d69f45)	2015-05-08 15:43:54 +02:00
Willy Tarreau	a5910cc6ef	MEDIUM: http: provide 3 fetches for the body Body processing is still fairly limited, but this is a start. It becomes possible to apply regex to find contents in order to decide where to route a request for example. Only the first chunk is parsed for now, and the response is not yet available (the parsing function must be duplicated for this). req.body : binary This returns the HTTP request's available body as a block of data. It requires that the request body has been buffered made available using "option http-buffer-request". In case of chunked-encoded body, currently only the first chunk is analyzed. req.body_len : integer This returns the length of the HTTP request's available body in bytes. It may be lower than the advertised length if the body is larger than the buffer. It requires that the request body has been buffered made available using "option http-buffer-request". req.body_size : integer This returns the advertised length of the HTTP request's body in bytes. It will represent the advertised Content-Length header, or the size of the first chunk in case of chunked encoding. In order to parse the chunks, it requires that the request body has been buffered made available using "option http-buffer-request".	2015-05-02 00:46:08 +02:00
Willy Tarreau	9fbe18e174	MEDIUM: http: add a new option http-buffer-request It is sometimes desirable to wait for the body of an HTTP request before taking a decision. This is what is being done by "balance url_param" for example. The first use case is to buffer requests from slow clients before connecting to the server. Another use case consists in taking the routing decision based on the request body's contents. This option placed in a frontend or backend forces the HTTP processing to wait until either the whole body is received, or the request buffer is full, or the first chunk is complete in case of chunked encoding. It can have undesired side effects with some applications abusing HTTP by expecting unbufferred transmissions between the frontend and the backend, so this should definitely not be used by default. Note that it would not work for the response because we don't reset the message state before starting to forward. For the response we need to 1) reset the message state to MSG_100_SENT or BODY , and 2) to reset body_len in case of chunked encoding to avoid counting it twice.	2015-05-02 00:10:44 +02:00
Willy Tarreau	e115b49c39	BUG/MEDIUM: http: wait for the exact amount of body bytes in wait_for_request_body Due to the fact that we were still considering only msg->sov for the first byte of data after calling http_parse_chunk_size(), we used to miscompute the input data size and to count the CRLF and the chunk size as part of the input data. The effect is that it was possible to release the processing with 3 or 4 missing bytes, especially if they're typed by hand during debugging sessions. This can cause the stats page to return some errors in admin mode, and the url_param balance algorithm to fail to properly hash a body input. This fix must be backported to 1.5.	2015-05-01 23:24:32 +02:00
Willy Tarreau	0f228a037a	MEDIUM: http: add option-ignore-probes to get rid of the floods of 408 Recently some browsers started to implement a "pre-connect" feature consisting in speculatively connecting to some recently visited web sites just in case the user would like to visit them. This results in many connections being established to web sites, which end up in 408 Request Timeout if the timeout strikes first, or 400 Bad Request when the browser decides to close them first. These ones pollute the log and feed the error counters. There was already "option dontlognull" but it's insufficient in this case. Instead, this option does the following things : - prevent any 400/408 message from being sent to the client if nothing was received over a connection before it was closed ; - prevent any log from being emitted in this situation ; - prevent any error counter from being incremented That way the empty connection is silently ignored. Note that it is better not to use this unless it is clear that it is needed, because it will hide real problems. The most common reason for not receiving a request and seeing a 408 is due to an MTU inconsistency between the client and an intermediary element such as a VPN, which blocks too large packets. These issues are generally seen with POST requests as well as GET with large cookies. The logs are often the only way to detect them. This patch should be backported to 1.5 since it avoids false alerts and makes it easier to monitor haproxy's status.	2015-05-01 15:39:23 +02:00
Willy Tarreau	13317669d5	MEDIUM: http: disable support for HTTP/0.9 by default There's not much reason for continuing to accept HTTP/0.9 requests nowadays except for manual testing. Now we disable support for these by default, unless option accept-invalid-http-request is specified, in which case they continue to be upgraded to 1.0.	2015-05-01 14:57:54 +02:00
Willy Tarreau	91852eb428	MEDIUM: http: restrict the HTTP version token to 1 digit as per RFC7230 While RFC2616 used to allow an undeterminate amount of digits for the major and minor components of the HTTP version, RFC7230 has reduced that to a single digit for each. If a server can't properly parse the version string and falls back to 0.9, it could then send a head-less response whose payload would be taken for headers, which could confuse downstream agents. Since there's no more reason for supporting a version scheme that was never used, let's upgrade to the updated version of the standard. It is still possible to enforce support for the old behaviour using options accept-invalid-http-request and accept-invalid-http-response. It would be wise to backport this to 1.5 as well just in case.	2015-05-01 14:57:01 +02:00
Willy Tarreau	b4d0c03aee	BUG/MEDIUM: http: remove content-length form responses with bad transfer-encoding The spec mandates that content-length must be removed from messages if Transfer-Encoding is present, not just for valid ones. This must be backported to 1.5 and 1.4.	2015-05-01 13:56:11 +02:00
Willy Tarreau	34dfc60571	BUG/MEDIUM: http: incorrect transfer-coding in the request is a bad request The rules related to how to handle a bad transfer-encoding header (one where "chunked" is not at the final place) have evolved to mandate an abort when this happens in the request. Previously it was only a close (which is still valid for the server side). This must be backported to 1.5 and 1.4.	2015-05-01 13:56:10 +02:00
Willy Tarreau	4979d5c5d1	BUG/MEDIUM: http: do not restrict parsing of transfer-encoding to HTTP/1.1 While Transfer-Encoding is HTTP/1.1, we must still parse it in HTTP/1.0 in case an agent sends it, because it's likely that the other side might use it as well, causing confusion. This will also result in getting rid of the Content-Length header in such abnormal situations and in having a clean connection. This must be backported to 1.5 and 1.4.	2015-05-01 13:56:10 +02:00
Willy Tarreau	557f199fb7	DOC: http: update the comments about the rules for determining transfer-length Let's now use the text from RFC7230 which is stricter and more precise. This must be backported to 1.5 and 1.4.	2015-05-01 13:56:10 +02:00
Willy Tarreau	1c91391df4	BUG/MEDIUM: http: remove content-length from chunked messages RFC7230 clarified the behaviour to adopt when facing both a content-length and a transfer-encoding: chunked in a message. While haproxy already complied with the method for getting the message length right, and used to detect improper content-length duplicates, it still did not remove the content-length header when facing a transfer-encoding: chunked. Usually it is not a problem since other agents (clients and servers) are required to parse the message according to the rules that have been in place since RFC2616 in 1999. However R�gis Leroy reported the existence of at least one such non-compliant agent so haproxy could be abused to get out of sync with it on pipelined requests (HTTP request smuggling attack), it consider part of a payload as a subsequent request. The best thing to do is then to remove the content-length according to RFC7230. It used to be in the todo list with a fixme in the code while waiting for the standard to stabilize, let's apply it now that it's published. Thanks to R�gis for bringing that subject to our attention. This fix must be backported to 1.5 and 1.4.	2015-05-01 13:56:10 +02:00
Thierry FOURNIER	7f6192c0d3	BUG/MEDIUM: http: functions set-{path,query,method,uri} breaks the HTTP parser When one of these functions replaces a part of the query string by a shorter or longer new one, the header parsing is broken. This is because the start of the first header is not updated. In the same way, the total length of the request line is not updated. I dont see any bug caused by this miss, but I guess than it is better to store the good length. This bug is only in the development version.	2015-04-27 11:56:52 +02:00
Willy Tarreau	ee335e65dc	BUG/MEDIUM: http: properly retrieve the front connection Commit `350f487` ("CLEANUP: session: simplify references to chn_{prod,cons}(&s->{req,res})") introduced a regression causing the cli_conn to be picked from the server side instead of the client side, so the XFF header is not appended anymore since the connection is NULL. Thanks to Reinis Rozitis for reporting this bug. No backport is needed as it's 1.6-specific.	2015-04-21 18:15:13 +02:00
Willy Tarreau	152b81e7b2	BUG/MAJOR: tcp/http: fix current_rule assignment when restarting over a ruleset Commit `bc4c1ac` ("MEDIUM: http/tcp: permit to resume http and tcp custom actions") introduced the ability to interrupt and restart processing in the middle of a TCP/HTTP ruleset. But it doesn't do it in a consistent way : it checks current_rule_list, immediately dereferences current_rule, which is only set in certain cases and never cleared. So that broke the tcp-request content rules when the processing was interrupted due to missing data, because current_rule was not yet set (segfault) or could have been inherited from another ruleset if it was used in a backend (random behaviour). The proper way to do it is to always set current_rule before dereferencing it. But we don't want to set it for all rules because we don't want any action to provide a checkpointing mechanism. So current_rule is set to NULL before entering the loop, and only used if not NULL and if current_rule_list matches the current list. This way they both serve as a guard for the other one. This fix also makes the current rule point to the rule instead of its list element, as it's much easier to manipulate. No backport is needed, this is 1.6-specific.	2015-04-20 13:46:20 +02:00
CJ Ess	108b1dd69d	MEDIUM: http: configurable http result codes for http-request deny This patch adds support for error codes 429 and 405 to Haproxy and a "deny_status XXX" option to "http-request deny" where you can specify which code is returned with 403 being the default. We really want to do this the "haproxy way" and hope to have this patch included in the mainline. We'll be happy address any feedback on how this is implemented.	2015-04-11 10:34:54 +02:00
Willy Tarreau	d0d8da989b	MINOR: stream: provide a few helpers to retrieve frontend, listener and origin Expressions are quite long when using strm_sess(strm)->whatever, so let's provide a few helpers : strm_fe(), strm_li(), strm_orig().	2015-04-06 11:37:29 +02:00
Willy Tarreau	192252e2d8	MAJOR: sample: pass a pointer to the session to each sample fetch function Many such function need a session, and till now they used to dereference the stream. Once we remove the stream from the embryonic session, this will not be possible anymore. So as of now, sample fetch functions will be called with this : - sess = NULL, strm = NULL : never - sess = valid, strm = NULL : tcp-req connection - sess = valid, strm = valid, strm->txn = NULL : tcp-req content - sess = valid, strm = valid, strm->txn = valid : http-req / http-res	2015-04-06 11:37:25 +02:00
Willy Tarreau	987e3fb868	MEDIUM: http: remove the now useless http_txn from {req/res} rules The registerable http_req_rules / http_res_rules used to require a struct http_txn at the end. It's redundant with struct stream and propagates very deep into some parts (ie: it was the reason for lua requiring l7). Let's remove it now.	2015-04-06 11:35:53 +02:00
Willy Tarreau	15e91e1b36	MAJOR: sample: don't pass l7 anymore to sample fetch functions All of them can now retrieve the HTTP transaction if it exists from the stream and be sure to get NULL there when called with an embryonic session. The patch is a bit large because many locations were touched (all fetch functions had to have their prototype adjusted). The opportunity was taken to also uniformize the call names (the stream is now always "strm" instead of "l4") and to fix indent where it was broken. This way when we later introduce the session here there will be less confusion.	2015-04-06 11:35:53 +02:00
Willy Tarreau	eee5b51248	MAJOR: http: move http_txn out of struct stream Now this one is dynamically allocated. It means that 280 bytes of memory are saved per TCP stream, but more importantly that it will become possible to remove the l7 pointer from fetches and converters since it will be deduced from the stream and will support being null. A lot of care was taken because it's easy to forget a test somewhere, and the previous code used to always trust s->txn for being valid, but all places seem to have been visited. All HTTP fetch functions check the txn first so we shouldn't have any issue there even when called from TCP. When branching from a TCP frontend to an HTTP backend, the txn is properly allocated at the same time as the hdr_idx.	2015-04-06 11:35:52 +02:00
Willy Tarreau	63986c72c8	MINOR: http: create a dedicated pool for http_txn This one will not necessarily be allocated for each stream, and we want to use the fact that it equals null to know it's not present so that we can always deduce its presence from the stream pointer. This commit only creates the new pool.	2015-04-06 11:35:52 +02:00
Willy Tarreau	cb7dd015be	MEDIUM: http: move header captures from http_txn to struct stream The header captures are now general purpose captures since tcp rules can use them to capture various contents. That removes a dependency on http_txn that appeared in some sample fetch functions and in the order by which captures and http_txn were allocated. Interestingly the reset of the header captures were done at too many places as http_init_txn() used to do it while it was done previously in every call place.	2015-04-06 11:35:52 +02:00
Willy Tarreau	53c9b4db41	CLEANUP: sample: remove useless tests in fetch functions for l4 != NULL The stream may never be null given that all these functions are called from sample_process(). Let's remove this now confusing test which sometimes happens after a dereference was already done.	2015-04-06 11:35:52 +02:00
Willy Tarreau	9ad7bd48d2	MEDIUM: session: use the pointer to the origin instead of s->si[0].end When s->si[0].end was dereferenced as a connection or anything in order to retrieve information about the originating session, we'll now use sess->origin instead so that when we have to chain multiple streams in HTTP/2, we'll keep accessing the same origin.	2015-04-06 11:34:29 +02:00
Willy Tarreau	e36cbcb3b0	MEDIUM: stream: move the frontend's pointer to the session Just like for the listener, the frontend is session-wide so let's move it to the session. There are a lot of places which were changed but the changes are minimal in fact.	2015-04-06 11:23:58 +02:00
Willy Tarreau	fb0afa77c9	MEDIUM: stream: move the listener's pointer to the session The listener is session-specific, move it there.	2015-04-06 11:23:57 +02:00
Willy Tarreau	e7dff02dd4	REORG/MEDIUM: stream: rename stream flags from SN_* to SF_* This is in order to keep things consistent.	2015-04-06 11:23:57 +02:00
Willy Tarreau	87b09668be	REORG/MAJOR: session: rename the "session" entity to "stream" With HTTP/2, we'll have to support multiplexed streams. A stream is in fact the largest part of what we currently call a session, it has buffers, logs, etc. In order to catch any error, this commit removes any reference to the struct session and tries to rename most "session" occurrences in function names to "stream" and "sess" to "strm" when that's related to a session. The files stream.{c,h} were added and session.{c,h} removed. The session will be reintroduced later and a few parts of the stream will progressively be moved overthere. It will more or less contain only what we need in an embryonic session. Sample fetch functions and converters will have to change a bit so that they'll use an L5 (session) instead of what's currently called "L4" which is in fact L6 for now. Once all changes are completed, we should see approximately this : L7 - http_txn L6 - stream L5 - session L4 - connection \| applet There will be at most one http_txn per stream, and a same session will possibly be referenced by multiple streams. A connection will point to a session and to a stream. The session will hold all the information we need to keep even when we don't yet have a stream. Some more cleanup is needed because some code was already far from being clean. The server queue management still refers to sessions at many places while comments talk about connections. This will have to be cleaned up once we have a server-side connection pool manager. Stream flags "SN_*" still need to be renamed, it doesn't seem like any of them will need to move to the session.	2015-04-06 11:23:56 +02:00
Willy Tarreau	cb703b0352	BUG/MAJOR: http: null-terminate the http actions keywords list Commit `a0dc23f` ("MEDIUM: http: implement http-request set-{method,path,query,uri}") forgot to null-terminate the list, resulting in crashes when these actions are used if the platform doesn't pad the struct with nulls. Thanks to Gunay Arslan for reporting a detailed trace showing the origin of this bug. No backport to 1.5 is needed.	2015-04-03 09:58:02 +02:00
Willy Tarreau	601a4d1741	BUG/MEDIUM: http: hdr_cnt would not count any header when called without name It's documented that these sample fetch functions should count all headers and/or all values when called with no name but in practice it's not what is being done as a missing name causes an immediate return and an absence of result. This bug is present in 1.5 as well and must be backported.	2015-04-01 19:16:09 +02:00
Willy Tarreau	615105e7e8	MEDIUM: compression: add a distinction between UA- and config- algorithms Thanks to MSIE/IIS, the "deflate" name is ambigous. According to the RFC it's a zlib-wrapped deflate stream, but IIS used to send only a raw deflate stream, which is the only format MSIE understands for "deflate". The other widely used browsers do support both formats. For this reason some people prefer to emit a raw deflate stream on "deflate" to serve more users even it that means violating the standards. Haproxy only follows the standard, so they cannot do this. This patch makes it possible to have one algorithm name in the configuration and another one in the protocol. This will make it possible to have a new configuration token to add a different algorithm so that users can decide if they want a raw deflate or the standard one.	2015-03-28 16:46:38 +01:00
Willy Tarreau	e7e49a8d0b	MINOR: http: check the algo name "identity" instead of the function pointer Next patch will statity all compression functions, so let's stop relying on a function pointer comparison and use the algo name instead.	2015-03-28 15:43:17 +01:00
Thierry FOURNIER	7fe75e0dab	MINOR: http: export function inet_set_tos() This is used by Lua.	2015-03-18 11:34:06 +01:00
Thierry FOURNIER	5531f87ace	MINOR: http: split http_transform_header() function in two parts. This function is a callback for HTTP actions. This function creates the replacement string from a build_logline() format and transform the header. This patch split this function in two part. With this modification, the header transformation and the replacement string are separed. We can now transform the header with another replacement string source than a build_logline() format.	2015-03-18 11:34:06 +01:00
Thierry FOURNIER	b77aece24a	MINOR: http: split the function http_action_set_req_line() in two parts The first part is the replacement engine. It take a replacement action number and a replacement string and process the action. The second part is the function which is called by the 'http-request action' to replace a request line part. This function makes the string used as replacement. This split permits to use the replacement engine in other parts of the code than the request action. The Lua use it for his own http action.	2015-03-18 11:34:06 +01:00
Thierry FOURNIER	63d692c037	MEDIUM: http: allows 'R' and 'S' in the protocol alphabet This patch allow the 'R' and the 'S' in the protocol/version alphabet. It permits to process RTSP requests like HTTP.	2015-03-17 16:19:52 +01:00
Thierry FOURNIER	5a33ac78ad	MEDIUM/CLEANUP: http: rewrite and lighten http_transform_header() prototype The http_transform_header() function prototype uses some parameter which can be guessed from other parameer. This patch removes theses parameters.	2015-03-17 11:42:43 +01:00
Thierry FOURNIER	191f9efdc5	BUG/MEDIUM: http: the function "(req\|res)-replace-value" doesn't respect the HTTP syntax These function used an invalid header parser. - The trailing white-spaces were embedded in the replacement regex, - The double-quote (") containing comma (,) were not respected. This patch replace this parser by the "official" parser http_find_header2().	2015-03-17 11:42:43 +01:00
Thierry FOURNIER	534101658d	BUG/MAJOR: http: don't read past buffer's end in http_replace_value The function http_replace_value use bad variable to detect the end of the input string. Regression introduced by the patch "MEDIUM: regex: Remove null terminated strings." (`c9c2daf2`) We need to backport this patch int the 1.5 stable branch. WT: there is no possibility to overwrite existing data as we only read past the end of the request buffer, to copy into the trash. The copy is bounded by buffer_replace2(), just like the replacement performed by exp_replace(). However if a buffer happens to contain non-zero data up to the next unmapped page boundary, there's a theorical risk of crashing the process despite this not being reproducible in tests. The risk is low because "http-request replace-value" did not work due to this bug so that probably means it's not used yet.	2015-03-16 14:20:07 +01:00
Thierry FOURNIER	01c30124ae	BUG/MEDIUM: http: the action set-{method\|path\|query\|uri} doesn't run. This bug is introduced by the commit "MEDIUM: http/tcp: permit to resume http and tcp custom actions" ( `bc4c1ac6ad` ). Before this patch, the return code of the function was ignored. After this path, if the function returns 0, it wats a YIELD. The function http_action_set_req_line() retunrs 0, in succes case. This patch changes the return code of this function.	2015-03-14 15:53:31 +01:00
Jesse Hathaway	2468d4e4f7	MEDIUM: http: Compress HTTP responses with status codes 201,202,203 in addition to 200 It is common for rest applications to return status codes other than 200, so compress the other common 200 level responses which might contain content.	2015-03-11 23:23:41 +01:00
Willy Tarreau	350f487300	CLEANUP: session: simplify references to chn_{prod,cons}(&s->{req,res}) These 4 combinations are needlessly complicated since the session already has direct access to the associated stream interfaces without having to check an indirect pointer.	2015-03-11 20:41:47 +01:00
Willy Tarreau	73796535a9	REORG/MEDIUM: channel: only use chn_prod / chn_cons to find stream-interfaces The purpose of these two macros will be to pass via the session to find the relevant stream interfaces so that we don't need to store the ->cons nor ->prod pointers anymore. Currently they're only defined so that all references could be removed. Note that many places need a second pass of clean up so that we don't have any chn_prod(&s->req) anymore and only &s->si[0] instead, and conversely for the 3 other cases.	2015-03-11 20:41:47 +01:00
Willy Tarreau	a5f5d8dc69	MEDIUM: stream-int: add a flag indicating which side the SI is on This new flag "SI_FL_ISBACK" is set only on the back SI and is cleared on the front SI. That way it's possible only by looking at the SI to know what side it is.	2015-03-11 20:41:46 +01:00
Willy Tarreau	2bb4a96f8f	REORG/MEDIUM: stream-int: introduce si_ic/si_oc to access channels We'll soon remove direct references to the channels from the stream interface since everything belongs to the same session, so let's first not dereference si->ib / si->ob anymore and use macros instead.	2015-03-11 20:41:46 +01:00
Willy Tarreau	22ec1eadd0	REORG/MAJOR: move session's req and resp channels back into the session The channels were pointers to outside structs and this is not needed anymore since the buffers have moved, but this complicates operations. Move them back into the session so that both channels and stream interfaces are always allocated for a session. Some places (some early sample fetch functions) used to validate that a channel was NULL prior to dereferencing it. Now instead we check if chn->buf is NULL and we force it to remain NULL until the channel is initialized.	2015-03-11 20:41:46 +01:00
Willy Tarreau	612adb8459	BUG/MAJOR: http: fix stats regression consecutive to HTTP_RULE_RES_YIELD Commit `bc4c1ac` ("MEDIUM: http/tcp: permit to resume http and tcp custom actions") unfortunately broke the stats applet by moving the clearing of the analyser bit after processing the applet headers. It used to work only in HTTP/1.1 and not in HTTP/1.0. This is 1.6-specific, no backport is needed.	2015-03-10 15:33:55 +01:00
Thierry FOURNIER	bc4c1ac6ad	MEDIUM: http/tcp: permit to resume http and tcp custom actions Later, the processing of some actions needs to be interrupted and resumed later. This patch permit to resume the actions. The actions that needs to run with the resume mode are not yet avalaible. It will be soon with Lua patches. So the code added by this patch is untestable for the moment. The list of "tcp_exec_req_rules" cannot resme because is called by the unresumable function "accept_session".	2015-02-28 23:12:33 +01:00
Thierry FOURNIER	9e2ef999a9	MEDIUM: http: change the code returned by the response processing rule functions Actually, this function returns a pointer on the rule that stop the evaluation of the rule list. Later we integrate the possibility of interrupt and resue the processsing of some actions. The current response mode is not sufficient to returns the "interrupt" information. The pointer returned is never used, so I change the return type of this function by an enum. With this enum, the function is ready to return the "interupt" value.	2015-02-28 23:12:33 +01:00
Thierry FOURNIER	49f45af9aa	MINOR: global: export many symbols. The functions "val_payload_lv" and "val_hdr" are useful with lua. The lua automatic binding for sample fetchs needs to compare check functions. The "arg_type_names" permit to display error messages.	2015-02-28 23:12:32 +01:00
Thierry FOURNIER	f41a809dc9	MINOR: sample: add private argument to the struct sample_fetch The add of this private argument is to prepare the integration of the lua fetchs.	2015-02-28 23:12:31 +01:00
Thierry FOURNIER	68a556e282	MINOR: converters: give the session pointer as converter argument Some usages of the converters need to know the attached session. The Lua needs the session for retrieving his running context. This patch adds the "session" as an argument of the converters prototype.	2015-02-28 23:12:31 +01:00
Thierry FOURNIER	1edc971919	MINOR: converters: add a "void *private" argument to converters This permits to store specific configuration pointer. It is useful with future Lua integration.	2015-02-28 23:12:31 +01:00
Willy Tarreau	eb27ec7569	MINOR: http: add the new sample fetches req.hdr_names and res.hdr_names These new sample fetches retrieve the list of header names as they appear in the request or response. This can be used for debugging, for statistics as well as an aid to better detect the presence of proxies or plugins on some browsers, which alter the request compared to a regular browser by adding or reordering headers.	2015-02-20 14:00:44 +01:00
Willy Tarreau	c90dc23e99	MINOR: http: add a new function to iterate over each header line New function http_find_next_header() will be used to scan all the input headers for various processing and for http/1 to http/2 header mapping.	2015-02-20 14:00:44 +01:00
Willy Tarreau	34d4c3c13f	BUG/MINOR: http: abort request processing on filter failure Commit `c600204` ("BUG/MEDIUM: regex: fix risk of buffer overrun in exp_replace()") added a control of failure on the response headers, but forgot to check for the error during request processing. So if the filters fail to apply, we could keep the request. It might cause some headers to silently fail to be added for example. Note that it's tagged MINOR because a standard configuration cannot make this case happen. The fix should be backported to 1.5 and 1.4 though.	2015-01-30 20:58:58 +01:00
Willy Tarreau	aa435e7d7e	BUG/MINOR: http: fix incorrect header value offset in replace-hdr/replace-value The two http-req/http-resp actions "replace-hdr" and "replace-value" were expecting exactly one space after the colon, which is wrong. It was causing the first char not to be seen/modified when no space was present, and empty headers not to be modified either. Instead of using name->len+2, we must use ctx->val which points to the first character of the value even if there is no value. This fix must be backported into 1.5.	2015-01-29 14:01:34 +01:00
Willy Tarreau	a0dc23f093	MEDIUM: http: implement http-request set-{method,path,query,uri} This commit implements the following new actions : - "set-method" rewrites the request method with the result of the evaluation of format string <fmt>. There should be very few valid reasons for having to do so as this is more likely to break something than to fix it. - "set-path" rewrites the request path with the result of the evaluation of format string <fmt>. The query string, if any, is left intact. If a scheme and authority is found before the path, they are left intact as well. If the request doesn't have a path ("*"), this one is replaced with the format. This can be used to prepend a directory component in front of a path for example. See also "set-query" and "set-uri". Example : # prepend the host name before the path http-request set-path /%[hdr(host)]%[path] - "set-query" rewrites the request's query string which appears after the first question mark ("?") with the result of the evaluation of format string <fmt>. The part prior to the question mark is left intact. If the request doesn't contain a question mark and the new value is not empty, then one is added at the end of the URI, followed by the new value. If a question mark was present, it will never be removed even if the value is empty. This can be used to add or remove parameters from the query string. See also "set-query" and "set-uri". Example : # replace "%3D" with "=" in the query string http-request set-query %[query,regsub(%3D,=,g)] - "set-uri" rewrites the request URI with the result of the evaluation of format string <fmt>. The scheme, authority, path and query string are all replaced at once. This can be used to rewrite hosts in front of proxies, or to perform complex modifications to the URI such as moving parts between the path and the query string. See also "set-path" and "set-query". All of them are handled by the same parser and the same exec function, which is why they're merged all together. For once, instead of adding even more entries to the huge switch/case, we used the new facility to register action keywords. A number of the existing ones should probably move there as well.	2015-01-23 20:27:41 +01:00
Willy Tarreau	15a53a4384	MEDIUM: regex: add support for passing regex flags to regex_exec_match() This function (and its sister regex_exec_match2()) abstract the regex execution but make it impossible to pass flags to the regex engine. Currently we don't use them but we'll need to support REG_NOTBOL soon (to indicate that we're not at the beginning of a line). So let's add support for this flag and update the API accordingly.	2015-01-22 14:24:53 +01:00
Willy Tarreau	8560328211	BUG/MEDIUM: http: make http-request set-header compute the string before removal The way http-request/response set-header works is stupid. For a naive reuse of the del-header code, it removes all occurrences of the header to be set before computing the new format string. This makes it almost unusable because it is not possible to append values to an existing header without first copying them to a dummy header, performing the copy back and removing the dummy header. Instead, let's share the same code as add-header and perform the optional removal after the string is computed. That way it becomes possible to write things like : http-request set-header X-Forwarded-For %[hdr(X-Forwarded-For)],%[src] Note that this change is not expected to have any undesirable impact on existing configs since if they rely on the bogus behaviour, they don't work as they always retrieve an empty string. This fix must be backported to 1.5 to stop the spreadth of ugly configs.	2015-01-21 20:45:00 +01:00
Willy Tarreau	49ad95cc8e	MINOR: http: add a new fetch "query" to extract the request's query string This fetch extracts the request's query string, which starts after the first question mark. If no question mark is present, this fetch returns nothing. If a question mark is present but nothing follows, it returns an empty string. This means it's possible to easily know whether a query string is present using the "found" matching method. This fetch is the completemnt of "path" which stops before the question mark.	2015-01-20 19:47:47 +01:00
Willy Tarreau	319f745ba0	MINOR: channel: rename bi_erase() to channel_truncate() It applies to the channel and it doesn't erase outgoing data, only pending unread data, which is strictly equivalent to what recv() does with MSG_TRUNC, so that new name is more accurate and intuitive.	2015-01-14 20:32:59 +01:00
Willy Tarreau	ba0902ede4	CLEANUP: channel: rename channel_reserved -> channel_is_rewritable channel_reserved is confusingly named. It is used to know whether or not the rewrite area is left intact for situations where we want to ensure we can use it before proceeding. Let's rename it to fix this confusion.	2015-01-14 18:41:33 +01:00
Willy Tarreau	7c1c217426	BUG/MEDIUM: http: fix header removal when previous header ends with pure LF In 1.4-dev7, a header removal mechanism was introduced with commit `68085d8` ("[MINOR] http: add http_remove_header2() to remove a header value."). Due to a typo in the function, the beginning of the headers gets desynchronized if the header preceeding the deleted one ends with an LF/CRLF combination different form the one of the removed header. The reason is that while rewinding the pointer, we go back by a number of bytes taking into account the LF/CRLF status of the removed header instead of the previous one. The case where it fails is in http-request del-header/set-header where the multiple occurrences of a header are present and their LF/CRLF ending differs from the preceeding header. The loop then stops because no more headers are found given that the names and length do not match. Another point to take into consideration is that removing headers using a loop of http_find_header2() and this function is inefficient since we remove values one at a time while it could be simpler and faster to remove full header lines. This is something that should be addressed separately. This fix must be backported to 1.5 and 1.4. Note that http-send-name-header relies on this function as well so it could be possible that some of the issues encountered with it in 1.4 come from this bug.	2015-01-07 17:23:50 +01:00
Willy Tarreau	f2f7d6b27b	MEDIUM: buffer: add a new buf_wanted dummy buffer to report failed allocations Doing so ensures that even when no memory is available, we leave the channel in a sane condition. There's a special case in proto_http.c regarding the compression, we simply pre-allocate the tmpbuf to point to the dummy buffer. Not reusing &buf_empty for this allows the rest of the code to differenciate an empty buffer that's not used from an empty buffer that results from a failed allocation which has the same semantics as a buffer full.	2014-12-24 23:47:32 +01:00
Willy Tarreau	e583ea583a	MEDIUM: buffer: use b_alloc() to allocate and initialize a buffer b_alloc() now allocates a buffer and initializes it to the size specified in the pool minus the size of the struct buffer itself. This ensures that callers do not need to care about buffer details anymore. Also this never applies memory poisonning, which is slow and useless on buffers.	2014-12-24 23:47:32 +01:00
Godbach	d972203fbc	BUG/MINOR: parse: refer curproxy instead of proxy Since during parsing stage, curproxy always represents a proxy to be operated, it should be a mistake by referring proxy. Signed-off-by: Godbach <nylzhaowei@gmail.com>	2014-12-18 11:01:51 +01:00
Godbach	1f1fae6202	BUG/MINOR: http: fix typo: "401 Unauthorized" => "407 Unauthorized" 401 Unauthorized => 407 Unauthorized Signed-off-by: Godbach <nylzhaowei@gmail.com>	2014-12-17 17:05:49 +01:00
Willy Tarreau	5506e3f8b6	BUG/MINOR: stats: correctly set the request/response analysers When enabling stats, response analysers were set on the request analyser list, which 1) has no effect, and 2) means we don't have the response analysers properly set. In practice these response analysers are set when the connection to the server or applet is established so we don't need/must not set them here. Fortunately this bug had no impact since the flags are distinct, but it definitely is confusing. It should be backported to 1.5.	2014-11-21 17:53:08 +01:00
Cyril Bont�	a83a50bd7d	BUG/MINOR: log: fix request flags when keep-alive is enabled Colin Ingarfield reported some unexplainable flags in the logs. For example, a "LR" termination state was set on a request which was forwarded to a server, where "LR" means that the request should have been handled internally by haproxy. This case happens when at least client side keep-alive is enabled. Next requests in the connection will inherit the flags from the previous request. 2 fields are impacted : "termination_state" and "Tt" in the timing events, where a "+" can be added, when a previous request was redispatched. This is not critical for the service itself but can confuse troubleshooting. The fix must be backported to 1.5 and 1.4.	2014-10-22 22:37:30 +02:00
Willy Tarreau	7d59e90473	BUG/MEDIUM: http: don't dump debug headers on MSG_ERROR When the HTTP parser is in state HTTP_MSG_ERROR, we don't know if it was already initialized or not. If the error happens before HTTP_MSG_RQBEFORE, random offsets might be present and we don't want to display such random strings in debug mode. While it's theorically possible to randomly crash the process when running in debug mode here, this bug was not tagged MAJOR because it would not make sense to run in debug mode in production. This fix must be backported to 1.5 and 1.4.	2014-10-22 19:25:09 +02:00
Willy Tarreau	e1cfc1f2b4	BUG/MINOR: config: do not accept more track-sc than configured MAX_SESS_STKCTR allows one to define the number of stick counters that can be used in parallel in track-sc* rules. The naming of this macro creates some confusion because the value there is sometimes used as a max instead of a count, and the config parser accepts values from 0 to MAX_SESS_STKCTR and the processing ignores anything tracked on the last one. This means that by default, track-sc3 is allowed and ignored. This fix must be backported to 1.5 where the problem there only affects TCP rules.	2014-10-17 11:53:05 +02:00
Willy Tarreau	4e21ff9244	BUG/MEDIUM: http: adjust close mode when switching to backend Commit `179085c` ("MEDIUM: http: move Connection header processing earlier") introduced a regression : the backend's HTTP mode is not considered anymore when setting the session's HTTP mode, because wait_for_request() is only called once, when the frontend receives the request (or when the frontend is in TCP mode, when the backend receives the request). The net effect is that in some situations when the frontend and the backend do not work in the same mode (eg: keep-alive vs close), the backend's mode is ignored. This patch moves all that processing to a dedicated function, which is called from the original place, as well as from session_set_backend() when switching from an HTTP frontend to an HTTP backend in different modes. This fix must be backported to 1.5.	2014-09-30 18:44:22 +02:00
Willy Tarreau	ce730de867	MEDIUM: http: enable header manipulation for 101 responses Ryan Brock reported that server stickiness did not work for WebSocket because the cookies and headers are not modified on 1xx responses. He found that his browser correctly presents the cookies learned on 101 responses, which was not specifically defined in the WebSocket spec, nor in the cookie spec. 101 is a very special case. Being part of 1xx, it's an interim response. But within 1xx, it's special because it's the last HTTP/1 response that transits on the wire, which is different from 100 or 102 which may appear multiple times. So in that sense, we can consider it as a final response regarding HTTP/1, and it makes sense to allow header processing there. Note that we still ensure not to mangle the Connection header, which is critical for HTTP upgrade to continue to work smoothly with agents that are a bit picky about what tokens are found there. The rspadd rules are now processed for 101 responses as well, but the cache-control checks are not performed (since no body is delivered). Ryan confirmed that this patch works for him. It would make sense to backport it to 1.5 given that it improves end user experience on WebSocket servers.	2014-09-16 10:40:38 +02:00
Willy Tarreau	9dc1c61c43	BUG/CRITICAL: http: don't update msg->sov once data start to leave the buffer Commit `bb2e669` ("BUG/MAJOR: http: correctly rewind the request body after start of forwarding") was incorrect/incomplete. It used to rely on CF_READ_ATTACHED to stop updating msg->sov once data start to leave the buffer, but this is unreliable because since commit `a6eebb3` ("[BUG] session: clear BF_READ_ATTACHED before next I/O") merged in 1.5-dev1, this flag is only ephemeral and is cleared once all analysers have seen it. So we can start updating msg->sov again each time we pass through this place with new data. With a sufficiently large amount of data, it is possible to make msg->sov wrap and validate the if() condition at the top, causing the buffer to advance by about 2GB and crash the process. Note that the offset cannot be controlled by the attacker because it is a sum of millions of small random sizes depending on how many bytes were read by the server and how many were left in the buffer, only because of the speed difference between reading and writing. Also, nothing is written, the invalid pointer resulting from this operation is only read. Many thanks to James Dempsey for reporting this bug and to Chris Forbes for narrowing down the faulty area enough to make its root cause analysable. This fix must be backported to haproxy 1.5.	2014-09-02 16:48:54 +02:00
Willy Tarreau	912c119557	BUG/MEDIUM: http: fix improper parsing of HTTP methods for use with ACLs pat_parse_meth() had some remains of an early implementation attempt for the patterns, it initialises a trash and never sets the pattern value there. The result is that a non-standard method cannot be matched anymore. The bug appeared during the pattern rework in 1.5, so this fix must be backported there. Thanks to Joe Williams of GitHub for reporting the bug.	2014-08-29 15:15:50 +02:00
Willy Tarreau	4de2a94165	BUG/MEDIUM: http: fix inverted condition in pat_match_meth() This results in a string-based HTTP method match returning true when it doesn't match and conversely. This bug was reported by Joe Williams. The fix must be backported to 1.5, though it still doesn't work because of at least 3-4 other bugs in the long path which leads to building this pattern list.	2014-08-28 20:42:57 +02:00
Thierry FOURNIER	7566e30477	BUG/MEDIUM: http: tarpit timeout is reset Before the commit `bbba2a8ecc` (1.5-dev24-8), the tarpit section set timeout and return, after this commit, the tarpit section set the timeout, and go to the "done" label which reset the timeout. Thanks Bryan Talbot for the bug report and analysis. This should be backported in 1.5.	2014-08-22 11:58:02 +02:00
Baptiste Assmann	12cb00b216	BUG: config: error in http-response replace-header number of arguments A couple of typo fixed in 'http-response replace-header': - an error when counting the number of arguments - a typo in the alert message This should be backported to 1.5.	2014-08-08 17:50:57 +02:00
Willy Tarreau	09448f7d7c	MEDIUM: http: add the track-sc* actions to http-request rules Add support for http-request track-sc, similar to what is done in tcp-request for backends. A new act_prm field was added to HTTP request rules to store the track params (table, counter). Just like for TCP rules, the table is resolved while checking for config validity. The code was mostly copied from the TCP code with the exception that here we also count the HTTP request count and rate by hand. Probably that something could be factored out in the future. It seems like tracking flags should be improved to mark each hook which tracks a key so that we can have some check points where to increase counters of the past if not done yet, a bit like is done for TRACK_BACKEND.	2014-07-16 17:26:40 +02:00
Willy Tarreau	5ad6e1dc09	BUG/MINOR: http: base32+src should use the big endian version of base32 We're using the internal memory representation of base32 here, which is wrong since these data might be exported to headers for logs or be used to stick to a server and replicated to other peers. Let's convert base32 to big endian (network representation) when building the binary block. This mistake is also present in 1.5, it would be better to backport it.	2014-07-15 21:36:10 +02:00
Thierry FOURNIER	055b9d5c63	MINOR: http: export the function 'smp_fetch_base32' It's sometimes useful outside of proto_http.c.	2014-07-15 19:09:36 +02:00
Willy Tarreau	bb2e669f9e	BUG/MAJOR: http: correctly rewind the request body after start of forwarding Daniel Dubovik reported an interesting bug showing that the request body processing was still not 100% fixed. If a POST request contained short enough data to be forwarded at once before trying to establish the connection to the server, we had no way to correctly rewind the body. The first visible case is that balancing on a header does not always work on such POST requests since the header cannot be found. But there are even nastier implications which are that http-send-name-header would apply to the wrong location and possibly even affect part of the request's body due to an incorrect rewinding. There are two options to fix the problem : - first one is to force the HTTP_MSG_F_WAIT_CONN flag on all hash-based balancing algorithms and http-send-name-header, but there's always a risk that any new algorithm forgets to set it ; - the second option is to account for the amount of skipped data before the connection establishes so that we always know the position of the request's body relative to the buffer's origin. The second option is much more reliable and fits very well in the spirit of the past changes to fix forwarding. Indeed, at the moment we have msg->sov which points to the start of the body before headers are forwarded and which equals zero afterwards (so it still points to the start of the body before forwarding data). A minor change consists in always making it point to the start of the body even after data have been forwarded. It means that it can get a negative value (so we need to change its type to signed).. In order to avoid wrapping, we only do this as long as the other side of the buffer is not connected yet. Doing this definitely fixes the issues above for the requests. Since the response cannot be rewound we don't need to perform any change there. This bug was introduced/remained unfixed in 1.5-dev23 so the fix must be backported to 1.5.	2014-07-10 19:29:45 +02:00
Willy Tarreau	506c69a50e	BUILD: http: fix isdigit & isspace warnings on Solaris As usual, when touching any is* function, Solaris complains about the type of the element being checked. Better backport this to 1.5 since nobody knows what the emitted code looks like since macros are used instead of functions.	2014-07-08 01:13:34 +02:00
Willy Tarreau	6c616e0b96	BUG/MAJOR: sample: correctly reinitialize sample fetch context before calling sample_process() We used to only clear flags when reusing the static sample before calling sample_process(), but that's not enough because there's a context in samples that can be used by some fetch functions such as auth, headers and cookies, and not reinitializing it risks that a pointer of a different type is used in the wrong context. An example configuration which triggers the case consists in mixing hdr() and http_auth_group() which both make use of contexts : http-request add-header foo2 %[hdr(host)],%[http_auth_group(foo)] The solution is simple, initialize all the sample and not just the flags. This fix must be backported into 1.5 since it was introduced in 1.5-dev19.	2014-06-25 17:12:08 +02:00
Willy Tarreau	d713bcc326	BUG/MINOR: counters: do not untrack counters before logging Baptiste Assmann reported a corner case in the releasing of stick-counters: we release content-aware counters before logging. In the past it was not a problem, but since now we can log them it, it prevents one from logging their value. Simply switching the log production and the release of the counter fixes the issue. This should be backported into 1.5.	2014-06-25 15:36:04 +02:00
Willy Tarreau	3caf2afabe	BUG/MEDIUM: http: fetch "base" is not compatible with set-header The sample fetch function "base" makes use of the trash which is also used by set-header/add-header etc... everything which builds a formated line. So we end up with some junk in the header if base is in use. Let's fix this as all other fetches by using a trash chunk instead. This bug was reported by Baptiste Assmann, and also affects 1.5.	2014-06-24 17:27:02 +02:00
Baptiste Assmann	92df370621	BUG/MINOR: config: http-request replace-header arg typo http-request replace-header was introduced with a typo which prevents it to be conditionned by an ACL. This patch fixes this issue.	2014-06-24 11:13:33 +02:00
Willy Tarreau	6f0a7bac28	BUG/MAJOR: session: revert all the crappy client-side timeout changes This is the 3rd regression caused by the changes below. The latest to date was reported by Finn Arne Gangstad. If a server responds with no content-length and the client's FIN is never received, either we leak the client-side FD or we spin at 100% CPU if timeout client-fin is set. Enough is enough. The amount of tricks needed to cover these side-effects starts to look like used toilet paper stacked over a chocolate cake. I don't want to eat that cake anymore! All this to avoid reporting a server-side timeout when a client stops uploading data and haproxy expires faster than the server... A lot of "ifs" resulting in a technically valid log that doesn't always please users, and whose alternative causes that many issues for all others users. So let's revert this crap merged since 1.5-dev25 : Revert "CLEANUP: http: don't clear CF_READ_NOEXP twice" This reverts commit `1592d1e72a`. Revert "BUG/MEDIUM: http: clear CF_READ_NOEXP when preparing a new transaction" This reverts commit `77d29029af`. Revert "BUG/MEDIUM: session: don't clear CF_READ_NOEXP if analysers are not called" This reverts commit `0943757a21`. Revert "BUG/MEDIUM: http: disable server-side expiration until client has sent the body" This reverts commit `3bed5e9337`. Revert "BUG/MEDIUM: http: correctly report request body timeouts" This reverts commit `b9edf8fbec`. Revert "BUG/MEDIUM: http/session: disable client-side expiration only after body" This reverts commit `b1982e27aa`. If a cleaner AND SAFER way to do something equivalent in 1.6-dev, we might consider backporting it to 1.5, but given the vicious bugs that have surfaced since, I doubt it will happen any time soon. Fortunately, that crap never made it into 1.4 so no backport is needed.	2014-06-23 15:47:00 +02:00
Thierry FOURNIER	c9c2daf283	MEDIUM: regex: Remove null terminated strings. The new regex function can use string and length. The HAproxy buffer are not null-terminated, and the use of the regex_exec* functions implies the add of this null character. This patch replace these function by the functions which takes a string and length as input. Just the file "proto_http.c" is change because this one is more executed than other. The file "checks.c" have a very low usage, and it is not interesting to change it. Furthermore, the buffer used by "checks.c" are null-terminated.	2014-06-18 15:12:51 +02:00
Thierry FOURNIER	09af0d6d43	MEDIUM: regex: replace all standard regex function by own functions This patch remove all references of standard regex in haproxy. The last remaining references are only in the regex.[ch] files. In the file src/checks.c, the original function uses a "pmatch" array. In fact this array is unused. This patch remove it.	2014-06-18 15:07:57 +02:00
Willy Tarreau	b854392824	BUG/MINOR: http: fix typos in previous patch When I renamed the modify-header action to replace-value, one of them was mistakenly set to "replace-val" instead. Additionally, differentiation of the two actions must be done on args[0][8] and not *args[8]. Thanks Thierry for spotting...	2014-06-17 19:03:56 +02:00
Sasha Pachev	218f064f55	MEDIUM: http: add actions "replace-header" and "replace-values" in http-req/resp This patch adds two new actions to http-request and http-response rulesets : - replace-header : replace a whole header line, suited for headers which might contain commas - replace-value : replace a single header value, suited for headers defined as lists. The match consists in a regex, and the replacement string takes a log-format and supports back-references.	2014-06-17 18:34:32 +02:00
Willy Tarreau	4bfc580dd3	MEDIUM: session: maintain per-backend and per-server time statistics Using the last rate counters, we now compute the queue, connect, response and total times per server and per backend with a 95% accuracy over the last 1024 samples. The operation is cheap so we don't need to condition it.	2014-06-17 17:15:56 +02:00
Willy Tarreau	54da8db40b	MINOR: capture: extend the captures to support non-header keys This patch adds support for captures with no header name. The purpose is to allow extra captures to be defined and logged along with the header captures.	2014-06-13 16:32:48 +02:00
Willy Tarreau	1592d1e72a	CLEANUP: http: don't clear CF_READ_NOEXP twice Last patch cleared the flag twice in the response, which is useless. Thanks Lukas for spotting it :-)	2014-06-11 16:49:14 +02:00
Willy Tarreau	77d29029af	BUG/MEDIUM: http: clear CF_READ_NOEXP when preparing a new transaction Commit `b1982e2` ("BUG/MEDIUM: http/session: disable client-side expiration only after body") was tricky and caused an issue which was fixed by commit `0943757` ("BUG/MEDIUM: session: don't clear CF_READ_NOEXP if analysers are not called"). But that's not enough, another issue was introduced and further emphasized by last fix. The issue is that the CF_READ_NOEXP flag needs to be cleared when waiting for a new request over that connection, otherwise we cannot expire anymore an idle connection waiting for a new request. This explains the neverending keepalives reported by at least 3 different persons since dev24. No backport is needed.	2014-06-11 14:11:44 +02:00
Sasha Pachev	c600204ddf	BUG/MEDIUM: regex: fix risk of buffer overrun in exp_replace() Currently exp_replace() (which is used in reqrep/reqirep) is vulnerable to a buffer overrun. I have been able to reproduce it using the attached configuration file and issuing the following command: wget -O - -S -q http://localhost:8000/`perl -e 'print "a"x4000'`/cookie.php Str was being checked only in in while (str) and it was possible to read past that when more than one character was being accessed in the loop. WT: Note that this bug is only marked MEDIUM because configurations capable of triggering this bug are very unlikely to exist at all due to the fact that most rewrites consist in static string additions that largely fit into the reserved area (8kB by default). This fix should also be backported to 1.4 and possibly even 1.3 since it seems to have been present since 1.1 or so. Config: ------- global maxconn 500 stats socket /tmp/haproxy.sock mode 600 defaults timeout client 1000 timeout connect 5000 timeout server 5000 retries 1 option redispatch listen stats bind :8080 mode http stats enable stats uri /stats stats show-legends listen tcp_1 bind :8000 mode http maxconn 400 balance roundrobin reqrep ^([^\ :])\ /(.)/(.)\.php(.) \1\ /\3.php?arg=\2\2\2\2\2\2\2\2\2\2\2\2\2\4 server srv1 127.0.0.1:9000 check port 9000 inter 1000 fall 1 server srv2 127.0.0.1:9001 check port 9001 inter 1000 fall 1	2014-05-27 14:36:06 +02:00
Willy Tarreau	892337c8e1	MAJOR: server: use states instead of flags to store the server state Servers used to have 3 flags to store a state, now they have 4 states instead. This avoids lots of confusion for the 4 remaining undefined states. The encoding from the previous to the new states can be represented this way : SRV_STF_RUNNING \| SRV_STF_GOINGDOWN \| \| SRV_STF_WARMINGUP \| \| \| 0 x x SRV_ST_STOPPED 1 0 0 SRV_ST_RUNNING 1 0 1 SRV_ST_STARTING 1 1 x SRV_ST_STOPPING Note that the case where all bits were set used to exist and was randomly dealt with. For example, the task was not stopped, the throttle value was still updated and reported in the stats and in the http_server_state header. It was the same if the server was stopped by the agent or for maintenance. It's worth noting that the internal function names are still quite confusing.	2014-05-22 11:27:00 +02:00
Willy Tarreau	c93cd16b6c	REORG/MEDIUM: server: split server state and flags in two different variables Till now, the server's state and flags were all saved as a single bit field. It causes some difficulties because we'd like to have an enum for the state and separate flags. This commit starts by splitting them in two distinct fields. The first one is srv->state (with its counter-part srv->prev_state) which are now enums, but which still contain bits (SRV_STF_*). The flags now lie in their own field (srv->flags). The function srv_is_usable() was updated to use the enum as input, since it already used to deal only with the state. Note that currently, the maintenance mode is still in the state for simplicity, but it must move as well.	2014-05-22 11:27:00 +02:00
Willy Tarreau	3bed5e9337	BUG/MEDIUM: http: disable server-side expiration until client has sent the body It's the final part of the 2 previous patches. We prevent the server from timing out if we still have some data to pass to it. That way, even if the server runs with a short timeout and the client with a large one, the server side timeout will only start to count once the client sends everything. This ensures we don't report a 504 before the server gets the whole request. It is not certain whether the 1.4 state machine is fully compatible with this change. Since the purpose is only to ensure that we never report a server error before a client error if some data are missing from the client and when the server-side timeout is smaller than or equal to the client's, it's probably not worth attempting the backport.	2014-05-07 15:23:52 +02:00
Willy Tarreau	b9edf8fbec	BUG/MEDIUM: http: correctly report request body timeouts This is the continuation of previous patch "BUG/MEDIUM: http/session: disable client-side expiration only after body". This one takes care of properly reporting the client-side read timeout when waiting for a body from the client. Since the timeout may happen before or after the server starts to respond, we have to take care of the situation in three different ways : - if the server does not read our data fast enough, we emit a 504 if we're waiting for headers, or we simply break the connection if headers were already received. We report either sH or sD depending on whether we've seen headers or not. - if the server has not yet started to respond, but has read all of the client's data and we're still waiting for more data from the client, we can safely emit a 408 and abort the request ; - if the server has already started to respond (thus it's a transfer timeout during a bidirectional exchange), then we silently break the connection, and only the session flags will indicate in the logs that something went wrong with client or server side. This bug is tagged MEDIUM because it touches very sensible areas, however its impact is very low. It might be worth performing a careful backport to 1.4 once it has been confirmed that everything is correct and that it does not introduce any regression.	2014-05-07 15:22:27 +02:00
Willy Tarreau	b1982e27aa	BUG/MEDIUM: http/session: disable client-side expiration only after body For a very long time, back in the v1.3 days, we used to rely on a trick to avoid expiring the client side while transferring a payload to the server. The problem was that if a client was able to quickly fill the buffers, and these buffers took some time to reach the server, the client should not expire while not sending anything. In order to cover this situation, the client-side timeout was disabled once the connection to the server was OK, since it implied that we would at least expire on the server if required. But there is a drawback to this : if a client stops uploading data before the end, its timeout is not enforced and we only expire on the server's timeout, so the logs report a 504. Since 1.4, we have message body analysers which ensure that we know whether all the expected data was received or not (HTTP_MSG_DATA or HTTP_MSG_DONE). So we can fix this problem by disabling the client-side or server-side timeout at the end of the transfer for the respective side instead of having it unconditionally in session.c during all the transfer. With this, the logs now report the correct side for the timeout. Note that this patch is not enough, because another issue remains : the HTTP body forwarders do not abort upon timeout, they simply rely on the generic handling from session.c. So for now, the session is still aborted when reaching the server timeout, but the culprit is properly reported. A subsequent patch will address this specific point. This bug was tagged MEDIUM because of the changes performed. The issue it fixes is minor however. After some cooling down, it may be backported to 1.4. It was reported by and discussed with Rachel Chavez and Patrick Hemmer on the mailing list.	2014-05-07 14:21:47 +02:00
William Lallemand	07c8b24edb	MINOR: http: export the smp_fetch_cookie function Remove the static attribute of smp_fetch_cookie, and declare the function in proto/proto_http.h for future use.	2014-05-02 18:05:15 +02:00
Willy Tarreau	644c101e2d	BUG/MAJOR: http: connection setup may stall on balance url_param On the mailing list, seri0528@naver.com reported an issue when using balance url_param or balance uri. The request would sometimes stall forever. Cyril Bont� managed to reproduce it with the configuration below : listen test :80 mode http balance url_param q hash-type consistent server s demo.1wt.eu:80 and found it appeared with this commit : `80a92c0` ("BUG/MEDIUM: http: don't start to forward request data before the connect"). The bug is subtle but real. The problem is that the HTTP request forwarding analyzer refrains from starting to parse the request body when some LB algorithms might need the body contents, in order to preserve the data pointer and avoid moving things around during analysis in case a redispatch is later needed. And in order to detect that the connection establishes, it watches the response channel's CF_READ_ATTACHED flag. The problem is that a request analyzer is not subscribed to a response channel, so it will only see changes when woken for other (generally correlated) reasons, such as the fact that part of the request could be sent. And since the CF_READ_ATTACHED flag is cleared once leaving process_session(), it is important not to miss it. It simply happens that sometimes the server starts to respond in a sequence that validates the connection in the middle of process_session(), that it is detected after the analysers, and that the newly assigned CF_READ_ATTACHED is not used to detect that the request analysers need to be called again, then the flag is lost. The CF_WAKE_WRITE flag doesn't work either because it's cleared upon entry into process_session(), ie if we spend more than one call not connecting. Thus we need a new flag to tell the connection initiator that we are specifically interested in being notified about connection establishment. This new flag is CF_WAKE_CONNECT. It is set by the requester, and is cleared once the connection succeeds, where CF_WAKE_ONCE is set instead, causing the request analysers to be scanned again. For future versions, some better options will have to be considered : - let all analysers subscribe to both request and response events ; - let analysers subscribe to stream interface events (reduces number of useless calls) - change CF_WAKE_WRITE's semantics to persist across calls to process_session(), but that is different from validating a connection establishment (eg: no data sent, or no data to send) The bug was introduced in 1.5-dev23, no backport is needed.	2014-04-30 20:02:02 +02:00
Willy Tarreau	0b7483385e	MEDIUM: http: make http-request rules processing return a verdict instead of a rule Till now we used to return a pointer to a rule, but that makes it complicated to later add support for registering new actions which may fail. For example, the redirect may fail if the response is too large to fit into the buffer. So instead let's return a verdict. But we needed the pointer to the last rule to get the address of a redirect and to get the realm used by the auth page. So these pieces of code have moved into the function and they produce a verdict.	2014-04-29 00:46:01 +02:00
Willy Tarreau	ae3c010226	MEDIUM: http: factorize the "auth" action of http-request and stats Both use exactly the same mechanism, except for the choice of the default realm to be emitted when none is selected. It can be achieved by simply comparing the ruleset with the stats' for now. This achieves a significant code reduction and further, removes the dependence on the pointer to the final rule in the caller.	2014-04-29 00:46:01 +02:00
Willy Tarreau	f75e5c3d84	MINOR: http: remove the now unused loop over "block" rules This ruleset is now always empty, simply remove it.	2014-04-28 22:15:00 +02:00
Willy Tarreau	353bc9f43f	CLEANUP: proxy: rename "block_cond" to "block_rules" Next patch will make them real rules, not only conditions. This separate patch makes the next one more readable.	2014-04-28 22:05:31 +02:00
Willy Tarreau	5bd6759a19	MINOR: http: silently support the "block" action for http-request This one will be used to convert "block" rules into "http-request block".	2014-04-28 22:00:46 +02:00
Willy Tarreau	5254259609	MEDIUM: http: remove even more of the spaghetti in the request path Some of the remaining interleaving of request processing after the http-request rules can now safely be removed, because all remaining actions are mutually exclusive. So we can move together all those related to an intercepting rule, then proceed with stats, then with req*. We still keep an issue with stats vs reqrep which forces us to keep the stats split in two (detection and action). Indeed, from the beginning, stats are detected before rewriting and not after. But a reqdeny rule would stop stats, so in practice we have to first detect, then perform the action. Maybe we'll be able to kill this in version 1.6.	2014-04-28 21:35:30 +02:00
Willy Tarreau	179085ccac	MEDIUM: http: move Connection header processing earlier Till now the Connection header was processed in the middle of the http-request rules and some reqadd rules. It used to force some http-request actions to be cut in two parts. Now with keep-alive, not only that doesn't make any sense anymore, but it's becoming a total mess, especially since we need to know the headers contents before proceeding with most actions. The real reason it was not moved earlier is that the "block" or "http-request" rules can see a different version if some fields are changed there. But that is already not reliable anymore since the values observed by the frontend differ from those in the backend. This patch is the equivalent of commit `f118d9f` ("REORG: http: move HTTP Connection response header parsing earlier") but for the request side. It has been tagged MEDIUM as it could theorically slightly affect some setups relying on corner cases or invalid setups, though this does not make real sense and is highly unlikely.	2014-04-28 21:35:29 +02:00
Willy Tarreau	65410831a1	BUG/MINOR: http: block rules forgot to increment the session's request counter The session's backend request counters were incremented after the block rules while these rules could increment the session's error counters, meaning that we could have more errors than requests reported in a stick table! Commit `5d5b5d8` ("MEDIUM: proto_tcp: add support for tracking L7 information") is the most responsible for this. This bug is 1.5-specific and does not need any backport.	2014-04-28 21:34:43 +02:00
Willy Tarreau	5fa7082911	BUG/MINOR: http: block rules forgot to increment the denied_req counter "block" rules used to build the whole response and forgot to increment the denied_req counters. By jumping to the general "deny" label created in previous patch, it's easier to fix this. The issue was already present in 1.3 and remained unnoticed, in part because few people use "block" nowadays.	2014-04-28 18:46:40 +02:00
Willy Tarreau	bbba2a8ecc	MEDIUM: http: jump to dedicated labels after http-request processing Continue the cleanup of http-request post-processing to remove some of the interleaved tests. Here we set up a few labels to deal with the deny and tarpit actions and avoid interleaved ifs.	2014-04-28 18:46:20 +02:00
Willy Tarreau	5e9edce0f0	MEDIUM: http: move reqadd after execution of http_request redirect We still have a plate of spaghetti in the request processing rules. All http-request rules are executed at once, then some responses are built interlaced with other rules that used to be there in the past. Here, reqadd is executed after an http-req redirect rule is decided, but before it is executed. So let's match the doc and config checks, to put the redirect actually before the reqadd completely.	2014-04-28 17:25:40 +02:00
Willy Tarreau	cfe7fdd02d	MINOR: http: rely on the message body parser to send 100-continue There's no point in open-coding the sending of 100-continue in the stats initialization code, better simply rely on the function designed to process the message body which already does it.	2014-04-28 17:25:40 +02:00
Willy Tarreau	e6d24163e5	BUG/MINOR: http: log 407 in case of proxy auth Commit `844a7e7` ("[MEDIUM] http: add support for proxy authentication") merged in v1.4-rc1 added the ability to emit a status code 407 in auth responses, but forgot to set the same status in the logs, which still contain 401. The bug is harmless, no backport is needed.	2014-04-28 17:24:42 +02:00
Thierry FOURNIER	e47e4e2385	BUG/MEDIUM: patterns: last fix was still not enough Last fix did address the issue for inlined patterns, but it was not enough because the flags are lost as well when updating patterns dynamically over the CLI. Also if the same file was used once with -i and another time without -i, their references would have been merged and both would have used the same matching method. It's appear that the patterns have two types of flags. The first ones are relative to the pattern matching, and the second are relative to the pattern storage. The pattern matching flags are the same for all the patterns of one expression. Now they are stored in the expression. The storage flags are information returned by the pattern mathing function. This information is relative to each entry and is stored in the "struct pattern". Now, the expression matching flags are forwarded to the parse and index functions. These flags are stored during the configuration parsing, and they are used during the parse and index actions. This issue was introduced in dev23 with the major pattern rework, and is a continuation of commit `a631fc8` ("BUG/MAJOR: patterns: -i and -n are ignored for inlined patterns"). No backport is needed.	2014-04-28 14:19:17 +02:00
Willy Tarreau	a631fc8de8	BUG/MAJOR: patterns: -i and -n are ignored for inlined patterns These flags are only passed to pattern_read_from_file() which loads the patterns from a file. The functions used to parse the patterns from the current line do not provide the means to pass the pattern flags so they're lost. This issue was introduced in dev23 with the major pattern rework, and was reported by Graham Morley. No backport is needed.	2014-04-27 09:21:08 +02:00
Willy Tarreau	6c09c2ceae	BUILD: http: remove a warning on strndup The latest commit about set-map/add-acl/... causes this warning for me : src/proto_http.c: In function 'parse_http_req_cond': src/proto_http.c:8863: warning: implicit declaration of function 'strndup' src/proto_http.c:8863: warning: incompatible implicit declaration of built-in function 'strndup' src/proto_http.c:8890: warning: incompatible implicit declaration of built-in function 'strndup' src/proto_http.c:8917: warning: incompatible implicit declaration of built-in function 'strndup' src/proto_http.c:8944: warning: incompatible implicit declaration of built-in function 'strndup' Use my_strndup() instead of strndup() which is not portable. No backport needed.	2014-04-25 21:39:17 +02:00
William Lallemand	73025dd7e2	MEDIUM: http: register http-request and http-response keywords The http_(res\|req)_keywords_register() functions allow to register new keywords. You need to declare a keyword list: struct http_req_action_kw_list test_kws = { .scope = "testscope", .kw = { { "test", parse_test }, { NULL, NULL }, } }; and a parsing function: int parse_test(const char *args, int cur_arg, struct proxy px, struct http_req_rule rule, char **err) { rule->action = HTTP_REQ_ACT_CUSTOM_STOP; rule->action_ptr = action_function; return 0; } http_req_keywords_register(&test_kws); The HTTP_REQ_ACT_CUSTOM_STOP action stops evaluation of rules after your rule, HTTP_REQ_ACT_CUSTOM_CONT permits the evaluation of rules after your rule.	2014-04-25 18:48:35 +02:00
Baptiste Assmann	fabcbe0de6	MEDIUM: http: ACL and MAP updates through http-(request\|response) rules This patch allows manipulation of ACL and MAP content thanks to any information available in a session: source IP address, HTTP request or response header, etc... It's an update "on the fly" of the content of the map/acls. This means it does not resist to reload or restart of HAProxy.	2014-04-25 18:48:35 +02:00
Willy Tarreau	6d8bac7ddc	BUG/MAJOR: http: fix the 'next' pointer when performing a redirect Commit `bed410e` ("MAJOR: http: centralize data forwarding in the request path") has woken up an issue in redirects, where msg->next is not reset when flushing the input buffer. The result is an attempt to forward a negative amount of data, making haproxy crash. This bug does not seem to affect versions prior to dev23, so no backport is needed.	2014-04-25 12:21:09 +02:00
Willy Tarreau	3c1b5ec29c	MINOR: http: add capture.req.ver and capture.res.ver These ones report a string as "HTTP/1.0" or "HTTP/1.1" depending on the version of the request message or the response message, respectively. The purpose is to be able to emit custom log lines reporting this version in a persistent way.	2014-04-24 23:41:57 +02:00
Willy Tarreau	f118d9f507	REORG: http: move HTTP Connection response header parsing earlier Currently, the parsing of the HTTP Connection header for the response is performed at the same place as the rule sets, which means that after parsing the beginning of the response, we still have no information on whether the response is keep-alive compatible or not. Let's do that earlier. Note that this is the same code that was moved in the previous function, both of them are always called in a row so no change of behaviour is expected. A future change might consist in having a late analyser to perform the late header changes such as mangling the connection header. It's quite painful that currently this is mixed with the rest of the processing such as filters.	2014-04-24 22:34:30 +02:00
Willy Tarreau	70730dddbd	MEDIUM: http: enable analysers to have keep-alive on stats This allows the stats page to work in keep-alive mode and to be compressed. At compression ratios up to 80%, it's quite interesting for large pages. We ensure to skip filters because we don't want to unexpectedly block a response nor to mangle response headers.	2014-04-24 22:32:12 +02:00
Willy Tarreau	5897567273	CLEANUP: http: remove the useless "if (1)" inherited from version 1.4 This block has been enclosed inside an "if (1)" statement when migrating 1.3 to 1.4 to avoid a massive reindent. Let's get rid of it now.	2014-04-24 21:26:23 +02:00
Willy Tarreau	f1fd9dc8fb	CLEANUP: general: get rid of all old occurrences of "session *t" All the code inherited from version 1.1 still holds a lot ot sessions called "t" because in 1.1 they were tasks. This naming is very annoying and sometimes even confusing, for example in code involving tables. Let's get rid of this once for all and before 1.5-final. Nothing changed beyond just carefully renaming these variables.	2014-04-24 21:25:50 +02:00
Willy Tarreau	628c40cd96	MEDIUM: http: move skipping of 100-continue earlier It's useless to process 100-continue in the middle of response filters because there's no info in the 100 response itself, and it could even make things worse. So better use it as it is, an interim response waiting for the next response, thus we just have to put it into http_wait_for_response(). That way we ensure to have a valid response in this function.	2014-04-24 20:21:56 +02:00
Willy Tarreau	4d1f128a18	BUG/MEDIUM: http: 100-continue responses must process the next part immediately Since commit `d7ad9f5` ("MAJOR: channel: add a new flag CF_WAKE_WRITE to notify the task of writes"), we got another bug with 100-continue responses. If the final response comes in the same packet as the 100, then the rest of the buffer is not processed since there is no wake-up event. In fact the change above uncoverred the real culprit which is more likely session.c which should detect that an earlier analyser was set and should loop back to it. A cleaner fix would be better, but setting the flag works fine. This issue was introduced in 1.5-dev22, no backport is needed.	2014-04-24 20:21:56 +02:00
Willy Tarreau	efdf094df2	BUG/MAJOR: http: fix timeouts during data forwarding Patches `c623c17` ("MEDIUM: http: start to centralize the forwarding code") and `bed410e` ("MAJOR: http: centralize data forwarding in the request path") merged into 1.5-dev23 cause transfers to be silently aborted after the server timeout due to the fact that the analysers are woken up when the timeout strikes and they believe they have nothing more to do, so they're terminating the transfer. No backport is needed.	2014-04-24 20:21:56 +02:00
Willy Tarreau	af3cf70d7c	MEDIUM: stats: reimplement HTTP keep-alive on the stats page This basically reimplements commit `f3221f9` ("MEDIUM: stats: add support for HTTP keep-alive on the stats page") which was reverted by commit `51437d2` after Igor Chan reported a broken stats page caused by the bug fix by previous commit.	2014-04-24 17:24:56 +02:00
Willy Tarreau	b2c6a786f7	BUG/MINOR: http: don't report server aborts as client aborts Commit `f003d37` ("BUG/MINOR: http: don't report client aborts as server errors") attempted to fix a longstanding issue by which some client aborts could be logged as server errors. Unfortunately, one of the tests involved there also catches truncated server responses, which are reported as client aborts. Instead, only check that the client has really closed using the abortonclose option, just as in done in the request path (which means that the close was propagated to the server). The faulty fix above was introduced in 1.5-dev15, and was backported into 1.4.23. Thanks to Patrick Hemmer for reporting this issue with traces showing the root cause of the problem.	2014-04-23 20:29:01 +02:00
Willy Tarreau	38b3aa5646	BUG/MAJOR: http: fix bug in parse_qvalue() when selecting compression algo Commit `ad90351` ("MINOR: http: Add the "language" converter to for use with accept-language") introduced a typo in parse_qvalue : if (end) end = qvalue; while it should be : if (end) *end = qvalue; Since end is tested for being NULL. This crashes when selecting the compression algorithm since end is NULL here. No backport is needed, this is just in latest 1.5-dev.	2014-04-22 23:32:05 +02:00
Willy Tarreau	3ce10ff9f0	CLEANUP: http: remove all calls to http_silent_debug() This macro has long remained unused and calls are unevenly spread over the code, so it's totally useless and pollutes the code. Remove it now.	2014-04-22 23:15:29 +02:00
Willy Tarreau	d351021860	CLEANUP: http: document the response forwarding states The forwarding code is never obvious to enter into for newcomers, so better improve the documentation about how states are chained and what happens for each of them.	2014-04-22 23:15:29 +02:00
Willy Tarreau	bed410e0e8	MAJOR: http: centralize data forwarding in the request path It is the same principle as what was just done for the response. It makes the code cleaner, faster, and more maintainable.	2014-04-22 23:15:29 +02:00
Willy Tarreau	32b5ab2a28	MEDIUM: http: only allocate the temporary compression buffer when needed Since we know when the buffer is needed, only check for its allocation at the same place in order to avoid useless tests on the normal path.	2014-04-22 23:15:29 +02:00
Willy Tarreau	d5a6783ac9	MINOR: http: further cleanups of response forwarding function There is no reason for mixing compressing and non-compressing code in the DATA state, they don't share anything. Better make this clearer.	2014-04-22 23:15:28 +02:00
Willy Tarreau	c623c17b13	MEDIUM: http: start to centralize the forwarding code Doing so avoids calling channel_forward() for each part of the chunk parsing and lowers the number of calls to channel_forward() to only one per buffer, resulting in about 11% performance increase on small chunks forwarding rate.	2014-04-22 23:15:28 +02:00
Willy Tarreau	168ebc5e2b	MEDIUM: http: cleanup: centralize a little bit HTTP compression end The call to flush the compression buffers only needs to be done when entering the final states or when leaving with missing data. After that, if trailers are present, they have to be forwarded.	2014-04-22 23:15:28 +02:00
Willy Tarreau	7f2f8d5cc3	MAJOR: http/compression: fix chunked-encoded response processing Now we have valid buffer offsets, we can use them to safely parse the input and only forward when needed. Thus we can get rid of the consumed_data accumulator, and the code now works both for chunked and content-length, even with a server feeding one byte at a time (which systematically broke the previous one). It's worth noting that 0<CRLF> must always be sent after end of data (ie: chunk_len==0), and that the trailing CRLF is sent only content length mode, because in chunked we'll have to pass trailers.	2014-04-22 23:15:28 +02:00
Willy Tarreau	5fb0abd9a1	MAJOR: http: re-enable compression on chunked encoding This is basically a revert of commit `667c2a3` ("BUG/MAJOR: http: compression still has defects on chunked responses"). The latest changes applied to message pointers should have got rid of all the issues that were making the compression of partial chunks unreliable.	2014-04-22 23:15:28 +02:00
Willy Tarreau	b59c7bfc95	MEDIUM: http: headers must be forwarded even if data was already inspected Currently, we forward headers only if the incoming message is still before HTTP_MSG_CHUNK_SIZE, otherwise they'll be considered as data. In practice this is always true for the response since there's no data inspection, and for the request there is no compression so there's no problem with forwarding them as data. But the principle is incorrect and will make it difficult to later add data processing features. So better fix it now. The new principle is simple : - if headers were not yet forwarded, forward them now. - while doing so, check if we need to update the state	2014-04-22 23:15:28 +02:00
Willy Tarreau	6fef8ae047	BUG/MINOR: http: deinitialize compression after a compression error If for some reason, the compression returns an error, the compression is not deinitialized which also means that any pending data are not flushed and could be lost, especially in the chunked-encoded case. No backport is needed.	2014-04-22 23:15:28 +02:00
Willy Tarreau	d01f426e62	BUG/MINOR: http: deinitialize compression after a parsing error When a parsing error was encountered in a chunked response, we failed to properly deinitialize the compression context. There was no impact till now since compression of chunked responses was disabled. No backport is needed.	2014-04-22 23:15:28 +02:00
Willy Tarreau	7ba235466d	MEDIUM: http: forward headers again while waiting for connection to complete Thanks to the last updates on the message pointers, it is now safe again to enable forwarding of the request headers while waiting for the connection to complete because we know how to safely rewind this part. So this patch slightly modifies what was done in commit `80a92c0` ("BUG/MEDIUM: http: don't start to forward request data before the connect") to let up to msg->sov bytes be forwarded when waiting for the connection. The resulting effect is that a POST request may now be sent with the connect's ACK, which still saves a packet and may even be useful later when TFO is supported.	2014-04-22 23:15:28 +02:00
Willy Tarreau	1234f4a210	MAJOR: http: reset msg->sov after headers are forwarded In order to avoid abusively relying on buf->o to guess how many bytes to rewind during a redispatch, we now clear msg->sov. Thus the meaning of this field is exactly "how many bytes of headers are left to be forwarded". It is still possible to rewind because msg->eoh + msg->eol equal that value before scheduling the forwarding, so we can always subtract them.	2014-04-22 23:15:28 +02:00
Willy Tarreau	211cdece79	MEDIUM: http: add a small helper to compute how far to rewind to find headers http_hdr_rewind() returns the number of bytes to rewind before buf->p to find the beginning of headers. At the moment it's not exact as it still relies on buf->o, assuming that no other data from a past message were pending there, but it's what was done till there. The purpose is to centralize further ->sov changes aiming at avoiding to rely on buf->o.	2014-04-22 23:15:28 +02:00
Willy Tarreau	c24715e5f7	MAJOR: http: don't update msg->sov anymore while processing the body We used to have msg->sov updated for every chunk that was parsed. The issue is that we want to be able to rewind after chunks were parsed in case we need to redispatch a request and perform a new hash on the request or insert a different server header name. Currently, msg->sov and msg->next make parallel progress. We reached a point where they're always equal because msg->next is initialized from msg->sov, and is subtracted msg->sov's value each time msg->sov bytes are forwarded. So we can now ensure that msg->sov can always be replaced by msg->next for every state after HTTP_MSG_BODY where it is used as a position counter. This allows us to keep msg->sov untouched whatever the number of chunks that are parsed, as is needed to extract data from POST request (eg: url_param). However, we still need to know the starting position of the data relative to the body, which differs by the chunk size length. We use msg->sol for this since it's now always zero and unused in the body. So with this patch, we have the following situation : - msg->sov = msg->eoh + msg->eol = size of the headers including last CRLF - msg->sol = length of the chunk size if any. So msg->sov + msg->sol = DATA. - msg->next corresponds to the byte being inspected based on the current state and is always >= msg->sov before starting to forward anything. Since sov and next are updated in case of header rewriting, a rewind will fix them both when needed. Of course, ->sol has no reason for changing in such conditions, so it's fine to keep it relative to msg->sov. In theory, even if a redispatch has to be performed, a transformation occurring on the request would still work because the data moved would still appear at the same place relative to bug->p.	2014-04-22 23:15:28 +02:00
Willy Tarreau	0669d7dcf3	MEDIUM: http: http_parse_chunk_crlf() must not advance the buffer pointer This function is only a parser, it must start to parse at the next character and only update the outgoing relative pointers, but not expect the buffer to be aligned with the next byte to be parsed. It's important to fix this otherwise we cannot use this function to parse chunks without starting to forward data.	2014-04-22 23:15:28 +02:00
Willy Tarreau	877e78dbef	MAJOR: http: do not use msg->sol while processing messages or forwarding data There are still some pending issues in the gzip compressor, and fixing them requires a better handling of intermediate parsing states. Another issue to deal with is the rewinding of a buffer during a redispatch when a load balancing algorithm involves L7 data because the exact amount of data to rewind is not clear. At the moment, this is handled by unwinding all pending data, which cannot work in responses due to pipelining. Last, having a first analysis which parses the body and another one which restarts from where the parsing was left is wrong. Right now it only works because we never both parse and transform in the same direction. But that is wrong anyway. In order to address the first issue, we'll have to use msg->eoh + msg->eol to find the end of headers, and we still need to store the information about the forwarded header length somewhere (msg->sol might be reused for this). msg->sov may only be used for the start of data and not for subsequent chunks if possible. This first implies that we stop sharing it with header length, and stop using msg->sol there. In fact we don't need it already as it is always zero when reaching the HTTP_MSG_BODY state. It was only updated to reflect a copy of msg->sov. So now as a first step into that direction, this patch ensure that msg->sol is never re-assigned after being set to zero and is not used anymore when we're dealing with HTTP processing and forwarding. We'll later reuse it differently but for now it's secured. The patch does nothing magic, it only removes msg->sol everywhere it was already zero and avoids setting it. In order to keep the sov-sol difference, it now resets sov after forwarding data. In theory there's no problem here, but the patch is still tagged major because that code is complex.	2014-04-22 23:15:28 +02:00
Willy Tarreau	0558a02eb1	MINOR: http: make msg->eol carry the last CRLF length One of the issues we face when we need to either forward headers only before compressing, or rewind the stream during a redispatch is to know the proper length of the request headers. msg->eoh always has the total length up to the last CRLF, and we never know whether the request ended with a single LF or a standard CRLF. This makes it hard to rewind the headers without explicitly checking the bytes in the buffer. Instead of doing so, we now use msg->eol to carry the length of the last CRLF (either 1 or 2). Since it is not modified at all after HTTP_MSG_BODY, and was only left in an undefined state, it is safe to use at any moment. Thus, the complete header length to forward or to rewind now is always msg->eoh + msg->eol.	2014-04-22 23:15:28 +02:00
Willy Tarreau	890988f122	CLEANUP: http: prepare dedicated processing for chunked encoded message bodies Content-length encoded message bodies are trivial to deal with, but chunked-encoded will require improvements, so let's separate the code flows between the two to ease next steps. The behaviour is not changed at all, the code is only rearranged.	2014-04-22 23:15:28 +02:00
Willy Tarreau	5a8f947f4f	CLEANUP: http: rename http_process_request_body() This function does not process anything, it just waits for the beginning of the request body. Let's rename it http_wait_for_request_body().	2014-04-22 23:15:27 +02:00
Willy Tarreau	226071e0a7	MEDIUM: http: wait for the first chunk or message body length in http_process_body This is the continuation of previous patch. Now that full buffers are not rejected anymore, let's wait for at least the advertised chunk or body length to be present or the buffer to be full. When either condition is met, the message processing can go forward. Thus we don't need to use url_param_post_limit anymore, which was passed in the configuration as an optionnal <max_wait> parameter after the "check_post" value. This setting was necessary when the feature was implemented because there was no support for parsing message bodies. The argument is now silently ignored if set in the configuration.	2014-04-22 23:15:27 +02:00
Willy Tarreau	31a19957d6	MEDIUM: http: don't reject anymore message bodies not containing the url param http_process_request_body() currently expects a request body containing exactly an expected message body. This was done in order to support load balancing on a unique POST parameter but the way it's done still suffers from some limitations. One of them is that there is no guarantee that the accepted message will contain the appropriate string if it starts with another parameter. But at the same time it will reject a message when the buffer is full. So as a first step, we don't reject anymore message bodies that fill the buffer.	2014-04-22 23:15:27 +02:00
Thierry FOURNIER	dad3d1d402	MINOR: http: add the function "del-header" to the directives http-request and http-response This patch permits to remove all HTTP request and response header fields whose name is specified in <name>.	2014-04-22 19:13:50 +02:00
Thierry FOURNIER	ad9035186e	MINOR: http: Add the "language" converter to for use with accept-language language(<value[;value[;value[;...]]]>[,<default>]) Returns the value with the highest q-factor from a list as extracted from the "accept-language" header using "req.fhdr". Values with no q-factor have a q-factor of 1. Values with a q-factor of 0 are dropped. Only values which belong to the list of semi-colon delimited <values> will be considered. If no value matches the given list and a default value is provided, it is returned. Note that language names may have a variant after a dash ('-'). If this variant is present in the list, it will be matched, but if it is not, only the base language is checked. The match is case-sensitive, and the output string is always one of those provided in arguments. The ordering of arguments is meaningless, only the ordering of the values in the request counts, as the first value among multiple sharing the same q-factor is used. Example : # this configuration switches to the backend matching a # given language based on the request : acl de req.fhdr(accept-language),language(de;es;fr;en) de acl es req.fhdr(accept-language),language(de;es;fr;en) es acl fr req.fhdr(accept-language),language(de;es;fr;en) fr acl en req.fhdr(accept-language),language(de;es;fr;en) en use_backend german if de use_backend spanish if es use_backend french if fr use_backend english if en default_backend choose_your_language	2014-04-14 18:39:29 +02:00
Willy Tarreau	e9187f8263	BUILD/MEDIUM: http: remove calls to sprintf() OpenBSD complains about this use of sprintf() : src/proto_http.o(.text+0xb0e6): In function `http_process_request': src/proto_http.c:4127: warning: sprintf() is often misused, please use snprintf() Here there's no risk as the strings are way shorter than the buffer size but let's fix it anyway.	2014-04-14 15:52:48 +02:00
Apollon Oikonomopoulos	25a15227f5	BUG/MINOR: reject malformed HTTP/0.9 requests RFC 1945 (�4.1) defines an HTTP/0.9 request ("Simple-Request") as: Simple-Request = "GET" SP Request-URI CRLF HAProxy tries to automatically upgrade HTTP/0.9 requests to to HTTP/1.0, by appending "HTTP/1.0" to the request and setting the Request-URI to "/" if it was not present. The latter however is RFC-incompatible, as HTTP/0.9 requests must already have a Request-URI according to the definition above. Additionally, http_upgrade_v09_to_v10() does not check whether the request method is indeed GET (the mandatory method for HTTP/0.9). As a result, any single- or double-word request line is regarded as a valid HTTP request. We fix this by failing in http_upgrade_v09_to_v10() if the request method is not GET or the request URI is not present.	2014-04-06 07:53:07 +02:00
Thierry FOURNIER	9f95e4084c	MINOR: standard: Add ipv6 support in the function url2sa(). The function url2sa() converts faster url like http://<ip>:<port> in a struct sockaddr_storage. This patch add: - the https support - permit to return the length parsed - support IPv6 - support DNS synchronous resolution only during start of haproxy. The faster IPv4 convertion way is keeped. IPv6 is slower, because I use the standard IPv6 parser function.	2014-03-31 09:54:44 +02:00
Willy Tarreau	0e9b1b4d1f	MEDIUM: compression: consider the "q=" attribute in Accept-Encoding Till now we didn't consider "q=". It's problematic because the first effect is that compression tokens were not even matched if it was present. It is important to parse it correctly because we still want to allow a user-agent to send "q=0" to explicitly disable a compressor, or to specify its preferences. Now, q-values are respected in order of precedence, and when several q-values are equal, the first occurrence is used.	2014-03-19 12:12:01 +01:00
Thierry FOURNIER	c5a4e98639	MEDIUM: acl: Change the acl register struct This patch replace a lot of pointeur by pattern matching identifier. If the declared ACL use all the predefined pattern matching functions, the register function gets the functions provided by "pattern.c" and identified by the PAT_LATCH_*. In the case of the acl uses his own functions, they can be declared, and the acl registration doesn't change it.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	eeaa951726	MINOR: configuration: File and line propagation This patch permits to communicate file and line of the configuration file at the configuration parser.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	e369ca2e66	MEDIUM: pattern_find_smp: functions find_smp uses the pat_ref_elt to find the element to be removed The find_smp search the smp using the value of the pat_ref_elt pointer. The pat_find_smp_* are no longer used. The function pattern_find_smp() known all pattern indexation, and can be found	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	7acca4b269	MEDIUM: pattern: delete() function uses the pat_ref_elt to find the element to be removed All the pattern delete function can use her reference to the original "struct pat_ref_elt" to find the element to be remove. The functions pat_del_list_str() and pat_del_meth() were deleted because after applying this modification, they have the same code than pat_del_list_ptr().	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	5d34408785	MEDIUM: pattern: The expected type is stored in the pattern head, and conversion is executed once. This patch extract the expect_type variable from the "struct pattern" to "struct pattern_head". This variable is set during the declaration of ACL and MAP. With this change, the function "pat_parse_len()" become useless and can be replaced by "pat_parse_int()". Implicit ACLs by default rely on the fetch's output type, so let's simply do the same for all other ones. It has been verified that they all match.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	55d0b10f06	MEDIUM: pattern: add sample lookup function. Some functions needs to change the sample associated to pattern. This new pointer permit to return the a pointer to the sample pointer. The caller can use or change the value.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	6f7203d673	MEDIUM: pattern: add prune function This path add specific pointer to each expression to point on prune function. Now, each pattern expression embed his own prune function.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	b113650e54	MEDIUM: pattern: add delete functions This commit adds a delete function for patterns. It looks up all instances of the pattern to delete and deletes them all. The fetch keyword declarations have been extended to point to the appropriate delete function.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	5338eea8eb	MEDIUM: pattern: The match function browse itself the list or the tree. The match function known the format of the pattern. The pattern can be stored in a list or in a tree. The pattern matching function use itself the good entry point and indexation type. Each pattern matching function return the struct pattern that match. If the flag "fill" is set, the struct pattern is filled, otherwise the content of this struct must not be used. With this feature, the general pattern matching function cannot have exceptions for building the "struct pattern".	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	d437314979	MEDIUM: sample/http_proto: Add new type called method The method are actuelly stored using two types. Integer if the method is known and string if the method is not known. The fetch is declared as UINT, but in some case it can provides STR. This patch create new type called METH. This type contain interge for known method and string for the other methods. It can be used with automatic converters. The pattern matching can expect method. During the free or prune function, http_meth pettern is freed. This patch initialise the freed pointer to NULL.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	7654c9ff44	MEDIUM: sample: Remove types SMP_T_CSTR and SMP_T_CBIN, replace it by SMP_F_CONST flags The operations applied on types SMP_T_CSTR and SMP_T_STR are the same, but the check code and the declarations are double, because it must declare action for SMP_T_C* and SMP_T_. The declared actions and checks are the same. this complexify the code. Only the "conv" functions can change from "C" to "*" Now, if a function needs to modify input string, it can call the new function smp_dup(). This one duplicate data in a trash buffer.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	edc15c3a35	MEDIUM: pattern: The parse functions just return "struct pattern" without memory allocation The pattern parse functions put the parsed result in a "struct pattern" without memory allocation. If the pattern must reference the input data without changes, the pattern point to the parsed string. If buffers are needed to store translated data, it use th trash buffer. The indexation function that allocate the memory later if it is needed.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	b9b08460a2	MEDIUM: pattern: add indexation function. Before this patch, the indexation function check the declared patttern matching function and index the data according with this function. This is not useful to add some indexation mode. This commit adds dedicated indexation function. Each struct pattern is associated with one indexation function. This function permit to index data according with the type of pattern and with the type of match.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	580c32cb3a	MEDIUM: pattern: The pattern parser no more uses <opaque> and just takes one string. After the previous patches, the "pat_parse_strcat()" function disappear, and the "pat_parse_int()" and "pat_parse_dotted_ver()" functions dont use anymore the "opaque" argument, and take only one string on his input. So, after this patch, each pattern parser no longer use the opaque variable and take only one string as input. This patch change the prototype of the pattern parsing functions. Now, the "char *args" is replaced by a "char arg", the "int *opaque" is removed and these functions return 1 in succes case, and 0 if fail.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	9eec0a646b	MAJOR: auth: Change the internal authentication system. This patch remove the limit of 32 groups. It also permit to use standard "pat_parse_str()" function in place of "pat_parse_strcat()". The "pat_parse_strcat()" is no longer used and its removed. Before this patch, the groups are stored in a bitfield, now they are stored in a list of strings. The matching is slower, but the number of groups is low and generally the list of allowed groups is short. The fetch function "smp_fetch_http_auth_grp()" used with the name "http_auth_group" return valid username. It can be used as string for displaying the username or with the acl "http_auth_group" for checking the group of the user. Maybe the names of the ACL and fetch methods are no longer suitable, but I keep the current names for conserving the compatibility with existing configurations. The function "userlist_postinit()" is created from verification code stored in the big function "check_config_validity()". The code is adapted to the new authentication storage system and it is moved in the "src/auth.c" file. This function is used to check the validity of the users declared in groups and to check the validity of groups declared on the "user" entries. This resolve function is executed before the check of all proxy because many acl needs solved users and groups.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	d048d8b891	BUG/MINOR: http: fix encoding of samples used in http headers The binary samples are sometimes copied as is into http headers. A sample can contain bytes unallowed by the http rfc concerning header content, for example if it was extracted from binary data. The resulting http request can thus be invalid. This issue does not yet happen because haproxy currently (mistakenly) hex-encodes binary data, so it is not really possible to retrieve invalid HTTP chars. The solution consists in hex-encoding all non-printable chars prefixed by a '%' sign. No backport is needed since existing code is not affected yet.	2014-03-17 16:39:03 +01:00
Willy Tarreau	7519560767	MINOR: http: release compression context only in http_end_txn() Currently there are two places where the compression context is released, one in session_free() and another one in http_end_txn_clean_session(). Both of them call http_end_txn(), either directly or via http_reset_txn(), and this function is made for this exact purpose. So let's centralize the call there instead.	2014-03-14 19:26:20 +01:00
Willy Tarreau	80a92c02f4	BUG/MEDIUM: http: don't start to forward request data before the connect Currently, "balance url_param check_post" randomly works. If the client sends chunked data and there's another chunk after the one containing the data, http_request_forward_body() will advance msg->sov and move the start of data to the beginning of the last chunk, and get_server_ph_post() will not find the data. In order to avoid this, we add an HTTP_MSGF_WAIT_CONN flag whose goal is to prevent the forwarding code from parsing until the connection is confirmed, so that we're certain not to fail on a redispatch. Note that we need to force channel_auto_connect() since the output buffer is empty and a previous analyser might have stopped auto-connect. The flag is currently set whenever some L7 POST analysis is needed for a connect() so that it correctly addresses all corner cases involving a possible rewind of the buffer, waiting for a better fix. Note that this has been broken for a very long time. Even all 1.4 versions seem broken but differently, with ->sov pointing to the end of the arguments. So the fix should be considered for backporting to all stable releases, possibly including 1.3 which works differently.	2014-03-14 12:22:56 +01:00
Willy Tarreau	36346247ac	BUG/MEDIUM: http: continue to emit 503 on keep-alive to different server Finn Arne Gangstad reported that commit `6b726adb35` ("MEDIUM: http: do not report connection errors for second and further requests") breaks support for serving static files by abusing the errorfile 503 statement. Indeed, a second request over a connection sent to any server or backend returning 503 would silently be dropped. The proper solution consists in adding a flag on the session indicating that the server connection was reused, and to only avoid the error code in this case.	2014-02-24 18:26:30 +01:00
Bhaskar Maddala	a20cb85eba	MINOR: stats: Enhancement to stats page to provide information of last session time. Summary: Track and report last session time on the stats page for each server in every backend, as well as the backend. This attempts to address the requirement in the ROADMAP - add a last activity date for each server (req/resp) that will be displayed in the stats. It will be useful with soft stop. The stats page reports this as time elapsed since last session. This change does not adequately address the requirement for long running session (websocket, RDP... etc).	2014-02-08 01:19:58 +01:00
William Lallemand	96a7785429	MINOR: http: optimize capture.req.method and capture.req.uri Useless strncpy were done in those two sample fetches, the "struct chunk" allows us to dump the specified len. The encode_string() in capture.req.uri was judged inappropriate and was deleted. The return type was fixed to SMP_T_CSTR.	2014-02-05 11:26:50 +01:00
William Lallemand	65ad6e12c1	MINOR: http: capture.req.method and capture.req.uri Add 2 sample fetchs allowing to extract the method and the uri of an HTTP request. FIXME: the sample fetches parser can't add the LW_REQ requirement, at the moment this flag is used automatically when you use sample fetches. Note: also fixed the alphabetical order of other capture.req.* keywords in the doc.	2014-02-04 23:41:36 +01:00
Willy Tarreau	416ce618be	BUG/MEDIUM: http: fix regression caused by recent switch to keep-alive by default Yesterday's commit `70dffda` ("MAJOR: http: switch to keep-alive mode by default") broke HTTP/1.0 handling without keep-alive when keep-alive is enabled both in the frontend and in the backend. Before this patch, it used to work because tunnel mode was the default one, so if no mode was present in the frontend and a mode was set in the backend, the backend was the first one to parse the header. This is what the original patch tried to do with keep-alive by default, causing the version and the connection header to be ignored if both the frontend and the backend were running in keep-alive mode. The fix consists in always parsing the header in non-tunnel mode, and processing the rest of the logic in at least once, and again if the backend works in a different mode than the frontend. This is 1.5-specific, no backport is needed.	2014-01-31 15:51:11 +01:00
Thierry FOURNIER	98d9695518	BUG/MEDIUM: http/auth: Sometimes the authentication credentials can be mix between two requests The authentication function "get_http_auth()" extract credentials from the request and keep it this values in shared cache. This function set a flag in the session indicating that the authentication is already parsed and the value stored in the cache are avalaible. If this flag is set the authorization header is not re-parsed and the shared cache is used. If two request are simultaneous processsed, the first one check the credentials. After this, the second request check also it's credentials and change the data stored in the shared cache. When the first request re-check credentials (for many reasons), they are changed. The change can introduce a segfault. This patch deactivate the cache upon success. When we need authentication information from one request, they are re-parsed and re-decoded. However, a failure to retrieve credentials is still cached to avoid useless lookups. This fix needs to be backported to 1.4 as well.	2014-01-31 14:42:54 +01:00
Willy Tarreau	70dffdaa10	MAJOR: http: switch to keep-alive mode by default Since we support HTTP keep-alive, there is no more reason for staying in tunnel mode by default. It is confusing for new users and creates more issues than it solves. Option "http-tunnel" is available to force to use it if really desired. Switching to KA by default has implied to change the value of some option flags and some transaction flags so that value zero (default) matches keep-alive. That explains why more code has been changed than expected. Tests have been run on the 25 combinations of frontend and backend options, plus a few with option http-pretend-keepalive, and no anomaly was found. The relation between frontend and backends remains the same. Options have been updated to take precedence over http-keep-alive which is now implicit. All references in the doc to haproxy not supporting keep-alive have been fixed, and the doc for config options has been updated.	2014-01-30 03:14:29 +01:00
Willy Tarreau	f8b0e03f49	MEDIUM: http: make keep-alive + httpclose be passive mode There's no particular reason for having keep-alive + httpclose combine into forceclose when set in different frontend/backend sections, since keep-alive does not close anything by default. Let's have this still combination remain httpclose only.	2014-01-30 03:14:29 +01:00
Willy Tarreau	02bce8be01	MAJOR: http: update connection mode configuration At the very beginning of haproxy, there was "option httpclose" to make haproxy add a "Connection: close" header in both directions to invite both sides to agree on closing the connection. It did not work with some rare products, so "option forceclose" was added to do the same and actively close the connection. Then client-side keep-alive was supported, so option http-server-close was introduced. Now we have keep-alive with a fourth option, not to mention the implicit tunnel mode. The connection configuration has become a total mess because all the options above may be combined together, despite almost everyone thinking they cancel each other, as judging from the common problem reports on the mailing list. Unfortunately, re-reading the doc shows that it's not clear at all that options may be combined, and the opposite seems more obvious since they're compared. The most common issue is options being set in the defaults section that are not negated in other sections, but are just combined when the user expects them to be overloaded. The migration to keep-alive by default will only make things worse. So let's start to address the first problem. A transaction can only work in 5 modes today : - tunnel : haproxy doesn't bother with what follows the first req/resp - passive close : option http-close - forced close : option forceclose - server close : option http-server-close with keep-alive on the client side - keep-alive : option http-keep-alive, end to end All 16 combination for each section fall into one of these cases. Same for the 256 combinations resulting from frontend+backend different modes. With this patch, we're doing something slightly different, which will not change anything for users with valid configs, and will only change the behaviour for users with unsafe configs. The principle is that these options may not combined anymore, and that the latest one always overrides all the other ones, including those inherited from the defaults section. The "no option xxx" statement is still supported to cancel one option and fall back to the default one. It is mainly needed to ignore defaults sections (eg: force the tunnel mode). The frontend+backend combinations have not changed. So for examplen the following configuration used to put the connection into forceclose : defaults http mode http option httpclose frontend foo. option http-server-close => http-server-close+httpclose = forceclose before this patch! Now the frontend's config replaces the defaults config and results in the more expected http-server-close. All 25 combinations of the 5 modes in (frontend,backend) have been successfully tested. In order to prepare for upcoming changes, a new "option http-tunnel" was added. It currently only voids all other options, and has the lowest precedence when mixed with another option in another frontend/backend.	2014-01-30 03:14:29 +01:00
Willy Tarreau	59ad1a2e75	BUG/MINOR: config: correctly report when log-format headers require HTTP mode When using some log-format directives in header insertion without HTTP mode, the config parser used to report a cryptic message about option httplog being downgraded to tcplog and with "(null):0" as the file name and line number. This is because the lfs_file and lfs_line were not properly set for some valid use cases of log-format directives. Now we cover http-request and http-response as well.	2014-01-29 14:39:58 +01:00
Willy Tarreau	f3338349ec	BUG/MEDIUM: counters: flush content counters after each request One year ago, commit `5d5b5d8` ("MEDIUM: proto_tcp: add support for tracking L7 information") brought support for tracking L7 information in tcp-request content rules. Two years earlier, commit `0a4838c` ("[MEDIUM] session-counters: correctly unbind the counters tracked by the backend") used to flush the backend counters after processing a request. While that earliest patch was correct at the time, it became wrong after the second patch was merged. The code does what it says, but the concept is flawed. "TCP request content" rules are evaluated for each HTTP request over a single connection. So if such a rule in the frontend decides to track any L7 information or to track L4 information when an L7 condition matches, then it is applied to all requests over the same connection even if they don't match. This means that a rule such as : tcp-request content track-sc0 src if { path /index.html } will count one request for index.html, and another one for each of the objects present on this page that are fetched over the same connection which sent the initial matching request. Worse, it is possible to make the code do stupid things by using multiple counters: tcp-request content track-sc0 src if { path /foo } tcp-request content track-sc1 src if { path /bar } Just sending two requests first, one with /foo, one with /bar, shows twice the number of requests for all subsequent requests. Just because both of them persist after the end of the request. So the decision to flush backend-tracked counters was not the correct one. In practice, what is important is to flush countent-based rules since they are the ones evaluated for each request. Doing so requires new flags in the session however, to keep track of which stick-counter was tracked by what ruleset. A later change might make this easier to maintain over time. This bug is 1.5-specific, no backport to stable is needed.	2014-01-28 21:40:28 +01:00
William Lallemand	a43ba4eee0	MINOR: http: smp_fetch_capture_header_* fetch captured headers Allows you to fetch a captured header content with capture.res.hdr() and capture.req.hdr().	2014-01-28 18:43:57 +01:00
Willy Tarreau	3c72872da1	CLEANUP: connection: use conn_ctrl_ready() instead of checking the flag It's easier and safer to rely on conn_ctrl_ready() everywhere than to check the flag itself. It will also simplify adding extra checks later if needed. Some useless controls for !ctrl have been removed, as the CTRL_READY flag itself guarantees ctrl is set.	2014-01-26 00:42:31 +01:00
Willy Tarreau	4afd70aeab	BUG/MAJOR: fix freezes during compression Recent commit `d7ad9f5` ("MAJOR: channel: add a new flag CF_WAKE_WRITE to notify the task of writes") introduced this new CF_WAKE_WRITE flag that an analyser which requires some free space to write must set if it wants to be notified. Unfortunately, some places were missing. More specifically, the compression engine can rarely be stuck by a lack of output space, especially when dealing with non-compressible data. It then has to stop until some pending data are flushed and for this it must set the CF_WAKE_WRITE flag. But these cases were missed by the commit above. Fortunately, this change was introduced very recently and never released, so the impact was limited. Huge thanks to Sander Klein who first reported this issue and who kindly and patiently provided lots of traces and test data that made it possible to reproduce, analyze, then fix this issue.	2014-01-25 22:28:22 +01:00
Willy Tarreau	1f0da2485e	BUG/MEDIUM: unique_id: HTTP request counter is not stable Patrick Hemmer reported that using unique_id_format and logs did not report the same unique ID counter since commit `9f09521` ("BUG/MEDIUM: unique_id: HTTP request counter must be unique!"). This is because the increment was done while producing the log message, so it was performed twice. A better solution consists in fetching a new value once per request and saving it in the request or session context for all of this request's life. It happens that sessions already have a unique ID field which is used for debugging and reporting errors, and which differs from the one sent in logs and unique_id header. So let's change this to reuse this field to have coherent IDs everywhere. As of now, a session gets a new unique ID once it is instanciated. This means that TCP sessions will also benefit from a unique ID that can be logged. And this ID is renewed for each extra HTTP request received on an existing session. Thus, all TCP sessions and HTTP requests will have distinct IDs that will be stable along all their life, and coherent between all places where they're used (logs, unique_id header, "show sess", "show errors"). This feature is 1.5-specific, no backport to 1.4 is needed.	2014-01-25 11:07:06 +01:00
Willy Tarreau	c920096993	BUG/MINOR: http: don't clear the SI_FL_DONT_WAKE flag between requests It's a bit hasardous to wipe out all channel flags, this flag should be left intact as it protects against recursive calls. Fortunately, we have no possibility to meet this situation with current applets, but better fix it before it becomes an issue. This bug has been there for a long time, but it doesn't seem worth backporting the fix.	2013-12-31 23:03:09 +01:00
Willy Tarreau	d7ad9f5b0d	MAJOR: channel: add a new flag CF_WAKE_WRITE to notify the task of writes Since commit `6b66f3e` ([MAJOR] implement autonomous inter-socket forwarding) introduced in 1.3.16-rc1, we've been relying on a stupid mechanism to wake up the task after a write, which was an exact copy-paste of the reader side. The principle was that if we empty a buffer and there's no forwarding scheduled or if the producer is not in a connected state, then we wake the task up. That does not make any sense. It happens to wake up too late sometimes (eg, when the request analyser waits for some room in the buffer to start to work), and leads to unneeded wakeups in client-side keep-alive, because the task is woken up when the response is sent, while the analysers are simply waiting for a new request. In order to fix this, we introduce a new channel flag : CF_WAKE_WRITE. It is designed so that an analyser can explicitly request being notified when some data were written. It is used only when the HTTP request or response analysers need to wait for more room in the buffers. It is automatically cleared upon wake up. The flag is also automatically set by the functions which try to write into a buffer from an applet when they fail (bi_putblk() etc...). That allows us to remove the stupid condition above and avoid some wakeups. In http-server-close and in http-keep-alive modes, this reduces from 4 to 3 the average number of wakeups per request, and increases the overall performance by about 1.5%.	2013-12-31 18:37:36 +01:00
Willy Tarreau	51437d2c59	Revert "MEDIUM: stats: add support for HTTP keep-alive on the stats page" This reverts commit `f3221f99ac`. Igor reported some very strange breakage of his stats page which is clearly caused by the chunking, though I don't see at first glance what could be wrong. Better revert it for now.	2013-12-29 00:43:40 +01:00
Willy Tarreau	f3221f99ac	MEDIUM: stats: add support for HTTP keep-alive on the stats page In theory the principle is simple as we just need to send HTTP chunks if the client is 1.1 compatible. In practice it's harder because we have to append a CR LF after each block of data and we're never sure to have the room for this. In order not to have to deal with this, we instead send the CR LF prior to each chunk size. The only issue is for the first chunk and for this reason we avoid to send the empty header line when using chunked encoding.	2013-12-28 21:40:16 +01:00
Willy Tarreau	3988d9342f	OPTIM: http: don't stop polling for read on the client side after a request We used to unconditionally disable client-side polling after the client has posted its request. The goal was to avoid subscribing the file descriptor to the poller for nothing. This is perfect for the HTTP close mode where we know we won't have to read on the client side anymore. However, when keep-alive is maintained with the client, this makes the situation worse. Indeed, after the first response, we'll have to wait for the client to send a next request and since this is never immediate, we'll certainly poll. So what happens is that polling is enabled after a response and disabled after a request, so the polling is constantly alternating, which is very expensive with epoll_ctl(). The solution implemented in this patch consists in only disabling the polling if the client-side is not in keep-alive mode. That way we have the best of both worlds. In close, we really close, and in keep-alive, we poll only once. The performance gained by this change is important, with haproxy jumping from 158kreq/s to 184kreq/s (+16%) in HTTP keep-alive mode on a machine which at best does 222k/s in raw TCP mode. With this patch and the previous one, a keep-alive run with a fast enough server (or enough concurrent connections to cover the connect time) does no epoll_ctl() anymore during a run of ab -k. The net measured gain is 19%.	2013-12-27 23:10:40 +01:00
Willy Tarreau	72575509ca	BUG/MINOR: http: always disable compression on HTTP/1.0 Compression is normally disabled on HTTP/1.0 since it does not support chunked encoded responses. But the test was incomplete, and Bertrand Jacquin reported a case where if the server responded using 1.1 to an 1.0 request, then haproxy still used to compress (and of course the client could not understand the response). No backport is needed, this is 1.5-specific.	2013-12-24 14:41:35 +01:00
Willy Tarreau	068621e4ad	MINOR: http: try to stick to same server after status 401/407 In HTTP keep-alive mode, if we receive a 401, we still have a chance of being able to send the visitor again to the same server over the same connection. This is required by some broken protocols such as NTLM, and anyway whenever there is an opportunity for sending the challenge to the proper place, it's better to do it (at least it helps with debugging).	2013-12-23 15:12:44 +01:00
Willy Tarreau	2737562e43	MEDIUM: stream-int: implement a very simplistic idle connection manager Idle connections are not monitored right now. So if a server closes after a response without advertising it, it won't be detected until a next request wants to use the connection. This is a bit problematic because it unnecessarily maintains file descriptors and sockets in an idle state. This patch implements a very simple idle connection manager for the stream interface. It presents itself as an I/O callback. The HTTP engine enables it when it recycles a connection. If a close or an error is detected on the underlying socket, it tries to drain as much data as possible from the socket, detect the close and responds with a close as well, then detaches from the stream interface.	2013-12-17 00:00:28 +01:00
Willy Tarreau	b169eba58d	BUG/MEDIUM: http: cook_cnt() forgets to set its output type Since comit `b805f71` (MEDIUM: sample: let the cast functions set their output type), the output type of a fetch function is automatically considered and passed to the next converter. A bug introduced in 1.5-dev9 with commit `f853c46` (MEDIUM: pattern/acl: get rid of temp_pattern in ACLs) was revealed by this last one : the output type remained string instead of UINT, causing the cast function to try to cast the contents and to crash on a NULL deref. Note: this fix was made after a careful review of all fetch functions. A few non-trivial ones had their comments amended to clearly indicate the output type.	2013-12-16 15:21:29 +01:00
Willy Tarreau	e8df1e128d	MEDIUM: http: make option http_proxy automatically rewrite the URL There are very few users of http_proxy, and all of them complain about the same thing : the request is passed unmodified to the server (in its proxy form), and it is not possible to fix it using reqrep rules because http_proxy happens after. So let's have http_proxy fix the URL it has analysed to get rid of the scheme and the host part. This will do what users of this feature expect.	2013-12-16 14:30:55 +01:00
Willy Tarreau	6b726adb35	MEDIUM: http: do not report connection errors for second and further requests In HTTP keep-alive, if we face a connection error to the server while sending the request, the error should not be reported, and the client-side connection should simply be closed, so that client knows it can retry. This can happen if the server has too short a keep-alive timeout and quits at the same moment the new request comes in.	2013-12-16 02:23:54 +01:00
Willy Tarreau	4213a11df9	MAJOR: http: add the keep-alive transition on the server side When a connection to the server is complete, if the transaction requests keep-alive mode, we don't shut the connection and we just reinitialize the stream interface in order to be able to reuse the connection afterwards. Note that the server connection count is decremented, just like the backend's, and that we still try to wake up waiters. But that makes sense considering that we'll eventually be able to immediately pass idle connections to waiters.	2013-12-16 02:23:54 +01:00
Willy Tarreau	9471b8ced9	MEDIUM: connection: inform si_alloc_conn() whether existing conn is OK or not When allocating a new connection, only the caller knows whether it's acceptable to reuse the previous one or not. Let's pass this information to si_alloc_conn() which will do the cleanup if the connection is not acceptable.	2013-12-16 02:23:53 +01:00
Willy Tarreau	2e7a165899	OPTIM: http: do not re-enable reading on client side while closing the server side It's common to observe a an recv() call on the client side just after the connect() to has been issued to the server side when running in server close mode. The reason is that the whole request has been sent and the shutw() has been queued in the channel, so the request message switches to the MSG_CLOSED state, which didn't disable reading. Let's do it now. That way the reading will only be re-enabled after the response is transferred to the client. However if abortonclose is set, we still leave it enabled.	2013-12-16 02:23:53 +01:00
Willy Tarreau	3f3997e6c6	OPTIM: http: set CF_READ_DONTWAIT on response message strace shows a lot of EAGAIN on small response messages. This is caused by the fact that the READ_DONTWAIT flag is not set on response message, it's only there when we want to flush pending data. For small responses, it's a waste of CPU cycles to call recv() for nothing since most of the time, everything we'll need will be in the first response. Also, this will offer more opportunities for using splice() to transfer data.	2013-12-16 02:23:52 +01:00
Willy Tarreau	89efaed6b6	BUILD: definitely silence some stupid GCC warnings It's becoming increasingly difficult to ignore unwanted function returns in debug code with gcc. Now even when you try to work around it, it suggests a way to write your code differently. For example : src/frontend.c:187:65: warning: if statement has empty body [-Wempty-body] if (write(1, trash.str, trash.len) < 0) /* shut gcc warning */; ^ src/frontend.c:187:65: note: put the semicolon on a separate line to silence this warning 1 warning generated. This is totally unacceptable, this code already had to be written this way to shut it up in earlier versions. And now it comments the form ? What's the purpose of the C language if you can't write anymore the code that does what you want ? Emeric proposed to just keep a global variable to drain such useless results so that gcc stops complaining all the time it believes people who write code are monkeys. The solution is acceptable because the useless assignment is done only in debug code so it will not impact performance. This patch implements this, until gcc becomes even "smarter" to detect that we tried to cheat.	2013-12-13 15:21:36 +01:00
Thierry FOURNIER	0b2fe4a5cd	MINOR: pattern: add support for compiling patterns for lookups With this patch, patterns can be compiled for two modes : - match - lookup The match mode is used for example in ACLs or maps. The lookup mode is used to lookup a key for pattern maintenance. For example, looking up a network is different from looking up one address belonging to this network. A special case is made for regex. In lookup mode they return the input regex string and do not compile the regex.	2013-12-12 15:44:02 +01:00
Thierry FOURNIER	7148ce6ef4	MEDIUM: pattern: Extract the index process from the pat_parse_() functions Now, the pat_parse_() functions parses the incoming data. The input "pattern" struct can be preallocated. If the parser needs to add some buffers, it allocates memory. The function pattern_register() runs the call to the parser, process the key indexation and associate the "sample_storage" used by maps.	2013-12-12 15:42:11 +01:00
Thierry FOURNIER	cc0e0b3dbb	MINOR: pattern: Each pattern sets the expected input type This is used later for increasing the compability with incoming sample types. When multiple compatible types are supported, one is arbitrarily used (eg: UINT).	2013-12-12 11:07:33 +01:00
Willy Tarreau	2d400bb931	MINOR: stream_interface: add reporting of ressouce allocation errors SSL and keep-alive will need to be able to fail on allocation errors, and the stream interface did not allow to report such a cause. The flag will then be "RC" as already documented.	2013-12-09 17:12:18 +01:00
Willy Tarreau	3770f23a3a	MINOR: http: switch the http state to an enum This reduces its size which is not reused by anything else. However it will significantly improve the debugger's output since we'll now get real state values. The default case had to be enabled in the parsers because gcc tries to optimize the switch/case and noticed some values were missing from the enums and emitted a warning.	2013-12-09 16:06:22 +01:00
Willy Tarreau	c8987b3664	DIET/MINOR: http: reduce the size of struct http_txn by 8 bytes Here again we had some oversized and misaligned entries. The method and the status don't need 4 bytes each, and there was a hole after the status that does not exist anymore. That's 8 additional bytes saved from http_txn and as much for the session. Also some fields were slightly moved to present better memory access patterns resulting in a steady 0.5% performance increase.	2013-12-09 16:06:22 +01:00
Willy Tarreau	1fbe1c9ec8	MEDIUM: stream-int: return the allocated appctx in stream_int_register_handler() The task returned by stream_int_register_handler() is never used, however we always need to access the appctx afterwards. So make it return the appctx instead. We already plan for it to fail, which is the reason for the addition of a few tests and the possibility for the HTTP analyser to return a status code 500.	2013-12-09 15:40:23 +01:00
Willy Tarreau	7b4b499fde	MEDIUM: stream-int: replace occurrences of si->appctx with si_appctx() We're about to remove si->appctx, so first let's replace all occurrences of its usage with a dynamic extract from si->end. A lot of code was changed by search-n-replace, but the behaviour was intentionally not altered. The code surrounding calls to stream_int_register_handler() was slightly changed since we can only use si->end after the registration.	2013-12-09 15:40:23 +01:00
Willy Tarreau	32e3c6a607	MAJOR: stream interface: dynamically allocate the outgoing connection The outgoing connection is now allocated dynamically upon the first attempt to touch the connection's source or destination address. If this allocation fails, we fail on SN_ERR_RESOURCE. As we didn't use si->conn anymore, it was removed. The endpoints are released upon session_free(), on the error path, and upon a new transaction. That way we are able to carry the existing server's address across retries. The stream interfaces are not initialized anymore before session_complete(), so we could even think about allocating them dynamically as well, though that would not provide much savings. The session initialization now makes use of conn_new()/conn_free(). This slightly simplifies the code and makes it more logical. The connection initialization code is now shorter by about 120 bytes because it's done at once, allowing the compiler to remove all redundant initializations. The si_attach_applet() function now takes care of first detaching the existing endpoint, and it is called from stream_int_register_handler(), so we can safely remove the calls to si_release_endpoint() in the application code around this call. A call to si_detach() was made upon stream_int_unregister_handler() to ensure we always free the allocated connection if one was allocated in parallel to setting an applet (eg: detect HTTP proxy while proceeding with stats maybe).	2013-12-09 15:40:23 +01:00
Willy Tarreau	f79c8171b2	MAJOR: connection: add two new flags to indicate readiness of control/transport Currently the control and transport layers of a connection are supposed to be initialized when their respective pointers are not NULL. This will not work anymore when we plan to reuse connections, because there is an asymmetry between the accept() side and the connect() side : - on accept() side, the fd is set first, then the ctrl layer then the transport layer ; upon error, they must be undone in the reverse order, then the FD must be closed. The FD must not be deleted if the control layer was not yet initialized ; - on the connect() side, the fd is set last and there is no reliable way to know if it has been initialized or not. In practice it's initialized to -1 first but this is hackish and supposes that local FDs only will be used forever. Also, there are even less solutions for keeping trace of the transport layer's state. Also it is possible to support delayed close() when something (eg: logs) tracks some information requiring the transport and/or control layers, making it even more difficult to clean them. So the proposed solution is to add two flags to the connection : - CO_FL_CTRL_READY is set when the control layer is initialized (fd_insert) and cleared after it's released (fd_delete). - CO_FL_XPRT_READY is set when the control layer is initialized (xprt->init) and cleared after it's released (xprt->close). The functions have been adapted to rely on this and not on the pointers anymore. conn_xprt_close() was unused and dangerous : it did not close the control layer (eg: the socket itself) but still marks the transport layer as closed, preventing any future call to conn_full_close() from finishing the job. The problem comes from conn_full_close() in fact. It needs to close the xprt and ctrl layers independantly. After that we're still having an issue : we don't know based on ->ctrl alone whether the fd was registered or not. For this we use the two new flags CO_FL_XPRT_READY and CO_FL_CTRL_READY. We now rely on this and not on conn->xprt nor conn->ctrl anymore to decide what remains to be done on the connection. In order not to miss some flag assignments, we introduce conn_ctrl_init() to initialize the control layer, register the fd using fd_insert() and set the flag, and conn_ctrl_close() which unregisters the fd and removes the flag, but only if the transport layer was closed. Similarly, at the transport layer, conn_xprt_init() calls ->init and sets the flag, while conn_xprt_close() checks the flag, calls ->close and clears the flag, regardless xprt_ctx or xprt_st. This also ensures that the ->init and the ->close functions are called only once each and in the correct order. Note that conn_xprt_close() does nothing if the transport layer is still tracked. conn_full_close() now simply calls conn_xprt_close() then conn_full_close() in turn, which do nothing if CO_FL_XPRT_TRACKED is set. In order to handle the error path, we also provide conn_force_close() which ignores CO_FL_XPRT_TRACKED and closes the transport and the control layers in turns. All relevant instances of fd_delete() have been replaced with conn_force_close(). Now we always know what state the connection is in and we can expect to split its initialization.	2013-12-09 15:40:23 +01:00
Willy Tarreau	4bd33a9e15	MINOR: http: use conn_init() to reinitialize the server connection It's safer and easier to proceed using this function which sets all the required fields.	2013-12-09 15:40:23 +01:00
Willy Tarreau	b363a1f469	MAJOR: stream-int: stop using si->conn and use si->end instead The connection will only remain there as a pre-allocated entity whose goal is to be placed in ->end when establishing an outgoing connection. All connection initialization can be made on this connection, but all information retrieved should be applied to the end point only. This change is huge because there were many users of si->conn. Now the only users are those who initialize the new connection. The difficulty appears in a few places such as backend.c, proto_http.c, peers.c where si->conn is used to hold the connection's target address before assigning the connection to the stream interface. This is why we have to keep si->conn for now. A future improvement might consist in dynamically allocating the connection when it is needed.	2013-12-09 15:40:22 +01:00
Willy Tarreau	9b6c2c721e	MINOR: stream-int: rename ->applet to ->appctx Since this is the applet context, call it ->appctx to avoid the confusion with the pointer to the applet. Many places were changed but it's only a renaming.	2013-12-09 15:40:22 +01:00
Willy Tarreau	1e6902fd6a	MINOR: connection: always initialize conn->objt_type to OBJ_TYPE_CONN We do this everywhere we prepare a connection so that we can safely switch to objt_conn() next.	2013-12-09 15:40:22 +01:00
Willy Tarreau	414e9bb806	MEDIUM: stats: move request argument processing to the final step At the moment, stats require some preliminary storage just to store some flags and codes that are parsed very early and used later. In fact that doesn't make much sense and makes it very hard to allocate the applet dynamically. This patch changes this. Now stats_check_uri() only checks for the validity of the request and the fact that it matches the stats uri. It's handle_stats() which parses it. It makes more sense because handle_stats() used to already perform some preliminary processing such as verifying that POST contents are not missing, etc... There is only one minor hiccup in doing so : the reqrep rules might be processed in between. This has been addressed by moving http_handle_stats() just after stats_check_uri() and setting s->target at the same time. Now that s->target is totally operational, it's used to mark the current request as being targetted at the stats, and this information is used after the request processing to remove the HTTP analysers and only let the applet handle the request. Thus we guarantee that the storage for the applet is filled with the relevant information and not overwritten when we switch to the applet.	2013-12-09 15:40:22 +01:00
Willy Tarreau	347a35d19e	MAJOR: stats: move the HTTP stats handling to its applet There is a big trouble with the way POST is handled for the admin stats page. The POST parameters are extracted from some http-request rules, and if not round they return zero hoping for being called again when more data passes. This results in the HTTP analyser being called several times and all the rules prior to the stats being executed multiple times as well. That includes rewrite rules. So instead of doing this, we now move all the processing of the stats into the stats applet. That way we just set the stats applet in the HTTP analyser when a stats request is detected, and the applet takes the time it needs to read the arguments and respond. We could even imagine improving the applet to support requests larger than a single buffer. The code was almost only moved and minimally changed. Several new HTTP states were added to the stats applet to emit headers, redirects and to read POST. It was necessary to do this because the headers sent depend on the parsing of the POST request. In the end it's beneficial because we removed two stream_int_retnclose() calls.	2013-12-09 15:40:22 +01:00
Willy Tarreau	96d44918f7	MEDIUM: stats: prepare the HTTP stats I/O handler to support more states In preparation for moving the POST processing to the applet, we first add new states to the HTTP I/O handler. Till now st0 was only 0/1 for start/end. We now replace it with an enum.	2013-12-09 15:40:22 +01:00
Willy Tarreau	4c804ec6ee	MINOR: http: prevent smp_fetch_url_{ip,port} from using si->conn These two fetch methods predate the samples and used to store the destination address into the server-facing connection's address field because we had no other place at this time. This will become problematic with the current connection changes, so let's fix this.	2013-12-09 15:40:22 +01:00
Willy Tarreau	306f8306cb	MEDIUM: stats: don't use conn->xprt_ctx anymore This field was used by dumpstats to retrieve a pointer to the current session, which may already be found from ->owner. With this change, the stats code doesn't need the connection at all anymore.	2013-12-09 15:40:21 +01:00
Willy Tarreau	a94d2d7653	MEDIUM: stats: don't use conn->xprt_st anymore We're trying to move the applets out of the struct connection. So let's remove the dependence on xprt_st and introduce si->applet.st2 to store the missing contextual data instead.	2013-12-09 15:40:21 +01:00
Willy Tarreau	08382955fe	CLEANUP: stream_interface: remove unused field err_loc This field was still fed with a pointer to the server that caught an error but was not used anymore. Let's remove it.	2013-12-09 15:40:21 +01:00
Willy Tarreau	0900bcbdbb	BUG/MEDIUM: checks: also update the DRAIN state from the web interface In commit `8c3d0be` (MEDIUM: Add DRAIN state and report it on the stats page), the drain state was updated on every weight change except those that can be sent via the web interface. This caused inconsistent state combinations to be reported in the stats depending on the sequence (web then cli vs cli then web). It would seem that a call to set_server_drain_state() from within server_recalc_eweight() would simplify things but that's not completely certain yet.	2013-12-04 00:54:18 +01:00
Willy Tarreau	60e0838f60	BUG/MINOR: http: usual deinit stuff in last commit We need to initialize the rdr_fmt list inconditionally. Using only a redirect rule without an http-redirect may cause a crash during deinit because of the list iterating from null.	2013-12-03 00:48:45 +01:00
Thierry FOURNIER	d18cd0f110	MEDIUM: http: The redirect strings follows the log format rules. We handle "http-request redirect" with a log-format string now, but we leave "redirect" unaffected. Note that the control of the special "/" case is move from the runtime execution to the configuration parsing. If the format rule list is empty, the build_logline() function does nothing.	2013-12-02 23:31:33 +01:00
Willy Tarreau	0cba607400	MINOR: acl/pattern: use types different from int to clarify who does what. We now have the following enums and all related functions return them and consume them : enum pat_match_res { PAT_NOMATCH = 0, /* sample didn't match any pattern / PAT_MATCH = 3, / sample matched at least one pattern / }; enum acl_test_res { ACL_TEST_FAIL = 0, / test failed / ACL_TEST_MISS = 1, / test may pass with more info / ACL_TEST_PASS = 3, / test passed / }; enum acl_cond_pol { ACL_COND_NONE, / no polarity set yet / ACL_COND_IF, / positive condition (after 'if') / ACL_COND_UNLESS, / negative condition (after 'unless') */ }; It's just in order to avoid doubts when reading some code.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	a65b343eee	MEDIUM: pattern: rename "acl" prefix to "pat" This patch just renames functions, types and enums. No code was changed. A significant number of files were touched, especially the ACL arrays, so it is likely that some external patches will not apply anymore. One important thing is that we had to split ACL_PAT_* into two groups : - ACL_TEST_{PASS\|MISS\|FAIL} - PAT_{MATCH\|UNMATCH} A future patch will enforce enums on all these places to avoid confusion.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	ed66c297c2	REORG: acl/pattern: extract pattern matching from the acl file and create pattern.c This patch just moves code without any change. The ACL are just the association between sample and pattern. The pattern contains the match method and the parse method. These two things are different. This patch cleans the code by splitting it.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	dd69a04666	MEDIUM: acl: associate "struct sample_storage" to each "struct acl_pattern" This will be used later with maps. Each map will associate an entry with a sample_storage value. This patch changes the "parse" prototype and all the parsing methods. The goal is to associate "struct sample_storage" to each entry of "struct acl_pattern". Only the "parse" function can add the sample value into the "struct acl_pattern".	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	1c0054fe83	BUG/MINOR: arg: fix error reporting for add-header/set-header sample fetch arguments The 'add-header %[samples]' parsing errors associated to http-request and http-response are displayed with the wrong keyword. Configuration entry: http-request set-header mon-header %[res.hdr(user-agent)] Original error message: [WARNING] 323/150920 (16559) : parsing [haproxy.conf:36] : 'log-format' : sample fetch <res.hdr ... After commit error message: [WARNING] 323/150929 (16580) : parsing [haproxy.conf:36] : 'http-request' : sample fetch <res.hdr ...	2013-11-28 18:25:18 +01:00
Simon Horman	58c32978b2	MEDIUM: Set rise and fall of agent checks to 1 This is achieved by moving rise and fall from struct server to struct check. After this move the behaviour of the primary check, server->check is unchanged. However, the secondary agent check, server->agent now has independent rise and fall values each of which are set to 1. The result is that receiving "fail", "stopped" or "down" just once from the agent will mark the server as down. And receiving a weight just once will allow the server to be marked up if its primary check is in good health. This opens up the scope to allow the rise and fall values of the agent check to be configurable, however this has not been implemented at this stage. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-25 07:31:16 +01:00
Willy Tarreau	004e045f31	BUG/MAJOR: server: weight calculation fails for map-based algorithms A crash was reported by Igor at owind when changing a server's weight on the CLI. Lukas Tribus could reproduce a related bug where setting a server's weight would result in the new weight being multiplied by the initial one. The two bugs are the same. The incorrect weight calculation results in the total farm weight being larger than what was initially allocated, causing the map index to be out of bounds on some hashes. It's easy to reproduce using "balance url_param" with a variable param, or with "balance static-rr". It appears that the calculation is made at many places and is not always right and not always wrong the same way. Thus, this patch introduces a new function "server_recalc_eweight()" which is dedicated to this task of computing ->eweight from many other elements including uweight and current time (for slowstart), and all users now switch to use this function. The patch is a bit large but the code was not trivially fixable in a way that could guarantee this situation would not occur anymore. The fix is much more readable and has been verified to work with all algorithms, with both consistent and map-based hashes, and even with static-rr. Slowstart was tested as well, just like enable/disable server. The same bug is very likely present in 1.4 as well, so the patch will probably need to be backported eventhough it will not apply as-is. Thanks to Lukas and Igor for the information they provided to reproduce it.	2013-11-21 15:09:02 +01:00
Simon Horman	125d099662	MEDIUM: Move health element to struct check This is in preparation for associating a agent check with a server which runs as well as the server's existing check. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 09:36:07 +01:00
Simon Horman	4a741432be	MEDIUM: Paramatise functions over the check of a server Paramatise the following functions over the check of a server * set_server_down * set_server_up * srv_getinter * server_status_printf * set_server_check_status * set_server_disabled * set_server_enabled Generally the server parameter of these functions has been removed. Where it is still needed it is obtained using check->server. This is in preparation for associating a agent check with a server which runs as well as the server's existing check. By paramatising these functions they may act on each of the checks without further significant modification. Explanation of the SSP_O_HCHK portion of this change: * Prior to this patch SSP_O_HCHK serves a single purpose which is to tell server_status_printf() weather it should print the details of the check of a server or not. With the paramatisation that this patch adds there are two cases. 1) Printing the details of the check in which case a valid check parameter is needed. 2) Not printing the details of the check in which case the contents check parameter are unused. In case 1) we could pass SSP_O_HCHK and a valid check and; In case 2) we could pass !SSP_O_HCHK and any value for check including NULL. If NULL is used for case 2) then SSP_O_HCHK becomes supurfulous and as NULL is used for case 2) SSP_O_HCHK has been removed. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 09:35:54 +01:00
Willy Tarreau	e155ec245a	BUG/MINOR: http: fix build warning introduced with url32/url32_src commit `39c63c5` "url32+src - like base32+src but whole url including parameters" was missing the last argument "const char *kw", resulting in the build warning below : src/proto_http.c:10351:2: warning: initialization from incompatible pointer type [enabled by default] src/proto_http.c:10351:2: warning: (near initialization for 'sample_fetch_keywords.kw[50].process') [enabled by default] src/proto_http.c:10352:2: warning: initialization from incompatible pointer type [enabled by default] src/proto_http.c:10352:2: warning: (near initialization for 'sample_fetch_keywords.kw[51].process') [enabled by default] It's harmless since it's not needed there anyway.	2013-11-18 18:33:32 +01:00
Willy Tarreau	6d4890cfea	BUG/MEDIUM: http: fix possible parser crash when parsing erroneous "http-request redirect" rules Baptiste Assmann reported a bug affecting the "http-request redirect" parser. It may randomly crash when reporting an error message if the syntax is not OK. It happens that this is caused by the output error message pointer which was not initialized to NULL. This bug is 1.5-specific (introduced in dev17), no backport is needed.	2013-11-18 18:07:35 +01:00
Neil - HAProxy List	39c63c56d2	url32+src - like base32+src but whole url including parameters I have a need to limit traffic to each url from each source address. much like base32+src but the whole url including parameters (this came from looking at the recent 'Haproxy rate limit per matching request' thread) attached is patch that seems to do the job, its a copy and paste job of the base32 functions the url32 function seems to work too and using 2 machines to request the same url locks me out of both if I abuse from either with the url32 key function and only the one if I use url32_src. Neil	2013-11-18 06:50:38 +01:00
Willy Tarreau	3b44e729e5	CLEANUP: http: merge error handling for req* and http-request * The reqdeny/reqtarpit and http-request deny/tarpit were using a copy-paste of the error handling code because originally the req* actions used to maintain their own stats. This is not the case anymore so we can use the same error blocks for both. The http-request rulesets still has precedence over req* so no functionality was changed.	2013-11-16 10:30:14 +01:00
Willy Tarreau	687ba13e92	CLEANUP: http: homogenize processing of denied req counter The reqdeny/reqideny and reqtarpit/reqitarpit rules used to maintain the stats counters themselves while http-request deny/tarpit and rspdeny/rspideny used to centralize them at the point where the error is processed. Thus, let's do the same for reqdeny/reqtarpit so that the functions which iterate over the rules do not have to deal with these counters anymore.	2013-11-16 10:13:35 +01:00
Willy Tarreau	8ac7249611	BUG/MINOR: stats: don't count tarpitted connections twice When a connection is tarpitted, a denied req is counted once when the action is applied, and then a failed req is counted when the tarpit timeout expires. This is completely wrong as the tarpit is exactly equivalent to a deny since it's a disguised deny. So let's not increment the failed req anymore. This fix may be backported to 1.4 which has the same issue.	2013-11-16 10:06:44 +01:00
Thierry FOURNIER	5068d96ac1	MINOR: http: change url_decode to return the size of the decoded string. Currently url_decode returns 1 or 0 depending on whether it could decode the string or not. For some future use cases, it will be needed to get the decoded string length after a successful decoding, so let's make it return that value, and fall back to a negative one in case of error.	2013-10-23 12:26:50 +02:00
Willy Tarreau	472b1ee115	BUG/MEDIUM: http: accept full buffers on smp_prefetch_http Bertrand Jacquin reported a but when using tcp_request content rules on large POST HTTP requests. The issue is that smp_prefetch_http() first tries to validate an input buffer, but only if the buffer is not full. This test is wrong since it must only be performed after the parsing has failed, otherwise we don't accept POST requests which fill the buffer as valid HTTP requests. This bug is 1.5-specific, no backport needed.	2013-10-14 22:47:00 +02:00
Willy Tarreau	7959a55e15	MINOR: http: compute response time before processing headers At the moment, HTTP response time is computed after response headers are processed. This can misleadingly assign to the server some heavy local processing (eg: regex), and also prevents response headers from passing information related to the response time (which can sometimes be useful for stats). Let's retrieve the reponse time before processing the headers instead. Note that in order to remain compatible with what was previously done, we disable the response time when we get a 502 or any bad response. This should probably be changed in 1.6 since it does not make sense anymore to lose this information.	2013-09-23 16:53:11 +02:00
William Lallemand	5b7ea3afa1	BUG/MEDIUM: unique_id: junk in log on empty unique_id When a request fail, the unique_id was allocated but not generated. The string was not initialized and junk was printed in the log with %ID. This patch changes the behavior of the unique_id. The unique_id is now generated when a request failed. This bug was reported by Patrick Hemmer.	2013-08-31 08:01:14 +02:00
Willy Tarreau	9f09521f2d	BUG/MEDIUM: unique_id: HTTP request counter must be unique! The HTTP request counter is incremented non atomically, which means that many requests can log the same ID. Let's increment it when it is consumed so that we avoid this case. This bug was reported by Patrick Hemmer. It's 1.5-specific and does not need to be backported.	2013-08-13 17:52:20 +02:00
Willy Tarreau	ef38c39287	MEDIUM: sample: systematically pass the keyword pointer to the keyword We're having a lot of duplicate code just because of minor variants between fetch functions that could be dealt with if the functions had the pointer to the original keyword, so let's pass it as the last argument. An earlier version used to pass a pointer to the sample_fetch element, but this is not the best solution for two reasons : - fetch functions will solely rely on the keyword string - some other smp_fetch_* users do not have the pointer to the original keyword and were forced to pass NULL. So finally we're passing a pointer to the keyword as a const char *, which perfectly fits the original purpose.	2013-08-01 21:17:13 +02:00
Willy Tarreau	276fae9ab9	MINOR: samples: add the http_date([<offset>]) sample converter. Converts an integer supposed to contain a date since epoch to a string representing this date in a format suitable for use in HTTP header fields. If an offset value is specified, then it is a number of seconds that is added to the date before the conversion is operated. This is particularly useful to emit Date header fields, Expires values in responses when combined with a positive offset, or Last-Modified values when the offset is negative.	2013-07-25 15:00:38 +02:00
Willy Tarreau	506d050600	BUG/MAJOR: http: sample prefetch code was not properly migrated When ACLs and samples were converged in 1.5-dev18, function "acl_prefetch_http" was not properly converted after commit `8ed669b1`. It used to return -1 when contents did not match HTTP traffic, which was considered as a "true" boolean result by the ACL execution code, possibly causing crashes due to missing data when checking for HTTP traffic in TCP rules. Another issue is that when the function returned zero, it did not set tje SMP_F_MAY_CHANGE flag, so it could randomly exit on partial requests before waiting for a complete one. Last issue is that when it returned 1, it did not set smp->data.uint, so this last one would retain a random value from a past execution. This could randomly cause some matches to fail as well. Thanks to Remo Eichenberger for reporting this issue with a detailed explanation and configuration. This bug is 1.5-specific, no backport is needed.	2013-07-06 13:36:34 +02:00
Willy Tarreau	5b15f9004d	BUG/MEDIUM: http: "option checkcache" fails with the no-cache header The checkcache option checks for cacheable responses with a set-cookie header. Since the response processing code was refactored in 1.3.8 (commit `a15645d4`), the check was broken because the no-cache value is only checked as no-cache="set-cookie", and not alone. Thanks to Herv� Commowick for reporting this stupid bug! The fix should be backported to 1.4 and 1.3.	2013-07-04 12:49:28 +02:00
Lukas Tribus	67db8df12b	MEDIUM: http: add IPv6 support for "set-tos" As per RFC3260 #4 and BCP37 #4.2 and #5.2, the IPv6 counterpart of TOS is "traffic class". Add support for IPv6 traffic class in "set-tos" by moving the "set-tos" related code to the new inline function inet_set_tos(), handling IPv4 (IP_TOS), IPv6 (IPV6_TCLASS) and IPv4-mapped sockets (IP_TOS, like ::ffff:127.0.0.1). Also define - if missing - the IN6_IS_ADDR_V4MAPPED() macro in include/common/compat.h for compatibility.	2013-06-23 18:01:38 +02:00
Lukas Tribus	2dd1d1a93f	BUG/MINOR: http: fix "set-tos" not working in certain configurations s->req->prod->conn->addr.to.ss_family contains only useful data if conn_get_to_addr() is called early. If thats not the case (nothing in the configuration needs the destination address like logs, transparent, ...) then "set-tos" doesn't work. Fix this by checking s->req->prod->conn->addr.from.ss_family instead. Also fix a minor doc issue about set-tos in http-response.	2013-06-23 18:01:31 +02:00
Willy Tarreau	dc13c11c1e	BUG/MEDIUM: prevent gcc from moving empty keywords lists into BSS Benoit Dolez reported a failure to start haproxy 1.5-dev19. The process would immediately report an internal error with missing fetches from some crap instead of ACL names. The cause is that some versions of gcc seem to trim static structs containing a variable array when moving them to BSS, and only keep the fixed size, which is just a list head for all ACL and sample fetch keywords. This was confirmed at least with gcc 3.4.6. And we can't move these structs to const because they contain a list element which is needed to link all of them together during the parsing. The bug indeed appeared with 1.5-dev19 because it's the first one to have some empty ACL keyword lists. One solution is to impose -fno-zero-initialized-in-bss to everyone but this is not really nice. Another solution consists in ensuring the struct is never empty so that it does not move there. The easy solution consists in having a non-null list head since it's not yet initialized. A new "ILH" list head type was thus created for this purpose : create an Initialized List Head so that gcc cannot move the struct to BSS. This fixes the issue for this version of gcc and does not create any burden for the declarations.	2013-06-21 23:29:02 +02:00
Willy Tarreau	67dad2715b	BUG/CRITICAL: fix a possible crash when using negative header occurrences When a config makes use of hdr_ip(x-forwarded-for,-1) or any such thing involving a negative occurrence count, the header is still parsed in the order it appears, and an array of up to MAX_HDR_HISTORY entries is created. When more entries are used, the entries simply wrap and continue this way. A problem happens when the incoming header field count exactly divides MAX_HDR_HISTORY, because the computation removes the number of requested occurrences from the count, but does not care about the risk of wrapping with a negative number. Thus we can dereference the array with a negative number and randomly crash the process. The bug is located in http_get_hdr() in haproxy 1.5, and get_ip_from_hdr2() in haproxy 1.4. It affects configurations making use of one of the following functions with a negative <value> occurence number : - hdr_ip(<name>, <value>) (in 1.4) - hdr_*(<name>, <value>) (in 1.5) It also affects "source" statements involving "hdr_ip(<name>)" since that statement implicitly uses -1 for <value> : - source 0.0.0.0 usesrc hdr_ip(<name>) A workaround consists in rejecting dangerous requests early using hdr_cnt(<name>), which is available both in 1.4 and 1.5 : block if { hdr_cnt(<name>) ge 10 } This bug has been present since the introduction of the negative offset count in 1.4.4 via commit `bce70882`. It has been reported by David Torgerson who offered some debugging traces showing where the crash happened, thus making it significantly easier to find the bug! CVE-2013-2175 was assigned to this bug. This fix must absolutely be backported to 1.4.	2013-06-17 12:00:22 +02:00
Willy Tarreau	3c4beb1feb	CLEANUP: http: remove the bogus urlp_ip ACL match This one is wrong, never matches and cannot work. It was brought by a blind copy-paste from the url_* version in 1.5-dev9, but there is no underlying fetch returning an IP type for this.	2013-06-12 22:26:04 +02:00
Willy Tarreau	c32484ed35	MEDIUM: acl: remove 15 additional useless ACLs that are equivalent to their fetches The following 15 ACLs were missed from previous review, and are not needed either. hdr_cnt, hdr_ip, hdr_val, rep_ssl_hello_type, req_len, req_ssl_hello_type, scook_cnt, scook_val, shdr_cnt, shdr_ip, shdr_val, url_ip, url_port, urlp_val, req_proto_http.	2013-06-12 22:23:40 +02:00
Willy Tarreau	6d4e4e8dd2	MEDIUM: acl: remove a lot of useless ACLs that are equivalent to their fetches The following 116 ACLs were removed because they're redundant with their fetch function since last commit which allows the fetch function to be used instead for types BOOL, INT and IP. Most places are now left with an empty ACL keyword list that was not removed so that it's easier to add other ACLs later. always_false, always_true, avg_queue, be_conn, be_id, be_sess_rate, connslots, nbsrv, queue, srv_conn, srv_id, srv_is_up, srv_sess_rate, res.comp, fe_conn, fe_id, fe_sess_rate, dst_conn, so_id, wait_end, http_auth, http_first_req, status, dst, dst_port, src, src_port, sc1_bytes_in_rate, sc1_bytes_out_rate, sc1_clr_gpc0, sc1_conn_cnt, sc1_conn_cur, sc1_conn_rate, sc1_get_gpc0, sc1_gpc0_rate, sc1_http_err_cnt, sc1_http_err_rate, sc1_http_req_cnt, sc1_http_req_rate, sc1_inc_gpc0, sc1_kbytes_in, sc1_kbytes_out, sc1_sess_cnt, sc1_sess_rate, sc1_tracked, sc1_trackers, sc2_bytes_in_rate, sc2_bytes_out_rate, sc2_clr_gpc0, sc2_conn_cnt, sc2_conn_cur, sc2_conn_rate, sc2_get_gpc0, sc2_gpc0_rate, sc2_http_err_cnt, sc2_http_err_rate, sc2_http_req_cnt, sc2_http_req_rate, sc2_inc_gpc0, sc2_kbytes_in, sc2_kbytes_out, sc2_sess_cnt, sc2_sess_rate, sc2_tracked, sc2_trackers, sc3_bytes_in_rate, sc3_bytes_out_rate, sc3_clr_gpc0, sc3_conn_cnt, sc3_conn_cur, sc3_conn_rate, sc3_get_gpc0, sc3_gpc0_rate, sc3_http_err_cnt, sc3_http_err_rate, sc3_http_req_cnt, sc3_http_req_rate, sc3_inc_gpc0, sc3_kbytes_in, sc3_kbytes_out, sc3_sess_cnt, sc3_sess_rate, sc3_tracked, sc3_trackers, src_bytes_in_rate, src_bytes_out_rate, src_clr_gpc0, src_conn_cnt, src_conn_cur, src_conn_rate, src_get_gpc0, src_gpc0_rate, src_http_err_cnt, src_http_err_rate, src_http_req_cnt, src_http_req_rate, src_inc_gpc0, src_kbytes_in, src_kbytes_out, src_sess_cnt, src_sess_rate, src_updt_conn_cnt, table_avl, table_cnt, ssl_c_ca_err, ssl_c_ca_err_depth, ssl_c_err, ssl_c_used, ssl_c_verify, ssl_c_version, ssl_f_version, ssl_fc, ssl_fc_alg_keysize, ssl_fc_has_crt, ssl_fc_has_sni, ssl_fc_use_keysize,	2013-06-11 21:22:58 +02:00
Willy Tarreau	51347ed94c	MEDIUM: http: add the "set-mark" action on http-request/http-response rules "set-mark" is used to set the Netfilter MARK on all packets sent to the client to the value passed in <mark> on platforms which support it. This value is an unsigned 32 bit value which can be matched by netfilter and by the routing table. It can be expressed both in decimal or hexadecimal format (prefixed by "0x"). This can be useful to force certain packets to take a different route (for example a cheaper network path for bulk downloads). This works on Linux kernels 2.6.32 and above and requires admin privileges.	2013-06-11 19:34:13 +02:00
Willy Tarreau	42cf39e3b9	MEDIUM: http: add support for "set-tos" in http-request/http-response This manipulates the TOS field of the IP header of outgoing packets sent to the client. This can be used to set a specific DSCP traffic class based on some request or response information. See RFC2474, 2597, 3260 and 4594 for more information.	2013-06-11 19:04:37 +02:00
Willy Tarreau	9a355ec257	MEDIUM: http: add support for action "set-log-level" in http-request/http-response Some users want to disable logging for certain non-important requests such as stats requests or health-checks coming from another equipment. Other users want to log with a higher importance (eg: notice) some special traffic (POST requests, authenticated requests, requests coming from suspicious IPs) or some abnormally large responses. This patch responds to all these needs at once by adding a "set-log-level" action to http-request/http-response. The 8 syslog levels are supported, as well as "silent" to disable logging.	2013-06-11 17:50:26 +02:00
Willy Tarreau	abcd5145f8	MEDIUM: log: add a log level override value in struct session This log level will be used in a further patch to change the log level depending on the request or response.	2013-06-11 17:50:26 +02:00
Willy Tarreau	f4c43c13be	MEDIUM: http: add the "set-nice" action to http-request and http-response This new action changes the nice factor of the task processing the current request.	2013-06-11 17:50:26 +02:00
Willy Tarreau	e365c0b92b	MEDIUM: http: add a new "http-response" ruleset Some actions were clearly missing to process response headers. This patch adds a new "http-response" ruleset which provides the following actions : - allow : stop evaluating http-response rules - deny : stop and reject the response with a 502 - add-header : add a header in log-format mode - set-header : set a header in log-format mode	2013-06-11 16:06:12 +02:00
Willy Tarreau	04ff9f105f	MINOR: http: add full-length header fetch methods The req.hdr and res.hdr fetch methods do not work well on headers which are allowed to contain commas, such as User-Agent, Date or Expires. More specifically, full-length matching is impossible if a comma is present. This patch introduces 4 new fetch functions which are designed to work with these full-length headers : - req.fhdr, req.fhdr_cnt - res.fhdr, res.fhdr_cnt These ones do not stop at commas and permit to return full-length header values.	2013-06-10 18:39:42 +02:00
Willy Tarreau	570f221cbb	MINOR: log: add a new flag 'L' for locally processed requests People who use "option dontlog-normal" are bothered with redirects and stats being logged and reported as errors in the logs ("PR" = proxy blocked the request). This patch introduces a new flag 'L' for when a request is locally processed, that is not considered as an error by the log filters. That way we know a request was intercepted and processed by haproxy without logging the line when "option dontlog-normal" is in effect.	2013-06-10 16:42:09 +02:00
Willy Tarreau	379357af58	BUG/MAJOR: http: always ensure response buffer has some room for a response Since 1.5-dev12 and commit `3bf1b2b8` (MAJOR: channel: stop relying on BF_FULL to take action), the HTTP parser switched to channel_full() instead of BF_FULL to decide whether a buffer had enough room to start parsing a request or response. The problem is that channel_full() intentionally ignores outgoing data, so a corner case exists where a large response might still be left in a response buffer with just a few bytes left (much less than the reserve), enough to accept a second response past the last data, but not enough to permit the HTTP processor to add some headers. Since all the processing relies on this space being available, we can get some random crashes when clients pipeline requests. The analysis of a core from haproxy configured with 20480 bytes buffers shows this : with enough "luck", when sending back the response for the first request, the client is slow, the TCP window is congested, the socket buffers are full, and haproxy's buffer fills up. We still have 20230 bytes of response data in a 20480 response buffer. The second request is sent to the server which returns 214 bytes which fit in the small 250 bytes left in this buffer. And the buffer arrangement makes it possible to escape all the controls in http_wait_for_response() : \|<------ response buffer = 20480 bytes ------>\| [ 2/2 \| 3 \| 4 \| 1/2 ] ^ start of circular buffer 1/2 = beginning of previous response (18240) 2/2 = end of previous response (1990) 3 = current response (214) 4 = free space (36) - channel_full() returns false (20230 bytes are going to leave) - the response headers does not wrap at the end of the buffer - the remaining linear room after the headers is larger than the reserve, because it's the previous response which wraps : => response is processed Header rewriting causes it to reach 260 bytes, 10 bytes larger than what the buffer could hold. So all computations during header addition are wrong and lead to the corruption we've observed. All the conditions are very hard to meet (which explains why it took almost one year for this bug to show up) and are almost impossible to reproduce on purpose on a test platform. But the bug is clearly there. This issue was reported by Dinko Korunic who kindly devoted a lot of time to provide countless traces and cores, and to experiment with troubleshooting patches to knock the bug down. Thanks Dinko! No backport is needed, but all 1.5-dev versions between dev12 and dev18 included must be upgraded. A workaround consists in setting option forceclose to prevent pipelined requests from being processed.	2013-06-08 13:14:17 +02:00
Willy Tarreau	7fe3300b76	BUG/MEDIUM: stats: fix a regression when dealing with POST requests In 1.5-dev17 (commit `1facd6d6`), we reorganized the way HTTP stats requests are handled. When moving the code, we dropped a "return 0" which happens upon incomplete POST request, so we now end up with the next return 1 which causes processing to go on with next analyser. This causes incomplete POST requests to try to forward the request to servers, resulting in either a 404 or a 503 depending on the configuration. This patch fixes this regression to restore the previous behaviour. It's not enough though, as it happens that the stats code is handled after all http header processing but in the same function. The net effect is that incomplete requests cause the headers manipulation to be performed multiple times, possibly resulting in multiple headers in the request buffer. Since the stats requests are not meant to be forwarded, it's not an issue yet but this is something to take care of later. A remaining issue that's not handled yet is that if the client does not send the complete POST headers, then the request is finally forwarded. This is not a regression, it has always been there and seems to be caused by the lack of timeout processing when waiting for the POST body. The solution to this issue would be to move the handling of stats requests into a dedicated analyser placed after http_process_request_body(). Bug reported by Guillaume de Lafond.	2013-04-21 08:16:10 +02:00
de Lafond Guillaume	88c278fadf	MEDIUM: stats: add proxy name filtering on the statistic page This patch adds a "scope" box in the statistics page in order to display only proxies with a name that contains the requested value. The scope filter is preserved across all clicks on the page.	2013-04-15 22:50:33 +02:00
Willy Tarreau	667c2a3d2a	BUG/MAJOR: http: compression still has defects on chunked responses The compression state machine happens to start work it cannot undo if there's no more data in the input buffer, and has trouble accounting for it. Fixing it requires more than a few lines, as the confusion is in part caused by the way the pointers to the various places in the message are handled internally. So as a temporary fix, let's disable compression on chunk-encoded responses. This will give us more time to perform the required changes.	2013-04-14 23:32:53 +02:00
Willy Tarreau	8d1c5164f3	BUG/MINOR: http: add-header/set-header did not accept the ACL condition Sander Klein reported this bug. The test for the extra argument on these rules prevent any condition from being added. The bug was introduced with the feature itself in 1.5-dev16.	2013-04-03 14:13:58 +02:00
Willy Tarreau	a4312fa28e	MAJOR: sample: maintain a per-proxy list of the fetch args to resolve While ACL args were resolved after all the config was parsed, it was not the case with sample fetch args because they're almost everywhere now. The issue is that ACLs now solely rely on sample fetches, so their args resolving doesn't work anymore. And many fetches involving a server, a proxy or a userlist don't work at all. The real issue is that at the bottom layers we have no information about proxies, line numbers, even ACLs in order to report understandable errors, and that at the top layers we have no visibility over the locations where fetches are referenced (think log node). After failing multiple unsatisfying solutions attempts, we now have a new concept of args list. The principle is that every proxy has a list head which contains a number of indications such as the config keyword, the context where it's used, the file and line number, etc... and a list of arguments. This list head is of the same type as the elements, so it serves as a template for adding new elements. This way, it is filled from top to bottom by the callers with the information they have (eg: line numbers, ACL name, ...) and the lower layers just have to duplicate it and add an element when they face an argument they cannot resolve yet. Then at the end of the configuration parsing, a loop passes over each proxy's list and resolves all the args in sequence. And this way there is all necessary information to report verbose errors. The first immediate benefit is that for the first time we got very precise location of issues (arg number in a keyword in its context, ...). Second, in order to do this we had to parse log-format and unique-id-format a bit earlier, so that was a great opportunity for doing so when the directives are encountered (unless it's a default section). This way, the recorded line numbers for these args are the ones of the place where the log format is declared, not the end of the file. Userlists report slightly more information now. They're the only remaining ones in the ACL resolving function.	2013-04-03 02:13:02 +02:00
Willy Tarreau	0a0daecbb2	MEDIUM: http: remove val_usr() to validate user_lists This one was incorrect since it tried to validate the user-lists before end of parsing.	2013-04-03 02:13:02 +02:00
Willy Tarreau	ff5afcc32b	MINOR: http: replace acl_parse_ver with acl_parse_str The HTTP version parser used in ACLs has long been a string and still had its own parser. This makes no sense, switch it to use the standard string parser.	2013-04-03 02:13:01 +02:00
Willy Tarreau	d86e29d2a1	CLEANUP: acl: remove unused references to ACL_USE_* Now that acl->requires is not used anymore, we can remove all references to it as well as all ACL_USE_* flags.	2013-04-03 02:13:00 +02:00
Willy Tarreau	18ed2569f5	MINOR: http: add new direction-explicit sample fetches for headers and cookies Since "hdr" and "cookie" were ambiguously referring to the request or response depending on the context, we need a way to explicitly specify the direction. By prefixing the fetches names with "req." and "res.", we can now restrict such fetches to the appropriate direction. At the moment the fetches are explicitly declared by later we might think about having an automatic match when "req." or "res." appears. These explicit fetches are now used by the relevant ACLs.	2013-04-03 02:12:59 +02:00
Willy Tarreau	9baae63d8d	MAJOR: acl: remove fetch argument validation from the ACL struct ACL fetch being inherited from the sample fetch keyword, we don't need anymore to specify what function to use to validate the fetch arguments. Note that the job is still done in the ACL parsing code based on elements from the sample fetch structs.	2013-04-03 02:12:59 +02:00
Willy Tarreau	c48c90dfa5	MAJOR: acl: remove the arg_mask from the ACL definition and use the sample fetch's Now that ACLs solely rely on sample fetch functions, make them use the same arg mask. All inconsistencies have been fixed separately prior to this patch, so this patch almost only adds a new pointer indirection and removes all references to ARG*() in the definitions. The parsing is still performed by the ACL code though.	2013-04-03 02:12:58 +02:00
Willy Tarreau	8ed669b12a	MAJOR: acl: make all ACLs reference the fetch function via a sample. ACL fetch functions used to directly reference a fetch function. Now that all ACL fetches have their sample fetches equivalent, we can make ACLs reference a sample fetch keyword instead. In order to simplify the code, a sample keyword name may be NULL if it is the same as the ACL's, which is the most common case. A minor change appeared, http_auth always expects one argument though the ACL allowed it to be missing and reported as such afterwards, so fix the ACL to match this. This is not really a bug.	2013-04-03 02:12:58 +02:00
Willy Tarreau	409bcde176	MEDIUM: http: unify acl and sample fetch functions The following sample fetch functions were only usable by ACLs but are now usable by sample fetches too : cook, cook_cnt, cook_val, hdr_cnt, hdr_ip, hdr_val, http_auth, http_auth_group, http_first_req, method, req_proto_http, req_ver, resp_ver, scook, scook_cnt, scook_val, shdr, shdr_cnt, shdr_ip, shdr_val, status, urlp, urlp_val, Most of them won't bring much benefit at the moment, or are even aliases of existing ones, however they'll be needed for ACL->SMP convergence. A new val_usr() function was added to resolve userlist names into pointers. The http_auth_group ACL forgot to make its first argument mandatory, so there was a check in cfgparse to report a vague error. Now that args are correctly parsed, let's report something more precise. All urlp* ACLs now support an optional 3rd argument like their sample counter-part which is the optional delimiter. The fetch functions have been renamed "smp_fetch_*". Some args controls on the sample keywords have been relaxed so that we can soon use them for ACLs : - cookie now accepts to have an optional name ; it will return the first matching cookie if the name is not set ; - same for set-cookie and hdr	2013-04-03 02:12:57 +02:00
Willy Tarreau	434c57c95c	MINOR: log: indicate it when some unreliable sample fetches are logged If a log-format involves some sample fetches that may not be present at the logging instant, we can now report a warning. Note that this is done both for log-format and for add-header and carefully respects the original fetch keyword's capabilities.	2013-04-03 02:12:56 +02:00
Willy Tarreau	80aca90ad2	MEDIUM: samples: use new flags to describe compatibility between fetches and their usages Samples fetches were relying on two flags SMP_CAP_REQ/SMP_CAP_RES to describe whether they were compatible with requests rules or with response rules. This was never reliable because we need a finer granularity (eg: an HTTP request method needs to parse an HTTP request, and is available past this point). Some fetches are also dependant on the context (eg: "hdr" uses request or response depending where it's involved, causing some abiguity). In order to solve this, we need to precisely indicate in fetches what they use, and their users will have to compare with what they have. So now we have a bunch of bits indicating where the sample is fetched in the processing chain, with a few variants indicating for some of them if it is permanent or volatile (eg: an HTTP status is stored into the transaction so it is permanent, despite being caught in the response contents). The fetches also have a second mask indicating their validity domain. This one is computed from a conversion table at registration time, so there is no need for doing it by hand. This validity domain consists in a bitmask with one bit set for each usage point in the processing chain. Some provisions were made for upcoming controls such as connection-based TCP rules which apply on top of the connection layer but before instantiating the session. Then everywhere a fetch is used, the bit for the control point is checked in the fetch's validity domain, and it becomes possible to finely ensure that a fetch will work or not. Note that we need these two separate bitfields because some fetches are usable both in request and response (eg: "hdr", "payload"). So the keyword will have a "use" field made of a combination of several SMP_USE_* values, which will be converted into a wider list of SMP_VAL_* flags. The knowledge of permanent vs dynamic information has disappeared for now, as it was never used. Later we'll probably reintroduce it differently when dealing with variables. Its only use at the moment could have been to avoid caching a dynamic rate measurement, but nothing is cached as of now.	2013-04-03 02:12:56 +02:00
Willy Tarreau	e0db1e8946	MEDIUM: acl: remove flag ACL_MAY_LOOKUP which is improperly used This flag is used on ACL matches that support being looking up patterns in trees. At the moment, only strings and IPs support tree-based lookups, but the flag is randomly set also on integers and binary data, and is not even always set on strings nor IPs. Better get rid of this mess by only relying on the matching function to decide whether or not it supports tree-based lookups, this is safer and easier to maintain.	2013-04-03 02:12:56 +02:00
Willy Tarreau	aae75e3279	BUG/CRITICAL: using HTTP information in tcp-request content may crash the process During normal HTTP request processing, request buffers are realigned if there are less than global.maxrewrite bytes available after them, in order to leave enough room for rewriting headers after the request. This is done in http_wait_for_request(). However, if some HTTP inspection happens during a "tcp-request content" rule, this realignment is not performed. In theory this is not a problem because empty buffers are always aligned and TCP inspection happens at the beginning of a connection. But with HTTP keep-alive, it also happens at the beginning of each subsequent request. So if a second request was pipelined by the client before the first one had a chance to be forwarded, the second request will not be realigned. Then, http_wait_for_request() will not perform such a realignment either because the request was already parsed and marked as such. The consequence of this, is that the rewrite of a sufficient number of such pipelined, unaligned requests may leave less room past the request been processed than the configured reserve, which can lead to a buffer overflow if request processing appends some data past the end of the buffer. A number of conditions are required for the bug to be triggered : - HTTP keep-alive must be enabled ; - HTTP inspection in TCP rules must be used ; - some request appending rules are needed (reqadd, x-forwarded-for) - since empty buffers are always realigned, the client must pipeline enough requests so that the buffer always contains something till the point where there is no more room for rewriting. While such a configuration is quite unlikely to be met (which is confirmed by the bug's lifetime), a few people do use these features together for very specific usages. And more importantly, writing such a configuration and the request to attack it is trivial. A quick workaround consists in forcing keep-alive off by adding "option httpclose" or "option forceclose" in the frontend. Alternatively, disabling HTTP-based TCP inspection rules enough if the application supports it. At first glance, this bug does not look like it could lead to remote code execution, as the overflowing part is controlled by the configuration and not by the user. But some deeper analysis should be performed to confirm this. And anyway, corrupting the process' memory and crashing it is quite trivial. Special thanks go to Yves Lafon from the W3C who reported this bug and deployed significant efforts to collect the relevant data needed to understand it in less than one week. CVE-2013-1912 was assigned to this issue. Note that 1.4 is also affected so the fix must be backported.	2013-04-03 02:12:55 +02:00
Willy Tarreau	2d43e18b69	BUG/MAJOR: http: fix regression introduced by commit `d655ffe` Sander Klein reported that since last snapshot, some downloads would hang from nginx but succeed from apache. The culprit was not too hard to find given the low number of recent changes affecting the data path. Commit `d655ffe` slightly reorganized the HTTP state machine and introduced this regression. The reason is that we must never jump into the MSG_DONE case without first flushing remaining data because this is not done anymore afterwards. This part is scheduled for being reorganized since it's totally ugly especially since we added compression, and this regression is an illustration of its readability. The issue is entirely dependant on the server close sequence, which explains why it was reproducible only with nginx here.	2013-04-03 00:22:25 +02:00
Willy Tarreau	ffb6f08bab	BUG/MAJOR: http: fix regression introduced by commit `a890d072` This commit fixed a bug and introduced a new one at the same time. It's a stupid typo, the index to store the context is [0], not [2]. The effect is that parsing the header can loop forever if multiple headers are found. This issue was reported by Lukas Tribus.	2013-04-02 23:19:30 +02:00
Willy Tarreau	a890d072fc	BUG/MAJOR: http: use a static storage for sample fetch context Baptiste Assmann reported that the cook*() ACLs do not work anymore. The reason is the way we store the hdr_ctx between subsequent calls to smp_fetch_cookie() since commit `3740635b` (1.5-dev10). The smp->ctx.a[] storage holds up to 8 pointers. It is not meant for generic storage. We used to store hdr_ctx in the ctx, but while it used to just fit for smp_fetch_hdr(), it does not for smp_fetch_cookie() since we stored it at offset 2. The correct solution is to use this storage to store a pointer to the current hdr_ctx struct which is statically allocated.	2013-04-02 12:01:06 +02:00
Willy Tarreau	d655ffe863	OPTIM: http: optimize the response forward state machine By replacing the if/else series with a switch/case, we could save another 20% on the worst case (chunks of 1 byte).	2013-04-02 02:01:00 +02:00
Willy Tarreau	0161d62d23	OPTIM: http: improve branching in chunk size parser By tweaking a bit some conditions in http_parse_chunk_size(), we could improve the overall performance in the worst case by 15%.	2013-04-02 02:00:57 +02:00
Yves Lafon	e267421e93	MINOR: http: status 301 should not be marked non-cacheable Also, browsers behaviour is inconsistent regarding the Cache-Control header field on a 301.	2013-03-30 11:22:41 +01:00
Yves Lafon	3e8d1ae2d2	MEDIUM: http: implement redirect 307 and 308 I needed to emit a 307 and noticed it was not available so I did it, as well as 308.	2013-03-29 19:17:41 +01:00
Yves Lafon	4e8ec500e5	MINOR: http: status code 303 is HTTP/1.1 only Don't return a 303 redirect with "HTTP/1.0" as it's HTTP/1.1 only.	2013-03-29 19:08:09 +01:00
Willy Tarreau	2fef9b1ef6	BUG/MEDIUM: http: fix another issue caused by http-send-name-header An issue reported by David Coulson is that when using http-send-name-header, the response processing would randomly be performed. The issue was first diagnosed by Cyril Bont� as being related to a time race when processing the closing of the response. In practice, the issue is a bit trickier. It happens that http_send_name_header() did not update msg->sol after a rewrite. This counter is supposed to point to the beginning of the message's body once headers are scheduled for being forwarded. And not updating it means that the first forwarding of the request headers in http_request_forward_body() does not send the correct count, leaving some bytes in chn->to_forward. Then if the server sends its response in a single packet with the close, the stream interface switches to state SI_ST_DIS which in turn moves to SI_ST_CLO in process_session(), and to close the outgoing connection. This is detected by http_request_forward_body(), which then switches the request message to the error state, and syncs all FSMs and removes any response analyser. The response analyser being removed, no processing is performed on the response buffer, which is tunnelled as-is to the client. Of course, the correct fix consists in having http_send_name_header() update msg->sol. Normally this ought not to have been needed, but it is an abuse to modify data already scheduled for being forwarded, so it is expected that such specific handling has to be done there. Better not have generic functions deal with such cases, so that it does not become the standard. Note: 1.4 does not have this issue even if it does not update the pointer either, because it forwards from msg->som which is not updated at the moment the connect() succeeds. So no backport is required.	2013-03-26 01:21:47 +01:00
Willy Tarreau	3bfeadb3f6	BUG/MEDIUM: http: add-header should not emit "-" for empty fields Patch `6cbbdbf3` fixed the missing "-" delimitors in logs but it caused them to be emitted with "http-request add-header", eventhough it was correctly fixed for the unique-id format. Fix this by simply removing LOG_OPT_MANDATORY in this case.	2013-03-24 07:33:22 +01:00
Willy Tarreau	6cbbdbf3f3	BUG/MEDIUM: log: emit '-' for empty fields again Commit `2b0108ad` accidently got rid of the ability to emit a "-" for empty log fields. This can happen for captured request and response cookies, as well as for fetches. Since we don't want to have this done for headers however, we set the default log method when parsing the format. It is still possible to force the desired mode using +M/-M.	2013-02-05 18:55:09 +01:00
Willy Tarreau	192e59fb07	CLEANUP: http: don't try to deinitialize http compression if it fails before init In select_compression_response_header(), some tests are rather confusing as the "fail" label is used to deinitialize the compression context for the session while it's branched only before initialization succeeds. The test is always false here and the dereferencing of the comp_algo pointer which might be null is also confusing. Remove that code which is not needed anymore since commit `ec3e3890` got rid of the latest issues. Reported-by: Dinko Korunic <dkorunic@reflected.net>	2013-01-24 16:19:19 +01:00
Willy Tarreau	4521ba689c	CLEANUP: http: remove a useless null check srv cannot be null in http_perform_server_redirect(), as it's taken from the stream interface's target which is always valid for a server-based redirect, and it was already dereferenced above, so in practice, gcc already removes the test anyway. Reported-by: Dinko Korunic <dkorunic@reflected.net>	2013-01-24 16:19:18 +01:00
Baptiste Assmann	116eefed8f	MINOR: config: http-request configuration error message misses new keywords "redirect" and "tarpit" keywords were missing from http-request configuration error message.	2013-01-05 16:53:49 +01:00
Willy Tarreau	56e9ffa6a6	BUG/MINOR: http-compression: lookup Cache-Control in the response, not the request As stated in both RFC2616 and the http-bis drafts, Cache-Control: no-transform must be looked up in the response since we're modifying the response. However, its presence in the request is irrelevant to any changes in the response : 7.2.1.6. no-transform The "no-transform" request directive indicates that an intermediary (whether or not it implements a cache) MUST NOT change the Content- Encoding, Content-Range or Content-Type request header fields, nor the request representation. 7.2.2.9. no-transform The "no-transform" response directive indicates that an intermediary (regardless of whether it implements a cache) MUST NOT change the Content-Encoding, Content-Range or Content-Type response header fields, nor the response representation. Note: according to the specs, we're supposed to emit the following response header : Warning: 214 transformation applied However no other product seems to do it, so the effect on user agents is unclear.	2013-01-05 16:31:58 +01:00
Willy Tarreau	ccbcc37a01	MEDIUM: http: add support for "http-request tarpit" rule The "reqtarpit" rule is not very handy to use. Now that we have more flexibility with "http-request", let's finally make the tarpit rules usable there. There are still semantical differences between apply_filters_to_request() and http_req_get_intercept_rule() because the former updates the counters while the latter does not. So we currently have almost similar code leafs for similar conditions, but this should be cleaned up later.	2012-12-28 14:47:19 +01:00
Willy Tarreau	81499eb67d	MEDIUM: http: add support for "http-request redirect" rules These are exactly the same as the classic redirect rules except that they can be interleaved with other http-request rules for more flexibility. The redirect parser should probably be changed to stop at the condition so that the caller puts its own condition pointer. At the moment, the redirect rule and condition are parsed at once by build_redirect_rule() and the condition is assigned to the http_req_rule.	2012-12-28 14:47:19 +01:00

... 5 6 7 8 9 ...

1359 Commits