haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-08 08:07:10 +02:00

Author	SHA1	Message	Date
Willy Tarreau	973a54235f	MEDIUM: stream-int: simplify si_alloc_conn() Since we now always call this function with the reuse parameter cleared, let's simplify the function's logic as it cannot return the existing connection anymore. The savings on this inline function are appreciable (240 bytes) : $ size haproxy.old haproxy.new text data bss dec hex filename 1020383 40816 36928 1098127 10c18f haproxy.old 1020143 40816 36928 1097887 10c09f haproxy.new	2015-08-05 21:51:09 +02:00
Thierry FOURNIER	bf65cd4d77	MAJOR: arg: converts uint and sint in sint This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed.	2015-07-22 00:48:23 +02:00
Thierry FOURNIER	07ee64ef4d	MAJOR: sample: converts uint and sint in 64 bits signed integer This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed. This makes easy the usage of integer and clarify signed and unsigned use. With the previous version, signed and unsigned are used ones in place of others, and sometimes the converter loose the sign. For example, divisions are processed with "unsigned", if one entry is negative, the result is wrong. Note that the integer pattern matching and dotted version pattern matching are already working with signed 64 bits integer values. There is one user-visible change : the "uint()" and "sint()" sample fetch functions which used to return a constant integer have been replaced with a new more natural, unified "int()" function. These functions were only introduced in the latest 1.6-dev2 so there's no impact on regular deployments.	2015-07-22 00:48:23 +02:00
Thierry FOURNIER	fac9ccfb70	BUG/MINOR: http/sample: gmtime/localtime can fail The man said that gmtime() and localtime() can return a NULL value. This is not tested. It appears that all the values of a 32 bit integer are valid, but it is better to check the return of these functions. However, if the integer move from 32 bits to 64 bits, some 64 values can be unsupported.	2015-07-20 12:21:35 +02:00
Adis Nezirovic	2fbcafc9ce	MEDIUM: http: Add new 'set-src' option to http-request This option enables overriding source IP address in a HTTP request. It is useful when we want to set custom source IP (e.g. front proxy rewrites address, but provides the correct one in headers) or we wan't to mask source IP address for privacy or compliance. It acts on any expression which produces correct IP address.	2015-07-06 16:17:28 +02:00
Adis Nezirovic	79beb248b9	CLEANUP: sample: generalize sample_fetch_string() as sample_fetch_as_type() This modification makes possible to use sample_fetch_string() in more places, where we might need to fetch sample values which are not plain strings. This way we don't need to fetch string, and convert it into another type afterwards. When using aliased types, the caller should explicitly check which exact type was returned (e.g. SMP_T_IPV4 or SMP_T_IPV6 for SMP_T_ADDR). All usages of sample_fetch_string() are converted to use new function.	2015-07-06 16:17:25 +02:00
Thierry FOURNIER	4834bc773c	MEDIUM: vars: adds support of variables This patch adds support of variables during the processing of each stream. The variables scope can be set as 'session', 'transaction', 'request' or 'response'. The variable type is the type returned by the assignment expression. The type can change while the processing. The allocated memory can be controlled for each scope and each request, and for the global process.	2015-06-13 23:01:37 +02:00
Thierry FOURNIER	0e11863a6f	MINOR: tcp/http/conf: extends the keyword registration options This patch permits to register a new keyword with the keyword "tcp-request content" 'tcp-request connection", tcp-response content", http-request" and "http-response" which is identified only by matching the start of the keyword. for example, we register the keyword "set-var" with the option "match_pfx" and the configuration keyword "set-var(var_name)" matchs this entry.	2015-06-13 23:01:37 +02:00
Willy Tarreau	b8cdf52da0	BUG/MEDIUM: http: fix body processing for the stats applet Commit `9fbe18e` ("MEDIUM: http: add a new option http-buffer-request") introduced a regression due to a misplaced check causing the admin mode of the HTTP stats not to work anymore. This patch tried to ensure that when we need a request body for the stats applet, and we have already waited for this body, we don't wait for it again, but the condition was applied too early causing a disabling of the entire processing the body, and based on the wrong HTTP state (MSG_BODY) resulting in the test never matching. Thanks to Chad Lavoie for reporting the problem. This bug is 1.6-only, no backport is needed.	2015-05-29 01:12:38 +02:00
Willy Tarreau	2de8a50918	MEDIUM: http: no need to close the request on redirect if data was parsed There are two reasons for not keeping the client connection alive upon a redirect : - save the client from uploading all data - avoid keeping a connection alive if the redirect goes to another domain The first case should consider an exception when all the data from the client have been read already. This specifically happens on response redirects after a POST to a server. This is an easy situation to detect. It could later be improved to cover the cases where option http-buffer-request is used.	2015-05-28 17:45:43 +02:00
Willy Tarreau	51d861a44f	MEDIUM: http: implement http-response redirect rules Sometimes it's problematic not to have "http-response redirect" rules, for example to perform a browser-based redirect based on certain server conditions (eg: match of a header). This patch adds "http-response redirect location <fmt>" which gives enough flexibility for most imaginable operations. The connection to the server is closed when this is performed so that we don't risk to forward any pending data from the server. Any pending response data are trimmed so that we don't risk to forward anything pending to the client. It's harmless to also do that for requests so we don't need to consider the direction.	2015-05-28 17:45:43 +02:00
Willy Tarreau	be4653b6d4	MINOR: http: prepare support for parsing redirect actions on responses In order to support http-response redirect, the parsing needs to be adapted a little bit to only support the "location" type, and to adjust the log-format parser so that it knows the direction of the sample fetch calls.	2015-05-28 17:43:11 +02:00
Willy Tarreau	b329a312e3	CLEANUP: http: explicitly reference request in http_apply_redirect_rules() This function was made to perform a redirect on requests only, it was using a message or txn->req in an inconsistent way and did not consider the possibility that it could be used for the other direction. Let's clean it up to have both a request and a response messages.	2015-05-28 17:42:16 +02:00
Thierry FOURNIER	e80fadaaca	MEDIUM: capture: adds http-response capture This patch adds a http response capture keyword with the same behavior as the previous patch called "MEDIUM: capture: Allow capture with slot identifier".	2015-05-28 13:51:00 +02:00
Thierry FOURNIER	82bf70dff4	MEDIUM: capture: Allow capture with slot identifier This patch modifies the current http-request capture function and adds a new keyword "id" that permits to identify a capture slot. If the identified doesn't exists, the action fails silently. Note that this patch removs an unused list initilisation, which seems to be inherited from a copy/paste. It's harmless and does not need to be backported. LIST_INIT((struct list *)&rule->arg.act.p[0]);	2015-05-28 13:50:29 +02:00
Thierry FOURNIER	35ab27561e	MINOR: capture: add two "capture" converters This patch adds "capture-req" and "capture-res". These two converters capture their entry in the allocated slot given in argument and pass the input on the output.	2015-05-28 13:50:29 +02:00
Willy Tarreau	98d0485a90	MAJOR: config: remove the deprecated reqsetbe / reqisetbe actions These ones were already obsoleted in 1.4, marked for removal in 1.5, and not documented anymore. They used to emit warnings, and do still require quite some code to stay in place. Let's remove them now.	2015-05-26 12:18:29 +02:00
Dragan Dosen	26f77e534c	BUG/MEDIUM: http: fix the url_param fetch The "name" and "name_len" arguments in function "smp_fetch_url_param" could be left uninitialized for subsequent calls. [wt: no backport needed, this is an 1.6 regression introduced by commit `4fdc74c` ("MINOR: http: split the url_param in two parts") ]	2015-05-25 19:01:39 +02:00
Thierry FOURNIER	8be451c52a	MEDIUM: http: url-encoded parsing function can run throught wrapped buffer The functions smp_fetch_param(), find_next_url_param() and find_url_param_pos() can look for argument in 2 chunks and not only one.	2015-05-20 16:05:38 +02:00
Thierry FOURNIER	e28c49975a	MINOR: http: add body_param fetch This fetch returns one body param or the list of each body param. This first version runs only with one chunk.	2015-05-20 15:56:23 +02:00
Thierry FOURNIER	0948d41a12	CLEANUP: http: bad indentation Some function argument uses space in place of tabulation for the indentation.	2015-05-20 15:56:23 +02:00
Thierry FOURNIER	4fdc74c22c	MINOR: http: split the url_param in two parts This patch is the part of the body_param fetch. The goal is to have generic url-encoded parser which can used for parsing the query string and the body.	2015-05-20 15:56:23 +02:00
Willy Tarreau	1ede1daab6	MEDIUM: http: make url_param iterate over multiple occurrences There are some situations hwere it's desirable to scan multiple occurrences of a same parameter name in the query string. This change ensures this can work, even with an empty name which will then iterate over all parameters.	2015-05-19 13:16:07 +02:00
Thierry FOURNIER	0786d05a04	MEDIUM: sample: change the prototype of sample-fetches functions This patch removes the "opt" entry from the prototype of the sample-fetches fucntions. This permits to remove some weight in the prototype call.	2015-05-11 20:03:08 +02:00
Thierry FOURNIER	0a9a2b8cec	MEDIUM: sample change the prototype of sample-fetches and converters functions This patch removes the structs "session", "stream" and "proxy" from the sample-fetches and converters function prototypes. This permits to remove some weight in the prototype call.	2015-05-11 20:01:42 +02:00
Willy Tarreau	bbfb6c4085	BUG/MEDIUM: http: don't forward client shutdown without NOLINGER except for tunnels There's an issue related with shutting down POST transfers or closing the connection after the end of the upload : the shutdown is forwarded to the server regardless of the abortonclose option. The problem it causes is that during a scan, brute force or whatever, it becomes possible that all source ports are exhausted with all sockets in TIME_WAIT state. There are multiple issues at once in fact : - no action is done for the close, it automatically happens at the lower layers thanks for channel_auto_close(), so we cannot act on NOLINGER ; - we do want to continue to send a clean shutdown in tunnel mode because some protocols transported over HTTP may need this, regardless of option abortonclose, thus we can't set the option inconditionally - for all other modes, we do want to close the dirty way because we're certain whether we've sent everything or not, and we don't want to eat all source ports. The solution is a bit complex and applies to DONE/TUNNEL states : 1) disable automatic close for everything not a tunnel and not just keep-alive / server-close. Force-close is now covered, as is HTTP/1.0 which implicitly works in force-close mode ; 2) when processing option abortonclose, we know we can disable lingering if the client has closed and the connection is not in tunnel mode. Since the last case above leads to a situation where the client side reports an error, we know the connection will not be reused, so leaving the flag on the stream-interface is safe. A client closing in the middle of the data transmission already aborts the transaction so this case is not a problem. This fix must be backported to 1.5 where the problem was detected.	2015-05-11 19:05:42 +02:00
Thierry FOURNIER	82ff3c9b05	MINOR: sample: add url_dec converter This converter decodes an url-encoded string. It takes a string as input and returns string as output.	2015-05-11 11:40:36 +02:00
Willy Tarreau	3986ac1860	BUG/MEDIUM: http: fix the http-request capture parser Due to the code being mostly inspired from the tcp-request parser, it does some crap because both don't work the same way. The "len" argument could be mismatched and then the length could be used uninitialized.	2015-05-08 16:13:42 +02:00
Willy Tarreau	a9083d0722	MEDIUM: http: add new "capture" action for http-request This is only possible in frontends of course, but it will finally make it possible to capture arbitrary http parts, including URL parameters or parts of the message body. It's worth noting that an ugly (char **) cast had to be done to call sample_fetch_string() which is caused by a 5- or 6- levels of inheritance of this type in the API. Here it's harmless since the function uses it as a const, but this API madness must be fixed, starting with the one or two rare functions that modify the args and inflict this on each and every keyword parser. (cherry picked from commit 484a4f38460593919a1c1d9a047a043198d69f45)	2015-05-08 15:43:54 +02:00
Willy Tarreau	a5910cc6ef	MEDIUM: http: provide 3 fetches for the body Body processing is still fairly limited, but this is a start. It becomes possible to apply regex to find contents in order to decide where to route a request for example. Only the first chunk is parsed for now, and the response is not yet available (the parsing function must be duplicated for this). req.body : binary This returns the HTTP request's available body as a block of data. It requires that the request body has been buffered made available using "option http-buffer-request". In case of chunked-encoded body, currently only the first chunk is analyzed. req.body_len : integer This returns the length of the HTTP request's available body in bytes. It may be lower than the advertised length if the body is larger than the buffer. It requires that the request body has been buffered made available using "option http-buffer-request". req.body_size : integer This returns the advertised length of the HTTP request's body in bytes. It will represent the advertised Content-Length header, or the size of the first chunk in case of chunked encoding. In order to parse the chunks, it requires that the request body has been buffered made available using "option http-buffer-request".	2015-05-02 00:46:08 +02:00
Willy Tarreau	9fbe18e174	MEDIUM: http: add a new option http-buffer-request It is sometimes desirable to wait for the body of an HTTP request before taking a decision. This is what is being done by "balance url_param" for example. The first use case is to buffer requests from slow clients before connecting to the server. Another use case consists in taking the routing decision based on the request body's contents. This option placed in a frontend or backend forces the HTTP processing to wait until either the whole body is received, or the request buffer is full, or the first chunk is complete in case of chunked encoding. It can have undesired side effects with some applications abusing HTTP by expecting unbufferred transmissions between the frontend and the backend, so this should definitely not be used by default. Note that it would not work for the response because we don't reset the message state before starting to forward. For the response we need to 1) reset the message state to MSG_100_SENT or BODY , and 2) to reset body_len in case of chunked encoding to avoid counting it twice.	2015-05-02 00:10:44 +02:00
Willy Tarreau	e115b49c39	BUG/MEDIUM: http: wait for the exact amount of body bytes in wait_for_request_body Due to the fact that we were still considering only msg->sov for the first byte of data after calling http_parse_chunk_size(), we used to miscompute the input data size and to count the CRLF and the chunk size as part of the input data. The effect is that it was possible to release the processing with 3 or 4 missing bytes, especially if they're typed by hand during debugging sessions. This can cause the stats page to return some errors in admin mode, and the url_param balance algorithm to fail to properly hash a body input. This fix must be backported to 1.5.	2015-05-01 23:24:32 +02:00
Willy Tarreau	0f228a037a	MEDIUM: http: add option-ignore-probes to get rid of the floods of 408 Recently some browsers started to implement a "pre-connect" feature consisting in speculatively connecting to some recently visited web sites just in case the user would like to visit them. This results in many connections being established to web sites, which end up in 408 Request Timeout if the timeout strikes first, or 400 Bad Request when the browser decides to close them first. These ones pollute the log and feed the error counters. There was already "option dontlognull" but it's insufficient in this case. Instead, this option does the following things : - prevent any 400/408 message from being sent to the client if nothing was received over a connection before it was closed ; - prevent any log from being emitted in this situation ; - prevent any error counter from being incremented That way the empty connection is silently ignored. Note that it is better not to use this unless it is clear that it is needed, because it will hide real problems. The most common reason for not receiving a request and seeing a 408 is due to an MTU inconsistency between the client and an intermediary element such as a VPN, which blocks too large packets. These issues are generally seen with POST requests as well as GET with large cookies. The logs are often the only way to detect them. This patch should be backported to 1.5 since it avoids false alerts and makes it easier to monitor haproxy's status.	2015-05-01 15:39:23 +02:00
Willy Tarreau	13317669d5	MEDIUM: http: disable support for HTTP/0.9 by default There's not much reason for continuing to accept HTTP/0.9 requests nowadays except for manual testing. Now we disable support for these by default, unless option accept-invalid-http-request is specified, in which case they continue to be upgraded to 1.0.	2015-05-01 14:57:54 +02:00
Willy Tarreau	91852eb428	MEDIUM: http: restrict the HTTP version token to 1 digit as per RFC7230 While RFC2616 used to allow an undeterminate amount of digits for the major and minor components of the HTTP version, RFC7230 has reduced that to a single digit for each. If a server can't properly parse the version string and falls back to 0.9, it could then send a head-less response whose payload would be taken for headers, which could confuse downstream agents. Since there's no more reason for supporting a version scheme that was never used, let's upgrade to the updated version of the standard. It is still possible to enforce support for the old behaviour using options accept-invalid-http-request and accept-invalid-http-response. It would be wise to backport this to 1.5 as well just in case.	2015-05-01 14:57:01 +02:00
Willy Tarreau	b4d0c03aee	BUG/MEDIUM: http: remove content-length form responses with bad transfer-encoding The spec mandates that content-length must be removed from messages if Transfer-Encoding is present, not just for valid ones. This must be backported to 1.5 and 1.4.	2015-05-01 13:56:11 +02:00
Willy Tarreau	34dfc60571	BUG/MEDIUM: http: incorrect transfer-coding in the request is a bad request The rules related to how to handle a bad transfer-encoding header (one where "chunked" is not at the final place) have evolved to mandate an abort when this happens in the request. Previously it was only a close (which is still valid for the server side). This must be backported to 1.5 and 1.4.	2015-05-01 13:56:10 +02:00
Willy Tarreau	4979d5c5d1	BUG/MEDIUM: http: do not restrict parsing of transfer-encoding to HTTP/1.1 While Transfer-Encoding is HTTP/1.1, we must still parse it in HTTP/1.0 in case an agent sends it, because it's likely that the other side might use it as well, causing confusion. This will also result in getting rid of the Content-Length header in such abnormal situations and in having a clean connection. This must be backported to 1.5 and 1.4.	2015-05-01 13:56:10 +02:00
Willy Tarreau	557f199fb7	DOC: http: update the comments about the rules for determining transfer-length Let's now use the text from RFC7230 which is stricter and more precise. This must be backported to 1.5 and 1.4.	2015-05-01 13:56:10 +02:00
Willy Tarreau	1c91391df4	BUG/MEDIUM: http: remove content-length from chunked messages RFC7230 clarified the behaviour to adopt when facing both a content-length and a transfer-encoding: chunked in a message. While haproxy already complied with the method for getting the message length right, and used to detect improper content-length duplicates, it still did not remove the content-length header when facing a transfer-encoding: chunked. Usually it is not a problem since other agents (clients and servers) are required to parse the message according to the rules that have been in place since RFC2616 in 1999. However R�gis Leroy reported the existence of at least one such non-compliant agent so haproxy could be abused to get out of sync with it on pipelined requests (HTTP request smuggling attack), it consider part of a payload as a subsequent request. The best thing to do is then to remove the content-length according to RFC7230. It used to be in the todo list with a fixme in the code while waiting for the standard to stabilize, let's apply it now that it's published. Thanks to R�gis for bringing that subject to our attention. This fix must be backported to 1.5 and 1.4.	2015-05-01 13:56:10 +02:00
Thierry FOURNIER	7f6192c0d3	BUG/MEDIUM: http: functions set-{path,query,method,uri} breaks the HTTP parser When one of these functions replaces a part of the query string by a shorter or longer new one, the header parsing is broken. This is because the start of the first header is not updated. In the same way, the total length of the request line is not updated. I dont see any bug caused by this miss, but I guess than it is better to store the good length. This bug is only in the development version.	2015-04-27 11:56:52 +02:00
Willy Tarreau	ee335e65dc	BUG/MEDIUM: http: properly retrieve the front connection Commit `350f487` ("CLEANUP: session: simplify references to chn_{prod,cons}(&s->{req,res})") introduced a regression causing the cli_conn to be picked from the server side instead of the client side, so the XFF header is not appended anymore since the connection is NULL. Thanks to Reinis Rozitis for reporting this bug. No backport is needed as it's 1.6-specific.	2015-04-21 18:15:13 +02:00
Willy Tarreau	152b81e7b2	BUG/MAJOR: tcp/http: fix current_rule assignment when restarting over a ruleset Commit `bc4c1ac` ("MEDIUM: http/tcp: permit to resume http and tcp custom actions") introduced the ability to interrupt and restart processing in the middle of a TCP/HTTP ruleset. But it doesn't do it in a consistent way : it checks current_rule_list, immediately dereferences current_rule, which is only set in certain cases and never cleared. So that broke the tcp-request content rules when the processing was interrupted due to missing data, because current_rule was not yet set (segfault) or could have been inherited from another ruleset if it was used in a backend (random behaviour). The proper way to do it is to always set current_rule before dereferencing it. But we don't want to set it for all rules because we don't want any action to provide a checkpointing mechanism. So current_rule is set to NULL before entering the loop, and only used if not NULL and if current_rule_list matches the current list. This way they both serve as a guard for the other one. This fix also makes the current rule point to the rule instead of its list element, as it's much easier to manipulate. No backport is needed, this is 1.6-specific.	2015-04-20 13:46:20 +02:00
CJ Ess	108b1dd69d	MEDIUM: http: configurable http result codes for http-request deny This patch adds support for error codes 429 and 405 to Haproxy and a "deny_status XXX" option to "http-request deny" where you can specify which code is returned with 403 being the default. We really want to do this the "haproxy way" and hope to have this patch included in the mainline. We'll be happy address any feedback on how this is implemented.	2015-04-11 10:34:54 +02:00
Willy Tarreau	d0d8da989b	MINOR: stream: provide a few helpers to retrieve frontend, listener and origin Expressions are quite long when using strm_sess(strm)->whatever, so let's provide a few helpers : strm_fe(), strm_li(), strm_orig().	2015-04-06 11:37:29 +02:00
Willy Tarreau	192252e2d8	MAJOR: sample: pass a pointer to the session to each sample fetch function Many such function need a session, and till now they used to dereference the stream. Once we remove the stream from the embryonic session, this will not be possible anymore. So as of now, sample fetch functions will be called with this : - sess = NULL, strm = NULL : never - sess = valid, strm = NULL : tcp-req connection - sess = valid, strm = valid, strm->txn = NULL : tcp-req content - sess = valid, strm = valid, strm->txn = valid : http-req / http-res	2015-04-06 11:37:25 +02:00
Willy Tarreau	987e3fb868	MEDIUM: http: remove the now useless http_txn from {req/res} rules The registerable http_req_rules / http_res_rules used to require a struct http_txn at the end. It's redundant with struct stream and propagates very deep into some parts (ie: it was the reason for lua requiring l7). Let's remove it now.	2015-04-06 11:35:53 +02:00
Willy Tarreau	15e91e1b36	MAJOR: sample: don't pass l7 anymore to sample fetch functions All of them can now retrieve the HTTP transaction if it exists from the stream and be sure to get NULL there when called with an embryonic session. The patch is a bit large because many locations were touched (all fetch functions had to have their prototype adjusted). The opportunity was taken to also uniformize the call names (the stream is now always "strm" instead of "l4") and to fix indent where it was broken. This way when we later introduce the session here there will be less confusion.	2015-04-06 11:35:53 +02:00
Willy Tarreau	eee5b51248	MAJOR: http: move http_txn out of struct stream Now this one is dynamically allocated. It means that 280 bytes of memory are saved per TCP stream, but more importantly that it will become possible to remove the l7 pointer from fetches and converters since it will be deduced from the stream and will support being null. A lot of care was taken because it's easy to forget a test somewhere, and the previous code used to always trust s->txn for being valid, but all places seem to have been visited. All HTTP fetch functions check the txn first so we shouldn't have any issue there even when called from TCP. When branching from a TCP frontend to an HTTP backend, the txn is properly allocated at the same time as the hdr_idx.	2015-04-06 11:35:52 +02:00
Willy Tarreau	63986c72c8	MINOR: http: create a dedicated pool for http_txn This one will not necessarily be allocated for each stream, and we want to use the fact that it equals null to know it's not present so that we can always deduce its presence from the stream pointer. This commit only creates the new pool.	2015-04-06 11:35:52 +02:00
Willy Tarreau	cb7dd015be	MEDIUM: http: move header captures from http_txn to struct stream The header captures are now general purpose captures since tcp rules can use them to capture various contents. That removes a dependency on http_txn that appeared in some sample fetch functions and in the order by which captures and http_txn were allocated. Interestingly the reset of the header captures were done at too many places as http_init_txn() used to do it while it was done previously in every call place.	2015-04-06 11:35:52 +02:00
Willy Tarreau	53c9b4db41	CLEANUP: sample: remove useless tests in fetch functions for l4 != NULL The stream may never be null given that all these functions are called from sample_process(). Let's remove this now confusing test which sometimes happens after a dereference was already done.	2015-04-06 11:35:52 +02:00
Willy Tarreau	9ad7bd48d2	MEDIUM: session: use the pointer to the origin instead of s->si[0].end When s->si[0].end was dereferenced as a connection or anything in order to retrieve information about the originating session, we'll now use sess->origin instead so that when we have to chain multiple streams in HTTP/2, we'll keep accessing the same origin.	2015-04-06 11:34:29 +02:00
Willy Tarreau	e36cbcb3b0	MEDIUM: stream: move the frontend's pointer to the session Just like for the listener, the frontend is session-wide so let's move it to the session. There are a lot of places which were changed but the changes are minimal in fact.	2015-04-06 11:23:58 +02:00
Willy Tarreau	fb0afa77c9	MEDIUM: stream: move the listener's pointer to the session The listener is session-specific, move it there.	2015-04-06 11:23:57 +02:00
Willy Tarreau	e7dff02dd4	REORG/MEDIUM: stream: rename stream flags from SN_* to SF_* This is in order to keep things consistent.	2015-04-06 11:23:57 +02:00
Willy Tarreau	87b09668be	REORG/MAJOR: session: rename the "session" entity to "stream" With HTTP/2, we'll have to support multiplexed streams. A stream is in fact the largest part of what we currently call a session, it has buffers, logs, etc. In order to catch any error, this commit removes any reference to the struct session and tries to rename most "session" occurrences in function names to "stream" and "sess" to "strm" when that's related to a session. The files stream.{c,h} were added and session.{c,h} removed. The session will be reintroduced later and a few parts of the stream will progressively be moved overthere. It will more or less contain only what we need in an embryonic session. Sample fetch functions and converters will have to change a bit so that they'll use an L5 (session) instead of what's currently called "L4" which is in fact L6 for now. Once all changes are completed, we should see approximately this : L7 - http_txn L6 - stream L5 - session L4 - connection \| applet There will be at most one http_txn per stream, and a same session will possibly be referenced by multiple streams. A connection will point to a session and to a stream. The session will hold all the information we need to keep even when we don't yet have a stream. Some more cleanup is needed because some code was already far from being clean. The server queue management still refers to sessions at many places while comments talk about connections. This will have to be cleaned up once we have a server-side connection pool manager. Stream flags "SN_*" still need to be renamed, it doesn't seem like any of them will need to move to the session.	2015-04-06 11:23:56 +02:00
Willy Tarreau	cb703b0352	BUG/MAJOR: http: null-terminate the http actions keywords list Commit `a0dc23f` ("MEDIUM: http: implement http-request set-{method,path,query,uri}") forgot to null-terminate the list, resulting in crashes when these actions are used if the platform doesn't pad the struct with nulls. Thanks to Gunay Arslan for reporting a detailed trace showing the origin of this bug. No backport to 1.5 is needed.	2015-04-03 09:58:02 +02:00
Willy Tarreau	601a4d1741	BUG/MEDIUM: http: hdr_cnt would not count any header when called without name It's documented that these sample fetch functions should count all headers and/or all values when called with no name but in practice it's not what is being done as a missing name causes an immediate return and an absence of result. This bug is present in 1.5 as well and must be backported.	2015-04-01 19:16:09 +02:00
Willy Tarreau	615105e7e8	MEDIUM: compression: add a distinction between UA- and config- algorithms Thanks to MSIE/IIS, the "deflate" name is ambigous. According to the RFC it's a zlib-wrapped deflate stream, but IIS used to send only a raw deflate stream, which is the only format MSIE understands for "deflate". The other widely used browsers do support both formats. For this reason some people prefer to emit a raw deflate stream on "deflate" to serve more users even it that means violating the standards. Haproxy only follows the standard, so they cannot do this. This patch makes it possible to have one algorithm name in the configuration and another one in the protocol. This will make it possible to have a new configuration token to add a different algorithm so that users can decide if they want a raw deflate or the standard one.	2015-03-28 16:46:38 +01:00
Willy Tarreau	e7e49a8d0b	MINOR: http: check the algo name "identity" instead of the function pointer Next patch will statity all compression functions, so let's stop relying on a function pointer comparison and use the algo name instead.	2015-03-28 15:43:17 +01:00
Thierry FOURNIER	7fe75e0dab	MINOR: http: export function inet_set_tos() This is used by Lua.	2015-03-18 11:34:06 +01:00
Thierry FOURNIER	5531f87ace	MINOR: http: split http_transform_header() function in two parts. This function is a callback for HTTP actions. This function creates the replacement string from a build_logline() format and transform the header. This patch split this function in two part. With this modification, the header transformation and the replacement string are separed. We can now transform the header with another replacement string source than a build_logline() format.	2015-03-18 11:34:06 +01:00
Thierry FOURNIER	b77aece24a	MINOR: http: split the function http_action_set_req_line() in two parts The first part is the replacement engine. It take a replacement action number and a replacement string and process the action. The second part is the function which is called by the 'http-request action' to replace a request line part. This function makes the string used as replacement. This split permits to use the replacement engine in other parts of the code than the request action. The Lua use it for his own http action.	2015-03-18 11:34:06 +01:00
Thierry FOURNIER	63d692c037	MEDIUM: http: allows 'R' and 'S' in the protocol alphabet This patch allow the 'R' and the 'S' in the protocol/version alphabet. It permits to process RTSP requests like HTTP.	2015-03-17 16:19:52 +01:00
Thierry FOURNIER	5a33ac78ad	MEDIUM/CLEANUP: http: rewrite and lighten http_transform_header() prototype The http_transform_header() function prototype uses some parameter which can be guessed from other parameer. This patch removes theses parameters.	2015-03-17 11:42:43 +01:00
Thierry FOURNIER	191f9efdc5	BUG/MEDIUM: http: the function "(req\|res)-replace-value" doesn't respect the HTTP syntax These function used an invalid header parser. - The trailing white-spaces were embedded in the replacement regex, - The double-quote (") containing comma (,) were not respected. This patch replace this parser by the "official" parser http_find_header2().	2015-03-17 11:42:43 +01:00
Thierry FOURNIER	534101658d	BUG/MAJOR: http: don't read past buffer's end in http_replace_value The function http_replace_value use bad variable to detect the end of the input string. Regression introduced by the patch "MEDIUM: regex: Remove null terminated strings." (`c9c2daf2`) We need to backport this patch int the 1.5 stable branch. WT: there is no possibility to overwrite existing data as we only read past the end of the request buffer, to copy into the trash. The copy is bounded by buffer_replace2(), just like the replacement performed by exp_replace(). However if a buffer happens to contain non-zero data up to the next unmapped page boundary, there's a theorical risk of crashing the process despite this not being reproducible in tests. The risk is low because "http-request replace-value" did not work due to this bug so that probably means it's not used yet.	2015-03-16 14:20:07 +01:00
Thierry FOURNIER	01c30124ae	BUG/MEDIUM: http: the action set-{method\|path\|query\|uri} doesn't run. This bug is introduced by the commit "MEDIUM: http/tcp: permit to resume http and tcp custom actions" ( `bc4c1ac6ad` ). Before this patch, the return code of the function was ignored. After this path, if the function returns 0, it wats a YIELD. The function http_action_set_req_line() retunrs 0, in succes case. This patch changes the return code of this function.	2015-03-14 15:53:31 +01:00
Jesse Hathaway	2468d4e4f7	MEDIUM: http: Compress HTTP responses with status codes 201,202,203 in addition to 200 It is common for rest applications to return status codes other than 200, so compress the other common 200 level responses which might contain content.	2015-03-11 23:23:41 +01:00
Willy Tarreau	350f487300	CLEANUP: session: simplify references to chn_{prod,cons}(&s->{req,res}) These 4 combinations are needlessly complicated since the session already has direct access to the associated stream interfaces without having to check an indirect pointer.	2015-03-11 20:41:47 +01:00
Willy Tarreau	73796535a9	REORG/MEDIUM: channel: only use chn_prod / chn_cons to find stream-interfaces The purpose of these two macros will be to pass via the session to find the relevant stream interfaces so that we don't need to store the ->cons nor ->prod pointers anymore. Currently they're only defined so that all references could be removed. Note that many places need a second pass of clean up so that we don't have any chn_prod(&s->req) anymore and only &s->si[0] instead, and conversely for the 3 other cases.	2015-03-11 20:41:47 +01:00
Willy Tarreau	a5f5d8dc69	MEDIUM: stream-int: add a flag indicating which side the SI is on This new flag "SI_FL_ISBACK" is set only on the back SI and is cleared on the front SI. That way it's possible only by looking at the SI to know what side it is.	2015-03-11 20:41:46 +01:00
Willy Tarreau	2bb4a96f8f	REORG/MEDIUM: stream-int: introduce si_ic/si_oc to access channels We'll soon remove direct references to the channels from the stream interface since everything belongs to the same session, so let's first not dereference si->ib / si->ob anymore and use macros instead.	2015-03-11 20:41:46 +01:00
Willy Tarreau	22ec1eadd0	REORG/MAJOR: move session's req and resp channels back into the session The channels were pointers to outside structs and this is not needed anymore since the buffers have moved, but this complicates operations. Move them back into the session so that both channels and stream interfaces are always allocated for a session. Some places (some early sample fetch functions) used to validate that a channel was NULL prior to dereferencing it. Now instead we check if chn->buf is NULL and we force it to remain NULL until the channel is initialized.	2015-03-11 20:41:46 +01:00
Willy Tarreau	612adb8459	BUG/MAJOR: http: fix stats regression consecutive to HTTP_RULE_RES_YIELD Commit `bc4c1ac` ("MEDIUM: http/tcp: permit to resume http and tcp custom actions") unfortunately broke the stats applet by moving the clearing of the analyser bit after processing the applet headers. It used to work only in HTTP/1.1 and not in HTTP/1.0. This is 1.6-specific, no backport is needed.	2015-03-10 15:33:55 +01:00
Thierry FOURNIER	bc4c1ac6ad	MEDIUM: http/tcp: permit to resume http and tcp custom actions Later, the processing of some actions needs to be interrupted and resumed later. This patch permit to resume the actions. The actions that needs to run with the resume mode are not yet avalaible. It will be soon with Lua patches. So the code added by this patch is untestable for the moment. The list of "tcp_exec_req_rules" cannot resme because is called by the unresumable function "accept_session".	2015-02-28 23:12:33 +01:00
Thierry FOURNIER	9e2ef999a9	MEDIUM: http: change the code returned by the response processing rule functions Actually, this function returns a pointer on the rule that stop the evaluation of the rule list. Later we integrate the possibility of interrupt and resue the processsing of some actions. The current response mode is not sufficient to returns the "interrupt" information. The pointer returned is never used, so I change the return type of this function by an enum. With this enum, the function is ready to return the "interupt" value.	2015-02-28 23:12:33 +01:00
Thierry FOURNIER	49f45af9aa	MINOR: global: export many symbols. The functions "val_payload_lv" and "val_hdr" are useful with lua. The lua automatic binding for sample fetchs needs to compare check functions. The "arg_type_names" permit to display error messages.	2015-02-28 23:12:32 +01:00
Thierry FOURNIER	f41a809dc9	MINOR: sample: add private argument to the struct sample_fetch The add of this private argument is to prepare the integration of the lua fetchs.	2015-02-28 23:12:31 +01:00
Thierry FOURNIER	68a556e282	MINOR: converters: give the session pointer as converter argument Some usages of the converters need to know the attached session. The Lua needs the session for retrieving his running context. This patch adds the "session" as an argument of the converters prototype.	2015-02-28 23:12:31 +01:00
Thierry FOURNIER	1edc971919	MINOR: converters: add a "void *private" argument to converters This permits to store specific configuration pointer. It is useful with future Lua integration.	2015-02-28 23:12:31 +01:00
Willy Tarreau	eb27ec7569	MINOR: http: add the new sample fetches req.hdr_names and res.hdr_names These new sample fetches retrieve the list of header names as they appear in the request or response. This can be used for debugging, for statistics as well as an aid to better detect the presence of proxies or plugins on some browsers, which alter the request compared to a regular browser by adding or reordering headers.	2015-02-20 14:00:44 +01:00
Willy Tarreau	c90dc23e99	MINOR: http: add a new function to iterate over each header line New function http_find_next_header() will be used to scan all the input headers for various processing and for http/1 to http/2 header mapping.	2015-02-20 14:00:44 +01:00
Willy Tarreau	34d4c3c13f	BUG/MINOR: http: abort request processing on filter failure Commit `c600204` ("BUG/MEDIUM: regex: fix risk of buffer overrun in exp_replace()") added a control of failure on the response headers, but forgot to check for the error during request processing. So if the filters fail to apply, we could keep the request. It might cause some headers to silently fail to be added for example. Note that it's tagged MINOR because a standard configuration cannot make this case happen. The fix should be backported to 1.5 and 1.4 though.	2015-01-30 20:58:58 +01:00
Willy Tarreau	aa435e7d7e	BUG/MINOR: http: fix incorrect header value offset in replace-hdr/replace-value The two http-req/http-resp actions "replace-hdr" and "replace-value" were expecting exactly one space after the colon, which is wrong. It was causing the first char not to be seen/modified when no space was present, and empty headers not to be modified either. Instead of using name->len+2, we must use ctx->val which points to the first character of the value even if there is no value. This fix must be backported into 1.5.	2015-01-29 14:01:34 +01:00
Willy Tarreau	a0dc23f093	MEDIUM: http: implement http-request set-{method,path,query,uri} This commit implements the following new actions : - "set-method" rewrites the request method with the result of the evaluation of format string <fmt>. There should be very few valid reasons for having to do so as this is more likely to break something than to fix it. - "set-path" rewrites the request path with the result of the evaluation of format string <fmt>. The query string, if any, is left intact. If a scheme and authority is found before the path, they are left intact as well. If the request doesn't have a path ("*"), this one is replaced with the format. This can be used to prepend a directory component in front of a path for example. See also "set-query" and "set-uri". Example : # prepend the host name before the path http-request set-path /%[hdr(host)]%[path] - "set-query" rewrites the request's query string which appears after the first question mark ("?") with the result of the evaluation of format string <fmt>. The part prior to the question mark is left intact. If the request doesn't contain a question mark and the new value is not empty, then one is added at the end of the URI, followed by the new value. If a question mark was present, it will never be removed even if the value is empty. This can be used to add or remove parameters from the query string. See also "set-query" and "set-uri". Example : # replace "%3D" with "=" in the query string http-request set-query %[query,regsub(%3D,=,g)] - "set-uri" rewrites the request URI with the result of the evaluation of format string <fmt>. The scheme, authority, path and query string are all replaced at once. This can be used to rewrite hosts in front of proxies, or to perform complex modifications to the URI such as moving parts between the path and the query string. See also "set-path" and "set-query". All of them are handled by the same parser and the same exec function, which is why they're merged all together. For once, instead of adding even more entries to the huge switch/case, we used the new facility to register action keywords. A number of the existing ones should probably move there as well.	2015-01-23 20:27:41 +01:00
Willy Tarreau	15a53a4384	MEDIUM: regex: add support for passing regex flags to regex_exec_match() This function (and its sister regex_exec_match2()) abstract the regex execution but make it impossible to pass flags to the regex engine. Currently we don't use them but we'll need to support REG_NOTBOL soon (to indicate that we're not at the beginning of a line). So let's add support for this flag and update the API accordingly.	2015-01-22 14:24:53 +01:00
Willy Tarreau	8560328211	BUG/MEDIUM: http: make http-request set-header compute the string before removal The way http-request/response set-header works is stupid. For a naive reuse of the del-header code, it removes all occurrences of the header to be set before computing the new format string. This makes it almost unusable because it is not possible to append values to an existing header without first copying them to a dummy header, performing the copy back and removing the dummy header. Instead, let's share the same code as add-header and perform the optional removal after the string is computed. That way it becomes possible to write things like : http-request set-header X-Forwarded-For %[hdr(X-Forwarded-For)],%[src] Note that this change is not expected to have any undesirable impact on existing configs since if they rely on the bogus behaviour, they don't work as they always retrieve an empty string. This fix must be backported to 1.5 to stop the spreadth of ugly configs.	2015-01-21 20:45:00 +01:00
Willy Tarreau	49ad95cc8e	MINOR: http: add a new fetch "query" to extract the request's query string This fetch extracts the request's query string, which starts after the first question mark. If no question mark is present, this fetch returns nothing. If a question mark is present but nothing follows, it returns an empty string. This means it's possible to easily know whether a query string is present using the "found" matching method. This fetch is the completemnt of "path" which stops before the question mark.	2015-01-20 19:47:47 +01:00
Willy Tarreau	319f745ba0	MINOR: channel: rename bi_erase() to channel_truncate() It applies to the channel and it doesn't erase outgoing data, only pending unread data, which is strictly equivalent to what recv() does with MSG_TRUNC, so that new name is more accurate and intuitive.	2015-01-14 20:32:59 +01:00
Willy Tarreau	ba0902ede4	CLEANUP: channel: rename channel_reserved -> channel_is_rewritable channel_reserved is confusingly named. It is used to know whether or not the rewrite area is left intact for situations where we want to ensure we can use it before proceeding. Let's rename it to fix this confusion.	2015-01-14 18:41:33 +01:00
Willy Tarreau	7c1c217426	BUG/MEDIUM: http: fix header removal when previous header ends with pure LF In 1.4-dev7, a header removal mechanism was introduced with commit `68085d8` ("[MINOR] http: add http_remove_header2() to remove a header value."). Due to a typo in the function, the beginning of the headers gets desynchronized if the header preceeding the deleted one ends with an LF/CRLF combination different form the one of the removed header. The reason is that while rewinding the pointer, we go back by a number of bytes taking into account the LF/CRLF status of the removed header instead of the previous one. The case where it fails is in http-request del-header/set-header where the multiple occurrences of a header are present and their LF/CRLF ending differs from the preceeding header. The loop then stops because no more headers are found given that the names and length do not match. Another point to take into consideration is that removing headers using a loop of http_find_header2() and this function is inefficient since we remove values one at a time while it could be simpler and faster to remove full header lines. This is something that should be addressed separately. This fix must be backported to 1.5 and 1.4. Note that http-send-name-header relies on this function as well so it could be possible that some of the issues encountered with it in 1.4 come from this bug.	2015-01-07 17:23:50 +01:00
Willy Tarreau	f2f7d6b27b	MEDIUM: buffer: add a new buf_wanted dummy buffer to report failed allocations Doing so ensures that even when no memory is available, we leave the channel in a sane condition. There's a special case in proto_http.c regarding the compression, we simply pre-allocate the tmpbuf to point to the dummy buffer. Not reusing &buf_empty for this allows the rest of the code to differenciate an empty buffer that's not used from an empty buffer that results from a failed allocation which has the same semantics as a buffer full.	2014-12-24 23:47:32 +01:00
Willy Tarreau	e583ea583a	MEDIUM: buffer: use b_alloc() to allocate and initialize a buffer b_alloc() now allocates a buffer and initializes it to the size specified in the pool minus the size of the struct buffer itself. This ensures that callers do not need to care about buffer details anymore. Also this never applies memory poisonning, which is slow and useless on buffers.	2014-12-24 23:47:32 +01:00
Godbach	d972203fbc	BUG/MINOR: parse: refer curproxy instead of proxy Since during parsing stage, curproxy always represents a proxy to be operated, it should be a mistake by referring proxy. Signed-off-by: Godbach <nylzhaowei@gmail.com>	2014-12-18 11:01:51 +01:00
Godbach	1f1fae6202	BUG/MINOR: http: fix typo: "401 Unauthorized" => "407 Unauthorized" 401 Unauthorized => 407 Unauthorized Signed-off-by: Godbach <nylzhaowei@gmail.com>	2014-12-17 17:05:49 +01:00
Willy Tarreau	5506e3f8b6	BUG/MINOR: stats: correctly set the request/response analysers When enabling stats, response analysers were set on the request analyser list, which 1) has no effect, and 2) means we don't have the response analysers properly set. In practice these response analysers are set when the connection to the server or applet is established so we don't need/must not set them here. Fortunately this bug had no impact since the flags are distinct, but it definitely is confusing. It should be backported to 1.5.	2014-11-21 17:53:08 +01:00
Cyril Bont�	a83a50bd7d	BUG/MINOR: log: fix request flags when keep-alive is enabled Colin Ingarfield reported some unexplainable flags in the logs. For example, a "LR" termination state was set on a request which was forwarded to a server, where "LR" means that the request should have been handled internally by haproxy. This case happens when at least client side keep-alive is enabled. Next requests in the connection will inherit the flags from the previous request. 2 fields are impacted : "termination_state" and "Tt" in the timing events, where a "+" can be added, when a previous request was redispatched. This is not critical for the service itself but can confuse troubleshooting. The fix must be backported to 1.5 and 1.4.	2014-10-22 22:37:30 +02:00
Willy Tarreau	7d59e90473	BUG/MEDIUM: http: don't dump debug headers on MSG_ERROR When the HTTP parser is in state HTTP_MSG_ERROR, we don't know if it was already initialized or not. If the error happens before HTTP_MSG_RQBEFORE, random offsets might be present and we don't want to display such random strings in debug mode. While it's theorically possible to randomly crash the process when running in debug mode here, this bug was not tagged MAJOR because it would not make sense to run in debug mode in production. This fix must be backported to 1.5 and 1.4.	2014-10-22 19:25:09 +02:00

1 2 3 4 5 ...

1099 Commits