haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-09 16:47:18 +02:00

Author	SHA1	Message	Date
Willy Tarreau	d86e29d2a1	CLEANUP: acl: remove unused references to ACL_USE_* Now that acl->requires is not used anymore, we can remove all references to it as well as all ACL_USE_* flags.	2013-04-03 02:13:00 +02:00
Willy Tarreau	18ed2569f5	MINOR: http: add new direction-explicit sample fetches for headers and cookies Since "hdr" and "cookie" were ambiguously referring to the request or response depending on the context, we need a way to explicitly specify the direction. By prefixing the fetches names with "req." and "res.", we can now restrict such fetches to the appropriate direction. At the moment the fetches are explicitly declared by later we might think about having an automatic match when "req." or "res." appears. These explicit fetches are now used by the relevant ACLs.	2013-04-03 02:12:59 +02:00
Willy Tarreau	9baae63d8d	MAJOR: acl: remove fetch argument validation from the ACL struct ACL fetch being inherited from the sample fetch keyword, we don't need anymore to specify what function to use to validate the fetch arguments. Note that the job is still done in the ACL parsing code based on elements from the sample fetch structs.	2013-04-03 02:12:59 +02:00
Willy Tarreau	c48c90dfa5	MAJOR: acl: remove the arg_mask from the ACL definition and use the sample fetch's Now that ACLs solely rely on sample fetch functions, make them use the same arg mask. All inconsistencies have been fixed separately prior to this patch, so this patch almost only adds a new pointer indirection and removes all references to ARG*() in the definitions. The parsing is still performed by the ACL code though.	2013-04-03 02:12:58 +02:00
Willy Tarreau	8ed669b12a	MAJOR: acl: make all ACLs reference the fetch function via a sample. ACL fetch functions used to directly reference a fetch function. Now that all ACL fetches have their sample fetches equivalent, we can make ACLs reference a sample fetch keyword instead. In order to simplify the code, a sample keyword name may be NULL if it is the same as the ACL's, which is the most common case. A minor change appeared, http_auth always expects one argument though the ACL allowed it to be missing and reported as such afterwards, so fix the ACL to match this. This is not really a bug.	2013-04-03 02:12:58 +02:00
Willy Tarreau	409bcde176	MEDIUM: http: unify acl and sample fetch functions The following sample fetch functions were only usable by ACLs but are now usable by sample fetches too : cook, cook_cnt, cook_val, hdr_cnt, hdr_ip, hdr_val, http_auth, http_auth_group, http_first_req, method, req_proto_http, req_ver, resp_ver, scook, scook_cnt, scook_val, shdr, shdr_cnt, shdr_ip, shdr_val, status, urlp, urlp_val, Most of them won't bring much benefit at the moment, or are even aliases of existing ones, however they'll be needed for ACL->SMP convergence. A new val_usr() function was added to resolve userlist names into pointers. The http_auth_group ACL forgot to make its first argument mandatory, so there was a check in cfgparse to report a vague error. Now that args are correctly parsed, let's report something more precise. All urlp* ACLs now support an optional 3rd argument like their sample counter-part which is the optional delimiter. The fetch functions have been renamed "smp_fetch_*". Some args controls on the sample keywords have been relaxed so that we can soon use them for ACLs : - cookie now accepts to have an optional name ; it will return the first matching cookie if the name is not set ; - same for set-cookie and hdr	2013-04-03 02:12:57 +02:00
Willy Tarreau	434c57c95c	MINOR: log: indicate it when some unreliable sample fetches are logged If a log-format involves some sample fetches that may not be present at the logging instant, we can now report a warning. Note that this is done both for log-format and for add-header and carefully respects the original fetch keyword's capabilities.	2013-04-03 02:12:56 +02:00
Willy Tarreau	80aca90ad2	MEDIUM: samples: use new flags to describe compatibility between fetches and their usages Samples fetches were relying on two flags SMP_CAP_REQ/SMP_CAP_RES to describe whether they were compatible with requests rules or with response rules. This was never reliable because we need a finer granularity (eg: an HTTP request method needs to parse an HTTP request, and is available past this point). Some fetches are also dependant on the context (eg: "hdr" uses request or response depending where it's involved, causing some abiguity). In order to solve this, we need to precisely indicate in fetches what they use, and their users will have to compare with what they have. So now we have a bunch of bits indicating where the sample is fetched in the processing chain, with a few variants indicating for some of them if it is permanent or volatile (eg: an HTTP status is stored into the transaction so it is permanent, despite being caught in the response contents). The fetches also have a second mask indicating their validity domain. This one is computed from a conversion table at registration time, so there is no need for doing it by hand. This validity domain consists in a bitmask with one bit set for each usage point in the processing chain. Some provisions were made for upcoming controls such as connection-based TCP rules which apply on top of the connection layer but before instantiating the session. Then everywhere a fetch is used, the bit for the control point is checked in the fetch's validity domain, and it becomes possible to finely ensure that a fetch will work or not. Note that we need these two separate bitfields because some fetches are usable both in request and response (eg: "hdr", "payload"). So the keyword will have a "use" field made of a combination of several SMP_USE_* values, which will be converted into a wider list of SMP_VAL_* flags. The knowledge of permanent vs dynamic information has disappeared for now, as it was never used. Later we'll probably reintroduce it differently when dealing with variables. Its only use at the moment could have been to avoid caching a dynamic rate measurement, but nothing is cached as of now.	2013-04-03 02:12:56 +02:00
Willy Tarreau	e0db1e8946	MEDIUM: acl: remove flag ACL_MAY_LOOKUP which is improperly used This flag is used on ACL matches that support being looking up patterns in trees. At the moment, only strings and IPs support tree-based lookups, but the flag is randomly set also on integers and binary data, and is not even always set on strings nor IPs. Better get rid of this mess by only relying on the matching function to decide whether or not it supports tree-based lookups, this is safer and easier to maintain.	2013-04-03 02:12:56 +02:00
Willy Tarreau	aae75e3279	BUG/CRITICAL: using HTTP information in tcp-request content may crash the process During normal HTTP request processing, request buffers are realigned if there are less than global.maxrewrite bytes available after them, in order to leave enough room for rewriting headers after the request. This is done in http_wait_for_request(). However, if some HTTP inspection happens during a "tcp-request content" rule, this realignment is not performed. In theory this is not a problem because empty buffers are always aligned and TCP inspection happens at the beginning of a connection. But with HTTP keep-alive, it also happens at the beginning of each subsequent request. So if a second request was pipelined by the client before the first one had a chance to be forwarded, the second request will not be realigned. Then, http_wait_for_request() will not perform such a realignment either because the request was already parsed and marked as such. The consequence of this, is that the rewrite of a sufficient number of such pipelined, unaligned requests may leave less room past the request been processed than the configured reserve, which can lead to a buffer overflow if request processing appends some data past the end of the buffer. A number of conditions are required for the bug to be triggered : - HTTP keep-alive must be enabled ; - HTTP inspection in TCP rules must be used ; - some request appending rules are needed (reqadd, x-forwarded-for) - since empty buffers are always realigned, the client must pipeline enough requests so that the buffer always contains something till the point where there is no more room for rewriting. While such a configuration is quite unlikely to be met (which is confirmed by the bug's lifetime), a few people do use these features together for very specific usages. And more importantly, writing such a configuration and the request to attack it is trivial. A quick workaround consists in forcing keep-alive off by adding "option httpclose" or "option forceclose" in the frontend. Alternatively, disabling HTTP-based TCP inspection rules enough if the application supports it. At first glance, this bug does not look like it could lead to remote code execution, as the overflowing part is controlled by the configuration and not by the user. But some deeper analysis should be performed to confirm this. And anyway, corrupting the process' memory and crashing it is quite trivial. Special thanks go to Yves Lafon from the W3C who reported this bug and deployed significant efforts to collect the relevant data needed to understand it in less than one week. CVE-2013-1912 was assigned to this issue. Note that 1.4 is also affected so the fix must be backported.	2013-04-03 02:12:55 +02:00
Willy Tarreau	2d43e18b69	BUG/MAJOR: http: fix regression introduced by commit `d655ffe` Sander Klein reported that since last snapshot, some downloads would hang from nginx but succeed from apache. The culprit was not too hard to find given the low number of recent changes affecting the data path. Commit `d655ffe` slightly reorganized the HTTP state machine and introduced this regression. The reason is that we must never jump into the MSG_DONE case without first flushing remaining data because this is not done anymore afterwards. This part is scheduled for being reorganized since it's totally ugly especially since we added compression, and this regression is an illustration of its readability. The issue is entirely dependant on the server close sequence, which explains why it was reproducible only with nginx here.	2013-04-03 00:22:25 +02:00
Willy Tarreau	ffb6f08bab	BUG/MAJOR: http: fix regression introduced by commit `a890d072` This commit fixed a bug and introduced a new one at the same time. It's a stupid typo, the index to store the context is [0], not [2]. The effect is that parsing the header can loop forever if multiple headers are found. This issue was reported by Lukas Tribus.	2013-04-02 23:19:30 +02:00
Willy Tarreau	a890d072fc	BUG/MAJOR: http: use a static storage for sample fetch context Baptiste Assmann reported that the cook*() ACLs do not work anymore. The reason is the way we store the hdr_ctx between subsequent calls to smp_fetch_cookie() since commit `3740635b` (1.5-dev10). The smp->ctx.a[] storage holds up to 8 pointers. It is not meant for generic storage. We used to store hdr_ctx in the ctx, but while it used to just fit for smp_fetch_hdr(), it does not for smp_fetch_cookie() since we stored it at offset 2. The correct solution is to use this storage to store a pointer to the current hdr_ctx struct which is statically allocated.	2013-04-02 12:01:06 +02:00
Willy Tarreau	d655ffe863	OPTIM: http: optimize the response forward state machine By replacing the if/else series with a switch/case, we could save another 20% on the worst case (chunks of 1 byte).	2013-04-02 02:01:00 +02:00
Willy Tarreau	0161d62d23	OPTIM: http: improve branching in chunk size parser By tweaking a bit some conditions in http_parse_chunk_size(), we could improve the overall performance in the worst case by 15%.	2013-04-02 02:00:57 +02:00
Yves Lafon	e267421e93	MINOR: http: status 301 should not be marked non-cacheable Also, browsers behaviour is inconsistent regarding the Cache-Control header field on a 301.	2013-03-30 11:22:41 +01:00
Yves Lafon	3e8d1ae2d2	MEDIUM: http: implement redirect 307 and 308 I needed to emit a 307 and noticed it was not available so I did it, as well as 308.	2013-03-29 19:17:41 +01:00
Yves Lafon	4e8ec500e5	MINOR: http: status code 303 is HTTP/1.1 only Don't return a 303 redirect with "HTTP/1.0" as it's HTTP/1.1 only.	2013-03-29 19:08:09 +01:00
Willy Tarreau	2fef9b1ef6	BUG/MEDIUM: http: fix another issue caused by http-send-name-header An issue reported by David Coulson is that when using http-send-name-header, the response processing would randomly be performed. The issue was first diagnosed by Cyril Bont� as being related to a time race when processing the closing of the response. In practice, the issue is a bit trickier. It happens that http_send_name_header() did not update msg->sol after a rewrite. This counter is supposed to point to the beginning of the message's body once headers are scheduled for being forwarded. And not updating it means that the first forwarding of the request headers in http_request_forward_body() does not send the correct count, leaving some bytes in chn->to_forward. Then if the server sends its response in a single packet with the close, the stream interface switches to state SI_ST_DIS which in turn moves to SI_ST_CLO in process_session(), and to close the outgoing connection. This is detected by http_request_forward_body(), which then switches the request message to the error state, and syncs all FSMs and removes any response analyser. The response analyser being removed, no processing is performed on the response buffer, which is tunnelled as-is to the client. Of course, the correct fix consists in having http_send_name_header() update msg->sol. Normally this ought not to have been needed, but it is an abuse to modify data already scheduled for being forwarded, so it is expected that such specific handling has to be done there. Better not have generic functions deal with such cases, so that it does not become the standard. Note: 1.4 does not have this issue even if it does not update the pointer either, because it forwards from msg->som which is not updated at the moment the connect() succeeds. So no backport is required.	2013-03-26 01:21:47 +01:00
Willy Tarreau	3bfeadb3f6	BUG/MEDIUM: http: add-header should not emit "-" for empty fields Patch `6cbbdbf3` fixed the missing "-" delimitors in logs but it caused them to be emitted with "http-request add-header", eventhough it was correctly fixed for the unique-id format. Fix this by simply removing LOG_OPT_MANDATORY in this case.	2013-03-24 07:33:22 +01:00
Willy Tarreau	6cbbdbf3f3	BUG/MEDIUM: log: emit '-' for empty fields again Commit `2b0108ad` accidently got rid of the ability to emit a "-" for empty log fields. This can happen for captured request and response cookies, as well as for fetches. Since we don't want to have this done for headers however, we set the default log method when parsing the format. It is still possible to force the desired mode using +M/-M.	2013-02-05 18:55:09 +01:00
Willy Tarreau	192e59fb07	CLEANUP: http: don't try to deinitialize http compression if it fails before init In select_compression_response_header(), some tests are rather confusing as the "fail" label is used to deinitialize the compression context for the session while it's branched only before initialization succeeds. The test is always false here and the dereferencing of the comp_algo pointer which might be null is also confusing. Remove that code which is not needed anymore since commit `ec3e3890` got rid of the latest issues. Reported-by: Dinko Korunic <dkorunic@reflected.net>	2013-01-24 16:19:19 +01:00
Willy Tarreau	4521ba689c	CLEANUP: http: remove a useless null check srv cannot be null in http_perform_server_redirect(), as it's taken from the stream interface's target which is always valid for a server-based redirect, and it was already dereferenced above, so in practice, gcc already removes the test anyway. Reported-by: Dinko Korunic <dkorunic@reflected.net>	2013-01-24 16:19:18 +01:00
Baptiste Assmann	116eefed8f	MINOR: config: http-request configuration error message misses new keywords "redirect" and "tarpit" keywords were missing from http-request configuration error message.	2013-01-05 16:53:49 +01:00
Willy Tarreau	56e9ffa6a6	BUG/MINOR: http-compression: lookup Cache-Control in the response, not the request As stated in both RFC2616 and the http-bis drafts, Cache-Control: no-transform must be looked up in the response since we're modifying the response. However, its presence in the request is irrelevant to any changes in the response : 7.2.1.6. no-transform The "no-transform" request directive indicates that an intermediary (whether or not it implements a cache) MUST NOT change the Content- Encoding, Content-Range or Content-Type request header fields, nor the request representation. 7.2.2.9. no-transform The "no-transform" response directive indicates that an intermediary (regardless of whether it implements a cache) MUST NOT change the Content-Encoding, Content-Range or Content-Type response header fields, nor the response representation. Note: according to the specs, we're supposed to emit the following response header : Warning: 214 transformation applied However no other product seems to do it, so the effect on user agents is unclear.	2013-01-05 16:31:58 +01:00
Willy Tarreau	ccbcc37a01	MEDIUM: http: add support for "http-request tarpit" rule The "reqtarpit" rule is not very handy to use. Now that we have more flexibility with "http-request", let's finally make the tarpit rules usable there. There are still semantical differences between apply_filters_to_request() and http_req_get_intercept_rule() because the former updates the counters while the latter does not. So we currently have almost similar code leafs for similar conditions, but this should be cleaned up later.	2012-12-28 14:47:19 +01:00
Willy Tarreau	81499eb67d	MEDIUM: http: add support for "http-request redirect" rules These are exactly the same as the classic redirect rules except that they can be interleaved with other http-request rules for more flexibility. The redirect parser should probably be changed to stop at the condition so that the caller puts its own condition pointer. At the moment, the redirect rule and condition are parsed at once by build_redirect_rule() and the condition is assigned to the http_req_rule.	2012-12-28 14:47:19 +01:00
Willy Tarreau	4baae248fc	REORG: config: move the http redirect rule parser to proto_http.c We'll have to use this elsewhere soon, let's move it to the proper place.	2012-12-28 14:47:19 +01:00
Willy Tarreau	71241abfd3	MINOR: http: move redirect rule processing to its own function We now have http_apply_redirect_rule() which does all the redirect-specific job instead of having this inside http_process_req_common(). Also one of the benefit gained from uniformizing this code is that both keep-alive and close response do emit the PR-- flags. The fix for the flags could probably be backported to 1.4 though it's very minor. The previous function http_perform_redirect() was becoming confusing so it was renamed http_perform_server_redirect() since it only applies to server-based redirection.	2012-12-28 14:47:19 +01:00
Willy Tarreau	96257ec5c8	CLEANUP: http: rename the misleading http_check_access_rule Several bugs were introduced recently due to a misunderstanding of how this function works and what it was supposed to do. Since it's supposed to only return the pointer to a rule which aborts further processing of the request, let's rename it to avoid further issues. The function was also slightly cleaned up without any functional change.	2012-12-28 14:47:19 +01:00
Willy Tarreau	d79a3b248e	BUG/MINOR: log: make log-format, unique-id-format and add-header more independant It happens that all of them call parse_logformat_line() which sets proxy->to_log with a number of flags affecting the line format for all three users. For example, having a unique-id specified disables the default log-format since fe->to_log is tested when the session is established. Similarly, having "option logasap" will cause "+" to be inserted in unique-id or headers referencing some of the fields depending on LW_BYTES. This patch first removes most of the dependency on fe->to_log whenever possible. The first possible cleanup is to stop checking fe->to_log for being null, considering that it always contains at least LW_INIT when any such usage is made of the log-format! Also, some checks are wrong. s->logs.logwait cannot be nulled by "logwait &= ~LW_" since LW_INIT is always there. This results in getting the wrong log at the end of a request or session when a unique-id or add-header is set, because logwait is still not null but the log-format is not checked. Further cleanups are required. Most LW_ flags should be removed or at least replaced with what they really mean (eg: depend on client-side connection, depend on server-side connection, etc...) and this should only affect logging, not other mechanisms. This patch fixes the default log-format and tries to limit interferences between the log formats, but does not pretend to do more for the moment, since it's the most visible breakage.	2012-12-28 09:51:00 +01:00
Willy Tarreau	cbc743e36c	BUG/MEDIUM: stats: disable request analyser when processing POST or HEAD After the response headers are sent and the request processing is done, the buffers are wiped out and the stream interface is closed. We must then disable the request analysers, otherwise some processing will happen on a closed stream interface and empty buffers which do not match, causing all sort of crashes. This issue was introduced with recent work on the stats, and was reported by Seri.	2012-12-28 08:36:50 +01:00
Willy Tarreau	1a1e8072f9	BUG/MINOR: stats: http-request rules still don't cope with stats Since commit `20b0de5`, we also had another remaining issue : an "http-request allow" rule would prevent a stats rule from being processed.	2012-12-27 10:34:21 +01:00
Willy Tarreau	8b80f0c9a2	BUG/MINOR: stats: last fix was still wrong Previous commit was still wrong, it broke add-header and set-header because we don't want to leave on these actions. The http_check_access_rule() function should be redesigned, it was initially thought for allow/deny rules but now it is executing other non-final rules and at the same time returning a pointer to the last final rule. That becomes a bit confusing and will need to be addressed before we implement redirect and return.	2012-12-25 21:55:37 +01:00
Willy Tarreau	418c1a0a95	BUG/MEDIUM: stats: fix stats page regression introduced by commit `20b0de5` This commit adding http-request add-header/set-header unfortunately introduced a regression to the handling of the stats page which is not matched anymore. Thanks to Dmitry Sivachenko for reporting this.	2012-12-25 20:52:58 +01:00
Willy Tarreau	20b0de56d4	MEDIUM: http: add http-request 'add-header' and 'set-header' to build headers These two new statements allow to pass information extracted from the request to the server. It's particularly useful for passing SSL information to the server, but may be used for various other purposes such as combining headers together to emulate internal variables.	2012-12-24 15:56:20 +01:00
Willy Tarreau	5c2e198390	MINOR: http: prepare to support more http-request actions We'll need to support per-action arguments, so we need to have an "arg" union in http_req_rule.	2012-12-24 12:26:26 +01:00
Willy Tarreau	354898bba9	MINOR: stats: replace STAT_FMT_CSV with STAT_FMT_HTML We need to switch the default mode if we want to add new output formats later. Let CSV be the default and HTML be an option.	2012-12-23 21:46:30 +01:00
Willy Tarreau	47ca54505c	MINOR: chunks: centralize the trash chunk allocation At the moment, we need trash chunks almost everywhere and the only correctly implemented one is in the sample code. Let's move this to the chunks so that all other places can use this allocator. Additionally, the get_trash_chunk() function now really returns two different chunks. Previously it used to always overwrite the same chunk and point it to a different buffer, which was a bit tricky because it's not obvious that two consecutive results do alias each other.	2012-12-23 21:46:07 +01:00
Willy Tarreau	1facd6d67e	REORG: stats: move the HTTP header injection to proto_http The HTTP header injection that are performed in dumpstats when responding or when redirecting a POST request have nothing to do in dumpstats. They do not use any state from the stats, and are 100% HTTP. Let's make the headers there in the HTTP core, and have dumpstats only produce stats.	2012-12-22 22:50:01 +01:00
Willy Tarreau	d9bdcd5139	REORG: stats: massive code reorg and cleanup The dumpstats code looks like a spaghetti plate. Several functions are supposed to be able to do several things but rely on complex states to dispatch the work to independant functions. Most of the HTML output is performed within the switch/case statements of the whole state machine. Let's clean this up by adding new functions to emit the data and have a few more iterators to avoid relying on so complex states. The new stats dump sequence looks like this for CLI and for HTTP : cli_io_handler() -> stats_dump_sess_to_buffer() // "show sess" -> stats_dump_errors_to_buffer() // "show errors" -> stats_dump_raw_info_to_buffer() // "show info" -> stats_dump_raw_info() -> stats_dump_raw_stat_to_buffer() // "show stat" -> stats_dump_csv_header() -> stats_dump_proxy() -> stats_dump_px_hdr() -> stats_dump_fe_stats() -> stats_dump_li_stats() -> stats_dump_sv_stats() -> stats_dump_be_stats() -> stats_dump_px_end() http_stats_io_handler() -> stats_http_redir() -> stats_dump_http() // also emits the HTTP headers -> stats_dump_html_head() // emits the HTML headers -> stats_dump_csv_header() // emits the CSV headers (same as above) -> stats_dump_http_info() // note: ignores non-HTML output -> stats_dump_proxy() // same as above -> stats_dump_http_end() // emits HTML trailer	2012-12-22 20:45:02 +01:00
Willy Tarreau	40f151aa79	BUG/MINOR: http: don't abort client connection on premature responses When a server responds prematurely to a POST request, haproxy used to cause the transfer to be aborted before the end. This is problematic because this causes the client to receive a TCP reset when it tries to push more data, generally preventing it from receiving the response which contain the reason for the premature reponse (eg: "entity too large" or an authentication request). From now on we take care of allowing the upload traffic to flow to the server even when the response has been received, since the server is supposed to drain it. That way the client receives the server response. This bug has been present since 1.4 and the fix should probably be backported there.	2012-12-20 12:10:09 +01:00
Willy Tarreau	f26b252ee4	MINOR: http: make resp_ver and status ACLs check for the presence of a response The two ACL fetches "resp_ver" and "status", if used in a request despite the warning, would return a match of zero length. This is inappropriate, better return a non-match to be more consistent with other ACL processing.	2012-12-14 08:35:45 +01:00
Willy Tarreau	4a55060aa6	MINOR: http: add the "base32+src" fetch method. This returns the concatenation of the base32 fetch and the src fetch. The resulting type is of type binary, with a size of 8 or 20 bytes depending on the source address family. This can be used to track per-IP, per-URL counters.	2012-12-09 14:53:32 +01:00
Willy Tarreau	ab1f7b72fb	MINOR: http: add the "base32" pattern fetch function This returns a 32-bit hash of the value returned by the "base" fetch method above. This is useful to track per-URL activity on high traffic sites without having to store all URLs. Instead a shorter hash is stored, saving a lot of memory. The output type is an unsigned integer.	2012-12-09 14:08:48 +01:00
Willy Tarreau	5d5b5d8eaf	MEDIUM: proto_tcp: add support for tracking L7 information Until now it was only possible to use track-sc1/sc2 with "src" which is the IPv4 source address. Now we can use track-sc1/sc2 with any fetch as well as any transformation type. It works just like the "stick" directive. Samples are automatically converted to the correct types for the table. Only "tcp-request content" rules may use L7 information, and such information must already be present when the tracking is set up. For example it becomes possible to track the IP address passed in the X-Forwarded-For header. HTTP request processing now also considers tracking from backend rules because we want to be able to update the counters even when the request was already parsed and tracked. Some more controls need to be performed (eg: samples do not distinguish between L4 and L6).	2012-12-09 14:08:47 +01:00
Willy Tarreau	dc979f2492	BUG/MINOR: http: don't log a 503 on client errors while waiting for requests If a client aborts a request with an error (typically a TCP reset), we must log a 400. Till now we did not set the status nor close the stream interface, causing the request to attempt to be forwarded and logging a 503. Should be backported to 1.4 which is affected as well.	2012-12-04 10:52:22 +01:00
Willy Tarreau	14cba4b0b1	MEDIUM: connection: add an error code in connections This will be needed to improve error reporting, especially for SSL.	2012-12-03 14:22:13 +01:00
Willy Tarreau	8139b9959f	MINOR: compression: make the stats a bit more robust To ensure that we only count when a response was compressed, we also check for the SN_COMP_READY flag which indicates that the compression was effectively initialized. Comp_algo alone is meaningless.	2012-11-27 09:34:00 +01:00
Willy Tarreau	9101535038	BUG/MINOR: http: disable compression when message has no body Compression was not disabled on 1xx, 204, 304 nor HEAD requests. This is not really a problem, but it reports more compressed responses than really done.	2012-11-27 09:34:00 +01:00
Willy Tarreau	0a80a8dbb2	MINOR: http: factor out the content-type checks Let's only look up the content-type header once. This involves inverting the condition which is not dramatic. Also, we now always check the value length before comparing it, and we always reset the ctx.idx before looking a header up. Otherwise that could make header lookups depend on their on-wire order. It would be a minor issue however since at worst it would cause some responses not to be compressed.	2012-11-26 16:36:00 +01:00
William Lallemand	d300261bab	MINOR: compression: disable on multipart or status != 200 The compression is disabled when the HTTP status code is not 200, indeed compression on some HTTP code can create issues (ex: 206, 416). Multipart message should not be compressed eitherway.	2012-11-26 16:02:58 +01:00
William Lallemand	859550e068	BUG/MINOR: compression: Content-Type is case insensitive The Content-Type parameter must be case insensitive.	2012-11-26 16:02:58 +01:00
Willy Tarreau	f003d375ec	BUG/MINOR: http: don't report client aborts as server errors If a client aborts with an abortonclose flag, the close is forwarded to the server and when server response is processed, the analyser thinks it's the server who has closed first, and logs flags "SD" or "SH" and counts a server error. In order to avoid this, we now first detect that the client has closed and log a client abort instead. This likely is the reason why many people have been observing a small rate of SD/SH flags without being able to find what the error was. This fix should probably be backported to 1.4.	2012-11-26 13:50:02 +01:00
Willy Tarreau	5e16cbc3bd	MINOR: stats: report the total number of compressed responses per front/back Depending on the content-types and accept-encoding fields, some responses might or might not be compressed. Let's have a counter of the number of compressed responses and report it in the stats to help improve compression usage. Some cosmetic issues were fixed in the CSV output too (missing commas at the end).	2012-11-24 14:54:13 +01:00
William Lallemand	00bf1dee9c	BUG/MEDIUM: compression: does not forward trailers The commit `bf3ae617` introduced a regression about the forward of the trailers in compression mode.	2012-11-23 11:12:33 +01:00
Willy Tarreau	193b8c6168	MINOR: http: allow the cookie capture size to be changed Some users need more than 64 characters to log large cookies. The limit was set to 63 characters (and not 64 as previously documented). Now it is possible to change this using the global "tune.http.cookielen" setting if required.	2012-11-22 00:44:27 +01:00
William Lallemand	072a2bf537	MINOR: compression: CPU usage limit New option 'maxcompcpuusage' in global section. Sets the maximum CPU usage HAProxy can reach before stopping the compression for new requests or decreasing the compression level of current requests. It works like 'maxcomprate' but with the Idle.	2012-11-21 02:15:16 +01:00
William Lallemand	8b52bb3878	MEDIUM: compression: use pool for comp_ctx Use pool for comp_ctx, it is allocated during the comp_algo->init(). The allocation of comp_ctx is accounted for in the zlib_memory_available.	2012-11-21 01:56:47 +01:00
William Lallemand	bf3ae61789	MEDIUM: compression: don't compress when no data This patch makes changes in the http_response_forward_body state machine. It checks if the compress algorithm had consumed data before swapping the temporary and the input buffer. So it prevents null sized zlib chunks.	2012-11-19 14:57:29 +01:00
Willy Tarreau	b97b6190e1	BUG: compression: properly disable compression when content-type does not match Disabling compression based on the content-type was improperly done since the introduction of the COMP_READY flag, sometimes resulting in truncated responses.	2012-11-19 14:55:02 +01:00
Willy Tarreau	543db62e1f	BUG/MEDIUM: compression: release the zlib pools between keep-alive requests There was a possible memory leak in the zlib code when the first response of a keep-alive session was compressed, because the next request would reset the compression algo, preventing a later call to session_free() from releasing it. The reason is that it is necessary to release the assigned resources in http_end_txn_clean_session().	2012-11-15 16:41:22 +01:00
William Lallemand	ec3e3890f0	BUG/MINOR: compression: deinit zlib only when required The zlib stream was deinitialized even when the init failed.	2012-11-15 15:42:17 +01:00
William Lallemand	c04ca58222	BUG/MEDIUM: compression: no Content-Type header but type in configuration HAProxy was compressing data when there was no Content-Type header in the response but a compression type specified in the configuration.	2012-11-15 15:42:11 +01:00
Willy Tarreau	3fdb366885	MAJOR: connection: replace struct target with a pointer to an enum Instead of storing a couple of (int, ptr) in the struct connection and the struct session, we use a different method : we only store a pointer to an integer which is stored inside the target object and which contains a unique type identifier. That way, the pointer allows us to retrieve the object type (by dereferencing it) and the object's address (by computing the displacement in the target structure). The NULL pointer always corresponds to OBJ_TYPE_NONE. This reduces the size of the connection and session structs. It also simplifies target assignment and compare. In order to improve the generated code, we try to put the obj_type element at the beginning of all the structs (listener, server, proxy, si_applet), so that the original and target pointers are always equal. A lot of code was touched by massive replaces, but the changes are not that important.	2012-11-12 00:42:33 +01:00
Willy Tarreau	50fc7777c6	MEDIUM: http: refrain from sending "Connection: close" when Upgrade is present Some servers are not totally HTTP-compliant when it comes to parsing the Connection header. This is particularly true with WebSocket where it happens from time to time that a server doesn't support having a "close" token along with the "Upgrade" token in the Connection header. This broken behaviour has also been noticed on some clients though the problem is less frequent on the response path. Sometimes the workaround consists in enabling "option http-pretend-keepalive" to leave the request Connection header untouched, but this is not always the most convenient solution. This patch introduces a new solution : haproxy now also looks for the "Upgrade" token in the Connection header and if it finds it, then it refrains from adding any other token to the Connection header (though "keep-alive" and "close" may still be removed if found). The same is done for the response headers. This way, WebSocket much with less changes even when facing non-compliant clients or servers. At least it fixes the DISCONNECT issue that was seen on the websocket.org test. Note that haproxy does not change its internal mode, it just refrains from adding new tokens to the connection header.	2012-11-11 22:40:00 +01:00
Willy Tarreau	7f7ad91056	BUILD: stream_interface: remove si_fd() and its references si_fd() is not used a lot, and breaks builds on OpenBSD 5.2 which defines this name for its own purpose. It's easy enough to remove this one-liner function, so let's do it.	2012-11-11 20:53:29 +01:00
William Lallemand	d85f917daf	MINOR: compression: maximum compression rate limit This patch adds input and output rate calcutation on the HTTP compresion feature. Compression can be limited with a maximum rate value in kilobytes per second. The rate is set with the global 'maxcomprate' option. You can change this value dynamicaly with 'set rate-limit http-compression global' on the UNIX socket.	2012-11-10 17:47:27 +01:00
William Lallemand	f3747837e5	MINOR: compression: tune.comp.maxlevel This option allows you to set the maximum compression level usable by the compression algorithm. It affects CPU usage.	2012-11-10 17:47:07 +01:00
Finn Arne Gangstad	0a410e81fb	BUG: http: revert broken optimisation from `82fe75c1a7` This optimisation causes haproxy to time out requests that result in two TCP packets, one packet containing the header, and one packet containing the actual data. This is a very typical type of response from a lot of servers. [Willy: I suspect the fix might have an impact on the compression code which I'm not sure completely handles calls with 0 bytes to forward]	2012-11-10 17:38:36 +01:00
William Lallemand	4c49fae985	MINOR: compression: init before deleting headers Init the compression algorithm before modifying the response headers. So if the compression init fail, the headers won't be modified.	2012-11-08 15:23:30 +01:00
William Lallemand	1c2d622d82	CLEANUP: use struct comp_ctx instead of union Replace union comp_ctx by struct comp_ctx. Use struct comp_ctx * in the init/add_data/flush/reset/end prototypes of compression.h functions.	2012-11-05 10:23:16 +01:00
Finn Arne Gangstad	cbb9a4b128	MINOR: compression: Enable compression for IE6 w/SP2, IE7 and IE8 Some old browsers that have a user-agent starting with "Mozilla/4" do not support compressison correctly, so disable compression for those. Internet explorer 6 after Windows XP service pack 2, IE 7, and IE 8, do however support compression and still have a user agent starting with Mozilla/4, so we try to enable compression for those. MSIE has a user-agent on this form: Mozilla/4.0 (compatible; MSIE <version>; ...) 98% of MSIE 6 SP2 user agents start with Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1 The remaining 2% have additional flags before "SV1". This simplified matching looking for MSIE at exactly position 25 and SV1 at exacly position 51 gives a few false negatives, so sometimes a compression opportunity is lost. A test against 3 hours of traffic to around 3000 news sites worldwide gives less than 0.007% (70ppm) missed compression opportunities.	2012-10-29 22:03:14 +01:00
Willy Tarreau	7e2c647ee7	MEDIUM: remove remains of BUFSIZE in HTTP auth and sample conversions Sample conversions rely on two alternative buffers which were previously allocated as static bufs of size BUFSIZE. Now they're initialized to the global buffer size. It was the same for HTTP authentication. Note that it seems that none of them was prone to any mistake when dealing with the buffer size, but better stay on the safe side by maintaining the old assumption that a trash buffer is always "large enough".	2012-10-29 20:44:36 +01:00
Willy Tarreau	19d14ef104	MEDIUM: make the trash be a chunk instead of a char * The trash is used everywhere to store the results of temporary strings built out of s(n)printf, or as a storage for a chunk when chunks are needed. Using global.tune.bufsize is not the most convenient thing either. So let's replace trash with a chunk and directly use it as such. We can then use trash.size as the natural way to get its size, and get rid of many intermediary chunks that were previously used. The patch is huge because it touches many areas but it makes the code a lot more clear and even outlines places where trash was used without being that obvious.	2012-10-29 16:57:30 +01:00
Willy Tarreau	08b4d79d31	BUG: compression: disable auto-close and enable MSG_MORE during transfer We don't want the lower layer to forward a close while we're compressing, and we want the system to fuse outgoing TCP segments using MSG_MORE as much as possible to save round trips that can emerge from sending short packets with a PUSH flag. A test on a remote busy DSL line consisting in compressing a 100MB file on the fly full of zeroes only showed a transfer rate of a few kB/s due to these round trips.	2012-10-27 01:36:34 +02:00
Willy Tarreau	70737d142f	MINOR: compression: add an offload option to remove the Accept-Encoding header This is used when it is desired that backend servers don't compress (eg: because of buggy implementations).	2012-10-27 01:13:24 +02:00
Willy Tarreau	f2943dccd0	MAJOR: session: detach the connections from the stream interfaces We will need to be able to switch server connections on a session and to keep idle connections. In order to achieve this, the preliminary requirement is that the connections can survive the session and be detached from them. Right now they're still allocated at exactly the same place, so when there is a session, there are always 2 connections. We could soon improve on this by allocating the outgoing connection only during a connect(). This current patch touches a lot of code and intentionally does not change any functionnality. Performance tests show no regression (even a very minor improvement). The doc has not yet been updated.	2012-10-26 20:15:20 +02:00
Willy Tarreau	c919dc66a3	CLEANUP: remove trashlen trashlen is a copy of global.tune.bufsize, so let's stop using it as a duplicate, fall back to the original bufsize, it's less confusing this way.	2012-10-26 20:04:27 +02:00
Willy Tarreau	3c7b97b9f9	BUG/MINOR: http: compression should consider all Accept-Encoding header values Right now commit `82fe75c1` came with a minor bug limiting the check to the first accept-encoding header value only.	2012-10-26 14:52:02 +02:00
Willy Tarreau	05d846092f	MINOR: compression: automatically disable compression for older browsers A number of older browsers have many issues with compressed contents. It happens that all these older browsers announce themselves as "Mozilla/4" and that despite not being all broken, the amount of working browsers announcing themselves this way compared to all other ones is so tiny that it's not worth wasting cycles trying to adapt to every specific one. So let's simply disable compression for these older browsers. More information on this very detailed article : http://zoompf.com/2012/02/lose-the-wait-http-compression	2012-10-26 02:54:31 +02:00
William Lallemand	82fe75c1a7	MEDIUM: HTTP compression (zlib library support) This commit introduces HTTP compression using the zlib library. http_response_forward_body has been modified to call the compression functions. This feature includes 3 algorithms: identity, gzip and deflate: * identity: this is mostly for debugging, and it was useful for developping the compression feature. With Content-Length in input, it is making each chunk with the data available in the current buffer. With chunks in input, it is rechunking, the output chunks will be bigger or smaller depending of the size of the input chunk and the size of the buffer. Identity does not apply any change on data. * gzip: same as identity, but applying a gzip compression. The data are deflated using the Z_NO_FLUSH flag in zlib. When there is no more data in the input buffer, it flushes the data in the output buffer (Z_SYNC_FLUSH). At the end of data, when it receives the last chunk in input, or when there is no more data to read, it writes the end of data with Z_FINISH and the ending chunk. * deflate: same as gzip, but with deflate algorithm and zlib format. Note that this algorithm has ambiguous support on many browsers and no support at all from recent ones. It is strongly recommended not to use it for anything else than experimentation. You can't choose the compression ratio at the moment, it will be set to Z_BEST_SPEED (1), as tests have shown very little benefit in terms of compression ration when going above for HTML contents, at the cost of a massive CPU impact. Compression will be activated depending of the Accept-Encoding request header. With identity, it does not take care of that header. To build HAProxy with zlib support, use USE_ZLIB=1 in the make parameters. This work was initially started by David Du Colombier at Exceliance.	2012-10-26 02:30:48 +02:00
Willy Tarreau	54d23dfc07	CLEANUP: http: rename HTTP_MSG_DATA_CRLF state This state's name is confusing as it is only used with chunked encoding and makes newcomers think it's also related to the content-length. Let's call it CHUNK_CRLF to clear any doubt on this.	2012-10-26 01:13:52 +02:00
Willy Tarreau	24e6d972aa	OPTIM: http: inline http_parse_chunk_size() and http_skip_chunk_crlf() These functions are not that long and the compiler inlines them well. Doing so has sped up the chunked encoding parser by 41% ! Note that http_forward_trailers was also declared static because it's not exported.	2012-10-26 01:12:40 +02:00
Cyril Bonté	69fa99292e	MEDIUM: http: accept IPv6 values with (s)hdr_ip acl Commit `ceb4ac9c` states that IPv6 values are accepted by "hdr_ip" acl, but the code didn't allow it. This patch provides the ability to accept IPv6 values.	2012-10-25 14:41:33 +02:00
Willy Tarreau	fc47f91c9c	BUG/MEDIUM: http: set DONTWAIT on data when switching to tunnel mode Jaroslaw Bojar diagnosed an issue when haproxy switches to tunnel mode after a transfer. The response data are sent with the MSG_MORE flag, causing them to be needlessly queued in the kernel. In order to fix this, we set the CF_NEVER_WAIT flag on the channels when switching to tunnel mode. One issue remained with client-side keep-alive : if the response is sent before the end of the request, it suffers the same issue for the same reason. This is easily addressed by setting the CF_SEND_DONTWAIT flag on the channel when the response has been parsed and we're waiting for the other side. The same issue is present in 1.4 so the fix must be backported.	2012-10-20 10:41:37 +02:00
Willy Tarreau	9b28e03b66	MAJOR: channel: replace the struct buffer with a pointer to a buffer With this commit, we now separate the channel from the buffer. This will allow us to replace buffers on the fly without touching the channel. Since nobody is supposed to keep a reference to a buffer anymore, doing so is not a problem and will also permit some copy-less data manipulation. Interestingly, these changes have shown a 2% performance increase on some workloads, probably due to a better cache placement of data.	2012-10-13 09:07:52 +02:00
Willy Tarreau	cdbdd52a38	CLEANUP: http: use 'chn' to name channel variables, not 'buf' These "buf" were confusing as they were really refering to channels. At most places, a buffer was really all what was needed, so a struct buffer was used instead. It is possible that the performance has slightly increased by the removal of pointer offset in many pointer operations by directly using the buffer pointer instead of the channel pointer.	2012-10-12 23:02:51 +02:00
Willy Tarreau	394db379eb	REORG: http: rename msg->buf to msg->chn since it's a channel It's extremely confusing to have all those msg->buf->buf everywhere after the extraction of the buffer from the channel. Let's clean this up.	2012-10-12 22:40:39 +02:00
Willy Tarreau	1b6c00cb99	BUG/MAJOR: ensure that hdr_idx is always reserved when L7 fetches are used Baptiste Assmann reported a bug causing a crash on recent versions when sticking rules were set on layer 7 in a TCP proxy. The bug is easier to reproduce with the "defer-accept" option on the "bind" line in order to have some contents to parse when the connection is accepted. The issue is that the acl_prefetch_http() function called from HTTP fetches relies on hdr_idx to be preinitialized, which is not the case if there is no L7 ACL. The solution consists in adding a new SMP_CAP_L7 flag to fetches to indicate that they are expected to work on L7 data, so that the proxy knows that the hdr_idx has to be initialized. This is already how ACL and HTTP mode are handled. The bug was present since 1.5-dev9.	2012-10-05 22:46:09 +02:00
bedis	4c75cca8ba	MINOR: samples: update the url_param fetch to match parameters in the path It now supports an optional delimiter to allow to look for the parameter before the query string.	2012-10-05 15:17:23 +02:00
Willy Tarreau	f7bc57ca6e	REORG: connection: rename the data layer the "transport layer" While working on the changes required to make the health checks use the new connections, it started to become obvious that some naming was not logical at all in the connections. Specifically, it is not logical to call the "data layer" the layer which is in charge for all the handshake and which does not yet provide a data layer once established until a session has allocated all the required buffers. In fact, it's more a transport layer, which makes much more sense. The transport layer offers a medium on which data can transit, and it offers the functions to move these data when the upper layer requests this. And it is the upper layer which iterates over the transport layer's functions to move data which should be called the data layer. The use case where it's obvious is with embryonic sessions : an incoming SSL connection is accepted. Only the connection is allocated, not the buffers nor stream interface, etc... The connection handles the SSL handshake by itself. Once this handshake is complete, we can't use the data functions because the buffers and stream interface are not there yet. Hence we have to first call a specific function to complete the session initialization, after which we'll be able to use the data functions. This clearly proves that SSL here is only a transport layer and that the stream interface constitutes the data layer. A similar change will be performed to rename app_cb => data, but the two could not be in the same commit for obvious reasons.	2012-10-04 22:26:09 +02:00
Willy Tarreau	b8ffd378f0	BUG/MAJOR: http: chunk parser was broken with buffer changes Since at least commit `a458b679`, msg->sov could become negative in http_parse_chunk_size() if a chunk size wrapped around the buffer. The effect is that at some point channel_forward() was called with a negative size, causing all data to be transferred without being analyzed anymore. Since haproxy does not support keep-alive with the server yet, this issue is not really noticeable, as the server closes the connection in response. Still, when tunnel mode is used or when pretent-keepalive is used, it is possible to see the problem. This issue was reported and diagnosed by William Lallemand at Exceliance.	2012-09-27 15:08:56 +02:00
Willy Tarreau	e92693af26	BUG: http: do not print garbage on invalid requests in debug mode Cyril Bont� reported a mangled debug output when an invalid request was sent with a faulty request line. The reason was the use of the msg->sl.rq.l offset which was not yet initialized in this case. So we change the way to report such an error so that first we initialize it to zero before parsing a message, then we use that to know whether we can trust it or not. If it's still zero, then we display the whole buffer, truncated by debug_hdr() to the first CR or LF character, which results in the first line only. The same operation was performed for the response, which was wrong too.	2012-09-24 21:16:42 +02:00
Willy Tarreau	eb6cead1de	MINOR: standard: make memprintf() support a NULL destination Doing so removes many checks that were systematically made because the callees don't know if the caller passed a valid pointer.	2012-09-24 10:53:16 +02:00
Willy Tarreau	2e1dca8f52	MEDIUM: http: add "redirect scheme" to ease HTTP to HTTPS redirection For instance : redirect scheme https if !{ is_ssl }	2012-09-12 08:43:15 +02:00
Willy Tarreau	783f25800c	BUILD: http: rename error_message http_error_message to fix conflicts on RHEL Duncan Hall reported a build issue on CentOS where error_message conflicts with another system declaration when SSL is enabled. Rename the function.	2012-09-04 12:19:04 +02:00
Willy Tarreau	74172ff9c3	CLEANUP: frontend: remove the old proxy protocol decoder This one used to rely on a stream analyser which was inappropriate. It's not used anymore.	2012-09-03 20:47:35 +02:00
Willy Tarreau	986a9d2d12	MAJOR: connection: move the addr field from the stream_interface We need to have the source and destination addresses in the connection. They were lying in the stream interface so let's move them. The flags SI_FL_FROM_SET and SI_FL_TO_SET have been moved as well. It's worth noting that tcp_connect_server() almost does not use the stream interface anymore except for a few flags. It has been identified that once we detach the connection from the SI, it will probably be needed to keep a copy of the server-side addresses in the SI just for logging purposes. This has not been implemented right now though.	2012-09-03 20:47:34 +02:00
Willy Tarreau	3cefd521fa	REORG: connection: move the target pointer from si to connection The target is per connection and is directly used by the connection, so we need it there. It's not needed anymore in the SI however.	2012-09-03 20:47:34 +02:00

1 2 3 4 5 ...

786 Commits