haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-09 00:27:08 +02:00

Author	SHA1	Message	Date
Willy Tarreau	409bcde176	MEDIUM: http: unify acl and sample fetch functions The following sample fetch functions were only usable by ACLs but are now usable by sample fetches too : cook, cook_cnt, cook_val, hdr_cnt, hdr_ip, hdr_val, http_auth, http_auth_group, http_first_req, method, req_proto_http, req_ver, resp_ver, scook, scook_cnt, scook_val, shdr, shdr_cnt, shdr_ip, shdr_val, status, urlp, urlp_val, Most of them won't bring much benefit at the moment, or are even aliases of existing ones, however they'll be needed for ACL->SMP convergence. A new val_usr() function was added to resolve userlist names into pointers. The http_auth_group ACL forgot to make its first argument mandatory, so there was a check in cfgparse to report a vague error. Now that args are correctly parsed, let's report something more precise. All urlp* ACLs now support an optional 3rd argument like their sample counter-part which is the optional delimiter. The fetch functions have been renamed "smp_fetch_*". Some args controls on the sample keywords have been relaxed so that we can soon use them for ACLs : - cookie now accepts to have an optional name ; it will return the first matching cookie if the name is not set ; - same for set-cookie and hdr	2013-04-03 02:12:57 +02:00
Willy Tarreau	434c57c95c	MINOR: log: indicate it when some unreliable sample fetches are logged If a log-format involves some sample fetches that may not be present at the logging instant, we can now report a warning. Note that this is done both for log-format and for add-header and carefully respects the original fetch keyword's capabilities.	2013-04-03 02:12:56 +02:00
Willy Tarreau	80aca90ad2	MEDIUM: samples: use new flags to describe compatibility between fetches and their usages Samples fetches were relying on two flags SMP_CAP_REQ/SMP_CAP_RES to describe whether they were compatible with requests rules or with response rules. This was never reliable because we need a finer granularity (eg: an HTTP request method needs to parse an HTTP request, and is available past this point). Some fetches are also dependant on the context (eg: "hdr" uses request or response depending where it's involved, causing some abiguity). In order to solve this, we need to precisely indicate in fetches what they use, and their users will have to compare with what they have. So now we have a bunch of bits indicating where the sample is fetched in the processing chain, with a few variants indicating for some of them if it is permanent or volatile (eg: an HTTP status is stored into the transaction so it is permanent, despite being caught in the response contents). The fetches also have a second mask indicating their validity domain. This one is computed from a conversion table at registration time, so there is no need for doing it by hand. This validity domain consists in a bitmask with one bit set for each usage point in the processing chain. Some provisions were made for upcoming controls such as connection-based TCP rules which apply on top of the connection layer but before instantiating the session. Then everywhere a fetch is used, the bit for the control point is checked in the fetch's validity domain, and it becomes possible to finely ensure that a fetch will work or not. Note that we need these two separate bitfields because some fetches are usable both in request and response (eg: "hdr", "payload"). So the keyword will have a "use" field made of a combination of several SMP_USE_* values, which will be converted into a wider list of SMP_VAL_* flags. The knowledge of permanent vs dynamic information has disappeared for now, as it was never used. Later we'll probably reintroduce it differently when dealing with variables. Its only use at the moment could have been to avoid caching a dynamic rate measurement, but nothing is cached as of now.	2013-04-03 02:12:56 +02:00
Willy Tarreau	e0db1e8946	MEDIUM: acl: remove flag ACL_MAY_LOOKUP which is improperly used This flag is used on ACL matches that support being looking up patterns in trees. At the moment, only strings and IPs support tree-based lookups, but the flag is randomly set also on integers and binary data, and is not even always set on strings nor IPs. Better get rid of this mess by only relying on the matching function to decide whether or not it supports tree-based lookups, this is safer and easier to maintain.	2013-04-03 02:12:56 +02:00
Willy Tarreau	aae75e3279	BUG/CRITICAL: using HTTP information in tcp-request content may crash the process During normal HTTP request processing, request buffers are realigned if there are less than global.maxrewrite bytes available after them, in order to leave enough room for rewriting headers after the request. This is done in http_wait_for_request(). However, if some HTTP inspection happens during a "tcp-request content" rule, this realignment is not performed. In theory this is not a problem because empty buffers are always aligned and TCP inspection happens at the beginning of a connection. But with HTTP keep-alive, it also happens at the beginning of each subsequent request. So if a second request was pipelined by the client before the first one had a chance to be forwarded, the second request will not be realigned. Then, http_wait_for_request() will not perform such a realignment either because the request was already parsed and marked as such. The consequence of this, is that the rewrite of a sufficient number of such pipelined, unaligned requests may leave less room past the request been processed than the configured reserve, which can lead to a buffer overflow if request processing appends some data past the end of the buffer. A number of conditions are required for the bug to be triggered : - HTTP keep-alive must be enabled ; - HTTP inspection in TCP rules must be used ; - some request appending rules are needed (reqadd, x-forwarded-for) - since empty buffers are always realigned, the client must pipeline enough requests so that the buffer always contains something till the point where there is no more room for rewriting. While such a configuration is quite unlikely to be met (which is confirmed by the bug's lifetime), a few people do use these features together for very specific usages. And more importantly, writing such a configuration and the request to attack it is trivial. A quick workaround consists in forcing keep-alive off by adding "option httpclose" or "option forceclose" in the frontend. Alternatively, disabling HTTP-based TCP inspection rules enough if the application supports it. At first glance, this bug does not look like it could lead to remote code execution, as the overflowing part is controlled by the configuration and not by the user. But some deeper analysis should be performed to confirm this. And anyway, corrupting the process' memory and crashing it is quite trivial. Special thanks go to Yves Lafon from the W3C who reported this bug and deployed significant efforts to collect the relevant data needed to understand it in less than one week. CVE-2013-1912 was assigned to this issue. Note that 1.4 is also affected so the fix must be backported.	2013-04-03 02:12:55 +02:00
Willy Tarreau	2d43e18b69	BUG/MAJOR: http: fix regression introduced by commit `d655ffe` Sander Klein reported that since last snapshot, some downloads would hang from nginx but succeed from apache. The culprit was not too hard to find given the low number of recent changes affecting the data path. Commit `d655ffe` slightly reorganized the HTTP state machine and introduced this regression. The reason is that we must never jump into the MSG_DONE case without first flushing remaining data because this is not done anymore afterwards. This part is scheduled for being reorganized since it's totally ugly especially since we added compression, and this regression is an illustration of its readability. The issue is entirely dependant on the server close sequence, which explains why it was reproducible only with nginx here.	2013-04-03 00:22:25 +02:00
Willy Tarreau	ffb6f08bab	BUG/MAJOR: http: fix regression introduced by commit `a890d072` This commit fixed a bug and introduced a new one at the same time. It's a stupid typo, the index to store the context is [0], not [2]. The effect is that parsing the header can loop forever if multiple headers are found. This issue was reported by Lukas Tribus.	2013-04-02 23:19:30 +02:00
Willy Tarreau	a890d072fc	BUG/MAJOR: http: use a static storage for sample fetch context Baptiste Assmann reported that the cook*() ACLs do not work anymore. The reason is the way we store the hdr_ctx between subsequent calls to smp_fetch_cookie() since commit `3740635b` (1.5-dev10). The smp->ctx.a[] storage holds up to 8 pointers. It is not meant for generic storage. We used to store hdr_ctx in the ctx, but while it used to just fit for smp_fetch_hdr(), it does not for smp_fetch_cookie() since we stored it at offset 2. The correct solution is to use this storage to store a pointer to the current hdr_ctx struct which is statically allocated.	2013-04-02 12:01:06 +02:00
Willy Tarreau	d655ffe863	OPTIM: http: optimize the response forward state machine By replacing the if/else series with a switch/case, we could save another 20% on the worst case (chunks of 1 byte).	2013-04-02 02:01:00 +02:00
Willy Tarreau	0161d62d23	OPTIM: http: improve branching in chunk size parser By tweaking a bit some conditions in http_parse_chunk_size(), we could improve the overall performance in the worst case by 15%.	2013-04-02 02:00:57 +02:00
Yves Lafon	e267421e93	MINOR: http: status 301 should not be marked non-cacheable Also, browsers behaviour is inconsistent regarding the Cache-Control header field on a 301.	2013-03-30 11:22:41 +01:00
Yves Lafon	3e8d1ae2d2	MEDIUM: http: implement redirect 307 and 308 I needed to emit a 307 and noticed it was not available so I did it, as well as 308.	2013-03-29 19:17:41 +01:00
Yves Lafon	4e8ec500e5	MINOR: http: status code 303 is HTTP/1.1 only Don't return a 303 redirect with "HTTP/1.0" as it's HTTP/1.1 only.	2013-03-29 19:08:09 +01:00
Willy Tarreau	2fef9b1ef6	BUG/MEDIUM: http: fix another issue caused by http-send-name-header An issue reported by David Coulson is that when using http-send-name-header, the response processing would randomly be performed. The issue was first diagnosed by Cyril Bont� as being related to a time race when processing the closing of the response. In practice, the issue is a bit trickier. It happens that http_send_name_header() did not update msg->sol after a rewrite. This counter is supposed to point to the beginning of the message's body once headers are scheduled for being forwarded. And not updating it means that the first forwarding of the request headers in http_request_forward_body() does not send the correct count, leaving some bytes in chn->to_forward. Then if the server sends its response in a single packet with the close, the stream interface switches to state SI_ST_DIS which in turn moves to SI_ST_CLO in process_session(), and to close the outgoing connection. This is detected by http_request_forward_body(), which then switches the request message to the error state, and syncs all FSMs and removes any response analyser. The response analyser being removed, no processing is performed on the response buffer, which is tunnelled as-is to the client. Of course, the correct fix consists in having http_send_name_header() update msg->sol. Normally this ought not to have been needed, but it is an abuse to modify data already scheduled for being forwarded, so it is expected that such specific handling has to be done there. Better not have generic functions deal with such cases, so that it does not become the standard. Note: 1.4 does not have this issue even if it does not update the pointer either, because it forwards from msg->som which is not updated at the moment the connect() succeeds. So no backport is required.	2013-03-26 01:21:47 +01:00
Willy Tarreau	3bfeadb3f6	BUG/MEDIUM: http: add-header should not emit "-" for empty fields Patch `6cbbdbf3` fixed the missing "-" delimitors in logs but it caused them to be emitted with "http-request add-header", eventhough it was correctly fixed for the unique-id format. Fix this by simply removing LOG_OPT_MANDATORY in this case.	2013-03-24 07:33:22 +01:00
Willy Tarreau	6cbbdbf3f3	BUG/MEDIUM: log: emit '-' for empty fields again Commit `2b0108ad` accidently got rid of the ability to emit a "-" for empty log fields. This can happen for captured request and response cookies, as well as for fetches. Since we don't want to have this done for headers however, we set the default log method when parsing the format. It is still possible to force the desired mode using +M/-M.	2013-02-05 18:55:09 +01:00
Willy Tarreau	192e59fb07	CLEANUP: http: don't try to deinitialize http compression if it fails before init In select_compression_response_header(), some tests are rather confusing as the "fail" label is used to deinitialize the compression context for the session while it's branched only before initialization succeeds. The test is always false here and the dereferencing of the comp_algo pointer which might be null is also confusing. Remove that code which is not needed anymore since commit `ec3e3890` got rid of the latest issues. Reported-by: Dinko Korunic <dkorunic@reflected.net>	2013-01-24 16:19:19 +01:00
Willy Tarreau	4521ba689c	CLEANUP: http: remove a useless null check srv cannot be null in http_perform_server_redirect(), as it's taken from the stream interface's target which is always valid for a server-based redirect, and it was already dereferenced above, so in practice, gcc already removes the test anyway. Reported-by: Dinko Korunic <dkorunic@reflected.net>	2013-01-24 16:19:18 +01:00
Baptiste Assmann	116eefed8f	MINOR: config: http-request configuration error message misses new keywords "redirect" and "tarpit" keywords were missing from http-request configuration error message.	2013-01-05 16:53:49 +01:00
Willy Tarreau	56e9ffa6a6	BUG/MINOR: http-compression: lookup Cache-Control in the response, not the request As stated in both RFC2616 and the http-bis drafts, Cache-Control: no-transform must be looked up in the response since we're modifying the response. However, its presence in the request is irrelevant to any changes in the response : 7.2.1.6. no-transform The "no-transform" request directive indicates that an intermediary (whether or not it implements a cache) MUST NOT change the Content- Encoding, Content-Range or Content-Type request header fields, nor the request representation. 7.2.2.9. no-transform The "no-transform" response directive indicates that an intermediary (regardless of whether it implements a cache) MUST NOT change the Content-Encoding, Content-Range or Content-Type response header fields, nor the response representation. Note: according to the specs, we're supposed to emit the following response header : Warning: 214 transformation applied However no other product seems to do it, so the effect on user agents is unclear.	2013-01-05 16:31:58 +01:00
Willy Tarreau	ccbcc37a01	MEDIUM: http: add support for "http-request tarpit" rule The "reqtarpit" rule is not very handy to use. Now that we have more flexibility with "http-request", let's finally make the tarpit rules usable there. There are still semantical differences between apply_filters_to_request() and http_req_get_intercept_rule() because the former updates the counters while the latter does not. So we currently have almost similar code leafs for similar conditions, but this should be cleaned up later.	2012-12-28 14:47:19 +01:00
Willy Tarreau	81499eb67d	MEDIUM: http: add support for "http-request redirect" rules These are exactly the same as the classic redirect rules except that they can be interleaved with other http-request rules for more flexibility. The redirect parser should probably be changed to stop at the condition so that the caller puts its own condition pointer. At the moment, the redirect rule and condition are parsed at once by build_redirect_rule() and the condition is assigned to the http_req_rule.	2012-12-28 14:47:19 +01:00
Willy Tarreau	4baae248fc	REORG: config: move the http redirect rule parser to proto_http.c We'll have to use this elsewhere soon, let's move it to the proper place.	2012-12-28 14:47:19 +01:00
Willy Tarreau	71241abfd3	MINOR: http: move redirect rule processing to its own function We now have http_apply_redirect_rule() which does all the redirect-specific job instead of having this inside http_process_req_common(). Also one of the benefit gained from uniformizing this code is that both keep-alive and close response do emit the PR-- flags. The fix for the flags could probably be backported to 1.4 though it's very minor. The previous function http_perform_redirect() was becoming confusing so it was renamed http_perform_server_redirect() since it only applies to server-based redirection.	2012-12-28 14:47:19 +01:00
Willy Tarreau	96257ec5c8	CLEANUP: http: rename the misleading http_check_access_rule Several bugs were introduced recently due to a misunderstanding of how this function works and what it was supposed to do. Since it's supposed to only return the pointer to a rule which aborts further processing of the request, let's rename it to avoid further issues. The function was also slightly cleaned up without any functional change.	2012-12-28 14:47:19 +01:00
Willy Tarreau	d79a3b248e	BUG/MINOR: log: make log-format, unique-id-format and add-header more independant It happens that all of them call parse_logformat_line() which sets proxy->to_log with a number of flags affecting the line format for all three users. For example, having a unique-id specified disables the default log-format since fe->to_log is tested when the session is established. Similarly, having "option logasap" will cause "+" to be inserted in unique-id or headers referencing some of the fields depending on LW_BYTES. This patch first removes most of the dependency on fe->to_log whenever possible. The first possible cleanup is to stop checking fe->to_log for being null, considering that it always contains at least LW_INIT when any such usage is made of the log-format! Also, some checks are wrong. s->logs.logwait cannot be nulled by "logwait &= ~LW_" since LW_INIT is always there. This results in getting the wrong log at the end of a request or session when a unique-id or add-header is set, because logwait is still not null but the log-format is not checked. Further cleanups are required. Most LW_ flags should be removed or at least replaced with what they really mean (eg: depend on client-side connection, depend on server-side connection, etc...) and this should only affect logging, not other mechanisms. This patch fixes the default log-format and tries to limit interferences between the log formats, but does not pretend to do more for the moment, since it's the most visible breakage.	2012-12-28 09:51:00 +01:00
Willy Tarreau	cbc743e36c	BUG/MEDIUM: stats: disable request analyser when processing POST or HEAD After the response headers are sent and the request processing is done, the buffers are wiped out and the stream interface is closed. We must then disable the request analysers, otherwise some processing will happen on a closed stream interface and empty buffers which do not match, causing all sort of crashes. This issue was introduced with recent work on the stats, and was reported by Seri.	2012-12-28 08:36:50 +01:00
Willy Tarreau	1a1e8072f9	BUG/MINOR: stats: http-request rules still don't cope with stats Since commit `20b0de5`, we also had another remaining issue : an "http-request allow" rule would prevent a stats rule from being processed.	2012-12-27 10:34:21 +01:00
Willy Tarreau	8b80f0c9a2	BUG/MINOR: stats: last fix was still wrong Previous commit was still wrong, it broke add-header and set-header because we don't want to leave on these actions. The http_check_access_rule() function should be redesigned, it was initially thought for allow/deny rules but now it is executing other non-final rules and at the same time returning a pointer to the last final rule. That becomes a bit confusing and will need to be addressed before we implement redirect and return.	2012-12-25 21:55:37 +01:00
Willy Tarreau	418c1a0a95	BUG/MEDIUM: stats: fix stats page regression introduced by commit `20b0de5` This commit adding http-request add-header/set-header unfortunately introduced a regression to the handling of the stats page which is not matched anymore. Thanks to Dmitry Sivachenko for reporting this.	2012-12-25 20:52:58 +01:00
Willy Tarreau	20b0de56d4	MEDIUM: http: add http-request 'add-header' and 'set-header' to build headers These two new statements allow to pass information extracted from the request to the server. It's particularly useful for passing SSL information to the server, but may be used for various other purposes such as combining headers together to emulate internal variables.	2012-12-24 15:56:20 +01:00
Willy Tarreau	5c2e198390	MINOR: http: prepare to support more http-request actions We'll need to support per-action arguments, so we need to have an "arg" union in http_req_rule.	2012-12-24 12:26:26 +01:00
Willy Tarreau	354898bba9	MINOR: stats: replace STAT_FMT_CSV with STAT_FMT_HTML We need to switch the default mode if we want to add new output formats later. Let CSV be the default and HTML be an option.	2012-12-23 21:46:30 +01:00
Willy Tarreau	47ca54505c	MINOR: chunks: centralize the trash chunk allocation At the moment, we need trash chunks almost everywhere and the only correctly implemented one is in the sample code. Let's move this to the chunks so that all other places can use this allocator. Additionally, the get_trash_chunk() function now really returns two different chunks. Previously it used to always overwrite the same chunk and point it to a different buffer, which was a bit tricky because it's not obvious that two consecutive results do alias each other.	2012-12-23 21:46:07 +01:00
Willy Tarreau	1facd6d67e	REORG: stats: move the HTTP header injection to proto_http The HTTP header injection that are performed in dumpstats when responding or when redirecting a POST request have nothing to do in dumpstats. They do not use any state from the stats, and are 100% HTTP. Let's make the headers there in the HTTP core, and have dumpstats only produce stats.	2012-12-22 22:50:01 +01:00
Willy Tarreau	d9bdcd5139	REORG: stats: massive code reorg and cleanup The dumpstats code looks like a spaghetti plate. Several functions are supposed to be able to do several things but rely on complex states to dispatch the work to independant functions. Most of the HTML output is performed within the switch/case statements of the whole state machine. Let's clean this up by adding new functions to emit the data and have a few more iterators to avoid relying on so complex states. The new stats dump sequence looks like this for CLI and for HTTP : cli_io_handler() -> stats_dump_sess_to_buffer() // "show sess" -> stats_dump_errors_to_buffer() // "show errors" -> stats_dump_raw_info_to_buffer() // "show info" -> stats_dump_raw_info() -> stats_dump_raw_stat_to_buffer() // "show stat" -> stats_dump_csv_header() -> stats_dump_proxy() -> stats_dump_px_hdr() -> stats_dump_fe_stats() -> stats_dump_li_stats() -> stats_dump_sv_stats() -> stats_dump_be_stats() -> stats_dump_px_end() http_stats_io_handler() -> stats_http_redir() -> stats_dump_http() // also emits the HTTP headers -> stats_dump_html_head() // emits the HTML headers -> stats_dump_csv_header() // emits the CSV headers (same as above) -> stats_dump_http_info() // note: ignores non-HTML output -> stats_dump_proxy() // same as above -> stats_dump_http_end() // emits HTML trailer	2012-12-22 20:45:02 +01:00
Willy Tarreau	40f151aa79	BUG/MINOR: http: don't abort client connection on premature responses When a server responds prematurely to a POST request, haproxy used to cause the transfer to be aborted before the end. This is problematic because this causes the client to receive a TCP reset when it tries to push more data, generally preventing it from receiving the response which contain the reason for the premature reponse (eg: "entity too large" or an authentication request). From now on we take care of allowing the upload traffic to flow to the server even when the response has been received, since the server is supposed to drain it. That way the client receives the server response. This bug has been present since 1.4 and the fix should probably be backported there.	2012-12-20 12:10:09 +01:00
Willy Tarreau	f26b252ee4	MINOR: http: make resp_ver and status ACLs check for the presence of a response The two ACL fetches "resp_ver" and "status", if used in a request despite the warning, would return a match of zero length. This is inappropriate, better return a non-match to be more consistent with other ACL processing.	2012-12-14 08:35:45 +01:00
Willy Tarreau	4a55060aa6	MINOR: http: add the "base32+src" fetch method. This returns the concatenation of the base32 fetch and the src fetch. The resulting type is of type binary, with a size of 8 or 20 bytes depending on the source address family. This can be used to track per-IP, per-URL counters.	2012-12-09 14:53:32 +01:00
Willy Tarreau	ab1f7b72fb	MINOR: http: add the "base32" pattern fetch function This returns a 32-bit hash of the value returned by the "base" fetch method above. This is useful to track per-URL activity on high traffic sites without having to store all URLs. Instead a shorter hash is stored, saving a lot of memory. The output type is an unsigned integer.	2012-12-09 14:08:48 +01:00
Willy Tarreau	5d5b5d8eaf	MEDIUM: proto_tcp: add support for tracking L7 information Until now it was only possible to use track-sc1/sc2 with "src" which is the IPv4 source address. Now we can use track-sc1/sc2 with any fetch as well as any transformation type. It works just like the "stick" directive. Samples are automatically converted to the correct types for the table. Only "tcp-request content" rules may use L7 information, and such information must already be present when the tracking is set up. For example it becomes possible to track the IP address passed in the X-Forwarded-For header. HTTP request processing now also considers tracking from backend rules because we want to be able to update the counters even when the request was already parsed and tracked. Some more controls need to be performed (eg: samples do not distinguish between L4 and L6).	2012-12-09 14:08:47 +01:00
Willy Tarreau	dc979f2492	BUG/MINOR: http: don't log a 503 on client errors while waiting for requests If a client aborts a request with an error (typically a TCP reset), we must log a 400. Till now we did not set the status nor close the stream interface, causing the request to attempt to be forwarded and logging a 503. Should be backported to 1.4 which is affected as well.	2012-12-04 10:52:22 +01:00
Willy Tarreau	14cba4b0b1	MEDIUM: connection: add an error code in connections This will be needed to improve error reporting, especially for SSL.	2012-12-03 14:22:13 +01:00
Willy Tarreau	8139b9959f	MINOR: compression: make the stats a bit more robust To ensure that we only count when a response was compressed, we also check for the SN_COMP_READY flag which indicates that the compression was effectively initialized. Comp_algo alone is meaningless.	2012-11-27 09:34:00 +01:00
Willy Tarreau	9101535038	BUG/MINOR: http: disable compression when message has no body Compression was not disabled on 1xx, 204, 304 nor HEAD requests. This is not really a problem, but it reports more compressed responses than really done.	2012-11-27 09:34:00 +01:00
Willy Tarreau	0a80a8dbb2	MINOR: http: factor out the content-type checks Let's only look up the content-type header once. This involves inverting the condition which is not dramatic. Also, we now always check the value length before comparing it, and we always reset the ctx.idx before looking a header up. Otherwise that could make header lookups depend on their on-wire order. It would be a minor issue however since at worst it would cause some responses not to be compressed.	2012-11-26 16:36:00 +01:00
William Lallemand	d300261bab	MINOR: compression: disable on multipart or status != 200 The compression is disabled when the HTTP status code is not 200, indeed compression on some HTTP code can create issues (ex: 206, 416). Multipart message should not be compressed eitherway.	2012-11-26 16:02:58 +01:00
William Lallemand	859550e068	BUG/MINOR: compression: Content-Type is case insensitive The Content-Type parameter must be case insensitive.	2012-11-26 16:02:58 +01:00
Willy Tarreau	f003d375ec	BUG/MINOR: http: don't report client aborts as server errors If a client aborts with an abortonclose flag, the close is forwarded to the server and when server response is processed, the analyser thinks it's the server who has closed first, and logs flags "SD" or "SH" and counts a server error. In order to avoid this, we now first detect that the client has closed and log a client abort instead. This likely is the reason why many people have been observing a small rate of SD/SH flags without being able to find what the error was. This fix should probably be backported to 1.4.	2012-11-26 13:50:02 +01:00
Willy Tarreau	5e16cbc3bd	MINOR: stats: report the total number of compressed responses per front/back Depending on the content-types and accept-encoding fields, some responses might or might not be compressed. Let's have a counter of the number of compressed responses and report it in the stats to help improve compression usage. Some cosmetic issues were fixed in the CSV output too (missing commas at the end).	2012-11-24 14:54:13 +01:00
William Lallemand	00bf1dee9c	BUG/MEDIUM: compression: does not forward trailers The commit `bf3ae617` introduced a regression about the forward of the trailers in compression mode.	2012-11-23 11:12:33 +01:00
Willy Tarreau	193b8c6168	MINOR: http: allow the cookie capture size to be changed Some users need more than 64 characters to log large cookies. The limit was set to 63 characters (and not 64 as previously documented). Now it is possible to change this using the global "tune.http.cookielen" setting if required.	2012-11-22 00:44:27 +01:00
William Lallemand	072a2bf537	MINOR: compression: CPU usage limit New option 'maxcompcpuusage' in global section. Sets the maximum CPU usage HAProxy can reach before stopping the compression for new requests or decreasing the compression level of current requests. It works like 'maxcomprate' but with the Idle.	2012-11-21 02:15:16 +01:00
William Lallemand	8b52bb3878	MEDIUM: compression: use pool for comp_ctx Use pool for comp_ctx, it is allocated during the comp_algo->init(). The allocation of comp_ctx is accounted for in the zlib_memory_available.	2012-11-21 01:56:47 +01:00
William Lallemand	bf3ae61789	MEDIUM: compression: don't compress when no data This patch makes changes in the http_response_forward_body state machine. It checks if the compress algorithm had consumed data before swapping the temporary and the input buffer. So it prevents null sized zlib chunks.	2012-11-19 14:57:29 +01:00
Willy Tarreau	b97b6190e1	BUG: compression: properly disable compression when content-type does not match Disabling compression based on the content-type was improperly done since the introduction of the COMP_READY flag, sometimes resulting in truncated responses.	2012-11-19 14:55:02 +01:00
Willy Tarreau	543db62e1f	BUG/MEDIUM: compression: release the zlib pools between keep-alive requests There was a possible memory leak in the zlib code when the first response of a keep-alive session was compressed, because the next request would reset the compression algo, preventing a later call to session_free() from releasing it. The reason is that it is necessary to release the assigned resources in http_end_txn_clean_session().	2012-11-15 16:41:22 +01:00
William Lallemand	ec3e3890f0	BUG/MINOR: compression: deinit zlib only when required The zlib stream was deinitialized even when the init failed.	2012-11-15 15:42:17 +01:00
William Lallemand	c04ca58222	BUG/MEDIUM: compression: no Content-Type header but type in configuration HAProxy was compressing data when there was no Content-Type header in the response but a compression type specified in the configuration.	2012-11-15 15:42:11 +01:00
Willy Tarreau	3fdb366885	MAJOR: connection: replace struct target with a pointer to an enum Instead of storing a couple of (int, ptr) in the struct connection and the struct session, we use a different method : we only store a pointer to an integer which is stored inside the target object and which contains a unique type identifier. That way, the pointer allows us to retrieve the object type (by dereferencing it) and the object's address (by computing the displacement in the target structure). The NULL pointer always corresponds to OBJ_TYPE_NONE. This reduces the size of the connection and session structs. It also simplifies target assignment and compare. In order to improve the generated code, we try to put the obj_type element at the beginning of all the structs (listener, server, proxy, si_applet), so that the original and target pointers are always equal. A lot of code was touched by massive replaces, but the changes are not that important.	2012-11-12 00:42:33 +01:00
Willy Tarreau	50fc7777c6	MEDIUM: http: refrain from sending "Connection: close" when Upgrade is present Some servers are not totally HTTP-compliant when it comes to parsing the Connection header. This is particularly true with WebSocket where it happens from time to time that a server doesn't support having a "close" token along with the "Upgrade" token in the Connection header. This broken behaviour has also been noticed on some clients though the problem is less frequent on the response path. Sometimes the workaround consists in enabling "option http-pretend-keepalive" to leave the request Connection header untouched, but this is not always the most convenient solution. This patch introduces a new solution : haproxy now also looks for the "Upgrade" token in the Connection header and if it finds it, then it refrains from adding any other token to the Connection header (though "keep-alive" and "close" may still be removed if found). The same is done for the response headers. This way, WebSocket much with less changes even when facing non-compliant clients or servers. At least it fixes the DISCONNECT issue that was seen on the websocket.org test. Note that haproxy does not change its internal mode, it just refrains from adding new tokens to the connection header.	2012-11-11 22:40:00 +01:00
Willy Tarreau	7f7ad91056	BUILD: stream_interface: remove si_fd() and its references si_fd() is not used a lot, and breaks builds on OpenBSD 5.2 which defines this name for its own purpose. It's easy enough to remove this one-liner function, so let's do it.	2012-11-11 20:53:29 +01:00
William Lallemand	d85f917daf	MINOR: compression: maximum compression rate limit This patch adds input and output rate calcutation on the HTTP compresion feature. Compression can be limited with a maximum rate value in kilobytes per second. The rate is set with the global 'maxcomprate' option. You can change this value dynamicaly with 'set rate-limit http-compression global' on the UNIX socket.	2012-11-10 17:47:27 +01:00
William Lallemand	f3747837e5	MINOR: compression: tune.comp.maxlevel This option allows you to set the maximum compression level usable by the compression algorithm. It affects CPU usage.	2012-11-10 17:47:07 +01:00
Finn Arne Gangstad	0a410e81fb	BUG: http: revert broken optimisation from `82fe75c1a7` This optimisation causes haproxy to time out requests that result in two TCP packets, one packet containing the header, and one packet containing the actual data. This is a very typical type of response from a lot of servers. [Willy: I suspect the fix might have an impact on the compression code which I'm not sure completely handles calls with 0 bytes to forward]	2012-11-10 17:38:36 +01:00
William Lallemand	4c49fae985	MINOR: compression: init before deleting headers Init the compression algorithm before modifying the response headers. So if the compression init fail, the headers won't be modified.	2012-11-08 15:23:30 +01:00
William Lallemand	1c2d622d82	CLEANUP: use struct comp_ctx instead of union Replace union comp_ctx by struct comp_ctx. Use struct comp_ctx * in the init/add_data/flush/reset/end prototypes of compression.h functions.	2012-11-05 10:23:16 +01:00
Finn Arne Gangstad	cbb9a4b128	MINOR: compression: Enable compression for IE6 w/SP2, IE7 and IE8 Some old browsers that have a user-agent starting with "Mozilla/4" do not support compressison correctly, so disable compression for those. Internet explorer 6 after Windows XP service pack 2, IE 7, and IE 8, do however support compression and still have a user agent starting with Mozilla/4, so we try to enable compression for those. MSIE has a user-agent on this form: Mozilla/4.0 (compatible; MSIE <version>; ...) 98% of MSIE 6 SP2 user agents start with Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1 The remaining 2% have additional flags before "SV1". This simplified matching looking for MSIE at exactly position 25 and SV1 at exacly position 51 gives a few false negatives, so sometimes a compression opportunity is lost. A test against 3 hours of traffic to around 3000 news sites worldwide gives less than 0.007% (70ppm) missed compression opportunities.	2012-10-29 22:03:14 +01:00
Willy Tarreau	7e2c647ee7	MEDIUM: remove remains of BUFSIZE in HTTP auth and sample conversions Sample conversions rely on two alternative buffers which were previously allocated as static bufs of size BUFSIZE. Now they're initialized to the global buffer size. It was the same for HTTP authentication. Note that it seems that none of them was prone to any mistake when dealing with the buffer size, but better stay on the safe side by maintaining the old assumption that a trash buffer is always "large enough".	2012-10-29 20:44:36 +01:00
Willy Tarreau	19d14ef104	MEDIUM: make the trash be a chunk instead of a char * The trash is used everywhere to store the results of temporary strings built out of s(n)printf, or as a storage for a chunk when chunks are needed. Using global.tune.bufsize is not the most convenient thing either. So let's replace trash with a chunk and directly use it as such. We can then use trash.size as the natural way to get its size, and get rid of many intermediary chunks that were previously used. The patch is huge because it touches many areas but it makes the code a lot more clear and even outlines places where trash was used without being that obvious.	2012-10-29 16:57:30 +01:00
Willy Tarreau	08b4d79d31	BUG: compression: disable auto-close and enable MSG_MORE during transfer We don't want the lower layer to forward a close while we're compressing, and we want the system to fuse outgoing TCP segments using MSG_MORE as much as possible to save round trips that can emerge from sending short packets with a PUSH flag. A test on a remote busy DSL line consisting in compressing a 100MB file on the fly full of zeroes only showed a transfer rate of a few kB/s due to these round trips.	2012-10-27 01:36:34 +02:00
Willy Tarreau	70737d142f	MINOR: compression: add an offload option to remove the Accept-Encoding header This is used when it is desired that backend servers don't compress (eg: because of buggy implementations).	2012-10-27 01:13:24 +02:00
Willy Tarreau	f2943dccd0	MAJOR: session: detach the connections from the stream interfaces We will need to be able to switch server connections on a session and to keep idle connections. In order to achieve this, the preliminary requirement is that the connections can survive the session and be detached from them. Right now they're still allocated at exactly the same place, so when there is a session, there are always 2 connections. We could soon improve on this by allocating the outgoing connection only during a connect(). This current patch touches a lot of code and intentionally does not change any functionnality. Performance tests show no regression (even a very minor improvement). The doc has not yet been updated.	2012-10-26 20:15:20 +02:00
Willy Tarreau	c919dc66a3	CLEANUP: remove trashlen trashlen is a copy of global.tune.bufsize, so let's stop using it as a duplicate, fall back to the original bufsize, it's less confusing this way.	2012-10-26 20:04:27 +02:00
Willy Tarreau	3c7b97b9f9	BUG/MINOR: http: compression should consider all Accept-Encoding header values Right now commit `82fe75c1` came with a minor bug limiting the check to the first accept-encoding header value only.	2012-10-26 14:52:02 +02:00
Willy Tarreau	05d846092f	MINOR: compression: automatically disable compression for older browsers A number of older browsers have many issues with compressed contents. It happens that all these older browsers announce themselves as "Mozilla/4" and that despite not being all broken, the amount of working browsers announcing themselves this way compared to all other ones is so tiny that it's not worth wasting cycles trying to adapt to every specific one. So let's simply disable compression for these older browsers. More information on this very detailed article : http://zoompf.com/2012/02/lose-the-wait-http-compression	2012-10-26 02:54:31 +02:00
William Lallemand	82fe75c1a7	MEDIUM: HTTP compression (zlib library support) This commit introduces HTTP compression using the zlib library. http_response_forward_body has been modified to call the compression functions. This feature includes 3 algorithms: identity, gzip and deflate: * identity: this is mostly for debugging, and it was useful for developping the compression feature. With Content-Length in input, it is making each chunk with the data available in the current buffer. With chunks in input, it is rechunking, the output chunks will be bigger or smaller depending of the size of the input chunk and the size of the buffer. Identity does not apply any change on data. * gzip: same as identity, but applying a gzip compression. The data are deflated using the Z_NO_FLUSH flag in zlib. When there is no more data in the input buffer, it flushes the data in the output buffer (Z_SYNC_FLUSH). At the end of data, when it receives the last chunk in input, or when there is no more data to read, it writes the end of data with Z_FINISH and the ending chunk. * deflate: same as gzip, but with deflate algorithm and zlib format. Note that this algorithm has ambiguous support on many browsers and no support at all from recent ones. It is strongly recommended not to use it for anything else than experimentation. You can't choose the compression ratio at the moment, it will be set to Z_BEST_SPEED (1), as tests have shown very little benefit in terms of compression ration when going above for HTML contents, at the cost of a massive CPU impact. Compression will be activated depending of the Accept-Encoding request header. With identity, it does not take care of that header. To build HAProxy with zlib support, use USE_ZLIB=1 in the make parameters. This work was initially started by David Du Colombier at Exceliance.	2012-10-26 02:30:48 +02:00
Willy Tarreau	54d23dfc07	CLEANUP: http: rename HTTP_MSG_DATA_CRLF state This state's name is confusing as it is only used with chunked encoding and makes newcomers think it's also related to the content-length. Let's call it CHUNK_CRLF to clear any doubt on this.	2012-10-26 01:13:52 +02:00
Willy Tarreau	24e6d972aa	OPTIM: http: inline http_parse_chunk_size() and http_skip_chunk_crlf() These functions are not that long and the compiler inlines them well. Doing so has sped up the chunked encoding parser by 41% ! Note that http_forward_trailers was also declared static because it's not exported.	2012-10-26 01:12:40 +02:00
Cyril Bonté	69fa99292e	MEDIUM: http: accept IPv6 values with (s)hdr_ip acl Commit `ceb4ac9c` states that IPv6 values are accepted by "hdr_ip" acl, but the code didn't allow it. This patch provides the ability to accept IPv6 values.	2012-10-25 14:41:33 +02:00
Willy Tarreau	fc47f91c9c	BUG/MEDIUM: http: set DONTWAIT on data when switching to tunnel mode Jaroslaw Bojar diagnosed an issue when haproxy switches to tunnel mode after a transfer. The response data are sent with the MSG_MORE flag, causing them to be needlessly queued in the kernel. In order to fix this, we set the CF_NEVER_WAIT flag on the channels when switching to tunnel mode. One issue remained with client-side keep-alive : if the response is sent before the end of the request, it suffers the same issue for the same reason. This is easily addressed by setting the CF_SEND_DONTWAIT flag on the channel when the response has been parsed and we're waiting for the other side. The same issue is present in 1.4 so the fix must be backported.	2012-10-20 10:41:37 +02:00
Willy Tarreau	9b28e03b66	MAJOR: channel: replace the struct buffer with a pointer to a buffer With this commit, we now separate the channel from the buffer. This will allow us to replace buffers on the fly without touching the channel. Since nobody is supposed to keep a reference to a buffer anymore, doing so is not a problem and will also permit some copy-less data manipulation. Interestingly, these changes have shown a 2% performance increase on some workloads, probably due to a better cache placement of data.	2012-10-13 09:07:52 +02:00
Willy Tarreau	cdbdd52a38	CLEANUP: http: use 'chn' to name channel variables, not 'buf' These "buf" were confusing as they were really refering to channels. At most places, a buffer was really all what was needed, so a struct buffer was used instead. It is possible that the performance has slightly increased by the removal of pointer offset in many pointer operations by directly using the buffer pointer instead of the channel pointer.	2012-10-12 23:02:51 +02:00
Willy Tarreau	394db379eb	REORG: http: rename msg->buf to msg->chn since it's a channel It's extremely confusing to have all those msg->buf->buf everywhere after the extraction of the buffer from the channel. Let's clean this up.	2012-10-12 22:40:39 +02:00
Willy Tarreau	1b6c00cb99	BUG/MAJOR: ensure that hdr_idx is always reserved when L7 fetches are used Baptiste Assmann reported a bug causing a crash on recent versions when sticking rules were set on layer 7 in a TCP proxy. The bug is easier to reproduce with the "defer-accept" option on the "bind" line in order to have some contents to parse when the connection is accepted. The issue is that the acl_prefetch_http() function called from HTTP fetches relies on hdr_idx to be preinitialized, which is not the case if there is no L7 ACL. The solution consists in adding a new SMP_CAP_L7 flag to fetches to indicate that they are expected to work on L7 data, so that the proxy knows that the hdr_idx has to be initialized. This is already how ACL and HTTP mode are handled. The bug was present since 1.5-dev9.	2012-10-05 22:46:09 +02:00
bedis	4c75cca8ba	MINOR: samples: update the url_param fetch to match parameters in the path It now supports an optional delimiter to allow to look for the parameter before the query string.	2012-10-05 15:17:23 +02:00
Willy Tarreau	f7bc57ca6e	REORG: connection: rename the data layer the "transport layer" While working on the changes required to make the health checks use the new connections, it started to become obvious that some naming was not logical at all in the connections. Specifically, it is not logical to call the "data layer" the layer which is in charge for all the handshake and which does not yet provide a data layer once established until a session has allocated all the required buffers. In fact, it's more a transport layer, which makes much more sense. The transport layer offers a medium on which data can transit, and it offers the functions to move these data when the upper layer requests this. And it is the upper layer which iterates over the transport layer's functions to move data which should be called the data layer. The use case where it's obvious is with embryonic sessions : an incoming SSL connection is accepted. Only the connection is allocated, not the buffers nor stream interface, etc... The connection handles the SSL handshake by itself. Once this handshake is complete, we can't use the data functions because the buffers and stream interface are not there yet. Hence we have to first call a specific function to complete the session initialization, after which we'll be able to use the data functions. This clearly proves that SSL here is only a transport layer and that the stream interface constitutes the data layer. A similar change will be performed to rename app_cb => data, but the two could not be in the same commit for obvious reasons.	2012-10-04 22:26:09 +02:00
Willy Tarreau	b8ffd378f0	BUG/MAJOR: http: chunk parser was broken with buffer changes Since at least commit `a458b679`, msg->sov could become negative in http_parse_chunk_size() if a chunk size wrapped around the buffer. The effect is that at some point channel_forward() was called with a negative size, causing all data to be transferred without being analyzed anymore. Since haproxy does not support keep-alive with the server yet, this issue is not really noticeable, as the server closes the connection in response. Still, when tunnel mode is used or when pretent-keepalive is used, it is possible to see the problem. This issue was reported and diagnosed by William Lallemand at Exceliance.	2012-09-27 15:08:56 +02:00
Willy Tarreau	e92693af26	BUG: http: do not print garbage on invalid requests in debug mode Cyril Bont� reported a mangled debug output when an invalid request was sent with a faulty request line. The reason was the use of the msg->sl.rq.l offset which was not yet initialized in this case. So we change the way to report such an error so that first we initialize it to zero before parsing a message, then we use that to know whether we can trust it or not. If it's still zero, then we display the whole buffer, truncated by debug_hdr() to the first CR or LF character, which results in the first line only. The same operation was performed for the response, which was wrong too.	2012-09-24 21:16:42 +02:00
Willy Tarreau	eb6cead1de	MINOR: standard: make memprintf() support a NULL destination Doing so removes many checks that were systematically made because the callees don't know if the caller passed a valid pointer.	2012-09-24 10:53:16 +02:00
Willy Tarreau	2e1dca8f52	MEDIUM: http: add "redirect scheme" to ease HTTP to HTTPS redirection For instance : redirect scheme https if !{ is_ssl }	2012-09-12 08:43:15 +02:00
Willy Tarreau	783f25800c	BUILD: http: rename error_message http_error_message to fix conflicts on RHEL Duncan Hall reported a build issue on CentOS where error_message conflicts with another system declaration when SSL is enabled. Rename the function.	2012-09-04 12:19:04 +02:00
Willy Tarreau	74172ff9c3	CLEANUP: frontend: remove the old proxy protocol decoder This one used to rely on a stream analyser which was inappropriate. It's not used anymore.	2012-09-03 20:47:35 +02:00
Willy Tarreau	986a9d2d12	MAJOR: connection: move the addr field from the stream_interface We need to have the source and destination addresses in the connection. They were lying in the stream interface so let's move them. The flags SI_FL_FROM_SET and SI_FL_TO_SET have been moved as well. It's worth noting that tcp_connect_server() almost does not use the stream interface anymore except for a few flags. It has been identified that once we detach the connection from the SI, it will probably be needed to keep a copy of the server-side addresses in the SI just for logging purposes. This has not been implemented right now though.	2012-09-03 20:47:34 +02:00
Willy Tarreau	3cefd521fa	REORG: connection: move the target pointer from si to connection The target is per connection and is directly used by the connection, so we need it there. It's not needed anymore in the SI however.	2012-09-03 20:47:34 +02:00
Willy Tarreau	8263d2b259	CLEANUP: channel: use "channel" instead of "buffer" in function names This is a massive rename of most functions which should make use of the word "channel" instead of the word "buffer" in their names. In concerns the following ones (new names) : unsigned long long channel_forward(struct channel buf, unsigned long long bytes); static inline void channel_init(struct channel buf) static inline int channel_input_closed(struct channel buf) static inline int channel_output_closed(struct channel buf) static inline void channel_check_timeouts(struct channel b) static inline void channel_erase(struct channel buf) static inline void channel_shutr_now(struct channel buf) static inline void channel_shutw_now(struct channel buf) static inline void channel_abort(struct channel buf) static inline void channel_stop_hijacker(struct channel buf) static inline void channel_auto_connect(struct channel buf) static inline void channel_dont_connect(struct channel buf) static inline void channel_auto_close(struct channel buf) static inline void channel_dont_close(struct channel buf) static inline void channel_auto_read(struct channel buf) static inline void channel_dont_read(struct channel buf) unsigned long long channel_forward(struct channel *buf, unsigned long long bytes) Some functions provided by channel.[ch] have kept their "buffer" name because they are really designed to act on the buffer according to some information gathered from the channel. They have been moved together to the same place in the file for better readability but they were not changed at all. The "buffer" memory pool was also renamed "channel".	2012-09-03 20:47:33 +02:00
Willy Tarreau	03cdb7c678	CLEANUP: channel: usr CF_/CHN_ prefixes instead of BF_/BUF_ Get rid of these confusing BF_* flags. Now channel naming should clearly be used everywhere appropriate. No code was changed, only a renaming was performed. The comments about channel operations was updated.	2012-09-03 20:47:33 +02:00
Willy Tarreau	af81935b82	REORG: channel: move buffer_{replace,insert_line}* to buffer.{c,h} These functions do not depend on the channel flags anymore thus they're much better suited to be used on plain buffers. Move them from channel to buffer.	2012-09-03 20:47:33 +02:00
Willy Tarreau	3bf1b2b816	MAJOR: channel: stop relying on BF_FULL to take action This flag is quite complex to get right and updating it everywhere is a major pain, especially since the buffer/channel split. This is the first step of getting rid of it. Instead now it's dynamically computed whenever needed.	2012-09-03 20:47:33 +02:00
Willy Tarreau	a75bcef867	REORG: buffer: move buffer_flush, b_adv and b_rew to buffer.h These one now operate over real buffers, not channels anymore.	2012-09-03 20:47:32 +02:00
Willy Tarreau	8e21bb9e52	MAJOR: channel: remove the BF_OUT_EMPTY flag This flag was very problematic because it was composite in that both changes to the pipe or to the buffer had to cause this flag to be updated, which is not always simple (eg: there may not even be a channel attached to a buffer at all). There were not that many users of this flags, mostly setters. So the flag got replaced with a macro which reports whether the channel is empty or not, by checking both the pipe and the buffer. One part of the change is sensible : the flag was also part of BF_MASK_STATIC, which is used by process_session() to rescan all analysers in case the flag's status changes. At first glance, none of the analysers seems to change its mind base on this flag when it is subject to change, so it seems fine not to add variation checks here. Otherwise it's possible that checking the buffer's output size is more useful than checking the flag's replacement.	2012-09-03 20:47:32 +02:00
Willy Tarreau	c7e4238df0	REORG: buffers: split buffers into chunk,buffer,channel Many parts of the channel definition still make use of the "buffer" word.	2012-09-03 20:47:32 +02:00
Willy Tarreau	75bf2c925f	REORG: sock_raw: rename the files raw_sock* The "raw_sock" prefix will be more convenient for naming functions as it will be prefixed with the data layer and suffixed with the data direction. So let's rename the files now to avoid any further confusion. The #include directive was also removed from a number of files which do not need it anymore.	2012-09-02 21:54:56 +02:00
Willy Tarreau	572bf9095d	REORG/MAJOR: extract "struct buffer" from "struct channel" At the moment, the struct is still embedded into the struct channel, but all the functions have been updated to use struct buffer only when possible, otherwise struct channel. Some functions would likely need to be splitted between a buffer-layer primitive and a channel-layer function. Later the buffer should become a pointer in the struct buffer, but doing so requires a few changes to the buffer allocation calls.	2012-09-02 21:54:56 +02:00
Willy Tarreau	7421efb85f	REORG/MAJOR: use "struct channel" instead of "struct buffer" This is a massive rename. We'll then split channel and buffer. This change needs a lot of cleanups. At many locations, the parameter or variable is still called "buf" which will become ambiguous. Also, the "struct channel" is still defined in buffers.h.	2012-09-02 21:54:55 +02:00
Willy Tarreau	505e34a36d	MAJOR: get rid of fdtab[].state and use connection->flags instead fdtab[].state was only used to know whether a connection was in progress or an error was encountered. Instead we now use connection->flags to store a flag for both. This way, connection management will be able to update the connection status on I/O.	2012-09-02 21:51:26 +02:00
Willy Tarreau	a9fddca778	MINOR: http: add the urlp_val ACL match It's derived from other urlp_* matches, but there was no way to check for an integer value and it seems like it's significantly used.	2012-07-31 07:55:32 +02:00
Willy Tarreau	dae2a8a5a5	BUG/MINOR: tarpit: fix condition to return the HTTP 500 message Commit `fa7e1025` (1.3.16-rc1) introduced a minor bug by comparing req->flags with BF_READ_ERROR instead of checking for the bit. The result is that the error message is always returned even in case of client error. This has no real impact but this must be fixed. It may be backported to 1.4 and 1.3.	2012-07-31 07:55:31 +02:00
Willy Tarreau	a7ad50cdb1	MEDIUM: pattern: add the "base" sample fetch method This one returns the concatenation of the first Host header entry with the path. It can make content-switching rules easier, help with fighting DDoS on certain URLs and improve shared caches efficiency.	2012-07-26 19:08:38 +02:00
Willy Tarreau	6812bcfc94	MINOR: replace acl_fetch_{path,url}* with smp_fetch_* Doing so allows us to support sticking on URL, URL's IP, URL's port and path. Both fetch functions should be improved to support an optional depth allowing to stick to a server depending on just a few directory components. This would help with portals, some prefetch-capable caches and with outgoing connections using multiple internet links.	2012-07-26 19:06:40 +02:00
Willy Tarreau	a05903174f	BUG/MAJOR: cookie prefix doesn't support cookie-less servers Commit `827aee91` merged in 1.5-dev5 introduced a regression causing the srv pointer to be tested twice instead of srv then srv->cookie. The result is that if a server has no cookie in prefix mode, haproxy will crash when trying to modify it. Such a config is very unlikely to happen, except maybe with a backup server, which would cause haproxy to die with the last server in the farm. No backport is needed, only 1.5-dev was affected.	2012-06-06 16:07:00 +02:00
Willy Tarreau	4f8a83cb6e	MEDIUM: stats: add the ability to kill sessions from the admin interface It was not possible to kill remaining sessions from the admin interface, which is annoying especially when switching to maintenance mode. Now it's possible.	2012-06-04 00:26:23 +02:00
Willy Tarreau	d72822442d	MEDIUM: stats: add support for soft stop/soft start in the admin interface One important missing feature on the web interface is the ability to perform a soft stop/soft start. This is now possible.	2012-06-04 00:22:44 +02:00
Willy Tarreau	4992dd2d30	MINOR: http: add support for "httponly" and "secure" cookie attributes httponly This option tells haproxy to add an "HttpOnly" cookie attribute when a cookie is inserted. This attribute is used so that a user agent doesn't share the cookie with non-HTTP components. Please check RFC6265 for more information on this attribute. secure This option tells haproxy to add a "Secure" cookie attribute when a cookie is inserted. This attribute is used so that a user agent never emits this cookie over non-secure channels, which means that a cookie learned with this flag will be presented only over SSL/TLS connections. Please check RFC6265 for more information on this attribute.	2012-05-31 21:02:17 +02:00
Willy Tarreau	674021329c	REORG/MINOR: use dedicated proxy flags for the cookie handling Cookies were mixed with many other options while they're not used as options. Move them to a dedicated bitmask (ck_opts). This has released 7 flags in the proxy options and leaves some room for new proxy flags.	2012-05-31 20:40:20 +02:00
Willy Tarreau	cde18fc1ba	BUG/MINOR: perform_http_redirect also needs to rewind the buffer Commit `d1de8af362` was incomplete, because perform_http_redirect() also needs to rewind the buffer since it's called after data are scheduled for forwarding. No backport needed.	2012-05-30 08:00:56 +02:00
Cyril Bont�	a32d275ab0	BUG/MEDIUM: option forwardfor if-none doesn't work with some configurations When "option forwardfor" is enabled in a frontend that uses backends, "if-none" ignores the header name provided in the frontend. This prevents haproxy to add the X-Forwarded-For header if the option is not used in the backend. This may introduce security issues for servers/applications that rely on the header provided by haproxy. A minimal configuration which can reproduce the bug: defaults mode http listen OK bind :9000 option forwardfor if-none server s1 127.0.0.1:80 listen BUG-frontend bind :9001 option forwardfor if-none default_backend BUG-backend backend BUG-backend server s1 127.0.0.1:80	2012-05-30 06:43:24 +02:00
Willy Tarreau	949811319b	REORG/MEDIUM: stream_interface: move applet->state and private to connection The state and the private pointer are not specific to the applets, since SSL will require exactly both of them. Move them to the connection layer now and rename them. We also now ensure that both are NULL on first call.	2012-05-21 17:09:48 +02:00
Willy Tarreau	fb7508aefb	REORG/MINOR: stream_interface: move si->fd to struct connection The socket fd is used only when in socket mode and with a connection.	2012-05-21 16:47:54 +02:00
Willy Tarreau	73b013b070	MINOR: stream_interface: introduce a new "struct connection" type We start to move everything needed to manage a connection to a special entity "struct connection". We have the data layer operations and the control operations there. We'll also have more info in the future such as file descriptors and applet contexts, so that in the end it becomes detachable from the stream interface, which will allow connections to be reused between sessions. For now on, we start with minimal changes.	2012-05-21 16:31:45 +02:00
Willy Tarreau	ea95316bf1	MEDIUM: http: msg->sov and msg->sol will never wrap These ones are offsets now, so they cannot wrap. Let's remove the useless wrapping detection and simplify the forwarding code.	2012-05-18 23:50:43 +02:00
Willy Tarreau	2692736aa3	MEDIUM: http: get rid of msg->som which is not used anymore msg->som was zero before the body and was used to carry the beginning of a chunk size for chunked-encoded messages, at a moment when msg->sol is always zero. Remove msg->som and replace it with msg->sol where needed.	2012-05-18 23:50:43 +02:00
Willy Tarreau	06a000f56e	CLEANUP: http: make it more obvious that msg->som is always null outside of chunks Since the recent buffer reorg, msg->som is redundant with buf->p but still appears at a number of places. This tiny patch allows to confirm that som follows two states : - 0 from the moment the message starts to be parsed - relative offset to ->p for start of chunk when parsing chunks During this second state, ->sol is never used, so we should probably merge the two.	2012-05-18 23:04:32 +02:00
Willy Tarreau	09d1e254c9	MAJOR: http: stop using msg->sol outside the parsers This is a left-over from the buffer changes. Msg->sol is always null at the end of the parsing, so we must not use it anymore to read headers or find the beginning of a message. As a side effect, the dump of the request in debug mode is working again because it was relying on msg->sol not being null. Maybe it will even be mergeable with another of the message pointers.	2012-05-18 22:43:55 +02:00
Willy Tarreau	d1de8af362	BUG/MAJOR: fix regression on content-based hashing and http-send-name-header The recent split between the buffers and HTTP messages in 1.5-dev9 caused a major trouble : in the past, we used to keep a pointer to HTTP data in the buffer struct itself, which was the cause of most of the pain we had to deal with buffers. Now the two are split but we lost the information about the beginning of the HTTP message once it's being forwarded. While it seems normal, it happens that several parts of the code currently rely on this ability to inspect a buffer containing old contents : - balance uri - balance url_param - balance url_param check_post - balance hdr() - balance rdp-cookie() - http-send-name-header All these happen after the data are scheduled for being forwarded, which also causes a server to be selected. So for a long time we've been relying on supposedly sent data that we still had a pointer to. Now that we don't have such a pointer anymore, we only have one possibility : when we need to inspect such data, we have to rewind the buffer so that ->p points to where it previously was. We're lucky, no data can leave the buffer before it's being connecting outside, and since no inspection can begin until it's empty, we know that the skipped data are exactly ->o. So we rewind the buffer by ->o to get headers and advance it back by the same amount. Proceeding this way is particularly important when dealing with chunked- encoded requests, because the ->som and ->sov fields may be reused by the chunk parser before the connection attempt is made, so we cannot rely on them. Also, we need to be able to come back after retries and redispatches, which might change the size of the request if http-send-name-header is set. All of this is accounted for by the output queue so in the end it does not look like a bad solution. No backport is needed.	2012-05-18 22:23:01 +02:00
David du Colombier	7af4605ef7	BUG/MAJOR: trash must always be the size of a buffer Before it was possible to resize the buffers using global.tune.bufsize, the trash has always been the size of a buffer by design. Unfortunately, the recent buffer sizing at runtime forgot to adjust the trash, resulting in it being too short for content rewriting if buffers were enlarged from the default value. The bug was encountered in 1.4 so the fix must be backported there.	2012-05-16 14:21:55 +02:00
Willy Tarreau	7bb68abb9f	OPTIM/MEDIUM: stream_interface: add a new SI_FL_NOHALF flag This flag indicates that we're not interested in keeping half-open connections on a stream interface. It has the benefit of allowing the socket layer to cause an immediate write close when detecting an incoming read close. This releases resources much faster and saves one syscall (either a shutdown or setsockopt). This flag is only set by HTTP on the interface going to the server since we don't want to continue pushing data there when it has closed. Another benefit is that it responds with a FIN to a server's FIN instead of responding with an RST as it used to, which is much cleaner. Performance gains of 7.5% have been measured on HTTP connection rate on empty objects.	2012-05-13 14:52:22 +02:00
Willy Tarreau	93548be149	OPTIM: proto_http: don't enable quick-ack on empty buffers Commit `5e205524` was a bit overzealous by inconditionally enabling quick ack when a request is not yet in the buffer, because it also does so when nothing has been received yet, causing a useless ACK to be emitted. Improve the situation by doing this only if the input buffer is empty (indicating that nothing was sent by the client). In case of keep-alive, an empty buffer means we already have a response in flight which will serve as an ACK.	2012-05-13 08:44:16 +02:00
Willy Tarreau	59b9479667	BUG/MEDIUM: stream_interface: restore get_src/get_dst Commit e164e7a removed get_src/get_dst setting in the stream interfaces but forgot to set it in proto_tcp. Get the feature back because we need it for logging, transparent mode, ACLs etc... We now rely on the stream interface direction to know what syscall to use. One benefit of doing it this way is that we don't use getsockopt() anymore on outgoing stream interfaces nor on UNIX sockets.	2012-05-11 16:48:10 +02:00
Willy Tarreau	c63190d429	REORG: use the name sock_raw instead of stream_sock We'll soon have an SSL socket layer, and in order to ease the difference between the two, we use the name "sock_raw" to designate the one which directly talks to the sockets without any conversion.	2012-05-11 14:23:52 +02:00
Willy Tarreau	4a3fd4c8df	BUG/MAJOR: acl: http_auth_group() must not accept any user from the userlist http_auth and http_auth_group used to share the same fetch function, while they're doing very different things. The first one only checks whether the supplied credentials are valid wrt a userlist while the second not only checks this but also checks group ownership from a list of patterns. Recent acl/pattern merge caused a simplification here by which the fetch function would always return a boolean, so the group match was always fine if the user:password was valid, regardless of the patterns provided with the ACL. The proper solution consists in splitting the function in two, depending on what is desired. It's also worth noting that check_user() would probably be split, one to check user:password, and the other one to check for group ownership for an already valid user:password combination. At this point it is not certain if the group mask is still useful or not considering that the passwd check is always made. This bug was reported and diagnosed by Cyril Bont�. It first appeared in 1.5-dev9 so it does not need any backporting.	2012-05-10 23:18:26 +02:00
Cyril Bont�	20a804ac6d	BUG/MINOR: stats admin: "Unexpected result" was displayed unconditionally I introduced a regression in commit `19979e176e` while reworking the admin actions results. "Unexpected result" was displayed even if the action was applied due to a misplaced initialization. This small patch should fix it. Note: no need to backport.	2012-05-10 21:51:17 +02:00
Willy Tarreau	a7fe8e527c	MINOR: http: replace http_message_realign() with buffer_slow_realign() There is no more reason for the realign function being HTTP specific, it only operates on a buffer now. Let's move it to buffers.c instead. It's likely that buffer_bounce_realign is broken (not used), this will have to be inspected. The function is worth rewriting as it can be cheaper than buffer_slow_realign() to realign large wrapping buffers.	2012-05-08 21:28:17 +02:00
Willy Tarreau	515393649c	MINOR: acl: add the cook_val() match to match a cookie against an integer	2012-05-08 21:28:16 +02:00
Willy Tarreau	d04b1bce69	MEDIUM: http: improve error capture reports A number of important information were missing from the error captures, so let's improve them. Now we also log source port, session flags, transaction flags, message flags, pending output bytes, expected buffer wrapping position, total bytes transferred, message chunk length, and message body length. As such, the output format has slightly evolved and the source address moved to the third line : [08/May/2012:11:14:36.341] frontend echo (#1): invalid request backend echo (#1), server <NONE> (#-1), event #1 src 127.0.0.1:40616, session #4, session flags 0x00000000 HTTP msg state 26, msg flags 0x00000000, tx flags 0x00000000 HTTP chunk len 0 bytes, HTTP body len 0 bytes buffer flags 0x00909002, out 0 bytes, total 28 bytes pending 28 bytes, wrapping at 8030, error at position 7: 00000 GET / /?t=20000 HTTP/1.1\r\n 00026 \r\n [08/May/2012:11:13:13.426] backend echo (#1) : invalid response frontend echo (#1), server local (#1), event #0 src 127.0.0.1:40615, session #1, session flags 0x0000044e HTTP msg state 32, msg flags 0x0000000e, tx flags 0x08200000 HTTP chunk len 0 bytes, HTTP body len 20 bytes buffer flags 0x00008002, out 81 bytes, total 92 bytes pending 11 bytes, wrapping at 7949, error at position 9: 00000 Foo: bar\r\r\n	2012-05-08 21:28:16 +02:00
Willy Tarreau	69d8c5d99e	BUG/MINOR: http: ensure that msg->err_pos is always relative to buf->p Since the beginning of buffer&msg changes, the error position (err_pos) had not completely been converted and some offsets still appear wrong. Now we ensure that everywhere msg->err_pos is relative to buf->p and we always report buf->i bytes starting at buf->p in all error captures, which ensures that err_pos is there. This is not exactly a bug and is specific to latest changes so no backport is needed.	2012-05-08 21:28:15 +02:00
Willy Tarreau	d6c2e8c916	BUG/MINOR: http: error snapshots are wrong if buffer wraps Commit 81f2fb added support for wrapping buffer captures, but unfortunately the code used to perform two memcpy() over the same destination, causing a loss of the start of the buffer rendering some error snapshots unusable. This bug is present in 1.4 too and must be backported.	2012-05-08 21:28:15 +02:00
Willy Tarreau	060781fb4a	REORG: stream_interface: create a struct sock_ops to hold socket operations These operators are used regardless of the socket protocol family. Move them to a "sock_ops" struct. ->read and ->write have been moved there too as they have no reason to remain at the protocol level.	2012-05-08 21:28:14 +02:00
Willy Tarreau	cd3b094618	REORG: rename "pattern" files They're now called "sample" everywhere to match their description.	2012-05-08 20:57:21 +02:00
Willy Tarreau	1278578487	REORG: use the name "sample" instead of "pattern" to designate extracted data This is mainly a massive renaming in the code to get it in line with the calling convention. Next patch will rename a few files to complete this operation.	2012-05-08 20:57:20 +02:00
Willy Tarreau	7dcb6480db	MEDIUM: acl: extend the pattern parsers to report meaningful errors By passing the error pointer to all ACL parsers, we can make them report useful errors and not simply fail.	2012-05-08 20:57:20 +02:00
Willy Tarreau	b7451bb660	MEDIUM: acl: report parsing errors to the caller All parsing errors were known but impossible to return. Now by making use of memprintf(), we're able to build meaningful error messages that the caller can display.	2012-05-08 20:57:20 +02:00
Willy Tarreau	28376d62cb	MEDIUM: http: merge ACL and pattern cookie fetches into a single one It's easy to merge pattern and ACL fetches of cookies. It allows us to remove two distinct fetch functions. The new function internally uses an occurrence number to serve both purposes, but it didn't appear worth exposing it outside so there is no keyword argument to set it. However one of the benefits is that the "cookie" fetch for stick tables now automatically adapts to requests and responses, so there is no more need for set-cookie().	2012-05-08 20:57:19 +02:00
Willy Tarreau	185b5c4a7b	MEDIUM: http: merge acl and pattern header fetch functions HTTP header fetch is now done using smp_fetch_hdr() for both ACLs and patterns. This one also supports an occurrence number, making it possible to specify explicit occurrences for ACLs and patterns.	2012-05-08 20:57:19 +02:00
Willy Tarreau	25c1ebc0c9	MEDIUM: acl/pattern: start merging common sample fetch functions src_port, dst_port and url_param have converged between ACLs and patterns. This means that src_port is now available in patterns and that urlp_* has been added to ACLs. Some code has moved to accommodate for static function definitions, but there were little changes.	2012-05-08 20:57:17 +02:00
Willy Tarreau	32a6f2e572	MEDIUM: acl/pattern: use the same direction scheme Patterns were using a bitmask to indicate if request or response was desired in fetch functions and keywords. ACLs were using a bitmask in fetch keywords and a single bit in fetch functions. ACLs were also using an ACL_PARTIAL bit in fetch functions indicating that a non-final fetch was performed, which was an abuse of the existing direction flag. The change now consists in using : - a capabilities field for fetch keywords => SMP_CAP_REQ/RES to indicate if a keyword supports requests, responses, both, etc... - an option field for fetch functions to indicate what the caller expects (request/response, final/non-final) The ACL_PARTIAL bit was reversed to get SMP_OPT_FINAL as it's more explicit to know we're working on a final buffer than on a non-final one. ACL_DIR_* were removed, as well as PATTERN_FETCH_*. L4 fetches were improved to support being called on responses too since they're still available. The <dir> field of all fetch functions was changed to <opt> which is now unsigned. The patch is large but mostly made of cosmetic changes to accomodate this, as almost no logic change happened.	2012-05-08 20:57:17 +02:00
Willy Tarreau	24e32d8c6b	MEDIUM: acl: replace acl_expr with args in acl fetch_* functions Having the args everywhere will make it easier to share fetch functions between patterns and ACLs. The only place where we could have needed the expr was in the http_prefetch function which can do well without.	2012-05-08 20:57:16 +02:00
Willy Tarreau	b8c8f1f611	MEDIUM: pattern: retrieve the sample type in the sample, not in the keyword description We need the pattern fetchers and converters to correctly set the output type so that they can be used by ACL fetchers. By using the sample type instead of the keyword type, we also open the possibility to create some multi-type pattern fetch methods later (eg: "src" being v4/v6). Right now the type in the keyword is used to validate the configuration.	2012-05-08 20:57:16 +02:00
Willy Tarreau	342acb4775	MEDIUM: pattern: integrate pattern_data into sample and use sample everywhere Now there is no more reference to union pattern_data. All pattern fetch and conversion functions now make use of the common sample type. Note: none of them adjust the type right now so it's important to do it next otherwise we would risk sharing such functions with ACLs and seeing them fail.	2012-05-08 20:57:15 +02:00
Willy Tarreau	21e5b0e3cb	MEDIUM: get rid of SMP_F_READ_ONLY and SMP_F_MUST_FREE These ones were either unused or improperly used. Some integers were marked read-only, which does not make much sense. Buffers are not read-only, they're "constant" in that they must be kept intact after any possible change.	2012-05-08 20:57:15 +02:00
Willy Tarreau	197e10aaae	MEDIUM: acl: get rid of the SET_RES flags We now simply rely on a boolean result from a fetch to declare a match. Booleans are not compared against patterns, they fix the result.	2012-05-08 20:57:15 +02:00
Willy Tarreau	f853c46bc3	MEDIUM: pattern/acl: get rid of temp_pattern in ACLs This one is not needed anymore as we can return the data and its type in the sample provided by the caller. ACLs now always return the proper type. BOOL is already returned when the result is expected to be processed as a boolean. temp_pattern has been unexported now.	2012-05-08 20:57:14 +02:00
Willy Tarreau	3740635b88	MAJOR: acl: make use of the new sample struct and get rid of acl_test This change is invasive in lines of code but not much in terms of functionalities as it's mainly a replacement of struct acl_test with struct sample.	2012-05-08 20:57:14 +02:00
Willy Tarreau	422aa0792d	MEDIUM: pattern: add new sample types to replace pattern types The new sample types are necessary for the acl-pattern convergence. These types are boolean and signed int. Some types were renamed for less ambiguity (ip->ipv4, integer->uint).	2012-05-08 20:57:14 +02:00
Willy Tarreau	8f7406e9b4	MEDIUM: acl: remove the ACL_TEST_F_NULL_MATCH flag This flag was used to force a boolean match even if there was no pattern to match. It was used only by http_auth() and designed only for this one. It's easier and cleaner to make the fetch function perform the test and report the boolean result as a few other functions already do. It simplifies the acl_exec_cond() logic and will help merging ACLs and patterns.	2012-05-08 20:57:13 +02:00
Willy Tarreau	21d68a6895	MEDIUM: pattern: add an argument validation callback to pattern descriptors This is used to validate that arguments are coherent. For instance, payload_lv expects that the last arg (if any) is not more negative than the sum of the first two. The error is reported if any.	2012-05-08 20:57:13 +02:00
Willy Tarreau	9fcb984b17	MEDIUM: pattern: use the standard arg parser We don't need the pattern-specific args parsers anymore, make use of the common parser instead. We still need to improve this by adding a validation function to report abnormal argument values or combinations. We don't report precise parsing errors yet but this was not previously done either.	2012-05-08 20:57:13 +02:00
Willy Tarreau	f995410355	MEDIUM: pattern: get rid of arg_i in all functions making use of arguments arg_i was almost unused, and since we migrated to use struct arg everywhere, the rare cases where arg_i was needed could be replaced by switching to arg->type = ARGT_STOP.	2012-05-08 20:57:12 +02:00
Willy Tarreau	ecfb8e8ff9	MEDIUM: pattern: replace type pattern_arg with type arg arg is more complete than pattern_arg since it also covers ACL args, so let's use this one instead.	2012-05-08 20:57:12 +02:00
Willy Tarreau	61612d49a7	MAJOR: acl: store the ACL argument types in the ACL keyword declaration The types and minimal number of ACL keyword arguments are now stored in their declaration. This will allow many more fantasies if some ACL use several arguments or types. Doing so required to rework all ACL keyword declarations to add two parameters. So this was a good opportunity for a general cleanup and to sort all entries in alphabetical order. We still have two pending issues : - parse_acl_expr() checks for errors but has no way to report them to the user ; - the types of some arguments are still not resolved and kept as strings (eg: ARGT_FE/BE/TAB) for compatibility reasons, which must be resolved in acl_find_targets()	2012-05-08 20:57:11 +02:00
Willy Tarreau	34db108423	MAJOR: acl: make use of the new argument parsing framework The ACL parser now uses the argument parser to build a typed argument list. Right now arguments are all strings and only one argument is supported since this is what ACLs currently support.	2012-05-08 20:57:11 +02:00
Willy Tarreau	d53e2428d1	MEDIUM: http/acl: make acl_fetch_hdr_{ip,val} rely on acl_fetch_hdr() These two functions will now exploit the return of acl_fetch_hdr() instead of doing the same work again and again.	2012-05-08 20:57:10 +02:00
Willy Tarreau	e333ec9f24	MEDIUM: http/acl: merge all request and response ACL fetches of headers and cookies Latest changes have made it possible to remove all differences between request and response processing, making it worth merging request and response ACL fetch functions to reduce code size. Most likely with minor adaptation it will be possible to use the same hdr_* functions to match in the response path, and cook_* for the response cookie too.	2012-05-08 20:57:10 +02:00
Willy Tarreau	7744e0cf43	BUG/MINOR: http_auth: ACLs are volatile, not permanent ACLs are volatile since they require a fetch of request buffer data which is then copied to a temporary shared place. The issue is minor though since auth is generally checked very early.	2012-05-08 20:57:10 +02:00
Willy Tarreau	c0239e0425	MEDIUM: http: make all ACL fetch function use acl_prefetch_http() All ACLs which need to process HTTP contents first call this function which performs all the preliminary tests and also triggers the request parsing if needed. A macro was written to simplify the code. As a side effect, it's not required anymore to check for the HTTP ACL before checking for HTTP contents.	2012-05-08 20:57:10 +02:00
Willy Tarreau	14174bc4c0	MEDIUM: http: add a prefetch function for ACL pattern fetch This function will be called by all ACL fetch functions. Right now all ACL fetch functions have to perform the exact same tests to check whether data are available. Also, only one of them is able to actually parse an HTTP request. Using the prefetch function, it will be possible to try to parse a request on the fly and to avoid the fetch if some data are missing. This will significantly reduce the amount of tests in all ACL fetch functions.	2012-05-08 20:57:09 +02:00
Willy Tarreau	9dab5fc4d4	MEDIUM: buffers: rename a number of buffer management functions The following renaming took place : 1) buffer input functions buffer_put_block => bi_putblk buffer_put_char => bi_putchr buffer_put_string => bi_putstr buffer_put_chunk => bi_putchk buffer_feed => bi_putstr buffer_feed_chunk => bi_putchk buffer_cut_tail => bi_erase buffer_ignore => bi_fast_delete 2) buffer output functions buffer_get_char => bo_getchr buffer_get_line => bo_getline buffer_get_block => bo_getblk buffer_skip => bo_skip buffer_write => bo_inject 3) buffer input avail/full functions were introduced : bi_avail bi_full	2012-05-08 20:56:56 +02:00
Willy Tarreau	cc5cfcbcce	MEDIUM: buffers: add new pointer wrappers and get rid of almost all buffer_wrap_add calls buffer_wrap_add was convenient for the migration but is not handy at all. Let's have new wrappers that report input begin/end and output begin/end instead. It looks like we'll also need a b_adv(ofs) to advance a buffer's pointer.	2012-05-08 12:28:14 +02:00
Willy Tarreau	ec1bc82a1d	MEDIUM: buffers: fix unsafe use of buffer_ignore at some places buffer_ignore may only be used when the output of a buffer is empty, but it's not granted it is always the case when sending HTTP error responses. Better use buffer_cut_tail() instead, and use buffer_ignore only on non-wrapping data.	2012-05-08 12:28:14 +02:00
Willy Tarreau	8b1323e4cb	MINOR: http: remove useless wrapping checks in http_msg_analyzer The message cannot wrap here.	2012-05-08 12:28:14 +02:00
Willy Tarreau	4baf44ba67	MEDIUM: http: remove buffer arg in chunk parsing functions The buffer pointer is now taken from the http_msg in the following functions : http_parse_chunk_size http_forward_trailers http_skip_chunk_crlf Most internal pointers were converted to const as the result of the operation.	2012-05-08 12:28:14 +02:00
Willy Tarreau	21710ffa35	MEDIUM: http: remove buffer arg in http_buffer_heavy_realign The buffer pointer is now taken from the http_msg. The function has also been renamed "http_message_realign".	2012-05-08 12:28:13 +02:00
Willy Tarreau	418bfcc794	MEDIUM: http: remove buffer arg in http_upgrade_v09_to_v10 The buffer and http_msg pointers are now taken from the transaction.	2012-05-08 12:28:13 +02:00
Willy Tarreau	a560c21f54	MEDIUM: http: remove buffer arg in http_msg_analyzer The buffer pointer is now taken from the http_msg.	2012-05-08 12:28:13 +02:00
Willy Tarreau	8a0cef2dad	MEDIUM: http: remove buffer arg in http_capture_bad_message The buffer pointer is now taken from the http_msg.	2012-05-08 12:28:13 +02:00
Willy Tarreau	6acf7c9179	MEDIUM: http: remove buffer arg in a few header manipulation functions The buffer pointer is now taken from the http_msg in the following functions : - http_remove_header2 - http_header_add_tail - http_header_add_tail2 - http_parse_connection_header - http_change_connection_header	2012-05-08 12:28:12 +02:00
Willy Tarreau	45c0d98769	MEDIUM: http: http_send_name_header: remove references to msg and buffer They can be deduced from txn.	2012-05-08 12:28:12 +02:00
Willy Tarreau	3a215bedba	MAJOR: http: make http_msg->sol relative to buffer's origin msg->sol is now a relative pointer just like all other ones. There is no more absolute references to the buffer outside the struct buffer itself. Next two cleanups should include removing buffer references to functions which already have an msg, and removal of wrapping detection in request and response parsing which cannot wrap by definition.	2012-05-08 12:28:12 +02:00
Willy Tarreau	62f791ea6f	MEDIUM: http: add a pointer to the buffer in http_msg ACLs and patterns only rely on a struct http_msg and don't know the pointer to the actual data. struct http_msg will soon only hold relative references so that's not possible. We need http_msg to hold a reference to the struct buffer before having relative pointers everywhere. It is likely that doing so will also result in opportunities to simplify a number of functions arguments. The following functions are already candidate : http_buffer_heavy_realign http_capture_bad_message http_change_connection_header http_forward_trailers http_header_add_tail http_header_add_tail2 http_msg_analyzer http_parse_chunk_size http_parse_connection_header http_remove_header2 http_send_name_header http_skip_chunk_crlf http_upgrade_v09_to_v10	2012-05-08 12:28:12 +02:00
Willy Tarreau	12e48b36dd	MAJOR: http: turn http_msg->eol to a buffer-relative offset It was an absolute pointer to the buffer's data, now it's a pointer relative to the buffer's origin.	2012-05-08 12:28:12 +02:00
Willy Tarreau	fa4a03ca08	CLEANUP: http: remove unused http_msg->col The <col> element of the struct http_msg has not been used for a long time now, remove it.	2012-05-08 12:28:11 +02:00
Willy Tarreau	ea1175a687	MAJOR: http: change msg->{som,col,sov,eoh} to be relative to buffer origin These offsets were relative to the buffer itself. Now they're relative to the buffer's origin (buf->p) which normally corresponds to the start of current message. This saves a big dependency between the HTTP message struct and the buffers. It appeared during this change that ->col is not used anymore (it will have to be removed). Next step is to turn ->eol and ->sol from absolute to relative.	2012-05-08 12:28:11 +02:00
Willy Tarreau	a458b67965	MAJOR: http: move buffer->lr to http_msg->next The buffer's pointer <lr> was only used by HTTP parsers which also use a struct http_msg to keep track of the parser's state. We've reached a point where it makes no sense to keep ->lr in the buffer, as the split between buffer and msg is only arbitrary for historical reasons. This change ensures that touching buffers will not impact HTTP messages anymore, making the buffers more content-agnostic. However, it becomes very important not to forget to update msg->next when some data get forwarded or moved (and in general each time buf->p is updated). The new pointer in http_msg becomes relative to buffer->p so that parsing multiple messages becomes easier. It is possible that at one point ->som and ->next will be merged. Note: http_parse_reqline() and http_parse_stsline() have been temporarily modified to know the message starting point in the buffer (->p).	2012-05-08 12:28:11 +02:00
Willy Tarreau	363a5bb152	MAJOR: buffers: replace buf->r with buf->p + buf->i This change gets rid of buf->r which is always equal to buf->p + buf->i. It removed some wrapping detection at a number of places, but required addition of new relative offset computations at other locations. A large number of places can be simplified now with extreme care, since most of the time, either the pointer has to be computed once or we need a difference between the old ->w and old ->r to compute free space. The cleanup will probably happen with the rewrite of the buffer_input_* and buffer_output_* functions anyway. buf->lr still has to move to the struct http_msg and be relative to buf->p for the rework to be complete.	2012-05-08 12:28:11 +02:00
Willy Tarreau	89fa706d39	MAJOR: buffers: replace buf->w with buf->p - buf->o This change introduces the buffer's base pointer, which is the limit between incoming and outgoing data. It's the point where the parsing should start from. A number of computations have already been greatly simplified, but more simplifications are expected to come from the removal of buf->r. The changes appear good and have revealed occasional improper use of some pointers. It is possible that this patch has introduced bugs or revealed some, although preliminary testings tend to indicate that everything still works as it should.	2012-05-08 12:28:10 +02:00
Willy Tarreau	02d6cfc1d7	MAJOR: buffer: replace buf->l with buf->{o+i} We don't have buf->l anymore. We have buf->i for pending data and the total length is retrieved by adding buf->o. Some computation already become simpler. Despite extreme care, bugs are not excluded. It's worth noting that msg->err_pos as set by HTTP request/response analysers becomes relative to pending data and not to the beginning of the buffer. This has not been completed yet so differences might occur when outgoing data are left in the buffer.	2012-05-08 12:28:10 +02:00
Willy Tarreau	2e046c6017	MAJOR: buffer rework: replace ->send_max with ->o This is the first minor step of the buffer rework. It's only renaming, it should have no impact.	2012-04-30 11:57:00 +02:00
Willy Tarreau	a36fc4d7ed	MEDIUM: move message-related flags from transaction to message Too many flags are stored in the transaction structure. Some flags are clearly message-specific and exist in two versions (request and response). Move them to a new "flags" field in the http_message struct instead.	2012-04-30 11:57:00 +02:00
Willy Tarreau	21337825c0	CLEANUP: remove a few warning about unchecked return values in debug code There were a few unchecked write() calls in the debug code that cause gcc 4.x to emit warnings on recent libc. We don't want to check them as we can't make anything from the result, let's simply surround them with an empty if statement. Note that one of the warnings was for chdir("/") which normally cannot fail since it follows a successful chroot (which means the perms are necessarily there). Anyway let's move the call uppe to protect it too.	2012-04-30 11:56:30 +02:00
Willy Tarreau	b56928a74c	CLEANUP: http: message parser must ignore HTTP_MSG_ERROR The issue only happens when DEBUG_FULL is enabled, which causes http_msg_analyzer() to complain if it's called twice with an invalid message, for instance because of two consecutive ACLs using req_proto_http. The code is commented out when DEBUG_FULL is disabled, so this is not a bug, just an annoyance for the developer.	2012-04-30 11:51:59 +02:00
Willy Tarreau	46787ed700	BUILD: http: stop gcc-4.1.2 from complaining about possibly uninitialized values The three warnings below are totally wrong since the variables depend on another one which is only turned on when the variables are initialized. Still this gcc-4.1.2 isn't able to see this and prefers to complain wrongly. So let's initialize the variables to shut it up since we're not in the fast path. src/proto_http.c: In function 'acl_fetch_any_cookie_cnt': src/proto_http.c:8393: warning: 'val_end' may be used uninitialized in this function src/proto_http.c: In function 'http_process_req_stat_post': src/proto_http.c:2577: warning: 'st_next_param' may be used uninitialized in this function src/proto_http.c:2577: warning: 'st_cur_param' may be used uninitialized in this function	2012-04-30 00:19:31 +02:00
Willy Tarreau	3fb818c014	BUILD: http: make extract_cookie_value() return an int not size_t It's very annoying that we have to deal with the crappy size_t and with ints at some places because these ones don't mix well. Patch 6f61b2 changed the chunk len to int but its size remains size_t and some functions are having trouble being used by several callers depending on the type of their arguments. Let's turn extract_cookie_value() to int for now on, and plan a massive cleanup later to remove all size_t.	2012-04-30 00:19:28 +02:00
Willy Tarreau	9b061e3320	MEDIUM: stream_sock: add a get_src and get_dst callback and remove SN_FRT_ADDR_SET These callbacks are used to retrieve the source and destination address of a socket. The address flags are not hold on the stream interface and not on the session anymore. The addresses are collected when needed. This still needs to be improved to store the IP and port separately so that it is not needed to perform a getsockname() when only the IP address is desired for outgoing traffic.	2012-04-07 18:03:52 +02:00
William Lallemand	a73203e3dc	MEDIUM: log: Unique ID The Unique ID, is an ID generated with several informations. You can use a log-format string to customize it, with the "unique-id-format" keyword, and insert it in the request header, with the "unique-id-header" keyword.	2012-04-07 16:25:26 +02:00
William Lallemand	5f2324019d	MEDIUM: log: New format-log flags: %Fi %Fp %Si %Sp %Ts %rt %H %pid %Fi: Frontend IP %Fp: Frontend Port %Si: Server IP %Sp: Server Port %Ts: Timestamp %rt: HTTP request counter %H: hostname %pid: PID +X: Hexadecimal represenation The +X mode in logformat displays hexadecimal for the following flags %Ci %Cp %Fi %Fp %Bi %Bp %Si %Sp %Ts %ct %pid rename logformat_write_string() to lf_text() Optimize size computation	2012-04-07 16:05:39 +02:00
Willy Tarreau	04aa6a9ce8	MEDIUM: http: add cookie and scookie ACLs The ACL matches rely on the extract_cookie_value() function as used for for patterns. This permits ACLs to match cookie values based on the cookie name instead of having to perform substring matching on the cookie header.	2012-04-07 08:47:26 +02:00
Willy Tarreau	4573af939c	MEDIUM: http: make extract_cookie_value() iterate over cookie values This will make the function usable for ACLs.	2012-04-06 18:20:06 +02:00
Willy Tarreau	c89ccb6221	MEDIUM: log: add a new cookie flag 'U' to report situations where cookie is not used This happens when a "use-server" rule sets the server instead.	2012-04-05 21:18:22 +02:00
Willy Tarreau	4a5cadea40	MEDIUM: session: implement the "use-server" directive Sometimes it is desirable to forward a particular request to a specific server without having to declare a dedicated backend for this server. This can be achieved using the "use-server" rules. These rules are evaluated after the "redirect" rules and before evaluating cookies, and they have precedence on them. There may be as many "use-server" rules as desired. All of these rules are evaluated in their declaration order, and the first one which matches will assign the server.	2012-04-05 21:14:10 +02:00
Cyril Bonté	19979e176e	MINOR: stats admin: reduce memcmp()/strcmp() calls on status codes memcmp()/strcmp() calls were needed in different parts of code to determine the status code. Each new status code introduces new calls, which can become inefficient and source of bugs. This patch reorganizes the code to rely on a numeric status code internally and to be hopefully more generic.	2012-04-05 09:58:27 +02:00
Cyril Bonté	cf8d9ae3cd	MINOR: stats admin: allow unordered parameters in POST requests Previously, the stats admin page required POST parameters to be provided exactly in the same order as the HTML form. This patch allows to handle those parameters in any orders. Also, note that haproxy won't alter server states anymore if backend or server names are ambiguous (duplicated names in the configuration) to prevent unexpected results (the same should probably be applied to the stats socket).	2012-04-05 09:58:25 +02:00
Willy Tarreau	42f7d89156	BUG/MAJOR: possible crash when using capture headers on TCP frontends Olufemi Omojola provided a config and a core showing a possible crash when captures are configured on a TCP-mode frontend which branches to an HTTP backend. The reason is that being in TCP mode, the frontend does not allocate capture pools for the request, but the HTTP backend tries to use them and dies on the NULL. While such a config has long been unlikely to happen, it looks like people using websocket tend to do this more often now. Change the control to use the pointer instead of the number of captures to know when to log. This bug was reported in 1.4.20, so it must be backported there.	2012-03-24 08:35:36 +01:00
William Lallemand	bddd4fd93b	MEDIUM: log: use log_format for mode tcplog Merge http_sess_log() and tcp_sess_log() to sess_log() and move it to log.c A new field in logformat_type define if you can use a logformat variable in TCP or HTTP mode. doc: log-format in tcp mode Note that due to the way log buffer allocation currently works, trying to log an HTTP request without "option httplog" is still not possible. This will change in the near future.	2012-03-12 15:47:13 +01:00
Willy Tarreau	869fc1edc2	BUG: http: disable TCP delayed ACKs when forwarding content-length data Commits 5c6209 and 072930 were aimed at avoiding undesirable PUSH flags when forwarding chunked data, but had the undesired effect of causing data advertised by content-length to be affected by the delayed ACK too. This can happen when the data to be forwarded are small enough to fit into a single send() call, otherwise the BF_EXPECT_MORE flag would be removed. Content-length data don't need the BF_EXPECT_MORE flag since the low-level forwarder already knows it can safely rely on bf->to_forward to set the appropriate TCP flags. Note that the issue is only observed in requests at the moment, though the later introduction of server-side keep-alive could trigger the issue on the response path too. Special thanks to Randy Shults for reporting this issue with a lot of details helping to reproduce it. The fix must be backported to 1.4.	2012-03-05 08:46:34 +01:00
Willy Tarreau	2d5cd479bc	BUG: queue: fix dequeueing sequence on HTTP keep-alive sessions When a request completes on a server and the server connection is closed while the client connection stays open, the HTTP engine releases all server connection slots and scans the queues to offer the connection slot to another pending request. An issue happens when the released connection allows other requests to be dequeued : may_dequeue_tasks() relies on srv->served which is only decremented by sess_change_server() which itself is only called after may_dequeue_tasks(). This results in no connection being woken up until another connection terminates so that may_dequeue_tasks() is called again. This fix is minimalist and only moves sess_change_server() earlier (which is safe). It should be reworked and the code factored out so that the same occurrence in session.c shares the same code. This bug has been there since the introduction of option-http-server-close and the fix must be backported to 1.4.	2012-03-01 23:49:20 +01:00
Willy Tarreau	431946e961	MEDIUM: increase chunk-size limit to 2GB-1 Since commit `115acb97`, chunk size was limited to 256MB. There is no reason for such a limit and the comment on the code suggests a missing zero. However, increasing the limit past 2 GB causes trouble due to some 32-bit subtracts in various computations becoming negative (eg: buffer_max_len). So let's limit the chunk size to 2 GB - 1 max.	2012-02-27 09:51:52 +01:00
Willy Tarreau	53bf6af3f9	BUG: fix httplog trailing LF commit `a1cc3811` introduced an undesirable \0\n ending on HTTP log messages. This is because of an extra character count passed to __send_log() which causes the LF to be appended past the \0. Some syslog daemons thus log an extra empty line. The fix is obvious. Fix the function comments to remind what they expect on their input. This is past 1.5-dev7 regression so there's no backport needed.	2012-02-24 11:48:42 +01:00
William Lallemand	a1cc381151	MEDIUM: log: make http_sess_log use log_format http_sess_log now use the logformat linked list to make the log string, snprintf is not used for speed issue. CLF mode also uses logformat. NOTE: as of now, empty fields in CLF now are "" not "-" anymore.	2012-02-09 17:03:28 +01:00
William Lallemand	d9e9066e71	BUILD: fix declaration inside a scope block	2012-02-06 09:46:16 +01:00
Willy Tarreau	b05405a3a8	BUILD: fix build error on FreeBSD Marcello Gorlani reported that commit `5e205524ad` (BUG: http: re-enable TCP quick-ack upon incomplete HTTP requests) broke build on FreeBSD. Moving the include lower fixes the issue. This must be backported to 1.4 too.	2012-01-23 15:35:52 +01:00
Willy Tarreau	422246eb26	MEDIUM: http: block non-ASCII characters in URIs by default These ones are invalid and blocked unless "option accept-invalid-http-request" is specified in the frontend. In any case, the faulty request is logged. Note that some of the remaining invalid chars are still not checked against, those are the invalid ones between 32 and 127 : 34 ('"'), 60 ('<'), 62 ('>'), 92 ('\'), 94 ('^'), 96 ('`'), 123 ('{'), 124 ('\|'), 125 ('}') Using a lookup table might be better at some point.	2012-01-07 23:55:20 +01:00
Willy Tarreau	2e9506d771	BUG: http: tighten the list of allowed characters in a URI The HTTP request parser was considering that any non-LWS char was par of the URI. Unfortunately, this allows control chars to be sent in the URI, sometimes resulting in backend servers misbehaving, for instance when they interprete \0 as an end of string and respond with plain HTTP/0.9 without headers, that haproxy blocks as invalid responses. RFC3986 clearly states the list of allowed characters in a URI. Even non-ASCII chars are not allowed. Unfortunately, after having run 10 years with these chars allowed, we can't block them right now without an optional workaround. So the first step consists in only blocking control chars. A later patch will allow non-ASCII only when an appropriate option is enabled in the frontend. Control chars are 0..31 and 127, with the exception of 9, 10 and 13 (\t, \n, \r).	2012-01-07 23:22:31 +01:00
Mark Lamourine	c2247f0b8d	MEDIUM: http: add support for sending the server's name in the outgoing request New option "http-send-name-header" specifies the name of a header which will hold the server name in outgoing requests. This is the name of the server the connection is really sent to, which means that upon redispatches, the header's value is updated so that it always matches the server's name.	2012-01-05 15:17:31 +01:00
Willy Tarreau	e428fb7b4e	MEDIUM: patterns: the hdr() pattern is now of type string This pattern previously was limited to type IP. With the new header extraction function, it becomes possible to extract strings, so that the header can be returned as a string. This will not change anything to existing configs, as string will automatically be converted to IP when needed. However, new configs will be able to use IPv6 addresses from headers in stick-tables, as well as stick on any non-IP header (eg: host, user-agent, ...).	2011-12-30 17:33:27 +01:00
Willy Tarreau	294c473756	MEDIUM: http: replace get_ip_from_hdr2() with http_get_hdr() The new function does not return IP addresses but header values instead, so that the caller is free to make what it want of them. The conversion is not quite clean yet, as the previous test which considered that address 0.0.0.0 meant "no address" is still used. A different IP parsing function should be used to take this into account.	2011-12-30 17:33:26 +01:00
Willy Tarreau	664092ccc1	MEDIUM: acl: use temp_pattern to store any string-type information Now strings and data blocks are stored in the temp_pattern's chunk and matched against this one. The rdp_cookie currently makes extensive use of acl_fetch_rdp_cookie() and will be a good candidate for the initial rework so that ACLs use the patterns framework and not the other way around.	2011-12-30 17:33:26 +01:00
Willy Tarreau	f4362b3e3b	MEDIUM: acl: use temp_pattern to store any address-type information IPv4 and IPv6 addresses are now stored into temp_pattern instead of the dirty hack consisting into storing them into the consumer's target address. Some refactoring should now be possible since the methods used to fetch source and destination addresses are similar between patterns and ACLs.	2011-12-30 17:33:26 +01:00
Willy Tarreau	a5e375646c	MEDIUM: acl: use temp_pattern to store any integer-type information All ACL fetches which return integer value now store the result into the temporary pattern struct. All ACL matches which rely on integer also get their value there. Note: the pattern data types are not set right now.	2011-12-30 17:33:26 +01:00
Willy Tarreau	8e5e955c50	MEDIUM: acl: use temp_pattern to store fetched information in the "method" match This match was using both the int and ptr part of the acl_test struct. Let's change this to be able to store it into a chunk with a special encoding.	2011-12-30 17:33:25 +01:00
Willy Tarreau	5e205524ad	BUG: http: re-enable TCP quick-ack upon incomplete HTTP requests By default we disable TCP quick-acking on HTTP requests so that we avoid sending a pure ACK immediately followed by the HTTP response. However, if the client sends an incomplete request in a short packet, its TCP stack might wait for this packet to be ACKed before sending the rest of the request, delaying incoming requests by up to 40-200ms. We can detect this undesirable situation when parsing the request : - if an incomplete request is received - if a full request is received and uses chunked encoding or advertises a content-length larger than the data available in the buffer In these situations, we re-enable TCP quick-ack if we had previously disabled it.	2011-12-17 16:45:29 +01:00
William Lallemand	0f99e34978	MEDIUM: log: Use linked lists for loggers This patch settles the 2 loggers limitation. Loggers are now stored in linked lists. Using "global log", the global loggers list content is added at the end of the current proxy list. Each "log" entries are added at the end of the proxy list. "no log" flush a logger list.	2011-10-31 14:09:19 +01:00
Sagi Bashari	1611e2d4a1	BUG/MINOR: fix options forwardfor if-none when an alternative header name is specified	2011-10-09 08:10:30 +02:00
Willy Tarreau	6471afb43d	MINOR: remove the client/server side distinction in SI addresses Stream interfaces used to distinguish between client and server addresses because they were previously of different types (sockaddr_storage for the client, sockaddr_in for the server). This is not the case anymore, and this distinction is confusing at best and has caused a number of regressions to be introduced in the process of converting everything to full-ipv6. We can now remove this and have a much cleaner code.	2011-09-23 10:54:59 +02:00
Willy Tarreau	0e69854ed4	MINOR: acl: add new matches for header/path/url length This patch introduces hdr_len, path_len and url_len for matching these respective parts lengths against integers. This can be used to detect abuse or empty headers.	2011-09-16 08:32:32 +02:00
Willy Tarreau	275600b6c7	BUG/MEDIUM: don't trim last spaces from headers consisting only of spaces Commit 588bd4 fixed header parsing so that trailing spaces were not part of the returned string. Unfortunately, if a header only had spaces, the last spaces were trimmed past the beginning of the value, causing a negative length to be returned. A quick code review shows that there should be no impact since the only places where the vlen is used are either compared to a specific value or with explicit contents (eg: digits). This must be backported to 1.4.	2011-09-16 08:11:26 +02:00
Willy Tarreau	eabea0763b	[MINOR] stats: report the number of requests intercepted by the frontend These requests are mainly monitor requests, as well as stats requests when the stats are processed by the frontend. Having this counter helps explain the difference in number of sessions that is sometimes observed between a frontend and a backend.	2011-09-10 23:32:41 +02:00
Willy Tarreau	ad14f753ea	[MINOR] http: take a capture of bad content-lengths. Sometimes a bad content-length header is encountered and this causes an abort. It's hard to debug without a trace, so let's take a capture of the contents when this happens.	2011-09-05 00:54:57 +02:00
Willy Tarreau	3b8c08a174	[MINOR] http: take a capture of truncated responses If a server starts to respond but stops before the body, then we capture the truncated response. We don't do this on the request because it would happen too often upon stupid attacks.	2011-09-05 00:54:56 +02:00
Willy Tarreau	fec4d89b24	[MINOR] http: take a capture of too large requests and responses It's hard to prove a request or response is too large if there is no capture, so let's take a snapshot of those too.	2011-09-05 00:54:56 +02:00
Willy Tarreau	588bd4f813	[BUG] http: trailing white spaces must also be trimmed after headers Trailing spaces after headers were not trimmed, only the leading ones were. An issue was detected today with a content-length value which was padded with spaces and which was rejected. Recent updates to the http-bis draft made it a lot more clear that such spaces must be ignored, so this is what this patch does. It should be backported to 1.4.	2011-09-05 00:54:56 +02:00
Willy Tarreau	631f01c2f1	[MINOR] make use of addr_to_str() and get_host_port() to replace many inet_ntop() Many inet_ntop calls were partially right, which was hard to detect given the complex combinations. Some of them were relying on the listener's proto instead of the address itself, which could have been different when dealing with an accept-proxy connection. The new addr_to_str() function does the dirty job and returns the family, which makes it particularly suited to calls from switch/case statements. A large number of if/else statements were removed and the stats output could even be cleaned up in the case of session dump. As a side effect of doing this, the resulting code is smaller by almost 1kB. All changed parts have been tested and provided expected output.	2011-09-05 00:54:36 +02:00
Willy Tarreau	86ad42c5b7	[MINOR] make use of set_host_port() and get_host_port() to get rid of family mismatches This also simplifies the code and makes it more auditable.	2011-09-05 00:54:35 +02:00
Willy Tarreau	87cf51406c	[MEDIUM] http: make x-forwarded-for addition conditional If "option forwardfor" has the "if-none" argument, then the header is only added when the request did not already have one. This option has security implications, and should not be set blindly.	2011-08-19 22:57:24 +02:00
Willy Tarreau	b3eb221e78	[MEDIUM] http: add support for 'cookie' and 'set-cookie' patterns This is used to perform cookie-based stickiness with table replication between multiple masters and across restarts. This partially overrides some of the appsession capabilities.	2011-07-01 16:16:17 +02:00
Simon Horman	af51495397	[MINOR] Add active connection list to server The motivation for this is to allow iteration of all the connections of a server without the expense of iterating over the global list of connections. The first use of this will be to implement an option to close connections associated with a server when is is marked as being down or in maintenance mode.	2011-06-21 22:00:12 +02:00
Simon Horman	70735c98f7	[CLEANUP] Remove assigned but unused variables gcc (Debian 4.6.0-2) 4.6.1 20110329 (prerelease) Copyright (C) 2011 Free Software Foundation, Inc. This is free software; see the source for copying conditions. There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. ... src/proto_http.c:3029:14: warning: variable ‘del_cl’ set but not used [-Wunused-but-set-variable] In file included from ebtree/eb64tree.c:23:0: ebtree/eb64tree.h: In function ‘__eb64_lookup’: ebtree/eb64tree.h:128:6: warning: variable ‘node_bit’ set but not used [-Wunused-but-set-variable] ebtree/eb64tree.h: In function ‘__eb64i_lookup’: ebtree/eb64tree.h:180:6: warning: variable ‘node_bit’ set but not used [-Wunused-but-set-variable] In file included from ebtree/ebpttree.h:26:0, from ebtree/ebimtree.c:23: ebtree/eb64tree.h: In function ‘__eb64_lookup’: ebtree/eb64tree.h:128:6: warning: variable ‘node_bit’ set but not used [-Wunused-but-set-variable] ebtree/eb64tree.h: In function ‘__eb64i_lookup’: ebtree/eb64tree.h:180:6: warning: variable ‘node_bit’ set but not used [-Wunused-but-set-variable] In file included from ebtree/ebpttree.h:26:0, from ebtree/ebistree.h:25, from ebtree/ebistree.c:23: ebtree/eb64tree.h: In function ‘__eb64_lookup’: ebtree/eb64tree.h:128:6: warning: variable ‘node_bit’ set but not used [-Wunused-but-set-variable] ebtree/eb64tree.h: In function ‘__eb64i_lookup’: ebtree/eb64tree.h:180:6: warning: variable ‘node_bit’ set but not used [-Wunused-but-set-variable]	2011-06-18 20:21:33 +02:00
Willy Tarreau	bf9c2fcd93	[BUG] stats: support url-encoded forms Bashkim Kasa reported that the stats admin page did not work when colons were used in server or backend names. This was caused by url-encoding resulting in ':' being sent as '%3A'. Now we systematically decode the field names and values to fix this issue.	2011-05-31 22:44:28 +02:00
Willy Tarreau	0729303fb0	[OPTIM] http: optimize chunking again in non-interactive mode Now that we support the http-no-delay mode, we can optimize HTTP chunking again by always waiting for more data to come until the last chunk is met. This patch may or may not be backported to 1.4, it's not a big deal, it will mainly help for chunks which are aligned with the buffer size.	2011-05-30 18:42:41 +02:00
Willy Tarreau	96e312139a	[MEDIUM] http: add support for "http-no-delay" There are some very rare server-to-server applications that abuse the HTTP protocol and expect the payload phase to be highly interactive, with many interleaved data chunks in both directions within a single request. This is absolutely not supported by the HTTP specification and will not work across most proxies or servers. When such applications attempt to do this through haproxy, it works but they will experience high delays due to the network optimizations which favor performance by instructing the system to wait for enough data to be available in order to only send full packets. Typical delays are around 200 ms per round trip. Note that this only happens with abnormal uses. Normal uses such as CONNECT requests nor WebSockets are not affected. When "option http-no-delay" is present in either the frontend or the backend used by a connection, all such optimizations will be disabled in order to make the exchanges as fast as possible. Of course this offers no guarantee on the functionality, as it may break at any other place. But if it works via HAProxy, it will work as fast as possible. This option should never be used by default, and should never be used at all unless such a buggy application is discovered. The impact of using this option is an increase of bandwidth usage and CPU usage, which may significantly lower performance in high latency environments. This change should be backported to 1.4 since the first report of such a misuse was in 1.4. Next patch will also be needed.	2011-05-30 18:42:41 +02:00
Willy Tarreau	5c62092ca1	[MINOR] http: partially revert the chunking optimization for now Commit 57f5c1 used to provide a nice improvement on chunked encoding since it ensured that we did not set a PUSH flag for every chunk or buffer data part of a chunked transfer. Some applications appear to erroneously abuse HTTP chunking in order to get interactive exchanges between a user agent and an origin server with very small chunks. While it happens to work through haproxy, it's terribly slow due to the latency added after passing each chunk to the system, who could wait up to 200ms before pushing them onto the wire. So we need an interactive mode for such usages. In the mean time, step back on the optim, but not completely, so that we still keep the flag as long as we know we're not finished with the current chunk. This change should be backported to 1.4 too as the issue was discovered with it.	2011-05-11 20:17:42 +02:00
Willy Tarreau	ae94d4df8f	[MINOR] http: make the "HTTP 200" status code configurable. This status code is used in response to requests matching "monitor-uri". Some users need to adjust it to fit their needs (eg: make some strings appear there). As it's already defined as a chunked string and used exactly like other status codes, it makes sense to make it configurable with the usual "errorfile", "errorloc", ...	2011-05-11 16:31:43 +02:00
Willy Tarreau	027a85bb03	[MINOR] http: don't report the "haproxy" word on the monitoring response Some people like to make the monitoring URL testable from unsafe locations. Reporting haproxy's existence there can sometimes be problematic. This patch should not be backported to 1.4 because it is possible, eventhough unlikely, that some scripts rely on this word to appear there.	2011-05-11 16:31:43 +02:00
Willy Tarreau	1fc1f45618	[CRITICAL] fix risk of crash when dealing with space in response cookies When doing fix `24581bae02` to correctly handle response cookies, an unfortunate typo was inserted in the less likely code path, resulting in a risk of crash when cookie-based persistence is enabled and the server emits a cookie with several spaces around the equal sign. This bug was noticed during a code backport. Its effects were never reported because this situation is very unlikely to appear, but it can be provoked on purpose by the server. This patch must be backported to 1.4 versions which contain the fix above (anything > 1.4.8), and to similar 1.3 versions > 1.3.25. 1.5-dev versions after 1.5-dev2 are affected too.	2011-04-08 00:50:36 +02:00
Willy Tarreau	d8ee85a0a3	[BUG] http: fix content-length handling on 32-bit platforms Despite much care around handling the content-length as a 64-bit integer, forwarding was broken on 32-bit platforms due to the 32-bit nature of the ->to_forward member of the "buffer" struct. The issue is that this member is declared as a long, so while it works OK on 64-bit platforms, 32-bit truncate the content-length to the lower 32-bits. One solution could consist in turning to_forward to a long long, but it is used a lot in the critical path, so it's not acceptable to perform all buffer size computations on 64-bit there. The fix consists in changing the to_forward member to a strict 32-bit integer and ensure in buffer_forward() that only the amount of bytes that can fit into it is considered. Callers of buffer_forward() are responsible for checking that their data were taken into account. We arbitrarily ensure we never consider more than 2G at once. That's the way it was intended to work on 32-bit platforms except that it did not. This issue was tracked down hard at Exosec with Bertrand Jacquin, Thierry Fournier and Julien Thomas. It remained undetected for a long time because files larger than 4G are almost always transferred in chunked-encoded format, and most platforms dealing with huge contents these days run on 64-bit. The bug affects all 1.5 and 1.4 versions, and must be backported.	2011-03-28 16:25:16 +02:00
Willy Tarreau	26f0f17200	[BUG] http: fix possible incorrect forwarded wrapping chunk size (take 2) Fix `acd20f80` was incomplete, the computed "bytes" value was not used. This fix must be backported to 1.4.	2011-03-27 20:00:03 +02:00
Willy Tarreau	7b7a8e9d83	[BUG] log: retrieve the target from the session, not the SI Since we now have the copy of the target in the session, use it instead of relying on the SI for it. The SI drops the target upon unregister() so applets such as stats were logged as "NOSRV".	2011-03-27 19:53:06 +02:00
Willy Tarreau	0b3a411543	[BUG] session: conn_retries was not always initialized Johannes Smith reported some wrong retries count in logs associated with bad requests. The cause was that the conn_retries field in the stream interface was only initialized when attempting to connect, but is used when logging, possibly with an uninitialized value holding last connection's conn_retries. This could have been avoided by making use of a stream interface initializer. This bug is 1.5-specific.	2011-03-27 19:16:56 +02:00
Willy Tarreau	6da0f6d6dd	[BUG] http: stats were not incremented on http-request deny A counter increase was missing here. This should be backported to 1.4 with care, as the code has changed a bit.	2011-03-13 22:00:24 +01:00
Willy Tarreau	ff011f26e9	[REORG] http: move the http-request rules to proto_http And also rename "req_acl_rule" "http_req_rule". At the beginning that was a bit confusing to me, especially the "req_acl" list which in fact holds what we call rules. After some digging, it appeared that some part of the code is 100% HTTP and not just related to authentication anymore, so let's move that part to HTTP and keep the auth-only code in auth.c.	2011-03-13 22:00:24 +01:00
Willy Tarreau	f68a15a951	[MEDIUM] http: always evaluate http-request rules before stats http-request Right now, http-request rules are not evaluated if the URL matches the stats request. This is quite unexpected. For instance, in the config below, an abuser present in the abusers list will not be prevented access to the stats. listen pub bind :8181 acl abuser src -f abusers.lst http-request deny if abuser stats uri /stats It is not a big deal but it's not documented as such either. For 1.5, let's have both lists be evaluated in turn, until one blocks. For 1.4 we'll simply update the doc to indicate that. Also instead of duplicating the code, the patch factors out the list walking code. The HTTP auth has been moved slightly earlier, because it was set after the header addition code, but we don't need to add headers to a request we're dropping.	2011-03-13 22:00:24 +01:00
Willy Tarreau	7d0aaf39d1	[MEDIUM] stats: split frontend and backend stats It's very annoying that frontend and backend stats are merged because we don't know what we're observing. For instance, if a "listen" instance makes use of a distinct backend, it's impossible to know what the bytes_out means. Some points take care of not updating counters twice if the backend points to the frontend, indicating a "listen" instance. The thing becomes more complex when we try to add support for server side keep-alive, because we have to maintain a pointer to the backend used for last request, and to update its stats. But we can't perform such comparisons anymore because the counters will not match anymore. So in order to get rid of this situation, let's have both frontend AND backend stats in the "struct proxy". We simply update the relevant ones during activity. Some of them are only accounted for in the backend, while others are just for frontend. Maybe we can improve a bit on that later, but the essential part is that those counters now reflect what they really mean.	2011-03-13 22:00:23 +01:00
David du Colombier	6f5ccb1589	[MEDIUM] add internal support for IPv6 server addresses This patch turns internal server addresses to sockaddr_storage to store IPv6 addresses, and makes the connect() function use it. This code already works but some caveats with getaddrinfo/gethostbyname still need to be sorted out while the changes had to be merged at this stage of internal architecture changes. So for now the config parser will not emit an IPv6 address yet so that user experience remains unchanged. This change should have absolutely zero user-visible effect, otherwise it's a bug introduced during the merge, that should be reported ASAP.	2011-03-13 22:00:12 +01:00
Willy Tarreau	827aee913f	[MAJOR] session: remove the ->srv pointer from struct session This one has been removed and is now totally superseded by ->target. To get the server, one must use target_srv(&s->target) instead of s->srv now. The function ensures that non-server targets still return NULL.	2011-03-10 23:32:17 +01:00
Willy Tarreau	9e000c6ec8	[CLEANUP] stream_interface: use inline functions to manipulate targets The connection target involves a type and a union of pointers, let's make the code cleaner using simple wrappers.	2011-03-10 23:32:17 +01:00
Willy Tarreau	3d80d911aa	[MEDIUM] session: remove s->prev_srv which is not needed anymore s->prev_srv is used by assign_server() only, but all code paths leading to it now take s->prev_srv from the existing s->srv. So assign_server() can do that copy into its own stack. If at one point a different srv is needed, we still have a copy of the last server on which we failed a connection attempt in s->target.	2011-03-10 23:32:16 +01:00
Willy Tarreau	664beb8610	[MINOR] session: add a pointer to the new target into the session When dealing with HTTP keep-alive, we'll have to know if we can reuse an existing connection. For that, we'll have to check if the current connection was made on the exact same target (referenced in the stream interface). Thus, we need to first assign the next target to the session, then copy it to the stream interface upon connect(). Later we'll check for equivalence between those two operations.	2011-03-10 23:32:16 +01:00
Willy Tarreau	295a837726	[REORG] session: move the data_ctx struct to the stream interface's applet This is in fact where those parts belong to. The old data_state was replaced by applet.state and is now initialized when the applet is registered. It's worth noting that the applet does not need to know the session nor the buffer anymore since everything is brought by the stream interface. It is possible that having a separate applet struct would simplify the code but that's not a big deal.	2011-03-10 23:32:16 +01:00
Willy Tarreau	75581aebb0	[CLEANUP] session: remove data_source from struct session This one was only used for logging purposes, it's not needed anymore.	2011-03-10 23:32:15 +01:00
Willy Tarreau	71904a4ee8	[MEDIUM] log: take the logged server name from the stream interface With HTTP keep-alive, logging the right server name will be quite complex because the assigned server will possibly change before we log. Also, when we want to log accesses to an applet, it's not easy because the applet becomes NULL again before logging. The logged server's name is now taken from the target stored in the stream interface. That way we can log an applet, a server name, or we could even log a proxy or anything else if we wanted to. Ideally the session should contain a desired target which is the one which should be logged.	2011-03-10 23:32:15 +01:00
Willy Tarreau	957c0a5845	[REORG] session: move client and server address to the stream interface This will be needed very soon for the keep-alive.	2011-03-10 23:32:14 +01:00
Willy Tarreau	bc4af0573c	[REORG] stream_interface: move the st0, st1 and private members to the applet Those fields are only used by the applets, so let's move them to the struct.	2011-03-10 23:32:14 +01:00
Willy Tarreau	b24281b0ff	[MINOR] stream_interface: make use of an applet descriptor for IO handlers I/O handlers are still delicate to manipulate. They have no type, they're just raw functions which have no knowledge of themselves. Let's have them declared as applets once for all. That way we can have multiple applets share the same handler functions and we can store their names there. When we later need to add more parameters (eg: usage stats), we'll be able to do so in the applets themselves. The CLI functions has been prefixed with "cli" instead of "stats" as it's clearly what is going on there. The applet descriptor in the stream interface should get all the applet specific data (st0, ...) but this will be done in the next patch so that we don't pollute this one too much.	2011-03-10 23:32:14 +01:00
Cyril Bonté	1e2a170cf8	[BUG] stats: admin web interface must check the proxy state Similar to the stats socket bug, we must check that the proxy is not disabled before trying to enable/disable a server. Even if a disabled proxy is not displayed, someone can inject a faulty proxy name in the POST parameters. So, we must ensure that no disabled proxy can be used.	2011-03-04 10:01:40 +01:00
Willy Tarreau	61a21a34da	[BUG] http: balance url_param did not work with first parameters on POST Bryan Talbot reported that POST requests with a query string were not correctly processed if the hash parameter was the first one, because the delimiter that was looked for to trigger the parsing was '&' instead of '?'. Also, while checking the code, it became apparent that it was enough for a query string to be present in the request for POST parameters to be ignored, even if the url_param was in the body and not in the URL. The code has then been fixed like this : 1) look for URL param. If found, return it. 2) if no URL param was found and method is POST, then look it up into the body The code now seems to pass all request combinations. This patch must be backported to 1.4 since 1.4 is equally broken right now.	2011-03-01 20:42:20 +01:00
Willy Tarreau	124d99181c	[BUG] http: fix computation of message body length after forwarding has started Till now, the forwarding code was making use of the hdr_content_len member to hold the size of the last chunk parsed. As such, it was reset after being scheduled for forwarding. The issue is that this entry was reset before the data could be viewed by backend.c in order to parse a POST body, so the "balance url_param check_post" did not work anymore. In order to fix this, we need two things : - the chunk size (reset upon every forward) - the total body size (not reset) hdr_content_len was thus replaced by the former (hence the size of the patch) as it makes more sense to have it stored that way than the way around. This patch should be backported to 1.4 with care, considering that it affects the forwarding code.	2011-03-01 20:30:48 +01:00
Willy Tarreau	acd20f80c1	[BUG] http: fix possible incorrect forwarded wrapping chunk size It seems like if a response message is chunked and the chunk size wraps at the end of the buffer and the crlf sequence is incomplete, then we can forward a wrong chunk size due to incorrect handling of the wrapped size. It seems extremely unlikely to occur on real traffic (no reason to have half of the CRLF after a chunk) but nothing prevents it from being possible. This fix must be backported to 1.4.	2011-03-01 20:04:36 +01:00
Willy Tarreau	910ef306bc	[BUG] http: use correct ACL pointer when evaluating authentication req_acl was used instead of req_acl_final. As a matter of luck, both happen to be the same at this point, but this is not granted in the future. This fix should be backported to 1.4.	2011-02-13 12:18:22 +01:00
Cyril Bont�	23b39d9859	[MINOR] stats: add support for several packets in stats admin Some browsers send POST requests in several packets, which was not supported by the "stats admin" function. This patch allows to wait for more data when they are not fully received (we are still limited to a certain size defined by the buffer size minus its reserved space). It also adds support for the "Expect: 100-Continue" header.	2011-02-12 13:10:18 +01:00
Willy Tarreau	5c4784f4b8	[BUG] http: update the header list's tail when removing the last header Stefan Behte reported a strange case where depending on the position of the Connection header in the header list, some headers added after it were or were not usable in "balance hdr()". The reason is that when the last header is removed, the list's tail was not updated, so any header added after that one was not visible from the list. This fix must be backported to 1.4 and possibly 1.3.	2011-02-12 13:07:35 +01:00
Willy Tarreau	0013433b09	[MINOR] http: improve url_param pattern extraction to ignore empty values It's better to avoid sticking on empty parameter values, as this almost always indicates a missing parameter. Otherwise it's easy to enter a situation where all new visitors stick to the same server.	2011-01-04 14:57:34 +01:00
David Cournapeau	16023eef0b	[MINOR] http: add pattern extraction method to stick on query string parameter This is an updated version of my patch for url parameter extraction on stick table. It adds "url_param(name)" as a possible stick method.	2011-01-03 13:26:02 +01:00
Cyril Bonté	9ea2b9ac75	[BUG] http: fix http-pretend-keepalive and httpclose/tunnel mode Since haproxy 1.4.9, combining option httpclose and option http-pretend-keepalive can leave the connections opened until the backend keep-alive timeout is reached, providing bad performances. The same can occur when the proxy is in tunnel mode. This patch ensures that the server side connection is closed after the response and ignore http-pretend-keepalive in tunnel mode.	2010-12-29 15:24:48 +01:00
Willy Tarreau	ed2fd2daea	[BUG] http: fix incorrect error reporting during data transfers We've had several issues related to data transfers. First, if a client aborted an upload before the server started to respond, it would get a 502 followed by a 400. The same was true (in the other way around) if the server suddenly aborted while the client was uploading the data. The flags reported in the logs were misleading. Request errors could be reported while the transfer was stopped during the data phase. The status codes could also be overwritten by a 400 eventhough the start of the response was transferred to the client. The stats were also wrong in case of data aborts. The server or the client could sometimes be miscredited for being the author of the abort depending on where the abort was detected. Some client aborts could also be accounted as request errors and some server aborts as response errors. Now it seems like all such issues are fixed. Since we don't have a specific state for data flowing from the client to the server before the server responds, we're still counting the client aborted transfers as "CH", and they become "CD" when the server starts to respond. Ideally a "P" state would be desired. This patch should be backported to 1.4.	2010-12-29 13:55:32 +01:00
Willy Tarreau	0499e3575c	[BUG] http: analyser optimizations broke pipelining HTTP pipelining currently needs to monitor the response buffer to wait for some free space to be able to send a response. It was not possible for the HTTP analyser to be called based on response buffer activity. Now we introduce a new buffer flag BF_WAKE_ONCE which is set when the HTTP request analyser is set on the response buffer and some activity is detected. This is not clean at all but once of the only ways to fix the issue before we make it possible to register events for analysers. Also it appeared that one realign condition did not cover all cases.	2010-12-17 07:15:57 +01:00
Willy Tarreau	10479e4bac	[MINOR] stats: add global event ID and count This counter will help quickly spot whether there are new errors or not. It is also assigned to each capture so that a script can keep trace of which capture was taken when.	2010-12-12 14:00:34 +01:00
Willy Tarreau	e1582eb7f6	[MINOR] http: capture incorrectly chunked message bodies It is possible to block on incorrectly chunked requests or responses, but this becomes very hard to debug when it happens once in a while. This patch adds the ability to also capture incorrectly chunked requests and responses. The chunk will appear in the error buffer and will be verifiable with the usual "show errors". The incorrect byte will match the error location.	2010-12-12 13:10:11 +01:00
Willy Tarreau	81f2fb97fe	[MINOR] http: support wrapping messages in error captures Error captures did only support contiguous messages. This is annoying for capturing chunking errors, so let's ensure the function is able to copy wrapped messages.	2010-12-12 13:09:08 +01:00
Willy Tarreau	3fe693b4d6	[BUG] http chunking: don't report a parsing error on connection errors When haproxy parses chunk-encoded data that are scheduled to be sent, it is possible that the other end is closed (mainly due to a client abort returning as an error). The message state thus changes to HTTP_MSG_ERROR and the error is reported as a chunk parsing error ("PD--") while it is not. Detect this case before setting the flags and set the appropriate flag in this case.	2010-12-12 12:50:05 +01:00
Willy Tarreau	078272e115	[MINOR] stats: report HTTP message state and buffer flags in error dumps Debugging parsing errors can be greatly improved if we know what the parser state was and what the buffer flags were (especially for closed inputs/outputs and full buffers). Let's add that to the error snapshots.	2010-12-12 12:46:33 +01:00
Willy Tarreau	57f5c12c04	[OPTIM] http: don't send each chunk in a separate packet When forwarding chunk-encoded data, each chunk gets a TCP PUSH flag when going onto the wire simply because the send() function does not know that some data remain after it (next chunk). Now we set the BF_EXPECT_MORE flag on the buffer if the chunk size is not null. That way we can reduce the number of packets sent, which is particularly noticeable when forwarding compressed data, especially as it requires less ACKs from the client.	2010-12-02 00:39:33 +01:00
Willy Tarreau	342b11c4d4	[BUG] http: do not re-enable the PROXY analyser on keep-alive The PROXY analyser is connection-oriented and must only be set once. When an HTTP transaction is done, we must not re-enable it.	2010-11-29 07:32:02 +01:00
Willy Tarreau	26db59ea6b	[BUG] http: correctly update the header list when removing two consecutive headers When a header is removed, the previous header's next pointer is updated to reflect the next of the current header. However, when cycling through the loop, we update the prev pointer to point to the deleted header, which means that if we delete another header, it's the deleted header's next pointer that will be updated, leaving the deleted header in the list with a null length, which is forbidden. We must just not update the prev pointer after a removal. This bug was present when either "reqdel" and "rspdel" removed two consecutive headers. It could also occur when removing cookies in either requests or responses, but since headers were the last header processing, the issue remained unnoticed. Issue reported by Hank A. Paulson. This fix must be ported to 1.4 and possibly 1.3.	2010-11-28 07:06:23 +01:00
Willy Tarreau	b810554f8f	[CRITICAL] cookies: mixing cookies in indirect mode and appsession can crash the process Cookies in indirect mode are removed from the cookie header. Three pointers ought to be updated when appsession cookies are processed next, but were not. The result is that a memcpy() can be called with a negative value causing the process to crash. It is not sure whether this can be remotely exploited or not. (cherry picked from commit c5f3749aa3ccfdebc4992854ea79823d26f66213)	2010-11-28 07:06:22 +01:00
Willy Tarreau	77eb9b8a2d	[BUG] appsession: fix possible double free in case of out of memory In out of memory conditions, the ->destroy function would free all possibly allocated pools from the current appsession, including those that were not yet allocated nor assigned, which used to point to a previous allocation, obviously resulting in a segfault. (cherry picked from commit 75eae485921d3a6ce197915c769673834ecbfa5c)	2010-11-19 13:25:11 +01:00
Willy Tarreau	f70fc75296	[BUG] capture: do not capture a cookie if there is no memory left In case of out of memory, it was possible to write to a null pointer when capturing response cookies due to a missing "else" block. The request handling was fine though. (cherry picked from commit 62e3604d7dd27741c0b4c9e27d9e7c73495dfc32)	2010-11-19 13:25:11 +01:00
Emeric Brun	485479d8e9	[MEDIUM] Create new protected pattern types CONSTSTRING and CONSTDATA to force memcpy if data from protected areas need to be manipulated. Enhance pattern convs and fetch argument parsing, now fetchs and convs callbacks used typed args. Add more details on error messages on parsing pattern expression function. Update existing pattern convs and fetchs to new proto. Create stick table key type "binary". Manage Truncation and padding if pattern's fetch-converted result don't match table key size.	2010-11-11 09:29:07 +01:00
Cyril Bont�	acd7d63ff9	[CLEANUP] Remove unneeded chars allocation Some arrays used to log addresses add some more bytes for ports but this space is never used.	2010-11-11 09:26:28 +01:00
Emeric Brun	5bd86a8ff5	[MINOR] Support listener's sockets unix on http logs. Enhance controls of sockets family on X-Forwarded-For and X-Original-To insert	2010-11-09 15:59:42 +01:00
Willy Tarreau	ba4c5be880	[MINOR] cookie: add support for the "preserve" option This option makes haproxy preserve any persistence cookie emitted by the server, which allows the server to change it or to unset it, for instance, after a logout request. (cherry picked from commit 52e6d75374c7900c1fe691c5633b4ae029cae8d5)	2010-10-30 19:04:36 +02:00
Willy Tarreau	7f18e52b13	[MINOR] acl: add the http_req_first match This match returns true when the request calling it is the first one of a connection. (cherry picked from commit 922ca979c50653c415852531f36fe409190ad76b)	2010-10-30 19:04:35 +02:00
Willy Tarreau	70461308fe	[MEDIUM] checks: set server state to one state from failure when leaving maintenance When we're enabling a server again (unix CLI or stats interface), we must not mark it completely up because it can take a while before a failure is detected. So we mark it one step above failure, which means it's up but will be marked down upon first failure. (cherry picked from commit 83c3e06452457ed5660fc814cbda5bf878bf19a2)	2010-10-30 19:04:34 +02:00
Cyril Bont�	474be415af	[MEDIUM] stats: add an admin level The stats web interface must be read-only by default to prevent security holes. As it is now allowed to enable/disable servers, a new keyword "stats admin" is introduced to activate this admin level, conditioned by ACLs. (cherry picked from commit 5334bab92ca7debe36df69983c19c21b6dc63f78)	2010-10-30 19:04:34 +02:00
Cyril Bont�	70be45dbdf	[MEDIUM] enable/disable servers from the stats web interface Based on a patch provided by Judd Montgomery, it is now possible to enable/disable servers from the stats web interface. This allows to select several servers in a backend and apply the action to them at the same time. Currently, there are 2 known limitations : - The POST data are limited to one packet (don't alter too many servers at a time). - Expect: 100-continue is not supported. (cherry picked from commit 7693948766cb5647ac03b48e782cfee2b1f14491)	2010-10-30 19:04:34 +02:00
Willy Tarreau	ef4f391cc4	[MEDIUM] cookie: set the date in the cookie if needed If a maxidle or maxlife parameter is set on the persistence cookie in insert mode and the client did not provide a recent enough cookie, then we emit a new cookie with a new last_seen date and the same first_seen (if maxlife is set). Recent enough here designates a cookie that would be rounded to the same date. That way, we can refresh a cookie when required without doing it in all responses. If the request did not contain such parameters, they are set anyway. This means that a monitoring request that is forced to a server will get an expiration date anyway, but this should not be a problem given that the client is able to set its cookie in this case. This also permits to force an expiration date on visitors who previously did not have one. If a request comes with a dated cookie while no date check is performed, then a new cookie is emitted with no date, so that we don't risk dropping the user too fast due to a very old date when we re-enable the date check. All requests that were targetting the correct server and which had their expiration date added/updated/removed in the response cookie are logged with the 'U' ("updated") flag instead of the 'I' ("inserted"). So very often we'll see "VU" instead of "VN". (cherry picked from commit 8b3c6ecab6d37be5f3655bc3a2d2c0f9f37325eb)	2010-10-30 19:04:33 +02:00
Willy Tarreau	f64d1410fc	[MEDIUM] cookie: check for maxidle and maxlife for incoming dated cookies If a cookie comes in with a first or last date, and they are configured on the backend, they're checked. If a date is expired or too far in the future, then the cookie is ignored and the specific reason appears in the cookie field of the logs. (cherry picked from commit faa3019107eabe6b3ab76ffec9754f2f31aa24c6)	2010-10-30 19:04:33 +02:00
Willy Tarreau	f1348310e8	[MEDIUM] cookie: reassign set-cookie status flags to store more states The set-cookie status flags were not very handy and limited. Reorder them to save some room for additional values and add the "U" flags (for Updated expiration date) that will be used with expirable cookies in insert mode. (cherry picked from commit 5bab52f821bb0fa99fc48ad1b400769e66196ece)	2010-10-30 19:04:33 +02:00
Willy Tarreau	b761ec4c94	[MINOR] cookie: add the expired (E) and old (O) flags for request cookies These flags will indicate the cookie status when an expiration date is set. (cherry picked from commit 3f0f0e4583a432d34b75bc7b9dd2c756b4e181a7)	2010-10-30 19:04:33 +02:00
Willy Tarreau	bca9969daf	[MEDIUM] cookie: support client cookies with some contents appended to their value In all cookie persistence modes but prefix, we now support cookies whose value is suffixed with some contents after a vertical bar ('\|'). This will be used to pass an optional expiration date. So as of now we only consider the part of the cookie value which is used before the vertical bar. (cherry picked from commit a4486bf4e5b03b5a980d03fef799f6407b2c992d)	2010-10-30 19:04:32 +02:00
Willy Tarreau	22a9534213	[MEDIUM] make it possible to combine http-pretend-keepalived with httpclose Some configs may involve httpclose in a frontend and http-pretend-keepalive in a backend. httpclose used to take priority over keepalive, thus voiding its effect. This change ensures that when both are combined, keepalive is still announced to the server while close is announced to the client. (cherry picked from commit 2be7ec90fa9caf66294f446423bbab2d00db9004)	2010-10-30 19:04:31 +02:00
Willy Tarreau	e3f284aa7b	[BUILD] proto_http: eliminate some build warnings with gcc-2.95 gcc-2.95 does not like labels before the first case in a switch statement. (cherry picked from commit e1c51a861ba0c389d31dfb010e8b188f5f43313a)	2010-10-30 19:04:31 +02:00
Willy Tarreau	58bd8fd46d	[BUG] stream_sock: try to flush any extra pending request data after a POST Some broken browsers still happen to send a CRLF after a POST. Those which send a CRLF in a second packet have it queued into the system's buffers, which causes an RST to be emitted by some systems upon close of the response (eg: Linux). The client may then receive the RST without the last response segments, resulting in a truncated response. This change leaves request polling enabled on a POST so that we can flush any late data from the request buffers. A more complete workaround would consist in reading from the request for a long time, until we get confirmation that the close has been ACKed. This is much more complex and should only be studied for newer versions. (cherry picked from commit 12e316af4f0245fde12dbc224ebe33c8fea806b2)	2010-10-30 19:04:30 +02:00
Willy Tarreau	24581bae02	[MEDIUM] http: fix space handling in the response cookie parser This patch addresses exactly the same issues as the previous one, but for responses this time. It also introduces implicit support for the Set-Cookie2 header, for which there's almost nothing specific to do since it is a clean header. This one allows multiple cookies in a same header, by respecting the HTTP messaging semantics. The new parser has been tested with insertion, rewrite, passive, removal, prefixing and captures, and it looks OK. It's still able to rewrite (or delete) multiple cookies at once. Just as with the request parser, it tries hard to fix formating of the cookies it displaces. This patch too should be backported to 1.4 and possibly to 1.3.	2010-09-01 00:02:44 +02:00
Willy Tarreau	eb7b0a2b56	[MEDIUM] http: fix space handling in the request cookie parser The request cookie parser did not allow spaces to appear in cookie values nor around the equal sign. The various RFCs on the subject say different things, some suggesting that a space is allowed after the equal sign and being worded in a way that lets one believe it is allowed before too. Some spaces may appear inside values and be part of the values. The quotes allow delimiters to be embedded in values. The spaces before and after attributes should be trimmed. The new parser addresses all those points and has been carefully tested. It fixes misplaced spaces around equal signs before processing the cookies or forwarding them. It also tries its best to perform clean removals by always keeping the delimiter after the value being removed and leaving one space after it. The variable inside the parser have been renamed to make the code a lot more understandable, and one multi-function pointer has been eliminated. Since this patch fixes real possible issues, it should be backported to 1.4 and possibly 1.3, since one (single) case of wrong spaces has been reported in 1.3. The code handling the Set-Cookie has not been touched yet.	2010-09-01 00:02:21 +02:00
Willy Tarreau	0f7f51fbe0	[BUG] http: don't consider commas as a header delimitor within quotes The header parser has a bug which causes commas to be matched within quotes while it was not expected. The way the code was written could make one think it was OK. The resulting effect is that the following config would use the second IP address instead of the third when facing this request : source 0.0.0.0 usesrc hdr_ip(X-Forwarded-For,2) GET / HTTP/1.0 X-Forwarded-for: "127.0.0.1, 127.0.0.2", 127.0.0.3 This fix must be backported to 1.4 and 1.3.	2010-08-30 11:06:34 +02:00
Willy Tarreau	92aa1fac0a	[BUG] http: don't set auto_close if more data are expected Fix `4fe4190278` was a bit too strong. It has caused some chunked-encoded responses to be truncated when a recv() call could return multiple chunks followed by a close. The reason is that when a chunk is parsed, only its contents are scheduled to be forwarded. Thus, the reader sees auto_close+shutr and sets shutw_now. The sender in turn sends the last scheduled data and does shutw(). Another nasty effect is that it has reduced the keep-alive rate. If a response did not completely fit into the buffer, then the auto_close bit was left on and the sender would close upon completion. The fix consists in not making use of auto_close when chunked encoding is used nor when keep-alive is used, which makes sense. However it is maintained on error processing. Thanks to Cyril Bont� for reporting the issue early.	2010-08-28 19:06:28 +02:00
Willy Tarreau	5c54c71463	[MEDIUM] http: forward client's close when abortonclose is set While it's usually desired to wait for a server response even when the client closes its request channel, it can be problematic with long polling requests. In order to let the server decide what to do in such a case, if option abortonclose is set, we simply forward the shutdown to the server. That way, it can decide to take the appropriate action. Most servers will still process the request, while some will probably want to abort. Obviously, this only works as long as the client has not sent another pipelined request over the same connection. (was commit 0e25d86da49827ff6aa3c94132c01292b5ba4854 in 1.4)	2010-08-17 21:37:51 +02:00
Willy Tarreau	f059a0f63a	[MAJOR] session-counters: split FE and BE track counters Having a single tracking pointer for both frontend and backend counters does not work. Instead let's have one for each. The keyword has changed to "track-be-counters" and "track-fe-counters", and the ACL "trk_" changed to "trkfe_" and "trkbe_*".	2010-08-10 18:04:15 +02:00
Willy Tarreau	da7ff64aa9	[MEDIUM] session-counters: add HTTP req/err tracking This patch adds support for the following session counters : - http_req_cnt : HTTP request count - http_req_rate: HTTP request rate - http_err_cnt : HTTP request error count - http_err_rate: HTTP request error rate The equivalent ACLs have been added to check the tracked counters for the current session or the counters of the current source.	2010-08-10 18:04:14 +02:00
Willy Tarreau	6df7a0e7d3	[MINOR] http: reset analysers to listener's, not frontend's When resetting a session's request analysers, we must take them from the listener, not from the frontend. At the moment there is no difference but this might change.	2010-08-10 14:04:42 +02:00
Willy Tarreau	bb695393da	[BUG] http: denied requests must not be counted as denied resps in listeners Socket stats had a wrong counter. This harmless bugfix must be backported to 1.4.	2010-08-10 14:02:54 +02:00
Willy Tarreau	ee55dc024b	[MINOR] frontend: rely on the frontend and not the backend for INDEPSTR Till now, the frontend relied on the backend's options for INDEPSTR, while at the time of accept, the frontend and backend are the same. So we now use the frontend's pointer instead of the backend and we don't have any dependency on the backend anymore in the frontend's accept code.	2010-06-14 10:53:17 +02:00
Willy Tarreau	070ceb6cfb	[MEDIUM] session: don't assign conn_retries upon accept() anymore The conn_retries attribute is now assigned when switching from SI_ST_INI to SI_ST_REQ. This eliminates one of the last dependencies on the backend in the frontend's accept() function.	2010-06-14 10:53:16 +02:00
Willy Tarreau	ee28de0a12	[MEDIUM] session: move the conn_retries attribute to the stream interface The conn_retries still lies in the session and its initialization depends on the backend when it may not yet be known. Let's first move it to the stream interface.	2010-06-14 10:53:16 +02:00
Willy Tarreau	d04e858db0	[MEDIUM] session: initialize server-side timeouts after connect() It was particularly embarrassing that the server timeout was assigned to buffers during an accept() just to be potentially changed later in case of a use_backend rule. The frontend side has nothing to do with server timeouts. Now we initialize them right after the connect() succeeds. Later this should change for a unique stream-interface timeout setting only.	2010-06-14 10:53:14 +02:00
Willy Tarreau	ace495e468	[CLEANUP] buffer->cto is not used anymore The connection timeout stored in the buffer has not been used since the stream interface were introduced. Let's get rid of it as it's one of the things that complicate factoring of the accept() functions.	2010-06-14 10:53:14 +02:00
Willy Tarreau	03fa5df64a	[CLEANUP] rename client -> frontend The 'client.c' file now only contained frontend-specific functions, so it has naturally be renamed 'frontend.c'. Same for client.h. This has also been an opportunity to remove some cross references from files that should not have depended on it. In the end, this file should contain a protocol-agnostic accept() code, which would initialize a session, task, etc... based on an accept() from a lower layer. Right now there are still references to TCP.	2010-06-14 10:53:10 +02:00
Willy Tarreau	663308bea1	[BUG] debug: correctly report truncated messages By using msg->sol as the beginning of a message, wrong messages were displayed in debug mode when they were truncated on the last line, because msg->sol points to the beginning of the last line. Use data+msg->som instead.	2010-06-07 22:43:55 +02:00
Willy Tarreau	1ba0e5f451	[BUG] debug: wrong pointer was used to report a status line This would only be wrong when the server has not completely responded yet. Fix two other occurrences of wrong rsp<->sl associations which were harmless but wrong anyway.	2010-06-07 22:43:55 +02:00
Willy Tarreau	79ebac602d	[BUG] http: report correct flags in case of client aborts during body Some client abort/timeouts during body transfer were reported as "PR--" instead of "CD--" or "cD--". This fix has to be ported to 1.5.	2010-06-07 22:43:54 +02:00
Willy Tarreau	4fe4190278	[BUG] http: automatically close response if req is aborted Latest BF_READ_ATTACHED fix has unveiled a nice issue with the way HTTP requests and responses are forwarded. The case where the request aborts after the response has responded (POST with early response) forgot to re-enable auto-close on the response. In fact it still worked thanks to a side effect as long as BF_READ_ATTACHED was there to force the states to be resynced (and the flags). Since last fix, the missing auto-close causes CLOSE_WAIT connections when the client aborts too late during a data transfer. The right fix consists in considering the situation where the client experiences an error and to explicitly abort the transfer. There is no need to wake the response analysers up for that since they'd have no added value and the analysers flags are cleared. However for a future usage, that might help (eg: stickiness, ...). This fix should be backported to 1.4 if the previous one is backported too. After all the non-reg tests, the risks to see a problem arise without both patches seems low, and both patches touch sensible areas of the code. So there's no hurry.	2010-06-07 22:42:44 +02:00
Willy Tarreau	d45b3d5aff	[BUG] http: dispatch and http_proxy modes were broken for a long time Both dispatch and http_proxy modes were broken since 1.4-dev5 when the adjustment of server health based on response codes was introduced. In fact, in these modes, s->srv == NULL. The result is a plain segfault. It should have been noted critical, but the fact that it remained 6 months without being noticed indicates that almost nobody uses these modes anymore. Also, the crash is immediate upon first request. Further versions should not be affected anymore since it's planned to have a dummy server instead of these annoying NULL pointers.	2010-05-23 08:56:02 +02:00
Willy Tarreau	4a568976c5	[MINOR] stick-tables: add support for "stick on hdr" It is now possible to stick on an IP address found in a HTTP header. Right now only the last occurrence of the header can be used, which is generally enough for most uses. Also, the header extraction rule only knows how to convert the header to IP. Later it will be usable as a plain string with an implicit conversion, and the syntax will not change.	2010-05-13 22:10:02 +02:00
Willy Tarreau	b337b532de	[MEDIUM] acl: add tree-based lookups of networks Networks patterns loaded from files for longest match ACL testing will now be arranged into a prefix tree. This is possible thanks to the new prefix features in ebtree v6.0. Longest match testing is slightly slower than exact data maching. However, the measured impact of running at 42000 requests per second and testing whether the IP address found in a header belongs to a list of 52000 networks or not is 3% CPU (increase from 66% to 69%). This is low enough to permit true geolocation based on huge tables.	2010-05-13 21:37:50 +02:00
Willy Tarreau	c4262961f8	[MEDIUM] acl: add tree-based lookups of exact strings Now if some ACL patterns are loaded from a file and the operation is an exact string match, the data will be arranged in a tree, yielding a significant performance boost on large data sets. Note that this only works when case is sensitive. A new dedicated function, acl_lookup_str(), has been created for this matching. It is called for every possible input data to test and it looks the tree up for the data. Since the keywords are loosely typed, we would have had to add a new columns to all keywords to adjust the function depending on the type. Instead, we just compare on the match function. We call acl_lookup_str() when we could use acl_match_str(). The tree lookup is performed first, then the remaining patterns are attempted if the tree returned nothing. A quick test shows that when matching a header against a list of 52000 network names, haproxy uses 68% of one core on a core2-duo 3.2 GHz at 42000 requests per second, versus 66% without any rule, which means only a 2% CPU increase for 52000 rules. Doing the same test without the tree leads to 100% CPU at 6900 requests/s. Also it was possible to run the same test at full speed with about 50 sets of 52000 rules without any measurable performance drop.	2010-05-13 21:37:45 +02:00
Willy Tarreau	c3bfeebdb4	[MINOR] fix possible crash in debug mode with invalid responses When trying to display an invalid request or response we received, we must at least check that we have identified something looking like a start of message, otherwise we can dereference a NULL pointer.	2010-04-29 07:09:25 +02:00
Cyril Bont�	47fdd8e993	[MINOR] add the "ignore-persist" option to conditionally ignore persistence This is used to disable persistence depending on some conditions (for example using an ACL matching static files or a specific User-Agent). You can see it as a complement to "force-persist". In the configuration file, the force-persist/ignore-persist declaration order define the rules priority. Used with the "appsesion" keyword, it can also help reducing memory usage, as the session won't be hashed the persistence is ignored.	2010-04-25 22:37:14 +02:00
Cyril Bont�	17530c34e4	[BUG] appsession should match the whole cookie name I met a strange behaviour with appsession. I firstly thought this was a regression due to one of my previous patch but after testing with a 1.3.15.12 version, I also could reproduce it. To illustrate, the configuration contains : appsession PHPSESSID len 32 timeout 1h Then I call a short PHP script containing : setcookie("P", "should not match") When calling this script thru haproxy, the cookie "P" matches the appsession rule : Dumping hashtable 0x11f05c8 table[1572]: should+not+match Shouldn't it be ignored ? If you confirm, I'll send a patch for 1.3 and 1.4 branches to check that the cookie length is equal to the appsession name length. This is due to the comparison length, where the cookie length is took into account instead of the appsession name length. Using the appsession name length would allow ASPSESSIONIDXXX (+ check that memcmp won't go after the buffer size). Also, while testing, I noticed that HEAD requests where not available for URIs containing the appsession parameter. 1.4.3 patch fixes an horrible segfault I missed in a previous patch when appsession is not in the configuration and HAProxy is compiled with DEBUG_HASH.	2010-04-07 21:56:10 +02:00
Willy Tarreau	8a8e1d99cb	[MINOR] http: make it possible to pretend keep-alive when doing close Some servers do not completely conform with RFC2616 requirements for keep-alive when they receive a request with "Connection: close". More specifically, they don't bother using chunked encoding, so the client never knows whether the response is complete or not. One immediately visible effect is that haproxy cannot maintain client connections alive. The second issue is that truncated responses may be cached on clients in case of network error or timeout. �scar Fr�as Barranco reported this issue on Tomcat 6.0.20, and Patrik Nilsson with Jetty 6.1.21. Cyril Bont� proposed this smart idea of pretending we run keep-alive with the server and closing it at the last moment as is already done with option forceclose. The advantage is that we only change one emitted header but not the overall behaviour. Since some servers such as nginx are able to close the connection very quickly and save network packets when they're aware of the close negociation in advance, we don't enable this behaviour by default. "option http-pretend-keepalive" will have to be used for that, in conjunction with "option http-server-close".	2010-04-05 16:26:34 +02:00
Willy Tarreau	bce7088275	[MEDIUM] add ability to connect to a server from an IP found in a header Using get_ip_from_hdr2() we can look for occurrence #X or #-X and extract the IP it contains. This is typically designed for use with the X-Forwarded-For header. Using "usesrc hdr_ip(name,occ)", it becomes possible to use the IP address found in <name>, and possibly specify occurrence number <occ>, as the source to connect to a server. This is possible both in a server and in a backend's source statement. This is typically used to use the source IP previously set by a upstream proxy.	2010-03-30 10:39:43 +02:00
Willy Tarreau	bf3f1de5b5	[BUG] http: fix truncated responses on chunk encoding when size divides buffer size Bernhard Krieger reported truncated HTTP responses in presence of some specific chunk-encoded data, and kindly offered complete traces of the issue which made it easy to reproduce it. Those traces showed that the chunks were of exactly 8192 bytes, chunk size and CRLF included, which was exactly half the size of the buffer. In this situation, the function http_chunk_skip_crlf() could erroneously try to parse a CRLF after the chunk believing there were more data pending, because the number of bytes present in the buffer was considered instead of the number of remaining bytes to be parsed.	2010-03-17 15:54:24 +01:00
Willy Tarreau	3965040898	[MINOR] http: don't mark a server as failed when it returns 501/505 Those two codes can be triggered on demand by client requests. We must not fail a server on them. Ideally we should ignore a certain amount of status codes which do not indicate life nor death.	2010-03-15 19:44:39 +01:00
Cyril Bont�	7f2c53938c	[BUG] clf logs segfault when capturing a non existant header Hi Willy, Please find a small patch to prevent haproxy segfaulting when logging captured headers in CLF format. Example config to reproduce the bug : listen test :10080 log 127.0.0.1 local7 debug err mode http option httplog clf capture request header NonExistantHeader len 16 -- Cyril Bont�	2010-03-14 20:02:10 +01:00
Willy Tarreau	6464841769	[BUG] http: don't wait for response data to leave buffer is client has left In case of pipelined requests, if the client aborts before reading response N-1, haproxy waits forever for the data to leave the buffer before parsing the next response.	2010-03-05 10:57:48 +01:00
Willy Tarreau	3e1b6d1ed0	[STATS] frontend requests were not accounted for failed requests But failed requests were accounted for, resulting in more failures than requests.	2010-03-04 23:02:38 +01:00
Willy Tarreau	ae52678444	[STATS] count transfer aborts caused by client and by server Often we need to understand why some transfers were aborted or what constitutes server response errors. With those two counters, it is now possible to detect an unexpected transfer abort during a data phase (eg: too short HTTP response), and to know what part of the server response errors may in fact be assigned to aborted transfers.	2010-03-04 20:34:23 +01:00
Willy Tarreau	40dba09343	[BUG] logs: don't report "proxy request" when server closes early A copy-paste typo and a missing check were causing the logs to report "PR" instead of "SD" when a server closes before sending full data. Also, the log would erroneously report 502 while in fact the correct response will already have been transmitted.	2010-03-04 18:45:47 +01:00
Willy Tarreau	8096de9a99	[MEDIUM] http: revert to use a swap buffer for realignment The bounce realign function was algorithmically good but as expected it was not cache-friendly. Using it with large requests caused so many cache thrashing that the function itself could drain 70% of the total CPU time for only 0.5% of the calls ! Revert back to a standard memcpy() using a specially allocated swap buffer. We're now back to 2M req/s on pipelined requests.	2010-02-26 11:12:27 +01:00
Willy Tarreau	2465779459	[STATS] separate frontend and backend HTTP stats It is wrong to merge FE and BE stats for a proxy because when we consult a BE's stats, it reflects the FE's stats eventhough the BE has received no traffic. The most common example happens with listen instances, where the backend gets credited for all the trafic even when a use_backend rule makes use of another backend.	2010-02-26 10:30:28 +01:00
Willy Tarreau	d9b587f260	[STATS] report HTTP requests (total and rate) in frontends Now that we support keep-alive, it's important to report a separate counter for requests. Right now it just appears in the CSV output.	2010-02-26 10:05:55 +01:00
Willy Tarreau	b97f199d4b	[MEDIUM] http: don't use trash to realign large buffers The trash buffer may now be smaller than a buffer because we can tune it at run time. This causes a risk when we're trying to use it as a temporary buffer to realign unaligned requests, because we may have to put up to a full buffer into it. Instead of doing a double copy, we're now relying on an open-coded bouncing copy algorithm. The principle is that we move one byte at a time to its final place, and if that place also holds a byte, then we move it too, and so on. We finish when we've moved all the buffer. It limits the number of memory accesses, but since it proceeds one byte at a time and with random walk, it's not cache friendly and should be slower than a double copy. However, it's only used in extreme situations and the difference will not be noticeable. It has been extensively tested and works reliably.	2010-02-25 23:54:31 +01:00
Willy Tarreau	0b89fbb076	[BUG] fix error response in case of server error The fix below was incomplete : commit `d5fd51c75b` [BUG] http_server_error() must not purge a previous pending response This can cause parts of responses to be truncated in case of pipelined requests if the second request generates an error before the first request is completely flushed. Pending response data being rejected was still sent, causing inappropriate error responses in case of error while parsing a response header. We must purge pending data from the response buffer that were not scheduled to be sent (l - send_max).	2010-02-02 10:04:19 +01:00
Willy Tarreau	dc008c57a4	[MEDIUM] http: stricter processing of the CONNECT method Now we establish the tunnel only once the status 200 reponse is received. That way we can still support an authentication request in response to a CONNECT, then a client's authentication response.	2010-02-01 16:20:08 +01:00
Willy Tarreau	5843d1a894	[MEDIUM] http: switch to tunnel mode after status 101 responses A 101 response is accompanied with an Upgrade header indicating a new protocol that is spoken on the connection after the exchange completes. At least we should switch to tunnel mode after such a response.	2010-02-01 15:13:32 +01:00
Krzysztof Olędzki	711ad9eb27	[MINOR] http-auth: last fix was wrong I'm not sure if the fix is correct: - if (req_acl->cond) - ret = acl_exec_cond(req_acl->cond, px, s, txn, ACL_DIR_REQ); + if (!req_acl->cond) + continue; Doesn't it ignore rules with no condition attached? I think that the proper solution would be the following.	2010-02-01 12:54:32 +01:00
Willy Tarreau	5142594dea	[MINOR] http-auth: make the 'unless' keyword work as expected One check was missing for the 'polarity' of the test. Now 'unless' works. BTW, 'unless' provides a nice way to perform one-line auth : acl valid-user http_auth(user-list) http-request auth unless valid-user	2010-02-01 10:40:19 +01:00
Willy Tarreau	844a7e76d2	[MEDIUM] http: add support for proxy authentication We're already able to know if a request is a proxy request or a normal one, and we have an option "http-use-proxy-header" which states that proxy headers must be checked. So let's switch to use the proxy authentication headers and responses when this option is set and we're facing a proxy request. That allows haproxy to enforce auth in front of a proxy.	2010-01-31 21:46:18 +01:00
Krzysztof Piotr Oledzki	8c8bd4593c	[MAJOR] use the new auth framework for http stats Support the new syntax (http-request allow/deny/auth) in http stats. Now it is possible to use the same syntax is the same like in the frontend/backend http-request access control: acl src_nagios src 192.168.66.66 acl stats_auth_ok http_auth(L1) stats http-request allow if src_nagios stats http-request allow if stats_auth_ok stats http-request auth realm LB The old syntax is still supported, but now it is emulated via private acls and an aditional userlist.	2010-01-31 19:14:09 +01:00
Krzysztof Piotr Oledzki	f9423ae43a	[MINOR] acl: add http_auth and http_auth_group Add two acls to match http auth data: acl <name> http_auth(userlist) acl <name> http_auth_hroup(userlist) group1 group2 (...)	2010-01-31 19:14:09 +01:00
Krzysztof Piotr Oledzki	59bb218b86	[MINOR] http-request: allow/deny/auth support for frontend/backend/listen Use the generic auth framework to control access to frontends/backends/listens	2010-01-31 19:14:08 +01:00
Willy Tarreau	fdb563c06f	[MEDIUM] http: add support for conditional response header rewriting Just as for the req* rules, we can now condition rsp* rules with ACLs. ACLs match on response, so volatile request information cannot be used. A warning is emitted if a configuration contains such an anomaly.	2010-01-31 15:43:27 +01:00
Willy Tarreau	8abd4cd526	[MEDIUM] http: add support for conditional request header addition Now the reqadd rules also support ACLs. All req* rules are converted now.	2010-01-31 15:12:45 +01:00
Willy Tarreau	6c123b15cb	[MEDIUM] http: make the request filter loop check for optional conditions From now on, if request filters have ACLs defined, these ACLs will be evaluated to condition the filter. This will be used to conditionally remove/rewrite headers based on ACLs.	2010-01-28 20:22:06 +01:00
Willy Tarreau	f4f04125d4	[MINOR] prepare req_/rsp_ to receive a condition It will be very handy to be able to pass conditions to req_* and rsp_*. For now, we just add the pointer to the condition in the affected structs.	2010-01-28 18:10:50 +01:00
Willy Tarreau	c3e8b25c79	[MINOR] http: disable keep-alive when process is going down Krzysztof Oledzki suggested to disable keep-alive when a process is going down due to a reload, in order to avoid ever-lasting sessions. This is a simple and very efficient solution as it ensures that at most one more request will be handled on a keep-alive connection after the process has received a SIGUSR1 signal.	2010-01-28 15:01:20 +01:00
Willy Tarreau	739cfbab6a	[BUG] http: trim any excess buffer data when recycling a connection We must trim any excess data from the response buffer when recycling a keep-alive connection, because we may have blocked an invalid response from a server that we don't want to accidentely forward once we disable the analysers, nor do we want those data to come along with next response. A typical example of such data would be from a buggy server responding to a HEAD with some data, or sending more than the advertised content-length.	2010-01-25 23:11:14 +01:00
Willy Tarreau	d08f82ebe2	[MINOR] http: remove a copy-paste typo in transaction cleaning For deciding to set the BF_EXPECT_MORE, we reused the same code as in http_wait_for_request(), but here we must ignore buf->lr which is not yet set and useless. This might only have caused random sub-optimal behaviours.	2010-01-25 22:46:30 +01:00
Willy Tarreau	88d349d25d	[MEDIUM] http: add support for Proxy-Connection header Despite what is explicitly stated in HTTP specifications, browsers still use the undocumented Proxy-Connection header instead of the Connection header when they connect through a proxy. As such, proxies generally implement support for this stupid header name, breaking the standards and making it harder to support keep-alive between clients and proxies. Thus, we add a new "option http-use-proxy-header" to tell haproxy that if it sees requests which look like proxy requests, it should use the Proxy-Connection header instead of the Connection header.	2010-01-25 12:48:26 +01:00
Willy Tarreau	2a6d88dafe	[MINOR] http: logs must report persistent connections to down servers When using "option persist" or "force-persist", we want to know from the logs if the cookie referenced a valid server or a down server. Till here the flag reported a valid server even if the server was down, which is misleading. Now we correctly report that the requested server was down. We can typically see "--DI" when using "option persist" with redispatch, ad "SCDN" when using force-persist on a down server.	2010-01-24 13:10:43 +01:00
Willy Tarreau	4de9149f87	[MINOR] add the "force-persist" statement to force persistence on down servers This is used to force access to down servers for some requests. This is useful when validating that a change on a server correctly works before enabling the server again.	2010-01-22 19:10:05 +01:00
Willy Tarreau	ff7b5883c0	[OPTIM] http: don't delay response if next request is incomplete We use to delay the response if there is a new request in the buffer. However, if the pending request is incomplete, we should not delay the pending responses.	2010-01-22 14:43:47 +01:00
Willy Tarreau	d5fd51c75b	[BUG] http_server_error() must not purge a previous pending response This can cause parts of responses to be truncated in case of pipelined requests if the second request generates an error before the first request is completely flushed.	2010-01-22 14:20:17 +01:00
Willy Tarreau	6046652253	[MAJOR] http: rework response Connection header handling This one is the next step of previous patch. It correctly computes the response mode and the Connection flag transformations depending on the request mode and version, and the response version and headers. We're now also able to add "Connection: keep-alive", and to convert server's close during a keep-alive connection to a server-close connection.	2010-01-22 11:49:41 +01:00
Willy Tarreau	bbf0b37f6c	[MAJOR] http: rework request Connection header handling We need to improve Connection header handling in the request for it to support the upcoming keep-alive mode. Now we have two flags which keep in the session the information about the presence of a Connection: close and a Connection: keep-alive headers in the initial request, as well as two others which keep the current state of those headers so that we don't have to parse them again. Knowing the initial value is essential to know when the client asked for keep-alive while we're forcing a close (eg in server-close mode). Also the Connection request parser is now able to automatically remove single header values at the same time they are parsed. This provides greater flexibility and reliability. All combinations of listen/front/back in all modes and with both 1.0 and 1.1 have been tested.	2010-01-22 11:49:35 +01:00
Willy Tarreau	68085d8cfb	[MINOR] http: add http_remove_header2() to remove a header value. Calling this function after http_find_header2() automatically deletes the current value of the header, and removes the header itself if the value is the only one. The context is automatically adjusted for a next call to http_find_header2() to return the next header. No other change nor test should be made on the transient context though.	2010-01-18 19:51:33 +01:00
Willy Tarreau	cce7fa4c81	[MEDIUM] http: don't switch to tunnel mode upon close The close mode of a transaction would be switched to tunnel mode at the end of the processing, letting a lot of pending data pass in the other direction if any. Let's fix that by checking for the close mode during state resync too.	2010-01-17 11:38:34 +01:00
Willy Tarreau	d3c343f8aa	[BUG] http: don't count req errors on client resets or t/o during keep-alive We must set the error flags when detecting that a client has reset a connection or timed out while waiting for a new request on a keep-alive connection, otherwise process_session() sets it itself and counts one request error. That explains why some sites were showing an increase in request errors with the keep-alive.	2010-01-16 10:26:19 +01:00
Emeric Brun	b982a3d23a	[MEDIUM] Add stick table configuration and init.	2010-01-12 16:01:24 +01:00
Willy Tarreau	b16a5746b7	[MINOR] http: add a separate "http-keep-alive" timeout This one is used to wait for next request after a response was sent to the client.	2010-01-10 14:46:16 +01:00
Willy Tarreau	fcffa6911c	[MINOR] http: differentiate waiting for new request and waiting for a complete requst While waiting in a keep-alive state for a request, we want to silently close if we don't get anything. However if we get a partial request it's different because that means the client has started to send something. This requires a new transaction flag. It will be used to implement a distinct timeout for keep-alive and requests.	2010-01-10 14:24:53 +01:00
Willy Tarreau	a3377eeeff	[MINOR] http: move appsession 'sessid' from session to http_txn This change, suggested by Cyril Bont�, makes a lot of sense and would have made it obvious that sessid was not properly initialized while switching to keep-alive. The code is now cleaner.	2010-01-10 10:49:11 +01:00
Willy Tarreau	75661457f7	[MINOR] http redirect: don't explicitly state keep-alive on 1.1 Do not set the "connection: keep-alive" header when the request is in HTTP 1.1, it's implicit.	2010-01-10 10:35:01 +01:00
Willy Tarreau	148d099406	[BUG] stream_interface: fix retnclose and remove cond_close The stream_int_cond_close() function was added to preserve the contents of the response buffer because stream_int_retnclose() was buggy. It flushed the response instead of flushing the request. This caused issues with pipelined redirects followed by error messages which ate the previous response. This might even have caused object truncation on pipelined requests followed by an error or by a server redirection. Now that this is fixed, simply get rid of the now useless function.	2010-01-10 10:21:21 +01:00
Cyril Bont�	41689c22da	[BUG] appsession: possible memory leak in case of out of memory condition I've tried to follow all the pool_alloc2/pool_free2 calls in the code to track memory leaks. I've found one which only happens when there's already no more memory when allocating a new appsession cookie.	2010-01-10 00:50:14 +01:00
Willy Tarreau	81e3b4f48d	[MINOR] http redirect: add the ability to append a '/' to the URL Sometimes it can be desired to return a location which is the same as the request with a slash appended when there was not one in the request. A typical use of this is for sending a 301 so that people don't reference links without the trailing slash. The name of the new option is "append-slash" and it can be used on "redirect" statements in prefix mode.	2010-01-10 00:42:19 +01:00
Willy Tarreau	dcb75c4a83	[MINOR] http: fix double slash prefix with server redirect When using server redirection, it is possible to specify a path consisting of only one slash. While this is discouraged (risk of loop) it may sometimes be useful combined with content switching. The prefixing of a '/' then causes two slashes to be returned in the response. So we now do as with the other redirects, don't prepend a slash if it's alone.	2010-01-10 00:24:22 +01:00
Willy Tarreau	962c3f4aab	[MEDIUM] http: fix handling of message pointers Some message pointers were not usable once the message reached the HTTP_MSG_DONE state. This is the case for ->som which points to the body because it is needed to parse chunks. There is one case where we need the beginning of the message : server redirect. We have to call http_get_path() after the request has been parsed. So we rely on ->sol without counting on ->som. In order to achieve this, we're making ->rq.{u,v} relative to the beginning of the message instead of the buffer. That simplifies the code and makes it cleaner. Preliminary tests show this is OK.	2010-01-10 00:15:35 +01:00
Willy Tarreau	59e0b0f972	[BUG] server redirection used an uninitialized string. This might have been introduced with chunk extensions. Note that the server redirect still does not work because http_get_path() cannot get the correct path once the request message is in the HTTP_MSG_DONE state (->som does not point to the start of message anymore).	2010-01-09 21:29:23 +01:00
Willy Tarreau	1fac75385a	[BUILD] appsession did not build anymore under gcc-2.95	2010-01-09 19:23:06 +01:00
Willy Tarreau	762a23618e	[BUG] appsession's sessid must be reset at end of transaction If we don't do that, we may corrupt the pools in keep-alive sessions.	2010-01-09 13:57:26 +01:00
Willy Tarreau	065e8338e8	[MEDIUM] http: wait for some flush of the response buffer before a new request If we accept a new request and that request produces an immediate response (error, redirect, ...), then we may fail to send it in case of pipelined requests if the response buffer is full. To avoid this, we check the availability of at least maxrewrite bytes in the response buffer before accepting a new pipelined request.	2010-01-08 00:36:57 +01:00
Willy Tarreau	ea65e68cc8	[MINOR] http redirect: use proper call to return last response During a redirect, we used to send the last chunk of response with stream_int_cond_close(). But this is wrong in case of pipeline, because if the response already contains something, this function will refrain from touching the buffer. Use a concatenation function instead. Also, this call might still fail when the buffer is full, we need a second fix to refrain from parsing an HTTP request as long as the response buffer is full, otherwise we may not even be able to return a pending redirect or an error code.	2010-01-08 00:36:57 +01:00
Willy Tarreau	4602363f6a	[BUG] http: fix for capture memory leak was incorrect That patch was incorrect because under some circumstances, the capture memory could be freed by session_free() and then again by http_end_txn(), causing a double free and an eventual segfault. The pool use count was also reported wrong due to this bug. The cleanup code was removed from session_free() to remain only in http_end_txn().	2010-01-07 22:51:47 +01:00
Willy Tarreau	6fe60182aa	[BUG] http: memory leak with captures when using keep-alive Hank A. Paulson reported a massive memory leak when using keep-alive mode. The information he provided made it easy to find that captured request and response headers were erased but not released when renewing a request.	2010-01-07 13:35:21 +01:00
Willy Tarreau	90deb18916	[MEDIUM] http: make safer use of the DONT_READ and AUTO_CLOSE flags Several HTTP analysers used to set those flags to values that were useful but without considering the possibility that they were not called again to clean what they did. First, replace direct flag manipulation with more explicit macros. Second, enforce a rule stating that any buffer which changes one of these flags from the default must restore it after completion, so that other analysers see correct flags. With both this fix and the previous one about analyser bits, we should not see any more stuck sessions.	2010-01-07 00:20:41 +01:00
Willy Tarreau	8db1c17634	[BUG] http: check options before the connection header Commit `0dfdf19b64` introduced a regression because the connection header is now parsed and checked depending on the configured options, but the options are set after calling it instead of being set before.	2010-01-05 23:12:12 +01:00
Willy Tarreau	0dfdf19b64	[MEDIUM] http: restore the original behaviour of option httpclose Historically, "option httpclose" has always worked the same way. It only mangles the "Connection" header in the request and the response if needed, but does not affect the connection by itself, and ignores any further data. It is dangerous to change this behaviour without leaving any other alternative. If an active close is desired, it's better to make use of "option forceclose" which does exactly what it intends to do. So as of now, "option httpclose" will only mangle the headers as before, and will only affect the connection by itself when combined with another connection-related option (eg: keepalive or server-close).	2010-01-05 11:33:11 +01:00
Willy Tarreau	2832d63874	[BUG] http: don't set no-linger on response in case of forced close This is a copy-paste error, it must only apply to the request.	2010-01-05 11:06:20 +01:00
Willy Tarreau	9300fb2c01	[BUG] http: redirect needed to be updated after recent changes The data forwarding fixes broke http redirection which relied on tricks.	2010-01-05 00:58:24 +01:00
Willy Tarreau	2fa144c66a	[BUG] http: some possible missed close remain in the forward chain We basically have to mimmic the code of process_session() here, so when the remote output is closed, we must abort otherwise we'll end up with data which cannot leave the buffer.	2010-01-04 23:16:01 +01:00
Willy Tarreau	e3fa6e5bd7	[BUG] http_process_res_common() must not skip the forward analyser By default this function returned 0 indicating an end of analysis. This was not a problem as long as it was the last analyser in the chain but becomes quite a big one now since it skips the forwarder with auto_close enabled, causing some data to pass under the nose of the last one undetected.	2010-01-04 22:57:43 +01:00
Willy Tarreau	610ecceef9	[MAJOR] http: fix again the forward analysers There were still several situations leading to CLOSE_WAIT sockets remaining there forever because some complex transitions were obviously not caught due to the impossibility to resync changes between the request and response FSMs. This patch now centralizes the global transaction state and feeds it from both request and response transitions. That way, whoever finishes first, there will be no issue for converging to the correct state. Some heavy use of the new debugging function has helped a lot. Maybe those calls could be removed after some time. First tests are very positive.	2010-01-04 21:15:02 +01:00
Willy Tarreau	e988a79c74	[DEBUG] add an http_silent_debug function to debug HTTP states This function outputs to fd #-1 the status of request and response buffers, the transaction states, the stream interface states, etc... That way, it's easy to find that output in an strace report, correctly placed WRT the other syscalls.	2010-01-04 21:13:14 +01:00
Willy Tarreau	f5c8bd6a99	[BUG] http: fix hopefully last closing issue on data forwarding The data forwarders are analysers. As such, the have to check for various situations on which they have to abort, one of them being the lack of data with closed input. Now we don't leave the functions anymore without performing these checks. This has solved the new CLOSE_WAIT issue that became more noticeable since last patch.	2010-01-04 07:10:34 +01:00
Willy Tarreau	d3347ee227	[BUG] http: disable auto-closing during chunk analysis It may happen that we forward a close just after we sent the last chunk, because we forgot to clear the AUTO_CLOSE flag. This issue caused some pages to be truncated depending on some timing races. Issue initially reported by Cyril Bont�.	2010-01-04 02:10:45 +01:00
Willy Tarreau	caabe41a15	[OPTIM] http: optimize a bit the construct of the forward loops By adjusting a few states and direct branches, we can save a few percents of CPU, increasing by as much the resulting data rate.	2010-01-03 23:08:28 +01:00
Willy Tarreau	bddaa4a2f7	[CLEANUP] http: remove a remaining impossible condition This test was there before we had the CLOSING and CLOSED states. It makes no sense now.	2010-01-03 22:13:35 +01:00
Willy Tarreau	deb9ed8f60	[MEDIUM] config: remove the limitation of 10 reqadd/rspadd statements Now we use a linked list, there is no limit anymore.	2010-01-03 21:22:14 +01:00
Willy Tarreau	f285f54311	[MINOR] redirect: add support for unconditional rules Sometimes it's useful to be able to specify an unconditional redirect rule without adding "if TRUE".	2010-01-03 21:22:08 +01:00
Willy Tarreau	305ae85957	[BUG] http: fix cookie parser to support spaces and commas in values The cookie parser could be fooled by spaces or commas in cookie names and values, causing the persistence cookie not to be matched if located just after such a cookie. Now spaces found in values are considered as part of the value, and spaces, commas and semi-colons found in values or names, are skipped till next cookie name. This fix must be backported to 1.3.	2010-01-03 19:45:54 +01:00
Willy Tarreau	a9679ac94b	[MINOR] http: make the conditional redirect support keep-alive It makes sense to permit a client to keep its connection when performing a redirect to the same host. We only detect the fact that the redirect location begins with a slash to use the keep-alive (if the client supports it).	2010-01-03 17:32:57 +01:00
Willy Tarreau	2be3939416	[MINOR] http: don't wait for sending requests to the server By default we automatically wait for enough data to fill large packets if buf->to_forward is not null. This causes a problem with POST/Expect requests which have a data size but no data immediately available. Instead of causing noticeable delays on such requests, simply add a flag to disable waiting when sending requests.	2010-01-03 17:24:51 +01:00
Willy Tarreau	6c2cbe14e4	[BUG] http: take care of errors, timeouts and aborts during the data phase In server-close mode particularly, the response buffer is marked for no-auto-close after a response passed through. This prevented a POST request from being aborted on errors, timeouts or anything if the response was received before the request was complete.	2010-01-03 17:07:49 +01:00
Willy Tarreau	bc5aa19e97	[MINOR] http: move redirect messages to HTTP/1.1 with a content-length This is cleaner and this tells clients we support 1.1.	2010-01-03 15:12:05 +01:00
Willy Tarreau	5e8949cf84	[OPTIM] http: don't immediately enable reading on request If we enable reading of a request immediately after completing another one, we end up performing small reads until the request buffer is complete. This takes time and makes it harder to realign the buffer when needed. Just enable reading when we need to.	2010-01-03 14:54:32 +01:00
Willy Tarreau	a95a1f4614	[BUG] http: the request URI pointer is relative to the buffer The rq.u field is relative to buf->data, not to msg->sol. We have to subtract msg->som everywhere this error was made. Maybe it will be simpler to have a pointer to the buffer in the message and find appropriate data there.	2010-01-03 13:04:35 +01:00
Willy Tarreau	3bb9c23bd6	[BUG] http: redirects were broken by chunk changes Redirects used to initialize a chunk whose size was not set (0). Also, the return code of chunk_strcpy() is 1 in case of success.	2010-01-03 12:24:37 +01:00
Willy Tarreau	face839296	[OPTIM] http: set MSG_MORE on response when a pipelined request is pending Many times we see a lot of short responses in HTTP (typically 304 on a reload). It is a waste of network bandwidth to send that many small packets when we know we can merge them. When we know that another HTTP request is following a response, we set BF_EXPECT_MORE on the response buffer, which will turn MSG_MORE on exactly once. That way, multiple short responses can leave pipelined if their corresponding requests were also pipelined.	2010-01-03 11:37:54 +01:00
Willy Tarreau	638cd02e9d	[BUG] http: fix erroneous trailers size computation We used to forward more trailers than required, causing a desynchronization of the output. Now we schedule all for forwarding as soon as we encounter them.	2010-01-03 07:42:04 +01:00
Willy Tarreau	21c5e4d85b	[BUG] last fix was overzealous and disabled server-close we must not close on remote shutdown but on remote error only.	2010-01-03 00:19:31 +01:00
Willy Tarreau	082b01c541	[BUG] http: ensure we abort data transfer on write error When a write error is encountered during a data phase, we must absolutely abort the pending data transfer, otherwise it will never complete.	2010-01-03 00:00:45 +01:00
Willy Tarreau	b608feb82a	[MAJOR] http: add support for option http-server-close This option enables HTTP keep-alive on the client side and close mode on the server side. This offers the best latency on the slow client side, and still saves as many resources as possible on the server side by actively closing connections. Pipelining is supported on both requests and responses, though there is currently no reason to get pipelined responses.	2010-01-02 22:47:18 +01:00
Willy Tarreau	2ab6eb1e24	[MEDIUM] http: make the parsers able to wait for a buffer flush When too large a message lies in a buffer before parsing a new request/response, we can now wait for previous outgoing data to leave the buffer before attempting to parse again. After that we can consider the opportunity to realign the buffer if needed.	2010-01-02 22:04:45 +01:00
Willy Tarreau	15de77e16e	[MEDIUM] http: make the analyser not rely on msg being initialized anymore The HTTP parser needed the msg structure to hold pre-initialized pointers. This causes a trouble with keep-alive because if some data is still in the buffer, the pointers can be anywhere after the data and later become invalid when the buffer gets realigned. It was not needed to rely on that since we have two valid information in the buffer itself : - buf->lr : last visited place - buf->w + buf->send_max : beginning of next message So by doing the maths only on those values, we can avoid doing tricks on msg->som.	2010-01-02 21:59:16 +01:00
Willy Tarreau	c88ea68ef1	[MEDIUM] http: add some SI_FL_NOLINGER around server errors When we catch an error from the server, speed up the connection abort since we don't want to remain long with pending data in the socket, and we want to be able to reuse our source port ASAP.	2009-12-29 14:56:36 +01:00
Willy Tarreau	9438c718ce	[MEDIUM] http: make forceclose use SI_FL_NOLINGER Option forceclose is not limited to the shortage of source ports anymore thanks to this flag.	2009-12-29 14:39:48 +01:00
Willy Tarreau	82eeaf2fae	[MEDIUM] http: properly handle "option forceclose" The "forceclose" option used to close the output channel to the server once it started to respond. While this happened to work with most servers, some of them considered this as a connection abort and immediately stopped responding. Now that we're aware of the end of a request and response, we're able to trivially handle this option and properly close both sides when the server's response is complete. During this change it appeared that forwarding could be allowed when the BF_SHUTW_NOW flag was set on a buffer, which obviously is not acceptable and was causing some trouble. This has been fixed too and is the reason for the MEDIUM status on this patch.	2009-12-29 14:26:42 +01:00
Willy Tarreau	5523b32cc6	[MEDIUM] http: add two more states for the closing period HTTP_MSG_CLOSING and HTTP_MSG_CLOSED are needed to know when it is safe to close a connection without risking to destroy pending data.	2009-12-29 12:05:52 +01:00
Willy Tarreau	83e3af0c86	[MEDIUM] http: rework the buffer alignment logic There were still issues with the buffer alignment. Now we ensure that we always align it before a request or response is completely parsed if there is less than maxrewrite bytes free at the end. In practice, it's not called that often and ensures we can always work as expected.	2009-12-28 17:39:57 +01:00
Willy Tarreau	58cc872848	[BUG] http: typos on several unlikely() around header insertion In many places where we perform header insertion, an error control is performed but due to a mistake, it cannot match any error : if (unlikely(error) < 0) instead of if (unlikely(error < 0)) This prevents error 400 responses from being sent when the buffer is full due to many header additions. This must be backported to 1.3.	2009-12-28 06:57:33 +01:00
Willy Tarreau	d98cf93395	[MAJOR] http: implement body parser The body parser will be used in close and keep-alive modes. It follows the stream to keep in sync with both the request and the response message. Both chunked transfer-coding and content-length are supported according to RFC2616. The multipart/byterange encoding has not yet been implemented and if not seconded by any of the two other ones, will be forwarded till the close, as requested by the specification. Both the request and the response analysers converge into an HTTP_MSG_DONE state where it will be possible to force a close (option forceclose) or to restart with a fresh new transaction and maintain keep-alive. This change is important. All tests are OK but any possible behaviour change with "option httpclose" might find its root here.	2009-12-27 22:54:55 +01:00
Willy Tarreau	7c96f678fa	[BUG] http: body parsing must consider the start of message When parsing body for URL parameters, we must not consider that data are available from buf->data but from buf->data + msg->som. This is not a problem right now but may become with keep-alive.	2009-12-27 22:47:25 +01:00
Willy Tarreau	aec571c2bb	[MEDIUM] http: automatically re-aling request buffer When parsing a request that does not start at the beginning of the buffer, we may experience a buffer full issue. In order to avoid this, we try to realign the buffer if it is not really full. That will be required when we have to deal with pipelined requests.	2009-12-27 17:18:11 +01:00
Willy Tarreau	1d3bcce4dd	[BUG] http: offsets are relative to the buffer, not to ->som Some wrong operations were performed on buffers, assuming the offsets were relative to the beginning of the request while they are relative to the beginning of the buffer. In practice this is not yet an issue since both are the same... until we add support for keep-alive.	2009-12-27 15:50:06 +01:00
Willy Tarreau	e8e785bb85	[MEDIUM] http: add a new transaction flags indicating if we know the transfer length It's not enough to know if the connection will be in CLOSE or TUNNEL mode, we still need to know whether we want to read a full message to a known length or read it till the end just as in TUNNEL mode. Some updates to the RFC clarify slightly better the corner cases, in particular for the case where a non-chunked encoding is used last. Now we also take care of adding a proper "connection: close" to messages whose size could not be determined.	2009-12-26 16:29:04 +01:00
Willy Tarreau	115acb9755	[MEDIUM] http: rework chunk-size parser Chunked encoding can be slightly more complex than what was implemented. Specifically, it supports some optional extensions that were not parsed till now if present, and would have caused an error to be returned. Also, now we enforce check for too large values in chunk sizes in order to ensure we never overflow. Last, we're now able to return a request error if we can't read the chunk size because the buffer is already full.	2009-12-26 13:56:06 +01:00
Willy Tarreau	0394594b06	[MINOR] http: introduce a new synchronisation state : HTTP_MSG_DONE This state indicates that an HTTP message (request or response) is complete. This will be used to know when we can re-initialize a new transaction. Right now we only switch to it after the end of headers if there is no data. When other analysers are implemented, we can switch to this state too. The condition to reuse a connection is when the response finishes after the request. This will have to be checked when setting the state.	2009-12-22 16:50:27 +01:00
Willy Tarreau	63c9e5ffa6	[MINOR] http: move 1xx handling earlier to eliminate a lot of ifs The response 1xx was set too low and required a lot of tests along the code in order to avoid some processing. We still left the test after the response rewrite rules so that we can eliminate unwanted headers if required.	2009-12-22 16:01:27 +01:00
Willy Tarreau	0937bc43cf	[MINOR] http: move the http transaction init/cleanup code to proto_http This code really belongs to the http part since it's transaction-specific. This will also make it easier to later reinitialize a transaction in order to support keepalive.	2009-12-22 15:03:09 +01:00
Willy Tarreau	7c3c54177a	[MAJOR] buffers: automatically compute the maximum buffer length We used to apply a limit to each buffer's size in order to leave some room to rewrite headers, then we used to remove this limit once the session switched to a data state. Proceeding that way becomes a problem with keepalive because we have to know when to stop reading too much data into the buffer so that we can leave some room again to process next requests. The principle we adopt here consists in only relying on to_forward+send_max. Indeed, both of those data define how many bytes will leave the buffer. So as long as their sum is larger than maxrewrite, we can safely fill the buffers. If they are smaller, then we refrain from filling the buffer. This means that we won't risk to fill buffers when reading last data chunk followed by a POST request and its contents. The only impact identified so far is that we must ensure that the BF_FULL flag is correctly dropped when starting to forward. Right now this is OK because nobody inflates to_forward without using buffer_forward().	2009-12-22 10:06:34 +01:00
Willy Tarreau	9e13c3c630	[MINOR] http: only consider chunk encoding with HTTP/1.1 This must be ignored in case of HTTP/1.0.	2009-12-22 09:59:58 +01:00
Willy Tarreau	5b15447672	[MAJOR] http: completely process the "connection" header Up to now, we only had a flag in the session indicating if it had to work in "connection: close" mode. This is not at all compatible with keep-alive. Now we ensure that both sides of a connection act independantly and only relative to the transaction. The HTTP version of the request and response is also correctly considered. The connection already knows several modes : - tunnel (CONNECT or no option in the config) - keep-alive (when permitted by configuration) - server-close (close the server side, not the client) - close (close both sides) This change carefully detects all situations to find whether a request can be fully processed in its mode according to the configuration. Then the response is also checked and tested to fix corner cases which can happen with different HTTP versions on both sides (eg: a 1.0 client asks for explicit keep-alive, and the server responds with 1.1 without a header). The mode is selected by a capability elimination algorithm which automatically focuses on the least capable agent between the client, the frontend, the backend and the server. This ensures we won't get undesired situtations where one of the 4 "agents" is not able to process a transaction. No "Connection: close" header will be added anymore to HTTP/1.0 requests or responses since they're already in close mode. The server-close mode is still not completely implemented. The response needs to be rewritten as keep-alive before being sent to the client if the connection was already in server-close (which implies the request was in keep-alive) and if the response has a content-length or a transfer-encoding (but only if client supports 1.1). A later improvement in server-close mode would probably be to detect some situations where it's interesting to close the response (eg: redirections with remote locations). But even then, the client might close by itself. It's also worth noting that in tunnel mode, no connection header is affected in either direction. A tunnelled connection should theorically be notified at the session level, but this is useless since by definition there will not be any more requests on it. Thus, we don't need to add a flag into the session right now.	2009-12-22 09:52:43 +01:00
Willy Tarreau	522d6c048f	[MEDIUM] http: process request body in a specific analyser The POST body analysis was split between two analysers for historical reasons. Now we only have one analyser which checks content length and waits for enough data to come. Right now this analyser waits for <url_param_post_limit> bytes of body to reach the buffer, or the first chunk. But this could be improved to wait for any other amount of data or any specific contents.	2009-12-22 09:52:42 +01:00
Krzysztof Piotr Oledzki	97f07b832f	[MEDIUM] Decrease server health based on http responses / events, version 3 Implement decreasing health based on observing communication between HAProxy and servers. Changes in this version 2: - documentation - close race between a started check and health analysis event - don't force fastinter if it is not set - better names for options - layer4 support Changes in this version 3: - add stats - port to the current 1.4 tree	2009-12-16 00:29:27 +01:00
Willy Tarreau	d0f06fc4b2	[MINOR] http: detect tunnel mode and set it in the session In order to support keepalive, we'll have to differentiate normal sessions from tunnel sessions, which are the ones we don't want to analyse further. Those are typically the CONNECT requests where we don't care about any form of content-length, as well as the requests which are forwarded on non-close and non-keepalive proxies.	2009-11-30 12:19:56 +01:00
Willy Tarreau	b86db34fe0	[BUG] x-original-to: name was not set in default instance This resulted in an empty header name when option originalto was declared in a default sections.	2009-11-30 11:50:16 +01:00
Cyril Bonté	b21570ae0f	[MEDIUM] appsession: add "len", "prefix" and "mode" options To sum up : - len : it's now the max number of characters for the value, preventing garbaged results. - a new option "prefix" is added, this allows to use dynamic cookie names (e.g. ASPSESSIONIDXXX). Previously in the thread, I wanted to use the value found with "capture cookie" but when i started to update the documentation, I found this solution quite weird. I've made a small rework to not depend on "capture cookie". - There's the posssiblity to define the URL parser mode (path parameters or query string).	2009-11-30 11:31:53 +01:00
Willy Tarreau	fa355d4a51	[MINOR] http: keep pointer to beginning of data We now set msg->col and msg->sov to the first byte of non-header. They will be used later when parsing chunks. A new macro was added to perform size additions on an http_msg in order to limit the risks of copy-paste in the long term. During this operation, it appeared that the http_msg struct was not optimal on 64-bit, so it was re-ordered to fill the holes.	2009-11-29 18:12:29 +01:00
Willy Tarreau	655dce90d4	[MINOR] http: create new MSG_BODY sub-states An HTTP message can be decomposed into several sub-states depending on the transfer-encoding. We'll have to keep these state information while parsing chunks, so we must extend the values. In order not to change everything, we'll now consider that anything >= MSG_BODY is the body, and that the value indicates the precise state. The MSG_ERROR status which was greater than MSG_BODY was moved for this.	2009-11-08 13:10:58 +01:00
Krzysztof Piotr Oledzki	de71d16ec0	[MINOR] Collect & provide http response codes for frontends, fix backends This patch extends and corrects the functionality introduced by "Collect & provide http response codes received from servers": - responses are now also accounted for frontends - backend's and frontend's counters are incremented based on responses sent to client, not received from servers	2009-10-27 21:56:47 +01:00
Willy Tarreau	8e89b84848	[MINOR] http: remove the last call to stream_int_return And remove the now unused function itself too.	2009-10-18 23:56:35 +02:00
Willy Tarreau	b50943e717	[MINOR] http response: update the TX_CLI_CONN_KA flag on rewrite If we modify the "connection" header we send to the client, update the TX_CLI_CONN_KA flag.	2009-10-18 23:53:19 +02:00
Willy Tarreau	b8c82c295b	[MEDIUM] http response: check body length and set transaction flags We also check the close status and terminate the server persistent connection if appropriate. Note that since this change, we'll not get any "Connection: close" headers added to HTTP/1.0 responses anymore, which is good.	2009-10-18 23:45:12 +02:00
Willy Tarreau	75a5fef4d2	[MINOR] http: pre-set the persistent flags in the transaction We should pre-set the persistent flags then try to clear them instead of the opposite.	2009-10-18 23:43:57 +02:00
Willy Tarreau	b37c27e28f	[MAJOR] http: create the analyser which waits for a response The code part which waits for an HTTP response has been extracted from the old function. We now have two analysers and the second one may re-enable the first one when an 1xx response is encountered. This has been tested and works. The calls to stream_int_return() that were remaining in the wait analyser have been converted to stream_int_retnclose().	2009-10-18 23:15:41 +02:00
Willy Tarreau	2225dd4421	[MEDIUM] http request: make use of pre-parsed transfer-encoding header This change should go a bit further. We should have a dedicated analyser to find and skip chunks.	2009-10-18 21:36:47 +02:00
Willy Tarreau	03a5633299	[MEDIUM] http request: simplify POST length detection We can now rely on the pre-parsed content-length and transfer-encoding to find what the supposed body length will be.	2009-10-18 21:28:29 +02:00
Willy Tarreau	349a0f62b5	[MINOR] http request: simplify the test of no-data Now we can rely on (chunked && !hdr_content_len) to stop forwarding data. We only do that for known methods that are not CONNECT though.	2009-10-18 21:19:33 +02:00
Willy Tarreau	4273664a1b	[MINOR] http request: update the TX_SRV_CONN_KA flag on rewrite If we modify the "connection" header we send to the server, update the TX_SRV_CONN_KA flag.	2009-10-18 21:10:21 +02:00
Willy Tarreau	32b47f42a0	[MEDIUM] http request: parse connection, content-length and transfer-encoding Store those elements in the transaction. RFC2616 is strictly followed. Note that requests containing two different content-length fields are discarded as invalid.	2009-10-18 20:59:20 +02:00
Cyril Bont�	bf47aeb946	[MEDIUM] appsession: add the "request-learn" option This patch has 2 goals : 1. I wanted to test the appsession feature with a small PHP code, using PHPSESSID. The problem is that when PHP gets an unknown session id, it creates a new one with this ID. So, when sending an unknown session to PHP, persistance is broken : haproxy won't see any new cookie in the response and will never attach this session to a specific server. This also happens when you restart haproxy : the internal hash becomes empty and all sessions loose their persistance (load balancing the requests on all backend servers, creating a new session on each one). For a user, it's like the service is unusable. The patch modifies the code to make haproxy also learn the persistance from the client : if no session is sent from the server, then the session id found in the client part (using the URI or the client cookie) is used to associated the server that gave the response. As it's probably not a feature usable in all cases, I added an option to enable it (by default it's disabled). The syntax of appsession becomes : appsession <cookie> len <length> timeout <holdtime> [request-learn] This helps haproxy repair the persistance (with the risk of losing its session at the next request, as the user will probably not be load balanced to the same server the first time). 2. This patch also tries to reduce the memory usage. Here is a little example to explain the current behaviour : - Take a Tomcat server where /session.jsp is valid. - Send a request using a cookie with an unknown value AND a path parameter with another unknown value : curl -b "JSESSIONID=12345678901234567890123456789012" http://<haproxy>/session.jsp;jsessionid=00000000000000000000000000000001 (I know, it's unexpected to have a request like that on a live service) Here, haproxy finds the URI session ID and stores it in its internal hash (with no server associated). But it also finds the cookie session ID and stores it again. - As a result, session.jsp sends a new session ID also stored in the internal hash, with a server associated. => For 1 request, haproxy has stored 3 entries, with only 1 which will be usable The patch modifies the behaviour to store only 1 entry (maximum).	2009-10-18 11:56:26 +02:00
Willy Tarreau	f1ba4b3de5	[MAJOR] buffer: flag BF_DONT_READ to disable reads when not required When processing a GET or HEAD request in close mode, we know we don't need to read anything anymore on the socket, so we can disable it. Doing this can save up to 40% of the recv calls, and half of the epoll_ctl calls. For this we need a buffer flag indicating that we're not interesting in reading anymore. Right now, this flag also disables both polled reads. We might benefit from disabling only speculative reads, but we will need at least this flag when we want to support keepalive anyway. Currently we don't disable the flag on completion, but it does not matter as we close ASAP when performing the shutw().	2009-10-18 08:52:24 +02:00
Willy Tarreau	7859991dd7	[MINOR] http: detect connection: close earlier Till now we would only set SN_CONN_CLOSED after rewriting it. Now we set it just after checking the Connection header so that we can use the result later if required.	2009-10-17 20:15:29 +02:00
Krzysztof Piotr Oledzki	5fb1882514	[MINOR] Collect & provide http response codes received from servers Additional data is provided on both html & csv stats: - html: when passing a mouse over Sessions -> Total (servers, backends) - cvs: by 6 additional fields (hrsp_1xx, hrsp_2xx, hrsp_3xx, hrsp_4xx, hrsp_5xx, hspr_other) Patch inspired by: http://www.formilux.org/archives/haproxy/0910/2528.html http://www.formilux.org/archives/haproxy/0910/2529.html	2009-10-14 21:49:53 +02:00
Krzysztof Piotr Oledzki	6f61b21524	[BUG] Fix NULL pointer dereference in stats_check_uri_auth(), v2 Recent "struct chunk rework" introduced a NULL pointer dereference and now haproxy segfaults if auth is required for stats but not found. The reason is that size_t cannot store negative values, but current code assumes that "len < 0" == uninitialized. This patch fixes it.	2009-10-04 23:44:45 +02:00
Krzysztof Piotr Oledzki	aeebf9ba65	[MEDIUM] Collect & provide separate statistics for sockets, v2 This patch allows to collect & provide separate statistics for each socket. It can be very useful if you would like to distinguish between traffic generate by local and remote users or between different types of remote clients (peerings, domestic, foreign). Currently no "Session rate" is supported, but adding it should be possible if we found it useful.	2009-10-04 18:56:02 +02:00
Krzysztof Piotr Oledzki	052d4fd07d	[CLEANUP] Move counters to dedicated structures Move counters from "struct proxy" and "struct server" to "struct pxcounters" and "struct svcounters". This patch should make no functional change.	2009-10-04 18:32:39 +02:00
Willy Tarreau	b0c9bc4f95	[MEDIUM] stats: make HTTP stats use an I/O handler Doing this, we can remove the last BF_HIJACK user and remove produce_content(). s->data_source could also be removed but it is currently used to detect if the stats or a server was used.	2009-10-04 15:56:38 +02:00
Krzysztof Piotr Oledzki	78abe618a8	[MAJOR] struct chunk rework Add size to struct chunk and simplify the code as there is no longer required to pass sizeof in chunk_printf().	2009-10-01 10:17:37 +02:00
Willy Tarreau	2f9cc8ab52	[BUG] http stats: large outputs sometimes got some parts chopped off Due to a misplaced call to stream_int_retnclose(), the stats output buffer was erased before each call to produce_content(), resulting in missing pieces in the stats output if the connection was not fast enough between haproxy and the client.	2009-09-24 22:22:18 +02:00
Willy Tarreau	56a560aef4	[MEDIUM] stats: prepare the connection for closing before dumping We will need to modify the stats dump functions so that they can be used in interactive mode. For this, we want their caller to prepare the connection for a close, not themselves to do it. Let's simply move the stream_int_retnclose() out.	2009-09-23 23:52:16 +02:00
Willy Tarreau	520d95e42b	[MAJOR] buffers: split BF_WRITE_ENA into BF_AUTO_CONNECT and BF_AUTO_CLOSE The BF_WRITE_ENA buffer flag became very complex to deal with, because it was used to : - enable automatic connection - enable close forwarding - enable data forwarding The last point was not very true anymore since we introduced ->send_max, but still the test remained everywhere. This was causing issues such as impossibility to connect without forwarding data, impossibility to prevent closing when data was forwarded, etc... This patch clarifies the situation by getting rid of this multi-purpose flag and replacing it with : - data forwarding based only on ->send_max \|\| ->pipe ; - a new BF_AUTO_CONNECT flag to allow automatic connection and only that ; - ability to perform an automatic connection when ->send_max or ->pipe indicate that data is waiting to leave the buffer ; - a new BF_AUTO_CLOSE flag to let the producer automatically set the BF_SHUTW_NOW flag when it gets a BF_SHUTR. During this cleanup, it was discovered that some tests were performed twice, or that the BF_HIJACK flag was still tested, which is not needed anymore since ->send_max replcaed it. These places have been fixed too. These cleanups have also revealed a few areas where the other flags such as BF_EMPTY are not cleanly used. This will be an opportunity for a second patch.	2009-09-19 21:14:54 +02:00
Willy Tarreau	816b979977	[MAJOR] http: add support for HTTP 1xx informational responses HTTP supports status codes 100 and 101 to report protocol indications, which are followed by the requests's response. Till now, haproxy would only see those responses without parsing subsequent ones. That means that cookie additions were only performed on 1xx messages for instance, which does not work since headers must be ignored with 1xx messages. Also, logs were not terribly useful with the common 100 status code in response to "Expect: 100-continue" during POST some requests. This change adds support for such messages. Now haproxy sees them, forwards them and skips them until it finds a correct response, which it logs and processes. As an exception, header removal/rewriting still work on 1xx responses in order to be able to strip out sensible information that may have accidentely been left by another equipment (possibly an older haproxy itself). But headers addition are disabled however. This change brings the ability to loop on response without data, which is a starting point to support keepalive. The change is marked as major as a few fixes had to be performed in the HTTP message parser.	2009-09-19 14:53:47 +02:00
Willy Tarreau	106f979bbd	[MINOR] acl: add support for hdr_ip to match IP addresses in headers For x-forwarded-for and such headers, it's sometimes needed to match based on network addresses. Let's use hdr_ip() for that.	2009-09-19 14:47:49 +02:00
Willy Tarreau	c465fd7836	[BUG] tarpit did not work anymore Tarpit was broken by recent splitting of analysers. It would still let the connection go to the server due to a missing buffer_write_dis(). Also, it was performed too late (after content switching rules).	2009-08-31 00:17:18 +02:00
Willy Tarreau	a07a34eb24	[MEDIUM] replace BUFSIZE with buf->size in computations The first step towards dynamic buffer size consists in removing all static definitions of the buffer size. Instead, we store a buffer's size in itself. Right now they're all preinitialized to BUFSIZE, but we will change that.	2009-08-16 23:27:46 +02:00
Willy Tarreau	52a0c60845	[MINOR] set s->srv_error according to the analysers s->srv_error was set depending on the frontend's protocol. Now it is set by the HTTP analyser, so that even when switching from a TCP frontend to an HTTP backend, we can have HTTP error messages.	2009-08-16 22:45:38 +02:00
Willy Tarreau	b55932ddaf	[MEDIUM] remove old experimental tcpsplice option This Linux-specific option was never really used in production and has since been superseded by new splicing options brought by recent Linux kernels. It caused several particular cases in the code because the kernel would take care of the session without haproxy being able to do anything on it, which became hard to handle in the new architecture. Let's simply get rid of it now that there is a replacement available.	2009-08-16 13:20:32 +02:00
Emeric Brun	3a058f3091	[MINOR] add a new CLF log format Appending the "clf" word after "option httplog" turns the HTTP log format into a CLF format, more suited for certain tools.	2009-07-14 12:50:40 +02:00
Willy Tarreau	cd7afc0a13	[MINOR] http: take http request timeout from the backend Since we can now switch from TCP to HTTP, we need to be able to apply the HTTP request timeout after switching. That means we need to take it from the backend and not from the frontend. Since the backend points to the frontend before switching, that changes nothing for the normal case.	2009-07-12 10:03:17 +02:00
Willy Tarreau	51aecc76f8	[MEDIUM] allow a TCP frontend to switch to an HTTP backend This patch allows a TCP frontend to switch to an HTTP backend. During the switch, missing structures are automatically allocated. The HTTP parser is enabled so that the backend first waits for a full HTTP request.	2009-07-12 09:47:04 +02:00
Willy Tarreau	2492d5b4d6	[MINOR] acl: add HTTP protocol detection (req_proto_http) Now that we can perform TCP-based content switching, it makes sense to be able to detect HTTP traffic and act accordingly. We already have an HTTP decoder, we just have to call it in order to detect HTTP protocol. Note that since the decoder will automatically fill in the interesting fields of the HTTP transaction, it would make sense to use this parsing to extend HTTP matching to TCP.	2009-07-12 08:06:20 +02:00
Willy Tarreau	1d0dfb155d	[MAJOR] http: complete splitting of the remaining stages The HTTP processing has been splitted into 7 steps, one of which is not anymore HTTP-specific (content-switching). That way, it becomes possible to use "use_backend" rules in TCP mode. A new "use_server" directive should follow soon.	2009-07-07 15:10:31 +02:00
Willy Tarreau	3a816293e9	[MEDIUM] session: tell analysers what bit they were called for Some stream analysers might become generic enough to be called for several bits. So we cannot have the analyser bit hard coded into the analyser itself. Let's make the caller inform the callee.	2009-07-07 10:55:49 +02:00
Willy Tarreau	d787e6648c	[MEDIUM] http: split request waiter from request processor We want to split several steps in HTTP processing so that we can call individual analysers depending on what processing we want to perform. The first step consists in splitting the part that waits for a request from the rest.	2009-07-07 10:14:51 +02:00
Willy Tarreau	571ec98baa	[CLEANUP] remove unused DEBUG_PARSE_NO_SPEEDUP define This one has become useless with the new HTTP parser.	2009-07-07 08:56:15 +02:00
Willy Tarreau	06b917c7ab	[BUG] http: redirect rules were processed too early redirect rules are documented as being processed last before use_backend but were mistakenly processed before block rules. Fortunately very few people use a mix of block and redirect rules, so this bug has never been reported yet.	2009-07-06 16:34:52 +02:00
Willy Tarreau	c9bd0cc224	[MINOR] add options dontlog-normal and log-separate-errors Some big traffic sites have trouble dealing with logs and tend to disable them. Here are two new options to help cope with massive logs. - dontlog-normal only disables logging for 100% successful connections, other ones will still be logged - log-separate-errors will cause non-100% successful connections to be logged at level "err" instead of level "info" so that a properly configured syslog daemon can send them to a different file for longer conservation.	2009-05-10 11:57:02 +02:00
Maik Broemme	2850cb42b6	[MINOR] add X-Original-To: header I have attached a patch which will add on every http request a new header 'X-Original-To'. If you have HAProxy running in transparent mode with a big number of SQUID servers behind it, it is very nice to have the original destination ip as a common header to make decisions based on it. The whole thing is configurable with a new option 'originalto'. I have updated the sourcecode as well as the documentation. The 'haproxy-en.txt' and 'haproxy-fr.txt' files are untouched, due to lack of my french language knowledge. ;) Also the patch adds this header for IPv4 only. I haven't any IPv6 test environment running here and don't know if getsockopt() with SO_ORIGINAL_DST will work on IPv6. If someone knows it and wants to test it I can modify the diff. Feel free to ask me questions or things which should be changed. :) --Maik	2009-05-01 16:22:33 +02:00
Willy Tarreau	2df8d713b3	[BUG] fix wrong pointer arithmetics in HTTP message captures The pointer arithmetics was wrong in http_capture_bad_message(). This has no impact right now because the error only msg->som was affected and right now it's always 0. But this was a bug waiting for keepalive support to strike.	2009-05-01 11:33:17 +02:00
Willy Tarreau	1772ece025	[MINOR] fix several printf formats and missing arguments Last patch revealed a number of mistakes in printf-like calls, mostly int/long mismatches, and a few missing arguments.	2009-04-03 14:49:12 +02:00
Willy Tarreau	4076a15255	[MEDIUM] http: capture invalid requests/responses even if accepted It's useful to be able to accept an invalid header name in a request or response but still be able to monitor further such errors. Now, when an invalid request/response is received and accepted due to an "accept-invalid-http-{request\|response}" option, the invalid request will be captured for later analysis with "show errors" on the stats socket.	2009-04-02 21:36:37 +02:00
Willy Tarreau	32a4ec0ed7	[MEDIUM] http: add options to ignore invalid header names Sometimes it is required to let invalid requests pass because applications sometimes take time to be fixed and other servers do not care. Thus we provide two new options : option accept-invalid-http-request (for the frontend) option accept-invalid-http-response (for the backend) When those options are set, invalid requests or responses do not cause a 403/502 error to be generated.	2009-04-02 21:36:34 +02:00
Willy Tarreau	2ab85e6fee	[BUG] don't set an expiration date directly from now_ms now_ms can be zero, don't set ->analyse_exp directly from it, we must use tick_add() instead.	2009-03-29 10:24:15 +02:00
Willy Tarreau	1b194fe03e	[OPTIM] buffer: new BF_READ_DONTWAIT flag reduces EAGAIN rates When the reader does not expect to read lots of data, it can set BF_READ_DONTWAIT on the request buffer. When it is set, the stream_sock_read callback will not try to perform multiple reads, it will return after only one, and clear the flag. That way, we can immediately return when waiting for an HTTP request without trying to read again. On pure request/responses schemes such as monitor-uri or redirects, this has completely eliminated the EAGAIN occurrences and the epoll_ctl() calls, resulting in a performance increase of about 10%. Similar effects should be observed once we support HTTP keep-alive since we'll immediately disable reads once we get a full request.	2009-03-21 21:57:30 +01:00
Willy Tarreau	8365f9335d	[CLEANUP] http: remove some commented out obsolete code in process_response	2009-03-15 23:11:49 +01:00
Willy Tarreau	6bf1736fb1	[BUILD] proto_http did not build on gcc-2.95 (again) move the DPRINTF below the local variable declarations. (cherry picked from commit `7b92db4cd5`) The patch accidently got reverted.	2009-03-08 23:10:34 +01:00
Willy Tarreau	6f0aa476bd	[CLEANUP] buffer_flush() was misleading, rename it as buffer_erase	2009-03-08 20:33:29 +01:00
Willy Tarreau	ec22b2c27a	[CLEANUP] remove last references to term_trace term_trace was very useful while reworking the lower layers but has almost completely been removed from every place it was referenced. Even the few remaining ones were not accurate, so it's better to completely remove those references and re-add them from scratch later if needed.	2009-03-06 13:07:40 +01:00
Willy Tarreau	7f062c4193	[MEDIUM] measure and report session rate on frontend, backends and servers With this change, all frontends, backends, and servers maintain a session counter and a timer to compute a session rate over the last second. This value will be very useful because it varies instantly and can be used to check thresholds. This value is also reported in the stats in a new "rate" column.	2009-03-05 18:43:00 +01:00
Willy Tarreau	defc52da95	[MINOR] errors dump must use user-visible date, not internal date.	2009-03-04 20:53:44 +01:00
Willy Tarreau	f073a83b1d	[MEDIUM] store a complete dump of request and response errors in proxies Each proxy instance, either frontend or backend, now has some room dedicated to storing a complete dated request or response in case of parsing error. This will make it possible to consult errors in order to find the exact cause, which is particularly important for troubleshooting faulty applications.	2009-03-04 10:26:38 +01:00
Willy Tarreau	7552c031c0	[MINOR] ensure that http_msg_analyzer updates pointer to invalid char If an invalid character is encountered while parsing an HTTP message, we want to get buf->lr updated to reflect it. Along this change, a few useless __label__ declarations have been removed because they caused gcc to consume stack space without putting anything there.	2009-03-01 11:10:40 +01:00
Willy Tarreau	7b92db4cd5	[BUILD] proto_http did not build on gcc-2.95 move the DPRINTF below the local variable declarations.	2009-02-24 10:48:35 +01:00
Willy Tarreau	fe651a50d6	[MINOR] redirect: in prefix mode a "/" means not to change the URI If the prefix is set to "/", it means the user does not want to alter the original URI, so we don't want to insert a new slash before the original URI. (cherry-picked from commit 02a35c74942c1bce762e996698add1270e6a5030)	2008-12-07 23:48:39 +01:00
Willy Tarreau	0140f2553c	[MINOR] redirect: add support for "set-cookie" and "clear-cookie" It is now possible to set or clear a cookie during a redirection. This is useful for logout pages, or for protecting against some DoSes. Check the documentation for the options supported by the "redirect" keyword. (cherry-picked from commit 4af993822e880d8c932f4ad6920db4c9242b0981)	2008-12-07 23:46:38 +01:00
Willy Tarreau	79da4697ca	[MINOR] redirect: add support for the "drop-query" option If "drop-query" is present on a "redirect" line using the "prefix" mode, then the returned Location header will be the request URI without the query-string. This may be used on some login/logout pages, or when it must be decided to redirect the user to a non-secure server. (cherry-picked from commit f2d361ccd73aa16538ce767c766362dd8f0a88fd)	2008-12-07 23:42:01 +01:00
Willy Tarreau	fd39ddaa3d	[BUG] cookie capture is declared in the frontend but checked on the backend Cookie capture would only work by pure luck on the request but did never work on responses since only the backend was checked. The fix consists in always checking frontend for cookie captures. (cherry picked from commit a83c5ba9315a7c47cda2698280b7e49a9d3eb374)	2008-12-07 23:36:52 +01:00
Willy Tarreau	0a46489228	[MINOR] slightly rebalance stats_dump_{raw,http} Both should process the response buffer equally. They now both clear the hijack bit once done, and both receive a pointer to the response buffer in their arguments.	2008-12-07 18:30:00 +01:00
Willy Tarreau	01bf8675ed	[MEDIUM] reference the current hijack function in the buffer itself Instead of calling a hard-coded function to produce data, let's reference this function into the buffer and call it from there when BF_HIJACK is set. This goes in the direction of more generic session management code.	2008-12-07 18:03:29 +01:00
Willy Tarreau	59234e91c2	[MEDIUM] rename process_request to http_process_request Now the function only does HTTP request and nothing else. Also pass the request buffer to it.	2008-11-30 23:51:27 +01:00
Willy Tarreau	d34af78a34	[MEDIUM] move the HTTP request body analyser out of process_request(). A new function http_process_request_body() has been created to process the request body. Next step is now to clean up process_request().	2008-11-30 23:36:37 +01:00
Willy Tarreau	60b85b0694	[MEDIUM] extract the HTTP tarpit code from process_request(). The tarpit is now an autonomous independant analyser.	2008-11-30 23:28:40 +01:00
Willy Tarreau	edcf6687d6	[MEDIUM] extract TCP request processing from HTTP The TCP analyser has moved to proto_tcp.c. Breaking the function has required finer use of the return value and adding some tests to process_session().	2008-11-30 23:15:34 +01:00
Willy Tarreau	0cac36f415	[MEDIUM] make the http server error function a pointer in the session It was a bit awkward to have session.c call return_srv_error() for HTTP error messages related to servers. The function has been adapted to be passed a pointer to the faulty stream interface, and is now a pointer in the session. It is possible that in the future, it will become a callback in the stream interface itself.	2008-11-30 20:44:17 +01:00
Willy Tarreau	2d3d94cf23	[MINOR] replace srv_close_with_err() with http_server_error() The new function looks like the previous one except that it operates at the stream interface level and assumes an already closed SI. Also remove some old unused occurrences of srv_close_with_err().	2008-11-30 20:28:57 +01:00
Willy Tarreau	dded32defa	[MINOR] replace client_retnclose() with stream_int_retnclose() This makes more sense to return a message to a stream interface than to a session. senddata.{c,h} have been removed.	2008-11-30 19:48:07 +01:00
Willy Tarreau	81acfab4fd	[MINOR] replace the ambiguous client_return function by stream_int_return This one applies to a stream interface, which makes more sense.	2008-11-30 19:22:53 +01:00
Willy Tarreau	a5555ec68a	[MINOR] call session->do_log() for logging In order to avoid having to call per-protocol logging function directly from session.c, it's better to assign the logging function when the session is created. This also eliminates a test when the function is needed, and opens the way to more complete logging functions.	2008-11-30 19:02:32 +01:00
Willy Tarreau	55a8d0e1bb	[CLEANUP] move the session-related functions to session.c proto_http.c was not suitable for session-related processing, it was just convenient for the tranformation. Some more splitting must occur: process_request/response in proto_http.c must be split again per protocol, and the caller must run a list. Some functions should be directly attached to the session or the buffer (eg: perform_http_redirect, return_srv_error, http_sess_log).	2008-11-30 18:47:21 +01:00
Willy Tarreau	fe3718ab79	[MAJOR] complete layer4/7 separation All the processing has now completely been split in layers. As of now, everything is still in process_session() which is not the right place, but the code sequence works. Timeouts, retries, errors, all work. The shutdown sequence has been strictly applied: BF_SHUTR/BF_SHUTW are only assigned by lower layers. Upper layers can only indicate their wish to close using BF_SHUTR_NOW and BF_SHUTW_NOW. When a shutdown is performed on a stream interface, the buffer flags are updated accordingly and re-checked by upper layers. A lot of care has been taken to ensure that aborts during intermediate connection setups are correctly handled and shutdowns correctly propagated to both buffers. A future evolution would consist in ensuring that BF_SHUT?_NOW may be set at any time, and applies only when the buffer is empty. This might help with error messages, but might complicate the processing of data remaining in buffers. Some useless buffer flag combinations have been removed. Stat counters are still broken (eg: per-server total number of sessions). Error messages should be delayed to the close instant and be produced by protocol. Many functions must now move to proper locations.	2008-11-30 18:14:12 +01:00
Willy Tarreau	99126c35c1	[MEDIUM] make the stream interface control the SHUT{R,W} bits It's better that the stream interface controls the BF_SHUT* bits so that they always reflect the real state of the interface.	2008-11-27 22:32:14 +01:00
Willy Tarreau	8bfa426cad	[MEDIUM] process shutw during connection attempt It sometimes happens that a connection is aborted at the exact same moment it establishes. We have to close the socket and not only to shut it down for writes. Some corner cases remain. We have to handle the shutr/shutw at the stream interface and only report the status to the buffer, not the opposite.	2008-11-27 09:25:45 +01:00
Willy Tarreau	0a5d5ddeb9	[MEDIUM] remove stream_sock_update_data() Two new functions are used instead : buffer_check_{shutr,shutw}. It is indeed more adequate to check for new closures only when the buffer reports them. Several remaining unclosed connections were detected after a test, even before this patch, so a bug remains. To reproduce, try the following during 30 seconds : inject30l4 -n 20000 -l -t 1000 -P 10 -o 4 -u 100 -s 100 -G 127.0.0.1:8000/	2008-11-23 19:31:35 +01:00
Willy Tarreau	74ab2ac7b0	[MEDIUM] stream_interface: added a DISconnected state between CON/EST and CLO There were rare situations where it was not easy to detect that a failed session attempt had occurred and needed some server cleanup. In particular, client aborts sometimes lead to session leaks on the server side. A new state "SI_ST_DIS" (disconnected) has been introduced for this. When a session has been closed at a stream interface but the server cleanup has not occurred, this state is entered instead of CLO. The cleanup is then performed there and the state goes to CLO. A new diagram has been added to show possible stream_interface state transitions that can occur in a stream-sock. It makes debugging easier.	2008-11-23 17:23:07 +01:00
Willy Tarreau	4351b3a4ca	[MEDIUM] continue layering cleanups. The server sessions are now only decremented when entering SI_ST_CER and SI_ST_CLO states. A state is clearly missing between EST and CLO, or after CLO (eg: END), because many cleanups are performed upon CLO and must rely on tricks to ensure being done only once. The goal of next changes will be to improve what has been started. Ideally, the FD should only notify the SI about the change, which should itself only notify the session when it has some news or when it needs help (eg: redispatch). The buffer's error processing should not change the FD's status immediately, otherwise we risk race conds between a pending connect and a shutw (for instance). Also, the new connect attempt should only be made after layer 7 and all the crap above buffers.	2008-11-12 01:51:41 +01:00
Willy Tarreau	1e62de615b	[MEDIUM] add the SN_CURR_SESS flag to the session to track open sessions It is quite hard to track when the current session has already been counted or discounted from the server's total number of established sessions. For this reason, we introduce a new session flag, SN_CURR_SESS, which indicates if the current session is one of those reported by the server or not. It simplifies session accounting and makes it far more robust. It also makes it possible to perform a last-minute cleanup during session_free(). Right now, with this fix and a few more buffer transitions fixes, no session were found to remain after a test.	2008-11-11 20:26:58 +01:00
Willy Tarreau	cff6411f9a	[MAJOR] add a connection error state to the stream_interface Tracking connection status changes was hard, and some code was redundant. A new SI_ST_CER state was added to the stream interface to indicate a past connection error, and an SI_FL_ERR flag was added to report past I/O error. The stream_sock code does not set the connection to SI_ST_CLO anymore in case of I/O error, it's the upper layer which does it. This makes it possible to know exactly when the file descriptors are allocated. The new SI_ST_CER state permitted to split tcp_connection_status() in two parts, one processing SI_ST_CON and the other one SI_ST_CER. Synchronous connection errors now make use of this last state, hence eliminating duplicate code. Some ib<->ob copy paste errors were found and fixed, and all entities setting SI_ST_CLO also shut the buffers down. Some of these stream_interface specific functions and structures have migrated to a new stream_interface.c file. Some types of errors are still not detected by the buffers. For instance, let's assume the following scenario in one single pass of process_session: a connection sits in SI_ST_TAR state during a retry. At TAR expiration, a new connection attempt is made, the connection is obtained and srv->cur_sess is increased. Then the buffer timeout is fires and everything is cleared, the new state becomes SI_ST_CLO. The cleaning code checks that previous state was either SI_ST_CON or SI_ST_EST to release the connection. But that's wrong because last state is still SI_ST_TAR. So the server's connection count does not get decreased. This means that prev_state must not be used, and must be replaced by some transition detection instead of level detection. The following debugging line was useful to track state changes : fprintf(stderr, "%s:%d: cs=%d ss=%d(%d) rqf=0x%08x rpf=0x%08x\n", __FUNCTION__, __LINE__, s->si[0].state, s->si[1].state, s->si[1].err_type, s->req->flags, s-> rep->flags);	2008-11-03 06:26:53 +01:00
Willy Tarreau	efb453c259	[MAJOR] migrate the connection logic to stream interface The connection setup code has been refactored in order to make it run only on low level (stream interface). Several complicated functions have been removed from backend.c, and we now have sess_update_stream_int() to manage an assigned connection, sess_prepare_conn_req() to assign a server to a connection request, perform_http_redirect() to redirect instead of connecting to server, and return_srv_error() to return connection error status messages. The stream_interface status changes are checked before adjusting buffer flags, so that the buffers can be informed about this lower level update. A new connection is initiated by changing si->state from SI_ST_INI to SI_ST_REQ. The code seems to work but is awfully dirty. Some functions need to be moved, and the layering is not yet quite clear. A lot of dead old code has simply been removed.	2008-11-02 10:19:10 +01:00
Willy Tarreau	d7704b5343	[MINOR] add an expiration flag to the stream_sock_interface This expiration flag is used to indicate that the timer has expired without having to check it everywhere.	2008-11-02 10:19:10 +01:00
Willy Tarreau	3c6ab2e28d	[MEDIUM] use buffer_check_timeouts instead of stream_sock_check_timeouts() It's more appropriate to use buffer_check_timeouts() to check for buffer timeouts and si->shutw/shutr to shutdown the stream interfaces.	2008-11-02 10:19:10 +01:00
Willy Tarreau	3537467679	[MEDIUM] move QUEUE and TAR timers to stream interfaces It was not practical to have QUEUE and TAR timers in buffers, as they caused triggering of the timeout flags. Move them to the stream interface where they belong.	2008-11-02 10:19:09 +01:00
Willy Tarreau	a37095b96f	[CLEANUP] process_session: move debug outputs out of the critical loop The if(debug&closed) printfs have moved outside of the loop. It also permitted to merge several of them.	2008-11-02 10:19:09 +01:00
Willy Tarreau	4ffd51a848	[MEDIUM] process_session: make use of the new buffer flags Now we have almost two distinct parts between tcp and http. Only the connection establishment code still requires some resynchronization, the rest does not.	2008-11-02 10:19:09 +01:00
Willy Tarreau	9a2d15429d	[MEDIUM] buffers: add BF_READ_ATTACHED and BF_ANA_TIMEOUT Those two flags will be used to wake up analysers only when needed.	2008-11-02 10:19:09 +01:00
Willy Tarreau	48adac5db9	[MEDIUM] stream interface: add the ->shutw method as well as in and out buffers Those entries were really needed for cleaner and better code. Using them has permitted to automatically close a file descriptor during a shut write, reducing by 20% the number of calls to process_session() and derived functions. Process_session() does not need to know the file descriptor anymore, though it still remains very complicated due to the special case for the connect mode.	2008-11-02 10:19:08 +01:00
Willy Tarreau	e5ed406715	[MAJOR] make stream sockets aware of the stream interface As of now, a stream socket does not directly wake up the task but it does contact the stream interface which itself knows the task. This allows us to perform a few cleanups upon errors and shutdowns, which reduces the number of calls to data_update() from 8 per session to 2 per session, and make all the functions called in the process_session() loop completely swappable. Some improvements are required. We need to provide a shutw() function on stream interfaces so that one side which closes its read part on an empty buffer can propagate the close to the remote side.	2008-11-02 10:19:08 +01:00
Willy Tarreau	cb651251f9	[OPTIM] ev_sepoll: detect newly created FDs and check them once When an accept() creates a new FD, it is already marked as set for reads. But the task will be woken up without first checking if the socket could be read. The speculative I/O gives us a chance to either read the FD if there are data pending on it, or immediately mark it for poll mode if nothing is pending. Simply doing this reduces the number of calls to process_session from 6 to 5 per session, 2 to 1 calls to process_request, 10% less calls to epoll_ctl, fd_clr, fd_set, stream_sock_data_update, 20% less eb32_insert/eb_delete, etc... General performance increase seems to be around 3%.	2008-11-02 10:19:07 +01:00
Willy Tarreau	3da77c5abd	[MINOR] re-arrange buffer flags and rename some of them The buffer flags became a big bazaar. Re-arrange them so that their names are more explicit and so that they are more easily readable in hex form. Some aggregates have also been adjusted.	2008-11-02 10:19:07 +01:00
Willy Tarreau	72b179a53c	[MEDIUM] reintroduce BF_HIJACK with produce_content The stats dump are back. Even very large config files with 5000 servers work fast and well. The SN_SELF_GEN flag has completely been removed.	2008-11-02 10:19:06 +01:00
Willy Tarreau	36e6a41bc8	[MINOR] only call flow analysers when their read side is connected. It's useless to call flow analysers when their read side has not seen a connection yet.	2008-11-02 10:19:06 +01:00
Willy Tarreau	3a16b2c9cd	[MEDIUM] split stream_sock_process_data It was a waste to constantly update the file descriptor's status and timeouts during a flags update. So stream_sock_process_data has been slit in two parts : stream_sock_data_update() => computes updated flags stream_sock_data_finish() => computes timeouts Only the first one is called during flag updates. The second one is only called upon completion. The number of calls to fd_set/fd_clr has now significantly dropped. Also, it's useless to check for errors and timeouts in the process_session() loop, it's enough to check for them at the beginning.	2008-11-02 10:19:06 +01:00
Willy Tarreau	f9839bdffe	[MAJOR] make the client side use stream_sock_process_data() The client side now relies on stream_sock_process_data(). One part has not yet been re-implemented, it concerns the calls to produce_content(). process_session() has been adjusted to correctly check for changing bits in order not to call useless functions too many times. It already appears that stream_sock_process_data() should be split so that the timeout computations are only performed at the exit of process_session().	2008-11-02 10:19:06 +01:00
Willy Tarreau	2d2127989c	[MEDIUM] stream_sock_process_data moved to stream_sock.c The old temporary process_srv_data function moved to stream_sock.c.	2008-11-02 10:19:05 +01:00
Willy Tarreau	8a8188301b	[MEDIUM] process_srv_data: ensure that we always correctly re-arm timeouts We really want to ensure that we don't miss a timeout update and do not update them for nothing. So the code takes care of updating the timeout in the two following circumstances : - it was not set - some I/O has been performed Maybe we'll be able to remove that from stream_sock_{read\|write}, or we'll find a way to ensure that we never have to re-enable this.	2008-11-02 10:19:05 +01:00
Willy Tarreau	2ac679d9aa	[MEDIUM] third cleanup and optimization of process_srv_data() Some repeated tests were factored out. Now the code makes sense and is fully understandable.	2008-11-02 10:19:05 +01:00
Willy Tarreau	8fbd3b4ce7	[MEDIUM] second level of code cleanup for process_srv_data Now the function is 100% server-independant. Next step will consist in using the same function for the client side too.	2008-11-02 10:19:05 +01:00
Willy Tarreau	376580a873	[MEDIUM] massive cleanup of process_srv() Server-specific calls were extracted and moved to the caller. The function is now nearly server-agnostic.	2008-11-02 10:19:05 +01:00
Willy Tarreau	8b46aa01ac	[OPTIM] remove useless fd_set(read) upon shutdown(write) Those old tricks are no longer needed and are overwritten anyway. Remove them.	2008-11-02 10:19:05 +01:00
Willy Tarreau	fa7e10251d	[MAJOR] rework of the server FSM srv_state has been removed from HTTP state machines, and states have been split in either TCP states or analyzers. For instance, the TARPIT state has just become a simple analyzer. New flags have been added to the struct buffer to compensate this. The high-level stream processors sometimes need to force a disconnection without touching a file-descriptor (eg: report an error). But if they touched BF_SHUTW or BF_SHUTR, the file descriptor would not be closed. Thus, the two SHUT?_NOW flags have been added so that an application can request a forced close which the stream interface will be forced to obey. During this change, a new BF_HIJACK flag was added. It will be used for data generation, eg during a stats dump. It prevents the producer on a buffer from sending data into it. BF_SHUTR_NOW /* the producer must shut down for reads ASAP / BF_SHUTW_NOW / the consumer must shut down for writes ASAP / BF_HIJACK / the producer is temporarily replaced / BF_SHUTW_NOW has precedence over BF_HIJACK. BF_HIJACK has precedence over BF_MAY_FORWARD (so that it does not need it). New functions buffer_shutr_now(), buffer_shutw_now(), buffer_abort() are provided to manipulate BF_SHUT flags. A new type "stream_interface" has been added to describe both sides of a buffer. A stream interface has states and error reporting. The session now has two stream interfaces (one per side). Each buffer has stream_interface pointers to both consumer and producer sides. The server-side file descriptor has moved to its stream interface, so that even the buffer has access to it. process_srv() has been split into three parts : - tcp_get_connection() obtains a connection to the server - tcp_connection_failed() tests if a previously attempted connection has succeeded or not. - process_srv_data() only manages the data phase, and in this sense should be roughly equivalent to process_cli. Little code has been removed, and a lot of old code has been left in comments for now.	2008-11-02 10:19:04 +01:00
Willy Tarreau	41f40ede3b	[MEDIUM] make it possible for analysers to follow the whole session Some analysers will need to remain present after connection is established. Change the way BF_MAY_FORWARD is set to allow this.	2008-11-02 10:19:04 +01:00
Willy Tarreau	c52164a1a8	[BUG] process_request: HTTP body analysis must return zero if missing data This missing return and timeout check caused an infinite loop too.	2008-08-17 19:27:11 +02:00
Willy Tarreau	2500981dc1	[BUG] process_cli/process_srv: don't call shutdown when already done A few missing checks of BF_SHUTR and BF_SHUTW caused busy loops upon some error paths.	2008-08-17 18:16:38 +02:00
Willy Tarreau	ffab5b4ab0	[MEDIUM] merge inspect_exp and txn->exp into request buffer Since we may have several analysers on a buffer, it's more convenient to have the analyser timeout attached to the buffer itself.	2008-08-17 18:03:28 +02:00
Willy Tarreau	6d2889ba3d	[OPTIM] process_cli/process_srv: reduce the number of tests We can skip a number of tests by simply checking a few flags, it saves a few CPU cycles in the fast path.	2008-08-17 16:25:06 +02:00
Willy Tarreau	2df28e8110	[MEDIUM] session: move the analysis bit field to the buffer It makes more sense to store the list of analysers in the buffer than in the session since they are precisely plugged onto one buffer.	2008-08-17 15:20:19 +02:00
Willy Tarreau	f495ddf9d4	[MINOR] ensure the termination flags are set by process_xxx When any processing remains on a buffer, it must be up to the processing functions to set the termination flags, because they are the only ones who know about higher levels.	2008-08-17 14:38:41 +02:00
Willy Tarreau	507385d0e1	[MEDIUM] centralize buffer timeout checks at the top of process_session it's more efficient and easier to check all the timeouts at once and always rely on the buffer flags than to check them everywhere.	2008-08-17 13:04:25 +02:00
Willy Tarreau	26ed74dadc	[MEDIUM] use buffer->wex instead of buffer->cex for connect timeout It's a shame not to use buffer->wex for connection timeouts since by definition it cannot be used till the connection is not established. Using it instead of ->cex also makes the buffer processing more symmetric.	2008-08-17 12:11:14 +02:00
Willy Tarreau	dafde43410	[MAJOR] process_session: rely only on buffer flags Instead of calling all functions in a loop, process_session now calls them according to buffer flags changes. This ensures that we almost never call functions for nothing. The flags settings are still quite coarse, but the number of average functions calls per session has dropped from 31 to 18 (the calls to process_srv dropped from 13 to 7 and the calls to process_cli dropped from 13 to 8). This could still be improved by memorizing which flags each function uses, but that would add a level of complexity which is not desirable and maybe even not worth the small gain.	2008-08-17 01:15:41 +02:00
Willy Tarreau	e393fe224b	[MEDIUM] buffers: add BF_EMPTY and BF_FULL to remove dependency on req/rep->l It is not always convenient to run checks on req->l in functions to check if a buffer is empty or full. Now the stream_sock functions set flags BF_EMPTY and BF_FULL according to the buffer contents. Of course, functions which touch the buffer contents adjust the flags too.	2008-08-16 22:18:07 +02:00

... 9 10 11 12 13 ...

1231 Commits