haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-09 08:37:04 +02:00

Author	SHA1	Message	Date
Willy Tarreau	cda7f3f5c2	MINOR: stream: don't prune variables if the list is empty The vars_prune() and vars_init() functions involve locking while most of the time there is no variable at all in streams nor sessions. Let's check for emptiness before calling these functions. Simply doing this has increased the multithreaded performance from 1.5 to 5% depending on the workload.	2018-10-28 13:46:47 +01:00
Lukas Tribus	80512b186f	BUG/MINOR: only auto-prefer last server if lb-alg is non-deterministic While "option prefer-last-server" only applies to non-deterministic load balancing algorithms, 401/407 responses actually caused haproxy to prefer the last server unconditionally. As this breaks deterministic load balancing algorithms like uri, this patch applies the same condition here. Should be backported to 1.8 (together with "BUG/MINOR: only mark connections private if NTLM is detected").	2018-10-27 22:10:32 +02:00
Lukas Tribus	fd9b68c48e	BUG/MINOR: only mark connections private if NTLM is detected Instead of marking all connections that see a 401/407 response private (for connection reuse), this patch detects a RFC4559/NTLM authentication scheme and restricts the private setting to those connections. This is so we can reuse connections with 401/407 responses with deterministic load balancing algorithms later (which requires another fix). This fixes the problem reported here by Elliot Barlas : https://discourse.haproxy.org/t/unable-to-configure-load-balancing-per-request-over-persistent-connection/3144 Should be backported to 1.8.	2018-10-27 22:10:29 +02:00
Willy Tarreau	ede3d884fc	MEDIUM: channel: merge back flags CF_WRITE_PARTIAL and CF_WRITE_EVENT The behaviour of the flag CF_WRITE_PARTIAL was modified by commit `95fad5ba4` ("BUG/MAJOR: stream-int: don't re-arm recv if send fails") due to a situation where it could trigger an immediate wake up of the other side, both acting in loops via the FD cache. This loss has caused the need to introduce CF_WRITE_EVENT as commit `c5a9d5bf`, to replace it, but both flags express more or less the same thing and this distinction creates a lot of confusion and complexity in the code. Since the FD cache now acts via tasklets, the issue worked around in the first patch no longer exists, so it's more than time to kill this hack and to restore CF_WRITE_PARTIAL's semantics (i.e.: there has been some write activity since we last left process_stream). This patch mostly reverts the two commits above. Only the part making use of CF_WROTE_DATA instead of CF_WRITE_PARTIAL to detect the loss of data upon connection setup was kept because it's more accurate and better suited.	2018-10-26 08:32:57 +02:00
Christopher Faulet	66943a4903	CLEANUP: http: Remove the unused function http_find_header	2018-10-23 10:22:36 +02:00
Christopher Faulet	315b39c391	MINOR: http: Use same flag for httpclose and forceclose options Since keep-alive mode is the default mode, the passive close has disappeared, and in the code, httpclose and forceclose options are handled the same way: connections with the client and the server are closed as soon as the request and the response are received and missing "Connection: close" header is added in each direction. So to make things clearer, forceclose is now an alias for httpclose. And httpclose is explicitly an active close. So the old passive close does not exist anymore. Internally, the flag PR_O_HTTP_PCL has been removed and PR_O_HTTP_FCL has been replaced by PR_O_HTTP_CLO. In HTTP analyzers, the checks done to find the right mode to use, depending on proxies options and "Connection: " header value, have been simplified. This should only be a cleanup and no changes are expected.	2018-10-12 16:07:56 +02:00
Christopher Faulet	10079f59b7	MINOR: http: Export some functions and do cleanup to prepare HTTP refactoring To ease the refactoring, the function "http_header_add_tail" have been remove. Now, "http_header_add_tail2" is always used. And the function "capture_headers" have been renamed into "http_capture_headers". Finally, some functions have been exported.	2018-10-12 16:00:45 +02:00
Willy Tarreau	61c112aa5b	REORG: http: move HTTP rules parsing to http_rules.c These ones are mostly called from cfgparse.c for the parsing and do not depend on the HTTP representation. The functions's prototypes were moved to proto/http_rules.h, making this file work exactly like tcp_rules. Ideally we should stop calling these functions directly from cfgparse and register keywords, but there are a few cases where that wouldn't work (stats http-request) so it's probably not worth trying to go this far.	2018-10-02 18:28:05 +02:00
Willy Tarreau	79e57336b5	REORG: http: move the code to different files The current proto_http.c file is huge and contains different processing domains making it very difficult to work on an alternative representation. This commit moves some parts to other files : - ACL registration code => http_acl.c This code only creates some ACL mappings and doesn't know anything about HTTP nor about the representation. This code could even have moved to acl.c but it was not worth polluting it again. - HTTP sample conversion => http_conv.c This code doesn't depend on the internal representation but definitely manipulates some HTTP elements, such as dates. It also has access to captures. - HTTP sample fetching => http_fetch.c This code does depend entirely on the internal representation but is totally independent on the analysers. Placing it into a different file will ease the transition to the new representation and the creation of a wrapper if required. An include file was created due to CHECK_HTTP_MESSAGE_FIRST() being used at various places. - HTTP action registration => http_act.c This code doesn't directly interact with the messages nor the transaction but it does so via some exported http functions like http_replace_req_line() or http_set_status() so it will be easier to change only this after the conversion. - a few very generic parts were found and moved to http.{c,h} as relevant. It is worth noting that the functions moved to these new files are not referenced anywhere outside of the files and are only called as registered callbacks, so these files do not even require associated include files.	2018-10-02 18:26:59 +02:00
Christopher Faulet	ca874b8d92	BUG/MEDIUM: http: Don't parse chunked body if there is no input data With recent modifications on the buffers API, when a buffer is released (calling b_free), we replace it by BUF_NULL where the area pointer is NULL. So many operations, like b_peek, must be avoided on a released or not allocated buffer. These changes were mainly made in the commit `c9fa048` ("MAJOR: buffer: finalize buffer detachment"). Since this commit, HAProxy can crash during the body parsing of chunked HTTP messages because there is no check on the channel's buffer in HTTP analyzers (http_request_forward_body and http_response_forward_body) nor in H1 functions reponsible to parse chunked content (h1_skip_chunk_crlf & co). If a stream is woken up after all input data were forwarded, its input channel's buffer is released (so set to BUF_NULL). In this case, if we resume the parsing of a chunk, HAProxy crashes. To fix this issue, we just skip the parsing of chunks if there is no input data for the corresponding channel. This is only done if the message state is strickly lower to HTTP_MSG_ENDING.	2018-09-20 14:37:58 +02:00
Willy Tarreau	b05e48a54d	BUILD: http: address a couple of null-deref warnings at -Wextra These two warnings are caused by the use of objt_server() without checking its result. These are turned to __objt_server() which is safe there.	2018-09-20 11:42:15 +02:00
Willy Tarreau	ab813a4b05	REORG: http: move some header value processing functions to http.c The following functions only deal with header field values and are agnostic to the HTTP version so they were moved to http.c : http_header_match2(), find_hdr_value_end(), find_cookie_value_end(), extract_cookie_value(), parse_qvalue(), http_find_url_param_pos(), http_find_next_url_param(). Those lacking the "http_" prefix were modified to have it.	2018-09-11 10:30:25 +02:00
Willy Tarreau	e10cd48a83	REORG: http: move the log encoding tables to log.c There are 3 tables in proto_http which are used exclusively by logs : hdr_encode_map[], url_encode_map[] and http_encode_map[]. They indicate what characters are safe to be emitted in logs depending on the part of the message where they are placed. Let's move this to log.c, as well as its initialization. It's worth noting that the rfc5424 map was already initialized there.	2018-09-11 10:30:25 +02:00
Willy Tarreau	04f1e2d202	REORG: http: move error codes production and processing to http.c These error codes and messages are agnostic to the version, even if they are represented as HTTP/1.0 messages. Ultimately they will have to be transformed into internal HTTP messages to be used everywhere. The HTTP/1.1 100 Continue message was turned to an IST and the local copy in the Lua code was removed.	2018-09-11 10:30:25 +02:00
Willy Tarreau	6b952c8101	REORG: http: move http_get_path() to http.c This function is purely HTTP once http_txn is put aside. So the original one was renamed to http_txn_get_path() and it extracts the relevant offsets from the txn to pass them to http_get_path(). One benefit of the new version is that it returns the length at the same time so that allowed to slightly simplify http_get_path_from_string() which had to look up the end pointer previously and which is not needed anymore.	2018-09-11 10:30:25 +02:00
Willy Tarreau	35b51c6e5b	REORG: http: move the HTTP semantics definitions to http.h/http.c It's a bit painful to have to deal with HTTP semantics for each protocol version (H1 and H2), and working on the version-agnostic code further emphasizes the problem. This patch creates http.h and http.c which are agnostic to the version in use, and which borrow a few parts from proto_http and from h1. For example the once thought h1-specific h1_char_classes array is in fact dictated by RFC7231 and is used to parse HTTP headers. A few changes were made to a few files which were including proto_http.h while they only needed http.h. Certain string definitions pre-dated the introduction of indirect strings (ist) so some were used to simplify the definition of the known HTTP methods. The current lookup code saves 2 kB of a heavily used table and is faster than the previous table based lookup (typ. 14 ns vs 16 before).	2018-09-11 10:30:25 +02:00
Willy Tarreau	ddb68ac69e	REORG: cli: move the "show errors" handler from http to proxy There's nothing HTTP-specific there anymore at all, let's move this to the proxy where it belongs.	2018-09-07 18:36:50 +02:00
Willy Tarreau	fd9419d560	MINOR: http: remove the pointer to the error snapshot in http_capture_bad_message() It's not needed anymore as we know the side thanks to the channel. This will allow the proxy generic code to better manage the error snapshots.	2018-09-07 18:36:04 +02:00
Willy Tarreau	ef3ca73fc3	MINOR: http: make the HTTP error capture rely on the generic proxy code Now that we have a generic error capture function, let's simplify http_capture_bad_message() to make use of it. At this point the API is not changed at all, but it could be further simplified.	2018-09-07 18:36:04 +02:00
Willy Tarreau	7ccdd8dad9	MEDIUM: snapshot: implement a show() callback and use it for HTTP The HTTP dumps are now configurable in the code : "show errors" now calls a protocol-specific function to emit the decoded output. For now only HTTP is implemented.	2018-09-07 18:36:01 +02:00
Willy Tarreau	0b5b480594	MEDIUM: snapshot: start to reorder the HTTP snapshot output a little bit The output of "show errors" was slightly reordered to split the HTTP part in a single chunk_appendf() call. The useless buffer total input was replaced to report the buffer's start offset, which is the offset in the stream of the first input byte (thus not counting output). Also it was the opportunity to stop calling the stream "session".	2018-09-07 17:48:14 +02:00
Willy Tarreau	7480f323ff	MINOR: snapshot: split the error snapshots into common and proto-specific parts The idea will be to make the error snapshot feature accessible to other protocols than just HTTP. This patch only introduces an "http_snapshot" structure and renames a few fields to make things more explicit. The HTTP part was installed inside a union so that we can easily add more protocols in the future.	2018-09-07 16:13:45 +02:00
Willy Tarreau	5865a8fe69	MINOR: snapshot: restart on the event ID and not the stream ID The snapshots have the ability to restart a partial dump and they use the stream ID as the restart point. Since it's purely HTTP, let's use the event ID instead.	2018-09-07 15:00:43 +02:00
Willy Tarreau	e9e878a056	BUG/MINOR: http/threads: atomically increment the error snapshot ID Let's use an atomic increment for the error snapshot, as we'd rather not assign the same ID to two errors happening in parallel. It's very unlikely that it will ever happen though. This patch must be backported to 1.8 with the other one it relies on ("MINOR: thread: implement HA_ATOMIC_XADD()").	2018-09-07 11:31:58 +02:00
Willy Tarreau	90a7c03ec0	BUG/MINOR: stream: use atomic increments for the request counter The request counter is incremented when creating a new stream and when resetting a stream, preparing for a new request. Unfortunately during the thread migration this was missed, leading to non-atomic increments in case threads are in use. The most visible side effect is that two requests may have the same ID from time to time in the logs. However the SPOE also uses this ID to route responses back to the stream so it may also lead to occasional spurious SPOE timeouts. Note that it still doesn't guarantee temporal unicity in the stream identifiers since a long and a short connection could technically use the same ID. The likeliness that this happens at the same time is almost null (roughly threads*runqueue_depth/2^32 that it happens in the same poll loop), but it will have to be addressed later anyway. This patch must be backported to 1.8 with the other one it relies on ("MINOR: thread: implement HA_ATOMIC_XADD()").	2018-09-05 16:30:19 +02:00
Patrick Hemmer	e3faf02581	BUG/MEDIUM: lua: reset lua transaction between http requests Previously LUA code would maintain the transaction state between http requests, resulting in things like txn:get_priv() retrieving data from a previous request. This addresses the issue by ensuring the LUA state is reset between requests. Co-authored-by: Tim D�sterhus <tim@bastelstu.be>	2018-08-25 07:51:02 +02:00
Willy Tarreau	9c768fdca1	BUG/MEDIUM: http: don't store url_decode() result in the samples's length By convenience or laziness we used to store url_decode()'s return code into smp->data.u.str.data. The result checks applied there compare it to 0 while it's now unsigned since commit `843b7cb` ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct "). Let's clean this up and test the result itself without storing it first. No backport is needed.	2018-08-22 05:16:32 +02:00
Willy Tarreau	6e27be1a5d	BUG/MEDIUM: http: don't store exp_replace() result in the trash's length By convenience or laziness we used to store exp_replace()'s return code into trash.data. The result checks applied there compare trash.data to -1 while it's now unsigned since commit `843b7cb` ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct "). Let's clean this up and test the result itself without storing it first. No backport is needed.	2018-08-22 05:16:32 +02:00
Willy Tarreau	5f6333caca	BUG/MINOR: chunks: do not store -1 into chunk_printf() in case of error Since commit `843b7cb` ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct") a chunk length is unsigned so we can't reliably store -1 and check for negative values in the caller. Only one such location was found in proto_http's http-request auth rules (which cannot realistically fail). No backport is needed.	2018-08-22 05:16:31 +02:00
Patrick Hemmer	ffe5e8c638	MINOR: stream: rename {srv,prx}_queue_size to *_queue_pos The current name is misleading as it implies a queue size, but the value instead indicates a position in the queue. The value is only the queue size at the exact moment the element is enqueued. Soon we will gain the ability to insert anywhere into the queue, upon which clarity of the name is more important.	2018-08-10 15:04:14 +02:00
Willy Tarreau	66425e31b5	MINOR: queue: make sure the pendconn is released before logging We'll soon need to rely on the pendconn position at the time of dequeuing to figure the position a stream took in the queue. Usually it's not a problem since pendconn_free() is called once the connection starts, but it will make a difference for failed dequeues (eg: queue timeout reached). Thus it's important to call pendconn_free() before logging in cases we are not certain whether it was already performed, and to call pendconn_unlink() after we know the pendconn will not be used so that we collect the queue state as accurately as possible. As a benefit it will also make the server's and backend's queues count more accurate in these cases.	2018-08-10 15:04:08 +02:00
Willy Tarreau	d0ad4a87f0	MEDIUM: queue: make pendconn_free() work on the stream instead Now pendconn_free() takes a stream, checks that pend_pos is set, clears it, and uses pendconn_unlink() to complete the job. It's cleaner and centralizes all the bookkeeping work in pendconn_unlink() only and ensures that there's a single place where the stream's position in the queue is manipulated.	2018-07-26 17:32:51 +02:00
Christopher Faulet	4507351a2f	BUG/MINOR: build: Fix compilation with debug mode enabled It remained some fragments of the old buffers API in debug messages, here and there. This was caused by the recent buffer API changes, no backport is needed.	2018-07-20 10:45:20 +02:00
Christopher Faulet	005e79e5dd	BUG/MINOR: http: Set brackets for the unlikely macro at the right place When test on the header "Early-Data" is made, the unlikely macro must encompass the condition. This patch must be backported in 1.8.	2018-07-20 10:42:53 +02:00
Willy Tarreau	83061a820e	MAJOR: chunks: replace struct chunk with struct buffer Now all the code used to manipulate chunks uses a struct buffer instead. The functions are still called "chunk*", and some of them will progressively move to the generic buffer handling code as they are cleaned up.	2018-07-19 16:23:43 +02:00
Willy Tarreau	843b7cbe9d	MEDIUM: chunks: make the chunk struct's fields match the buffer struct Chunks are only a subset of a buffer (a non-wrapping version with no head offset). Despite this we still carry a lot of duplicated code between buffers and chunks. Replacing chunks with buffers would significantly reduce the maintenance efforts. This first patch renames the chunk's fields to match the name and types used by struct buffers, with the goal of isolating the code changes from the declaration changes. Most of the changes were made with spatch using this coccinelle script : @rule_d1@ typedef chunk; struct chunk chunk; @@ - chunk.str + chunk.area @rule_d2@ typedef chunk; struct chunk chunk; @@ - chunk.len + chunk.data @rule_i1@ typedef chunk; struct chunk chunk; @@ - chunk->str + chunk->area @rule_i2@ typedef chunk; struct chunk chunk; @@ - chunk->len + chunk->data Some minor updates to 3 http functions had to be performed to take size_t ints instead of ints in order to match the unsigned length here.	2018-07-19 16:23:43 +02:00
Willy Tarreau	c9fa0480af	MAJOR: buffer: finalize buffer detachment Now the buffers only contain the header and a pointer to the storage area which can be anywhere. This will significantly simplify buffer swapping and will make it possible to map chunks on buffers as well. The buf_empty variable was removed, as now it's enough to have size==0 and area==NULL to designate the empty buffer (thus a non-allocated head is the empty buffer by default). buf_wanted for now is indicated by size==0 and area==(void *)1. The channels and the checks now embed the buffer's head, and the only pointer is to the storage area. This slightly increases the unallocated buffer size (3 extra ints for the empty buffer) but considerably simplifies dynamic buffer management. It will also later permit to detach unused checks. The way the struct buffer is arranged has proven quite efficient on a number of tests, which makes sense given that size is always accessed and often first, followed by the othe ones.	2018-07-19 16:23:43 +02:00
Willy Tarreau	e3128024bf	MINOR: buffer: replace buffer_replace2() with b_rep_blk() This one is more generic and designed to work on a random block. It may later get a b_rep_ist() variant since many strings are already available as (ptr,len).	2018-07-19 16:23:43 +02:00
Willy Tarreau	4d893d440c	MINOR: buffers/channel: replace buffer_insert_line2() with ci_insert_line2() There was no point keeping that function in the buffer part since it's exclusively used by HTTP at the channel level, since it also automatically appends the CRLF. This further cleans up the buffer code.	2018-07-19 16:23:43 +02:00
Willy Tarreau	72a100b386	MINOR: buffer: replace bi_fast_delete() with b_del() There's no distinction between in and out data now. The latter covers the needs of the former and supports wrapping. The extra cost is negligible given the locations where it's used.	2018-07-19 16:23:43 +02:00
Willy Tarreau	f37954d4da	MEDIUM: http: use wrappers instead of directly manipulating buffers states This is aimed at easing the transition to the new API. There are a few places which deserve some simplifications afterwards because ci_head() is called often and may be placed into a local pointer.	2018-07-19 16:23:42 +02:00
Willy Tarreau	5ba65521a3	MINOR: buffer: replace buffer_pending() with ci_data() It used to return b->i for channels, which is what ci_data() does.	2018-07-19 16:23:41 +02:00
Willy Tarreau	2375233ef0	MINOR: buffer: replace buffer_full() with channel_full() It's only used by channels since we need to know the amount of output data.	2018-07-19 16:23:41 +02:00
Willy Tarreau	d760eecf61	MINOR: buffer: replace buffer_not_empty() with b_data() or c_data() It's mostly for consistency as many places already use one of these instead.	2018-07-19 16:23:41 +02:00
Willy Tarreau	84d6b7af87	MINOR: h1: make h1_parse_chunk_size() not depend on b_ptr() anymore It's similar to the previous commit so that the function doesn't rely on buf->p anymore.	2018-07-19 16:23:41 +02:00
Willy Tarreau	c0973c6742	MINOR: h1: make h1_skip_chunk_crlf() not depend on b_ptr() anymore It now takes offsets relative to the buffer's head. It's up to the callers to add this offset which corresponds to the buffer's output size.	2018-07-19 16:23:41 +02:00
Willy Tarreau	188e230704	MINOR: buffer: convert most b_ptr() calls to c_ptr() The latter uses the channel wherever a channel is known.	2018-07-19 16:23:40 +02:00
Willy Tarreau	8f9c72d301	MINOR: buffer: remove bi_end() It was replaced by ci_tail() when the channel is known, or b_tail() in other cases.	2018-07-19 16:23:40 +02:00
Willy Tarreau	bcbd39370f	MINOR: channel/buffer: replace b_{adv,rew} with c_{adv,rew} These ones manipulate the output data count which will be specific to the channel soon, so prepare the call points to use the channel only. The b_* functions are now unused and were removed.	2018-07-19 16:23:40 +02:00
Willy Tarreau	fd8d42f496	MEDIUM: channel: make channel_slow_realign() take a swap buffer The few call places where it's used can use the trash as a swap buffer, which is made for this exact purpose. This way we can rely on the generic b_slow_realign() call.	2018-07-19 16:23:40 +02:00
Willy Tarreau	4cf1300e6a	MINOR: channel/buffer: replace buffer_slow_realign() with channel_slow_realign() and b_slow_realign() Where relevant, the channel version is used instead. The buffer version was ported to be more generic and now takes a swap buffer and the output byte count to know where to set the alignment point. The H2 mux still uses buffer_slow_realign() with buf->o but it will change later.	2018-07-19 16:23:40 +02:00
Willy Tarreau	d5b343bf9e	MINOR: channel/buffer: use c_realign_if_empty() instead of buffer_realign() This patch removes buffer_realign() and replaces it with c_realign_if_empty() instead.	2018-07-19 16:23:40 +02:00
Tim Duesterhus	3fd1973d37	MINOR: http: Log warning if (add\|set)-header fails This patch adds a warning if an http-(request\|reponse) (add\|set)-header rewrite fails to change the respective header in a request or response. This usually happens when tune.maxrewrite is not sufficient to hold all the headers that should be added.	2018-05-28 14:53:59 +02:00
William Lallemand	8a16fe0d05	BUG/MEDIUM: cache: don't cache when an Authorization header is present RFC 7234 says: A cache MUST NOT store a response to any request, unless: [...] the Authorization header field (see Section 4.2 of [RFC7235]) does not appear in the request, if the cache is shared, unless the response explicitly allows it (see Section 3.2), [...] In this patch we completely disable the cache upon the receipt of an Authorization header in the request. In this case it's not possible to either use the cache or store into the cache anymore. Thanks to Adam Eijdenberg of Digital Transformation Agency for raising this issue. This patch must be backported to 1.8.	2018-05-23 10:36:44 +02:00
Willy Tarreau	ba20dfc501	BUG/MEDIUM: http: don't always abort transfers on CF_SHUTR Pawel Karoluk reported on Discourse[1] that HTTP/2 breaks url_param. Christopher managed to track it down to the HTTP_MSGF_WAIT_CONN flag which is set there to ensure the connection is validated before sending the headers, as we may need to rewind the stream and hash again upon redispatch. What happens is that in the forwarding code we refrain from forwarding when this flag is set and the connection is not yet established, and for this we go through the missing_data_or_waiting path. This exit path was initially designed only to wait for data from the client, so it rightfully checks whether or not the client has already closed since in that case it must not wait for more data. But it also has the side effect of aborting such a transfer if the client has closed after the request, which is exactly what happens in H2. A study on the code reveals that this whole combined check should be revisited : while it used to be true that waiting had the same error conditions as missing data, it's not true anymore. Some other corner cases were identified, such as the risk to report a server close instead of a client timeout when waiting for the client to read the last chunk of data if the shutr is already present, or the risk to fail a redispatch when a client uploads some data and closes before the connection establishes. The compression seems to be at risk of rare issues there if a write to a full buffer is not yet possible but a shutr is already queued. At the moment these risks are extremely unlikely but they do exist, and their impact is very minor since it mostly concerns an issue not being optimally handled, and the fixes risk to cause more serious issues. Thus this patch only focuses on how the HTTP_MSGF_WAIT_CONN is handled and leaves the rest untouched. This patch needs to be backported to 1.8, and could be backported to earlier versions to properly take care of HTTP/1 requests passing via url_param which are closed immediately after the headers, though this is unlikely as this behaviour is only exhibited by scripts. [1] https://discourse.haproxy.org/t/haproxy-1-8-x-url-param-issue-in-http2/2482/13	2018-05-16 11:35:05 +02:00
Tim Duesterhus	e2b10bf491	MINOR: http: Add support for 421 Misdirected Request This makes haproxy aware of HTTP 421 Misdirected Request, which is defined in RFC 7540, section 9.1.2.	2018-04-28 07:03:39 +02:00
Aur�lien Nephtali	abbf607105	MEDIUM: cli: Add payload support In order to use arbitrary data in the CLI (multiple lines or group of words that must be considered as a whole, for example), it is now possible to add a payload to the commands. To do so, the first line needs to end with a special pattern: <<\n. Everything that follows will be left untouched by the CLI parser and will be passed to the commands parsers. Per-command support will need to be added to take advantage of this feature. Signed-off-by: Aur�lien Nephtali <aurelien.nephtali@corp.ovh.com>	2018-04-26 14:19:33 +02:00
Rian McGuire	89fcb7d929	BUG/MINOR: log: t_idle (%Ti) is not set for some requests If TCP content inspection is used, msg_state can be >= HTTP_MSG_ERROR the first time http_wait_for_request is called. t_idle was being left unset in that case. In the example below : stick-table type string len 64 size 100k expire 60s tcp-request inspect-delay 1s tcp-request content track-sc1 hdr(X-Session) %Ti will always be -1, because the msg_state is already at HTTP_MSG_BODY when http_wait_for_request is called for the first time. This patch should backported to 1.8 and 1.7.	2018-04-25 08:59:23 +02:00
Christopher Faulet	11ebb2080e	BUG/MINOR: http: Return an error in proxy mode when url2sa fails In proxy mode, the result of url2sa is never checked. So when the function fails to resolve the destination server from the URL, we continue. Depending on the internal state of the connection, we get different behaviours. With a newly allocated connection, the field <addr.to> is not set. So we will get a HTTP error. The status code is 503 instead of 400, but it's not really critical. But, if it's a recycled connection, we will reuse the previous value of <addr.to>, opening a connection on an unexpected server. To fix the bug, we return an error when url2sa fails. This patch should be backported in all version from 1.5.	2018-04-16 15:31:18 +02:00
Christopher Faulet	5cd4bbd7ab	BUG/MAJOR: threads/queue: Fix thread-safety issues on the queues management The management of the servers and the proxies queues was not thread-safe at all. First, the accesses to <strm>->pend_pos were not protected. So it was possible to release it on a thread (for instance because the stream is released) and to use it in same time on another one (because we redispatch pending connections for a server). Then, the accesses to stream's information (flags and target) from anywhere is forbidden. To be safe, The stream's state must always be updated in the context of process_stream. So to fix these issues, the queue module has been refactored. A lock has been added in the pendconn structure. And now, when we try to dequeue a pending connection, we start by unlinking it from the server/proxy queue and we wake up the stream. Then, it is the stream reponsibility to really dequeue it (or release it). This way, we are sure that only the stream can create and release its <pend_pos> field. However, be careful. This new implementation should be thread-safe (hopefully...). But it is not optimal and in some situations, it could be really slower in multi-threaded mode than in single-threaded one. The problem is that, when we try to dequeue pending connections, we process it from the older one to the newer one independently to the thread's affinity. So we need to wait the other threads' wakeup to really process them. If threads are blocked in the poller, this will add a significant latency. This problem happens when maxconn values are very low. This patch must be backported in 1.8.	2018-03-19 10:03:06 +01:00
Christopher Faulet	fd04fcf5ed	BUG/MEDIUM: http: Switch the HTTP response in tunnel mode as earlier as possible When the body length is undefined (no Content-Length or Transfer-Encoding headers), The reponse remains in ending mode, waiting the request is done. So, most of time this is not a problem because the resquest is done before the response. But when a client sends data to a server that replies without waiting all the data, it is really not desirable to wait the end of the request to finish the response. This bug was introduced when the tunneling of the request and the reponse was refactored, in commit `4be980391` ("MINOR: http: Switch requests/responses in TUNNEL mode only by checking txn flag"). This patch should be backported in 1.8 and 1.7.	2018-02-19 16:47:12 +01:00
Olivier Houchard	25ae45a078	MINOR: early data: Never remove the CO_FL_EARLY_DATA flag. It may be useful to keep the CO_FL_EARLY_DATA flag, so that we know early data were used, so instead of doing this, only add the Early-data header, and have the sample fetch ssl_fc_has_early return 1, if CO_FL_EARLY_DATA is set, and if the handshake isn't done yet.	2018-02-05 14:24:50 +01:00
Fr�d�ric L�caille	a41d531e4e	MINOR: config: Enable tracking of up to MAX_SESS_STKCTR stick counters. This patch really adds support for up to MAX_SESS_STKCTR stick counters.	2018-01-29 13:53:56 +01:00
Willy Tarreau	ff47b3f41d	BUG/MEDIUM: http: don't automatically forward request close Maximilian B�hm, and Lucas Rolff reported some frequent HTTP/2 POST failures affecting version 1.8.2 that were not affecting 1.8.1. Lukas Tribus determined that these ones appeared consecutive to commit `a48c141` ("BUG/MAJOR: connection: refine the situations where we don't send shutw()"). It turns out that the HTTP request forwarding engine lets a shutr from the client be automatically forwarded to the server unless chunked encoding is in use. It's a bit tricky to meet this condition as it only happens if the shutr is not reported in the initial request. So if a request is large enough or the body is delayed after the headers (eg: Expect: 100-continue), the the function quits with channel_auto_close() left enabled. The patch above was not really related in fact. It's just that a previous bug was causing this shutw to be skipped at the lower layers, and the two bugs used to cancel themselves. In the HTTP request we should only pass the close in tunnel mode, as other cases either need to keep the connection alive (eg: for reuse) or will force-close it. Also the forced close will properly take care of avoiding the painful time-wait, which is not possible with the early close. This patch must be backported to 1.8 as it directly impacts HTTP/2, and may be backported to older version to save them from being abused by clients causing TIME_WAITs between haproxy and the server. Thanks to Lukas and Lucas for running many tests with captures allowing the bug to be narrowed down.	2017-12-29 17:23:40 +01:00
Willy Tarreau	0ad8e0dfea	MINOR: http: add a function to check request's cache-control header field The new function check_request_for_cacheability() is used to check if a request may be served from the cache, and/or allows the response to be stored into the cache. For this it checks the cache-control and pragma header fields, and adjusts the existing TX_CACHEABLE and a new TX_CACHE_IGNORE flags. For now, just like its response side counterpart, it only checks the first value of the header field. These functions should be reworked to improve their parsers and validate all elements.	2017-12-22 17:56:17 +01:00
Willy Tarreau	d3900cc31d	BUG/MINOR: http: properly detect max-age=0 and s-maxage=0 in responses In 1.3.8, commit `a15645d` ("[MAJOR] completed the HTTP response processing.") improved the response parser by taking care of the cache-control header field. The parser is wrong because it is split in two parts, one checking for elements containing an equal sign and the other one for those without. The "max-age=0" and "s-maxage=0" tests were located at the wrong place and thus have never matched. In practice the side effect was very minimal given that this code used to be enabled only when checking if a cookie had the risk of being cached or not. Recently in 1.8 it was also used to decide if the response could be cached but in practice the cache takes care of these values by itself so there is very limited impact. This fix can be backported to all stable versions.	2017-12-22 15:49:15 +01:00
Willy Tarreau	12b32f212f	BUG/MINOR: http: do not ignore cache-control: public In check_response_for_cacheability(), we don't check the cache-control flags if the response is already supposed not to be cacheable. This was introduced very early when cache-control:public was not checked, and it basically results in this last one not being able to properly mark the response as cacheable if it uses a status code which is non-cacheable by default. Till now the impact is very limited as it doesn't check that cookies set on non-default status codes are not cacheable, and it prevents the cache from caching such responses. Let's fix this by doing two things : - remove the test for !TX_CACHEABLE in the aforementionned function - however take care of 1xx status codes here (which used to be implicitly dealt with by the test above) and remove the explicit check for 101 in the caller This fix must be backported to 1.8.	2017-12-22 14:43:26 +01:00
Willy Tarreau	83ece462b4	MINOR: http: start to compute the transaction's cacheability from the request There has always been something odd with the way the cache-control flags are checked. Since it was made for checking for the risk of leaking cookies only, all the processing was done in the response. Because of this it is not possible to reuse the transaction flags correctly for use with the cache. This patch starts to change this by moving the method check in the request so that we know very early whether the transaction is expected to be cacheable and that this status evolves along with checked headers. For now it's not enough to use from the cache yet but at least it makes the flag more consistent along the transaction processing.	2017-12-22 14:43:26 +01:00
Willy Tarreau	c55ddce65c	MINOR: http: update the list of cacheable status codes as per RFC7231 Since RFC2616, the following codes were added to the list of codes cacheable by default : 204, 404, 405, 414, 501. For now this it only checked by the checkcache option to detect cacheable cookies.	2017-12-22 14:43:26 +01:00
Willy Tarreau	24ea0bcb1d	MINOR: http: adjust the list of supposedly cacheable methods We used to have a rule inherited from RFC2616 saying that the POST method was the only uncacheable one, but things have changed since and RFC7231+7234 made it clear that in fact only GET/HEAD/OPTIONS/TRACE are cacheable. Currently this rule is only used to detect cacheable cookies.	2017-12-22 14:43:26 +01:00
Willy Tarreau	7aa15b072e	BUG/MEDIUM: stream: don't consider abortonclose on muxes which close cleanly The H2 mux can cleanly report an error when a client closes, which is not the case for the pass-through mux which only reports shutr. That was the reason why "option abortonclose" was created since there was no way to distinguish a clean shutdown after sending the request from an abort. The problem is that in case of H2, the streams are always shut read after the request is complete (when the END_STREAM flag is received), and that when this lands on a backend configured with "option abortonclose", this aborts the request. Disabling abortonclose is not always an option when H1 and H2 have to coexist. This patch makes use of the newly introduced mux capabilities reported via the stream interface's SI_FL_CLEAN_ABRT indicating that the mux is safe and that there is no need to turn a clean shutread into an abort. This way abortonclose has no effect on requests initiated from an H2 mux. This patch as well as these 3 previous ones need to be backported to 1.8 : - BUG/MINOR: h2: properly report a stream error on RST_STREAM - MINOR: mux: add flags to describe a mux's capabilities - MINOR: stream-int: set flag SI_FL_CLEAN_ABRT when mux supports clean aborts	2017-12-20 17:01:24 +01:00
Cyril Bont�	9fc9e53763	BUG: MINOR: http: don't check http-request capture id when len is provided Randomly, haproxy could fail to start when a "http-request capture" action is defined, without any change to the configuration. The issue depends on the memory content, which may raise a fatal error like : unable to find capture id 'xxxx' referenced by http-request capture rule Commit `fd608dd2` already prevents the condition to happen, but this one should be included for completeness and to reclect the code on the response side. The issue was introduced recently by commit `29730ba5` and should only be backported to haproxy 1.8.	2017-12-14 22:46:27 +01:00
Willy Tarreau	805935147a	BUG/MEDIUM: http: don't disable lingering on requests with tunnelled responses The HTTP forwarding engine needs to disable lingering on requests in case the connection to the server has to be suddenly closed due to http-server-close being used, so that we don't accumulate lethal TIME_WAIT sockets on the outgoing side. A problem happens when the server doesn't advertise a response size, because the response message quickly goes through the MSG_DONE and MSG_TUNNEL states, and once the client has transferred all of its data, it turns to MSG_DONE and immediately sets NOLINGER and closes before the server has a chance to respond. The problem is that this destroys some of the pending DATA being uploaded, the server doesn't receive all of them, detects an error and closes. This early NOLINGER is inappropriate in this situation because it happens before the response is transmitted. This state transition to MSG_TUNNEL doesn't happen when the response size is known since we stay in MSG_DATA (and related states) during all the transfer. Given that the issue is only related to connections not advertising a response length and that by definition these connections cannot be reused, there's no need for NOLINGER when the response's transfer length is not known, which can be verified when entering the CLOSED state. That's what this patch does. This fix needs to be backported to 1.8 and very likely to 1.7 and older as it affects the very rare case where a client immediately closes after the last uploaded byte (typically a script). However given that the risk of occurrence in HTTP/1 is extremely low, it is probably wise to wait before backporting it before 1.8.	2017-12-14 13:43:52 +01:00
Christopher Faulet	fd608dd2d2	BUG/MINOR: action: Don't check http capture rules when no id is defined This is a regression in the commit `29730ba5` ("MINOR: action: Add a functions to check http capture rules"). We must check the capture id only when an id is defined. This patch must be backported in 1.8.	2017-12-04 10:39:56 +01:00
Emeric Brun	0fed0b0a38	BUG/MEDIUM: peers: fix some track counter rules dont register entries for sync. This BUG was introduced with: 'MEDIUM: threads/stick-tables: handle multithreads on stick tables' The API was reviewed to handle stick table entry updates asynchronously and the caller must now call a 'stkable_touch_' function each time the content of an entry is modified to register the entry to be synced. There was missing call to stktable_touch_ resulting in not propagated entries to remote peers (or local one during reload)	2017-11-29 19:16:22 +01:00
Willy Tarreau	bafbe01028	CLEANUP: pools: rename all pool functions and pointers to remove this "2" During the migration to the second version of the pools, the new functions and pool pointers were all called "pool_something2()" and "pool2_something". Now there's no more pool v1 code and it's a real pain to still have to deal with this. Let's clean this up now by removing the "2" everywhere, and by renaming the pool heads "pool_head_something".	2017-11-24 17:49:53 +01:00
Olivier Houchard	fbc74e8556	MINOR/CLEANUP: proxy: rename "proxy" to "proxies_list" Rename the global variable "proxy" to "proxies_list". There's been multiple proxies in haproxy for quite some time, and "proxy" is a potential source of bugs, a number of functions have a "proxy" argument, and some code used "proxy" when it really meant "px" or "curproxy". It worked by pure luck, because it usually happened while parsing the config, and thus "proxy" pointed to the currently parsed proxy, but we should probably not rely on this. [wt: some of these are definitely fixes that are worth backporting]	2017-11-24 17:21:27 +01:00
Christopher Faulet	767a84bcc0	CLEANUP: log: Rename Alert/Warning in ha_alert/ha_warning	2017-11-24 17:19:12 +01:00
Willy Tarreau	53275e8b02	MINOR: http: implement the "http-request reject" rule This one acts similarly to its tcp-request counterpart. It immediately closes the request without emitting any response. It can be suitable in certain DoS conditions, as well as to close an HTTP/2 connection.	2017-11-24 07:52:01 +01:00
William Lallemand	71bd11a1f3	MEDIUM: cache: enable the HTTP analysers Enable the same analysers as the stats applet. Allows keepalive and termination flags to work.	2017-11-20 19:22:27 +01:00
Willy Tarreau	916597903c	MEDIUM: http: always reject the "PRI" method This method was reserved for the HTTP/2 connection preface, must never be used and must be rejected. In normal situations it doesn't happen, but it may be visible if a TCP frontend has alpn "h2" enabled, and forwards to an HTTP backend which tries to parse the request. Before this patch it would pass the wrong request to the backend server, now it properly returns 400 bad req. This patch should probably be backported to stable versions.	2017-11-10 19:38:10 +01:00
Christopher Faulet	c5a9d5bf23	BUG/MEDIUM: stream-int: Don't loss write's notifs when a stream is woken up When a write activity is reported on a channel, it is important to keep this information for the stream because it take part on the analyzers' triggering. When some data are written, the flag CF_WRITE_PARTIAL is set. It participates to the task's timeout updates and to the stream's waking. It is also used in CF_MASK_ANALYSER mask to trigger channels anaylzers. In the past, it was cleared by process_stream. Because of a bug (fixed in commit `95fad5ba4` ["BUG/MAJOR: stream-int: don't re-arm recv if send fails"]), It is now cleared before each send and in stream_int_notify. So it is possible to loss this information when process_stream is called, preventing analyzers to be called, and possibly leading to a stalled stream. Today, this happens in HTTP2 when you call the stat page or when you use the cache filter. In fact, this happens when the response is sent by an applet. In HTTP1, everything seems to work as expected. To fix the problem, we need to make the difference between the write activity reported to lower layers and the one reported to the stream. So the flag CF_WRITE_EVENT has been added to notify the stream of the write activity on a channel. It is set when a send succedded and reset by process_stream. It is also used in CF_MASK_ANALYSER. finally, it is checked in stream_int_notify to wake up a stream and in channel_check_timeouts. This bug is probably present in 1.7 but it seems to have no effect. So for now, no needs to backport it.	2017-11-09 15:16:05 +01:00
Olivier Houchard	522eea7110	MINOR: ssl: Handle sending early data to server. This adds a new keyword on the "server" line, "allow-0rtt", if set, we'll try to send early data to the server, as long as the client sent early data, as in case the server rejects the early data, we no longer have them, and can't resend them, so the only option we have is to send back a 425, and we need to be sure the client knows how to interpret it correctly.	2017-11-08 14:11:10 +01:00
Christopher Faulet	2a944ee16b	BUILD: threads: Rename SPIN/RWLOCK macros using HA_ prefix This remove any name conflicts, especially on Solaris.	2017-11-07 11:10:24 +01:00
Olivier Houchard	9aaf778129	MAJOR: connection : Split struct connection into struct connection and struct conn_stream. All the references to connections in the data path from streams and stream_interfaces were changed to use conn_streams. Most functions named "something_conn" were renamed to "something_cs" for this. Sometimes the connection still is what matters (eg during a connection establishment) and were not always renamed. The change is significant and minimal at the same time, and was quite thoroughly tested now. As of this patch, all accesses to the connection from upper layers go through the pass-through mux.	2017-10-31 18:03:23 +01:00
Christopher Faulet	1bc04c7664	BUG/MINOR: threads: Add missing THREAD_LOCAL on static here and there	2017-10-31 13:58:33 +01:00
Emeric Brun	b5997f740b	MAJOR: threads/map: Make acls/maps thread safe locks have been added in pat_ref and pattern_expr structures to protect all accesses to an instance of on of them. Moreover, a global lock has been added to protect the LRU cache used for pattern matching. Patterns are now duplicated after a successfull matching, to avoid modification by other threads when the result is used. Finally, the function reloading a pattern list has been modified to be thread-safe.	2017-10-31 13:58:32 +01:00
Emeric Brun	8c1aaa201a	MEDIUM: threads/http: Make http_capture_bad_message thread-safe This is done by passing the right stream's proxy (the frontend or the backend, depending on the context) to lock the error snapshot used to store the error info.	2017-10-31 13:58:31 +01:00
Emeric Brun	819fc6f563	MEDIUM: threads/stick-tables: handle multithreads on stick tables The stick table API was slightly reworked: A global spin lock on stick table was added to perform lookup and insert in a thread safe way. The handling of refcount on entries is now handled directly by stick tables functions under protection of this lock and was removed from the code of callers. The "stktable_store" function is no more externalized and users should now use "stktable_set_entry" in any case of insertion. This last one performs a lookup followed by a store if not found. So the code using "stktable_store" was re-worked. Lookup, and set_entry functions automatically increase the refcount of the returned/stored entry. The function "sticktable_touch" was renamed "sticktable_touch_local" and is now able to decrease the refcount if last arg is set to true. It is allowing to release the entry without taking the lock twice. A new function "sticktable_touch_remote" is now used to insert entries coming from remote peers at the right place in the update tree. The code of peer update was re-worked to use this new function. This function is also able to decrease the refcount if wanted. The function "stksess_kill" also handle a parameter to decrease the refcount on the entry. A read/write lock is added on each entry to protect the data content updates of the entry.	2017-10-31 13:58:31 +01:00
Christopher Faulet	29f77e846b	MEDIUM: threads/server: Add a lock per server and atomically update server vars The server's lock is use, among other things, to lock acces to the active connection list of a server.	2017-10-31 13:58:31 +01:00
Christopher Faulet	40a007cf2a	MEDIUM: threads/server: Make connection list (priv/idle/safe) thread-safe For now, we have a list of each type per thread. So there is no need to lock them. This is the easiest solution for now, but not the best one because there is no sharing between threads. An idle connection on a thread will not be able be used by a stream on another thread. So it could be a good idea to rework this patch later.	2017-10-31 13:58:30 +01:00
Christopher Faulet	ff8abcd31d	MEDIUM: threads/proxy: Add a lock per proxy and atomically update proxy vars Now, each proxy contains a lock that must be used when necessary to protect it. Moreover, all proxy's counters are now updated using atomic operations.	2017-10-31 13:58:30 +01:00
Christopher Faulet	8d8aa0d681	MEDIUM: threads/listeners: Make listeners thread-safe First, we use atomic operations to update jobs/totalconn/actconn variables, listener's nbconn variable and listener's counters. Then we add a lock on listeners to protect access to their information. And finally, listener queues (global and per proxy) are also protected by a lock. Here, because access to these queues are unusal, we use the same lock for all queues instead of a global one for the global queue and a lock per proxy for others.	2017-10-31 13:58:30 +01:00
Christopher Faulet	1b421eab87	MINOR: acl: Pass the ACLs as an explicit parameter of build_acl_cond So it is possible to use anothers ACLs to build ACL conditions than those of proxies.	2017-10-31 11:36:12 +01:00
Christopher Faulet	29730ba570	MINOR: action: Add a functions to check http capture rules "check_http_req_capture" and "check_http_res_capture" functions have been added to check validity of "http-request capture" and "http-response capture" rules. Code for these functions come from cfgparse.c.	2017-10-31 11:36:12 +01:00
Christopher Faulet	78880fb196	MINOR: action: Add function to check rules using an action ACT_ACTION_TRK_* The function "check_trk_action" has been added to find and check the target table for rules using an action ACT_ACTION_TRK_*.	2017-10-31 11:36:12 +01:00
Christopher Faulet	4fce0d8447	MINOR: action: Use trk_idx instead of tcp/http_trk_idx So tcp_trk_idx and http_trk_idx have been removed.	2017-10-31 11:36:12 +01:00
Olivier Houchard	c2aae74f01	MEDIUM: ssl: Handle early data with OpenSSL 1.1.1 When compiled with Openssl >= 1.1.1, before attempting to do the handshake, try to read any early data. If any early data is present, then we'll create the session, read the data, and handle the request before we're doing the handshake. For this, we add a new connection flag, CO_FL_EARLY_SSL_HS, which is not part of the CO_FL_HANDSHAKE set, allowing to proceed with a session even before an SSL handshake is completed. As early data do have security implication, we let the origin server know the request comes from early data by adding the "Early-Data" header, as specified in this draft from the HTTP working group : https://datatracker.ietf.org/doc/html/draft-ietf-httpbis-replay	2017-10-27 10:54:05 +02:00
Olivier Houchard	51a76d84e4	MINOR: http: Mark the 425 code as "Too Early". This adds a new status code for use with the "http-request deny" ruleset. The use case for this code is currently handled by this draft dedicated to 0-RTT processing : https://datatracker.ietf.org/doc/html/draft-ietf-httpbis-replay	2017-10-27 10:53:32 +02:00
Dragan Dosen	2ae327edaf	BUG/MEDIUM: prevent buffers being overwritten during build_logline() execution Calls to build_logline() are audited in order to use dynamic trash buffers allocated by alloc_trash_chunk() instead of global trash buffers. This is similar to commits `07a0fec` ("BUG/MEDIUM: http: Prevent replace-header from overwriting a buffer") and `0d94576` ("BUG/MEDIUM: http: prevent redirect from overwriting a buffer"). This patch should be backported in 1.7, 1.6 and 1.5. It relies on commit `b686afd` ("MINOR: chunks: implement a simple dynamic allocator for trash buffers") for the trash allocator, which has to be backported as well.	2017-10-27 10:02:33 +02:00

1 2 3 4 5 ...

1359 Commits