haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-18 13:06:58 +02:00

Author	SHA1	Message	Date
Christopher Faulet	e0768ebabc	MEDIUM: proto_htx: Add HTX analyzers and use it when the mux H1 is used For now, these analyzers are just copies of the legacy HTTP analyzers. But, during the HTTP refactoring, it will be the main place where it will be visible. And in legacy analyzers, the macro IS_HTX_STRM is used to know if the HTX version should be called or not. Note: the following commits were applied to proto_http.c after this patch was developed and need to be studied to see if an adaptation to htx is required : `fd9b68c` BUG/MINOR: only mark connections private if NTLM is detected	2018-11-18 21:45:50 +01:00
Christopher Faulet	27a3dc8fb2	MINOR: http: Call http_send_name_header with the stream instead of the txn This is just a minor change to ease integrartion of the HTX.	2018-11-18 21:45:49 +01:00
Olivier Houchard	7c6f8b146d	MAJOR: connections: Detach connections from streams. Do not destroy the connection when we're about to destroy a stream. This prevents us from doing keepalive on server connections when the client is using HTTP/2, as a new stream is created for each request. Instead, the session is now responsible for destroying connections. When reusing connections, the attach() mux method is now used to create a new conn_stream.	2018-11-18 21:45:45 +01:00
Christopher Faulet	3c0544efbf	BUG/MINOR: http: Be sure to sent fully formed HTTP 103 responses The previous commit fedceaf33 ("MINOR: http: Regroup return statements of http_req_get_intercept_rule at the end") partly fixes the problem. But not entierly. Because HTTP 103 reponses were sent line by line it is possible to mix them with others. For instance, an early-hint rule followed by a redirect rule leaving the response buffer totally messed up. Furthermore, if we fail to add the last CRLF to finish the HTTP 103 response because there is no more space in the buffer, it leave the buffer with an unfinished and invalid message. This patch fixes the bug by creating a fully formed HTTP 103 response before trying to push it in the response buffer. If an error occurred during the copy or if another response was already sent, the HTTP 103 response is ignored. However, the last point should never happened because, for redirects and authentication errors, we first try to copy any pending HTTP 103 response.	2018-11-16 16:05:51 +01:00
Christopher Faulet	6c243ebb9f	MINOR: http: Regroup return statements of http_res_get_intercept_rule at the end Instead of having multiple return statements spreaded here and there in middle of the function, we just exit from the loop setting the right return code. It let a chance to do some work before leaving the function. It is also less error prone.	2018-11-16 16:05:51 +01:00
Christopher Faulet	ea827bdcbc	MINOR: http: Regroup return statements of http_req_get_intercept_rule at the end Instead of having multiple return statements spreaded here and there in middle of the function, we just exit from the loop setting the right return code. It let a chance to do some work before leaving the function. It is also less error prone.	2018-11-16 16:05:51 +01:00
Fr�d�ric L�caille	9ca51aa288	MINOR: http: Implement "early-hint" http request rules. This patch implements http_apply_early_hint_rule() function is responsible of building HTTP 103 Early Hint responses each time a "early-hint" rule is matched.	2018-11-12 21:08:55 +01:00
Willy Tarreau	9d9ccdbf8b	BUG/MAJOR: http: http_txn_get_path() may deference an inexisting buffer When the "path" sample fetch function is called without any path, the function doesn't check that the request buffer is allocated. While this doesn't happen with the request during processing, it can definitely happen when mistakenly trying to reference a path from the response since the request channel is not allocated anymore. It's certain that this bug was emphasized by the buffer changes that went in 1.9 and the HTTP refactoring, but at first glance, 1.8 doesn't seem 100% safe either so it's possible that older version are affected as well. Thanks to PiBa-NL for reporting this bug with a reproducer.	2018-10-28 20:16:12 +01:00
Willy Tarreau	cda7f3f5c2	MINOR: stream: don't prune variables if the list is empty The vars_prune() and vars_init() functions involve locking while most of the time there is no variable at all in streams nor sessions. Let's check for emptiness before calling these functions. Simply doing this has increased the multithreaded performance from 1.5 to 5% depending on the workload.	2018-10-28 13:46:47 +01:00
Lukas Tribus	80512b186f	BUG/MINOR: only auto-prefer last server if lb-alg is non-deterministic While "option prefer-last-server" only applies to non-deterministic load balancing algorithms, 401/407 responses actually caused haproxy to prefer the last server unconditionally. As this breaks deterministic load balancing algorithms like uri, this patch applies the same condition here. Should be backported to 1.8 (together with "BUG/MINOR: only mark connections private if NTLM is detected").	2018-10-27 22:10:32 +02:00
Lukas Tribus	fd9b68c48e	BUG/MINOR: only mark connections private if NTLM is detected Instead of marking all connections that see a 401/407 response private (for connection reuse), this patch detects a RFC4559/NTLM authentication scheme and restricts the private setting to those connections. This is so we can reuse connections with 401/407 responses with deterministic load balancing algorithms later (which requires another fix). This fixes the problem reported here by Elliot Barlas : https://discourse.haproxy.org/t/unable-to-configure-load-balancing-per-request-over-persistent-connection/3144 Should be backported to 1.8.	2018-10-27 22:10:29 +02:00
Willy Tarreau	ede3d884fc	MEDIUM: channel: merge back flags CF_WRITE_PARTIAL and CF_WRITE_EVENT The behaviour of the flag CF_WRITE_PARTIAL was modified by commit `95fad5ba4` ("BUG/MAJOR: stream-int: don't re-arm recv if send fails") due to a situation where it could trigger an immediate wake up of the other side, both acting in loops via the FD cache. This loss has caused the need to introduce CF_WRITE_EVENT as commit `c5a9d5bf`, to replace it, but both flags express more or less the same thing and this distinction creates a lot of confusion and complexity in the code. Since the FD cache now acts via tasklets, the issue worked around in the first patch no longer exists, so it's more than time to kill this hack and to restore CF_WRITE_PARTIAL's semantics (i.e.: there has been some write activity since we last left process_stream). This patch mostly reverts the two commits above. Only the part making use of CF_WROTE_DATA instead of CF_WRITE_PARTIAL to detect the loss of data upon connection setup was kept because it's more accurate and better suited.	2018-10-26 08:32:57 +02:00
Christopher Faulet	66943a4903	CLEANUP: http: Remove the unused function http_find_header	2018-10-23 10:22:36 +02:00
Christopher Faulet	315b39c391	MINOR: http: Use same flag for httpclose and forceclose options Since keep-alive mode is the default mode, the passive close has disappeared, and in the code, httpclose and forceclose options are handled the same way: connections with the client and the server are closed as soon as the request and the response are received and missing "Connection: close" header is added in each direction. So to make things clearer, forceclose is now an alias for httpclose. And httpclose is explicitly an active close. So the old passive close does not exist anymore. Internally, the flag PR_O_HTTP_PCL has been removed and PR_O_HTTP_FCL has been replaced by PR_O_HTTP_CLO. In HTTP analyzers, the checks done to find the right mode to use, depending on proxies options and "Connection: " header value, have been simplified. This should only be a cleanup and no changes are expected.	2018-10-12 16:07:56 +02:00
Christopher Faulet	10079f59b7	MINOR: http: Export some functions and do cleanup to prepare HTTP refactoring To ease the refactoring, the function "http_header_add_tail" have been remove. Now, "http_header_add_tail2" is always used. And the function "capture_headers" have been renamed into "http_capture_headers". Finally, some functions have been exported.	2018-10-12 16:00:45 +02:00
Willy Tarreau	61c112aa5b	REORG: http: move HTTP rules parsing to http_rules.c These ones are mostly called from cfgparse.c for the parsing and do not depend on the HTTP representation. The functions's prototypes were moved to proto/http_rules.h, making this file work exactly like tcp_rules. Ideally we should stop calling these functions directly from cfgparse and register keywords, but there are a few cases where that wouldn't work (stats http-request) so it's probably not worth trying to go this far.	2018-10-02 18:28:05 +02:00
Willy Tarreau	79e57336b5	REORG: http: move the code to different files The current proto_http.c file is huge and contains different processing domains making it very difficult to work on an alternative representation. This commit moves some parts to other files : - ACL registration code => http_acl.c This code only creates some ACL mappings and doesn't know anything about HTTP nor about the representation. This code could even have moved to acl.c but it was not worth polluting it again. - HTTP sample conversion => http_conv.c This code doesn't depend on the internal representation but definitely manipulates some HTTP elements, such as dates. It also has access to captures. - HTTP sample fetching => http_fetch.c This code does depend entirely on the internal representation but is totally independent on the analysers. Placing it into a different file will ease the transition to the new representation and the creation of a wrapper if required. An include file was created due to CHECK_HTTP_MESSAGE_FIRST() being used at various places. - HTTP action registration => http_act.c This code doesn't directly interact with the messages nor the transaction but it does so via some exported http functions like http_replace_req_line() or http_set_status() so it will be easier to change only this after the conversion. - a few very generic parts were found and moved to http.{c,h} as relevant. It is worth noting that the functions moved to these new files are not referenced anywhere outside of the files and are only called as registered callbacks, so these files do not even require associated include files.	2018-10-02 18:26:59 +02:00
Christopher Faulet	ca874b8d92	BUG/MEDIUM: http: Don't parse chunked body if there is no input data With recent modifications on the buffers API, when a buffer is released (calling b_free), we replace it by BUF_NULL where the area pointer is NULL. So many operations, like b_peek, must be avoided on a released or not allocated buffer. These changes were mainly made in the commit `c9fa048` ("MAJOR: buffer: finalize buffer detachment"). Since this commit, HAProxy can crash during the body parsing of chunked HTTP messages because there is no check on the channel's buffer in HTTP analyzers (http_request_forward_body and http_response_forward_body) nor in H1 functions reponsible to parse chunked content (h1_skip_chunk_crlf & co). If a stream is woken up after all input data were forwarded, its input channel's buffer is released (so set to BUF_NULL). In this case, if we resume the parsing of a chunk, HAProxy crashes. To fix this issue, we just skip the parsing of chunks if there is no input data for the corresponding channel. This is only done if the message state is strickly lower to HTTP_MSG_ENDING.	2018-09-20 14:37:58 +02:00
Willy Tarreau	b05e48a54d	BUILD: http: address a couple of null-deref warnings at -Wextra These two warnings are caused by the use of objt_server() without checking its result. These are turned to __objt_server() which is safe there.	2018-09-20 11:42:15 +02:00
Willy Tarreau	ab813a4b05	REORG: http: move some header value processing functions to http.c The following functions only deal with header field values and are agnostic to the HTTP version so they were moved to http.c : http_header_match2(), find_hdr_value_end(), find_cookie_value_end(), extract_cookie_value(), parse_qvalue(), http_find_url_param_pos(), http_find_next_url_param(). Those lacking the "http_" prefix were modified to have it.	2018-09-11 10:30:25 +02:00
Willy Tarreau	e10cd48a83	REORG: http: move the log encoding tables to log.c There are 3 tables in proto_http which are used exclusively by logs : hdr_encode_map[], url_encode_map[] and http_encode_map[]. They indicate what characters are safe to be emitted in logs depending on the part of the message where they are placed. Let's move this to log.c, as well as its initialization. It's worth noting that the rfc5424 map was already initialized there.	2018-09-11 10:30:25 +02:00
Willy Tarreau	04f1e2d202	REORG: http: move error codes production and processing to http.c These error codes and messages are agnostic to the version, even if they are represented as HTTP/1.0 messages. Ultimately they will have to be transformed into internal HTTP messages to be used everywhere. The HTTP/1.1 100 Continue message was turned to an IST and the local copy in the Lua code was removed.	2018-09-11 10:30:25 +02:00
Willy Tarreau	6b952c8101	REORG: http: move http_get_path() to http.c This function is purely HTTP once http_txn is put aside. So the original one was renamed to http_txn_get_path() and it extracts the relevant offsets from the txn to pass them to http_get_path(). One benefit of the new version is that it returns the length at the same time so that allowed to slightly simplify http_get_path_from_string() which had to look up the end pointer previously and which is not needed anymore.	2018-09-11 10:30:25 +02:00
Willy Tarreau	35b51c6e5b	REORG: http: move the HTTP semantics definitions to http.h/http.c It's a bit painful to have to deal with HTTP semantics for each protocol version (H1 and H2), and working on the version-agnostic code further emphasizes the problem. This patch creates http.h and http.c which are agnostic to the version in use, and which borrow a few parts from proto_http and from h1. For example the once thought h1-specific h1_char_classes array is in fact dictated by RFC7231 and is used to parse HTTP headers. A few changes were made to a few files which were including proto_http.h while they only needed http.h. Certain string definitions pre-dated the introduction of indirect strings (ist) so some were used to simplify the definition of the known HTTP methods. The current lookup code saves 2 kB of a heavily used table and is faster than the previous table based lookup (typ. 14 ns vs 16 before).	2018-09-11 10:30:25 +02:00
Willy Tarreau	ddb68ac69e	REORG: cli: move the "show errors" handler from http to proxy There's nothing HTTP-specific there anymore at all, let's move this to the proxy where it belongs.	2018-09-07 18:36:50 +02:00
Willy Tarreau	fd9419d560	MINOR: http: remove the pointer to the error snapshot in http_capture_bad_message() It's not needed anymore as we know the side thanks to the channel. This will allow the proxy generic code to better manage the error snapshots.	2018-09-07 18:36:04 +02:00
Willy Tarreau	ef3ca73fc3	MINOR: http: make the HTTP error capture rely on the generic proxy code Now that we have a generic error capture function, let's simplify http_capture_bad_message() to make use of it. At this point the API is not changed at all, but it could be further simplified.	2018-09-07 18:36:04 +02:00
Willy Tarreau	7ccdd8dad9	MEDIUM: snapshot: implement a show() callback and use it for HTTP The HTTP dumps are now configurable in the code : "show errors" now calls a protocol-specific function to emit the decoded output. For now only HTTP is implemented.	2018-09-07 18:36:01 +02:00
Willy Tarreau	0b5b480594	MEDIUM: snapshot: start to reorder the HTTP snapshot output a little bit The output of "show errors" was slightly reordered to split the HTTP part in a single chunk_appendf() call. The useless buffer total input was replaced to report the buffer's start offset, which is the offset in the stream of the first input byte (thus not counting output). Also it was the opportunity to stop calling the stream "session".	2018-09-07 17:48:14 +02:00
Willy Tarreau	7480f323ff	MINOR: snapshot: split the error snapshots into common and proto-specific parts The idea will be to make the error snapshot feature accessible to other protocols than just HTTP. This patch only introduces an "http_snapshot" structure and renames a few fields to make things more explicit. The HTTP part was installed inside a union so that we can easily add more protocols in the future.	2018-09-07 16:13:45 +02:00
Willy Tarreau	5865a8fe69	MINOR: snapshot: restart on the event ID and not the stream ID The snapshots have the ability to restart a partial dump and they use the stream ID as the restart point. Since it's purely HTTP, let's use the event ID instead.	2018-09-07 15:00:43 +02:00
Willy Tarreau	e9e878a056	BUG/MINOR: http/threads: atomically increment the error snapshot ID Let's use an atomic increment for the error snapshot, as we'd rather not assign the same ID to two errors happening in parallel. It's very unlikely that it will ever happen though. This patch must be backported to 1.8 with the other one it relies on ("MINOR: thread: implement HA_ATOMIC_XADD()").	2018-09-07 11:31:58 +02:00
Willy Tarreau	90a7c03ec0	BUG/MINOR: stream: use atomic increments for the request counter The request counter is incremented when creating a new stream and when resetting a stream, preparing for a new request. Unfortunately during the thread migration this was missed, leading to non-atomic increments in case threads are in use. The most visible side effect is that two requests may have the same ID from time to time in the logs. However the SPOE also uses this ID to route responses back to the stream so it may also lead to occasional spurious SPOE timeouts. Note that it still doesn't guarantee temporal unicity in the stream identifiers since a long and a short connection could technically use the same ID. The likeliness that this happens at the same time is almost null (roughly threads*runqueue_depth/2^32 that it happens in the same poll loop), but it will have to be addressed later anyway. This patch must be backported to 1.8 with the other one it relies on ("MINOR: thread: implement HA_ATOMIC_XADD()").	2018-09-05 16:30:19 +02:00
Patrick Hemmer	e3faf02581	BUG/MEDIUM: lua: reset lua transaction between http requests Previously LUA code would maintain the transaction state between http requests, resulting in things like txn:get_priv() retrieving data from a previous request. This addresses the issue by ensuring the LUA state is reset between requests. Co-authored-by: Tim D�sterhus <tim@bastelstu.be>	2018-08-25 07:51:02 +02:00
Willy Tarreau	9c768fdca1	BUG/MEDIUM: http: don't store url_decode() result in the samples's length By convenience or laziness we used to store url_decode()'s return code into smp->data.u.str.data. The result checks applied there compare it to 0 while it's now unsigned since commit `843b7cb` ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct "). Let's clean this up and test the result itself without storing it first. No backport is needed.	2018-08-22 05:16:32 +02:00
Willy Tarreau	6e27be1a5d	BUG/MEDIUM: http: don't store exp_replace() result in the trash's length By convenience or laziness we used to store exp_replace()'s return code into trash.data. The result checks applied there compare trash.data to -1 while it's now unsigned since commit `843b7cb` ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct "). Let's clean this up and test the result itself without storing it first. No backport is needed.	2018-08-22 05:16:32 +02:00
Willy Tarreau	5f6333caca	BUG/MINOR: chunks: do not store -1 into chunk_printf() in case of error Since commit `843b7cb` ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct") a chunk length is unsigned so we can't reliably store -1 and check for negative values in the caller. Only one such location was found in proto_http's http-request auth rules (which cannot realistically fail). No backport is needed.	2018-08-22 05:16:31 +02:00
Patrick Hemmer	ffe5e8c638	MINOR: stream: rename {srv,prx}_queue_size to *_queue_pos The current name is misleading as it implies a queue size, but the value instead indicates a position in the queue. The value is only the queue size at the exact moment the element is enqueued. Soon we will gain the ability to insert anywhere into the queue, upon which clarity of the name is more important.	2018-08-10 15:04:14 +02:00
Willy Tarreau	66425e31b5	MINOR: queue: make sure the pendconn is released before logging We'll soon need to rely on the pendconn position at the time of dequeuing to figure the position a stream took in the queue. Usually it's not a problem since pendconn_free() is called once the connection starts, but it will make a difference for failed dequeues (eg: queue timeout reached). Thus it's important to call pendconn_free() before logging in cases we are not certain whether it was already performed, and to call pendconn_unlink() after we know the pendconn will not be used so that we collect the queue state as accurately as possible. As a benefit it will also make the server's and backend's queues count more accurate in these cases.	2018-08-10 15:04:08 +02:00
Willy Tarreau	d0ad4a87f0	MEDIUM: queue: make pendconn_free() work on the stream instead Now pendconn_free() takes a stream, checks that pend_pos is set, clears it, and uses pendconn_unlink() to complete the job. It's cleaner and centralizes all the bookkeeping work in pendconn_unlink() only and ensures that there's a single place where the stream's position in the queue is manipulated.	2018-07-26 17:32:51 +02:00
Christopher Faulet	4507351a2f	BUG/MINOR: build: Fix compilation with debug mode enabled It remained some fragments of the old buffers API in debug messages, here and there. This was caused by the recent buffer API changes, no backport is needed.	2018-07-20 10:45:20 +02:00
Christopher Faulet	005e79e5dd	BUG/MINOR: http: Set brackets for the unlikely macro at the right place When test on the header "Early-Data" is made, the unlikely macro must encompass the condition. This patch must be backported in 1.8.	2018-07-20 10:42:53 +02:00
Willy Tarreau	83061a820e	MAJOR: chunks: replace struct chunk with struct buffer Now all the code used to manipulate chunks uses a struct buffer instead. The functions are still called "chunk*", and some of them will progressively move to the generic buffer handling code as they are cleaned up.	2018-07-19 16:23:43 +02:00
Willy Tarreau	843b7cbe9d	MEDIUM: chunks: make the chunk struct's fields match the buffer struct Chunks are only a subset of a buffer (a non-wrapping version with no head offset). Despite this we still carry a lot of duplicated code between buffers and chunks. Replacing chunks with buffers would significantly reduce the maintenance efforts. This first patch renames the chunk's fields to match the name and types used by struct buffers, with the goal of isolating the code changes from the declaration changes. Most of the changes were made with spatch using this coccinelle script : @rule_d1@ typedef chunk; struct chunk chunk; @@ - chunk.str + chunk.area @rule_d2@ typedef chunk; struct chunk chunk; @@ - chunk.len + chunk.data @rule_i1@ typedef chunk; struct chunk chunk; @@ - chunk->str + chunk->area @rule_i2@ typedef chunk; struct chunk chunk; @@ - chunk->len + chunk->data Some minor updates to 3 http functions had to be performed to take size_t ints instead of ints in order to match the unsigned length here.	2018-07-19 16:23:43 +02:00
Willy Tarreau	c9fa0480af	MAJOR: buffer: finalize buffer detachment Now the buffers only contain the header and a pointer to the storage area which can be anywhere. This will significantly simplify buffer swapping and will make it possible to map chunks on buffers as well. The buf_empty variable was removed, as now it's enough to have size==0 and area==NULL to designate the empty buffer (thus a non-allocated head is the empty buffer by default). buf_wanted for now is indicated by size==0 and area==(void *)1. The channels and the checks now embed the buffer's head, and the only pointer is to the storage area. This slightly increases the unallocated buffer size (3 extra ints for the empty buffer) but considerably simplifies dynamic buffer management. It will also later permit to detach unused checks. The way the struct buffer is arranged has proven quite efficient on a number of tests, which makes sense given that size is always accessed and often first, followed by the othe ones.	2018-07-19 16:23:43 +02:00
Willy Tarreau	e3128024bf	MINOR: buffer: replace buffer_replace2() with b_rep_blk() This one is more generic and designed to work on a random block. It may later get a b_rep_ist() variant since many strings are already available as (ptr,len).	2018-07-19 16:23:43 +02:00
Willy Tarreau	4d893d440c	MINOR: buffers/channel: replace buffer_insert_line2() with ci_insert_line2() There was no point keeping that function in the buffer part since it's exclusively used by HTTP at the channel level, since it also automatically appends the CRLF. This further cleans up the buffer code.	2018-07-19 16:23:43 +02:00
Willy Tarreau	72a100b386	MINOR: buffer: replace bi_fast_delete() with b_del() There's no distinction between in and out data now. The latter covers the needs of the former and supports wrapping. The extra cost is negligible given the locations where it's used.	2018-07-19 16:23:43 +02:00
Willy Tarreau	f37954d4da	MEDIUM: http: use wrappers instead of directly manipulating buffers states This is aimed at easing the transition to the new API. There are a few places which deserve some simplifications afterwards because ci_head() is called often and may be placed into a local pointer.	2018-07-19 16:23:42 +02:00
Willy Tarreau	5ba65521a3	MINOR: buffer: replace buffer_pending() with ci_data() It used to return b->i for channels, which is what ci_data() does.	2018-07-19 16:23:41 +02:00

1 2 3 4 5 ...

1317 Commits