haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-18 04:56:56 +02:00

Author	SHA1	Message	Date
Christopher Faulet	ed26fb8ac8	BUG/MINOR: http: Use out buffer instead of trash to display error snapshot the function http_show_error_snapshot() must not use the trash buffer to append the HTTP error description. Instead, it must use the <out> buffer, its first argument. Note that concretely, this function always succeeds because <out> is always the trash buffer.	2018-12-01 17:20:36 +01:00
Olivier Houchard	985f139aa2	MEDIUM: session: Steal owner-less connections on end of transaction. When a transaction ends, if we want to do keepalive, and the connection we used didn't have an owner, attach the connection to the session, so that we don't have to destroy it, and we can reuse it later.	2018-12-01 10:47:19 +01:00
Willy Tarreau	8ceae72d44	MEDIUM: init: use initcall for all fixed size pool creations This commit replaces the explicit pool creation that are made in constructors with a pool registration. Not only this simplifies the pools declaration (it can be done on a single line after the head is declared), but it also removes references to pools from within constructors. The only remaining create_pool() calls are those performed in init functions after the config is parsed, so there is no more user of potentially uninitialized pool now. It has been the opportunity to remove no less than 12 constructors and 6 init functions.	2018-11-26 19:50:32 +01:00
Willy Tarreau	9efd7456e0	MEDIUM: tasks: collect per-task CPU time and latency Right now we measure for each task the cumulated time spent waiting for the CPU and using it. The timestamp uses a 64-bit integer to report a nanosecond-level date. This is only enabled when "profiling.tasks" is enabled, and consumes less than 1% extra CPU on x86_64 when enabled. The cumulated processing time and wait time are reported in "show sess". The task's counters are also reset when an HTTP transaction is reset since the HTTP part pretends to restart on a fresh new stream. This will make sure we always report correct numbers for each request in the logs.	2018-11-22 15:44:21 +01:00
Joseph Herlant	5ba8025976	CLEANUP: fix typos in the proto_http subsystem Fixes typos in the code comments of the proto_http subsystem.	2018-11-18 22:23:15 +01:00
Christopher Faulet	fefc73da34	MINOR: proto_htx: Add functions htx_perform_server_redirect It is more or less the same than legacy version but adapted to be called from HTX analyzers. In the legacy version of this function, we switch on the HTX code when applicable.	2018-11-18 22:08:58 +01:00
Christopher Faulet	64159df1fb	MINOR: proto_htx: Add functions htx_send_name_header It is more or less the same than legacy version but adapted to be called from HTX analyzers. In the legacy version of this function, we switch on the HTX code when applicable.	2018-11-18 22:08:58 +01:00
Christopher Faulet	25a02f65b1	MINOR: proto_htx: Add functions to check the cacheability of HTX messages It is more or less the same than legacy versions but adapted to be called from HTX analyzers. In the legacy versions of these functions, we switch on the HTX code when applicable.	2018-11-18 22:08:58 +01:00
Christopher Faulet	8d8ac191a7	MINOR: proto_htx: Add functions htx_req_replace_stline and htx_res_set_status It is more or less the same than legacy versions but adapted to be called from HTX analyzers. In the legacy versions of these functions, we switch on the HTX code when applicable.	2018-11-18 22:08:56 +01:00
Christopher Faulet	9768c2660e	MAJOR: mux-h1/proto_htx: Switch mux-h1 and HTX analyzers on the HTX representation The mux-h1 now parses and formats HTTP/1 messages using the HTX representation. The HTX analyzers have been updated too. For now, only htx_wait_for_{request/response} and http_{request/response}_forward_body have been adapted. Others are disabled for now. Now, the HTTP messages are parsed by the mux on a side and then, after analysis, formatted on the other side. In the middle, in the stream, there is no more parsing. Among other things, the version parsing is now handled by the mux. During the data forwarding, depending the value of the "extra" field, we are able to know if the body length is known or not and if yes, how many bytes are still expected.	2018-11-18 22:08:54 +01:00
Christopher Faulet	0f226958b7	MINOR: proto_htx: Add some functions to handle HTX messages More functions will come, but it is the minimum to switch HTX analyzers on the HTX internal representation.	2018-11-18 22:08:54 +01:00
Christopher Faulet	f2824e6e10	MAJOR: mux-h1/proto_htx: Handle keep-alive connections in the mux Now, the connection mode is detected in the mux and not in HTX analyzers anymore. Keep-alive connections are now managed by the mux. A new stream is created for each transaction. This removes the most important part of the synchronization between channels and the HTTP transaction cleanup. These changes only affect the HTX part (proto_htx.c). Legacy HTTP analyzers remain untouched for now. On the client-side, the mux is responsible to create new streams when a new request starts. It is also responsible to parse and update the "Connection:" header of the response. On the server-side, the mux is responsible to parse and update the "Connection:" header of the request. Muxes on each side are independent. For now, there is no connection pool on the server-side, so it always close the server connection.	2018-11-18 22:02:42 +01:00
Christopher Faulet	e0768ebabc	MEDIUM: proto_htx: Add HTX analyzers and use it when the mux H1 is used For now, these analyzers are just copies of the legacy HTTP analyzers. But, during the HTTP refactoring, it will be the main place where it will be visible. And in legacy analyzers, the macro IS_HTX_STRM is used to know if the HTX version should be called or not. Note: the following commits were applied to proto_http.c after this patch was developed and need to be studied to see if an adaptation to htx is required : `fd9b68c` BUG/MINOR: only mark connections private if NTLM is detected	2018-11-18 21:45:50 +01:00
Christopher Faulet	27a3dc8fb2	MINOR: http: Call http_send_name_header with the stream instead of the txn This is just a minor change to ease integrartion of the HTX.	2018-11-18 21:45:49 +01:00
Olivier Houchard	7c6f8b146d	MAJOR: connections: Detach connections from streams. Do not destroy the connection when we're about to destroy a stream. This prevents us from doing keepalive on server connections when the client is using HTTP/2, as a new stream is created for each request. Instead, the session is now responsible for destroying connections. When reusing connections, the attach() mux method is now used to create a new conn_stream.	2018-11-18 21:45:45 +01:00
Christopher Faulet	3c0544efbf	BUG/MINOR: http: Be sure to sent fully formed HTTP 103 responses The previous commit fedceaf33 ("MINOR: http: Regroup return statements of http_req_get_intercept_rule at the end") partly fixes the problem. But not entierly. Because HTTP 103 reponses were sent line by line it is possible to mix them with others. For instance, an early-hint rule followed by a redirect rule leaving the response buffer totally messed up. Furthermore, if we fail to add the last CRLF to finish the HTTP 103 response because there is no more space in the buffer, it leave the buffer with an unfinished and invalid message. This patch fixes the bug by creating a fully formed HTTP 103 response before trying to push it in the response buffer. If an error occurred during the copy or if another response was already sent, the HTTP 103 response is ignored. However, the last point should never happened because, for redirects and authentication errors, we first try to copy any pending HTTP 103 response.	2018-11-16 16:05:51 +01:00
Christopher Faulet	6c243ebb9f	MINOR: http: Regroup return statements of http_res_get_intercept_rule at the end Instead of having multiple return statements spreaded here and there in middle of the function, we just exit from the loop setting the right return code. It let a chance to do some work before leaving the function. It is also less error prone.	2018-11-16 16:05:51 +01:00
Christopher Faulet	ea827bdcbc	MINOR: http: Regroup return statements of http_req_get_intercept_rule at the end Instead of having multiple return statements spreaded here and there in middle of the function, we just exit from the loop setting the right return code. It let a chance to do some work before leaving the function. It is also less error prone.	2018-11-16 16:05:51 +01:00
Fr�d�ric L�caille	9ca51aa288	MINOR: http: Implement "early-hint" http request rules. This patch implements http_apply_early_hint_rule() function is responsible of building HTTP 103 Early Hint responses each time a "early-hint" rule is matched.	2018-11-12 21:08:55 +01:00
Willy Tarreau	9d9ccdbf8b	BUG/MAJOR: http: http_txn_get_path() may deference an inexisting buffer When the "path" sample fetch function is called without any path, the function doesn't check that the request buffer is allocated. While this doesn't happen with the request during processing, it can definitely happen when mistakenly trying to reference a path from the response since the request channel is not allocated anymore. It's certain that this bug was emphasized by the buffer changes that went in 1.9 and the HTTP refactoring, but at first glance, 1.8 doesn't seem 100% safe either so it's possible that older version are affected as well. Thanks to PiBa-NL for reporting this bug with a reproducer.	2018-10-28 20:16:12 +01:00
Willy Tarreau	cda7f3f5c2	MINOR: stream: don't prune variables if the list is empty The vars_prune() and vars_init() functions involve locking while most of the time there is no variable at all in streams nor sessions. Let's check for emptiness before calling these functions. Simply doing this has increased the multithreaded performance from 1.5 to 5% depending on the workload.	2018-10-28 13:46:47 +01:00
Lukas Tribus	80512b186f	BUG/MINOR: only auto-prefer last server if lb-alg is non-deterministic While "option prefer-last-server" only applies to non-deterministic load balancing algorithms, 401/407 responses actually caused haproxy to prefer the last server unconditionally. As this breaks deterministic load balancing algorithms like uri, this patch applies the same condition here. Should be backported to 1.8 (together with "BUG/MINOR: only mark connections private if NTLM is detected").	2018-10-27 22:10:32 +02:00
Lukas Tribus	fd9b68c48e	BUG/MINOR: only mark connections private if NTLM is detected Instead of marking all connections that see a 401/407 response private (for connection reuse), this patch detects a RFC4559/NTLM authentication scheme and restricts the private setting to those connections. This is so we can reuse connections with 401/407 responses with deterministic load balancing algorithms later (which requires another fix). This fixes the problem reported here by Elliot Barlas : https://discourse.haproxy.org/t/unable-to-configure-load-balancing-per-request-over-persistent-connection/3144 Should be backported to 1.8.	2018-10-27 22:10:29 +02:00
Willy Tarreau	ede3d884fc	MEDIUM: channel: merge back flags CF_WRITE_PARTIAL and CF_WRITE_EVENT The behaviour of the flag CF_WRITE_PARTIAL was modified by commit `95fad5ba4` ("BUG/MAJOR: stream-int: don't re-arm recv if send fails") due to a situation where it could trigger an immediate wake up of the other side, both acting in loops via the FD cache. This loss has caused the need to introduce CF_WRITE_EVENT as commit `c5a9d5bf`, to replace it, but both flags express more or less the same thing and this distinction creates a lot of confusion and complexity in the code. Since the FD cache now acts via tasklets, the issue worked around in the first patch no longer exists, so it's more than time to kill this hack and to restore CF_WRITE_PARTIAL's semantics (i.e.: there has been some write activity since we last left process_stream). This patch mostly reverts the two commits above. Only the part making use of CF_WROTE_DATA instead of CF_WRITE_PARTIAL to detect the loss of data upon connection setup was kept because it's more accurate and better suited.	2018-10-26 08:32:57 +02:00
Christopher Faulet	66943a4903	CLEANUP: http: Remove the unused function http_find_header	2018-10-23 10:22:36 +02:00
Christopher Faulet	315b39c391	MINOR: http: Use same flag for httpclose and forceclose options Since keep-alive mode is the default mode, the passive close has disappeared, and in the code, httpclose and forceclose options are handled the same way: connections with the client and the server are closed as soon as the request and the response are received and missing "Connection: close" header is added in each direction. So to make things clearer, forceclose is now an alias for httpclose. And httpclose is explicitly an active close. So the old passive close does not exist anymore. Internally, the flag PR_O_HTTP_PCL has been removed and PR_O_HTTP_FCL has been replaced by PR_O_HTTP_CLO. In HTTP analyzers, the checks done to find the right mode to use, depending on proxies options and "Connection: " header value, have been simplified. This should only be a cleanup and no changes are expected.	2018-10-12 16:07:56 +02:00
Christopher Faulet	10079f59b7	MINOR: http: Export some functions and do cleanup to prepare HTTP refactoring To ease the refactoring, the function "http_header_add_tail" have been remove. Now, "http_header_add_tail2" is always used. And the function "capture_headers" have been renamed into "http_capture_headers". Finally, some functions have been exported.	2018-10-12 16:00:45 +02:00
Willy Tarreau	61c112aa5b	REORG: http: move HTTP rules parsing to http_rules.c These ones are mostly called from cfgparse.c for the parsing and do not depend on the HTTP representation. The functions's prototypes were moved to proto/http_rules.h, making this file work exactly like tcp_rules. Ideally we should stop calling these functions directly from cfgparse and register keywords, but there are a few cases where that wouldn't work (stats http-request) so it's probably not worth trying to go this far.	2018-10-02 18:28:05 +02:00
Willy Tarreau	79e57336b5	REORG: http: move the code to different files The current proto_http.c file is huge and contains different processing domains making it very difficult to work on an alternative representation. This commit moves some parts to other files : - ACL registration code => http_acl.c This code only creates some ACL mappings and doesn't know anything about HTTP nor about the representation. This code could even have moved to acl.c but it was not worth polluting it again. - HTTP sample conversion => http_conv.c This code doesn't depend on the internal representation but definitely manipulates some HTTP elements, such as dates. It also has access to captures. - HTTP sample fetching => http_fetch.c This code does depend entirely on the internal representation but is totally independent on the analysers. Placing it into a different file will ease the transition to the new representation and the creation of a wrapper if required. An include file was created due to CHECK_HTTP_MESSAGE_FIRST() being used at various places. - HTTP action registration => http_act.c This code doesn't directly interact with the messages nor the transaction but it does so via some exported http functions like http_replace_req_line() or http_set_status() so it will be easier to change only this after the conversion. - a few very generic parts were found and moved to http.{c,h} as relevant. It is worth noting that the functions moved to these new files are not referenced anywhere outside of the files and are only called as registered callbacks, so these files do not even require associated include files.	2018-10-02 18:26:59 +02:00
Christopher Faulet	ca874b8d92	BUG/MEDIUM: http: Don't parse chunked body if there is no input data With recent modifications on the buffers API, when a buffer is released (calling b_free), we replace it by BUF_NULL where the area pointer is NULL. So many operations, like b_peek, must be avoided on a released or not allocated buffer. These changes were mainly made in the commit `c9fa048` ("MAJOR: buffer: finalize buffer detachment"). Since this commit, HAProxy can crash during the body parsing of chunked HTTP messages because there is no check on the channel's buffer in HTTP analyzers (http_request_forward_body and http_response_forward_body) nor in H1 functions reponsible to parse chunked content (h1_skip_chunk_crlf & co). If a stream is woken up after all input data were forwarded, its input channel's buffer is released (so set to BUF_NULL). In this case, if we resume the parsing of a chunk, HAProxy crashes. To fix this issue, we just skip the parsing of chunks if there is no input data for the corresponding channel. This is only done if the message state is strickly lower to HTTP_MSG_ENDING.	2018-09-20 14:37:58 +02:00
Willy Tarreau	b05e48a54d	BUILD: http: address a couple of null-deref warnings at -Wextra These two warnings are caused by the use of objt_server() without checking its result. These are turned to __objt_server() which is safe there.	2018-09-20 11:42:15 +02:00
Willy Tarreau	ab813a4b05	REORG: http: move some header value processing functions to http.c The following functions only deal with header field values and are agnostic to the HTTP version so they were moved to http.c : http_header_match2(), find_hdr_value_end(), find_cookie_value_end(), extract_cookie_value(), parse_qvalue(), http_find_url_param_pos(), http_find_next_url_param(). Those lacking the "http_" prefix were modified to have it.	2018-09-11 10:30:25 +02:00
Willy Tarreau	e10cd48a83	REORG: http: move the log encoding tables to log.c There are 3 tables in proto_http which are used exclusively by logs : hdr_encode_map[], url_encode_map[] and http_encode_map[]. They indicate what characters are safe to be emitted in logs depending on the part of the message where they are placed. Let's move this to log.c, as well as its initialization. It's worth noting that the rfc5424 map was already initialized there.	2018-09-11 10:30:25 +02:00
Willy Tarreau	04f1e2d202	REORG: http: move error codes production and processing to http.c These error codes and messages are agnostic to the version, even if they are represented as HTTP/1.0 messages. Ultimately they will have to be transformed into internal HTTP messages to be used everywhere. The HTTP/1.1 100 Continue message was turned to an IST and the local copy in the Lua code was removed.	2018-09-11 10:30:25 +02:00
Willy Tarreau	6b952c8101	REORG: http: move http_get_path() to http.c This function is purely HTTP once http_txn is put aside. So the original one was renamed to http_txn_get_path() and it extracts the relevant offsets from the txn to pass them to http_get_path(). One benefit of the new version is that it returns the length at the same time so that allowed to slightly simplify http_get_path_from_string() which had to look up the end pointer previously and which is not needed anymore.	2018-09-11 10:30:25 +02:00
Willy Tarreau	35b51c6e5b	REORG: http: move the HTTP semantics definitions to http.h/http.c It's a bit painful to have to deal with HTTP semantics for each protocol version (H1 and H2), and working on the version-agnostic code further emphasizes the problem. This patch creates http.h and http.c which are agnostic to the version in use, and which borrow a few parts from proto_http and from h1. For example the once thought h1-specific h1_char_classes array is in fact dictated by RFC7231 and is used to parse HTTP headers. A few changes were made to a few files which were including proto_http.h while they only needed http.h. Certain string definitions pre-dated the introduction of indirect strings (ist) so some were used to simplify the definition of the known HTTP methods. The current lookup code saves 2 kB of a heavily used table and is faster than the previous table based lookup (typ. 14 ns vs 16 before).	2018-09-11 10:30:25 +02:00
Willy Tarreau	ddb68ac69e	REORG: cli: move the "show errors" handler from http to proxy There's nothing HTTP-specific there anymore at all, let's move this to the proxy where it belongs.	2018-09-07 18:36:50 +02:00
Willy Tarreau	fd9419d560	MINOR: http: remove the pointer to the error snapshot in http_capture_bad_message() It's not needed anymore as we know the side thanks to the channel. This will allow the proxy generic code to better manage the error snapshots.	2018-09-07 18:36:04 +02:00
Willy Tarreau	ef3ca73fc3	MINOR: http: make the HTTP error capture rely on the generic proxy code Now that we have a generic error capture function, let's simplify http_capture_bad_message() to make use of it. At this point the API is not changed at all, but it could be further simplified.	2018-09-07 18:36:04 +02:00
Willy Tarreau	7ccdd8dad9	MEDIUM: snapshot: implement a show() callback and use it for HTTP The HTTP dumps are now configurable in the code : "show errors" now calls a protocol-specific function to emit the decoded output. For now only HTTP is implemented.	2018-09-07 18:36:01 +02:00
Willy Tarreau	0b5b480594	MEDIUM: snapshot: start to reorder the HTTP snapshot output a little bit The output of "show errors" was slightly reordered to split the HTTP part in a single chunk_appendf() call. The useless buffer total input was replaced to report the buffer's start offset, which is the offset in the stream of the first input byte (thus not counting output). Also it was the opportunity to stop calling the stream "session".	2018-09-07 17:48:14 +02:00
Willy Tarreau	7480f323ff	MINOR: snapshot: split the error snapshots into common and proto-specific parts The idea will be to make the error snapshot feature accessible to other protocols than just HTTP. This patch only introduces an "http_snapshot" structure and renames a few fields to make things more explicit. The HTTP part was installed inside a union so that we can easily add more protocols in the future.	2018-09-07 16:13:45 +02:00
Willy Tarreau	5865a8fe69	MINOR: snapshot: restart on the event ID and not the stream ID The snapshots have the ability to restart a partial dump and they use the stream ID as the restart point. Since it's purely HTTP, let's use the event ID instead.	2018-09-07 15:00:43 +02:00
Willy Tarreau	e9e878a056	BUG/MINOR: http/threads: atomically increment the error snapshot ID Let's use an atomic increment for the error snapshot, as we'd rather not assign the same ID to two errors happening in parallel. It's very unlikely that it will ever happen though. This patch must be backported to 1.8 with the other one it relies on ("MINOR: thread: implement HA_ATOMIC_XADD()").	2018-09-07 11:31:58 +02:00
Willy Tarreau	90a7c03ec0	BUG/MINOR: stream: use atomic increments for the request counter The request counter is incremented when creating a new stream and when resetting a stream, preparing for a new request. Unfortunately during the thread migration this was missed, leading to non-atomic increments in case threads are in use. The most visible side effect is that two requests may have the same ID from time to time in the logs. However the SPOE also uses this ID to route responses back to the stream so it may also lead to occasional spurious SPOE timeouts. Note that it still doesn't guarantee temporal unicity in the stream identifiers since a long and a short connection could technically use the same ID. The likeliness that this happens at the same time is almost null (roughly threads*runqueue_depth/2^32 that it happens in the same poll loop), but it will have to be addressed later anyway. This patch must be backported to 1.8 with the other one it relies on ("MINOR: thread: implement HA_ATOMIC_XADD()").	2018-09-05 16:30:19 +02:00
Patrick Hemmer	e3faf02581	BUG/MEDIUM: lua: reset lua transaction between http requests Previously LUA code would maintain the transaction state between http requests, resulting in things like txn:get_priv() retrieving data from a previous request. This addresses the issue by ensuring the LUA state is reset between requests. Co-authored-by: Tim D�sterhus <tim@bastelstu.be>	2018-08-25 07:51:02 +02:00
Willy Tarreau	9c768fdca1	BUG/MEDIUM: http: don't store url_decode() result in the samples's length By convenience or laziness we used to store url_decode()'s return code into smp->data.u.str.data. The result checks applied there compare it to 0 while it's now unsigned since commit `843b7cb` ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct "). Let's clean this up and test the result itself without storing it first. No backport is needed.	2018-08-22 05:16:32 +02:00
Willy Tarreau	6e27be1a5d	BUG/MEDIUM: http: don't store exp_replace() result in the trash's length By convenience or laziness we used to store exp_replace()'s return code into trash.data. The result checks applied there compare trash.data to -1 while it's now unsigned since commit `843b7cb` ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct "). Let's clean this up and test the result itself without storing it first. No backport is needed.	2018-08-22 05:16:32 +02:00
Willy Tarreau	5f6333caca	BUG/MINOR: chunks: do not store -1 into chunk_printf() in case of error Since commit `843b7cb` ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct") a chunk length is unsigned so we can't reliably store -1 and check for negative values in the caller. Only one such location was found in proto_http's http-request auth rules (which cannot realistically fail). No backport is needed.	2018-08-22 05:16:31 +02:00
Patrick Hemmer	ffe5e8c638	MINOR: stream: rename {srv,prx}_queue_size to *_queue_pos The current name is misleading as it implies a queue size, but the value instead indicates a position in the queue. The value is only the queue size at the exact moment the element is enqueued. Soon we will gain the ability to insert anywhere into the queue, upon which clarity of the name is more important.	2018-08-10 15:04:14 +02:00

1 2 3 4 5 ...

1329 Commits