haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-16 03:56:56 +02:00

Author	SHA1	Message	Date
Willy Tarreau	089eaa0ba7	BUG/MINOR: backend: don't use url_param_name as a hint for BE_LB_ALGO_PH At a few places in the code we used to rely on this variable to guess what LB algo was in place. This is wrong because if the defaults section presets "balance url_param foo" and a backend uses "balance roundrobin", these locations will still see this url_param_name set and consider it. The harm is limited, as this only causes the beginning of the request body to be buffered. And in general this is a bad practice which prevents us from cleaning the lbprm stuff. Let's explicitly check the LB algo instead. This may be backported to all currently maintained versions.	2019-01-14 19:33:17 +01:00
Willy Tarreau	c9036c0004	BUG/MAJOR: cache: fix confusion between zero and uninitialized cache key The cache uses the first 32 bits of the uri's hash as the key to reference the object in the cache. It makes a special case of the value zero to mean that the object is not in the cache anymore. The problem is that when an object hashes as zero, it's still inserted but the eb32_delete() call is skipped, resulting in the object still being chained in the memory area while the block has been reclaimed and used for something else. Then when objects which were chained below it (techically any object since zero is at the root) are deleted, the walk through the upper object may encounter corrupted values where valid pointers were expected. But while this should only happen statically once on 4 billion, the problem gets worse when the cache-use conditions don't match the cache-store ones, because cache-store runs with an uninitialized key, which can create objects that will never be found by the lookup code, or worse, entries with a zero key preventing eviction of the tree node and resulting in a crash. It's easy to accidently end up on such a config because the request rules generally can't be used to decide on the response : http-request cache-use cache if { path_beg /images } http-response cache-store cache In this test, mixing traffic with /images/$RANDOM and /foo/$RANDOM will result in random keys being inserted, some of them possibly being zero, and crashes will quickly happen. The fix consists in 1) always initializing the transaction's cache_hash to zero, and 2) never storing a response for which the hash has not been calculated, as indicated by the value zero. It is worth noting that objects hashing as value zero will never be cached, but given that there's only one chance among 4 billion that this happens, this is totally harmless. This fix must be backported to 1.9 and 1.8.	2019-01-14 10:31:31 +01:00
Olivier Houchard	351411facd	BUG/MAJOR: sessions: Use an unlimited number of servers for the conn list. When a session adds a connection to its connection list, we used to remove connections for an another server if there were not enough room for our server. This can't work, because those lists are now the list of connections we're responsible for, not just the idle connections. To fix this, allow for an unlimited number of servers, instead of using an array, we're now using a linked list.	2018-12-28 16:33:13 +01:00
Willy Tarreau	14bfe9af12	CLEANUP: stream-int: consistently call the si/stream_int functions As long-time changes have accumulated over time, the exported functions of the stream-interface were almost all prefixed "si_<something>" while most private ones (mostly callbacks) were called "stream_int_<something>". There were still a few confusing exceptions, which were addressed to follow this shcme : - stream_sock_read0(), only used internally, was renamed stream_int_read0() and made static - stream_int_notify() is only private and was made static - stream_int_{check_timeouts,report_error,retnclose,register_handler,update} were renamed si_<something>. Now it is clearer when checking one of these if it risks to be used outside or not.	2018-12-19 15:25:43 +01:00
Olivier Houchard	a4d4fdfaa3	MEDIUM: sessions: Don't keep an infinite number of idling connections. In session, don't keep an infinite number of connection that can idle. Add a new frontend parameter, "max-session-srv-conns" to set a max number, with a default value of 5.	2018-12-15 23:50:10 +01:00
Olivier Houchard	a70e176843	BUG/MEDIUM: connections: Don't attempt to reuse an unusable connection. Before trying to add a connection to the idle list, make sure it doesn't have the error, the shutr or the shutw flag. If any of them is present, don't bother trying to recycle the connection, it's going to be destroyed anyway.	2018-12-13 18:09:17 +01:00
Willy Tarreau	afba57ae80	REORG: h1: merge types+proto into common/h1.h These two files are self-contained and do not depend on other layers, so let's remerge them together for easier manipulation.	2018-12-11 17:15:13 +01:00
Willy Tarreau	c5a4fd5c30	REORG: http: create http_msg.c to place there some legacy HTTP parts Lots of HTTP code still uses struct http_msg. Not only this code is still huge, but it's part of the legacy interface. Let's move most of these functions to a separate file http_msg.c to make it more visible which file relies on what. It's mostly symmetrical with what is present in http_htx.c. The function http_transform_header_str() which used to rely on two function pointers to look up a header was simplified to rely on two variants http_legacy_replace_{,full_}header(), making both sides of the function much simpler. No code was changed beyond these moves.	2018-12-11 17:15:13 +01:00
Willy Tarreau	1a18b54142	REORG: connection: centralize the conn_set_{tos,mark,quickack} functions There were a number of ugly setsockopt() calls spread all over proto_http.c, proto_htx.c and hlua.c just to manipulate the front connection's TOS, mark or TCP quick-ack. These ones entirely relied on the connection, its existence, its control layer's presence, and its addresses. Worse, inet_set_tos() was placed in proto_http.c, exported and used from the two other ones, surrounded in #ifdefs. This patch moves this code to connection.h and makes the other ones rely on it without ifdefs.	2018-12-11 16:41:51 +01:00
Willy Tarreau	b54c40ac0b	BUILD: threads: fix minor build warnings when threads are disabled These potential null-deref warnings are emitted on gcc 7 and above when threads are disabled due to the use of objt_server() after an existing validity test. Let's switch to __objt_server() since we know the pointer is valid, it will not confuse the compiler. Some of these may be backported to 1.8.	2018-12-02 19:28:41 +01:00
Christopher Faulet	a7b677cd0d	MEDIUM: proto_htx: Convert all HTTP error messages into HTX During startup, after the configuration parsing, all HTTP error messages (errorloc, errorfile or default messages) are converted into HTX messages and stored in dedicated buffers. We use it to return errors in the HTX analyzers instead of using ugly OOB blocks.	2018-12-01 17:37:27 +01:00
Christopher Faulet	ed26fb8ac8	BUG/MINOR: http: Use out buffer instead of trash to display error snapshot the function http_show_error_snapshot() must not use the trash buffer to append the HTTP error description. Instead, it must use the <out> buffer, its first argument. Note that concretely, this function always succeeds because <out> is always the trash buffer.	2018-12-01 17:20:36 +01:00
Olivier Houchard	985f139aa2	MEDIUM: session: Steal owner-less connections on end of transaction. When a transaction ends, if we want to do keepalive, and the connection we used didn't have an owner, attach the connection to the session, so that we don't have to destroy it, and we can reuse it later.	2018-12-01 10:47:19 +01:00
Willy Tarreau	8ceae72d44	MEDIUM: init: use initcall for all fixed size pool creations This commit replaces the explicit pool creation that are made in constructors with a pool registration. Not only this simplifies the pools declaration (it can be done on a single line after the head is declared), but it also removes references to pools from within constructors. The only remaining create_pool() calls are those performed in init functions after the config is parsed, so there is no more user of potentially uninitialized pool now. It has been the opportunity to remove no less than 12 constructors and 6 init functions.	2018-11-26 19:50:32 +01:00
Willy Tarreau	9efd7456e0	MEDIUM: tasks: collect per-task CPU time and latency Right now we measure for each task the cumulated time spent waiting for the CPU and using it. The timestamp uses a 64-bit integer to report a nanosecond-level date. This is only enabled when "profiling.tasks" is enabled, and consumes less than 1% extra CPU on x86_64 when enabled. The cumulated processing time and wait time are reported in "show sess". The task's counters are also reset when an HTTP transaction is reset since the HTTP part pretends to restart on a fresh new stream. This will make sure we always report correct numbers for each request in the logs.	2018-11-22 15:44:21 +01:00
Joseph Herlant	5ba8025976	CLEANUP: fix typos in the proto_http subsystem Fixes typos in the code comments of the proto_http subsystem.	2018-11-18 22:23:15 +01:00
Christopher Faulet	fefc73da34	MINOR: proto_htx: Add functions htx_perform_server_redirect It is more or less the same than legacy version but adapted to be called from HTX analyzers. In the legacy version of this function, we switch on the HTX code when applicable.	2018-11-18 22:08:58 +01:00
Christopher Faulet	64159df1fb	MINOR: proto_htx: Add functions htx_send_name_header It is more or less the same than legacy version but adapted to be called from HTX analyzers. In the legacy version of this function, we switch on the HTX code when applicable.	2018-11-18 22:08:58 +01:00
Christopher Faulet	25a02f65b1	MINOR: proto_htx: Add functions to check the cacheability of HTX messages It is more or less the same than legacy versions but adapted to be called from HTX analyzers. In the legacy versions of these functions, we switch on the HTX code when applicable.	2018-11-18 22:08:58 +01:00
Christopher Faulet	8d8ac191a7	MINOR: proto_htx: Add functions htx_req_replace_stline and htx_res_set_status It is more or less the same than legacy versions but adapted to be called from HTX analyzers. In the legacy versions of these functions, we switch on the HTX code when applicable.	2018-11-18 22:08:56 +01:00
Christopher Faulet	9768c2660e	MAJOR: mux-h1/proto_htx: Switch mux-h1 and HTX analyzers on the HTX representation The mux-h1 now parses and formats HTTP/1 messages using the HTX representation. The HTX analyzers have been updated too. For now, only htx_wait_for_{request/response} and http_{request/response}_forward_body have been adapted. Others are disabled for now. Now, the HTTP messages are parsed by the mux on a side and then, after analysis, formatted on the other side. In the middle, in the stream, there is no more parsing. Among other things, the version parsing is now handled by the mux. During the data forwarding, depending the value of the "extra" field, we are able to know if the body length is known or not and if yes, how many bytes are still expected.	2018-11-18 22:08:54 +01:00
Christopher Faulet	0f226958b7	MINOR: proto_htx: Add some functions to handle HTX messages More functions will come, but it is the minimum to switch HTX analyzers on the HTX internal representation.	2018-11-18 22:08:54 +01:00
Christopher Faulet	f2824e6e10	MAJOR: mux-h1/proto_htx: Handle keep-alive connections in the mux Now, the connection mode is detected in the mux and not in HTX analyzers anymore. Keep-alive connections are now managed by the mux. A new stream is created for each transaction. This removes the most important part of the synchronization between channels and the HTTP transaction cleanup. These changes only affect the HTX part (proto_htx.c). Legacy HTTP analyzers remain untouched for now. On the client-side, the mux is responsible to create new streams when a new request starts. It is also responsible to parse and update the "Connection:" header of the response. On the server-side, the mux is responsible to parse and update the "Connection:" header of the request. Muxes on each side are independent. For now, there is no connection pool on the server-side, so it always close the server connection.	2018-11-18 22:02:42 +01:00
Christopher Faulet	e0768ebabc	MEDIUM: proto_htx: Add HTX analyzers and use it when the mux H1 is used For now, these analyzers are just copies of the legacy HTTP analyzers. But, during the HTTP refactoring, it will be the main place where it will be visible. And in legacy analyzers, the macro IS_HTX_STRM is used to know if the HTX version should be called or not. Note: the following commits were applied to proto_http.c after this patch was developed and need to be studied to see if an adaptation to htx is required : `fd9b68c` BUG/MINOR: only mark connections private if NTLM is detected	2018-11-18 21:45:50 +01:00
Christopher Faulet	27a3dc8fb2	MINOR: http: Call http_send_name_header with the stream instead of the txn This is just a minor change to ease integrartion of the HTX.	2018-11-18 21:45:49 +01:00
Olivier Houchard	7c6f8b146d	MAJOR: connections: Detach connections from streams. Do not destroy the connection when we're about to destroy a stream. This prevents us from doing keepalive on server connections when the client is using HTTP/2, as a new stream is created for each request. Instead, the session is now responsible for destroying connections. When reusing connections, the attach() mux method is now used to create a new conn_stream.	2018-11-18 21:45:45 +01:00
Christopher Faulet	3c0544efbf	BUG/MINOR: http: Be sure to sent fully formed HTTP 103 responses The previous commit fedceaf33 ("MINOR: http: Regroup return statements of http_req_get_intercept_rule at the end") partly fixes the problem. But not entierly. Because HTTP 103 reponses were sent line by line it is possible to mix them with others. For instance, an early-hint rule followed by a redirect rule leaving the response buffer totally messed up. Furthermore, if we fail to add the last CRLF to finish the HTTP 103 response because there is no more space in the buffer, it leave the buffer with an unfinished and invalid message. This patch fixes the bug by creating a fully formed HTTP 103 response before trying to push it in the response buffer. If an error occurred during the copy or if another response was already sent, the HTTP 103 response is ignored. However, the last point should never happened because, for redirects and authentication errors, we first try to copy any pending HTTP 103 response.	2018-11-16 16:05:51 +01:00
Christopher Faulet	6c243ebb9f	MINOR: http: Regroup return statements of http_res_get_intercept_rule at the end Instead of having multiple return statements spreaded here and there in middle of the function, we just exit from the loop setting the right return code. It let a chance to do some work before leaving the function. It is also less error prone.	2018-11-16 16:05:51 +01:00
Christopher Faulet	ea827bdcbc	MINOR: http: Regroup return statements of http_req_get_intercept_rule at the end Instead of having multiple return statements spreaded here and there in middle of the function, we just exit from the loop setting the right return code. It let a chance to do some work before leaving the function. It is also less error prone.	2018-11-16 16:05:51 +01:00
Fr�d�ric L�caille	9ca51aa288	MINOR: http: Implement "early-hint" http request rules. This patch implements http_apply_early_hint_rule() function is responsible of building HTTP 103 Early Hint responses each time a "early-hint" rule is matched.	2018-11-12 21:08:55 +01:00
Willy Tarreau	9d9ccdbf8b	BUG/MAJOR: http: http_txn_get_path() may deference an inexisting buffer When the "path" sample fetch function is called without any path, the function doesn't check that the request buffer is allocated. While this doesn't happen with the request during processing, it can definitely happen when mistakenly trying to reference a path from the response since the request channel is not allocated anymore. It's certain that this bug was emphasized by the buffer changes that went in 1.9 and the HTTP refactoring, but at first glance, 1.8 doesn't seem 100% safe either so it's possible that older version are affected as well. Thanks to PiBa-NL for reporting this bug with a reproducer.	2018-10-28 20:16:12 +01:00
Willy Tarreau	cda7f3f5c2	MINOR: stream: don't prune variables if the list is empty The vars_prune() and vars_init() functions involve locking while most of the time there is no variable at all in streams nor sessions. Let's check for emptiness before calling these functions. Simply doing this has increased the multithreaded performance from 1.5 to 5% depending on the workload.	2018-10-28 13:46:47 +01:00
Lukas Tribus	80512b186f	BUG/MINOR: only auto-prefer last server if lb-alg is non-deterministic While "option prefer-last-server" only applies to non-deterministic load balancing algorithms, 401/407 responses actually caused haproxy to prefer the last server unconditionally. As this breaks deterministic load balancing algorithms like uri, this patch applies the same condition here. Should be backported to 1.8 (together with "BUG/MINOR: only mark connections private if NTLM is detected").	2018-10-27 22:10:32 +02:00
Lukas Tribus	fd9b68c48e	BUG/MINOR: only mark connections private if NTLM is detected Instead of marking all connections that see a 401/407 response private (for connection reuse), this patch detects a RFC4559/NTLM authentication scheme and restricts the private setting to those connections. This is so we can reuse connections with 401/407 responses with deterministic load balancing algorithms later (which requires another fix). This fixes the problem reported here by Elliot Barlas : https://discourse.haproxy.org/t/unable-to-configure-load-balancing-per-request-over-persistent-connection/3144 Should be backported to 1.8.	2018-10-27 22:10:29 +02:00
Willy Tarreau	ede3d884fc	MEDIUM: channel: merge back flags CF_WRITE_PARTIAL and CF_WRITE_EVENT The behaviour of the flag CF_WRITE_PARTIAL was modified by commit `95fad5ba4` ("BUG/MAJOR: stream-int: don't re-arm recv if send fails") due to a situation where it could trigger an immediate wake up of the other side, both acting in loops via the FD cache. This loss has caused the need to introduce CF_WRITE_EVENT as commit `c5a9d5bf`, to replace it, but both flags express more or less the same thing and this distinction creates a lot of confusion and complexity in the code. Since the FD cache now acts via tasklets, the issue worked around in the first patch no longer exists, so it's more than time to kill this hack and to restore CF_WRITE_PARTIAL's semantics (i.e.: there has been some write activity since we last left process_stream). This patch mostly reverts the two commits above. Only the part making use of CF_WROTE_DATA instead of CF_WRITE_PARTIAL to detect the loss of data upon connection setup was kept because it's more accurate and better suited.	2018-10-26 08:32:57 +02:00
Christopher Faulet	66943a4903	CLEANUP: http: Remove the unused function http_find_header	2018-10-23 10:22:36 +02:00
Christopher Faulet	315b39c391	MINOR: http: Use same flag for httpclose and forceclose options Since keep-alive mode is the default mode, the passive close has disappeared, and in the code, httpclose and forceclose options are handled the same way: connections with the client and the server are closed as soon as the request and the response are received and missing "Connection: close" header is added in each direction. So to make things clearer, forceclose is now an alias for httpclose. And httpclose is explicitly an active close. So the old passive close does not exist anymore. Internally, the flag PR_O_HTTP_PCL has been removed and PR_O_HTTP_FCL has been replaced by PR_O_HTTP_CLO. In HTTP analyzers, the checks done to find the right mode to use, depending on proxies options and "Connection: " header value, have been simplified. This should only be a cleanup and no changes are expected.	2018-10-12 16:07:56 +02:00
Christopher Faulet	10079f59b7	MINOR: http: Export some functions and do cleanup to prepare HTTP refactoring To ease the refactoring, the function "http_header_add_tail" have been remove. Now, "http_header_add_tail2" is always used. And the function "capture_headers" have been renamed into "http_capture_headers". Finally, some functions have been exported.	2018-10-12 16:00:45 +02:00
Willy Tarreau	61c112aa5b	REORG: http: move HTTP rules parsing to http_rules.c These ones are mostly called from cfgparse.c for the parsing and do not depend on the HTTP representation. The functions's prototypes were moved to proto/http_rules.h, making this file work exactly like tcp_rules. Ideally we should stop calling these functions directly from cfgparse and register keywords, but there are a few cases where that wouldn't work (stats http-request) so it's probably not worth trying to go this far.	2018-10-02 18:28:05 +02:00
Willy Tarreau	79e57336b5	REORG: http: move the code to different files The current proto_http.c file is huge and contains different processing domains making it very difficult to work on an alternative representation. This commit moves some parts to other files : - ACL registration code => http_acl.c This code only creates some ACL mappings and doesn't know anything about HTTP nor about the representation. This code could even have moved to acl.c but it was not worth polluting it again. - HTTP sample conversion => http_conv.c This code doesn't depend on the internal representation but definitely manipulates some HTTP elements, such as dates. It also has access to captures. - HTTP sample fetching => http_fetch.c This code does depend entirely on the internal representation but is totally independent on the analysers. Placing it into a different file will ease the transition to the new representation and the creation of a wrapper if required. An include file was created due to CHECK_HTTP_MESSAGE_FIRST() being used at various places. - HTTP action registration => http_act.c This code doesn't directly interact with the messages nor the transaction but it does so via some exported http functions like http_replace_req_line() or http_set_status() so it will be easier to change only this after the conversion. - a few very generic parts were found and moved to http.{c,h} as relevant. It is worth noting that the functions moved to these new files are not referenced anywhere outside of the files and are only called as registered callbacks, so these files do not even require associated include files.	2018-10-02 18:26:59 +02:00
Christopher Faulet	ca874b8d92	BUG/MEDIUM: http: Don't parse chunked body if there is no input data With recent modifications on the buffers API, when a buffer is released (calling b_free), we replace it by BUF_NULL where the area pointer is NULL. So many operations, like b_peek, must be avoided on a released or not allocated buffer. These changes were mainly made in the commit `c9fa048` ("MAJOR: buffer: finalize buffer detachment"). Since this commit, HAProxy can crash during the body parsing of chunked HTTP messages because there is no check on the channel's buffer in HTTP analyzers (http_request_forward_body and http_response_forward_body) nor in H1 functions reponsible to parse chunked content (h1_skip_chunk_crlf & co). If a stream is woken up after all input data were forwarded, its input channel's buffer is released (so set to BUF_NULL). In this case, if we resume the parsing of a chunk, HAProxy crashes. To fix this issue, we just skip the parsing of chunks if there is no input data for the corresponding channel. This is only done if the message state is strickly lower to HTTP_MSG_ENDING.	2018-09-20 14:37:58 +02:00
Willy Tarreau	b05e48a54d	BUILD: http: address a couple of null-deref warnings at -Wextra These two warnings are caused by the use of objt_server() without checking its result. These are turned to __objt_server() which is safe there.	2018-09-20 11:42:15 +02:00
Willy Tarreau	ab813a4b05	REORG: http: move some header value processing functions to http.c The following functions only deal with header field values and are agnostic to the HTTP version so they were moved to http.c : http_header_match2(), find_hdr_value_end(), find_cookie_value_end(), extract_cookie_value(), parse_qvalue(), http_find_url_param_pos(), http_find_next_url_param(). Those lacking the "http_" prefix were modified to have it.	2018-09-11 10:30:25 +02:00
Willy Tarreau	e10cd48a83	REORG: http: move the log encoding tables to log.c There are 3 tables in proto_http which are used exclusively by logs : hdr_encode_map[], url_encode_map[] and http_encode_map[]. They indicate what characters are safe to be emitted in logs depending on the part of the message where they are placed. Let's move this to log.c, as well as its initialization. It's worth noting that the rfc5424 map was already initialized there.	2018-09-11 10:30:25 +02:00
Willy Tarreau	04f1e2d202	REORG: http: move error codes production and processing to http.c These error codes and messages are agnostic to the version, even if they are represented as HTTP/1.0 messages. Ultimately they will have to be transformed into internal HTTP messages to be used everywhere. The HTTP/1.1 100 Continue message was turned to an IST and the local copy in the Lua code was removed.	2018-09-11 10:30:25 +02:00
Willy Tarreau	6b952c8101	REORG: http: move http_get_path() to http.c This function is purely HTTP once http_txn is put aside. So the original one was renamed to http_txn_get_path() and it extracts the relevant offsets from the txn to pass them to http_get_path(). One benefit of the new version is that it returns the length at the same time so that allowed to slightly simplify http_get_path_from_string() which had to look up the end pointer previously and which is not needed anymore.	2018-09-11 10:30:25 +02:00
Willy Tarreau	35b51c6e5b	REORG: http: move the HTTP semantics definitions to http.h/http.c It's a bit painful to have to deal with HTTP semantics for each protocol version (H1 and H2), and working on the version-agnostic code further emphasizes the problem. This patch creates http.h and http.c which are agnostic to the version in use, and which borrow a few parts from proto_http and from h1. For example the once thought h1-specific h1_char_classes array is in fact dictated by RFC7231 and is used to parse HTTP headers. A few changes were made to a few files which were including proto_http.h while they only needed http.h. Certain string definitions pre-dated the introduction of indirect strings (ist) so some were used to simplify the definition of the known HTTP methods. The current lookup code saves 2 kB of a heavily used table and is faster than the previous table based lookup (typ. 14 ns vs 16 before).	2018-09-11 10:30:25 +02:00
Willy Tarreau	ddb68ac69e	REORG: cli: move the "show errors" handler from http to proxy There's nothing HTTP-specific there anymore at all, let's move this to the proxy where it belongs.	2018-09-07 18:36:50 +02:00
Willy Tarreau	fd9419d560	MINOR: http: remove the pointer to the error snapshot in http_capture_bad_message() It's not needed anymore as we know the side thanks to the channel. This will allow the proxy generic code to better manage the error snapshots.	2018-09-07 18:36:04 +02:00
Willy Tarreau	ef3ca73fc3	MINOR: http: make the HTTP error capture rely on the generic proxy code Now that we have a generic error capture function, let's simplify http_capture_bad_message() to make use of it. At this point the API is not changed at all, but it could be further simplified.	2018-09-07 18:36:04 +02:00

1 2 3 4 5 ...

1340 Commits