haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-10 00:57:02 +02:00

Author	SHA1	Message	Date
Christopher Faulet	6e54095d0a	BUG/MINOR: flt_trace/compression: Use the right flag to add the HTX support Of course, the flag FLT_CFG_FL_HTX must be used and not STRM_FLT_FL_HAS_FILTERS. "Fortunately", these 2 flags have the same value, so everything worked as expected.	2018-12-04 16:43:30 +01:00
Christopher Faulet	e6902cd57c	MEDIUM: compression: Adapt to be compatible with the HTX representation Functions analyzing request and response headers have been duplicated and adapted to support HTX messages. The callback http_payload have been implemented to handle the data compression itself. It loops on HTX blocks and replace uncompressed value of DATA block by compressed one. Unlike the HTTP legacy version, there is no chunk at all. So HTX version is significantly easier.	2018-12-01 17:37:27 +01:00
Willy Tarreau	8ceae72d44	MEDIUM: init: use initcall for all fixed size pool creations This commit replaces the explicit pool creation that are made in constructors with a pool registration. Not only this simplifies the pools declaration (it can be done on a single line after the head is declared), but it also removes references to pools from within constructors. The only remaining create_pool() calls are those performed in init functions after the config is parsed, so there is no more user of potentially uninitialized pool now. It has been the opportunity to remove no less than 12 constructors and 6 init functions.	2018-11-26 19:50:32 +01:00
Willy Tarreau	0108d90c6c	MEDIUM: init: convert all trivial registration calls to initcalls This switches explicit calls to various trivial registration methods for keywords, muxes or protocols from constructors to INITCALL1 at stage STG_REGISTER. All these calls have in common to consume a single pointer and return void. Doing this removes 26 constructors. The following calls were addressed : - acl_register_keywords - bind_register_keywords - cfg_register_keywords - cli_register_kw - flt_register_keywords - http_req_keywords_register - http_res_keywords_register - protocol_register - register_mux_proto - sample_register_convs - sample_register_fetches - srv_register_keywords - tcp_req_conn_keywords_register - tcp_req_cont_keywords_register - tcp_req_sess_keywords_register - tcp_res_cont_keywords_register - flt_register_keywords	2018-11-26 19:50:32 +01:00
Joseph Herlant	942eea3f5c	CLEANUP: Fix typos in the http subsystem Fix typos in code comment of the http subsystem.	2018-11-18 22:26:42 +01:00
Willy Tarreau	ab813a4b05	REORG: http: move some header value processing functions to http.c The following functions only deal with header field values and are agnostic to the HTTP version so they were moved to http.c : http_header_match2(), find_hdr_value_end(), find_cookie_value_end(), extract_cookie_value(), parse_qvalue(), http_find_url_param_pos(), http_find_next_url_param(). Those lacking the "http_" prefix were modified to have it.	2018-09-11 10:30:25 +02:00
Willy Tarreau	35b51c6e5b	REORG: http: move the HTTP semantics definitions to http.h/http.c It's a bit painful to have to deal with HTTP semantics for each protocol version (H1 and H2), and working on the version-agnostic code further emphasizes the problem. This patch creates http.h and http.c which are agnostic to the version in use, and which borrow a few parts from proto_http and from h1. For example the once thought h1-specific h1_char_classes array is in fact dictated by RFC7231 and is used to parse HTTP headers. A few changes were made to a few files which were including proto_http.h while they only needed http.h. Certain string definitions pre-dated the introduction of indirect strings (ist) so some were used to simplify the definition of the known HTTP methods. The current lookup code saves 2 kB of a heavily used table and is faster than the previous table based lookup (typ. 14 ns vs 16 before).	2018-09-11 10:30:25 +02:00
Willy Tarreau	843b7cbe9d	MEDIUM: chunks: make the chunk struct's fields match the buffer struct Chunks are only a subset of a buffer (a non-wrapping version with no head offset). Despite this we still carry a lot of duplicated code between buffers and chunks. Replacing chunks with buffers would significantly reduce the maintenance efforts. This first patch renames the chunk's fields to match the name and types used by struct buffers, with the goal of isolating the code changes from the declaration changes. Most of the changes were made with spatch using this coccinelle script : @rule_d1@ typedef chunk; struct chunk chunk; @@ - chunk.str + chunk.area @rule_d2@ typedef chunk; struct chunk chunk; @@ - chunk.len + chunk.data @rule_i1@ typedef chunk; struct chunk chunk; @@ - chunk->str + chunk->area @rule_i2@ typedef chunk; struct chunk chunk; @@ - chunk->len + chunk->data Some minor updates to 3 http functions had to be performed to take size_t ints instead of ints in order to match the unsigned length here.	2018-07-19 16:23:43 +02:00
Willy Tarreau	c9fa0480af	MAJOR: buffer: finalize buffer detachment Now the buffers only contain the header and a pointer to the storage area which can be anywhere. This will significantly simplify buffer swapping and will make it possible to map chunks on buffers as well. The buf_empty variable was removed, as now it's enough to have size==0 and area==NULL to designate the empty buffer (thus a non-allocated head is the empty buffer by default). buf_wanted for now is indicated by size==0 and area==(void *)1. The channels and the checks now embed the buffer's head, and the only pointer is to the storage area. This slightly increases the unallocated buffer size (3 extra ints for the empty buffer) but considerably simplifies dynamic buffer management. It will also later permit to detach unused checks. The way the struct buffer is arranged has proven quite efficient on a number of tests, which makes sense given that size is always accessed and often first, followed by the othe ones.	2018-07-19 16:23:43 +02:00
Olivier Houchard	08afac0fd7	MEDIUM: buffers: move "output" from struct buffer to struct channel Since we never access this field directly anymore, but only through the channel's wrappers, it can now move to the channel. The buffers are now completely free from the distinction between input and output data.	2018-07-19 16:23:43 +02:00
Willy Tarreau	892f1dbe4f	MINOR: buffer: rename the "data" field to "area" Since we use "_data" for the amount of data at many places, as opposed to "_space" for the amount of space, let's rename the "data" field to "area" so that we can reuse "data" later for the amount of data in the buffer (currently called "len" despite not being contigous).	2018-07-19 16:23:43 +02:00
Willy Tarreau	d54a8ceb97	MAJOR: start to change buffer API This is intentionally the minimal and safest set of changes, some cleanups area still required. These changes are quite tricky and cannot be independantly tested, so it's important to keep this patch as bisectable as possible. buf_empty and buf_wanted were changed and are now exactly similar since there's no <p> member in the structure anymore. Given that no test is ever made in the code to check that buf == &buf_wanted, it may be possible that we don't need to have two anymore, unless some buf_empty tests have precedence. This will have to be investigated. A significant part of this commit affects the HTTP compression code, which used to deeply manipulate the input and output buffers without any reasonable solution for a better abstraction. For this reason, if any regression is met and designates this patch as the culprit, it is important to run tests which specifically involve compression or which definitely don't use it in order to spot the issue. Cc: Olivier Houchard <ohouchard@haproxy.com>	2018-07-19 16:23:42 +02:00
Olivier Houchard	0b662843c8	MEDIUM: compression: start to move to the new buffer API This part is tricky, it passes a channel where we used to have a buffer, in order to reduce the API changes during the big switch. This way all the channel's wrappers to distinguish between input and output are available. It also makes sense given that the compression applies on a channel since it's in the forwarding path.	2018-07-19 16:23:42 +02:00
Olivier Houchard	acd1403794	MINOR: buffer: Use b_add()/bo_add() instead of accessing b->i/b->o. Use the newly available functions instead of using the buffer fields directly.	2018-07-19 16:23:42 +02:00
Willy Tarreau	eac5259888	MINOR: buffer: use b_room() to determine available space in a buffer We used to have variations around buffer_total_space() and size-buffer_len() or size-b_data(). Let's simplify all this. buffer_len() was also removed as not used anymore.	2018-07-19 16:23:41 +02:00
Willy Tarreau	0e11d59af6	MINOR: buffer: remove bo_contig_data() The two call places now make use of b_contig_data(0) and check by themselves that the returned size is no larger than the scheduled output data.	2018-07-19 16:23:40 +02:00
Willy Tarreau	8f9c72d301	MINOR: buffer: remove bi_end() It was replaced by ci_tail() when the channel is known, or b_tail() in other cases.	2018-07-19 16:23:40 +02:00
Willy Tarreau	89faf5d7c3	MINOR: buffer: remove bo_ptr() It was replaced by co_head() when a channel was known, otherwise b_head().	2018-07-19 16:23:40 +02:00
Willy Tarreau	dda2e41881	MINOR: buffer: remove bi_ptr() It's now been replaced by b_head() when b->o is null, ci_head() when the channel is known, or b_peek(b, b->o) in other situations.	2018-07-19 16:23:40 +02:00
Willy Tarreau	7194d3cc3b	MINOR: buffer: split bi_contig_data() into ci_contig_data and b_config_data() This function was sometimes used from a channel and sometimes from a buffer. In both cases it requires knowledge of the size of the output data (to skip them). Here the split ensures the channel can deal with this point, and that other places not having output data can continue to work.	2018-07-19 16:23:40 +02:00
Willy Tarreau	bcbd39370f	MINOR: channel/buffer: replace b_{adv,rew} with c_{adv,rew} These ones manipulate the output data count which will be specific to the channel soon, so prepare the call points to use the channel only. The b_* functions are now unused and were removed.	2018-07-19 16:23:40 +02:00
Willy Tarreau	4d452384a3	MINOR: compression: pass the channel to http_compression_buffer_end() This will be needed to access the output data count from the channel after the buffer/channel changes.	2018-07-19 16:23:39 +02:00
Willy Tarreau	506a29ac6e	MINOR: buffer: switch buffer sizes and offsets to size_t Passing unsigned ints everywhere is painful, and will cause some headache later when we'll want to integrate better with struct ist which already uses size_t. Let's switch buffers to use size_t instead.	2018-07-19 16:23:39 +02:00
Willy Tarreau	bafbe01028	CLEANUP: pools: rename all pool functions and pointers to remove this "2" During the migration to the second version of the pools, the new functions and pool pointers were all called "pool_something2()" and "pool2_something". Now there's no more pool v1 code and it's a real pain to still have to deal with this. Let's clean this up now by removing the "2" everywhere, and by renaming the pool heads "pool_head_something".	2017-11-24 17:49:53 +01:00
Christopher Faulet	767a84bcc0	CLEANUP: log: Rename Alert/Warning in ha_alert/ha_warning	2017-11-24 17:19:12 +01:00
Christopher Faulet	8ca3b4bc46	MEDIUM: threads/compression: Make HTTP compression thread-safe	2017-10-31 13:58:32 +01:00
Christopher Faulet	ff8abcd31d	MEDIUM: threads/proxy: Add a lock per proxy and atomically update proxy vars Now, each proxy contains a lock that must be used when necessary to protect it. Moreover, all proxy's counters are now updated using atomic operations.	2017-10-31 13:58:30 +01:00
Christopher Faulet	3dc860d19d	BUG/MINOR: compression: Check response headers before http-response rules eval This is required if we want to use res.comp or res.comp_algo sample fetches in http-response rules. This patch must be backported in 1.7.	2017-09-15 18:42:23 +02:00
Christopher Faulet	03d85538b3	BUG/MEDIUM: compression: Fix check on txn in smp_fetch_res_comp_algo The check was totally messed up. In the worse case, it led to a crash, when res.comp_algo sample fetch was retrieved on uncompressed response (with the compression enabled). This patch must be backported in 1.7.	2017-09-15 18:42:23 +02:00
Christopher Faulet	a03d4ada26	MINOR: compression: Use a memory pool to allocate compression states Instead of doing a malloc/free to each HTTP transaction to allocate the compression state (when the HTTP compression is enabled), we use a memory pool.	2017-06-30 14:05:29 +02:00
Christopher Faulet	d60b3cf431	BUG/MAJOR: compression: Be sure to release the compression state in all cases This patch fixes an obvious memory leak in the compression filter. The compression state (comp_state) is allocated when a HTTP transaction starts, in channel_start_analyze callback, Whether we are able to compression the response or not. So it must be released when the transaction ends, in channel_end_analyze callback. But there is a bug here. The state is released on the response side only. So, if a transaction ends before the response is started, it is never released. This happens when a connection is closed before the response is started. To fix the bug, statistics about the HTTP compression are now updated in http_end callback, when the response parsing ends. It happens only if no error is encountered and when the response is compressed. So, it is safe to release the compression state in channel_end_analyze callback, regardless the channel's type. This patch must be backported in 1.7.	2017-06-30 14:05:29 +02:00
Willy Tarreau	2235b261b6	OPTIM: http: move all http character classs tables into a single one We used to have 7 different character classes, each was 256 bytes long, resulting in almost 2kB being used in the L1 cache. It's as cheap to test a bit than to check the byte is not null, so let's store a 7-bit composite value and check for the respective bits there instead. The executable is now 4 kB smaller and the performance on small objects increased by about 1% to 222k requests/second with a config involving 4 http-request rules including 1 header lookup, one header replacement, and 2 variable assignments.	2016-11-05 15:58:08 +01:00
Christopher Faulet	06ecf3ab72	BUG/MEDIUM: http/compression: Fix how chunked data are copied during the HTTP body parsing When the compression is enable on HTTP responses, the chunked data are copied in a temporary buffer during the HTTP body parsing and then compressed when everything is forwarded to the client. But the amout of data that can be copied was not correctly calculated. In many cases, it worked, else on the edge when the channel buffer was almost full. [wt: bug introduced by `b77c5c26` in 1.7-dev, no backport needed]	2016-09-23 16:01:14 +02:00
Christopher Faulet	1339d744d5	MEDIUM: filters: Move HTTP headers filtering in its own callback Instead of calling 'channel_analyze' callback with the flag AN_FLT_HTTP_HDRS, now we use the new callback 'http_headers'. This change is done because 'channel_analyze' callback will be removed in a next commit.	2016-05-18 15:11:54 +02:00
Thierry Fournier	3610c39c8c	MINOR: filters: add opaque data Add opaque data between the filter keyword registrering and the parsing function. This opaque data allow to use the same parser with differents registered keywords. The opaque data is used for giving data which mainly makes difference between the two keywords. It will be used with Lua keywords registering.	2016-04-27 10:48:15 +02:00
Vincent Bernat	02779b6263	CLEANUP: uniformize last argument of malloc/calloc Instead of repeating the type of the LHS argument (sizeof(struct ...)) in calls to malloc/calloc, we directly use the pointer name (sizeof(...)). The following Coccinelle patch was used: @@ type T; T x; @@ x = malloc( - sizeof(T) + sizeof(x) ) @@ type T; T x; @@ x = calloc(1, - sizeof(T) + sizeof(*x) ) When the LHS is not just a variable name, no change is made. Moreover, the following patch was used to ensure that "1" is consistently used as a first argument of calloc, not the last one: @@ @@ calloc( + 1, ... - ,1 )	2016-04-03 14:17:42 +02:00
Willy Tarreau	be508f1580	BUG/MAJOR: samples: check smp->strm before using it Since commit `6879ad3` ("MEDIUM: sample: fill the struct sample with the session, proxy and stream pointers") merged in 1.6-dev2, the sample contains the pointer to the stream and sample fetch functions as well as converters use it heavily. The problem is that earlier commit `87b0966` ("REORG/MAJOR: session: rename the "session" entity to "stream"") had split the session and stream resulting in the possibility for smp->strm to be NULL before the stream was initialized. This is what happens in tcp-request connection rulesets, as discovered by Baptiste. The sample fetch functions must now check that smp->strm is valid before using it. An alternative could consist in using a dummy stream with nothing in it to avoid some checks but it would only result in deferring them to the next step anyway, and making it harder to detect that a stream is valid or the dummy one. There is still an issue with variables which requires a complete independant fix. They use strm->sess to find the session with strm possibly NULL and passed as an argument. All call places indirectly use smp->strm to build strm. So the problem is there but the API needs to be changed to remove this duplicate argument that makes it much harder to know what pointer to use. This fix must be backported to 1.6, as well as the next one fixing variables.	2016-03-10 16:42:58 +01:00
Christopher Faulet	443ea1a242	MINOR: filters: Extract proxy stuff from the struct filter Now, filter's configuration (.id, .conf and .ops fields) is stored in the structure 'flt_conf'. So proxies own a flt_conf list instead of a filter list. When a filter is attached to a stream, it gets a pointer on its configuration. This avoids mixing the filter's context (owns by a stream) and its configuration (owns by a proxy). It also saves 2 pointers per filter instance.	2016-02-09 14:53:15 +01:00
Christopher Faulet	b77c5c2693	MEDIUM: filters: Optimize the HTTP compression for chunk encoded response Instead of compressing all chunks as they come, we store them in a temporary buffer. The compression happens during the forwarding phase. This change speeds up the compression of chunked response.	2016-02-09 14:53:15 +01:00
Christopher Faulet	3e7bc67722	MINOR: filters: Remove unused or useless stuff and do small optimizations	2016-02-09 14:53:15 +01:00
Christopher Faulet	da02e17d42	MAJOR: filters: Require explicit registration to filter HTTP body and TCP data Before, functions to filter HTTP body (and TCP data) were called from the moment at least one filter was attached to the stream. If no filter is interested by these data, this uselessly slows data parsing. A good example is the HTTP compression filter. Depending of request and response headers, the response compression can be enabled or not. So it could be really nice to call it only when enabled. So, now, to filter HTTP/TCP data, a filter must use the function register_data_filter. For TCP streams, this function can be called only once. But for HTTP streams, when needed, it must be called for each HTTP request or HTTP response. Only registered filters will be called during data parsing. At any time, a filter can be unregistered by calling the function unregister_data_filter.	2016-02-09 14:53:15 +01:00
Christopher Faulet	fcf035cb5a	MINOR: filters: Add stream_filters structure to hide filters info From the stream point of view, this new structure is opaque. it hides filters implementation details. So, impact for future optimizations will be reduced (well, we hope so...). Some small improvements has been made in filters.c to avoid useless checks.	2016-02-09 14:53:15 +01:00
Christopher Faulet	309c6418b0	MEDIUM: filters: Replace filter_http_headers callback by an analyzer This new analyzer will be called for each HTTP request/response, before the parsing of the body. It is identified by AN_FLT_HTTP_HDRS. Special care was taken about the following condition : * the frontend is a TCP proxy * filters are defined in the frontend section * the selected backend is a HTTP proxy So, this patch explicitly add AN_FLT_HTTP_HDRS analyzer on the request and the response channels when the backend is a HTTP proxy and when there are filters attatched on the stream. This patch simplifies http_request_forward_body and http_response_forward_body functions.	2016-02-09 14:53:15 +01:00
Christopher Faulet	2fb2880caf	MEDIUM: filters: remove http_start_chunk, http_last_chunk and http_chunk_end For Chunked HTTP request/response, the body filtering can be really expensive. In the worse case (many chunks of 1 bytes), the filters overhead is of 3 calls per chunk. If http_data callback is useful, others are just informative. So these callbacks has been removed. Of course, existing filters (trace and compression) has beeen updated accordingly. For the HTTP compression filter, the update is quite huge. Its implementation is closer to the old one.	2016-02-09 14:53:15 +01:00
Christopher Faulet	92d3638d2d	MAJOR: filters/http: Rewrite the HTTP compression as a filter HTTP compression has been rewritten to use the filter API. This is more a PoC than other thing for now. It allocates memory to work. So, if only for that, it should be rewritten. In the mean time, the implementation has been refactored to allow its use with other filters. However, there are limitations that should be respected: - No filter placed after the compression one is allowed to change input data (in 'http_data' callback). - No filter placed before the compression one is allowed to change forwarded data (in 'http_forward_data' callback). For now, these limitations are informal, so you should be careful when you use several filters. About the configuration, 'compression' keywords are still supported and must be used to configure the HTTP compression behavior. In absence of a 'filter' line for the compression filter, it is added in the filter chain when the first compression' line is parsed. This is an easy way to do when you do not use other filters. But another filter exists, an error is reported so that the user must explicitly declare the filter. For example: listen tst ... compression algo gzip compression offload ... filter flt_1 filter compression filter flt_2 ...	2016-02-09 14:53:15 +01:00
Christopher Faulet	3d97c90974	REORG: filters: Prepare creation of the HTTP compression filter HTTP compression will be moved in a true filter. To prepare the ground, some functions have been moved in a dedicated file. Idea is to keep everything about compression algos in compression.c and everything related to the filtering in flt_http_comp.c. For now, a header has been added to help during the transition. It will be removed later. Unused empty ACL keyword list was removed. The "compression" keyword parser was moved from cfgparse.c to flt_http_comp.c.	2016-02-09 14:53:15 +01:00

1 2

96 Commits