haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-17 04:27:00 +02:00

Author	SHA1	Message	Date
Willy Tarreau	1e62df92e3	MEDIUM: stats: implement a typed output format for stats The output for each field is : field:<origin><nature><scope>:type:value where field reminds the type of the object being dumped as well as its position (pid, iid, sid), field number and field name. This way a monitoring utility may very well report all available information without knowing new fields in advance. This format is also supported in the HTTP version of the stats by adding ";typed" after the URI, instead of ";csv" for the CSV format. The doc was not updated yet.	2016-03-11 17:24:15 +01:00
Willy Tarreau	6204cd9f27	BUG/MAJOR: vars: always retrieve the stream and session from the sample This is the continuation of previous patch called "BUG/MAJOR: samples: check smp->strm before using it". It happens that variables may have a session-wide scope, and that their session is retrieved by dereferencing the stream. But nothing prevents them from being used from a streamless context such as tcp-request connection, thus crashing the process. Example : tcp-request connection accept if { src,set-var(sess.foo) -m found } In order to fix this, we have to always ensure that variable manipulation only happens via the sample, which contains the correct owner and context, and that we never use one from a different source. This results in quite a large change since a lot of functions are inderctly involved in the call chain, but the change is easy to follow. This fix must be backported to 1.6, and requires the last two patches.	2016-03-10 17:28:04 +01:00
Willy Tarreau	be508f1580	BUG/MAJOR: samples: check smp->strm before using it Since commit `6879ad3` ("MEDIUM: sample: fill the struct sample with the session, proxy and stream pointers") merged in 1.6-dev2, the sample contains the pointer to the stream and sample fetch functions as well as converters use it heavily. The problem is that earlier commit `87b0966` ("REORG/MAJOR: session: rename the "session" entity to "stream"") had split the session and stream resulting in the possibility for smp->strm to be NULL before the stream was initialized. This is what happens in tcp-request connection rulesets, as discovered by Baptiste. The sample fetch functions must now check that smp->strm is valid before using it. An alternative could consist in using a dummy stream with nothing in it to avoid some checks but it would only result in deferring them to the next step anyway, and making it harder to detect that a stream is valid or the dummy one. There is still an issue with variables which requires a complete independant fix. They use strm->sess to find the session with strm possibly NULL and passed as an argument. All call places indirectly use smp->strm to build strm. So the problem is there but the API needs to be changed to remove this duplicate argument that makes it much harder to know what pointer to use. This fix must be backported to 1.6, as well as the next one fixing variables.	2016-03-10 16:42:58 +01:00
Christopher Faulet	75e2eb66e5	MINOR: filters/http: Forward remaining data when a channel has no "data" filters This is an improvement, especially when the message body is big. Before this patch, remaining data were forwarded when there is no filter on the stream. Now, the forwarding is triggered when there is no "data" filter on the channel. When no filter is used, there is no difference, but when at least one filter is used, it can be really significative.	2016-02-09 14:53:15 +01:00
Christopher Faulet	113f7decfc	MINOR: filters/http: Slightly update the parsing of chunks Now, http_parse_chunk_size and http_skip_chunk_crlf return the number of bytes parsed on success. http_skip_chunk_crlf does not use msg->sol anymore. On the other hand, http_forward_trailers is unchanged. It returns >0 if the end of trailers is reached and 0 if not. In all cases (except if an error is encountered), msg->sol contains the length of the last parsed part of the trailer headers. Internal doc and comments about msg->sol has been updated accordingly.	2016-02-09 14:53:15 +01:00
Christopher Faulet	da02e17d42	MAJOR: filters: Require explicit registration to filter HTTP body and TCP data Before, functions to filter HTTP body (and TCP data) were called from the moment at least one filter was attached to the stream. If no filter is interested by these data, this uselessly slows data parsing. A good example is the HTTP compression filter. Depending of request and response headers, the response compression can be enabled or not. So it could be really nice to call it only when enabled. So, now, to filter HTTP/TCP data, a filter must use the function register_data_filter. For TCP streams, this function can be called only once. But for HTTP streams, when needed, it must be called for each HTTP request or HTTP response. Only registered filters will be called during data parsing. At any time, a filter can be unregistered by calling the function unregister_data_filter.	2016-02-09 14:53:15 +01:00
Christopher Faulet	dbe34eb8cb	MEDIUM: filters/http: Move body parsing of HTTP messages in dedicated functions Now body parsing is done in http_msg_forward_body and http_msg_forward_chunked_body functions, regardless of whether we parse a request or a response. Parsing result is still handled in http_request_forward_body and http_response_forward_body functions. This patch will ease futur optimizations, mainly on filters.	2016-02-09 14:53:15 +01:00
Christopher Faulet	309c6418b0	MEDIUM: filters: Replace filter_http_headers callback by an analyzer This new analyzer will be called for each HTTP request/response, before the parsing of the body. It is identified by AN_FLT_HTTP_HDRS. Special care was taken about the following condition : * the frontend is a TCP proxy * filters are defined in the frontend section * the selected backend is a HTTP proxy So, this patch explicitly add AN_FLT_HTTP_HDRS analyzer on the request and the response channels when the backend is a HTTP proxy and when there are filters attatched on the stream. This patch simplifies http_request_forward_body and http_response_forward_body functions.	2016-02-09 14:53:15 +01:00
Christopher Faulet	2fb2880caf	MEDIUM: filters: remove http_start_chunk, http_last_chunk and http_chunk_end For Chunked HTTP request/response, the body filtering can be really expensive. In the worse case (many chunks of 1 bytes), the filters overhead is of 3 calls per chunk. If http_data callback is useful, others are just informative. So these callbacks has been removed. Of course, existing filters (trace and compression) has beeen updated accordingly. For the HTTP compression filter, the update is quite huge. Its implementation is closer to the old one.	2016-02-09 14:53:15 +01:00
Christopher Faulet	3e34429515	MEDIUM: filters: Use macros to call filters callbacks to speed-up processing When no filter is attached to the stream, the CPU footprint due to the calls to filters_* functions is huge, especially for chunk-encoded messages. Using macros to check if we have some filters or not is a great improvement. Furthermore, instead of checking the filter list emptiness, we introduce a flag to know if filters are attached or not to a stream.	2016-02-09 14:53:15 +01:00
Christopher Faulet	92d3638d2d	MAJOR: filters/http: Rewrite the HTTP compression as a filter HTTP compression has been rewritten to use the filter API. This is more a PoC than other thing for now. It allocates memory to work. So, if only for that, it should be rewritten. In the mean time, the implementation has been refactored to allow its use with other filters. However, there are limitations that should be respected: - No filter placed after the compression one is allowed to change input data (in 'http_data' callback). - No filter placed before the compression one is allowed to change forwarded data (in 'http_forward_data' callback). For now, these limitations are informal, so you should be careful when you use several filters. About the configuration, 'compression' keywords are still supported and must be used to configure the HTTP compression behavior. In absence of a 'filter' line for the compression filter, it is added in the filter chain when the first compression' line is parsed. This is an easy way to do when you do not use other filters. But another filter exists, an error is reported so that the user must explicitly declare the filter. For example: listen tst ... compression algo gzip compression offload ... filter flt_1 filter compression filter flt_2 ...	2016-02-09 14:53:15 +01:00
Christopher Faulet	3d97c90974	REORG: filters: Prepare creation of the HTTP compression filter HTTP compression will be moved in a true filter. To prepare the ground, some functions have been moved in a dedicated file. Idea is to keep everything about compression algos in compression.c and everything related to the filtering in flt_http_comp.c. For now, a header has been added to help during the transition. It will be removed later. Unused empty ACL keyword list was removed. The "compression" keyword parser was moved from cfgparse.c to flt_http_comp.c.	2016-02-09 14:53:15 +01:00
Christopher Faulet	d7c9196ae5	MAJOR: filters: Add filters support This patch adds the support of filters in HAProxy. The main idea is to have a way to "easely" extend HAProxy by adding some "modules", called filters, that will be able to change HAProxy behavior in a programmatic way. To do so, many entry points has been added in code to let filters to hook up to different steps of the processing. A filter must define a flt_ops sutrctures (see include/types/filters.h for details). This structure contains all available callbacks that a filter can define: struct flt_ops { /* * Callbacks to manage the filter lifecycle / int (init) (struct proxy p); void (deinit)(struct proxy p); int (check) (struct proxy p); / * Stream callbacks / void (stream_start) (struct stream s); void (stream_accept) (struct stream s); void (session_establish)(struct stream s); void (stream_stop) (struct stream s); / * HTTP callbacks / int (http_start) (struct stream s, struct http_msg msg); int (http_start_body) (struct stream s, struct http_msg msg); int (http_start_chunk) (struct stream s, struct http_msg msg); int (http_data) (struct stream s, struct http_msg msg); int (http_last_chunk) (struct stream s, struct http_msg msg); int (http_end_chunk) (struct stream s, struct http_msg msg); int (http_chunk_trailers)(struct stream s, struct http_msg msg); int (http_end_body) (struct stream s, struct http_msg msg); void (http_end) (struct stream s, struct http_msg msg); void (http_reset) (struct stream s, struct http_msg msg); int (http_pre_process) (struct stream s, struct http_msg msg); int (http_post_process) (struct stream s, struct http_msg msg); void (http_reply) (struct stream s, short status, const struct chunk msg); }; To declare and use a filter, in the configuration, the "filter" keyword must be used in a listener/frontend section: frontend test ... filter <FILTER-NAME> [OPTIONS...] The filter referenced by the <FILTER-NAME> must declare a configuration parser on its own name to fill flt_ops and filter_conf field in the proxy's structure. An exemple will be provided later to make it perfectly clear. For now, filters cannot be used in backend section. But this is only a matter of time. Documentation will also be added later. This is the first commit of a long list about filters. It is possible to have several filters on the same listener/frontend. These filters are stored in an array of at most MAX_FILTERS elements (define in include/types/filters.h). Again, this will be replaced later by a list of filters. The filter API has been highly refactored. Main changes are: * Now, HA supports an infinite number of filters per proxy. To do so, filters are stored in list. * Because filters are stored in list, filters state has been moved from the channel structure to the filter structure. This is cleaner because there is no more info about filters in channel structure. * It is possible to defined filters on backends only. For such filters, stream_start/stream_stop callbacks are not called. Of course, it is possible to mix frontend and backend filters. * Now, TCP streams are also filtered. All callbacks without the 'http_' prefix are called for all kind of streams. In addition, 2 new callbacks were added to filter data exchanged through a TCP stream: - tcp_data: it is called when new data are available or when old unprocessed data are still waiting. - tcp_forward_data: it is called when some data can be consumed. * New callbacks attached to channel were added: - channel_start_analyze: it is called when a filter is ready to process data exchanged through a channel. 2 new analyzers (a frontend and a backend) are attached to channels to call this callback. For a frontend filter, it is called before any other analyzer. For a backend filter, it is called when a backend is attached to a stream. So some processing cannot be filtered in that case. - channel_analyze: it is called before each analyzer attached to a channel, expects analyzers responsible for data sending. - channel_end_analyze: it is called when all other analyzers have finished their processing. A new analyzers is attached to channels to call this callback. For a TCP stream, this is always the last one called. For a HTTP one, the callback is called when a request/response ends, so it is called one time for each request/response. * 'session_established' callback has been removed. Everything that is done in this callback can be handled by 'channel_start_analyze' on the response channel. * 'http_pre_process' and 'http_post_process' callbacks have been replaced by 'channel_analyze'. * 'http_start' callback has been replaced by 'http_headers'. This new one is called just before headers sending and parsing of the body. * 'http_end' callback has been replaced by 'channel_end_analyze'. * It is possible to set a forwarder for TCP channels. It was already possible to do it for HTTP ones. * Forwarders can partially consumed forwardable data. For this reason a new HTTP message state was added before HTTP_MSG_DONE : HTTP_MSG_ENDING. Now all filters can define corresponding callbacks (http_forward_data and tcp_forward_data). Each filter owns 2 offsets relative to buf->p, next and forward, to track, respectively, input data already parsed but not forwarded yet by the filter and parsed data considered as forwarded by the filter. A any time, we have the warranty that a filter cannot parse or forward more input than previous ones. And, of course, it cannot forward more input than it has parsed. 2 macros has been added to retrieve these offets: FLT_NXT and FLT_FWD. In addition, 2 functions has been added to change the 'next size' and the 'forward size' of a filter. When a filter parses input data, it can alter these data, so the size of these data can vary. This action has an effet on all previous filters that must be handled. To do so, the function 'filter_change_next_size' must be called, passing the size variation. In the same spirit, if a filter alter forwarded data, it must call the function 'filter_change_forward_size'. 'filter_change_next_size' can be called in 'http_data' and 'tcp_data' callbacks and only these ones. And 'filter_change_forward_size' can be called in 'http_forward_data' and 'tcp_forward_data' callbacks and only these ones. The data changes are the filter responsability, but with some limitation. It must not change already parsed/forwarded data or data that previous filters have not parsed/forwarded yet. Because filters can be used on backends, when we the backend is set for a stream, we add filters defined for this backend in the filter list of the stream. But we must only do that when the backend and the frontend of the stream are not the same. Else same filters are added a second time leading to undefined behavior. The HTTP compression code had to be moved. So it simplifies http_response_forward_body function. To do so, the way the data are forwarded has changed. Now, a filter (and only one) can forward data. In a commit to come, this limitation will be removed to let all filters take part to data forwarding. There are 2 new functions that filters should use to deal with this feature: * flt_set_http_data_forwarder: This function sets the filter (using its id) that will forward data for the specified HTTP message. It is possible if it was not already set by another filter _AND_ if no data was yet forwarded (msg->msg_state <= HTTP_MSG_BODY). It returns -1 if an error occurs. * flt_http_data_forwarder: This function returns the filter id that will forward data for the specified HTTP message. If there is no forwarder set, it returns -1. When an HTTP data forwarder is set for the response, the HTTP compression is disabled. Of course, this is not definitive.	2016-02-09 14:53:15 +01:00
Willy Tarreau	53f9685b72	BUG/MEDIUM: http-reuse: do not share private connections across backends When working on the previous bug, it appeared that it the case that was triggering the bug would also work between two backends, one of which doesn't support http-reuse. The reason is that while the idle connection is moved to the private pool, upon reuse we only check if it holds the CO_FL_PRIVATE flag. And we don't set this flag when there's no reuse. So let's always set it in this case, it will guarantee that no undesired connection sharing may happen. This fix must be backported to 1.6.	2016-02-03 21:23:08 +01:00
Cyril Bonté	f78d8967d7	BUG/MEDIUM: sample: http_date() doesn't provide the right day of the week Gregor Kovač reported that http_date() did not return the right day of the week. For example "Sat, 22 Jan 2016 17:43:38 GMT" instead of "Fri, 22 Jan 2016 17:43:38 GMT". Indeed, gmtime() returns a 'struct tm' result, where tm_wday begins on Sunday, whereas the code assumed it began on Monday. This patch must be backported to haproxy 1.5 and 1.6.	2016-01-22 19:52:31 +01:00
Christopher Faulet	a94e5a548c	MINOR: filters/http: Use a wrapper function instead of stream_int_retnclose The function http_reply_and_close has been added in proto_http.c to wrap calls to stream_int_retnclose. This functions will be modified when the filters will be added.	2015-12-28 16:49:36 +01:00
Christopher Faulet	a46bbd893a	BUG/MINOR: http: Be sure to process all the data received from a server When the response body is forwarded, if the server closes the input before the end, an error is thrown. But if the data processing is too slow, all data could already be received and pending in the input buffer. So this is a bug to stop processing in this context. The server doesn't really closed the input before the end. As an example, this could happen when HAProxy is configured to do compression offloading. If the server closes the connection explicitly after the response (keep-alive disabled by the server) and if HAProxy receives the data faster than they are compressed, then the response could be truncated. This patch fixes the bug by checking if some pending data remain in the input buffer before returning an error. If yes, the processing continues.	2015-12-28 16:49:36 +01:00
Willy Tarreau	f66258237c	BUG/MINOR: http: fix several off-by-one errors in the url_param parser Several cases of "<=" instead of "<" were found in the url_param parser, mostly affecting the case where the parameter is wrapping. They shouldn't affect header operations, just body parsing in a wrapped pipelined request. The code is a bit complicated with certain operations done multiple times in multiple functions, so it's not sure others are not left. This code must be re-audited. It should only be backported to 1.6 once carefully tested, because it is possible that other bugs relied on these ones.	2015-12-27 14:51:01 +01:00
Willy Tarreau	858b103631	BUG/MEDIUM: http: fix http-reuse when frontend and backend differ Krishna Kumar reported that the following configuration doesn't permit HTTP reuse between two clients : frontend private-frontend mode http bind :8001 default_backend private-backend backend private-backend mode http http-reuse always server bck 127.0.0.1:8888 The reason for this is that in http_end_txn_clean_session() we check the stream's backend backend's http-reuse option before deciding whether the backend connection should be moved back to the server's pool or not. But since we're doing this after the call to http_reset_txn(), the backend is reset to match the frontend, which doesn't have the option. However it will work fine in a setup involving a "listen" section. We just need to keep a pointer to the current backend before calling http_reset_txn(). The code does that and replaces the few remaining references to s->be inside the same function so that if any part of code were to be moved later, this trap doesn't happen again. This fix must be backported to 1.6.	2015-12-07 17:04:59 +01:00
Cyril Bont�	ce1ef4df01	BUG/MEDIUM: sample: urlp can't match an empty value Currently urlp fetching samples were able to find parameters with an empty value, but the return code depended on the value length. The final result was that acls using urlp couldn't match empty values. Example of acl which always returned "false": acl MATCH_EMPTY urlp(foo) -m len 0 The fix consists in unconditionally return 1 when the parameter is found. This fix must be backported to 1.6 and 1.5.	2015-11-26 23:51:42 +01:00
Willy Tarreau	714ea78c9a	BUG/MEDIUM: http: don't enable auto-close on the response side There is a bug where "option http-keep-alive" doesn't force a response to stay in keep-alive if the server sends the FIN along with the response on the second or subsequent response. The reason is that the auto-close was forced enabled when recycling the HTTP transaction and it's never disabled along the response processing chain before the SHUTR gets a chance to be forwarded to the client side. The MSG_DONE state of the HTTP response properly disables it but too late. There's no more reason for enabling auto-close here, because either it doesn't matter in non-keep-alive modes because the connection is closed, or it is automatically enabled by process_stream() when it sees there's no analyser on the stream. This bug also affects 1.5 so a backport is desired.	2015-11-26 10:25:11 +01:00
Willy Tarreau	7f876a1eeb	BUG/MEDIUM: http: switch the request channel to no-delay once done. There's an issue when sending POST data that came in a second packet, the CF_NEVER_WAIT flag is not always set on the request channel, while the server is waiting for the request. We must always set this flag in this case since we're not going to shut down after sending, contrary to the response side. Note that option http-no-delay works around this issue. Reproducer : listen px mode http timeout client 10s timeout server 5s timeout connect 3s option http-server-close #option http-no-delay bind :8001 server s1 127.0.0.1:8003 $ (printf "POST / HTTP/1.1\r\nTransfer-encoding: chunked\r\n\r\n"; sleep 0.01; printf "10\r\nAZERTYUIOPQSDFGH\r\n0\r\n\r\n") \| nc6 0 8001 Before this fix : 12:03:31.946763 epoll_wait(3, {{EPOLLIN, {u32=5, u64=5}}}, 200, 1000) = 1 12:03:32.634175 accept4(5, {sa_family=AF_INET, sin_port=htons(53849), sin_addr=inet_addr("127.0.0.1")}, [16], SOCK_NONBLOCK) = 6 12:03:32.634318 setsockopt(6, SOL_TCP, TCP_NODELAY, [1], 4) = 0 12:03:32.634434 accept4(5, 0x7ffccfbb2cf0, [128], SOCK_NONBLOCK) = -1 EAGAIN (Resource temporarily unavailable) 12:03:32.634574 recvfrom(6, "POST / HTTP/1.1\r\nTransfer-encodi"..., 8192, 0, NULL, NULL) = 47 12:03:32.634809 setsockopt(6, SOL_TCP, TCP_QUICKACK, [1], 4) = 0 12:03:32.634952 socket(PF_INET, SOCK_STREAM, IPPROTO_TCP) = 7 12:03:32.635031 fcntl(7, F_SETFL, O_RDONLY\|O_NONBLOCK) = 0 12:03:32.635089 setsockopt(7, SOL_TCP, TCP_NODELAY, [1], 4) = 0 12:03:32.635153 connect(7, {sa_family=AF_INET, sin_port=htons(8003), sin_addr=inet_addr("127.0.0.1")}, 16) = -1 EINPROGRESS (Operation now in progress) 12:03:32.635315 epoll_wait(3, {}, 200, 0) = 0 12:03:32.635394 sendto(7, "POST / HTTP/1.1\r\nTransfer-encodi"..., 66, MSG_DONTWAIT\|MSG_NOSIGNAL, NULL, 0) = 66 12:03:32.635527 recvfrom(6, 0x7f0224e66024, 8192, 0, 0, 0) = -1 EAGAIN (Resource temporarily unavailable) 12:03:32.635651 epoll_ctl(3, EPOLL_CTL_ADD, 6, {EPOLLIN\|0x2000, {u32=6, u64=6}}) = 0 12:03:32.635782 epoll_wait(3, {}, 200, 0) = 0 12:03:32.635842 recvfrom(7, 0x7f0224e66024, 8192, 0, 0, 0) = -1 EAGAIN (Resource temporarily unavailable) 12:03:32.635924 epoll_ctl(3, EPOLL_CTL_ADD, 7, {EPOLLIN\|0x2000, {u32=7, u64=7}}) = 0 12:03:32.636027 epoll_wait(3, {{EPOLLIN, {u32=6, u64=6}}}, 200, 1000) = 1 12:03:32.644892 recvfrom(6, "10\r\nAZERTYUIOPQSDFGH\r\n0\r\n\r\n", 8192, 0, NULL, NULL) = 27 12:03:32.645016 epoll_wait(3, {}, 200, 0) = 0 12:03:32.645105 sendto(7, "10\r\nAZERTYUIOPQSDFGH\r\n0\r\n\r\n", 27, MSG_DONTWAIT\|MSG_NOSIGNAL\|MSG_MORE, NULL, 0) = 27 After the fix : 11:59:12.538617 connect(7, {sa_family=AF_INET, sin_port=htons(8003), sin_addr=inet_addr("127.0.0.1")}, 16) = -1 EINPROGRESS (Operation now in progress) 11:59:12.538787 epoll_wait(3, {}, 200, 0) = 0 11:59:12.538867 sendto(7, "POST / HTTP/1.1\r\nTransfer-encodi"..., 66, MSG_DONTWAIT\|MSG_NOSIGNAL, NULL, 0) = 66 11:59:12.539031 recvfrom(6, 0x7f832ce45024, 8192, 0, 0, 0) = -1 EAGAIN (Resource temporarily unavailable) 11:59:12.539161 epoll_ctl(3, EPOLL_CTL_ADD, 6, {EPOLLIN\|0x2000, {u32=6, u64=6}}) = 0 11:59:12.539259 epoll_wait(3, {}, 200, 0) = 0 11:59:12.539337 recvfrom(7, 0x7f832ce45024, 8192, 0, 0, 0) = -1 EAGAIN (Resource temporarily unavailable) 11:59:12.539421 epoll_ctl(3, EPOLL_CTL_ADD, 7, {EPOLLIN\|0x2000, {u32=7, u64=7}}) = 0 11:59:12.539499 epoll_wait(3, {{EPOLLIN, {u32=6, u64=6}}}, 200, 1000) = 1 11:59:12.548519 recvfrom(6, "10\r\nAZERTYUIOPQSDFGH\r\n0\r\n\r\n", 8192, 0, NULL, NULL) = 27 11:59:12.548844 epoll_wait(3, {}, 200, 0) = 0 11:59:12.549012 sendto(7, "10\r\nAZERTYUIOPQSDFGH\r\n0\r\n\r\n", 27, MSG_DONTWAIT\|MSG_NOSIGNAL, NULL, 0) = 27 11:59:12.549454 epoll_wait(3, {}, 200, 1000) = 0 This fix must be backported to 1.6, 1.5 and 1.4.	2015-11-18 12:50:38 +01:00
lsenta	1e1f41d0f3	BUG: http: do not abort keep-alive connections on server timeout When a server timeout is detected on the second or nth request of a keep-alive connection, HAProxy closes the connection without writing a response. Some clients would fail with a remote disconnected exception and some others would retry potentially unsafe requests. This patch removes the special case and makes sure a 504 timeout is written back whenever a server timeout is handled. Signed-off-by: lsenta <laurent.senta@gmail.com>	2015-11-13 14:41:51 +01:00
Willy Tarreau	1c59bd5abc	BUG/MAJOR: http: don't requeue an idle connection that is already queued Cyril Bont� reported a reproduceable sequence which can lead to a crash when using backend connection reuse. The problem comes from the fact that we systematically add the server connection to an idle pool at the end of the HTTP transaction regardless of the fact that it might already be there. This is possible for example when processing a request which doesn't use a server connection (typically a redirect) after a request which used a connection. Then after the first request, the connection was already in the idle queue and we're putting it a second time at the end of the second request, causing a corruption of the idle pool. Interestingly, the memory debugger in 1.7 immediately detected a suspicious double free on the connection, leading to a very early detection of the cause instead of its consequences. Thanks to Cyril for quickly providing a working reproducer. This fix must be backported to 1.6 since connection reuse was introduced there.	2015-11-02 22:28:25 +01:00
Christopher Faulet	d57ad64873	BUG/MINOR: http: Add OPTIONS in supported http methods (found by find_http_meth) The 'OPTIONS' method was not in the list of supported HTTP methods and find_http_meth return HTTP_METH_OTHER instead of HTTP_METH_OPTIONS. [wt: this fix needs to be backported at least to 1.5, 1.4 and 1.3]	2015-10-09 10:18:09 +02:00
Thierry FOURNIER	ab95e656ea	MINOR: http/tcp: fill the avalaible actions This patch adds a function that generates the list of avalaible actions for the error message.	2015-10-02 22:56:11 +02:00
David Carlier	4686f792b4	MINOR: proto_http: Externalisation of previously internal functions Needs to expose the HTTP headers 'iterator' and the client's cookie value extraction functions.	2015-09-28 14:01:27 +02:00
Willy Tarreau	acc980036f	MEDIUM: action: add a new flag ACT_FLAG_FIRST This flag is used by custom actions to know that they're called for the first time. The only case where it's not set is when they're resuming from a yield. It will be needed to let them know when they have to allocate some resources.	2015-09-27 23:34:39 +02:00
Willy Tarreau	394586836f	MEDIUM: http: pass ACT_FLAG_FINAL to custom actions In HTTP it's more difficult to know when to pass the flag or not because all actions are supposed to be final and there's no inspection delay. Also, the input channel may very well be closed without this being an error. So we only set the flag when option abortonclose is set and the input channel is closed, which is the only case where the user explicitly wants to forward a close down the chain.	2015-09-27 11:04:06 +02:00
Willy Tarreau	658b85b68d	MEDIUM: actions: pass a new "flags" argument to custom actions Since commit `bc4c1ac` ("MEDIUM: http/tcp: permit to resume http and tcp custom actions"), some actions may yield and be called back when new information are available. Unfortunately some of them may continue to yield because they simply don't know that it's the last call from the rule set. For this reason we'll need to pass a flag to the custom action to pass such information and possibly other at the same time.	2015-09-27 11:04:06 +02:00
Thierry FOURNIER	fd50f0bcc8	MINOR: http: split initialization The goal is to export the http txn initialisation functions for using it in the Lua code.	2015-09-25 23:39:48 +02:00
Thierry FOURNIER	3c3317849f	MINOR: http: export http_get_path() function This patch simply exports the http_get_path() function from the proto_http.c file.	2015-09-25 23:39:27 +02:00
Thierry FOURNIER	ed08d6a9be	MEDIUM: proto_http: smp_prefetch_http initialize txn When we call the function smp_prefetch_http(), if the txn is not initialized, it doesn't work. This patch fix this. Now, smp_prefecth_http() permits to use http with any proxy mode.	2015-09-25 23:27:23 +02:00
Thierry FOURNIER	85c6c97830	MINOR: action: add reference to the original keywork matched for the called parser. This is usefull because the keyword can contains some condifiguration data set while the keyword registration.	2015-09-23 21:44:23 +02:00
Willy Tarreau	c29d0cda4b	BUG/MEDIUM: http: do not dereference strm_li(stream) Some streams do not have a listener (eg: Lua's cosockets) so let's check for this. For now this problem cannot happen but it's definitely unsafe.	2015-09-23 13:42:08 +02:00
James Rosewell	91a41cb32d	MINOR: http: made CHECK_HTTP_MESSAGE_FIRST accessible to other functions Added the definition of CHECK_HTTP_MESSAGE_FIRST and the declaration of smp_prefetch_http to the header. Changed smp_prefetch_http implementation to remove the static qualifier.	2015-09-21 12:05:26 +02:00
Willy Tarreau	b7ce424be2	BUG/MINOR: http: remove stupid HTTP_METH_NONE entry When converting the "method" fetch to a string, we used to get an empty string if the first character was not an upper case. This was caused by the lookup function which returns HTTP_METH_NONE when a lookup is not possible, and this method being mapped to an empty string in the array. This is a totally stupid mechanism, there's no reason for having the result depend on the first char. In fact the message parser already checks that the syntax matches an HTTP token so we can only land there with a valid token, hence only HTTP_METH_OTHER should be returned. This fix should be backported to all actively supported branches.	2015-09-03 17:15:21 +02:00
Thierry FOURNIER	42148735bc	MEDIUM: actions: remove ACTION_STOP Before this patch, two type of custom actions exists: ACT_ACTION_CONT and ACT_ACTION_STOP. ACT_ACTION_CONT is a non terminal action and ACT_ACTION_STOP is a terminal action. Note that ACT_ACTION_STOP is not used in HAProxy. This patch remove this behavior. Only type type of custom action exists, and it is called ACT_CUSTOM. Now, the custion action can return a code indicating the required behavior. ACT_RET_CONT wants that HAProxy continue the current rule list evaluation, and ACT_RET_STOP wants that HAPRoxy stops the the current rule list evaluation.	2015-09-02 18:36:38 +02:00
Willy Tarreau	bd99d5818d	BUG/MAJOR: http: don't manipulate the server connection if it's killed Jesse Hathaway reported a crash that Cyril Bont� diagnosed as being caused by the manipulation of srv_conn after setting it to NULL. This happens in http-server-close mode when the server returns either a 401 or a 407, because the connection was previously closed then it's being assigned the CO_FL_PRIVATE flag. This bug only affects 1.6-dev as it was introduced by connection reuse code with commit `387ebf8` ("MINOR: connection: add a new flag CO_FL_PRIVATE").	2015-09-02 10:52:05 +02:00
Thierry FOURNIER	35d70efc33	MINOR: http: Action for manipulating the returned status code. This patch is inspired by Bowen Ni's proposal and it is based on his first implementation: With Lua integration in HAProxy 1.6, one can change the request method, path, uri, header, response header etc except response line. I'd like to contribute the following methods to allow modification of the response line. [...] There are two new keywords in 'http-response' that allows you to rewrite them in the native HAProxy config. There are also two new APIs in Lua that allows you to do the same rewriting in your Lua script. Example: Use it in HAProxy config: http-response set-code 404 Or use it in Lua script: txn.http:res_set_reason("Redirect") I dont take the full patch because the manipulation of the "reason" is useless. standard reason are associated with each returned code, and unknown code can take generic reason. So, this patch can set the status code, and the reason is automatically adapted.	2015-08-27 14:29:44 +02:00
Thierry FOURNIER	3f4bc65a22	DOC: fix "http_action_set_req_line()" comments Bowen repports errors about http_action_set_req_line() comments. Some other errors appears from the patches about "actions" reorganisation.	2015-08-27 11:31:19 +02:00
Thierry FOURNIER	afa80496db	MEDIUM: actions: Normalize the return code of the configuration parsers This patch normalize the return code of the configuration parsers. Before these changes, the tcp action parser returned -1 if fail and 0 for the succes. The http action returned 0 if fail and 1 if succes. The normalisation does: - ACT_RET_PRS_OK for succes - ACT_RET_PRS_ERR for failure	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	322a124867	MINOR: actions: mutualise the action keyword lookup Each (http\|tcp)-(request\|response) action use the same method for looking up the action keyword during the cofiguration parsing. This patch mutualize the code.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	36481b8667	MEDIUM: actions: Merge (http\|tcp)-(request\|reponse) keywords structs This patch merges the conguration keyword struct. Each declared configuration keyword struct are similar with the others. This patch simplify the code.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	24ff6c6fce	MEDIUM: actions: Add standard return code for the action API Action function can return 3 status: - error if the action encounter fatal error (like out of memory) - yield if the action must terminate his work later - continue in other cases	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	0ea5c7fafa	MINOR: actions: change actions names For performances considerations, some actions are not processed by remote function. They are directly processed by the function. Some of these actions does the same things but for different processing part (request / response). This patch give the same name for the same actions, and change the normalization of the other actions names. This patch is ONLY a rename, it doesn't modify the code.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	91f6ba0f2c	MINOR: actions: Declare all the embedded actions in the same header file This patch group the action name in one file. Some action are called many times and need an action embedded in the action caller. The main goal is to have only one header file grouping all definitions.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	22e49011b1	MINOR: actions: remove the mark indicating the last entry in enum This mark permit to detect if the action tag is over the allowed range. - Normally, this case doesn't appear - If it appears, it is processed by ded fault case of the switch	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	5563e4b469	MINOR: actions: add "from" information This struct member is used to specify who is the rule caller. It permits to use one function for differents callers.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	5ec63e008d	MEDIUM: track-sc: Move the track-sc configuration storage in the union This patch moves the track-sc configuration struct (track_ctr_prm) in the main "arg" union. This reduce the size od the struct.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	e209797ef0	MINOR: proto_http: replace generic opaque types by real used types in "http_capture" by id This patch removes the generic opaque type for storing the configuration of the action "http_capture" by id.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	32b15003fe	MINOR: proto_http: replace generic opaque types by real used types in "http_capture" This patch removes the generic opaque type for storing the configuration of the action "http_capture"".	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	8855a92d8c	MINOR: proto_http: replace generic opaque types by real used types for the actions on thr request line This patch removes the generic opaque type for storing the configuration of the action "set-method", "set-path", "set-query" and "set-uri".	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	a002dc9df8	MINOR: proto_http: use an "expr" type in place of generic opaque type. This patch removes the generic opaque type for storing the configuration of the acion "set-src" (HTTP_REQ_ACT_SET_SRC), and use the dedicated type "struct expr"	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	a28a9429b2	MEDIUM: actions: Merge (http\|tcp)-(request\|reponse) action structs This patch is the first of a serie which merge all the action structs. The function "tcp-request content", "tcp-response-content", "http-request" and "http-response" have the same values and the same process for some defined actions, but the struct and the prototype of the declared function are different. This patch try to unify all of these entries.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	136f9d34a9	MINOR: samples: rename union from "data" to "u" The union name "data" is a little bit heavy while we read the source code because we can read "data.data.sint". The rename from "data" to "u" makes the read easiest like "data.u.sint".	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	8c542cac07	MEDIUM: samples: Use the "struct sample_data" in the "struct sample" This patch remove the struct information stored both in the struct sample_data and in the striuct sample. Now, only thestruct sample_data contains data, and the struct sample use the struct sample_data for storing his own data.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	a6b6343cff	CLEANUP: http/tcp actions: remove the scope member The scope member is not used. This patch removes this entry.	2015-08-11 13:44:53 +02:00
Thierry FOURNIER	9b49f589ed	CLEANUP: proto_http: remove useless initialisation This initialisation of the opaque array is useless.	2015-08-11 13:44:51 +02:00
Willy Tarreau	53a09d520e	MAJOR: http: remove references to appsession appsessions started to be deprecated with the introduction of stick tables, and the latter are much more powerful and flexible, and in addition they are replicated between nodes and maintained across reloads. Let's now remove appsession completely.	2015-08-10 19:16:18 +02:00
Willy Tarreau	449d74a906	MEDIUM: backend: add the "http-reuse aggressive" strategy This strategy is less extreme than "always", it only dispatches first requests to validated reused connections, and moves a connection from the idle list to the safe list once it has seen a second request, thus proving that it could be reused.	2015-08-06 16:29:01 +02:00
Willy Tarreau	8dff998b91	MAJOR: backend: initial work towards connection reuse In connect_server(), if we don't have a connection attached to the stream-int, we first look into the server's idle_conns list and we pick the first one there, we detach it from its owner if it had one. If we used to have a connection, we close it. This mechanism works well but doesn't scale : as servers increase, the likeliness that the connection attached to the stream interface doesn't match the server and gets closed increases.	2015-08-06 11:34:21 +02:00
Willy Tarreau	387ebf84dd	MINOR: connection: add a new flag CO_FL_PRIVATE This flag is set on an outgoing connection when this connection gets some properties that must not be shared with other connections, such as dynamic transparent source binding, SNI or a proxy protocol header, or an authentication challenge from the server. This will be needed later to implement connection reuse.	2015-08-06 11:14:17 +02:00
Willy Tarreau	4320eaac62	MINOR: stream-int: make si_idle_conn() only accept valid connections This function is now dedicated to idle connections only, which means that it must not be used without any endpoint nor anything not a connection. The connection remains attached to the stream interface.	2015-08-06 11:11:10 +02:00
Willy Tarreau	323a2d925c	MEDIUM: stream-int: queue idle connections at the server Now we get a per-server list of all idle connections. That way we'll be able to reclaim them upon shortage later.	2015-08-06 11:06:25 +02:00
Willy Tarreau	973a54235f	MEDIUM: stream-int: simplify si_alloc_conn() Since we now always call this function with the reuse parameter cleared, let's simplify the function's logic as it cannot return the existing connection anymore. The savings on this inline function are appreciable (240 bytes) : $ size haproxy.old haproxy.new text data bss dec hex filename 1020383 40816 36928 1098127 10c18f haproxy.old 1020143 40816 36928 1097887 10c09f haproxy.new	2015-08-05 21:51:09 +02:00
Thierry FOURNIER	bf65cd4d77	MAJOR: arg: converts uint and sint in sint This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed.	2015-07-22 00:48:23 +02:00
Thierry FOURNIER	07ee64ef4d	MAJOR: sample: converts uint and sint in 64 bits signed integer This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed. This makes easy the usage of integer and clarify signed and unsigned use. With the previous version, signed and unsigned are used ones in place of others, and sometimes the converter loose the sign. For example, divisions are processed with "unsigned", if one entry is negative, the result is wrong. Note that the integer pattern matching and dotted version pattern matching are already working with signed 64 bits integer values. There is one user-visible change : the "uint()" and "sint()" sample fetch functions which used to return a constant integer have been replaced with a new more natural, unified "int()" function. These functions were only introduced in the latest 1.6-dev2 so there's no impact on regular deployments.	2015-07-22 00:48:23 +02:00
Thierry FOURNIER	fac9ccfb70	BUG/MINOR: http/sample: gmtime/localtime can fail The man said that gmtime() and localtime() can return a NULL value. This is not tested. It appears that all the values of a 32 bit integer are valid, but it is better to check the return of these functions. However, if the integer move from 32 bits to 64 bits, some 64 values can be unsupported.	2015-07-20 12:21:35 +02:00
Adis Nezirovic	2fbcafc9ce	MEDIUM: http: Add new 'set-src' option to http-request This option enables overriding source IP address in a HTTP request. It is useful when we want to set custom source IP (e.g. front proxy rewrites address, but provides the correct one in headers) or we wan't to mask source IP address for privacy or compliance. It acts on any expression which produces correct IP address.	2015-07-06 16:17:28 +02:00
Adis Nezirovic	79beb248b9	CLEANUP: sample: generalize sample_fetch_string() as sample_fetch_as_type() This modification makes possible to use sample_fetch_string() in more places, where we might need to fetch sample values which are not plain strings. This way we don't need to fetch string, and convert it into another type afterwards. When using aliased types, the caller should explicitly check which exact type was returned (e.g. SMP_T_IPV4 or SMP_T_IPV6 for SMP_T_ADDR). All usages of sample_fetch_string() are converted to use new function.	2015-07-06 16:17:25 +02:00
Thierry FOURNIER	4834bc773c	MEDIUM: vars: adds support of variables This patch adds support of variables during the processing of each stream. The variables scope can be set as 'session', 'transaction', 'request' or 'response'. The variable type is the type returned by the assignment expression. The type can change while the processing. The allocated memory can be controlled for each scope and each request, and for the global process.	2015-06-13 23:01:37 +02:00
Thierry FOURNIER	0e11863a6f	MINOR: tcp/http/conf: extends the keyword registration options This patch permits to register a new keyword with the keyword "tcp-request content" 'tcp-request connection", tcp-response content", http-request" and "http-response" which is identified only by matching the start of the keyword. for example, we register the keyword "set-var" with the option "match_pfx" and the configuration keyword "set-var(var_name)" matchs this entry.	2015-06-13 23:01:37 +02:00
Willy Tarreau	b8cdf52da0	BUG/MEDIUM: http: fix body processing for the stats applet Commit `9fbe18e` ("MEDIUM: http: add a new option http-buffer-request") introduced a regression due to a misplaced check causing the admin mode of the HTTP stats not to work anymore. This patch tried to ensure that when we need a request body for the stats applet, and we have already waited for this body, we don't wait for it again, but the condition was applied too early causing a disabling of the entire processing the body, and based on the wrong HTTP state (MSG_BODY) resulting in the test never matching. Thanks to Chad Lavoie for reporting the problem. This bug is 1.6-only, no backport is needed.	2015-05-29 01:12:38 +02:00
Willy Tarreau	2de8a50918	MEDIUM: http: no need to close the request on redirect if data was parsed There are two reasons for not keeping the client connection alive upon a redirect : - save the client from uploading all data - avoid keeping a connection alive if the redirect goes to another domain The first case should consider an exception when all the data from the client have been read already. This specifically happens on response redirects after a POST to a server. This is an easy situation to detect. It could later be improved to cover the cases where option http-buffer-request is used.	2015-05-28 17:45:43 +02:00
Willy Tarreau	51d861a44f	MEDIUM: http: implement http-response redirect rules Sometimes it's problematic not to have "http-response redirect" rules, for example to perform a browser-based redirect based on certain server conditions (eg: match of a header). This patch adds "http-response redirect location <fmt>" which gives enough flexibility for most imaginable operations. The connection to the server is closed when this is performed so that we don't risk to forward any pending data from the server. Any pending response data are trimmed so that we don't risk to forward anything pending to the client. It's harmless to also do that for requests so we don't need to consider the direction.	2015-05-28 17:45:43 +02:00
Willy Tarreau	be4653b6d4	MINOR: http: prepare support for parsing redirect actions on responses In order to support http-response redirect, the parsing needs to be adapted a little bit to only support the "location" type, and to adjust the log-format parser so that it knows the direction of the sample fetch calls.	2015-05-28 17:43:11 +02:00
Willy Tarreau	b329a312e3	CLEANUP: http: explicitly reference request in http_apply_redirect_rules() This function was made to perform a redirect on requests only, it was using a message or txn->req in an inconsistent way and did not consider the possibility that it could be used for the other direction. Let's clean it up to have both a request and a response messages.	2015-05-28 17:42:16 +02:00
Thierry FOURNIER	e80fadaaca	MEDIUM: capture: adds http-response capture This patch adds a http response capture keyword with the same behavior as the previous patch called "MEDIUM: capture: Allow capture with slot identifier".	2015-05-28 13:51:00 +02:00
Thierry FOURNIER	82bf70dff4	MEDIUM: capture: Allow capture with slot identifier This patch modifies the current http-request capture function and adds a new keyword "id" that permits to identify a capture slot. If the identified doesn't exists, the action fails silently. Note that this patch removs an unused list initilisation, which seems to be inherited from a copy/paste. It's harmless and does not need to be backported. LIST_INIT((struct list *)&rule->arg.act.p[0]);	2015-05-28 13:50:29 +02:00
Thierry FOURNIER	35ab27561e	MINOR: capture: add two "capture" converters This patch adds "capture-req" and "capture-res". These two converters capture their entry in the allocated slot given in argument and pass the input on the output.	2015-05-28 13:50:29 +02:00
Willy Tarreau	98d0485a90	MAJOR: config: remove the deprecated reqsetbe / reqisetbe actions These ones were already obsoleted in 1.4, marked for removal in 1.5, and not documented anymore. They used to emit warnings, and do still require quite some code to stay in place. Let's remove them now.	2015-05-26 12:18:29 +02:00
Dragan Dosen	26f77e534c	BUG/MEDIUM: http: fix the url_param fetch The "name" and "name_len" arguments in function "smp_fetch_url_param" could be left uninitialized for subsequent calls. [wt: no backport needed, this is an 1.6 regression introduced by commit `4fdc74c` ("MINOR: http: split the url_param in two parts") ]	2015-05-25 19:01:39 +02:00
Thierry FOURNIER	8be451c52a	MEDIUM: http: url-encoded parsing function can run throught wrapped buffer The functions smp_fetch_param(), find_next_url_param() and find_url_param_pos() can look for argument in 2 chunks and not only one.	2015-05-20 16:05:38 +02:00
Thierry FOURNIER	e28c49975a	MINOR: http: add body_param fetch This fetch returns one body param or the list of each body param. This first version runs only with one chunk.	2015-05-20 15:56:23 +02:00
Thierry FOURNIER	0948d41a12	CLEANUP: http: bad indentation Some function argument uses space in place of tabulation for the indentation.	2015-05-20 15:56:23 +02:00
Thierry FOURNIER	4fdc74c22c	MINOR: http: split the url_param in two parts This patch is the part of the body_param fetch. The goal is to have generic url-encoded parser which can used for parsing the query string and the body.	2015-05-20 15:56:23 +02:00
Willy Tarreau	1ede1daab6	MEDIUM: http: make url_param iterate over multiple occurrences There are some situations hwere it's desirable to scan multiple occurrences of a same parameter name in the query string. This change ensures this can work, even with an empty name which will then iterate over all parameters.	2015-05-19 13:16:07 +02:00
Thierry FOURNIER	0786d05a04	MEDIUM: sample: change the prototype of sample-fetches functions This patch removes the "opt" entry from the prototype of the sample-fetches fucntions. This permits to remove some weight in the prototype call.	2015-05-11 20:03:08 +02:00
Thierry FOURNIER	0a9a2b8cec	MEDIUM: sample change the prototype of sample-fetches and converters functions This patch removes the structs "session", "stream" and "proxy" from the sample-fetches and converters function prototypes. This permits to remove some weight in the prototype call.	2015-05-11 20:01:42 +02:00
Willy Tarreau	bbfb6c4085	BUG/MEDIUM: http: don't forward client shutdown without NOLINGER except for tunnels There's an issue related with shutting down POST transfers or closing the connection after the end of the upload : the shutdown is forwarded to the server regardless of the abortonclose option. The problem it causes is that during a scan, brute force or whatever, it becomes possible that all source ports are exhausted with all sockets in TIME_WAIT state. There are multiple issues at once in fact : - no action is done for the close, it automatically happens at the lower layers thanks for channel_auto_close(), so we cannot act on NOLINGER ; - we do want to continue to send a clean shutdown in tunnel mode because some protocols transported over HTTP may need this, regardless of option abortonclose, thus we can't set the option inconditionally - for all other modes, we do want to close the dirty way because we're certain whether we've sent everything or not, and we don't want to eat all source ports. The solution is a bit complex and applies to DONE/TUNNEL states : 1) disable automatic close for everything not a tunnel and not just keep-alive / server-close. Force-close is now covered, as is HTTP/1.0 which implicitly works in force-close mode ; 2) when processing option abortonclose, we know we can disable lingering if the client has closed and the connection is not in tunnel mode. Since the last case above leads to a situation where the client side reports an error, we know the connection will not be reused, so leaving the flag on the stream-interface is safe. A client closing in the middle of the data transmission already aborts the transaction so this case is not a problem. This fix must be backported to 1.5 where the problem was detected.	2015-05-11 19:05:42 +02:00
Thierry FOURNIER	82ff3c9b05	MINOR: sample: add url_dec converter This converter decodes an url-encoded string. It takes a string as input and returns string as output.	2015-05-11 11:40:36 +02:00
Willy Tarreau	3986ac1860	BUG/MEDIUM: http: fix the http-request capture parser Due to the code being mostly inspired from the tcp-request parser, it does some crap because both don't work the same way. The "len" argument could be mismatched and then the length could be used uninitialized.	2015-05-08 16:13:42 +02:00
Willy Tarreau	a9083d0722	MEDIUM: http: add new "capture" action for http-request This is only possible in frontends of course, but it will finally make it possible to capture arbitrary http parts, including URL parameters or parts of the message body. It's worth noting that an ugly (char **) cast had to be done to call sample_fetch_string() which is caused by a 5- or 6- levels of inheritance of this type in the API. Here it's harmless since the function uses it as a const, but this API madness must be fixed, starting with the one or two rare functions that modify the args and inflict this on each and every keyword parser. (cherry picked from commit 484a4f38460593919a1c1d9a047a043198d69f45)	2015-05-08 15:43:54 +02:00
Willy Tarreau	a5910cc6ef	MEDIUM: http: provide 3 fetches for the body Body processing is still fairly limited, but this is a start. It becomes possible to apply regex to find contents in order to decide where to route a request for example. Only the first chunk is parsed for now, and the response is not yet available (the parsing function must be duplicated for this). req.body : binary This returns the HTTP request's available body as a block of data. It requires that the request body has been buffered made available using "option http-buffer-request". In case of chunked-encoded body, currently only the first chunk is analyzed. req.body_len : integer This returns the length of the HTTP request's available body in bytes. It may be lower than the advertised length if the body is larger than the buffer. It requires that the request body has been buffered made available using "option http-buffer-request". req.body_size : integer This returns the advertised length of the HTTP request's body in bytes. It will represent the advertised Content-Length header, or the size of the first chunk in case of chunked encoding. In order to parse the chunks, it requires that the request body has been buffered made available using "option http-buffer-request".	2015-05-02 00:46:08 +02:00
Willy Tarreau	9fbe18e174	MEDIUM: http: add a new option http-buffer-request It is sometimes desirable to wait for the body of an HTTP request before taking a decision. This is what is being done by "balance url_param" for example. The first use case is to buffer requests from slow clients before connecting to the server. Another use case consists in taking the routing decision based on the request body's contents. This option placed in a frontend or backend forces the HTTP processing to wait until either the whole body is received, or the request buffer is full, or the first chunk is complete in case of chunked encoding. It can have undesired side effects with some applications abusing HTTP by expecting unbufferred transmissions between the frontend and the backend, so this should definitely not be used by default. Note that it would not work for the response because we don't reset the message state before starting to forward. For the response we need to 1) reset the message state to MSG_100_SENT or BODY , and 2) to reset body_len in case of chunked encoding to avoid counting it twice.	2015-05-02 00:10:44 +02:00
Willy Tarreau	e115b49c39	BUG/MEDIUM: http: wait for the exact amount of body bytes in wait_for_request_body Due to the fact that we were still considering only msg->sov for the first byte of data after calling http_parse_chunk_size(), we used to miscompute the input data size and to count the CRLF and the chunk size as part of the input data. The effect is that it was possible to release the processing with 3 or 4 missing bytes, especially if they're typed by hand during debugging sessions. This can cause the stats page to return some errors in admin mode, and the url_param balance algorithm to fail to properly hash a body input. This fix must be backported to 1.5.	2015-05-01 23:24:32 +02:00
Willy Tarreau	0f228a037a	MEDIUM: http: add option-ignore-probes to get rid of the floods of 408 Recently some browsers started to implement a "pre-connect" feature consisting in speculatively connecting to some recently visited web sites just in case the user would like to visit them. This results in many connections being established to web sites, which end up in 408 Request Timeout if the timeout strikes first, or 400 Bad Request when the browser decides to close them first. These ones pollute the log and feed the error counters. There was already "option dontlognull" but it's insufficient in this case. Instead, this option does the following things : - prevent any 400/408 message from being sent to the client if nothing was received over a connection before it was closed ; - prevent any log from being emitted in this situation ; - prevent any error counter from being incremented That way the empty connection is silently ignored. Note that it is better not to use this unless it is clear that it is needed, because it will hide real problems. The most common reason for not receiving a request and seeing a 408 is due to an MTU inconsistency between the client and an intermediary element such as a VPN, which blocks too large packets. These issues are generally seen with POST requests as well as GET with large cookies. The logs are often the only way to detect them. This patch should be backported to 1.5 since it avoids false alerts and makes it easier to monitor haproxy's status.	2015-05-01 15:39:23 +02:00
Willy Tarreau	13317669d5	MEDIUM: http: disable support for HTTP/0.9 by default There's not much reason for continuing to accept HTTP/0.9 requests nowadays except for manual testing. Now we disable support for these by default, unless option accept-invalid-http-request is specified, in which case they continue to be upgraded to 1.0.	2015-05-01 14:57:54 +02:00
Willy Tarreau	91852eb428	MEDIUM: http: restrict the HTTP version token to 1 digit as per RFC7230 While RFC2616 used to allow an undeterminate amount of digits for the major and minor components of the HTTP version, RFC7230 has reduced that to a single digit for each. If a server can't properly parse the version string and falls back to 0.9, it could then send a head-less response whose payload would be taken for headers, which could confuse downstream agents. Since there's no more reason for supporting a version scheme that was never used, let's upgrade to the updated version of the standard. It is still possible to enforce support for the old behaviour using options accept-invalid-http-request and accept-invalid-http-response. It would be wise to backport this to 1.5 as well just in case.	2015-05-01 14:57:01 +02:00

1 2 3 4 5 ...

1164 Commits