haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-12 10:06:58 +02:00

Author	SHA1	Message	Date
Willy Tarreau	b26881a5d5	BUG/MEDIUM: checks: properly set servers to stopping state on 404 Paul Lockaby reported that since 1.8, disable-on-404 doesn't work anymore in that the server stay up despite returning 404. Cyril spotted that this was caused by a copy-paste error introduced by commit `5a13351` ("BUG/MEDIUM: log: check result details truncated.") causing set_server_running() to be called instead of set_server_stopping() in this case. It can be reproduced with the simple test config below : defaults mode http timeout connect 1s timeout client 10s timeout server 10s listen http bind :8888 option httpchk GET / http-check disable-on-404 server s1 127.0.0.1:9001 check server s2 127.0.0.1:9002 check http-response add-header x-served-by %s listen s1 bind :9001 server next 127.0.0.1:9002 http-response set-status 404 frontend s2 bind :9002 http-request redirect location / S1 is supposed to be stopping and s2 up, which is not the case. After calling the correct function, only S2 is used now. This needs to be backported to 1.8.	2017-12-23 11:16:49 +01:00
Willy Tarreau	a48c141f44	BUG/MAJOR: connection: refine the situations where we don't send shutw() Since commit `f9ce57e` ("MEDIUM: connection: make conn_sock_shutw() aware of lingering"), we refrain from performing the shutw() on the socket if there is no lingering risk. But there is a problem with this in tunnel and in TCP modes where a client is explicitly allowed to send a shutw to the server, eventhough it it risky. Not doing it creates this situation reported by Ricardo Fraile and diagnosed by Christopher : a typical HTTP client (eg: curl) connecting via the config below to an HTTP server would receive its response, immediately close while the server remains in keep-alive mode. The shutr() received by haproxy from the client is "propagated" to the server side but not acted upon because fdtab[fd].linger_risk is set, so we expect that the next close will immediately complete this operation. listen proxy-tcp bind 127.0.0.1:8888 mode tcp timeout connect 5s timeout server 10s timeout client 10s server server1 127.0.0.1:8000 But since the whole stream will not end until the server closes in turn, the server doesn't close and haproxy expires on server timeout. This problem has already struck by waking up an older bug and was partially fixed with commit `8059351` ("BUG/MEDIUM: http: don't disable lingering on requests with tunnelled responses") though it was not enough. The problem is that linger_risk is not suited here. In fact we need to know whether or not it is desired to close normally or silently, and whether or not a shutr() has already been received on this connection. This is the approach this patch takes, and it solves the problem for the various difficult modes (tcp, http-server-close, pretend-keepalive). This fix needs to be backported to 1.8. Many thanks to Ricardo for providing very detailed traces and configurations.	2017-12-22 18:54:05 +01:00
Willy Tarreau	d4569d1937	BUG/MEDIUM: cache: don't cache the response on no-cache="set-cookie" If the server mentions no-cache="set-cookie" in the response headers, we must guarantee that any set-cookie field will not be stored. We cannot edit the stored response on the fly to trim the set-cookie header so we can refrain from storing a response containing such a header. In theory we could use TX_SCK_PRESENT for this but this one is only set when the cookie is being watched by the configuration. Since these responses are not very frequent and often accompanied with a set-cookie header, let's simply refrain from caching whenever such directive is present. This needs to be backported to 1.8.	2017-12-22 18:03:04 +01:00
Willy Tarreau	504455c533	BUG/MEDIUM: cache: respect the request cache-control header Till now if a client emitted a request featureing a cache-control header, this one was not respected and a stale object could still be delievered.r This patch ensures that : - cache-control: no-cache disables retrieval from the cache but does not prevent the newly fetched object from being stored ; - cache-control: no-store can safely retrieve from the cache but prevents from storing any fetched object - cache-control: max-age/max-stale/min-fresh act like no-cache - pragma: no-cache acts like cache-control: no-cache. This needs to be backported to 1.8.	2017-12-22 17:56:18 +01:00
Willy Tarreau	c9bd34c7e0	BUG/MEDIUM: cache: replace old object on store Currently the cache aborts a store operation if the object to store already exists in the cache. This is used to avoid storing multiple copies at the same time on concurrent accesses. It causes an issue though, which is that existing unexpired objects cannot be updated. This happens when any request criterion disables the retrieval from the cache (eg: with max-age or any other cache-control condition). For now, let's simply replace the previous existing entry by unlinking it from the index. This could possibly be improved in the future if needed. This fix needs to be backported to 1.8.	2017-12-22 17:56:18 +01:00
Willy Tarreau	7704b1e89a	BUG/MEDIUM: cache: do not try to retrieve host-less requests from the cache All HTTP/1.1 requests the Host header share the same hash key 0 and will be return the first cached object. Let's add the check on the call to sha1_hosturi() to prevent this from happening. This must be backported to 1.8.	2017-12-22 17:56:17 +01:00
Willy Tarreau	0ad8e0dfea	MINOR: http: add a function to check request's cache-control header field The new function check_request_for_cacheability() is used to check if a request may be served from the cache, and/or allows the response to be stored into the cache. For this it checks the cache-control and pragma header fields, and adjusts the existing TX_CACHEABLE and a new TX_CACHE_IGNORE flags. For now, just like its response side counterpart, it only checks the first value of the header field. These functions should be reworked to improve their parsers and validate all elements.	2017-12-22 17:56:17 +01:00
Willy Tarreau	faf2909f9f	BUG/MINOR: cache: do not force the TX_CACHEABLE flag before checking cacheability The cache used to set this flag before calling check_response_for_cacheability() due to the way the flags were previously set (too late), but this is a bad idea as it loses the information of the implicit caching rules related to the method and the status code. Let's only rely on what was determined during the request and response parsing instead and not change it. This fix must be backported to 1.8, and it requires that the following patches are also merged : - MINOR: http: adjust the list of supposedly cacheable methods - MINOR: http: update the list of cacheable status codes as per RFC7231 - MINOR: http: start to compute the transaction's cacheability from the request - BUG/MINOR: http: do not ignore cache-control: public	2017-12-22 15:49:15 +01:00
Willy Tarreau	d3900cc31d	BUG/MINOR: http: properly detect max-age=0 and s-maxage=0 in responses In 1.3.8, commit `a15645d` ("[MAJOR] completed the HTTP response processing.") improved the response parser by taking care of the cache-control header field. The parser is wrong because it is split in two parts, one checking for elements containing an equal sign and the other one for those without. The "max-age=0" and "s-maxage=0" tests were located at the wrong place and thus have never matched. In practice the side effect was very minimal given that this code used to be enabled only when checking if a cookie had the risk of being cached or not. Recently in 1.8 it was also used to decide if the response could be cached but in practice the cache takes care of these values by itself so there is very limited impact. This fix can be backported to all stable versions.	2017-12-22 15:49:15 +01:00
Willy Tarreau	12b32f212f	BUG/MINOR: http: do not ignore cache-control: public In check_response_for_cacheability(), we don't check the cache-control flags if the response is already supposed not to be cacheable. This was introduced very early when cache-control:public was not checked, and it basically results in this last one not being able to properly mark the response as cacheable if it uses a status code which is non-cacheable by default. Till now the impact is very limited as it doesn't check that cookies set on non-default status codes are not cacheable, and it prevents the cache from caching such responses. Let's fix this by doing two things : - remove the test for !TX_CACHEABLE in the aforementionned function - however take care of 1xx status codes here (which used to be implicitly dealt with by the test above) and remove the explicit check for 101 in the caller This fix must be backported to 1.8.	2017-12-22 14:43:26 +01:00
Willy Tarreau	83ece462b4	MINOR: http: start to compute the transaction's cacheability from the request There has always been something odd with the way the cache-control flags are checked. Since it was made for checking for the risk of leaking cookies only, all the processing was done in the response. Because of this it is not possible to reuse the transaction flags correctly for use with the cache. This patch starts to change this by moving the method check in the request so that we know very early whether the transaction is expected to be cacheable and that this status evolves along with checked headers. For now it's not enough to use from the cache yet but at least it makes the flag more consistent along the transaction processing.	2017-12-22 14:43:26 +01:00
Willy Tarreau	c55ddce65c	MINOR: http: update the list of cacheable status codes as per RFC7231 Since RFC2616, the following codes were added to the list of codes cacheable by default : 204, 404, 405, 414, 501. For now this it only checked by the checkcache option to detect cacheable cookies.	2017-12-22 14:43:26 +01:00
Willy Tarreau	24ea0bcb1d	MINOR: http: adjust the list of supposedly cacheable methods We used to have a rule inherited from RFC2616 saying that the POST method was the only uncacheable one, but things have changed since and RFC7231+7234 made it clear that in fact only GET/HEAD/OPTIONS/TRACE are cacheable. Currently this rule is only used to detect cacheable cookies.	2017-12-22 14:43:26 +01:00
Eric Salama	fe7456f3b7	BUG/MEDIUM: lua: fix crash when using bogus mode in register_service() When using an incorrect 'mode' as 2nd argument of core.register_service(), HAProxy crashes while displaying the error message. To be backported to 1.8, 1.7 and 1.6.	2017-12-22 14:34:54 +01:00
Emeric Brun	e31148031f	BUG/MEDIUM: checks: a server passed in maint state was not forced down. Setting a server in maint mode, the required next_state was not set before calling the 'lb_down' function and so the system state was never commited. This patch should be backported in 1.8	2017-12-21 15:23:55 +01:00
Willy Tarreau	7aa15b072e	BUG/MEDIUM: stream: don't consider abortonclose on muxes which close cleanly The H2 mux can cleanly report an error when a client closes, which is not the case for the pass-through mux which only reports shutr. That was the reason why "option abortonclose" was created since there was no way to distinguish a clean shutdown after sending the request from an abort. The problem is that in case of H2, the streams are always shut read after the request is complete (when the END_STREAM flag is received), and that when this lands on a backend configured with "option abortonclose", this aborts the request. Disabling abortonclose is not always an option when H1 and H2 have to coexist. This patch makes use of the newly introduced mux capabilities reported via the stream interface's SI_FL_CLEAN_ABRT indicating that the mux is safe and that there is no need to turn a clean shutread into an abort. This way abortonclose has no effect on requests initiated from an H2 mux. This patch as well as these 3 previous ones need to be backported to 1.8 : - BUG/MINOR: h2: properly report a stream error on RST_STREAM - MINOR: mux: add flags to describe a mux's capabilities - MINOR: stream-int: set flag SI_FL_CLEAN_ABRT when mux supports clean aborts	2017-12-20 17:01:24 +01:00
Willy Tarreau	984fca9363	MINOR: stream-int: set flag SI_FL_CLEAN_ABRT when mux supports clean aborts By copying the info in the stream interface that the mux cleanly reports aborts, we'll have the ability to check this flag wherever needed regardless of the presence of a mux or not.	2017-12-20 16:56:32 +01:00
Willy Tarreau	28f1cb9da2	MINOR: mux: add flags to describe a mux's capabilities This new field will be used to describe certain properties of some muxes. For now we only add MX_FL_CLEAN_ABRT to indicate that a mux is able to unambiguously report aborts using CS_FL_ERROR contrary to others who may only report it via a read0. This will be used to improve handling of the abortonclose option with H2. Other flags may come later to report multiplexing capabilities or not, support of client/server sides etc.	2017-12-20 16:31:30 +01:00
Willy Tarreau	2153d3ce73	BUG/MINOR: h2: properly report a stream error on RST_STREAM We want to report such an error since H2 allows to differenciate between an end of stream and an abort. To be backported to 1.8.	2017-12-20 14:38:19 +01:00
Ryan O'Hara	8cb9993469	CONTRIB: halog: Fix compiler warnings in halog.c There were several unused variables in halog.c that each caused a compiler warning [-Wunused-but-set-variable]. This patch simply removes the declaration of said vairables and any instance where the unused variable was assigned a value.	2017-12-20 09:36:58 +01:00
Ryan O'Hara	957d12028e	CONTRIB: iprange: Fix compiler warning in iprange.c The declaration of main() in iprange.c did not specify a type, causing a compiler warning [-Wimplicit-int]. This patch simply declares main() to be type 'int' and calls exit(0) at the end of the function.	2017-12-20 09:36:58 +01:00
Etienne Carriere	aec8989e53	MINOR: spoe: add force-set-var option in spoe-agent configuration For security reasons, the spoe filter was only able to change values of existing variables. In specific cases (ex : with LUA code), the name of variables are unknown at the configuration parsing phase. The force-set-var option can be enabled to register all variables.	2017-12-20 08:55:18 +01:00
Bertrand Jacquin	72fa1ec24e	MEDIUM: netscaler: add support for standard NetScaler CIP protocol It looks like two version of the protocol exist as reported by Andreas Mahnke. This patch add support for both legacy and standard CIP protocol according to NetScaler specifications.	2017-12-20 07:04:07 +01:00
Bertrand Jacquin	a341a2f479	MEDIUM: netscaler: do not analyze original IP packet size Original informations about the client are stored in the CIP encapsulated IP header, hence there is no need to consider original IP packet length to determine if data are missing. Instead this change detect missing data if the remaining buffer is large enough to contain a minimal IP and TCP header and if the buffer has as much data as CIP is telling.	2017-12-20 07:04:07 +01:00
Bertrand Jacquin	67de5a295c	MINOR: netscaler: check in one-shot if buffer is large enough for IP and TCP header There is minimal gain in checking first the IP header length and then the TCP header length since we always want to capture information about both protocols. IPv4 length calculation was incorrect since IPv4 ip_len actually defines the total length of IPv4 header and following data.	2017-12-20 07:04:07 +01:00
Bertrand Jacquin	43a66a96b3	BUG/MAJOR: netscaler: address truncated CIP header detection Buffer line is manually incremented in order to progress in the trash buffer but calculation are made omitting this manual offset. This leads to random packets being rejected with the following error: HTTP/1: Truncated NetScaler Client IP header received Instead, once original IP header is found, use the IP header length without considering the CIP encapsulation.	2017-12-20 07:04:07 +01:00
Bertrand Jacquin	c7cc69ac36	BUG/MEDIUM: netscaler: use the appropriate IPv6 header size IPv6 header has a fixed size of 40 bytes, not 20.	2017-12-20 07:04:07 +01:00
Bertrand Jacquin	7d668f9e76	MINOR: netscaler: rename cip_len to clarify its uage cip_len was meant to be the length of the data encapsulated in the CIP protocol, the size the IP and TCP header	2017-12-20 07:04:07 +01:00
Bertrand Jacquin	4b4c286bee	MINOR: netscaler: remove the use of cip_magic only used once	2017-12-20 07:04:07 +01:00
Bertrand Jacquin	b387591f32	MINOR: netscaler: respect syntax As per doc/coding-style.txt	2017-12-20 07:04:07 +01:00
Davor Ocelic	4094ce1a23	DOC/MINOR: intro: typo, wording, formatting fixes - Fix a couple typos - Introduce a couple simple rewordings - Eliminate > 80 column lines Changes do not affect technical content and can be backported.	2017-12-20 07:01:36 +01:00
Christopher Faulet	789691778f	BUG/MEDIUM: mworker: Set FD_CLOEXEC flag on log fd A log socket (UDP or UNIX) is opened by the master during its startup, when the first log message is sent. So, to prevent FD leaks, we must ensure we correctly close it during a reload. By setting FD_CLOEXEC bit on it, we are sure it will be automatically closed it during a reload. This patch must be backported in 1.8.	2017-12-19 14:03:30 +01:00
Willy Tarreau	60a2ee7945	MINOR: sample: rename the "len" converter to "length" This converter was recently introduced by commit `ed0d24e` ("MINOR: sample: add len converter"). As found by Cyril, it causes an issue in "http-request capture" statements. The non-obvious problem is that an old syntax for sample expressions and converters used to support a series of words, each representing a converter. This used to be how the "stick" directives were created initially. By having a converter called "len", a statement such as "http-request capture foo len 10" considers "len" as a converter and not as the capture length. This obsolete syntax needs to be changed in 1.9 but it's too late for other versions. It's worth noting that the same problem can happen if converters are registered on the fly using Lua. Other language keywords that currently have to be avoided in converters include "id", "table", "if", "unless".	2017-12-15 07:13:48 +01:00
Cyril Bont�	9fc9e53763	BUG: MINOR: http: don't check http-request capture id when len is provided Randomly, haproxy could fail to start when a "http-request capture" action is defined, without any change to the configuration. The issue depends on the memory content, which may raise a fatal error like : unable to find capture id 'xxxx' referenced by http-request capture rule Commit `fd608dd2` already prevents the condition to happen, but this one should be included for completeness and to reclect the code on the response side. The issue was introduced recently by commit `29730ba5` and should only be backported to haproxy 1.8.	2017-12-14 22:46:27 +01:00
Cyril Bont�	3906d5739c	BUG: MAJOR: lb_map: server map calculation broken Adrian Williams reported that several balancing methods were broken and sent all requests to one backend. This is a regression in haproxy 1.8 where the server score was not correctly recalculated. This fix must be backported to the 1.8 branch.	2017-12-14 17:36:39 +01:00
Etienne Carriere	ed0d24ebed	MINOR: sample: add len converter Add len converter that returns the length of a string	2017-12-14 14:36:10 +01:00
Willy Tarreau	b78b80efe5	BUG/MINOR: stream-int: don't try to receive again after receiving an EOS When an end of stream has been reported, we should not try to receive again as the mux layer might not be prepared to this and could report unexpected errors. This is more of a strengthening measure that follows the introduction of conn_stream that came in 1.8. It's desired to backport this into 1.8 though it's uncertain at this time whether it may have caused real issues.	2017-12-14 13:43:52 +01:00
Willy Tarreau	91bfdd7e04	BUG/MEDIUM: h2: fix stream limit enforcement Commit `4974561` ("BUG/MEDIUM: h2: enforce the per-connection stream limit") implemented a stream limit enforcement on the connection but it was not correctly done as it would count streams still known by the connection, which includes the lingering ones that are already marked close. We need to count only the non-closed ones, which this patch does. The effect is that some streams are rejected a bit before the limit. This fix needs to be backported to 1.8.	2017-12-14 13:43:52 +01:00
Willy Tarreau	805935147a	BUG/MEDIUM: http: don't disable lingering on requests with tunnelled responses The HTTP forwarding engine needs to disable lingering on requests in case the connection to the server has to be suddenly closed due to http-server-close being used, so that we don't accumulate lethal TIME_WAIT sockets on the outgoing side. A problem happens when the server doesn't advertise a response size, because the response message quickly goes through the MSG_DONE and MSG_TUNNEL states, and once the client has transferred all of its data, it turns to MSG_DONE and immediately sets NOLINGER and closes before the server has a chance to respond. The problem is that this destroys some of the pending DATA being uploaded, the server doesn't receive all of them, detects an error and closes. This early NOLINGER is inappropriate in this situation because it happens before the response is transmitted. This state transition to MSG_TUNNEL doesn't happen when the response size is known since we stay in MSG_DATA (and related states) during all the transfer. Given that the issue is only related to connections not advertising a response length and that by definition these connections cannot be reused, there's no need for NOLINGER when the response's transfer length is not known, which can be verified when entering the CLOSED state. That's what this patch does. This fix needs to be backported to 1.8 and very likely to 1.7 and older as it affects the very rare case where a client immediately closes after the last uploaded byte (typically a script). However given that the risk of occurrence in HTTP/1 is extremely low, it is probably wise to wait before backporting it before 1.8.	2017-12-14 13:43:52 +01:00
Willy Tarreau	13e4e94dae	BUG/MEDIUM: h2: don't close after the first DATA frame on tunnelled responses Tunnelled responses are those without a content-length nor a chunked encoding. They are specially dealt with in the current code but the behaviour is not correct. The fact that the chunk size is left to zero with a state artificially set to CHUNK_SIZE validates the test on whether or not to set the end of stream flag. Thus the first DATA frame always carries the ES flag and subsequent ones remain blocked. This patch fixes it in two ways : - update h1m->curr_len to the size of the current buffer so that it is properly subtracted later to find the real end ; - don't set the state to CHUNK_SIZE when there's no content-length and instead set it to CHUNK_SIZE only when there's chunking. This fix needs to be backported to 1.8.	2017-12-14 13:43:52 +01:00
Willy Tarreau	c4134ba8b0	BUG/MEDIUM: h2: don't switch the state to HREM before end of DATA frame We used to switch the stream's state to HREM when seeing and ES bit on the DATA frame before actually being able to process that frame, possibly resulting in the DATA frame being processed after the stream was seen as half-closed and possibly being rejected. The state must not change before the frame is really processed. Also fixes a harmless typo in the flag name which should have DATA and not HEADERS in its name (but all values are equal). Must be backported to 1.8.	2017-12-14 13:43:52 +01:00
Willy Tarreau	6847262211	MINOR: h2: don't demand that a DATA frame is complete before processing it Since last commit it's not required that the DATA frames are complete anymore so better start with what we have. Only the HEADERS frame requires this. This may be backported as part of the upload fixes.	2017-12-14 13:43:52 +01:00
Willy Tarreau	8fc016d0fe	BUG/MEDIUM: h2: support uploading partial DATA frames We currently have a problem with DATA frames when they don't fit into the destination buffer. While it was imagined that in theory this never happens, in practice it does when "option http-buffer-request" is set, because the headers don't leave the target buffer before trying to read so if the frame is full, there's never enough room. This fix consists in reading what can be read from the frame and advancing the input buffer. Once the contents left are only the padding, the frame is completely processed. This also solves another problem we had which is that it was possible to fill a request buffer beyond its reserve because the <count> argument was not respected in h2_rcv_buf(). Thus it's possible that some POST requests sent at once with a headers+body filling exactly a buffer could result in "400 bad req" when trying to add headers. This fix must be backported to 1.8.	2017-12-14 13:43:52 +01:00
Willy Tarreau	05e5dafe9a	MINOR: h2: store the demux padding length in the h2c struct We'll try to process partial frames and for this we need to know the padding length. The first step requires to extract it during the parsing and store it in the demux context in the connection. Till now it was only processed at once.	2017-12-14 13:43:52 +01:00
Willy Tarreau	d13bf27e78	BUG/MEDIUM: h2: debug incoming traffic in h2_wake() Even after previous commit ("BUG/MEDIUM: h2: work around a connection API limitation") there is still a problem with some requests. Sometimes when polling for more request data while some pending data lies in the buffer, there's no way to enter h2_recv() because the FD is not marked ready for reading. We need to slightly change the approach and make h2_recv() only receive from the buffer and h2_wake() always attempt to demux if the demux is not blocked. However, if the connection is already being polled for reading, it will not wake up from polling. For this reason we need to cheat and also pretend a request for sending data, which ensures that as soon as any direction may move, we can continue to demux. This shows that in the long term we probably need a better way to resume an interrupted operation at the mux level. With this fix, no more hangups happen during uploads. Note that this time the setup required to provoke the hangups was a bit complex : - client is "curl" running on local host, uploading 1.7 MB of data via haproxy - haproxy running on local host, forwarding to a remote server through a 100 Mbps only switch - timeouts disabled on haproxy - remote server made of thttpd executing a cgi reading request data through "dd bs=10" to slow down everything. With such a setup, around 3-5% of the connections would hang up. This fix needs to be backported to 1.8.	2017-12-14 13:43:24 +01:00
Willy Tarreau	6042aeb1e8	BUG/MEDIUM: h2: work around a connection API limitation The connection API permits us to enable or disable receiving on a connection. The underlying FD layer arranges this with the polling and the fd cache. In practice, if receiving was allowed and an end of buffer was reached, the FD is subscribed to the polling. If later we want to process pending data from the buffer, we have to enable receiving again, but since it's already enabled (in polled mode), nothing happens and the pending data remain stuck until a new event happens on the connection to wake the FD up. This is a limitation of the internal connection API which is not very friendly to the new mux architecture. The visible effect is that certain uploads to slow servers experience truncation on timeout on their last blocks because nothing new comes from the connection to wake it up while it's being polled. In order to work around this, there are two solutions : - either cheat on the connection so that conn_update_xprt_polling() always performs a call to fd_may_recv() after fd_want_recv(), that we can trigger from the mux by always calling conn_xprt_stop_recv() before conn_xprt_want_recv(), but that's a bit tricky and may have side effects on other parts (eg: SSL) - or we refrain from receiving in the mux as soon as we're busy on anything else, regardless of whether or not some room is available in the receive buffer. This patch takes the second approach above. This way once we read some data, as soon as we detect that we're stuck, we immediately stop receiving. This ensures the event doesn't go into polled mode for this period and that as soon as we're unstuck we can continue. In fact this guarantees that we can only wait on one side of the mux for a given direction. A future improvement of the connection layer should make it possible to resume processing of an interrupted receive operation. This fix must be backported to 1.8.	2017-12-14 13:43:24 +01:00
Willy Tarreau	315d807cbc	BUG/MEDIUM: h2: enable recv polling whenever demuxing is possible In order to allow demuxing when the dmux buffer is full, we need to enable data receipt in multiple conditions. Since the conditions are a bit complex, they have been delegated to a new function h2_recv_allowed() which follows these rules : - if an error or a shutdown was detected on the connection and the buffer is empty, we must not attempt to receive - if the demux buf failed to be allocated, we must not try to receive and we know there is nothing pending - if the buffer is not full, we may attempt to receive - if no flag indicates a blocking condition, we may attempt to receive - otherwise must may not attempt No more truncated payloads are detected in tests anymore, which seems to indicate that the issue was worked around. A better connection API will have to be created for new versions to make this stuff simpler and more intuitive. This fix needs to be backported to 1.8 along with the rest of the patches related to CS_FL_RCV_MORE.	2017-12-10 22:17:57 +01:00
Willy Tarreau	c9ede6c43e	BUG/MEDIUM: h2: automatically set CS_FL_RCV_MORE when the output buffer is full If we can't demux pending data due to a stream buffer full condition, we now set CS_FL_RCV_MORE on the conn_stream so that the stream layer knows it must call back as soon as possible to restart demuxing. Without this, some uploaded payloads are truncated if the server does not consume them fast enough and buffers fill up. Note that this is still not enough to solve the problem, some changes are required on the recv() and update_poll() paths to allow to restart reading even with a buffer full condition. This patch must be backported to 1.8.	2017-12-10 21:28:43 +01:00
Willy Tarreau	6577b48613	BUG/MEDIUM: stream-int: always set SI_FL_WAIT_ROOM on CS_FL_RCV_MORE When a stream interface tries to read data from a mux using rcv_buf(), sometimes it sees 0 as the return value and concludes that there's no more data while there are, resulting in the connection being polled for more data and no new attempt being made at reading these pending data. Now it will automatically check for flag CS_FL_RCV_MORE to know if the mux really did not have anything available or was not able to provide these data by lack of room in the destination buffer, and will set SI_FL_WAIT_ROOM accordingly. This will ensure that once current data lying in the buffer are forwarded to the other side, reading chk_rcv() will be called to re-enable reading. It's important to note that in practice it will rely on the mux's update_poll() function to re-enable reading and that where the calls are placed in the stream interface, it's not possible to perform a new synchronous rcv_buf() call. Thus a corner case remains where the mux cannot receive due to a full buffer or any similar condition, but needs to be able to wake itself up to deliver pending data. This is a limitation of the current connection/conn_stream API which will likely need a new event subscription to at least call ->wake() asynchronously (eg: mux->{kick,restart,touch,update} ?). For now the affected mux (h2 only) will have to take care of the extra logic to carefully enable polling to restart processing incoming data. This patch relies on previous one (MINOR: conn_stream: add new flag CS_FL_RCV_MORE to indicate pending data) and both must be backported to 1.8.	2017-12-10 21:19:33 +01:00
Willy Tarreau	3c8294b607	MINOR: conn_stream: add new flag CS_FL_RCV_MORE to indicate pending data Due to the nature of multiplexed protocols, it will often happen that some operations are only performed on full frames, preventing any partial operation from being performed. HTTP/2 is one such example. The current MUX API causes a problem here because the rcv_buf() function has no way to let the stream layer know that some data could not be read due to a lack of room in the buffer, but that data are definitely present. The problem with this is that the stream layer might not know it needs to call the function again after it has made some room. And if the frame in the buffer is not followed by any other, nothing will move anymore. This patch introduces a new conn_stream flag CS_FL_RCV_MORE whose purpose is to indicate on the stream that more data than what was received are already available for reading as soon as more room will be available in the buffer. This patch doesn't make use of this flag yet, it only declares it. It is expected that other similar flags may come in the future, such as reports of pending end of stream, errors or any such event that might save the caller from having to poll, or simply let it know that it can take some actions after having processed data.	2017-12-10 21:13:25 +01:00

1 2 3 4 5 ...

7307 Commits