haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-09 16:47:18 +02:00

Author	SHA1	Message	Date
Willy Tarreau	fb07b3f825	BUG/MEDIUM: mux-h2/htx: never wait for EOM when processing trailers In message https://www.mail-archive.com/haproxy@formilux.org/msg33541.html Patrick Hemmer reported an interesting bug affecting H2 and trailers. The problem is that in order to close the stream we have to see the EOM block, but nothing guarantees it will atomically be delivered with the trailers block(s). So the code currently waits for it by returning zero when it was not found, resulting in the caller (h2_snd_buf()) to loop forever calling it again. The current internal connection/connstream API doesn't allow a send actor to notify its caller that it cannot process the data until it gets more, so even returning zero will only lead to calls in loops without any guarantee that any progress will be made. Some late amendments to HTX already guaranteed the atomicity of the trailers block during snd_buf(), which is currently ensured by the fact that producers create exactly one such trailers block for all trailers. So in practice we can only loop between trailers and EOM. This patch changes the behaviour by making h2s_htx_make_trailers() become atomic by not consuming the EOM block. This way either it finds the end of trailers marker (empty line) or it fails. Once it sends the trailers block, ES is set so the stream turns HLOC or CLOSED. Thanks to previous patch "MEDIUM: mux-h2: discard contents that are to be sent after a shutdown" is is now safe to interrupt outgoing data processing, and the late EOM block will silently be discarded when the caller finally sends it. This is a bit tricky but should remain solid by design, and seems like the only option we have that is compatible with 1.9, where it must be backported along with the aforementioned patch.	2019-05-07 11:08:02 +02:00
Willy Tarreau	2b77848418	MEDIUM: mux-h2: discard contents that are to be sent after a shutdown In h2_snd_buf() we discard any possible buffer contents requested to be sent after a close or an error. But in practice we can extend this to any case where the stream is locally half-closed since it means we will never be able to send these data anymore. For now it must not change anything, but it will be used by subsequent patches to discard lone a HTX EOM block arriving after the trailers block.	2019-05-07 11:08:02 +02:00
Willy Tarreau	aab1a60977	BUG/MEDIUM: h2/htx: always fail on too large trailers In case a header frame carrying trailers just fits into the HTX buffer but leaves no room for the EOM block, we used to return the same code as the one indicating we're missing data. This could would result in such frames causing timeouts instead of immediate clean aborts. Now they are properly reported as stream errors (since the frame was decoded and the compression context is still synchronized). This must be backported to 1.9.	2019-05-07 11:08:02 +02:00
Willy Tarreau	5121e5d750	BUG/MINOR: mux-h2: rely on trailers output not input to turn them to empty data When sending trailers, we may face an empty HTX trailers block or even have to discard some of the headers there and be left with nothing to send. RFC7540 forbids sending of empty HEADERS frames, so in this case we turn to DATA frames (which is possible since after other DATA). The code used to only check the input frame's contents to decide whether or not to switch to a DATA frame, it didn't consider the possibility that the frame only used to contain headers discarded later, thus it could still emit an empty HEADERS frame in such a case. This patch makes sure that the output frame size is checked instead to take the decision. This patch must be backported to 1.9. In practice this situation is never encountered since the discarded headers have really nothing to do in a trailers block.	2019-05-07 11:07:59 +02:00
Willy Tarreau	97215ca284	BUG/MEDIUM: mux-h2: properly deal with too large headers frames In h2c_decode_headers(), now that we support CONTINUATION frames, we try to defragment all pending frames at once before processing them. However if the first is exactly full and the second cannot be parsed, we don't detect the problem and we wait for the next part forever due to an incorrect check on exit; we must abort the processing as soon as the current frame remains full after defragmentation as in this case there is no way to make forward progress. Thanks to Yves Lafon for providing traces exhibiting the problem. This must be backported to 1.9.	2019-04-29 10:20:21 +02:00
Olivier Houchard	e179d0e88f	MEDIUM: connections: Provide a xprt_ctx for each xprt method. For most of the xprt methods, provide a xprt_ctx. This will be useful later when we'll want to be able to stack xprts. The init() method now has to create and provide the said xprt_ctx if needed.	2019-04-18 14:56:24 +02:00
Olivier Houchard	3f795f76e8	MEDIUM: tasks: Merge task_delete() and task_free() into task_destroy(). task_delete() was never used without calling task_free() just after, and task_free() was only used on error pathes to destroy a just-created task, so merge them into task_destroy(), that will remove the task from the wait queue, and make sure the task is either destroyed immediately if it's not in the run queue, or destroyed when it's supposed to run.	2019-04-18 10:10:04 +02:00
Olivier Houchard	998410a41b	BUG/MEDIUM: h2: Revamp the way send subscriptions works. Instead of abusing the SUB_CALL_UNSUBSCRIBE flag, revamp the H2 code a bit so that it just checks if h2s->sending_list is empty to know if the tasklet of the stream_interface has been waken up or not. send_wait is now set to NULL in h2_snd_buf() (ideally we'd set it to NULL as soon as we're waking the tasklet, but it can't be done, because we still need it in case we have to remove the tasklet from the task list).	2019-04-15 19:27:57 +02:00
Olivier Houchard	9a0f559676	BUG/MEDIUM: h2: Make sure we're not already in the send_list in h2_subscribe(). In h2_subscribe(), don't add ourself to the send_list if we're already in it. That may happen if we try to send and fail twice, as we're only removed from the send_list if we managed to send data, to promote fairness. Failing to do so can lead to either an infinite loop, or some random crashes, as we'd get the same h2s in the send_list twice. This should be backported to 1.9.	2019-04-15 19:27:57 +02:00
Olivier Houchard	0e0793715c	BUG/MEDIUM: muxes: Make sure we unsubcribed when destroying mux ctx. In the h1 and h2 muxes, make sure we unsubscribed before destroying the mux context. Failing to do so will lead in a segfault later, as the connection will attempt to dereference its conn->send_wait or conn->recv_wait, which pointed to the now-free'd mux context. This was introduced by commit `39a96ee16e`, so should only be backported if that commit gets backported.	2019-04-15 19:27:57 +02:00
Christopher Faulet	61840e715f	BUG/MEDIUM: muxes: Don't dereference mux context if null in release functions When a mux context is released, we must be sure it exists before dereferencing it. The bug was introduced in the commit `39a96ee16` ("MEDIUM: muxes: Be prepared to don't own connection during the release"). No need to backport this patch, expect if the commit `39a96ee16` is backported too.	2019-04-15 09:47:10 +02:00
Christopher Faulet	39a96ee16e	MEDIUM: muxes: Be prepared to don't own connection during the release This happens during mux upgrades. In such case, when the destroy() callback is called, the connection points to a different mux's context than the one passed to the callback. It means the connection is owned by another mux. The old mux is then released but the connection is not closed.	2019-04-12 22:06:53 +02:00
Christopher Faulet	73c1207c71	MINOR: muxes: Pass the context of the mux to destroy() instead of the connection It is mandatory to handle mux upgrades, because during a mux upgrade, the connection will be reassigned to another multiplexer. So when the old one is destroyed, it does not own the connection anymore. Or in other words, conn->ctx does not point to the old mux's context when its destroy() callback is called. So we now rely on the multiplexer context do destroy it instead of the connection. In addition, h1_release() and h2_release() have also been updated in the same way.	2019-04-12 22:06:53 +02:00
Christopher Faulet	51f73eb11a	MEDIUM: muxes: Add an optional input buffer during mux initialization The mux's callback init() now take a pointer to a buffer as extra argument. It must be used by the multiplexer as its input buffer. This buffer is always NULL when a multiplexer is initialized with a fresh connection. But if a mux upgrade is performed, it may be filled with existing data. Note that, for now, mux upgrades are not supported. But this commit is mandatory to do so.	2019-04-12 22:06:53 +02:00
Christopher Faulet	e9b7072e9e	MINOR: muxes: Rely on conn_is_back() during init to handle front/back conn Instead of using the connection context to make the difference between a frontend connection and a backend connection, we now rely on the function conn_is_back().	2019-04-12 22:06:53 +02:00
Christopher Faulet	9f38f5aa80	MINOR: muxes: Add a flag to specify a multiplexer uses the HTX A multiplexer must now set the flag MX_FL_HTX when it uses the HTX to structured the data exchanged with channels. the muxes h1 and h2 set this flag. Of course, for the mux h2, it is set on h2_htx_ops only.	2019-04-12 22:06:53 +02:00
Christopher Faulet	9b579106fe	MINOR: mux-h2: Add a mux_ops dedicated to the HTX mode Instead of using the same mux_ops structure for the legacy HTTP mode and the HTX mode, a dedicated mux_ops is now used for the HTX mode. Same callbacks are used for both. But the flags may be different depending on the mode used.	2019-04-12 22:06:53 +02:00
Olivier Houchard	3ca18bf0bd	BUG/MEDIUM: h2: Don't attempt to recv from h2_process_demux if we subscribed. Modify h2c_restart_reading() to add a new parameter, to let it know if it should consider if the buffer isn't empty when retrying to read or not, and call h2c_restart_reading() using 0 as a parameter from h2_process_demux(). If we're leaving h2_process_demux() with a non-empty buffer, it means the frame is incomplete, and we're waiting for more data, and if we already subscribed, we'll be waken when more data are available. Failing to do so means we'll be waken up in a loop until more data are available. This should be backported to 1.9.	2019-04-05 16:03:54 +02:00
Willy Tarreau	a27db38f12	BUG/MEDIUM: mux-h2: make sure to always notify streams of EOS condition Recent commit `63768a63d` ("MEDIUM: mux-h2: Don't mix the end of the message with the end of stream") introduced a race which may manifest itself with small connection counts on large objects and large server timeouts in legacy mode. Sometimes h2s_close() is called while the data layer is subscribed to read events but nothing in the chain can cause this wake-up to happen and some streams stall for a while at the end of a transfer until the server timeout strikes and ends the stream completes. We need to wake the stream up if it's subscribed to rx events there, which is what this patch does. When the patch above is backported to 1.9, this patch will also have to be backported.	2019-03-25 18:13:16 +01:00
Willy Tarreau	e73256fd2a	BUG/MEDIUM: task/h2: add an idempotent task removal fucntion Previous commit `3ea351368` ("BUG/MEDIUM: h2: Remove the tasklet from the task list if unsubscribing.") uncovered an issue which needs to be addressed in the scheduler's API. The function task_remove_from_task_list() was initially designed to remove a task from the running tasklet list from within the scheduler, and had to be used in h2 to abort pending I/O events. However this function was not designed to be idempotent, occasionally causing a double removal from the tasklet list, with the second doing nothing but affecting the apparent tasks count and making haproxy use 100% CPU on some tests consisting in stopping the client during some transfers. The h2_unsubscribe() function can sometimes be called upon stream exit after an error where the tasklet was possibly already removed, so it. This patch does 2 things : - it renames task_remove_from_task_list() to __task_remove_from_tasklet_list() to discourage users from calling it. Also note the fix in the naming since it's a tasklet list and not a task list. This function is still uesd from the scheduler. - it adds a new, idempotent, task_remove_from_tasklet_list() function which does nothing if the task is already not in the tasklet list. This patch will need to be backported where the commit above is backported.	2019-03-25 18:02:54 +01:00
Olivier Houchard	3ea3513689	BUG/MEDIUM: h2: Remove the tasklet from the task list if unsubscribing. In h2_unsubscribe(), if we unsubscribe on SUB_CALL_UNSUBSCRIBE, then remove ourself from the sending_list, and remove the tasklet from the task list. We're probably about to destroy the stream anyway, so we don't want the tasklet to run, or to stay in the sending_list, or it could lead to a crash. This should be backpored to 1.9.	2019-03-25 14:34:26 +01:00
Olivier Houchard	afc7cb85c4	BUG/MEDIUM: h2: Follow the same logic in h2_deferred_shut than in h2_snd_buf. In h2_deferred_shut(), don't just set h2s->send_wait to NULL, instead, use the same logic as in h2_snd_buf() and only do so if we successfully sent data (or if we don't want to send them anymore). Setting it to NULL can lead to crashes. This should be backported to 1.9.	2019-03-25 14:34:26 +01:00
Olivier Houchard	fd1e96d2fb	BUG/MEDIUM: h2: Use the new sending_list in h2s_notify_send(). In h2s_notify_send(), use the new sending_list instead of using the old way of setting hs->send_wait to NULL, failing to do so may lead to crashes. This should be backported to 1.9.	2019-03-25 14:34:26 +01:00
Olivier Houchard	01d4cb5339	BUG/MEDIUM: h2: only destroy the h2s if h2s->cs is NULL. In h2_deferred_shut(), only attempt to destroy the h2s if h2s->cs is NULL. h2s->cs being non-NULL means it's still referenced by the stream interface, so it may try to use it later, and that could lead to a crash. This should be backported to 1.9.	2019-03-25 13:35:02 +01:00
Christopher Faulet	87a8f353f1	CLEANUP: muxes/stream-int: Remove flags CS_FL_READ_NULL and SI_FL_READ_NULL Since the flag CF_SHUTR is no more set to mark the end of the message, these flags become useless. This patch should be backported to 1.9.	2019-03-25 06:55:23 +01:00
Christopher Faulet	63768a63d7	MEDIUM: mux-h2: Don't mix the end of the message with the end of stream The H2 multiplexer now sets CS_FL_EOI when it receives a frame with the ES flag. And when the H2 streams is closed, it set the flag CS_FL_REOS. This patch should be backported to 1.9.	2019-03-25 06:26:30 +01:00
Christopher Faulet	3ab07c35b4	MINOR: mux-h2: Remove useless test on ES flag in h2_frt_transfer_data() Same test is already performed in the caller function, h2c_frt_handle_data(). This patch should be backported to 1.9.	2019-03-22 18:06:17 +01:00
Olivier Houchard	d360ac60f4	BUG/MEDIUM: h2: Try to be fair when sending data. On the send path, try to be fair, and make sure the first to attempt to send data will actually be the first to send data when it's possible (ie when the mux' buffer is not full anymore). To do so, use a separate list element for the sending_list, and only remove the h2s from the send_list/fctl_list if we successfully sent data. If we did not, we'll keep our place in the list, and will be able to try again next time. This should be backported to 1.9.	2019-03-22 18:05:03 +01:00
Willy Tarreau	749f5cab83	CLEANUP: mux-h2: add some comments to help understand the code Some functions' roles and usage are far from being obvious, and diving into this part each time requires deep concentration before starting to understand who does what. Let's add a few comments which help figure some of the useful pieces.	2019-03-21 19:19:36 +01:00
Willy Tarreau	8ab128c06a	MINOR: mux-h2: copy small data blocks more often and reduce the number of pauses We tend to refrain from sending data a bit too much in the H2 mux : whenever there are pending data in the buffer and we try to copy something larger than 1/4 of the buffer we prefer to pause. This is suboptimal for medium-sized objects which have to send their headers and later their data. This patch slightly changes this by allowing a copy of a large block if it fits at once and if the realign cost is small, i.e. the pending data are small or the block fits in the contiguous area. Depending on the object size this measurably improves the download performance by between 1 and 10%, and possibly lowers the transfer latency for medium objects.	2019-03-21 18:28:31 +01:00
Olivier Houchard	fd8bd4521a	BUG/MEDIUM: mux-h2: Use the right list in h2_stop_senders(). In h2_stop_senders(), when we're about to move the h2s about to send back to the send_list, because we know the mux is full, instead of putting them all in the send_list, put them back either in the fctl_list or the send_list depending on if they are waiting for the flow control or not. This also makes sure they're inserted in their arrival order and not reversed. This should be backported to 1.9.	2019-03-21 18:28:31 +01:00
Olivier Houchard	16ff261633	BUG/MEDIUM: mux-h2: Don't bother keeping the h2s if detaching and nothing to send. In h2_detach(), don't bother keeping the h2s even if it was waiting for flow control if we no longer are subscribed for receiving or sending, as nobody will do anything once we can write in the mux, anyway. Failing to do so may lead to h2s being kept opened forever. This should be backported to 1.9.	2019-03-21 18:28:31 +01:00
Olivier Houchard	7a977431ca	BUG/MEDIUM: mux-h2: Make sure we destroyed the h2s once shutr/shutw is done. If we're waiting until we can send a shutr and/or a shutw, once we're done and not considering sending anything, destroy the h2s, and eventually the h2c if we're done with the whole connection, or it will never be done. This should be backported to 1.9.	2019-03-21 18:28:31 +01:00
Christopher Faulet	203b2b0a5a	MINOR: muxes: Report the Last read with a dedicated flag For conveniance, in HTTP muxes (h1 and h2), the end of the stream and the end of the message are reported the same way to the stream, by setting the flag CS_FL_EOS. In the stream-interface, when CS_FL_EOS is detected, a shutdown for read is reported on the channel side. This is historical. With the legacy HTTP layer, because the parsing is done by the stream in HTTP analyzers, the EOS really means a shutdown for read. Most of time, for muxes h1 and h2, it works pretty well, especially because the keep-alive is handled by the muxes. The stream is only used for one transaction. So mixing EOS and EOM is good enough. But not everytime. For now, client aborts are only reported if it happens before the end of the request. It is an error and it is properly handled. But because the EOS was already reported, client aborts after the end of the request are silently ignored. Eventually an error can be reported when the response is sent to the client, if the sending fails. Otherwise, if the server does not reply fast enough, an error is reported when the server timeout is reached. It is the expected behaviour, excpect when the option abortonclose is set. In this case, we must report an error when the client aborts. But as said before, this event can be ignored. So to be short, for now, the abortonclose is broken. In fact, it is a design problem and we have to rethink all channel's flags and probably the conn-stream ones too. It is important to split EOS and EOM to not loose information anymore. But it is not a small job and the refactoring will be far from straightforward. So for now, temporary flags are introduced. When the last read is received, the flag CS_FL_READ_NULL is set on the conn-stream. This way, we can set the flag SI_FL_READ_NULL on the stream interface. Both flags are persistant. And to be sure to wake the stream, the event CF_READ_NULL is reported. So the stream will always have the chance to handle the last read. This patch must be backported to 1.9 because it will be used by another patch to fix the option abortonclose.	2019-03-18 15:50:23 +01:00
Christopher Faulet	35757d38ce	MINOR: mux-h2: Set REFUSED_STREAM error to reset a stream if no data was never sent According to the H2 spec (see #8.1.4), setting the REFUSED_STREAM error code is a way to indicate that the stream is being closed prior to any processing having occurred, such as when a server-side H1 keepalive connection is closed without sending anything (which differs from the regular error case since haproxy doesn't even generate an error message). Any request that was sent on the reset stream can be safely retried. So, when a stream is closed, if no data was ever sent back (ie. the flag H2_SF_HEADERS_SENT is not set), we can set the REFUSED_STREAM error code on the RST_STREAM frame. This patch may be backported to 1.9.	2019-03-18 15:50:23 +01:00
Christopher Faulet	f02ca00a36	BUG/MEDIUM: mux-h2: Always wakeup streams with no id to avoid frozen streams This only happens for server streams because their id is assigned when the first message is sent. If these streams are not woken up, some events can be lost leading to frozen streams. For instance, it happens when a server closes its connection before sending its preface. This patch must be backported to 1.9.	2019-03-18 15:50:23 +01:00
Willy Tarreau	7196dd6071	MINOR: mux-h2: always pass HTX_FL_PARSING_ERROR between h2s and buf on RX In order to allow the H2 parser to report parsing errors, we must make sure to always pass the HTX_FL_PARSING_ERROR flag from the h2s htx to the conn_stream's htx.	2019-03-05 10:56:34 +01:00
Willy Tarreau	927b88ba00	BUG/MAJOR: mux-h2: fix race condition between close on both ends A crash in H2 was reported in issue #52. It turns out that there is a small but existing race by which a conn_stream could detach itself using h2_detach(), not being able to destroy the h2s due to pending output data blocked by flow control, then upon next h2s activity (transfer_data or trailers parsing), an ES flag may need to be turned into a CS_FL_REOS bit, causing a dereference of a NULL stream. This is a side effect of the fact that we still have a few places which incorrectly depend on the CS flags, while these flags should only be set by h2_rcv_buf() and h2_snd_buf(). All candidate locations along this path have been secured against this risk, but the code should really evolve to stop depending on CS anymore. This fix must be backported to 1.9 and possibly partially to 1.8.	2019-03-04 08:17:12 +01:00
Willy Tarreau	0bbad6bb06	BUG/MEDIUM: h2: advertise to servers that we don't support push The h2c_send_settings() function was initially made to serve on the frontend. Here we don't need to advertise that we don't support PUSH since we don't do that ourselves. But on the backend side it's different because PUSH is enabled by default so we must announce that we don't want the server to use it. This must be backported to 1.9.	2019-02-26 16:07:27 +01:00
Willy Tarreau	67b8caefc9	BUG/MEDIUM: mux-h2/htx: send an empty DATA frame on empty HTX trailers When chunked-encoding is used in HTX mode, a trailers HTX block is always made due to the way trailers are currently implemented (verbatim copy of the H1 representation). Because of this it's not possible to know when processing data that we've reached the end of the stream, and it's up to the function encoding the trailers (h2s_htx_make_trailers) to put the end of stream. But when there are no trailers and only an empty HTX block, this one cannot produce a HEADERS frame, thus it cannot send the END_STREAM flag either, leaving the other end with an incomplete message, waiting for either more data or some trailers. This is particularly visible with POST requests where the server continues to wait. What this patch does is transform the HEADERS frame into an empty DATA frame when meeting an empty trailers block. It is possible to do this because we've not sent any trailers so the other end is still waiting for DATA frames. The check is made after attempting to encode the list of headers, so as to minimize the specific code paths. Thanks to Dragan Dosen for reporting the issue with a reproducer. This fix must be backported to 1.9.	2019-02-21 18:22:26 +01:00
Willy Tarreau	a24b35ca18	MINOR: mux-h2: make the H2 MAX_FRAME_SIZE setting configurable This creates a new tunable "tune.h2.max-frame-size" to adjust the advertised max frame size. When not set it still defaults to the buffer size. It is convenient to advertise sizes lower than the buffer size, for example when using very large buffers.	2019-02-21 17:30:59 +01:00
Christopher Faulet	eaf0d2a936	MINOR: mux-h2: Set HTX extra value when possible For now, this can be only done when a content-length is specified. In fact, it is the same value than h2s->body_len, the remaining body length according to content-length. Setting this field allows the fast forwarding at the channel layer, improving significantly data transfer for big objects. This patch may be backported to 1.9.	2019-02-19 16:26:14 +01:00
Christopher Faulet	0b46548a68	BUG/MEDIUM: h2/htx: Correctly handle interim responses when HTX is enabled 1xx responses does not work in HTTP2 when the HTX is enabled. First of all, when a response is parsed, only one HEADERS frame is expected. So when an interim response is received, the flag H2_SF_HEADERS_RCVD is set and the next HEADERS frame (for another interim repsonse or the final one) is parsed as a trailers one. Then when the response is sent, because an EOM block is found at the end of the interim HTX response, the ES flag is added on the frame, closing too early the stream. Here, it is a design problem of the HTX. Iterim responses are considered as full messages, leading to some ambiguities when HTX messages are processed. This will not be fixed now, but we need to keep it in mind for future improvements. To fix the parsing bug, the flag H2_MSGF_RSP_1XX is added when the response headers are decoded. When this flag is set, an EOM block is added into the HTX message, despite the fact that there is no ES flag on the frame. And we don't set the flag H2_SF_HEADERS_RCVD on the corresponding H2S. So the next HEADERS frame will not be parsed as a trailers one. To fix the sending bug, the ES flag is not set on the frame when an interim response is processed and the flag H2_SF_HEADERS_SENT is not set on the corresponding H2S. This patch must be backported to 1.9.	2019-02-19 16:26:14 +01:00
Christopher Faulet	fd74267264	BUG/MINOR: mux-h2: Don't add ":status" pseudo-header on trailers It is a cut-paste bug. Pseudo-header fields MUST NOT appear in trailers. This patch must be backported to 1.9.	2019-02-18 16:25:06 +01:00
Christopher Faulet	37070b2b15	BUG/MEDIUM: mux-h2/htx: Always set CS flags before exiting h2_rcv_buf() It is especially important when some data are blocked in the RX buf and the channel buffer is already full. In such case, instead of exiting the function directly, we need to set right flags on the conn_stream. CS_FL_RCV_MORE and CS_FL_WANT_ROOM must be set, otherwise, the stream-interface will subscribe to receive events, thinking it is not blocked. This bug leads to connection freeze when everything was received with some data blocked in the RX buf and a channel full. This patch must be backported to 1.9.	2019-02-18 16:25:06 +01:00
Willy Tarreau	053c15750b	BUG/MEDIUM: mux-h2: always set :authority on request output PiBa-NL reported that some servers don't fall back to the Host header when :authority is absent. After studying all the combinations of Host and :authority, it appears that we always have to send the latter, hence we never need the former. In case of CONNECT method, the authority is retrieved from the URI part, otherwise it's extracted from the Host field. The tricky part is that we have to scan all headers for the Host header before dumping other headers. This is due to the fact that we must emit pseudo headers before other ones. One improvement could possibly be made later in the request parser to search and emit the Host header immediately if authority was not found. This would cost nothing on the vast marjority of requests and make the lookup faster on output since Host would appear first. This fix must be backported to 1.9.	2019-02-01 16:47:46 +01:00
Willy Tarreau	5be92ff23f	BUG/MEDIUM: mux-h2: always omit :scheme and :path for the CONNECT method This is mandated by RFC7540 #8.3, these pseudo-headers must not be emitted with the CONNECT method. This must be backported to 1.9.	2019-02-01 16:47:46 +01:00
Olivier Houchard	9c9da5ee89	MINOR: muxes: Don't bother to LIST_DEL(&conn->list) before calling conn_free(). conn_free() already removes the connection from any idle list, so there's no need to do it in the mux code, just before calling conn_free().	2019-01-31 19:38:25 +01:00
Willy Tarreau	8694978892	BUG/MEDIUM: mux-h2: properly consider the peer's advertised max-concurrent-streams Till now we used to only rely on tune.h2.max-concurrent-streams but if a peer advertises a lower limit this can cause streams to be reset or even the conection to be killed. Let's respect the peer's value for outgoing streams. This patch should be backported to 1.9, though it depends on the following ones : BUG/MINOR: server: fix logic flaw in idle connection list management MINOR: mux-h2: max-concurrent-streams should be unsigned MINOR: mux-h2: make sure to only check concurrency limit on the frontend MINOR: mux-h2: learn and store the peer's advertised MAX_CONCURRENT_STREAMS setting	2019-01-31 19:38:25 +01:00
Willy Tarreau	2e2083ae5b	MINOR: mux-h2: learn and store the peer's advertised MAX_CONCURRENT_STREAMS setting We used not to take it into account because we only used the configured parameter everywhere. This patch makes sure we can actually learn the value advertised by the peer. We still enforce our own limit on top of it however, to make sure we can actually limit resources or stream concurrency in case of suboptimal server settings.	2019-01-31 19:38:25 +01:00
Willy Tarreau	fa1d357f05	MINOR: mux-h2: make sure to only check concurrency limit on the frontend h2_has_too_many_cs() was renamed to h2_frt_has_too_many_cs() to make it clear it's only used to throttle the frontend connection, and the call places were adjusted to only call this code on a front connection. In practice it was already the case since the H2_CF_DEM_TOOMANY flag is only set there. But now the ambiguity is removed.	2019-01-31 19:38:25 +01:00
Willy Tarreau	5a490b669e	MINOR: mux-h2: max-concurrent-streams should be unsigned We compare it to other unsigned values, let's make it unsigned as well.	2019-01-31 19:38:25 +01:00
Willy Tarreau	00f18a36b6	BUG/MINOR: server: fix logic flaw in idle connection list management With variable connection limits, it's not possible to accurately determine whether the mux is still in use by comparing usage and max to be equal due to the fact that one determines the capacity and the other one takes care of the context. This can cause some connections to be dropped before they reach their stream ID limit. It seems it could also cause some connections to be terminated with streams still alive if the limit was reduced to match the newly computed avail_streams() value, though this cannot yet happen with existing muxes. Instead let's switch to usage reports and simply check whether connections are both unused and available before adding them to the idle list. This should be backported to 1.9.	2019-01-31 19:38:25 +01:00
Willy Tarreau	180590409f	BUG/MEDIUM: mux-h2: do not close the connection on aborted streams We used to rely on a hint that a shutw() or shutr() without data is an indication that the upper layer had performed a tcp-request content reject and really wanted to kill the connection, but sadly there is another situation where this happens, which is failed keep-alive request to a server. In this case the upper layer stream silently closes to let the client retry. In our case this had the side effect of killing all the connection. Instead of relying on such hints, let's address the problem differently and rely on information passed by the upper layers about the intent to kill the connection. During shutr/shutw, this is detected because the flag CS_FL_KILL_CONN is set on the connstream. Then only in this case we send a GOAWAY(ENHANCE_YOUR_CALM), otherwise we only send the reset. This makes sure that failed backend requests only fail frontend requests and not the whole connections anymore. This fix relies on the two previous patches adding SI_FL_KILL_CONN and CS_FL_KILL_CONN as well as the fix for the connection close, and it must be backported to 1.9 and 1.8, though the code in 1.8 could slightly differ (cs is always valid) : BUG/MEDIUM: mux-h2: wait for the mux buffer to be empty before closing the connection MINOR: stream-int: add a new flag to mention that we want the connection to be killed MINOR: connstream: have a new flag CS_FL_KILL_CONN to kill a connection	2019-01-31 19:38:25 +01:00
Willy Tarreau	4dbda620f2	BUG/MEDIUM: mux-h2: wait for the mux buffer to be empty before closing the connection When finishing to respond on a stream, a shutw() is called (resulting in either an end of stream or RST), then h2_detach() is called, and may decide to kill the connection is a number of conditions are satisfied. Actually one of these conditions is that a GOAWAY frame was already sent or attempted to be sent. This one is wrong, because it can happen in at least these two situations : - a shutw() sends a GOAWAY to obey tcp-request content reject - a graceful shutdown is pending In both cases, the connection will be aborted with the mux buffer holding some data. In case of a strong abort the client will not see the GOAWAY or RST and might want to try again, which is counter-productive. In case of the graceful shutdown, it could result in truncated data. It looks like a valid candidate for the issue reported here : https://www.mail-archive.com/haproxy@formilux.org/msg32433.html A backport to 1.9 and 1.8 is necessary.	2019-01-31 19:38:25 +01:00
Willy Tarreau	a9b7796862	MINOR: mux-h2: consistently rely on the htx variable to detect the mode In h2_frt_transfer_data(), we support both HTX and legacy modes. The HTX mode is detected from the proxy option and sets a valid pointer into the htx variable. Better rely on this variable in all the function rather than testing the option again. This way the code is clearer and even the compiler knows this pointer is valid when it's dereferenced. This should be backported to 1.9 if the b_is_null() patch is backported.	2019-01-31 08:07:17 +01:00
Willy Tarreau	1f035507af	BUG/MINOR: mux-h2: make sure request trailers on aborted streams don't break the connection We used to respond a connection error in case we received a trailers frame on a closed stream, but it's a problem to do this if the error was caused by a reset because the sender has not yet received it and is just a victim of the timing. Thus we must not close the connection in this case. This patch may be backported to 1.9 but then it requires the following previous ones : MINOR: h2: add a generic frame checker MEDIUM: mux-h2: check the frame validity before considering the stream state CLEANUP: mux-h2: remove stream ID and frame length checks from the frame parsers	2019-01-30 19:37:20 +01:00
Willy Tarreau	b860c73756	CLEANUP: mux-h2: remove stream ID and frame length checks from the frame parsers It's not convenient to have such structural checks mixed with the ones related to the stream state. Let's remove all these basic tests that are already covered once for all when reading the frame header.	2019-01-30 19:37:20 +01:00
Willy Tarreau	54f46e53dd	MEDIUM: mux-h2: check the frame validity before considering the stream state There are some uneasy situation where it's difficult to validate a frame's format without being in an appropriate state. This patch makes sure that each frame passes through h2_frame_check() before being checked in the context of the stream's state. This makes sure we can always return a GOAWAY for protocol violations even if we can't process the frame.	2019-01-30 19:37:20 +01:00
Willy Tarreau	08bb1d6109	BUG/MINOR: mux-h2: make sure response HEADERS are not received in other states than OPEN and HLOC RFC7540#5.1 states that these are the only states allowing any frame type. For response HEADERS frames, we cannot accept that they are delivered on idle streams of course, so we're left with these two states only. It is important to test this so that we can remove the generic CLOSE_STREAM test for such frames in the main loop. This must be backported to 1.9 (1.8 doesn't have response HEADERS).	2019-01-30 19:37:14 +01:00
Willy Tarreau	8d9ac3ed8b	BUG/MEDIUM: mux-h2: do not abort HEADERS frame before decoding them If a response HEADERS frame arrives on a closed connection (due to a client abort sending an RST_STREAM), it's currently immediately rejected with an RST_STREAM, like any other frame. This is incorrect, as HEADERS frames must first be decoded to keep the HPACK decoder synchronized, possibly breaking subsequent responses. This patch excludes HEADERS/CONTINUATION/PUSH_PROMISE frames from the central closed state test and leaves to the respective frame parsers the responsibility to decode the frame then send RST_STREAM. This fix must be backported to 1.9. 1.8 is not directly impacted since it doesn't have response HEADERS nor trailers thus cannot recover from such situations anyway.	2019-01-30 19:36:21 +01:00
Willy Tarreau	24ff1f8341	BUG/MEDIUM: mux-h2: make sure never to send GOAWAY on too old streams The H2 spec requires to send GOAWAY when the client sends a frame after it has already closed using END_STREAM. Here the corresponding case was the fallback of a series of tests on the stream state, but it unfortunately also catches old closed streams which we don't know anymore. Thus any late packet after we've sent an RST_STREAM will trigger this GOAWAY and break other streams on the connection. This can happen when launching two tabs in a browser targetting the same slow page through an H2-to-H2 proxy, and pressing Escape to stop one of them. The other one gets an error when the page finally responds (and it generally retries), and the logs in the middle indicate SD-- flags since the late response was cancelled. This patch takes care to only send GOAWAY on streams we still know. It must be backported to 1.9 and 1.8.	2019-01-30 19:35:42 +01:00
Willy Tarreau	fc10f599cc	BUG/MEDIUM: mux-h2: fix two half-closed to closed transitions When receiving a HEADERS or DATA frame with END_STREAM set, we would inconditionally switch to half-closed(remote). This is wrong because we could already have been in half-closed(local) and need to switch to closed. This happens in the following situations : - receipt of the end of a client upload after we've already responded (e.g. redirects to POST requests) - receipt of a response on the backend side after we've already finished sending the request (most common case). This may possibly have caused some streams to stay longer than needed at the end of a transfer, though this is not apparent in tests. This must be backported to 1.9 and 1.8.	2019-01-30 19:34:40 +01:00
Willy Tarreau	b1c9edc579	BUG/MEDIUM: mux-h2: wake up flow-controlled streams on initial window update When a settings frame updates the initial window, all affected streams's window is updated as well. However the streams are not put back into the send list if they were already blocked on flow control. The effect is that such a stream will only be woken up by a WINDOW_UPDATE message but not by a SETTINGS changing the initial window size. This can be verified with h2spec's test http2/6.9.2/1 which occasionally fails without this patch. It is unclear whether this situation is really met in field, but the fix is trivial, it consists in adding each unblocked streams to the wait list as is done for the window updates. This fix must be backported to 1.9. For 1.8 the patch needs quite a few adaptations. It's better to copy-paste the code block from h2c_handle_window_update() adding the stream to the send_list when its mws is > 0.	2019-01-30 16:21:39 +01:00
Willy Tarreau	6432dc8783	CLEANUP: mux-h2: remove misleading leftover test on h2s' nullity The WINDOW_UPDATE and DATA frame handlers used to still have a check on h2s to return either h2s_error() or h2c_error(). This is a leftover from the early code. The h2s cannot be null there anymore as it has already been dereferenced before reaching these locations.	2019-01-30 15:45:02 +01:00
Olivier Houchard	2b09443e04	BUG/MEDIUM: h2: In h2_send(), stop the loop if we failed to alloc a buf. In h2_send(), make sure we break the loop if we failed to alloc a buffer, or we'd end up looping endlessly. This should be backported to 1.9.	2019-01-29 19:47:20 +01:00
Willy Tarreau	f1e6fa35de	CLEANUP: mux-h2: remove two useless but misleading assignments h2c->st0 was assigned to H2_CS_ERROR right after returning from h2c_error(), which had already done it. It's useless and confusing, let's remove this.	2019-01-29 18:51:41 +01:00
Willy Tarreau	3ad5d31bdf	BUG/MEDIUM: mux-h2: only close connection on request frames on closed streams A subtle bug was introduced with H2 on the backend. RFC7540 states that an attempt to create a stream on an ID not higher than the max known is a connection error. This was translated into rejecting HEADERS frames for closed streams. But with H2 on the backend, if the client aborts and causes an RST_STREAM to be emitted, the stream is effectively closed, and if/once the server responds, it starts by emitting a HEADERS frame with this ID thus it is interpreted as a connection error. This test must of course consider the side the mux is installed on and not take this for a connection error on responses. The effect is that an aborted stream on an outgoing H2 connection, for example due to a client stopping a transfer with option abortonclose set, would lead to an abort of all other streams. In the logs, this appears as one or several CD-- line(s) followed by one or several SD-- lines which are victims. Thanks to Luke Seelenbinder for reporting this problem and providing enough elements to help understanding how to reproduce it. This fix must be backported to 1.9.	2019-01-29 18:49:27 +01:00
Willy Tarreau	6afec46ba3	BUG/MINOR: mux-h2: do not report available outgoing streams after GOAWAY The calculation of available outgoing H2 streams was improved by commit `d64a3ebe6` ("BUG/MINOR: mux-h2: always check the stream ID limit in h2_avail_streams()"), but it still is incorrect because RFC7540#6.8 specifically forbids the creation of new streams after a GOAWAY frame was received. Thus we must not mark the connection as available anymore in order to be able to handle a graceful shutdown. This needs to be backported to 1.9.	2019-01-28 06:44:53 +01:00
Tim Duesterhus	4707033932	CLEANUP: h2: Remove debug printf in mux_h2.c It was introduced by `1915ca2738` and should be backported to 1.9.	2019-01-25 05:22:07 +01:00
Willy Tarreau	1915ca2738	BUG/MINOR: mux-h2: always compare content-length to the sum of DATA frames This is mandated by RFC7541#8.1.2.6. Till now we didn't have a copy of the content-length header field. But now that it's already parsed, it's easy to add the check. The reg-test was updated to match the new behaviour as the previous one expected unadvertised data to be silently discarded. This should be backported to 1.9 along with previous patch (MEDIUM: h2: always parse and deduplicate the content-length header) after it has got a bit more exposure.	2019-01-24 19:45:27 +01:00
Willy Tarreau	4790f7c907	MEDIUM: h2: always parse and deduplicate the content-length header The header used to be parsed only in HTX but not in legacy. And even in HTX mode, the value was dropped. Let's always parse it and report the parsed value back so that we'll be able to store it in the streams.	2019-01-24 19:07:26 +01:00
Willy Tarreau	e9634bdc22	MINOR: mux-h2: always consider a server's max-reuse parameter This parameter allows to limit the number of successive requests sent on a connection. Let's compare it to the number of streams already sent on the connection to decide if the connection may still appear in the idle list or not. This may be used to help certain servers work around resource leaks, and also helps dealing with the issue of the GOAWAY in flight which requires to set a usage limit on the client to be reliable. This must be backported to 1.9.	2019-01-24 19:06:43 +01:00
Willy Tarreau	a80dca8535	BUG/MINOR: mux-h2: refuse to allocate a stream with too high an ID One of the reasons for the excessive number of aborted requests when a server sets a limit on the highest stream ID is that we don't check this limit while allocating a new stream. This patch does this at two locations : - when a backend stream is allocated, we verify that there are still IDs left ; - when the ID is assigned, we verify that it's not higher than the advertised limit. This should be backported to 1.9.	2019-01-24 19:06:43 +01:00
Willy Tarreau	d64a3ebe64	BUG/MINOR: mux-h2: always check the stream ID limit in h2_avail_streams() This function is used to decide whether to put an idle connection back into the idle pool. While it considers the limit in number of concurrent requests, it does not consider the limit in number of streams, so if a server announces a low limit in a GOAWAY frame, it will be ignored. However there is a caveat : since we assign the stream IDs when sending them, we have a number of allocated streams which max_id doesn't take care of. This can be addressed by adding a new nb_reserved count on each connection to keep track of the ID-less streams. This patch makes sure we take care of the remaining number of streams if such a limit was announced, or of the number of streams before the highest ID. Now it is possible to accurately know how many streams can be allocated, and the number of failed outgoing streams has dropped in half. This must be backported to 1.9.	2019-01-24 19:06:43 +01:00
Willy Tarreau	175cebb38a	BUG/MINOR: mux-h2: make it possible to set the error code on an already closed stream When sending RST_STREAM in response to a frame delivered on an already closed stream, we used not to be able to update the error code and deliver an RST_STREAM with a wrong code (e.g. H2_ERR_CANCEL). Let's always allow to update the code so that RST_STREAM is always sent with the appropriate error code (most often H2_ERR_STREAM_CLOSED). This should be backported to 1.9 and possibly to 1.8.	2019-01-24 15:27:06 +01:00
Willy Tarreau	5b4eae33de	BUG/MINOR: mux-h2: headers-type frames in HREM are always a connection error There are incompatible MUST statements in the HTTP/2 specification. Some require a stream error and others a connection error for the same situation. As discussed in the thread below, let's always apply the connection error when relevant (headers-like frame in half-closed(remote)) : https://mailarchive.ietf.org/arch/msg/httpbisa/pOIWRBRBdQrw5TDHODZXp8iblcE This must be backported to 1.9, possibly to 1.8 as well.	2019-01-24 15:27:06 +01:00
Willy Tarreau	113c7a2794	BUG/MINOR: mux-h2: CONTINUATION in closed state must always return GOAWAY Since we now support CONTINUATION frames, we must take care of properly aborting the connection when they are sent on a closed stream. By default we'd get a stream error which is not sufficient since the compression context is modified and unrecoverable. More info in this discussion : https://mailarchive.ietf.org/arch/msg/httpbisa/azZ1jiOkvM3xrpH4jX-Q72KoH00 This needs to be backported to 1.9 and possibly to 1.8 (less important there).	2019-01-24 15:27:06 +01:00
Willy Tarreau	31e846a071	BUG/MEDIUM: mux-h2: properly abort on trailers decoding errors There was an incomplete test in h2c_frt_handle_headers() resulting in negative return values from h2c_decode_headers() not being taken as errors. The effect is that the stream is then aborted on timeout only. This fix must be backported to 1.9.	2019-01-24 15:27:06 +01:00
Willy Tarreau	759ca1eacc	BUG/MAJOR: mux-h2: don't destroy the stream on failed allocation in h2_snd_buf() In case we cannot allocate a stream ID for an outgoing stream, the stream will be aborted. The problem is that we also release it and it will be destroyed again by the application detecting the error, leading to a NULL dereference in h2_shutr() and h2_shutw(). Let's only mark the error on the CS and let the rest of the code handle the close. This should be backported to 1.9.	2019-01-24 13:52:10 +01:00
Christopher Faulet	a413e958fd	BUG/MEDIUM: mux-h2/htx: Respect the channel's reserve When data are pushed in the channel's buffer, in h2_rcv_buf(), the mux-h2 must respect the reserve if the flag CO_RFL_KEEP_RSV is set. In HTX, because the stream-interface always sees the buffer as full, there is no other way to know the reserve must be respected. This patch must be backported to 1.9.	2019-01-23 11:27:34 +01:00
Willy Tarreau	a01f45e3ce	BUG/CRITICAL: mux-h2: re-check the frame length when PRIORITY is used Tim D�sterhus reported a possible crash in the H2 HEADERS frame decoder when the PRIORITY flag is present. A check is missing to ensure the 5 extra bytes needed with this flag are actually part of the frame. As per RFC7540#4.2, let's return a connection error with code FRAME_SIZE_ERROR. Many thanks to Tim for responsibly reporting this issue with a working config and reproducer. This issue was assigned CVE-2018-20615. This fix must be backported to 1.9 and 1.8.	2019-01-08 13:20:59 +01:00
Willy Tarreau	1bb812fd80	MEDIUM: mux-h2: emit HEADERS frames when facing HTX trailers blocks Now the H2 mux will parse and encode the HTX trailers blocks and send the corresponding HEADERS frame. Since these blocks contain pure H1 trailers which may be fragmented on line boundaries, if first needs to collect all of them, parse them using the H1 parser, build a list and finally encode all of them at once once the EOM is met. Note that this HEADERS frame always carries the end-of-headers and end-of-stream flags. This was tested using the helloworld examples from the grpc project, as well as with the h2c tools. It doesn't seem possible at the moment to test tailers using varnishtest though.	2019-01-04 10:56:26 +01:00
Willy Tarreau	7eeb10a5b5	MINOR: mux-h2: make HTX_BLK_EOM processing idempotent We want to make sure we won't emit another empty DATA frame if we meet HTX_BLK_EOM after and end of stream was already sent. For now it cannot happen as far as HTX is respected, but with trailers it may become ambiguous.	2019-01-04 09:28:17 +01:00
Willy Tarreau	5255f283f6	MEDIUM: mux-h2: pass trailers to HTX When receiving an H2 message in HTX mode, trailers present in chunked messages are now properly appended to the HTX block.	2019-01-03 18:45:38 +01:00
Willy Tarreau	e2b05ccff5	MEDIUM: mux-h2: pass trailers to H1 (legacy mode) When forwarding an H2 request to an H1 server, if the request doesn't have a content-length header field, it is chunked. In this case it is possible to send trailers to the server, which is what this patch does. If the transfer is performed without chunking, then the trailers are silently discarded.	2019-01-03 18:45:38 +01:00
Willy Tarreau	88d138ef6d	BUG/MEDIUM: mux-h2: decode trailers in HEADERS frames This is not exactly a bug but a long-time design limitation. We used not to decode trailers in H2, resulting in broken connections each time a trailer was sent, since it was impossible to keep the HPACK decompressor synchronized. Now that the sequencing of operations permits it, we must make sure to at least properly decode them. What we try to do is to identify if a HEADERS frame was already seen and use this indication to know if it's a headers or a trailers. For this, h2c_decode_headers() checks if the stream indicates that a HEADERS frame was already received. If so, it decodes it and emits the trailing 0 CRLF CRLF in case of H1, or the HTX_EOD + HTX_EOM blocks in case of HTX, to terminate the data stream. The trailers contents are still deleted for now but the request works, and the connection remains synchronized and usable for subsequent streams. The correctness may be tested using a simple config and h2spec : h2spec -o 1000 -v -t -S -k -h 127.0.0.1 -p 4443 generic/4/4 This should definitely be backported to 1.9 given the low impact for the benefit. However it cannot be backported to 1.8 since the operations cannot be resumed. The following patches are also needed with this one : MINOR: mux-h2: make h2c_decode_headers() return a status, not a count MINOR: mux-h2: add a new dummy stream : h2_error_stream MEDIUM: mux-h2: make h2c_decode_headers() support recoverable errors BUG/MINOR: mux-h2: detect when the HTX EOM block cannot be added after headers MINOR: mux-h2: check for too many streams only for idle streams MINOR: mux-h2: set H2_SF_HEADERS_RCVD when a HEADERS frame was decoded	2019-01-03 18:45:38 +01:00
Willy Tarreau	6cc85a5abb	MINOR: mux-h2: set H2_SF_HEADERS_RCVD when a HEADERS frame was decoded Doing this will be needed to be able to tell the difference between a headers block and a trailers block.	2019-01-03 18:45:38 +01:00
Willy Tarreau	415b1ee18b	MINOR: mux-h2: check for too many streams only for idle streams The HEADERS frame parser checks if we still have too many streams, but this should only be done for idle streams, otherwise it would prevent us from processing trailer frames.	2019-01-03 18:45:38 +01:00
Willy Tarreau	b8c4dd3320	CLEANUP: mux-h2: clean the stream error path on HEADERS frame processing In h2c_frt_handle_headers() and h2c_bck_handle_headers() we have an unused error path made of the strm_err label, while send_rst is used to emit an RST upon stream error after forcing the stream to h2_refused_stream. Let's remove this unused strm_err block now.	2019-01-03 18:45:38 +01:00
Willy Tarreau	3a429f04cb	MINOR: mux-h2: remove a misleading and impossible test In h2c_frt_handle_headers(), we test the stream for SS_ERROR just after setting it to SS_OPEN, this makes no sense and creates confusion in the error path. Remove this misleading test.	2019-01-03 18:45:38 +01:00
Willy Tarreau	b30d0f914e	BUG/MINOR: mux-h2: detect when the HTX EOM block cannot be added after headers In case we receive a very large HEADERS frame which doesn't leave enough room to place the EOM block after the decoded headers, we must fail the stream. This test was missing, resulting in the loss of the EOM, possibly leaving the stream waiting for a time-out. Note that we also clear h2c->dfl here so that we don't attempt to clear it twice when going back to the demux. If this is backported to 1.9, it also requires that the following patches are backported as well : MINOR: mux-h2: make h2c_decode_headers() return a status, not a count MINOR: mux-h2: add a new dummy stream : h2_error_stream MEDIUM: mux-h2: make h2c_decode_headers() support recoverable errors	2019-01-03 18:45:38 +01:00
Willy Tarreau	259192370f	MEDIUM: mux-h2: make h2c_decode_headers() support recoverable errors When a decoding error is recoverable, we should emit a stream error and not a connection error. This patch does this by carefully checking the connection state before deciding to send a connection error. If only the stream is in error, an RST_STREAM is sent.	2019-01-03 18:45:38 +01:00
Willy Tarreau	ecb9dcdf93	MINOR: mux-h2: add a new dummy stream : h2_error_stream This dummy stream will be used to send stream errors that must not be retried, such as undecodable headers frames.	2019-01-03 18:45:38 +01:00
Willy Tarreau	86277d4453	MINOR: mux-h2: make h2c_decode_headers() return a status, not a count This function used to return a byte count for the output produced, or zero on failure. Not only this value is not used differently than a boolean, but it prevents us from returning stream errors when a frame cannot be extracted because it's too large, or from parsing a frame and producing nothing on output. This patch modifies its API to return <0 on errors, 0 on inability to proceed, or >0 on success, irrelevant to the amount of output data.	2019-01-03 18:45:38 +01:00
Willy Tarreau	8319593005	BUG/MINOR: mux-h2: only update rxbuf's length for H1 headers In h2c_decode_headers() we update the buffer's length according to the amount of data produced (outlen). But in case of HTX this outlen value is not a quantity, just an indicator of success, resulting in the buffer being added one extra byte and temporarily showing .data > .size, which is wrong. Fortunately this is overridden when leaving the function by htx_to_buf() so the impact only exists in step-by-step debugging, but it definitely needs to be fixed. This must be backported to 1.9.	2019-01-03 10:30:10 +01:00
Willy Tarreau	45ffc0ca34	BUG/MINOR: mux-h2: mark end-of-stream after processing response HEADERS, not before When dealing with a server's H2 response, we used to set the end-of-stream flag on the conn_stream and the stream before parsing the response, which is incorrect since we can fail to process this response by lack of room, buffer or anything. The extend of this problem is still limited to a few rare cases, but with trailers it will cause a systematic failure. This fix must be backported to 1.9.	2019-01-03 09:34:19 +01:00
Willy Tarreau	c1fc95f850	BUG/MINOR: mux-h2: don't check the CS count in h2c_bck_handle_headers() This function handles response HEADERS frames, it is not responsible for creating new streams thus it must not check if we've reached the stream count limit, otherwise it could lead to some undesired pauses which bring no benefit. This must be backported to 1.9.	2019-01-03 09:28:59 +01:00
Willy Tarreau	8dbb1705fd	BUG/MINOR: mux-h2: set the stream-full flag when leaving h2c_decode_headers() If we exit this function because some data are pending in the rxbuf, we currently don't indicate any blocking flag, which will prevent the operation from being attempted again. Let's set H2_CF_DEM_SFULL in this case to indicate there's not enough room in the stream buffer so that the operation may be attempted again once we make room. It seems that this issue cannot be triggered right now but it definitely will with trailers. This fix should be backported to 1.9 for completeness.	2019-01-03 09:28:59 +01:00
Willy Tarreau	872e2fac39	BUG/MEDIUM: mux-h2: always restart reading if data are available h2c_restart_reading() is used at various place to resume processing of demux data, but this one refrains from doing so if the mux is already subscribed for receiving. It just happens that even if some incoming frame processing is interrupted, the mux is always subscribed for receiving, so this condition alone is not enough, it must be combined with the fact that the demux buffer is empty, otherwise some resume events are lost. This typically happens when we refrain from processing some incoming data due to missing room in the stream's rxbuf, and want to resume in h2c_rcv_buf(). It will become even more visible with trailers since these ones want to have an empty rxbuf before proceeding. This must be backported to 1.9.	2019-01-03 09:28:59 +01:00
Willy Tarreau	880f580492	CLEANUP: mux-h2: fix end-of-stream flag name when processing headers In h2c_decode_headers() we mistakenly check for H2_F_DATA_END_STREAM while we should check for H2_F_HEADERS_END_STREAM. Both have the same value (1) but better stick to the correct flag.	2019-01-03 08:12:54 +01:00
Olivier Houchard	351411facd	BUG/MAJOR: sessions: Use an unlimited number of servers for the conn list. When a session adds a connection to its connection list, we used to remove connections for an another server if there were not enough room for our server. This can't work, because those lists are now the list of connections we're responsible for, not just the idle connections. To fix this, allow for an unlimited number of servers, instead of using an array, we're now using a linked list.	2018-12-28 16:33:13 +01:00
Olivier Houchard	855ac25d82	BUG/MEDIUM: mux_h2: Don't add to the idle list if we're full. In h2_detach(), don't add the connection to the idle list if nb_streams is at the max. This can happen if we already closed that stream before, so its slot became available and was used by another stream. This should be backported to 1.9.	2018-12-28 15:48:52 +01:00
Willy Tarreau	48507ef558	CLEANUP: mux-h2: remove misleading comments about CONTINUATION These ones were left-over from copy-pastes that are unrelated to CONTINUATION frames.	2018-12-24 11:45:00 +01:00
Willy Tarreau	ea18f86364	MEDIUM: mux-h2: handle decoding of CONTINUATION frames Now that the HEADERS frame decoding is retryable, we can safely try to fold CONTINUATION frames into a HEADERS frame when the END_OF_HEADERS flag is missing. In order to do this, h2c_decode_headers() moves the frames payloads in-situ and leaves a hole that is plugged when leaving the function. There is no limit to the number of CONTINUATION frames handled this way provided that all of them fit into the buffer. The error reported when meeting isolated CONTINUATION frames has now changed from INTERNAL_ERROR to PROTOCOL_ERROR. Now there is only one (unrelated) remaining failure in h2spec.	2018-12-24 11:45:00 +01:00
Willy Tarreau	a4428bd531	MINOR: mux-h2: make h2_peek_frame_hdr() support an offset This function will be used to parse multiple subsequent frames so it needs to support an offset.	2018-12-24 11:45:00 +01:00
Willy Tarreau	96a10c24cf	MINOR: mux-h2: fail stream creation more cleanly using RST_STREAM The H2 demux only checks for too many streams in h2c_frt_stream_new(), then refuses to create a new stream and causes the connection to be aborted by sending a GOAWAY frame. This will also happen if any error happens during the stream creation (e.g. memory allocation). RFC7540#5.1.2 says that attempts to create streams in excess should instead be dealt with using an RST_STREAM frame conveying either the PROTOCOL_ERROR or REFUSED_STREAM reason (the latter being usable only if it is guaranteed that the stream was not processed). In theory it should not happen for well behaving clients, though it may if we configure a low enough h2.max_concurrent_streams limit. This error however may definitely happen on memory shortage. Previously it was not possible to use RST_STREAM due to the fact that the HPACK decompressor would be desynchronized. But now we first decode and only then try to allocate the stream, so the decompressor remains synchronized regardless of policy or resources issues. With this patch we enforce stream termination with RST_STREAM and REFUSED_STREAM if this protocol violation happens, as well as if there is a temporary condition like a memory allocation issue. It will allow a client to recover cleanly. This could possibly be backported to 1.9. Note that this requires that these five previous patches are merged as well : MINOR: h2: add a bit-based frame type representation MEDIUM: mux-h2: remove padlen during headers phase MEDIUM: mux-h2: decode HEADERS frames before allocating the stream MINOR: mux-h2: make h2c_send_rst_stream() use the dummy stream's error code MINOR: mux-h2: add a new dummy stream for the REFUSED_STREAM error code	2018-12-24 11:45:00 +01:00
Willy Tarreau	8d0d58bf6a	MINOR: mux-h2: add a new dummy stream for the REFUSED_STREAM error code This patch introduces a new dummy stream, h2_refused_stream, in CLOSED status with the aforementioned error code. It will be usable to reject unexpected extraneous streams.	2018-12-24 11:45:00 +01:00
Willy Tarreau	e6888fff75	MINOR: mux-h2: make h2c_send_rst_stream() use the dummy stream's error code We currently have 2 dummy streams allowing us to send an RST_STREAM message with an error code matching this one. However h2c_send_rst_stream() still enforces the STREAM_CLOSED error code for these dummy streams, ignoring their respective errcode fields which however are properly set. Let's make the function always use the stream's error code. This will allow to create other dummy streams for different codes.	2018-12-24 11:45:00 +01:00
Willy Tarreau	5c8cafae39	MEDIUM: mux-h2: decode HEADERS frames before allocating the stream It's hard to recover from a HEADERS frame decoding error after having already created the stream, and it's not possible to recover from a stream allocation error without dropping the connection since we can't maintain the HPACK context, so let's decode it before allocating the stream, into a temporary buffer that will then be offered to the newly created stream.	2018-12-24 11:45:00 +01:00
Willy Tarreau	6fa380dbba	MINOR: mux-h2: remove useless check for empty frame length in h2s_decode_headers() This test for an empty frame was already performed in the callers, there is no need for checking it again.	2018-12-24 11:45:00 +01:00
Willy Tarreau	3bf6918cb2	MEDIUM: mux-h2: remove padlen during headers phase Three types of frames may be padded : DATA, HEADERS and PUSH_PROMISE. Currently, each of these independently deals with padding and needs to wait for and skip the initial padlen byte. Not only this complicates frame processing, but it makes it very hard to process CONTINUATION frames after a padded HEADERS frame, and makes it complicated to perform atomic calls to h2s_decode_headers(), which are needed if we want to be able to maintain the HPACK decompressor's context even when dropping streams. This patch takes a different approach : the padding is checked when parsing the frame header, the padlen byte is waited for and parsed, and the dpl value is updated with this padlen value. This will allow the frame parsers to decide to overwrite the padding if needed when merging adjacent frames.	2018-12-24 11:45:00 +01:00
Willy Tarreau	a875466243	BUG/MEDIUM: mux-h2: mark that we have too many CS once we have more than the max Since commit `f210191` ("BUG/MEDIUM: h2: don't accept new streams if conn_streams are still in excess") we're refraining from reading input frames if we've reached the limit of number of CS. The problem is that it prevents such situations from working fine. The initial purpose was in fact to prevent from reading new HEADERS frames when this happens, and causes some occasional transfer hiccups and pauses with large concurrencies. Given that we now properly reject extraneous streams before checking this value, we can be sure never to have too many streams, and that any higher value is only caused by a scheduling reason and will go down after the scheduler calls the code. This fix must be backported to 1.9 and possibly to 1.8. It may be tested using h2spec this way with an h2spec config : while :; do h2spec -o 5 -v -t -S -k -h 127.0.0.1 -p 4443 http2/5.1.2 done	2018-12-24 08:13:16 +01:00
Willy Tarreau	c4ea04c2b6	BUG/MINOR: mux-h2: make empty HEADERS frame return a connection error We were returning a stream error of type PROTOCOL_ERROR on empty HEADERS frames, but RFC7540#4.2 stipulates that we should instead return a connection error of type FRAME_SIZE_ERROR. This may be backported to 1.9 and 1.8 though it's unlikely to have any real life effect.	2018-12-23 10:02:38 +01:00
Willy Tarreau	97aaa67658	MINOR: mux-h2: only increase the connection window with the first update Commit `dc57236` ("BUG/MINOR: mux-h2: advertise a larger connection window size") caused a WINDOW_UPDATE message to be sent early with the connection to increase the connection's window size. It turns out that it causes some minor trouble that need to be worked around : - varnishtest cannot transparently cope with the WU frames during the handshake, forcing all tests to explicitly declare the handshake sequence ; - some vtc scripts randomly fail if the WU frame is sent after another expected response frame, adding uncertainty to some tests ; - h2spec doesn't correctly identify these WU at the connection level that it believes are the responses to some purposely erroneous frames it sends, resulting in some errors being reported None of these are a problem with real clients but they add some confusion during troubleshooting. Since the fix above was intended to increase the upload bandwidth, we have another option which is to increase the window size with the first WU frame sent for the connection. This way, no WU frame is sent until one is really needed, and this first frame will adjust the window to the maximum value. It will make the window increase slightly later, so the client will experience the first round trip when uploading data, but this should not be perceptible, and is not worth the extra hassle needed to maintain our debugging abilities. As an extra bonus, a few extra bytes are saved for each connection until the first attempt to upload data. This should possibly be backported to 1.9 and 1.8.	2018-12-23 09:49:04 +01:00
Willy Tarreau	47b515a462	BUG/MEDIUM: mux-h2: don't needlessly wake up the demux on short frames In some situations, if too short a frame header is received, we may leave h2_process_demux() waking up the task again without checking that we were already subscribed. In order to avoid this once for all, let's introduce an h2_restart_reading() function which performs the control and calls the task up. This way we won't needlessly wake the task up if it's already waiting for I/O. Must be backported to 1.9.	2018-12-21 16:12:33 +01:00
Willy Tarreau	645b33d233	BUG/MEDIUM: mux-h2: Don't forget to quit the send list on error reports Similar to last fix, we need to quit the send list when reporting an error via the send side. This should be backported to 1.9.	2018-12-20 15:35:57 +01:00
Olivier Houchard	f29cd5c8a8	BUG/MEDIUM: h2: Don't forget to quit the sending_list if SUB_CALL_UNSUBSCRIBE. In mux_h2_unsubscribe, don't forget to leave the sending_list if SUB_CALL_UNSUBSCRIBE was set. SUB_CALL_UNSUBSCRIBE means we were about to be woken up for writing, unless the mux was too full to get more data. If there's an unsubscribe call in the meanwhile, we should leave the list, or we may be put back in the send_list. This should be backported to 1.9.	2018-12-20 12:24:43 +01:00
Olivier Houchard	6dea2ee939	BUG/MEDIUM: h2: Don't wait for flow control if the connection had a shutr. In h2_snd_buf(), if we couldn't send the data because of flow control, and the connection got a shutr, then add CS_FL_ERROR (or CS_FL_ERR_PENDING). We will never get any window update, so we will never be unlocked, anyway. No backport is needed.	2018-12-19 18:35:40 +01:00
Willy Tarreau	fde287cc76	BUG/MINOR: mux-h2: make sure we check the conn_stream in early data When dealing with early data we scan the list of stream to notify them. We're not supposed to have h2s->cs == NULL here but it doesn't cost much to make the scan more robust and verify it before notifying. No backport is needed.	2018-12-19 18:33:16 +01:00
Willy Tarreau	ec988c7a0f	CLEANUP: mux-h2: make use of cs_set_error() It's cleaner than open-coding the conditions and error bits.	2018-12-19 18:13:52 +01:00
Willy Tarreau	f830f018cf	BUG/MEDIUM: mux-h2: make use of h2s_alert() to report aborts If we had no pending read, it could be complicated to report an RST_STREAM to a sender since we used to only report it via the rx side if subscribed. Similarly in h2_wake_some_streams() we now try all methods, hoping to catch all possible events. No backport is needed.	2018-12-19 18:13:52 +01:00
Willy Tarreau	8b2757c339	MINOR: mux-h2: add a new function h2s_alert() to call the data layer In order to report an error to the data layer, we have different ways depending on the situation. At a lot of places it's open-coded and not always correct. Let's create a new function h2s_alert() to handle this task. It tries to wake on recv() first, then on send(), then using wake().	2018-12-19 18:13:48 +01:00
Willy Tarreau	7e094451d0	CLEANUP: mux-h2: implement h2s_notify_{send,recv} to report events to subscribers Till now we had to open-code all the manipulation of the wait_event, let's use standarized functions for this and reduce the risk of bugs.	2018-12-19 18:11:35 +01:00
Olivier Houchard	251064b02d	BUG/MEDIUM: h2: Make sure we don't set CS_FL_ERROR if there's still data. In the mux h2, make sure we set CS_FL_ERR_PENDING and wake the recv task, instead of setting CS_FL_ERROR, if CS_FL_EOS is not set, so if there's potentially still some data to be sent.	2018-12-19 17:28:54 +01:00
Olivier Houchard	9117780bfd	BUG/MEDIUM: mux-h2: pass CS_FL_ERR_PENDING to h2_wake_some_streams() Commiy 8519357c ("BUG/MEDIUM: mux-h2: report asynchronous errors in h2_wake_some_streams()") addressed an issue with synchronous errors but forgot to fix the call places to also pass CS_FL_ERR_PENDING instead of CS_FL_ERROR. No backport is needed.	2018-12-19 17:06:49 +01:00
Olivier Houchard	2f30883793	BUG/MEDIUM: H2: Make sure htx is set even on empty frames. When transfering data, make sure htx is set even on empty frames, or we will never add a HTX_BLK_EOM block.	2018-12-19 17:00:14 +01:00
Willy Tarreau	3d2ee55ebd	CLEANUP: connection: rename conn->mux_ctx to conn->ctx We most often store the mux context there but it can also be something else while setting up the connection. Better call it "ctx" and know that it's the owner's context than misleadingly call it mux_ctx and get caught doing suspicious tricks.	2018-12-19 14:13:07 +01:00
Willy Tarreau	4f6516d677	CLEANUP: connection: rename subscription events values and event field The SUB_CAN_SEND/SUB_CAN_RECV enum values have been confusing a few times, especially when checking them on reading. After some discussion, it appears that calling them SUB_RETRY_SEND/SUB_RETRY_RECV more accurately reflects their purpose since these events may only appear after a first attempt to perform the I/O operation has failed or was not completed. In addition the wait_reason field in struct wait_event which carries them makes one think that a single reason may happen at once while it is in fact a set of events. Since the struct is called wait_event it makes sense that this field is called "events" to indicate it's the list of events we're subscribed to. Last, the values for SUB_RETRY_RECV/SEND were swapped so that value 1 corresponds to recv and 2 to send, as is done almost everywhere else in the code an in the shutdown() call.	2018-12-19 14:09:21 +01:00
Willy Tarreau	567beb8a91	BUG/MEDIUM: mux-h2: make sure the demux also wakes streams up on errors Today the demux only wakes a stream up after receiving some contents, but not necessarily on close or error. Let's do it based on both error flags and both EOS flags. With a bit of refinement we should be able to only do it when the pending bits are there but not the static ones. No backport is needed.	2018-12-18 16:52:44 +01:00
Willy Tarreau	a8519357c5	BUG/MEDIUM: mux-h2: report asynchronous errors in h2_wake_some_streams() This function is called when dealing with a connection error or a GOAWAY frame. It used to report a synchronous error instead of an asycnhronous error, which can lead to data truncation since whatever is still available in the rxbuf will be ignored. Let's correctly use CS_FL_ERR_PENDING instead and only fall back to CS_FL_ERROR if CS_FL_EOS was already delivered. No backport is needed.	2018-12-18 16:46:24 +01:00
Willy Tarreau	7ecb6f10a4	BUG/MEDIUM: mux-h2: make sure to report synchronous errors after EOS If EOS has already been reported on the conn_stream, there won't be any read anymore to turn ERR_PENDING into ERROR, so we have to do report it directly. No backport is needed.	2018-12-18 16:46:19 +01:00
Willy Tarreau	3af3771bf3	BUG/MINOR: mux-h2: don't report a fantom h2s in "show fd" The h2s pointer was used to scan fctl lists prior to being used to scan the send list by ID, so it could appear non-null eventhough the list is empty, resulting in misleading information on empty connections. No backport is needed.	2018-12-18 14:34:41 +01:00
Willy Tarreau	987c0633fa	MINOR: mux-h2: report more h2c, last h2s and cs information on "show fd" Most of the time when we issue "show fd" to dump a mux's state, it's to figure why a transfer is frozen. Connection, stream and conn_stream states are critical there. And most of the time when this happens there is a single stream left in the H2 mux, so let's always dump the last known stream on show fd, as most of the time it will be the one of interest.	2018-12-18 11:03:11 +01:00
Willy Tarreau	cef5c8e2aa	BUG/MEDIUM: mux-h2: restart demuxing as soon as demux data are available Commit `7505f94f9` ("MEDIUM: h2: Don't use a wake() method anymore.") changed the conditions to restart demuxing so that this happens as soon as something is read. But similar to previous fix, at an end of stream we may be woken up with nothing to read but data still available in the demux buffer, so we must also use this as a valid condition for demuxing. No backport is needed, this is purely 1.9.	2018-12-18 11:03:11 +01:00
Willy Tarreau	c5b1004fbe	BUG/MEDIUM: mux-h2: also restart demuxing when data are pending in demux Commit `082f559d3` ("BUG/MEDIUM: h2: restart demuxing after releasing buffer space") tried to address a situation where transfers could stall after a read, but the condition was not completely covered : some stalls may still happen at end of stream because there's nothing anymore to receive and the last data lie in the demux buffer. Thus we must also consider this state as a valid condition to restart demuxing. No backport is needed.	2018-12-18 11:03:11 +01:00
Olivier Houchard	71748cb91b	BUG/MEDIUM: connection: Add a new CS_FL_ERR_PENDING flag to conn_streams. Add a new flag to conn_streams, CS_FL_ERR_PENDING. This is to be set instead of CS_FL_ERR in case there's still more data to be read, so that we read all the data before closing.	2018-12-17 21:54:14 +01:00
Olivier Houchard	ffda58b546	BUG/MEDIUM: h2: Don't destroy the h2s if it still has a cs attached. In h2_deferred_shut, if we're done sending the shutr/shutw, don't destroy the h2s if it still has a conn_stream attached, or the conn_stream may try to access it again.	2018-12-16 08:22:01 +01:00
Olivier Houchard	746fb772f1	MEDIUM: mux_h2: Always set CS_FL_NOT_FIRST for new conn_streams. When creating new conn_streams, always set the CS_FL_NOT_FIRST flag. We don't really care about being the first request for HTTP/2, this only really makes sense for HTTP/1, and that way we can reuse connections.	2018-12-15 23:50:11 +01:00
Olivier Houchard	a4d4fdfaa3	MEDIUM: sessions: Don't keep an infinite number of idling connections. In session, don't keep an infinite number of connection that can idle. Add a new frontend parameter, "max-session-srv-conns" to set a max number, with a default value of 5.	2018-12-15 23:50:10 +01:00
Olivier Houchard	f502aca5c2	MEDIUM: mux: provide the session to the init() and attach() method. Instead of trying to get the session from the connection, which is not always there, and of course there could be multiple sessions per connection, provide it with the init() and attach() methods, so that we know the session for each outgoing stream.	2018-12-15 23:50:09 +01:00
Olivier Houchard	8a78690229	MEDIUM: mux: Destroy the stream before trying to add the conn to the idle list. In the mux_h1 and mux_h2, move the test to see if we should add the connection in the idle list until after we destroyed the h1s/h2s, that way later we'll be able to check if the connection has no stream at all, and if it should be added to the server idling list.	2018-12-15 23:50:09 +01:00
Olivier Houchard	2c68a462e1	BUG/MEDIUM: h2: Don't forget to destroy the h2s after deferred shut. If we had to defer shutr/shutw, and we're now done, destroy the h2s, or nobody will do so.	2018-12-15 23:50:07 +01:00
Olivier Houchard	84cca66ea3	BUG/MEDIUM: htx: When performing zero-copy, start from the right offset. When using zerocopy, start from the beginning of the data, not from the beginning of the buffer, it may have contained headers, and so the data won't start at the beginning of the buffer.	2018-12-14 17:02:11 +01:00
Willy Tarreau	c0960d1185	MINOR: mux_h1/h2: simplify the zero-copy Rx alignment The transpory layer now respects buffer alignment, so we don't need to cheat anymore pretending we have some data at the head, adjusting the buffer's head is enough.	2018-12-14 10:59:15 +01:00
Willy Tarreau	e0f24ee149	MINOR: connection: realign empty buffers in muxes, not transport layers For a long time we've been realigning empty buffers in the transport layers, where the I/Os were performed based on callbacks. Doing so is optimal for higher data throughput but makes it trickier to optimize unaligned data, where mux_h1/h2 have to claim some data are present in the buffer to force unaligned accesses to skip the frame's header or the chunk header. We don't need to do this anymore since the I/O calls are now always performed from top to bottom, so it's only the mux's responsibility to realign an empty buffer if it wants to. In practice it doesn't change anything, it's just a convention, and it will allow the code to be simplified in a next patch.	2018-12-14 10:51:23 +01:00
Olivier Houchard	44d59146a6	MEDIUM: htx: Try to take a connection over if it has no owner. In the mux detach function, when using HTX, take the connection over if it no longer has an owner (ie because the session that was the owner left). It is done for legacy code in proto_http.c, but not for HTX. Also when using HTX, in H2, try to add the connection back to idle_conns if it was not already (ie we used to use all the available streams, and we're freeing one). That too was done in proto_http.c.	2018-12-13 18:54:27 +01:00
Willy Tarreau	2a59e87735	MINOR: mux-h2: force reads to be HTX-aligned in HTX mode H2 has a 9-byte frame header, and HTX has a 40-byte frame header. By artificially advancing the Rx header and limiting the amount of bytes read to protect the end of the buffer, we can make the data payload perfectly aligned with HTX blocks and optimize the copy.	2018-12-12 11:52:45 +01:00
Willy Tarreau	98de12a5d1	MEDIUM: mux-h2: implement true zero-copy send of large HTX DATA blocks This is similar to what was done for the H1 mux : when the mux's buffer is empty and the htx area contains exactly one data block of the same size as the requested count, and all window and frame size conditions are satisfied, then it's possible to simply swap the caller's buffer with the mux's output buffer and adjust offsets and length to match the entire DATA HTX block in the middle. An H2 frame header has to be prepended before the block but this always fits in an HTX frame header. In this case we perform a true zero-copy operation from end-to-end. This is the situation that happens all the time with large files. When using HTX over H2 over TLS, this brings a 3% extra performance gain. TLS remains a limiting factor here but the copy definitely has a cost. Also since haproxy can now use H2 in clear, the savings can be higher.	2018-12-12 11:52:45 +01:00
Willy Tarreau	06ae84a8ac	MINOR: mux-h2: avoid copying large blocks into full buffers Due to blocking factor being different on H1 and H2, we regularly end up with tails of data blocks that leave room in the mux buffer, making it tempting to copy the pending frame into the remaining room left, and possibly realigning the output buffer. Here we check if the output buffer contains data, and prefer to wait if either the current frame doesn't fit or if it's larger than 1/4 of the buffer. This way upon next call, either a zero copy, or a larger and aligned copy will be performed, taking the whole chunk at once. Doing so increases the H2 bandwidth by slightly more than 1% on large objects.	2018-12-12 11:52:45 +01:00
Willy Tarreau	dc572364c6	BUG/MINOR: mux-h2: advertise a larger connection window size By default H2 uses a 65535 bytes window for the connection, and changing it requires sending a WINDOW_UPDATE message. We only used to update the window when receiving data, thus never increasing it further. As reported by user klzgrad on the mailing list, this seriously limits the upload bitrate, and will have an even higher impact on the backend H2 connections to origin servers. There is no technical reason for keeping this window so low, so let's increase it to the maximum possible value (2G-1). We do this by pretending we've already received that many data minus the maximum data the client might already send (65535), so that an early WINDOW_UPDATE message is sent right after the SETTINGS frame. This should be backported to 1.8. This patch depends on previous patch "BUG/MINOR: mux-h2: refrain from muxing during the preface".	2018-12-12 09:23:41 +01:00
Willy Tarreau	75a930affb	BUG/MINOR: mux-h2: refrain from muxing during the preface The condition to refrain from processing the mux was insufficient as it would only handle the outgoing connections. In essence it is not that much of a problem since we don't have streams yet on an incoming connetion. But it prevents waiting for the end of the preface before sending an early WINDOW_UPDATE message, thus causing the connections to fail in this case. This must be backported to 1.8 with a few minor adaptations.	2018-12-12 09:23:41 +01:00
Willy Tarreau	afba57ae80	REORG: h1: merge types+proto into common/h1.h These two files are self-contained and do not depend on other layers, so let's remerge them together for easier manipulation.	2018-12-11 17:15:13 +01:00
Willy Tarreau	b96b77ed6e	REORG: htx: merge types+proto into common/htx.h All the HTX definition is self-contained and doesn't really depend on anything external since it's a mostly protocol. In addition, some external similar files (like h2) also placed in common used to rely on it, making it a bit awkward. This patch moves the two htx.h files into a single self-contained one. The historical dependency on sample.h could be also removed since it used to be there only for http_meth_t which is now in http.h.	2018-12-11 17:15:04 +01:00
Willy Tarreau	907998194b	MEDIUM: mux-h2: make use of hpack_encode_path() to encode the path The HTTP path encoding was open-coded with a HPACK byte matching the "/" or "/index.html" paths. Let's make use of the new functions to avoid this.	2018-12-11 09:07:02 +01:00
Willy Tarreau	7561bcbb36	MEDIUM: mux-h2: make use of hpack_encode_scheme() to encode the scheme The HTTP scheme encoding was open-coded with a HPACK byte matching the "https" scheme. Let's make use of the new functions to avoid this.	2018-12-11 09:07:02 +01:00
Willy Tarreau	bdabc3a25f	MEDIUM: mux-h2: make use of hpack_encode_method() to encode the method The HTTP method encoding was open-coded with raw HPACK bytes, which is not suitable there. Let's make use of the new functions to avoid this.	2018-12-11 09:07:02 +01:00
Willy Tarreau	aafdf58333	MEDIUM: mux-h2: make use of standard HPACK encoding functions for the status This way we don't open-code the HPACK status codes anymore in the H2 code. Special care was taken not to cause any slowdown as this code is very sensitive.	2018-12-11 09:07:02 +01:00
Olivier Houchard	56b0348ea7	BUG/MEDIUM: mux-h2: Don't forget to set the CS_FL_EOS flag with htx. When running with HTX, if we got an empty answer, don't forget to set CS_FL_EOS, or the stream will never be destroyed.	2018-12-10 20:53:31 +01:00
Willy Tarreau	ac77b6f441	BUG/MEDIUM: mux-h2: fix encoding of non-GET/POST methods Jerome reported that outgoing H2 failed for methods different from GET or POST. It turns out that the HPACK encoding is performed by hand in the outgoing headers encoding function and that the data length was not incremented to cover the literal method value, resulting in a corrupted HEADERS frame. Admittedly this code should move to the generic HPACK code. No backport is needed.	2018-12-10 11:08:04 +01:00
Willy Tarreau	e2778a43d4	BUILD: h2: mark the start line already checked to avoid warnings Gcc 7 warns about a potential null pointer deref that cannot happen since the start line block is guaranteed to be present in the functions where it's dereferenced. Let's mark it as already checked.	2018-12-08 15:31:57 +01:00
Olivier Houchard	50d660c545	BUG/MEDIUM: h2: Don't try to chunk data when using HTX. When we're using HTX, we don't have to generate chunk header/trailers, and that ultimately leads to a crash when we try to access a buffer that contains just chunk trailers. This should not be backported.	2018-12-08 08:22:04 +01:00
Willy Tarreau	c2a10d4b4c	MINOR: h2: don't turn HTX header names to lower case anymore Since HTX stores header names in lower case already, we don't need to do it again anymore. This increased H2 performance by 2.7% on quick tests, now making H2 overr HTX about 5.5% faster than H2 over H1.	2018-12-07 13:25:59 +01:00
Olivier Houchard	d247be0620	BUG/MEDIUM: connections: Split CS_FL_RCV_MORE into 2 flags. CS_FL_RCV_MORE is used in two cases, to let the conn_stream know there may be more data available, and to let it know that it needs more room. We can't easily differentiate between the two, and that may leads to hangs, so split it into two flags, CS_FL_RCV_MORE, that means there may be more data, and CS_FL_WANT_ROOM, that means we need more room. This should not be backported.	2018-12-06 16:36:05 +01:00
Willy Tarreau	c14999b3bc	BUG/MEDIUM: mux-h2: stop sending using HTX on errors We didn't take care of the stream error in the HTX send loop, causing some errors (like buffer full) to provoke 100% CPU. No backport is needed.	2018-12-06 14:09:09 +01:00
Willy Tarreau	8e162ee1f9	BUG/MEDIUM: mux-h2: use the correct offset for the HTX start line Due to a thinko, I used sl_off as the start line index number but it's not it, it's its offset. The first index is obtained using htx_get_head(), and the start line is obtained using htx_get_sline(). This caused crashes to happen when forwarding HTX traffic via the H2 mux once the HTX buffer started to wrap. No backport is needed.	2018-12-06 14:07:27 +01:00
Christopher Faulet	27ba2dc6d6	MEDIUM: htx: Rework conversion from a buffer to an htx structure Now, the function htx_from_buf() will set the buffer's length to its size automatically. In return, the caller should call htx_to_buf() at the end to be sure to leave the buffer hosting the HTX message in the right state. When the caller can use the function htxbuf() to get the HTX message without any update on the underlying buffer.	2018-12-05 17:10:16 +01:00
Willy Tarreau	2fb1d4caaa	MINOR: mux-h2: stop on non-DATA and non-EOM HTX blocks We don't want to send such blocks as DATA frames if they were ever to appear, let's quit when meeting them.	2018-12-04 18:32:39 +01:00
Willy Tarreau	ee57376ffb	BUG/MEDIUM: mux-h2: don't send more HTX data than requested It's incorrect to send more bytes than requested, because some filters (e.g. compression) might intentionally hold on some blocks, so DATA blocks must not be processed past the advertised byte count. It is not the case for headers however. No backport is needed.	2018-12-04 18:32:39 +01:00
Willy Tarreau	b08d91fbc5	BUG/MEDIUM: mux-h2: stop sending HTX once the mux is blocked If we're blocking on mux full, mux busy or whatever, we must get out of the loop. In legacy mode this problem doesn't exist as we can normally return 0 but here it's not a sufficient condition to stop sending, so we must inspect the blocking flags as well. No backport is needed.	2018-12-04 18:32:39 +01:00
Willy Tarreau	0c22fa7d6f	BUG/MEDIUM: mux-h2: make sure to always report HTX EOM when consumed by headers The way htx_xfer_blks() was used is wrong, if we receive data, we must report everything we found, not just the headers blocks. This ways causing the EOM to be postponed and some fast responses (or errors) to be incorrectly delayed. No backport is needed.	2018-12-04 18:32:39 +01:00
Willy Tarreau	0f799ca4df	BUG/MEDIUM: mux-h2: properly update the window size in HTX mode When sending data in HTX mode, we forgot to update the window size, it was the cause of the limitation to 1 GB in testing. No backport is needed.	2018-12-04 18:32:39 +01:00
Olivier Houchard	8122a8d681	BUG/MEDIUM: h2: When sending in HTX, make sure the caller knows we sent all. In h2_snd_buf(), when running with htx, make sure we return the amount of data the caller specified, if we emptied the buffer, as it is what the caller expects, and will lead to him properly consider the buffer to be empty.	2018-12-04 18:32:39 +01:00
Olivier Houchard	435ce2d71d	BUG/MEDIUM: h2: Don't forget to wake the tasklet after shutr/shutw. When reaching h2_shutr/h2_shutw, as we may have generated an empty frame, a goaway or a rst, make sure we wake the I/O tasklet, or we may not send what we just generated. Also in h2_shutw(), don't forget to return if all went well, we don't want to subscribe the h2s to wait events.	2018-12-04 05:57:34 +01:00
Joseph Herlant	d77575d03e	CLEANUP: Fix typos in the h2 subsystem Fixes typos in the code comments of the h2 subsystem.	2018-12-02 18:38:08 +01:00
Olivier Houchard	8defe4b51a	MINOR: mux: add a "max_streams" method. Add a new method to muxes, "max_streams", that returns the max number of streams the mux can handle. This will be used to know if a mux is in use or not.	2018-12-02 17:48:32 +01:00
Olivier Houchard	a6cf7112bb	MEDIUM: mux-h2: Don't bother flagging outgoing connections as TOOMANY. When creating a new stream, don't bother flagging a connection with H2_CF_DEM_TOOMANY if we created the last available stream. We won't create any other anyway, because h2_avail_streams() would return 0 available streams, and has it is a blocking flag, it prevents us from reading data after.	2018-12-02 13:31:53 +01:00
Olivier Houchard	7a57e8a67a	MEDIUM: mux-h2: Implement h2_attach(). Implement h2_attach(), so that we can have multiple streams in one outgoin h2 connection.	2018-12-02 13:31:53 +01:00
Willy Tarreau	c12f38fe32	MEDIUM: mux-h2: make h2_process_demux() capable of processing responses as well The function now calls h2c_bck_handle_headers() or h2c_frt_handle_headers() depending on the connection's side. The former doesn't create a new stream but feeds an existing one. At this point it's possible to forward an H2 request to a backend server and retrieve the response headers.	2018-12-02 13:31:52 +01:00
Willy Tarreau	c3e18f3448	MEDIUM: mux-h2: make h2_frt_decode_headers() direction-agnostic This function does not really depend on the request, all it does is also valid for H2 responses found on the backend side, so this patch renames it and makes it call the appropriate decoder based on the direction.	2018-12-02 13:31:52 +01:00
Willy Tarreau	8073969376	MEDIUM: mux-h2: implement encoding of H2 request on the backend side This creates an H2 HEADERS frame from an HTX request. The code is very similar to the response encoding, so probably that in the future we'll have to factor these functions differently. The HTX's start line type is used to decide on the direction. We also purposely error out when trying to encode an H2 request from an H1 message since it's not implemented.	2018-12-02 13:31:52 +01:00
Willy Tarreau	01b4482b46	MEDIUM: mux-h2: start to create the outgoing mux For now it reports an immediate error when trying to encode the request since it doesn't parse as a response. We take care of sending the preface and settings frame with the outgoing connection, and not to wait for a preface during the H2_CS_PREFACE phase for outgoing connections.	2018-12-02 13:31:51 +01:00
Willy Tarreau	751f2d0ddf	MINOR: mux-h2: implement an outgoing stream allocator : h2c_bck_stream_new() For the backend we'll need to allocate streams as well. Let's do this with h2c_bck_stream_new(). The stream ID allocator was split from it so that the caller can decide whether or not to stay on the same connection or create a new one. It possibly isn't the best way to do this as once we're on the mux it's too late to give up creation of a new stream. Another approach would possibly consist in detaching muxes that reached their connection count limit before they can be reused. Instead of choosing the stream id as soon as the stream is created, wait until data is about to be sent. If we don't do that, the stream may send data out of order, and so the stream 3 may send data before the stream 1, and then when the stream 1 will try to send data, the other end will consider that an error, as stream ids should always be increased. Cc: Olivier Houchard <ohouchard@haproxy.com>	2018-12-02 13:31:51 +01:00
Willy Tarreau	f8957277ff	MINOR: mux-h2: mention that the mux is compatible with both sides We declare two configurations for the H2 mux. One supporting only the frontend in HTTP mode and one supporting both sides in HTX mode. This is only to ease development at this point. Trying to assign an h2 mux on the server side will still fail during h2_init() anyway instead of at config parsing time.	2018-12-02 13:31:03 +01:00
Willy Tarreau	c5753aedf7	BUG/MEDIUM: mux-h2: remove the HTX EOM block on H2 response headers If we decided to emit the end of stream flag on the H2 response headers frame, we must remove the EOM block from the HTX stream, otherwise it will lead to an extra DATA frame being sent with the ES flag and will violate the protocol.	2018-12-02 12:31:51 +01:00
Willy Tarreau	fab9bb08fc	BUG/MEDIUM: mux-h2: don't lose the first response header in HTX mode When converting response headers from HTX to H2, we accidently skipped the first header block.	2018-12-02 12:31:20 +01:00
Willy Tarreau	61ea7dc005	MEDIUM: mux-h2: support passing H2 DATA frames to HTX blocks This is used for uploads, we can now convert H2 DATA frames to HTX DATA blocks. It's uncertain whether it's better to reuse the same function or to split it in two at this point. For now the same function was added with some paths specific to HTX. In this mode we loop back to the same or next frame in order to try to complete DATA blocks.	2018-12-01 23:31:13 +01:00
Willy Tarreau	0c535fd1b5	MEDIUM: mux-h2: implement the emission of DATA frames from HTX DATA blocks At the moment the way it's done is not optimal. We should aggregate multiple blocks into a single DATA frame, and we should merge the ES flag with the last one when we already know we've reached the end. For now and for an easier tracking of the HTX stream, an individual empty DATA frame is sent with the ES bit when EOM is met. The DATA function is called for DATA, EOD and EOM since these stats indicate that a previous frame was already produced without the ES flag (typically a headers frame or another DATA frame). Thus it makes sense to handle all these blocks there. There's still an uncertainty on the way the EOD and EOM HTX blocks must be accounted for, as they're counted as one byte in the HTX stream, but if we count that byte off when parsing these blocks, we end up sending too much and desynchronizing the HTX stream. Maybe it hides an issue somewhere else. At least it's possible to reliably retrieve payloads up to 1 GB over H2/HTX now. It's still unclear why larger ones are interrupted at 1 GB.	2018-12-01 23:27:08 +01:00
Willy Tarreau	115e83b071	MEDIUM: mux-h2: implement emission of H2 headers frames from HTX blocks When using HTX, we need a separate function to emit a headers frame. The code is significantly different from the H1 to H2 conversion, though it borrows some parts there. It looks like the part building the H2 frame from the headers list could be factored out, however some of the logic around dealing with end of stream or block sizes remains different. With this patch it becomes possible to retrieve bodyless HTTP responses using H2 over HTX.	2018-12-01 23:27:08 +01:00
Willy Tarreau	bd4a6b675c	MEDIUM: mux-h2: add basic H2->HTX transcoding support for headers When the proxy is configured to use HTX mode, the headers frames will be converted to HTX header blocks instead of HTTP/1 messages. This requires very little modifications to the existing function so it appeared better to do it this way than to duplicate it. Only the request headers are handled, responses are not processed yet and data frames are not processed yet either. The return value is inaccurate but this is not an issue since we're using it as a boolean : data received or not.	2018-12-01 23:27:08 +01:00
Willy Tarreau	bcd3bb3ca2	MEDIUM: mux-h2: make h2_snd_buf() HTX-aware Now h2_snd_buf() will check the proxy's mode to decide whether to use HTX-specific send functions or legacy functions. In HTX mode, the HTX blocks of the output buffer will be parsed and the related functions will be called accordingly based on the block type, and unimplemented blocks will be skipped. For now all blocks are skipped, this is only helpful for debugging.	2018-12-01 23:27:07 +01:00
Willy Tarreau	86724e2e8a	MEDIUM: mux-h2: make h2_rcv_buf() support HTX transfers The function needs to be slightly adapted to transfer HTX blocks, since it may face a full buffer on the receive path, thus it needs to transfer HTX blocks between the two sides ignoring the <count> argument in this mode.	2018-12-01 23:25:55 +01:00
Willy Tarreau	5ae9600950	MEDIUM: mux-h2: register mux for both HTTP and HTX modes The H2 mux will now be called for both HTTP and HTX modes. For now the data transferr functions are not HTX-aware so this will lead to problems if used as-is but it's convenient for development and debugging.	2018-12-01 19:03:20 +01:00
Olivier Houchard	93c8852572	MEDIUM: h2: Destroy a connection with no stream if it has no owner. In h2_detach(), if the connection has no stream left, and no associated owner, then destroy it, as nobody else will be able to.	2018-12-01 10:47:18 +01:00
Olivier Houchard	4667773a8a	BUG/MEDIUM: h2: Call h2_process() if there's an error on the connection. In h2_recv(), return 1 if there's an error on the connection, not just if there's a read0 pending, so that h2_process() can be called and act as a janitor.	2018-11-29 17:39:04 +01:00
Olivier Houchard	0024a98640	BUG/MEDIUM: h2: Don't bogusly error if the previous stream was closed. In h2_process_demux(), if we're demuxing multiple frames, and the previous frame led to a stream getting closed, don't bogusly consider that an error, and destroy the next stream, as there are valid cases where the stream could be closed.	2018-11-28 14:09:55 +01:00
Willy Tarreau	680b2bdf2f	MINOR: h2: make struct h2_ops static There's no reason to export this descriptor, it used to be needed during early H2 development and will complicate porting to HTX.	2018-11-27 09:59:48 +01:00
Willy Tarreau	2455cebe00	MEDIUM: memory: use pool_destroy_all() to destroy all pools on deinit() Instead of exporting a number of pools and having to manually delete them in deinit() or to have dedicated destructors to remove them, let's simply kill all pools on deinit(). For this a new function pool_destroy_all() was introduced. As its name implies, it destroys and frees all pools (provided they don't have any user anymore of course). This allowed to remove 4 implicit destructors, 2 explicit ones, and 11 individual calls to pool_destroy(). In addition it properly removes the mux_pt_ctx pool which was not cleared on exit (no backport needed here since it's 1.9 only). The sig_handler pool doesn't need to be exported anymore and became static now.	2018-11-26 19:50:32 +01:00
Willy Tarreau	8ceae72d44	MEDIUM: init: use initcall for all fixed size pool creations This commit replaces the explicit pool creation that are made in constructors with a pool registration. Not only this simplifies the pools declaration (it can be done on a single line after the head is declared), but it also removes references to pools from within constructors. The only remaining create_pool() calls are those performed in init functions after the config is parsed, so there is no more user of potentially uninitialized pool now. It has been the opportunity to remove no less than 12 constructors and 6 init functions.	2018-11-26 19:50:32 +01:00
Willy Tarreau	172f5ce948	MINOR: initcall: use initcalls for most post_{check,deinit} and per_thread* Most calls to hap_register_post_check(), hap_register_post_deinit(), hap_register_per_thread_init(), hap_register_per_thread_deinit() can be done using initcalls and will not require a constructor anymore. Let's create a set of simplified macros for this, called respectively REGISTER_POST_CHECK, REGISTER_POST_DEINIT, REGISTER_PER_THREAD_INIT, and REGISTER_PER_THREAD_DEINIT. Some files were not modified because they wouldn't benefit from this or because they conditionally register (e.g. the pollers).	2018-11-26 19:50:32 +01:00
Willy Tarreau	0108d90c6c	MEDIUM: init: convert all trivial registration calls to initcalls This switches explicit calls to various trivial registration methods for keywords, muxes or protocols from constructors to INITCALL1 at stage STG_REGISTER. All these calls have in common to consume a single pointer and return void. Doing this removes 26 constructors. The following calls were addressed : - acl_register_keywords - bind_register_keywords - cfg_register_keywords - cli_register_kw - flt_register_keywords - http_req_keywords_register - http_res_keywords_register - protocol_register - register_mux_proto - sample_register_convs - sample_register_fetches - srv_register_keywords - tcp_req_conn_keywords_register - tcp_req_cont_keywords_register - tcp_req_sess_keywords_register - tcp_res_cont_keywords_register - flt_register_keywords	2018-11-26 19:50:32 +01:00
Willy Tarreau	082f559d36	BUG/MEDIUM: h2: restart demuxing after releasing buffer space Since the connection changes in 1.9, some breakage happened to the H2 mux whose initial design was heavily relying on the fact that connection-level functions were woken up after data were transferred to the stream layer. We need to wake the demux up after receiving such data if the demux is blocked. This at least allows to receive POSTs again. One issue remains, it looks like the end of the uploaded data is silently discarded if the server responds before the end of the transfer (H2 in half-closed(local) state), which doesn't happen with 1.8.14 and nghttp as the client. No backport is needed.	2018-11-25 09:06:42 +01:00
Willy Tarreau	1ed87b77b4	BUG/MEDIUM: h2: wake the processing task up after demuxing After the changes to the connection layer in 1.9, some wake up calls need to be introduced to re-activate reading from the connection. One such place is at the end of h2_process_demux(), otherwise processing of input data stops after a few frames. No backport is needed.	2018-11-25 08:52:11 +01:00
Olivier Houchard	7c6f8b146d	MAJOR: connections: Detach connections from streams. Do not destroy the connection when we're about to destroy a stream. This prevents us from doing keepalive on server connections when the client is using HTTP/2, as a new stream is created for each request. Instead, the session is now responsible for destroying connections. When reusing connections, the attach() mux method is now used to create a new conn_stream.	2018-11-18 21:45:45 +01:00
Olivier Houchard	060ed43361	MINOR: mux: Add a destroy() method. Add a new method to muxes, destroy(), that is responsible for destroying the mux and the associated connection, to be used for server connections.	2018-11-18 21:44:53 +01:00
Olivier Houchard	d540b36e8a	MINOR: mux: Add a new "avail_streams" method. Add a new method for mux, avail_streams, that returns the number of streams still available for a mux. For the mux_pt, it'll return 1 if the connection is in idle, or 0. For the H2 mux, it'll return the max number of streams allowed, minus the number of streams currently in use.	2018-11-18 21:44:06 +01:00
Willy Tarreau	fafd3984b9	MINOR: mux: implement a get_first_cs() method This method is used to retrieve the first known good conn_stream from the mux. It will be used to find the other end of a connection when dealing with the proxy protocol for example.	2018-11-18 21:29:20 +01:00
Willy Tarreau	479998adbf	CLEANUP: h2: minimum documentation for recent API changes Commit `d4dd22d` ("MINOR: h2: Let user of h2_recv() and h2_send() know xfer has been done") changed the API without documenting the expected returned values which appear to come out of nowhere in the code :-( Please don't do that anymore! The description was recovered from the commit message.	2018-11-18 06:35:29 +01:00
Olivier Houchard	d846c267d5	MINOR: h2: Don't run tasks that are waiting to send if mux in full. We wake up all the streams waiting to send data when we have space available in the mux buffer. Doing so means we probably wake way too many streams, because after a few the buffer will probably be full instead. So keep a list of all the streams that are about to send data, and if we detect that the buffer is full, unschedule the tasks and put the streams back to the send_list.	2018-10-21 06:00:13 +02:00
Olivier Houchard	53216e7db9	MEDIUM: connections: Don't directly mess with the polling from the upper layers. Avoid using conn_xprt_want_send/recv, and totally nuke cs_want_send/recv, from the upper layers. The polling is now directly handled by the connection layer, it is activated on subscribe(), and unactivated once we got the event and we woke the related task.	2018-10-21 05:58:40 +02:00
Olivier Houchard	81a15af6bc	MINOR: h2: Make sure to return 1 in h2_recv() when needed. In h2_recv(), return 1 if we have data available, or if h2_recv_allowed() failed, to be sure h2_process() is called. Also don't subscribe if our buffer is full.	2018-10-21 05:58:33 +02:00
Olivier Houchard	52b946686c	BUG/MEDIUM: h2: Close connection if no stream is left an GOAWAY was sent. When we're closing a stream, is there's no stream left and a goaway was sent, close the connection, there's no reason to keep it open. [wt: it's likely that this is needed in 1.8 as well, though it's unclear how to trigger this issue, some tests are needed]	2018-10-21 05:53:09 +02:00
Willy Tarreau	b3fb56db10	MINOR: h2: add a new flag to quickly distinguish front vs back connection We will need to know if a mux was created for a front or a back connection and once it's established it's much harder, so let's introduce H2_CF_IS_BACK for this.	2018-10-12 16:58:41 +02:00
Willy Tarreau	a8e4954856	MINOR: h2: split h2c_stream_new() into h2s_new() + h2c_frt_stream_new() For backend connections we'll have to initialize streams but not allocate conn_streams since they'll already be there. Thus this patch splits the h2c_stream_new() function into one dedicated to allocation of a new stream and another one supposed to attach this stream to an existing frontend connection.	2018-10-12 16:58:01 +02:00
Willy Tarreau	0b37d658e6	MINOR: h2: retrieve the front proxy from the caller instead of the session Till now in order to figure the timeouts, we used to retrieve the proxy from the session's owner, but the new API provides it so it's better to simply take it from the caller at init time. We take this opportunity to store the pointer to the proxy into the h2 connection so that we can reuse it later when needed.	2018-10-12 16:58:01 +02:00
Willy Tarreau	7dc24e49cc	MINOR: h2: unify the mux init function The init function was split into the mux init and the front init, but it appears that most of the code will be common between the two sides when implementing the backend init. Thus let's simply make this a unique h2_init() function.	2018-10-12 16:58:01 +02:00
Willy Tarreau	6bf641a61d	MINOR: h2: don't try to send data before preface h2_snd_buf() must not accept to send data if the preface was not yet received nor sent. At the moment it doesn't happen but it can with server-side H2.	2018-10-12 16:58:01 +02:00
Willy Tarreau	7f0cc49645	CLEANUP: h2: rename h2c_snd_settings() to h2c_send_settings() It's the only function not called h2c_send_<something>() and it took me a while to find it.	2018-10-12 16:58:01 +02:00
Willy Tarreau	ab0e1da3a9	MEDIUM: h2: stop relying on H2_SS_IDLE / H2_SS_CLOSED At a few places we check these states to detect if a stream has valid data/errcode or is one of the two dummy streams (idle or closed). It will become problematic for outgoing streams as it will not be possible to report errors for example since the stream will switch from IDLE state only after sending a HEADERS frame. There is a safer solution consisting in checking the stream ID, which may only be zero in the dummy streams. This patch changes the test to only rely on the stream ID.	2018-10-12 16:58:01 +02:00
Olivier Houchard	dddfe31265	BUG/MEDIUM: h2: Make sure we're not in the send list on flow control. If we can't send data for a stream because of its flow control, make sure not to put it in the send_list, until the flow control lets it send again. This is specific to 1.9, and should not be backported.	2018-10-11 15:35:05 +02:00
Olivier Houchard	fa8aa867b9	MEDIUM: connections: Change struct wait_list to wait_event. When subscribing, we don't need to provide a list element, only the h2 mux needs it. So instead, Add a list element to struct h2s, and use it when a list is needed. This forces us to use the unsubscribe method, since we can't just unsubscribe by using LIST_DEL anymore. This patch is larger than it should be because it includes some renaming.	2018-10-11 15:34:39 +02:00
Olivier Houchard	83a0cd8a36	MINOR: connections: Introduce an unsubscribe method. As we don't know how subscriptions are handled, we can't just assume we can use LIST_DEL() to unsubscribe, so introduce a new method to mux and connections to do so.	2018-10-11 15:34:21 +02:00
mildis	cd2d7de44e	BUG/MINOR: h2: null-deref h2c can be null if pool_alloc() failed. Bypass tasklet_free and pool_free if pool_alloc did fail.	2018-10-11 15:17:27 +02:00
Dirkjan Bussink	c26c72d89b	CLEANUP: h1: Fix debug warnings for h1 headers The wrong method was used to debug the h1m state here. This fixes both the signature of the h1m method and also fixes the invocation to be correct.	2018-10-09 15:09:29 +02:00
Willy Tarreau	45efc07cb5	BUG/MEDIUM: h2: make h2_stream_new() return an error on memory allocation failure Commit `8ae735da0` ("MEDIUM: mux_h2: Revamp the send path when blocking.") added a tasklet allocation in h2_stream_new(), however the error exit path fails to reset h2s in case the tasklet cannot be allocated, resulting in the h2s pointer to be returned as valid to the caller. Let's readjust the exit path to always return NULL on error and to always log as well (since there is no reason for not logging on such important errors). No backport is needed, this is strictly 1.9-dev.	2018-10-03 18:30:39 +02:00
Willy Tarreau	0f3835878d	BUG/MEDIUM: h2: check that the connection is still valid at the end of init() Since commit `7505f94f9` ("MEDIUM: h2: Don't use a wake() method anymore."), the H2 mux's init() calls h2_process(). But this last one may detect an early error and call h2_release(), destroying the connection, and return -1. At this point we're screwed because the caller will still dereference the connection for various things ranging from the configuration of the proxy protocol header to the retries. We could simply return -1 here upon failure but that's not enough since the stream layer really needs to keep its connection structure allocated (to clean it up in session_kill_embryonic or for example because it holds the destination address to reconnect to when the connection goes to the backend). Thus the correct solution here is to only schedule a wakeup of the I/O callback so that the init succeeds, and that the connection is only handled later. No backport is needed, this is 1.9-specific.	2018-10-03 18:09:58 +02:00
Olivier Houchard	61d322fa9e	BUG/MEDIUM: h2: Wake the task instead of calling h2_recv()/h2_process(). In a number of cases, we may end up recursively calling h2_recv() via h2_process(), so just wake the tasklet up instead.	2018-09-26 14:21:54 +02:00
Olivier Houchard	21df6cc2f9	MINOR: h2/stream_interface: Reintroduce te wake() method. For the time being, reintroduce the wake methods, it may be revisited later.h	2018-09-26 14:21:54 +02:00
Willy Tarreau	db72da0432	BUG/MINOR: h1: don't consider the status for each header While it was possible to consider the status before parsing response headers, it's wrong to do it for request headers and could lead to random behaviours due to this status matching other fields instead. Additionnally there is little to no value in doing this for each and every new header field. It's much better to reset the content-length at once in the callerwhen seeing such statuses (which currently is only the H2 mux). No backport is needed, this is purely 1.9.	2018-09-13 14:30:23 +02:00
Willy Tarreau	b5b7d4a532	BUG/MAJOR: h2: reset the parser's state on mux buffer full The h2 parser has this specificity that if it cannot send the headers frame resulting from the headers it just parsed, it needs to drop it and parse it again later. Since commit 8852850 ("MEDIUM: h1: let the caller pass the initial parser's state"), when this happens the parser remains in the data state and the headers are not parsed again next time, resulting in a parse error. Let's reset the parser on exit there. No backport is needed.	2018-09-12 18:55:29 +02:00
Olivier Houchard	70d0d18d41	BUG/MEDIUM: h2: Don't forget to set recv_wait_list to NULL in h2_detach. If we're detaching the conn_stream, and it was subscribed to be waken up when more data was available to receive, unsubscribe it. No backport is needed.	2018-09-12 18:55:25 +02:00
Olivier Houchard	251f6a23ad	BUG/MEDIUM: h2: Don't forget to empty the wait lists on destroy. Empty both send_list and fctl_list when destroying the h2 context, so that if we're freeing the stream after, it doesn't try to remove itself from the now-deleted list. No backport is needed.	2018-09-12 18:55:18 +02:00
Willy Tarreau	175a2bb507	MINOR: connection: pass the proxy when creating a connection Till now it was very difficult for a mux to know what proxy it was working for. Let's pass the proxy when the mux is instanciated at init() time. It's not yet used but the H1 mux will definitely need it, just like the H2 mux when dealing with backend connections.	2018-09-12 17:39:22 +02:00
Willy Tarreau	eb528db60b	MINOR: h1: add H1_MF_TOLOWER to decide when to turn header names to lower case The h1 parser used to systematically turn header field names to lower case because it was designed for H2. Let's add a flag which is off by default to condition this behaviour so that when using it from an H1 parser it will not affect the message.	2018-09-12 17:38:26 +02:00
Willy Tarreau	9c5e22e436	MINOR: h2: store the HTTP status into the H2S, not the H1M The HTTP status is not relevant to the H1 message but to the H2 stream itself. It used to be placed there by pure convenience but better move it before it's too hard to remove.	2018-09-12 17:38:25 +02:00
Willy Tarreau	001823c304	MEDIUM: h1: remove the useless H1_MSG_BODY state This state was only a delimiter between headers and body but it now causes more harm than good because it requires someone to change it. Since the H1 parser knows if we're in DATA or CHUNK_SIZE, simply let it set the right next state so that h1m->state constantly matches what is expected afterwards.	2018-09-12 17:38:25 +02:00
Willy Tarreau	4433c083ec	MEDIUM: h1: let the caller pass the initial parser's state This way the caller controls if it's the request or response which has to be used, and it will allow to restart after an incomplete parsing.	2018-09-12 17:38:25 +02:00
Willy Tarreau	a41393fc61	MEDIUM: h1: make the parser support a pointer to a start line This will allow the parser to fill some extra fields like the method or status without having to store them permanently in the HTTP message. At this point however the parser cannot restart from an interrupted read.	2018-09-12 17:38:25 +02:00
Willy Tarreau	9b8cd1f183	MINOR: h2: pre-initialize h1m->err_pos to -1 on the output path We don't want to trigger an error while parsing a response coming from haproxy (it could be an errorfile for example), so let's set this to -1.	2018-09-12 17:38:25 +02:00
Willy Tarreau	a40704ab05	MINOR: mux_h2: replace the req,res h1 messages with a single h1 message There's no reason to have the two sides in H1 format since we only use one at a time (the response at the moment). While completely removing the request declaration, let's rename the response to "h1m" to clarify that it's the unique h1 message there.	2018-09-12 17:38:25 +02:00
Willy Tarreau	25173a7bcc	MINOR: h2: make sure h1m->err_pos field is correct on chunk error This never happens but in case it would, it's better to report the correct offset of the error instead of a negative value.	2018-09-12 17:38:25 +02:00
Willy Tarreau	7f437ff81c	MINOR: h1: provide a distinct init() function for request and response h1m_init() used to handle response only since it was used by the H1 client code. Let's have one init per direction.	2018-09-12 17:38:25 +02:00
Willy Tarreau	801250e07d	REORG: h1: create a new h1m_state This is the parsing state of an HTTP/1 message. Currently the h1_state is composite as it's made both of parsing and control (100SENT, BODY, DONE, TUNNEL, ENDING etc). The purpose here is to have a purely H1 state that can be used by H1 parsers. For now it's equivalent to h1_state.	2018-09-12 17:38:25 +02:00
Olivier Houchard	c2aa71108a	MEDIUM: stream_interfaces: Starts receiving from the upper layers. Instead of waiting for the connection layer to let us know we can read, attempt to receive as soon as process_stream() is called, and subscribe to receive events if we can't receive yet. Now, except for idle connections, the recv(), send() and wake() methods are no more, all the lower layers do is waking tasklet for anybody waiting for I/O events.	2018-09-12 17:37:55 +02:00
Olivier Houchard	8ae735da05	MEDIUM: mux_h2: Revamp the send path when blocking. Change fctl_list and send_list to be lists of struct wait_list, and nuke send_wait_list, as it's now redundant. Make the code responsible for shutr/shutw subscribe to those lists.	2018-09-12 17:37:55 +02:00
Olivier Houchard	7505f94f90	MEDIUM: h2: Don't use a wake() method anymore. Instead of having our wake() method called each time a fd event happens, just subscribe to recv/send events, and get our tasklet called when that happens. If any recv/send was possible, the equivalent of what h2_wake_cb() will be done.	2018-09-12 17:37:55 +02:00
Olivier Houchard	a1411e62e4	MEDIUM: h2: always subscribe to receive if allowed. Let the connection layer know we're always interested in getting more data, so that we get scheduled as soon as data is available, instead of relying on the wake() method.	2018-09-12 17:37:55 +02:00
Olivier Houchard	d4dd22d0ab	MINOR: h2: Let user of h2_recv() and h2_send() know xfer has been done. Make h2_recv() and h2_send() return 1 if data has been sent/received, or 0 if it did not. That way the caller will be able to know if more work may have to be done.	2018-09-12 17:37:55 +02:00
Olivier Houchard	af4021e680	MEDIUM: connections: Get rid of the recv() method. Remove the recv() method from mux and conn_stream. The goal is to always receive from the upper layers, instead of waiting for the connection later. For now, recv() is still called from the wake() method, but that should change soon.	2018-09-12 17:37:55 +02:00
Olivier Houchard	4cf7fb148f	MEDIUM: connections/mux: Add a recv and a send+recv wait list. For struct connection, struct conn_stream, and for the h2 mux, add 2 new lists, one that handles waiters for recv, and one that handles waiters for recv and send. That way we can ask to subscribe for either recv or send.	2018-09-12 17:37:55 +02:00
Willy Tarreau	2c096c3b7a	BUG/MINOR: h2: report asynchronous end of stream on closed connections Christopher noticed that the CS_FL_EOS to CS_FL_REOS conversion was incomplete : when the connectionis closed, we mark the streams with EOS instead of REOS, causing the loss of any possibly pending data. At the moment it's not an issue since H2 is used only with a client, but with servers it could be a real problem if servers close the connection right after sending their response. This patch should be backported to 1.8.	2018-09-12 09:45:54 +02:00
Willy Tarreau	22de8d3e01	MEDIUM: h2: produce some logs on early errors that prevent streams from being created The h2 mux currently lacks some basic transparency. Some errors cause the connection to be aborted but they couldn't be reported. With this patch, almost all situations where an error will cause a stream or connection to be aborted without the ability for an existing stream to report it will be reported in the logs. This at least provides a solution to monitor the activity and abnormal traffic.	2018-09-06 09:43:41 +02:00
Willy Tarreau	a0d11b6fd5	BUG/MEDIUM: h2: fix risk of memory leak on malformated wrapped frames While parsing a headers frame, if the frame is wrapped in the buffer and needs to be unwrapped, it will be duplicated before being processed. But if it contains certain combinations of invalid flags, the parser returns without releasing the temporary buffer leading to a memory leak. This fix needs to be backported to 1.8.	2018-09-05 20:01:14 +02:00
Willy Tarreau	590a0514f2	BUG/MEDIUM: session: fix reporting of handshake processing time in the logs The handshake processing time used to be stored per stream, which was valid when there was exactly one stream per session. With H2 and multiplexing it's not the case anymore and the reported handshake times are wrong in the logs as it's computed between the TCP accept() and the stream creation. Let's first move the handshake where it belongs, which is the session. However, this is not enough because we don't want to report an excessive idle time either for H2 (since many requests use the connection). So the solution used here is to have the stream retrieve sess->tv_accept and the handshake duration when the stream is created, and let the mux immediately reset them. This way, the handshake time becomes zero for the second and subsequent requests in H2 (which was already the case in H1), and the idle time exactly counts how long the connection remained unused while it could be used, so in H1 it runs from the end of the previous response and in H2 it runs from the end of the previous request since the channel is already available. This patch will need to be backported to 1.8.	2018-09-05 16:30:23 +02:00
Olivier Houchard	fab7c7e91c	BUG/MEDIUM: H2: Activate polling after successful h2_snd_buf(). Make sure h2_send() is called after h2_snd_buf() by activating polling. This is 1.9-specific, no backport is needed.	2018-08-21 18:06:57 +02:00
Olivier Houchard	29fb89dc5e	MINOR: mux_h2: Don't use h2_send() as a callback. Instead of using h2_send() directly as a callback, introcude h2_io_cb(), that will call h2_send() if it is possible to send data.	2018-08-16 17:29:54 +02:00
Olivier Houchard	e1c6dbcd70	MINOR: connections/mux: Add the wait reason(s) to wait_list. Add a new element to the wait_list, that let us know which event(s) we are waiting on.	2018-08-16 17:29:53 +02:00
Olivier Houchard	638b799b09	MINOR: connections: Move rxbuf from the conn_stream to the h2s. As the mux_h2 is the only user of rxbuf, move it to the struct h2s, instead of conn_stream.	2018-08-16 17:28:11 +02:00
Olivier Houchard	511efeae7e	MINOR: connections: Make rcv_buf mandatory and nuke cs_recv(). Reintroduce h2_rcv_buf(), right now it just does what cs_recv() did, but should be modified later.	2018-08-16 17:23:44 +02:00
Christopher Faulet	32f61c0421	MINOR: mux: Unlink ALPN and multiplexers to rather speak of mux protocols Multiplexers are not necessarily associated to an ALPN. ALPN is a TLS extension, so it is not always defined or used. Instead, we now rather speak of multiplexer's protocols. So in this patch, there are no significative changes, some structures and functions are just renamed.	2018-08-08 09:54:22 +02:00
Christopher Faulet	2d5292a412	MINOR: mux: Add info about the supported side in alpn_mux_list structure Now, a multiplexer can specify if it can be install on incoming connections (ALPN_SIDE_FE), on outgoing connections (ALPN_SIDE_BE) or both (ALPN_SIDE_BOTH). These flags are compatible with proxies' ones.	2018-08-08 09:54:22 +02:00
Christopher Faulet	d44a9b3627	MEDIUM: mux: Remove const on the buffer in mux->snd_buf() This is a partial revert of the commit `deccd1116` ("MEDIUM: mux: make mux->snd_buf() take the byte count in argument"). It is a requirement to do zero-copy transfers. This will be mandatory when the TX buffer of the conn_stream will be used. So, now, data are consumed by mux->snd_buf() and not only sent. So it needs to update the buffer state. On its side, the caller must be aware the buffer can be replaced y an empty or unallocated one. As a side effet of this change, the function co_set_data() is now only responsible to update the channel set, by update ->output field.	2018-08-07 14:36:52 +02:00
Willy Tarreau	a2b5181e7a	BUG/MEDIUM: h2: prevent orphaned streams from blocking a connection forever Some h2 connections remaining in CLOSE_WAIT state forever have been reported for a while. Thanks to detailed captures provided by Milan Petruzelka, the sequence where this happens became clearer : 1) multiple streams compete for the mux and are queued in the send_list 2) at this point the mux has to emit a GOAWAY for any reason (for example because it received a bad message) 3) the streams are woken up, notified about the error 4) h2_detach() is called for each of them 5) the CS they are detached from the H2S 6) since the streams are marked as blocked for some room, they are orphaned and nothing more is done on them. 7) at this point, any activity on the connection goes through h2_wake() which sees the conneciton in ERROR2 state, tries again to release the streams, cannot, and stops polling (thus even connection errors cannot be detected anymore). => from this point, no more events can be received on the connection, and the streams remain orphaned forever. This patch makes sure that we never return without doing anything once an error was met. It has to act both on the h2_detach() side (for h2 streams being detached after the error was emitted) and on the h2_wake() side (for errors reported after h2s have already been orphaned). Many thanks to Milan Petruzelka and Janusz Dziemidowicz for their awesome work on this issue, collecting traces and testing patches, and to Olivier Doucet for extra testing and confirming the fix. This fix must be backported to 1.8.	2018-07-27 09:55:14 +02:00
Willy Tarreau	616ac81dec	MINOR: h2: add the error code and the max/last stream IDs to "show fd" This is intented to help debugging H2 in field.	2018-07-24 14:12:42 +02:00
Willy Tarreau	842ed9b1cb	MEDIUM: h2: use the default conn_stream's receive function This removes h2_rcv_buf() now that the generic code can handle it fine.	2018-07-20 19:37:12 +02:00
Willy Tarreau	39d68508c3	MINOR: h2: make use of CS_FL_REOS to indicate that end of stream was seen This allows h2_rcv_buf() not to depend anymore on h2s at all and to become generic.	2018-07-20 19:35:14 +02:00
Willy Tarreau	2df65e7194	MEDIUM: h2: don't call data_cb->recv() anymore Now we simply call data_cb->wake() which will automatically perform the recv() call if required.	2018-07-20 19:31:36 +02:00
Willy Tarreau	2a761dcf0d	MEDIUM: h2: perform a single call to the data layer in demux() Instead of calling the data layer from each individual frame processing function, we now call it from demux. This requires to know the h2s that was created inside h2c_frt_handle_headers(), which is why the pointer is now returned. This results in a small performance boost from 58k to 60k POST requests/s compared to -master, thanks to half the number of si_cs_recv_cb() calls and 66% calls to si_cs_wake_cb(). It's interesting to note that all calls to data_cb->recv() are now always immediately followed by a call to data_cb->wake(). The next step should be to let the ->wake handler perform the recv() call itself. For this it will be useful to have some info on the CS to indicate whether or not it is ready to be read (ie: contains a non-empty input buffer).	2018-07-20 19:30:03 +02:00
Willy Tarreau	a56a6def91	MEDIUM: h2: move headers and data frame decoding to their respective parsers Now we entirely process the input frame before transfering it above, so that h2_rcv_buf() doesn't have to "speak" h2 anymore.	2018-07-20 19:21:43 +02:00
Willy Tarreau	454b57b347	MEDIUM: h2: centralize transfer of decoded frames in h2_rcv_buf() We still call the parser but it should soon not be needed anymore. The decode functions don't need the buffer nor the max size anymore. They must also not touch the CS_FL_EOS or CS_FL_RCV_MORE flags either, so this is done within h2_rcv_buf() after transmission. The "flags" argument to h2_frt_decode_headers() and h2_frt_transfer_data() has been removed since it's not used anymore.	2018-07-20 19:21:43 +02:00
Willy Tarreau	d755ea6c7d	MEDIUM: h2: make h2_frt_transfer_data() copy via an intermediary buffer The purpose here is also to ensure we can split the lower from the top layers. The way the CS_FL_MSG_MORE flag is set was updated so that it's set or cleared upon exit depending on the buffer's remaining contents.	2018-07-20 19:21:43 +02:00
Willy Tarreau	937f760e1e	MEDIUM: h2: make h2_frt_decode_headers() use an intermediary buffer The purpose is to decode to a temporary buffer and then to copy this buffer to the caller. This double-copy definitely has an impact on performance, the test code goes down from 220k to 140k req/s, but this memcpy() will disappear soon. The test on CO_RFL_BUF_WET has become irrelevant now since we only use the cs' rxbuf, so we cannot be blocked by "output" data that has to be forwarded first. Thus instead we don't start until the rxbuf is empty (it will be drained from any input data once the stream processes it).	2018-07-20 19:21:43 +02:00
Willy Tarreau	0b559071dd	MINOR: h2: make each H2 stream support an intermediary input buffer The purpose is to decode to a temporary buffer and then to copy this buffer to the caller upon request to avoid having to process frames on the fly when called from the higher level. For now the buffer is only initialized on stream creation via cs_new() and allocated if the buffer_wait's callback is called.	2018-07-20 19:21:43 +02:00
Olivier Houchard	f495fc460e	BUG/MEDIUM: mux_h2: Call h2_send() before updating polling. In h2_wake(), make sure we call h2_send() before we try to update the polling flags, and detect connection errors, or errors will never be detected.	2018-07-20 19:07:49 +02:00
Olivier Houchard	910b2bc829	MEDIUM: connections/mux: Revamp the send direction. Totally nuke the "send" method, instead, the upper layer decides when it's time to send data, and if it's not possible, uses the new subscribe() method to be called when it can send data again.	2018-07-19 18:31:07 +02:00
Olivier Houchard	6ff2039d13	MINOR: connections/mux: Add a new "subscribe" method. Add a new "subscribe" method for connection, conn_stream and mux, so that upper layer can subscribe to them, to be called when the event happens. Right now, the only event implemented is "SUB_CAN_SEND", where the upper layer can register to be called back when it is possible to send data. The connection and conn_stream got a new "send_wait_list" entry, which required to move a few struct members around to maintain an efficient cache alignment (and actually this slightly improved performance).	2018-07-19 16:23:43 +02:00
Willy Tarreau	83061a820e	MAJOR: chunks: replace struct chunk with struct buffer Now all the code used to manipulate chunks uses a struct buffer instead. The functions are still called "chunk*", and some of them will progressively move to the generic buffer handling code as they are cleaned up.	2018-07-19 16:23:43 +02:00
Willy Tarreau	843b7cbe9d	MEDIUM: chunks: make the chunk struct's fields match the buffer struct Chunks are only a subset of a buffer (a non-wrapping version with no head offset). Despite this we still carry a lot of duplicated code between buffers and chunks. Replacing chunks with buffers would significantly reduce the maintenance efforts. This first patch renames the chunk's fields to match the name and types used by struct buffers, with the goal of isolating the code changes from the declaration changes. Most of the changes were made with spatch using this coccinelle script : @rule_d1@ typedef chunk; struct chunk chunk; @@ - chunk.str + chunk.area @rule_d2@ typedef chunk; struct chunk chunk; @@ - chunk.len + chunk.data @rule_i1@ typedef chunk; struct chunk chunk; @@ - chunk->str + chunk->area @rule_i2@ typedef chunk; struct chunk chunk; @@ - chunk->len + chunk->data Some minor updates to 3 http functions had to be performed to take size_t ints instead of ints in order to match the unsigned length here.	2018-07-19 16:23:43 +02:00
Willy Tarreau	c9fa0480af	MAJOR: buffer: finalize buffer detachment Now the buffers only contain the header and a pointer to the storage area which can be anywhere. This will significantly simplify buffer swapping and will make it possible to map chunks on buffers as well. The buf_empty variable was removed, as now it's enough to have size==0 and area==NULL to designate the empty buffer (thus a non-allocated head is the empty buffer by default). buf_wanted for now is indicated by size==0 and area==(void *)1. The channels and the checks now embed the buffer's head, and the only pointer is to the storage area. This slightly increases the unallocated buffer size (3 extra ints for the empty buffer) but considerably simplifies dynamic buffer management. It will also later permit to detach unused checks. The way the struct buffer is arranged has proven quite efficient on a number of tests, which makes sense given that size is always accessed and often first, followed by the othe ones.	2018-07-19 16:23:43 +02:00
Willy Tarreau	ea1b06d5bb	MINOR: buffer: add a new file for ist + buffer manipulation functions The new file istbuf.h links the indirect strings (ist) with the buffers. The purpose is to encourage addition of more standard buffer manipulation functions that rely on this in order to improve the overall ease of use along all the code. Just like ist.h and buf.h, this new file is not expected to depend on anything beyond these two files. A few functions were added and/or converted from buffer.h : - b_isteq() : indicates if a buffer and a string match - b_isteat() : consumes a string from the buffer if it matches - b_istput() : appends a small string to a buffer (all or none) - b_putist() : appends part of a large string to a buffer The equivalent functions were removed from buffer.h and changed at the various call places.	2018-07-19 16:23:43 +02:00
Willy Tarreau	55372f646f	MINOR: buffer: replace b{i,o}_put* with b_put* The two variants now do exactly the same (appending at the tail of the buffer) so let's not keep the distinction between these classes of functions and have generic ones for this. It's also worth noting that b{i,o}_putchk() wasn't used at all and was removed.	2018-07-19 16:23:43 +02:00
Willy Tarreau	b7b5fe1a14	MEDIUM: h2: update to the new buffer API There is no more distinction between ->i and ->o for the mux's buffers, we always use b_data() to know the buffer's length since only one side is used for each direction.	2018-07-19 16:23:42 +02:00
Olivier Houchard	acd1403794	MINOR: buffer: Use b_add()/bo_add() instead of accessing b->i/b->o. Use the newly available functions instead of using the buffer fields directly.	2018-07-19 16:23:42 +02:00
Willy Tarreau	591d445049	MINOR: buffer: use b_orig() to replace most references to b->data This patch updates most users of b->data to use b_orig().	2018-07-19 16:23:42 +02:00
Willy Tarreau	337ea57cfc	MINOR: connection: add a new receive flag : CO_RFL_BUF_WET With this flag we introduce the notion of "dry" vs "wet" buffers : some demultiplexers like the H2 mux require as much room as possible for some operations that are not retryable like decoding a headers frame. For this they need to know if the buffer is congested with data scheduled for leaving soon or not. Since the new API will not provide this information in the buffer itself, the caller must indicate it. We never need to know the amount of such data, just the fact that the buffer is not in its optimal condition to be used for receipt. This "CO_RFL_BUF_WET" flag is used to mention that such outgoing data are still pending in the buffer and that a sensitive receiver should better let it "dry" before using it.	2018-07-19 16:23:41 +02:00
Willy Tarreau	7f3225f251	MINOR: connection: add a flags argument to rcv_buf() The mux and transport rcv_buf() now takes a "flags" argument, just like the snd_buf() one or like the equivalent syscall lower part. The upper layers will use this to pass some information such as indicating whether the buffer is free from outgoing data or if the lower layer may allocate the buffer itself.	2018-07-19 16:23:41 +02:00
Willy Tarreau	d9cf540457	MEDIUM: mux: make mux->rcv_buf() take a size_t for the count It also returns a size_t. This is in order to clean the API. Note that the H2 mux still uses some ints in the functions called from h2_rcv_buf(), though it's not really a problem given that H2 frames are smaller. It may deserve a general cleanup later though.	2018-07-19 16:23:41 +02:00
Willy Tarreau	deccd1116d	MEDIUM: mux: make mux->snd_buf() take the byte count in argument This way the mux doesn't need to modify the buffer's metadata anymore nor to know the output's size. The mux->snd_buf() function now takes a const buffer and it's up to the caller to update the buffer's state. The return type was updated to return a size_t to comply with the count argument.	2018-07-19 16:23:41 +02:00
Willy Tarreau	787db9a6a4	MEDIUM: connection: make xprt->snd_buf() take the byte count in argument This way the senders don't need to modify the buffer's metadata anymore nor to know about the output's split point. This way the functions can take a const buffer and it's clearer who's in charge of updating the buffer after a send. That's why the buffer realignment is now performed by the caller of the transport's snd_buf() functions. The return type was updated to return a size_t to comply with the count argument.	2018-07-19 16:23:41 +02:00
Willy Tarreau	55f3ce1c91	MINOR: buffer: make b_getblk_nc() take size_t for the block sizes Till now we used to reimplement it using ints to limit external changes but we must adjust it and the various users to switch to size_t.	2018-07-19 16:23:41 +02:00
Willy Tarreau	206ba834ef	MINOR: buffer: make b_getblk_nc() take const pointers Now that there are no more users requiring to modify the buffer anymore, switch these ones to const char and const buffer. This will make it more obvious next time send functions are tempted to modify the buffer's output count. Minor adaptations were necessary at a few call places which were using char due to the function's previous prototype.	2018-07-19 16:23:41 +02:00
Willy Tarreau	9c7f2d19bf	MEDIUM: h2: don't use b_ptr() nor b_end() anymore The few places where they were still used were replaced with b_peek() and b_wrap() respectively. The parts making use of ->i and ->o should now be convertible to the new API.	2018-07-19 16:23:41 +02:00
Willy Tarreau	0bad0439f4	MEDIUM: h2: do not use buf->o anymore inside h2_snd_buf's loop buf->o is only retrieved at the loop entry and modified using b_del() on exit. We're close to being able to change the API to take a count argument.	2018-07-19 16:23:41 +02:00
Willy Tarreau	f40e68227b	MINOR: h1: make h1_measure_trailers() use an offset and a count This will be needed by the H2 encoder to restart after wrapping.	2018-07-19 16:23:41 +02:00
Willy Tarreau	84d6b7af87	MINOR: h1: make h1_parse_chunk_size() not depend on b_ptr() anymore It's similar to the previous commit so that the function doesn't rely on buf->p anymore.	2018-07-19 16:23:41 +02:00
Willy Tarreau	c0973c6742	MINOR: h1: make h1_skip_chunk_crlf() not depend on b_ptr() anymore It now takes offsets relative to the buffer's head. It's up to the callers to add this offset which corresponds to the buffer's output size.	2018-07-19 16:23:41 +02:00
Willy Tarreau	5dd17353d5	MEDIUM: h2: prevent the various mux encoders from modifying the buffer Functions h2s_frt_make_resp_headers() and h2s_frt_make_resp_data() used to modify the buffer's output data count. This is problematic for the buffer's rework as we don't want to rely on this anymore. This commit modifies these functions to take an offset (relative to the buffer's head) and a maximum byte count. Thus h2_snd_buf() now calls them with buf->o and takes care of removing deleted data itself. The send functions now almost support being passed const buffers (except for the data part which is still embedded).	2018-07-19 16:23:41 +02:00
Willy Tarreau	1dc41e75d8	MINOR: h2: clarify the fact that the send functions are unsigned There's no more error return combined with the send output, though the comments were misleading. Let's fix this as well as the functions' prototypes. h2_snd_buf()'s return value wasn't changed yet since it has to match the ->snd_buf prototype.	2018-07-19 16:23:40 +02:00
Willy Tarreau	7314be8e2c	MINOR: h1: make h1_measure_trailers() take the byte count in argument The principle is that it should not have to take this value from the buffer itself anymore.	2018-07-19 16:23:40 +02:00
Willy Tarreau	e5f12ce7f2	MINOR: buffer: replace bi_del() and bo_del() with b_del() Till now the callers had to know which one to call for specific use cases. Let's fuse them now since a single one will remain after the API migration. Given that bi_del() may only be used where o==0, just combine the two tests by first removing output data then only input.	2018-07-19 16:23:40 +02:00
Willy Tarreau	a1f78fb652	MINOR: buffer: replace bo_getblk_nc() with b_getblk_nc() which takes an offset This will be important so that we can parse a buffer without touching it. Now we indicate where from the buffer's head we plan to start to copy, and for how many bytes. This will be used by send functions to loop at the end of the buffer without having to update the buffer's output byte count.	2018-07-19 16:23:40 +02:00
Willy Tarreau	e4d5a036ed	MINOR: buffer: merge b{i,o}_contig_space() These ones were merged into a single b_contig_space() that covers both (the bo_ case was a simplified version of the other one). The function doesn't use ->i nor ->o anymore.	2018-07-19 16:23:40 +02:00
Willy Tarreau	8f9c72d301	MINOR: buffer: remove bi_end() It was replaced by ci_tail() when the channel is known, or b_tail() in other cases.	2018-07-19 16:23:40 +02:00
Willy Tarreau	41e38ac0ee	MINOR: buffer: remove bo_end() It was replaced by either b_tail() when the buffer has no input data, or b_peek(b, b->o).	2018-07-19 16:23:40 +02:00
Willy Tarreau	89faf5d7c3	MINOR: buffer: remove bo_ptr() It was replaced by co_head() when a channel was known, otherwise b_head().	2018-07-19 16:23:40 +02:00
Willy Tarreau	dda2e41881	MINOR: buffer: remove bi_ptr() It's now been replaced by b_head() when b->o is null, ci_head() when the channel is known, or b_peek(b, b->o) in other situations.	2018-07-19 16:23:40 +02:00
Willy Tarreau	7194d3cc3b	MINOR: buffer: split bi_contig_data() into ci_contig_data and b_config_data() This function was sometimes used from a channel and sometimes from a buffer. In both cases it requires knowledge of the size of the output data (to skip them). Here the split ensures the channel can deal with this point, and that other places not having output data can continue to work.	2018-07-19 16:23:40 +02:00
Willy Tarreau	aa7af7213d	MINOR: buffer: replace calls to buffer_space_wraps() with b_space_wraps() And remove the unused function.	2018-07-19 16:23:40 +02:00
Willy Tarreau	0db4d10efc	MINOR: h2: use b_slow_realign() with the trash as a swap buffer H2 doesn't use the trash so it can make use of it as a swap area when calling b_slow_realign(). This way we don't need buffer_slow_realign() anymore.	2018-07-19 16:23:40 +02:00
Willy Tarreau	4cf1300e6a	MINOR: channel/buffer: replace buffer_slow_realign() with channel_slow_realign() and b_slow_realign() Where relevant, the channel version is used instead. The buffer version was ported to be more generic and now takes a swap buffer and the output byte count to know where to set the alignment point. The H2 mux still uses buffer_slow_realign() with buf->o but it will change later.	2018-07-19 16:23:40 +02:00
Willy Tarreau	506a29ac6e	MINOR: buffer: switch buffer sizes and offsets to size_t Passing unsigned ints everywhere is painful, and will cause some headache later when we'll want to integrate better with struct ist which already uses size_t. Let's switch buffers to use size_t instead.	2018-07-19 16:23:39 +02:00
Willy Tarreau	42d55b9b6a	BUG/MEDIUM: h2: make sure the last stream closes the connection after a timeout If a timeout strikes on the connection side with some active streams, there is a corner case which can sometimes cause the following sequence to happen : - There are active streams but there are data in the mux buffer (eg: a client suddenly disconnected during a download with pending requests). The timeout is active. - The timeout strikes, h2_timeout_task() is called, kills the task and doesn't close the connection since there are streams left ; The connection is marked in H2_CS_ERROR ; - the streams are woken up and closed ; - when the last stream closes, calling h2_detach(), it sees the tree list is empty, but there is no condition allowing the connection to be closed (mbuf->o > 0), thus it does nothing ; - since the task is dead, there's no more hope to clear this situation later For now we can take care of this by adding a test for the presence of H2_CS_ERROR and !task, implying the timeout task triggered already and will not be able to handle this again. Over the long term it seems like a more reliable test on should be made, so that it is possible to know whether or not someone is still able to close this connection. A big thanks to Janusz Dziemidowicz and Milan Petruzelka for providing many details helping in figuring this bug.	2018-07-19 14:31:47 +02:00
Willy Tarreau	00610960a1	BUG/MEDIUM: h2: never leave pending data in the output buffer on close We currently don't process trailers on H2, but this has an impact : on chunked HTTP/1 responses, we decide to emit the ES bit once we see the 0CRLF. From this point the stream switches to the CLOSED state, which aborts processing of the remaining bytes. Thus the extra CRLF which ends trailers is not processed and remains in the buffer. This prevents the stream from being notified about end of transmission, which in turn keeps the mux busy and prevents the connection from quitting. The case of the trailers is not the root cause of this issue, though it is what triggers it. The root cause is that upon error and/or close, once we know we're not going to process any more data, we must absolutely flush any remaining bytes from the output buffer, otherwise there is no way the stream can quit. This is what this patch does. It looks very likely related to the issues reported and debugged by Janusz Dziemidowicz and Milan Petruzelka. One way to reproduce it is to chain two proxies with the last one emitting chunked data (typically using the stats page) : global stats socket /tmp/sock1 mode 666 level admin stats timeout 1h tune.ssl.default-dh-param 1024 tune.bufsize 16384 defaults mode http timeout connect 4s timeout client 10s timeout server 20s listen px1 bind :4443 ssl crt rsa+dh2048.pem npn h2 alpn h2 server s1 127.0.0.1:4445 listen px2 bind :4444 ssl crt rsa+dh2048.pem npn h2 alpn h2 bind :4445 stats uri / Then use curl to fetch the stats through px1 : curl --http2 -k "https://127.0.0.1:4443/" When curl is sent to the first one, "show sess" issued to the CLI will show a remaining session during the client timeout. When curl is aimed at port 4444 (px2), there is no such remaining session. This fix needs to be backported to 1.8.	2018-07-19 11:09:12 +02:00
Willy Tarreau	c65edac804	MINOR: h2: add the mux and demux buffer lengths on "show fd" It is convenient during debugging sessions to know if the mux and demux buffers are empty/full/other. Let's report this on "show fd" output.	2018-07-19 10:54:43 +02:00
Willy Tarreau	f210191dcd	BUG/MEDIUM: h2: don't accept new streams if conn_streams are still in excess The streams bookkeeping made in H2 is used for protocol compliance only but it doesn't consider the number of conn_streams still attached to the mux. It causes an issue when http-request set-nice rules are applied on H2 requests processed on a saturated machine. Indeed, in this case, the requests are accepted and assigned a default nice value of zero. When they are processed, their nice value changes to a higher one (say 1024). The response is sent through the H2 mux, which detects the end of stream and decrements the protocol-level stream count (h2c->nb_streams). The client may then send a new request. But the conn_stream is still attached and will require a new call to process_stream() to finish, which is made through the scheduler. Given that the machine is saturated, it is assumed that many tasks are present in the scheduler. Thus the closing tasks holding a higher nice value will pass after the new stream creations. If the client is fast enough with a low latency link, it may add a lot of new stream creations before the stream terminations have a chance to disappear due to their high nice value, resulting in a huge amount of memory being used. The solution consists in letting a mux always monitor its conn_streams and refrain from creating new ones when it is full. Here the H2 mux checks the nb_cs counter and sets a new blocked flag (H2_CF_DEM_TOOMANY) if the limit was reached, so that the frame parser requests a pause in the new stream creation, leaving some time for the pending conn_streams to vanish. Several experiments were made using varying thresholds to see if overbooking would provide any benefit here but it turned out not to be the case, so the conn_stream limit remains set to the exact streams limit. Interestingly various performance measurements showed that the code tends to be slightly faster now than without the limit, probably due to the smoother memory usage. This commit requires previous patch ("MINOR: h2: keep a count of the number of conn_streams attached to the mux"). It needs to be backported to 1.8.	2018-07-19 10:23:15 +02:00
Willy Tarreau	7ac60e836a	MINOR: h2: keep a count of the number of conn_streams attached to the mux The h2 mux only knows about the number of H2 streams which are not in a CLOSED state. This is used for protocol compliance. But it doesn't hold the number of really attached streams. It is a problem because depending on scheduling, it is possible that more streams are attached to the mux than the ones seen at the protocol level, due to some streams taking some time to be detached. Let's add this count based on the conn_streams. Note: this patch is part of a series of fixes which will have to be backported to 1.8.	2018-07-19 09:06:37 +02:00
Olivier Houchard	673867c357	MAJOR: applets: Use tasks, instead of rolling our own scheduler. There's no real reason to have a specific scheduler for applets anymore, so nuke it and just use tasks. This comes with some benefits, the first one being that applets cannot induce high latencies anymore since they share nice values with other tasks. Later it will be possible to configure the applets' nice value. The second benefit is that the applet scheduler was not very thread-friendly, having a big lock around it in prevision of this change. Thus applet-intensive workloads should now scale much better with threads. Some more improvement is possible now : some applets also use a task to handle timers and timeouts. These ones could now be simplified to use only one task.	2018-05-26 20:03:30 +02:00
Olivier Houchard	9f6af33222	MINOR: tasks: Change the task API so that the callback takes 3 arguments. In preparation for thread-specific runqueues, change the task API so that the callback takes 3 arguments, the task itself, the context, and the state, those were retrieved from the task before. This will allow these elements to change atomically in the scheduler while the application uses the copied value, and even to have NULL tasks later.	2018-05-26 19:23:57 +02:00
Willy Tarreau	eba10f24b7	BUG/MEDIUM: h2: implement missing support for chunked encoded uploads Upload requests not carrying a content-length nor tunnelling data must be sent chunked-encoded over HTTP/1. The code was planned but for some reason forgotten during the implementation, leading to such payloads to be sent as tunnelled data. Browsers always emit a content length in uploads so this problem doesn't happen for most sites. However some applications may send data frames after a request without indicating it earlier. The only way to detect that a client will need to send data is that the HEADERS frame doesn't hold the ES bit. In this case it's wise to look for the content-length header. If it's not there, either we're in tunnel (CONNECT method) or chunked-encoding (other methods). This patch implements this. The following request is sent using content-length : curl --http2 -sk https://127.0.0.1:4443/s2 -XPOST -T /large/file and these ones using chunked-encoding : curl --http2 -sk https://127.0.0.1:4443/s2 -XPUT -T /large/file curl --http2 -sk https://127.0.0.1:4443/s2 -XPUT -T - < /dev/urandom Thanks to Robert Samuel Newson for raising this issue with details. This fix must be backported to 1.8.	2018-04-26 10:20:44 +02:00
Willy Tarreau	174b06a572	MINOR: h2: detect presence of CONNECT and/or content-length We'll need this in order to support uploading chunks. The h2 to h1 converter checks for the presence of the content-length header field as well as the CONNECT method and returns these information to the caller. The caller indicates whether or not a body is detected for the message (presence of END_STREAM or not). No transfer-encoding header is emitted yet.	2018-04-26 10:15:14 +02:00
Willy Tarreau	3f0e1ec701	BUG/CRITICAL: h2: fix incorrect frame length check The incoming H2 frame length was checked against the max_frame_size setting instead of being checked against the bufsize. The max_frame_size only applies to outgoing traffic and not to incoming one, so if a large enough frame size is advertised in the SETTINGS frame, a wrapped frame will be defragmented into a temporary allocated buffer where the second fragment my overflow the heap by up to 16 kB. It is very unlikely that this can be exploited for code execution given that buffers are very short lived and their address not realistically predictable in production, but the likeliness of an immediate crash is absolutely certain. This fix must be backported to 1.8. Many thanks to Jordan Zebor from F5 Networks for reporting this issue in a responsible way.	2018-04-19 10:35:30 +02:00
Willy Tarreau	b2e290acb6	BUG/MEDIUM: h2: always add a stream to the send or fctl list when blocked When a stream blocks on a mux buffer full/unallocated or on connection flow control, a flag among H2_SF_MUX_M* is set, but the stream is not always added to the connection's list. It's properly done when the operations are performed from the connection handler but not always when done from the stream handler. For instance, a simple shutr or shutw may fail by lack of room. If it's immediately followed by a call to h2_detach(), the stream remains lying around in no list at all, and prevents the connection from ending. This problem is actually quite difficult to trigger and seems to require some large objects and low server-side timeouts. This patch covers all identified paths. Some are redundant but since the code will change and will be simplified in 1.9, it's better to stay on the safe side here for now. It must be backported to 1.8.	2018-03-30 17:43:49 +02:00
Willy Tarreau	1a1dd6066f	BUG/MINOR: h2: remove accidental debug code introduced with show_fd function Commit `e3f36cd` ("MINOR: h2: implement a basic "show_fd" function") accidently brought one surrounding debugging part that was in the same context. No backport needed.	2018-03-30 17:41:19 +02:00
Willy Tarreau	e3f36cd479	MINOR: h2: implement a basic "show_fd" function The purpose here is to dump some information regarding an H2 connection, and a few statistics about its streams. The output looks like this : 35 : st=0x55(R:PrA W:PrA) ev=0x00(heopi) [lc] cache=0 owner=0x7ff49ee15e80 iocb=0x588a61(conn_fd_handler) tmask=0x1 umask=0x0 cflg=0x00201366 fe=decrypt mux=H2 mux_ctx=0x7ff49ee16f30 st0=2 flg=0x00000002 fctl_cnt=0 send_cnt=33 tree_cnt=33 orph_cnt=0 - st0 is the connection's state (FRAME_H here) - flg is the connection's flags (MUX_MFULL here) - fctl_cnt is the number of streams in the fctl_list - send_cnt is the number of streams in the send_list - tree_cnt is the number of streams in the streams_by_id tree - orph_cnt is the number of orphaned streams (cs==0) in the tree	2018-03-30 14:43:13 +02:00
Willy Tarreau	3041fcc2fd	BUG/MEDIUM: h2: don't consider pending data on detach if connection is in error Interrupting an h2load test shows that some connections remain active till the client timeout. This is due to the fact that h2_detach() immediately returns if the h2s flags indicate that the h2s is still waiting for some buffer room in the output mux (possibly to emit a response or to send some window updates). If the connection is broken, these data will never leave and must not prevent the stream from being terminated nor the connection from being released. This fix must be backported to 1.8.	2018-03-29 15:41:32 +02:00
Willy Tarreau	0975f11d55	BUG/MEDIUM: h2/threads: never release the task outside of the task handler Currently, h2_release() will release all resources assigned to the h2 connection, including the timeout task if any. But since the multi-threaded scheduler, the timeout task could very well be queued in the thread-local list of running tasks without any way to remove it, so task_delete() will have no effect and task_free() will cause this undefined object to be dereferenced. In order to prevent this from happening, we never release the task in h2_release(), instead we wake it up after marking its context NULL so that the task handler can release the task. Future improvements could consist in modifying the scheduler so that a task_wakeup() has to be done on any task having to be killed, letting the scheduler take care of it. This fix must be backported to 1.8. This bug was apparently not reported so far.	2018-03-29 15:22:59 +02:00
Willy Tarreau	71049cce3f	MINOR: h2: fuse h2s_detach() and h2s_free() into h2s_destroy() Since these two functions are always used together, let's simplify the code by having a single one for both operations. It also ensures we don't leave wandering elements that risk to leak later.	2018-03-29 13:22:15 +02:00
Willy Tarreau	e323f3458c	MINOR: h2: always call h2s_detach() in h2_detach() The code is safer and more robust this way, it avoids multiple paths. This is possible due to the idempotence of LIST_DEL() and eb32_delete() that are called in h2s_detach().	2018-03-29 13:22:15 +02:00
Willy Tarreau	4a333d3d53	BUG/MAJOR: h2: remove orphaned streams from the send list before closing Several people reported very strange occasional crashes when using H2. Every time it appeared that either an h2s or a task was corrupted. The outcome is that a missing LIST_DEL() when removing an orphaned stream from the list in h2_wake_some_streams() can cause this stream to remain present in the send list after it was freed. This may happen when receiving a GOAWAY frame for example. In the mean time the send list may be processed due to pending streams, and the just released stream is still found. If due to a buffer full condition we left the h2_process_demux() loop before being able to process the pending stream, the pool entry may be reassigned somewhere else. Either another h2 connection will get it, or a task, since they are the same size and are shared. Then upon next pass in h2_process_mux(), the stream is processed again. Either it crashes here due to modifications, or the contents are harmless to it and its last changes affect the other object reasigned to this area (typically a struct task). In the case of a collision with struct task, the LIST_DEL operation performed on h2s corrupts the task's wait queue's leaf_p pointer, thus all the wait queue's structure. The fix consists in always performing the LIST_DEL in h2s_detach(). It will also make h2s_stream_new() more robust against a possible future situation where stream_create_from_cs() could have sent data before failing. Many thanks to all the reporters who provided extremely valuable information, traces and/or cores, namely Thierry Fournier, Yves Lafon, Holger Amann, Peter Lindegaard Hansen, and discourse user "slawekc". This fix must be backported to 1.8. It is probably better to also backport the following code cleanups with it as well to limit the divergence between master and 1.8-stable : `00dd078` CLEANUP: h2: rename misleading h2c_stream_close() to h2s_close() `0a10de6` MINOR: h2: provide and use h2s_detach() and h2s_free()	2018-03-29 13:22:15 +02:00
Willy Tarreau	8adae7c15f	BUG/MINOR: h2: ensure we can never send an RST_STREAM in response to an RST_STREAM There are some corner cases where this could happen by accident. Since the spec explicitly forbids this (RFC7540#5.4.2), let's add a test in the two only functions which make the RST to avoid this. Thanks to user klzgrad for reporting this problem. Usually it is expected to be harmless but may result in browsers issuing a warning. This fix must be backported to 1.8.	2018-03-22 17:37:05 +01:00
Willy Tarreau	d1023bbab3	BUG/MEDIUM: h2: properly account for DATA padding in flow control Recent fixes made to process partial frames broke the flow control on DATA frames, as the padding is not considered anymore, only the actual data is. Let's simply take account of the padding once the transfer ends. The probability to meet this bug is low because, when used, padding is small and it can require a large number of padded transfers before the window is completely depleted. Thanks to user klzgrad for reporting this bug and confirming the fix. This fix must be backported to 1.8.	2018-03-22 16:53:12 +01:00
Willy Tarreau	84b118f312	BUG/MEDIUM: h2: also arm the h2 timeout when sending Right now the h2 idle timeout is only set when there is no stream. If we fail to send because the socket buffers are full (generally indicating the client has left), we also need to arm it so that we can properly expire such connections, otherwise some failed transfers might leave H2 connections pending forever. Thanks to Thierry Fournier for the diag and the traces. This patch needs to be backported to 1.8.	2018-03-08 18:43:56 +01:00
Willy Tarreau	44e973f508	MEDIUM: h2: use a single buffer allocator We used to have one buffer allocator per direction while we can never block on two buffers at once. Let's have a single one and rely on the connection's flags to know which one we're waitinf for.	2018-03-01 17:58:15 +01:00
Willy Tarreau	0a10de6066	MINOR: h2: provide and use h2s_detach() and h2s_free() These ones save us from open-coding the cleanup functions on each and every error path. The code was updated to use them with no functional change.	2018-03-01 16:35:01 +01:00
Willy Tarreau	00dd07895a	CLEANUP: h2: rename misleading h2c_stream_close() to h2s_close() This function takes an h2c and an h2s but it never uses the h2c, which is a bit confusing at some places in the code. Let's make it clear that it only operates on the h2s instead by renaming it and removing the unused h2c argument.	2018-03-01 16:31:34 +01:00
Willy Tarreau	35a62705df	BUG/MEDIUM: h2: always consume any trailing data after end of output buffers In case a stream tries to emit more data than advertised by the chunks or content-length headers, the extra data remains in the channel's output buffer until the channel's timeout expires. It can easily happen when sending malformed error files making use of a wrong content-length or having extra CRLFs after the empty chunk. It may also be possible to forge such a bad response using Lua. The H1 to H2 encoder must protect itself against this by marking the data presented to it as consumed if it decides to discard them, so that the sending stream doesn't wait for the timeout to trigger. The visible effect of this problem is a huge memory usage and a high concurrent connection count during benchmarks when using such bad data (a typical place where this easily happens). This fix must be backported to 1.8.	2018-02-27 15:37:25 +01:00
Christopher Faulet	929b52d8a1	BUG/MINOR: h2: Set the target of dbuf_wait to h2c In h2_get_dbuf, when the buffer allocation was failing, dbuf_wait.target was errornously set to the connection (h2c->conn) instead of the h2 connection descriptor (h2c). This patch must be backported to 1.8.	2018-02-26 17:33:16 +01:00
Tim Duesterhus	66888f907c	CLEANUP: h2: Remove unused labels from mux_h2.c This removes the unused next_header_block and try_again labels from mux_h2.c. try_again is unused as of `a76e4c2183`, which first appeared in haproxy 1.8.0. next_header_block is unused as of `872855998b`, which was backported to haproxy 1.8.0 as 59fcb216085a7aa9744cffe39567c80de4ebd6bf.	2018-02-20 08:30:13 +01:00
Olivier Houchard	6fa63d9852	MINOR: early data: Don't rely on CO_FL_EARLY_DATA to wake up streams. Instead of looking for CO_FL_EARLY_DATA to know if we have to try to wake up a stream, because it is waiting for a SSL handshake, instead add a new conn_stream flag, CS_FL_WAIT_FOR_HS. This way we don't have to rely on CO_FL_EARLY_DATA, and we will only wake streams that are actually waiting.	2018-02-05 14:24:50 +01:00
Willy Tarreau	4a28da1e9d	BUG/MEDIUM: h2: properly handle the END_STREAM flag on empty DATA frames Peter Lindegaard Hansen reported a problem affecting some POST requests sent by MSIE on 1.8.3. Lukas found that we incorrectly dealt with the END_STREAM flag on empty DATA frames. What happens in fact is that while we correctly report that we've read a zero-byte frame, since commit `8fc016d` ("BUG/MEDIUM: h2: support uploading partial DATA frames") backported into 1.8.2, we've been able to return without updating the parser's state nor checking the frame flags in this case. The fix is trival, we just need not to return too early. This fix must be backported to 1.8.	2018-01-04 14:41:00 +01:00
Willy Tarreau	8ec140604a	MEDIUM: h2: prepare a graceful shutdown when the frontend is stopped During a reload operation, instead of keeping the H2 connections opened forever causing confusion during configuration changes, let's send a graceful shutdown so that the client knows that it would better open a new connection for future requests. We can't really catch the signal from H2, but we can advertise this graceful shutdown upon the next I/O event (eg: a WINDOW_UPDATE from the client or a new request). One of the visible effect is that the old process quits much faster. This patch should be backported to 1.8 since it is affected by this problem.	2017-12-30 18:08:13 +01:00
Willy Tarreau	d790143d99	BUG/MEDIUM: h2: ensure we always know the stream before sending a reset The recent patch introducing the H2_CS_FRAME_E state to emit stream resets was not totally correct in that in the rare case where there is no room left to emit the reset, the next call to process it later could use an uninitialized stream. This only affects responses to frames that are sent on closed streams though. This fix must be backported to 1.8.	2017-12-29 11:34:40 +01:00
Willy Tarreau	ab83750a29	BUG/MEDIUM: h2: improve handling of frames received on closed streams The h2spec utility found certain situations where we're returning an RST_STREAM while a GOAWAY is expected. While we can't always reliably decide which one to use (eg: after a stream has been closed for a long time), in practice we often still have the stream available until it's destroyed at the application level. This provides the flags we need to verify the conditions that led to its closure, namely if RST was sent or received, or if it was regularly closed using a double ES. The first step consists in marking all closed streams as having already sent an RST_STREAM frame. This will ensure that we can send an RST_STREAM for a late transmission on a stream we have forgotten about instead of risking to break the connection. The next steps consist in re-arranging the H2_SS_CLOSED checks so that we can deliver a GOAWAY frame for the few cases where an unexpected frame was received after a double ES. By carefully taking care of these specificities, we can reduce by 4 the number of remaining compliance issues. Note: some tests start to become a bit long and to be repeated at various places. Probably that adding a bitmask of allowed/forbidden frame types per state and/or per situation could significantly help. It's likely that some deeper tests in the frame handlers could also be removed now as they can't be triggered anymore. This fix should be backported to 1.8.	2017-12-27 18:44:22 +01:00
Willy Tarreau	a20a519b8f	BUG/MEDIUM: h2: properly handle and report some stream errors Some stream errors applied to half-closed and closed streams are not properly reported, especially after the stream transistions to the closed state. The reason is that the code checks for this "error" stream state in order to send an RST frame. But if the stream was just closed or was already closed, there's no way to validate this condition, and the error is never reported to the peer. In order to address this situation, we'll add a new FRAME_E demux state which indicates that the previously parsed frame triggered a stream error of type STREAM CLOSED that needs to be reported. Proceeding like this will ensure that we don't lose that information even if we can't immediately send the message. It also removes the confusion where FRAME_A could be used either for ACKs or for RST. The state transition has been added after every h2s_error() on the demux path. It seems that we might need to have two distinct h2s_error() functions, one for the mux and another one for the demux, though it would provide little benefit. It also becomes more apparent that the H2_SS_ERROR state is only used to detect the need to report an error on the mux direction. Maybe this will have to be revisited later. This simple change managed to eliminate 5 bugs reported by h2spec. This fix must be backported to 1.8.	2017-12-27 18:34:50 +01:00
Willy Tarreau	28f1cb9da2	MINOR: mux: add flags to describe a mux's capabilities This new field will be used to describe certain properties of some muxes. For now we only add MX_FL_CLEAN_ABRT to indicate that a mux is able to unambiguously report aborts using CS_FL_ERROR contrary to others who may only report it via a read0. This will be used to improve handling of the abortonclose option with H2. Other flags may come later to report multiplexing capabilities or not, support of client/server sides etc.	2017-12-20 16:31:30 +01:00
Willy Tarreau	2153d3ce73	BUG/MINOR: h2: properly report a stream error on RST_STREAM We want to report such an error since H2 allows to differenciate between an end of stream and an abort. To be backported to 1.8.	2017-12-20 14:38:19 +01:00
Willy Tarreau	91bfdd7e04	BUG/MEDIUM: h2: fix stream limit enforcement Commit `4974561` ("BUG/MEDIUM: h2: enforce the per-connection stream limit") implemented a stream limit enforcement on the connection but it was not correctly done as it would count streams still known by the connection, which includes the lingering ones that are already marked close. We need to count only the non-closed ones, which this patch does. The effect is that some streams are rejected a bit before the limit. This fix needs to be backported to 1.8.	2017-12-14 13:43:52 +01:00
Willy Tarreau	13e4e94dae	BUG/MEDIUM: h2: don't close after the first DATA frame on tunnelled responses Tunnelled responses are those without a content-length nor a chunked encoding. They are specially dealt with in the current code but the behaviour is not correct. The fact that the chunk size is left to zero with a state artificially set to CHUNK_SIZE validates the test on whether or not to set the end of stream flag. Thus the first DATA frame always carries the ES flag and subsequent ones remain blocked. This patch fixes it in two ways : - update h1m->curr_len to the size of the current buffer so that it is properly subtracted later to find the real end ; - don't set the state to CHUNK_SIZE when there's no content-length and instead set it to CHUNK_SIZE only when there's chunking. This fix needs to be backported to 1.8.	2017-12-14 13:43:52 +01:00
Willy Tarreau	c4134ba8b0	BUG/MEDIUM: h2: don't switch the state to HREM before end of DATA frame We used to switch the stream's state to HREM when seeing and ES bit on the DATA frame before actually being able to process that frame, possibly resulting in the DATA frame being processed after the stream was seen as half-closed and possibly being rejected. The state must not change before the frame is really processed. Also fixes a harmless typo in the flag name which should have DATA and not HEADERS in its name (but all values are equal). Must be backported to 1.8.	2017-12-14 13:43:52 +01:00
Willy Tarreau	6847262211	MINOR: h2: don't demand that a DATA frame is complete before processing it Since last commit it's not required that the DATA frames are complete anymore so better start with what we have. Only the HEADERS frame requires this. This may be backported as part of the upload fixes.	2017-12-14 13:43:52 +01:00
Willy Tarreau	8fc016d0fe	BUG/MEDIUM: h2: support uploading partial DATA frames We currently have a problem with DATA frames when they don't fit into the destination buffer. While it was imagined that in theory this never happens, in practice it does when "option http-buffer-request" is set, because the headers don't leave the target buffer before trying to read so if the frame is full, there's never enough room. This fix consists in reading what can be read from the frame and advancing the input buffer. Once the contents left are only the padding, the frame is completely processed. This also solves another problem we had which is that it was possible to fill a request buffer beyond its reserve because the <count> argument was not respected in h2_rcv_buf(). Thus it's possible that some POST requests sent at once with a headers+body filling exactly a buffer could result in "400 bad req" when trying to add headers. This fix must be backported to 1.8.	2017-12-14 13:43:52 +01:00
Willy Tarreau	05e5dafe9a	MINOR: h2: store the demux padding length in the h2c struct We'll try to process partial frames and for this we need to know the padding length. The first step requires to extract it during the parsing and store it in the demux context in the connection. Till now it was only processed at once.	2017-12-14 13:43:52 +01:00
Willy Tarreau	d13bf27e78	BUG/MEDIUM: h2: debug incoming traffic in h2_wake() Even after previous commit ("BUG/MEDIUM: h2: work around a connection API limitation") there is still a problem with some requests. Sometimes when polling for more request data while some pending data lies in the buffer, there's no way to enter h2_recv() because the FD is not marked ready for reading. We need to slightly change the approach and make h2_recv() only receive from the buffer and h2_wake() always attempt to demux if the demux is not blocked. However, if the connection is already being polled for reading, it will not wake up from polling. For this reason we need to cheat and also pretend a request for sending data, which ensures that as soon as any direction may move, we can continue to demux. This shows that in the long term we probably need a better way to resume an interrupted operation at the mux level. With this fix, no more hangups happen during uploads. Note that this time the setup required to provoke the hangups was a bit complex : - client is "curl" running on local host, uploading 1.7 MB of data via haproxy - haproxy running on local host, forwarding to a remote server through a 100 Mbps only switch - timeouts disabled on haproxy - remote server made of thttpd executing a cgi reading request data through "dd bs=10" to slow down everything. With such a setup, around 3-5% of the connections would hang up. This fix needs to be backported to 1.8.	2017-12-14 13:43:24 +01:00
Willy Tarreau	6042aeb1e8	BUG/MEDIUM: h2: work around a connection API limitation The connection API permits us to enable or disable receiving on a connection. The underlying FD layer arranges this with the polling and the fd cache. In practice, if receiving was allowed and an end of buffer was reached, the FD is subscribed to the polling. If later we want to process pending data from the buffer, we have to enable receiving again, but since it's already enabled (in polled mode), nothing happens and the pending data remain stuck until a new event happens on the connection to wake the FD up. This is a limitation of the internal connection API which is not very friendly to the new mux architecture. The visible effect is that certain uploads to slow servers experience truncation on timeout on their last blocks because nothing new comes from the connection to wake it up while it's being polled. In order to work around this, there are two solutions : - either cheat on the connection so that conn_update_xprt_polling() always performs a call to fd_may_recv() after fd_want_recv(), that we can trigger from the mux by always calling conn_xprt_stop_recv() before conn_xprt_want_recv(), but that's a bit tricky and may have side effects on other parts (eg: SSL) - or we refrain from receiving in the mux as soon as we're busy on anything else, regardless of whether or not some room is available in the receive buffer. This patch takes the second approach above. This way once we read some data, as soon as we detect that we're stuck, we immediately stop receiving. This ensures the event doesn't go into polled mode for this period and that as soon as we're unstuck we can continue. In fact this guarantees that we can only wait on one side of the mux for a given direction. A future improvement of the connection layer should make it possible to resume processing of an interrupted receive operation. This fix must be backported to 1.8.	2017-12-14 13:43:24 +01:00
Willy Tarreau	315d807cbc	BUG/MEDIUM: h2: enable recv polling whenever demuxing is possible In order to allow demuxing when the dmux buffer is full, we need to enable data receipt in multiple conditions. Since the conditions are a bit complex, they have been delegated to a new function h2_recv_allowed() which follows these rules : - if an error or a shutdown was detected on the connection and the buffer is empty, we must not attempt to receive - if the demux buf failed to be allocated, we must not try to receive and we know there is nothing pending - if the buffer is not full, we may attempt to receive - if no flag indicates a blocking condition, we may attempt to receive - otherwise must may not attempt No more truncated payloads are detected in tests anymore, which seems to indicate that the issue was worked around. A better connection API will have to be created for new versions to make this stuff simpler and more intuitive. This fix needs to be backported to 1.8 along with the rest of the patches related to CS_FL_RCV_MORE.	2017-12-10 22:17:57 +01:00
Willy Tarreau	c9ede6c43e	BUG/MEDIUM: h2: automatically set CS_FL_RCV_MORE when the output buffer is full If we can't demux pending data due to a stream buffer full condition, we now set CS_FL_RCV_MORE on the conn_stream so that the stream layer knows it must call back as soon as possible to restart demuxing. Without this, some uploaded payloads are truncated if the server does not consume them fast enough and buffers fill up. Note that this is still not enough to solve the problem, some changes are required on the recv() and update_poll() paths to allow to restart reading even with a buffer full condition. This patch must be backported to 1.8.	2017-12-10 21:28:43 +01:00
Willy Tarreau	0249219be8	BUG/MEDIUM: h2: fix handling of end of stream again Commit `9470d2c` ("BUG/MINOR: h2: try to abort closed streams as soon as possible") tried to address the situations where a stream is closed by the client, but caused a side effect which is that in some cases, a regularly closed stream reports an error to the stream layer. The reason is that we purposely matched H2_SS_CLOSED in the test for H2_SS_ERROR to report this so that we can check for RST, but it accidently catches certain end of transfers as well. This results in valid requests to report flags "CD" in the logs. Instead, let's roll back to detecting H2_SS_ERROR and explicitly check for a received RST. This way we can correctly abort transfers without mistakenly reporting errors in normal situations. This fix needs to be backported to 1.8 as the fix above was merged into 1.8.1.	2017-12-07 19:20:35 +01:00
Willy Tarreau	7912781a30	BUG/MINOR: h2: use the H2_F_DATA_* macros for DATA frames A typo resulted in H2_F_HEADERS_* being used there, but it's harmless as they are equal. Better fix the confusion though. Should be backported to 1.8.	2017-12-03 21:09:38 +01:00
Willy Tarreau	92153fccd3	BUG/MINOR: h2: properly check PRIORITY frames We don't use them right now but it's better to ensure they're properly checked. This removes another 3 warnings in h2spec. To backport to 1.8.	2017-12-03 21:08:43 +01:00
Willy Tarreau	18b86cd074	BUG/MINOR: h2: reject incorrect stream dependencies on HEADERS frame We currently don't use stream dependencies, but as reported by h2spec, the spec requires that we reject streams that depend on themselves in HEADERS frames. To backport to 1.8.	2017-12-03 21:08:42 +01:00
Willy Tarreau	1b38b46ab7	BUG/MINOR: h2: do not accept SETTINGS_ENABLE_PUSH other than 0 or 1 We don't use yet it but for correctness, let's enforce the check. To backport to 1.8.	2017-12-03 21:08:42 +01:00
Willy Tarreau	497456154e	BUG/MEDIUM: h2: enforce the per-connection stream limit h2spec reports that we unfortunately didn't enforce the per-connection stream limit that we advertise. It's important to ensure it's never crossed otherwise it's cheap for a client to create many streams. This requires the addition of a stream count. The h2c struct could be cleaned up a bit, just like the h2_detach() function where an "if" block doesn't make sense anymore since it's always true. To backport to 1.8.	2017-12-03 21:08:42 +01:00
Willy Tarreau	68ed64148a	BUG/MINOR: h2: fix a typo causing PING/ACK to be responded to The ACK flag was tested on the frame type instead of the frame flag. To backport to 1.8.	2017-12-03 21:08:41 +01:00
Willy Tarreau	9470d2cd35	BUG/MINOR: h2: try to abort closed streams as soon as possible The purpose here is to be able to signal receipt of RST_STREAM to streams when they start to provide a response so that the response can be aborted ASAP. Given that RST_STREAM immediately switches the stream to the CLOSED state, we must check for CLOSED in addition to the existing ERROR check. To be backported to 1.8.	2017-12-03 21:08:41 +01:00
Willy Tarreau	11cc2d6031	BUG/MINOR: h2: immediately close if receiving GOAWAY after the last stream The h2spec test suite reveals that a GOAWAY frame received after the last stream doesn't cause an immediate close, because we count on the last stream to quit to do so. By simply setting the last_sid to the received value in case it was not set, we can ensure to properly close an idle connection during h2_wake(). To be backported to 1.8.	2017-12-03 21:08:40 +01:00
Willy Tarreau	872855998b	BUG/MEDIUM: h2: don't report an error after parsing a 100-continue response Yves Lafon reported a breakage with 100-continue. In fact the problem is caused when an 1xx is the last response in the buffer (which commonly is the case). We loop back immediately into the parser with what remains of the input buffer (ie: nothing), while it is not expected to be called with an empty response, so it fails. Let's simply get back to the caller to decide whether or not more data are expected to be sent. This fix needs to be backported to 1.8.	2017-11-29 15:41:32 +01:00
Willy Tarreau	bafbe01028	CLEANUP: pools: rename all pool functions and pointers to remove this "2" During the migration to the second version of the pools, the new functions and pool pointers were all called "pool_something2()" and "pool2_something". Now there's no more pool v1 code and it's a real pain to still have to deal with this. Let's clean this up now by removing the "2" everywhere, and by renaming the pool heads "pool_head_something".	2017-11-24 17:49:53 +01:00
Willy Tarreau	599391a7c2	MINOR: h2: make use of client-fin timeout after GOAWAY At the moment, the "client" timeout is used on an HTTP/2 connection once it's idle with no active stream. With this patch, this timeout is replaced by client-fin once a GOAWAY frame is sent. This closely matches what is done on HTTP/1 since the principle is the same, as it indicates a willing ness to quickly close a connection on which we don't expect to see anything anymore.	2017-11-24 10:16:00 +01:00
Willy Tarreau	a76e4c2183	MEDIUM: h2: don't gracefully close the connection anymore on Connection: close As reported by Lukas, it causes more harm than good, for example on prompt for authentication. Now we have an "http-request reject" rule to use instead of "http-request deny" if we absolutely want to close the connection.	2017-11-24 08:17:28 +01:00
Willy Tarreau	90c3232e54	MINOR: h2: send RST_STREAM before GOAWAY on reject Apparently the h2c client has trouble reading the RST_STREAM frame after a GOAWAY was sent, so it's likely that other clients may face the same difficulty. Curl and Firefox don't care about this ordering, so let's send it first.	2017-11-24 08:00:30 +01:00
Olivier Houchard	7fc96d5a01	MINOR: mux: Make sure every string is woken up after the handshake. In case any stream was waiting for the handshake after receiving early data, we have to wake all of them. Do so by making the mux responsible for removing the CO_FL_EARLY_DATA flag after all of them are woken up, instead of doing it in si_cs_wake_cb(), which would then only work for the first one. This makes wait_for_handshake work with HTTP/2.	2017-11-23 19:35:42 +01:00
Willy Tarreau	541dd82879	BUG/MAJOR: h2: always remove a stream from the send list before freeing it When a stream is aborted on timeout or any reason initiated by the stream, and this stream was subscribed to the send list, we forgot to detach it when freeing it, resulting in a dead node remaining present in the send list with all usual funny consequences (memory corruption, crashes, etc). Let's simply unconditionally delete the stream.	2017-11-23 18:12:50 +01:00
Willy Tarreau	59a10fb53d	MEDIUM: h2: change hpack_decode_headers() to only provide a list of headers The current H2 to H1 protocol conversion presents some issues which will require to perform some processing on certain headers before writing them so it's not possible to convert HPACK to H1 on the fly. This commit modifies the headers decoding so that it now works in two phases : hpack_decode_headers() only decodes the HPACK stream in the HEADERS frame and puts the result into a list. Headers which require storage (huffman-compressed or from the dynamic table) are stored in a chunk allocated by the H2 demuxer. Then once the headers are properly decoded into this list, h2_make_h1_request() is called with this list to produce the HTTP/1.1 request into the destination buffer. The list necessarily enforces a limit. Here we use 2*MAX_HTTP_HDR, which means that we can have as many individual cookies as we have regular headers if a client decides to break their cookies into multiple values. This seams reasonable and will allow the H1 parser to decide whether it's too much or not. Thus the output stream is not produced on the fly anymore and this will permit to deal with certain corner cases like reparing the Cookie header (which for now is not done). In order to limit header duplication and parsing, the known pseudo headers continue to be passed by their index : the name element in the list then has a NULL pointer and the value is the pseudo header's index. Given that these ones represent about half of the incoming requests and need to be found quickly, it maintains an acceptable level of performance. The code was significantly reduced by doing this because the orignal code had to deal with HPACK and H1 combinations (eg: index vs not indexed, etc) and now the HPACK decoding is totally focused on the decompression, and the H1 encoding doesn't have to deal with the issue of wrapping input for example. One bug was addressed here (though it couldn't happen at the moment). The H2 demuxer used to detect a failure to write the request into the H1 buffer and would then detect if the output buffer wraps, realign it and try again. The problem by doing so was that the HPACK context was already modified and not rewindable. Thus the size check is now performed first and a failure is reported if it doesn't fit.	2017-11-21 21:13:36 +01:00
Willy Tarreau	8f650c369d	BUG/MEDIUM: h2: properly report connection errors in headers and data handlers We used to return >0 indicating a success when an error was present on the connection, preventing the caller from detecting and handling it. This for example happens when sending too many headers in a frame, making the request impossible to decompress.	2017-11-21 19:36:21 +01:00
Willy Tarreau	1f09467114	BUILD: h2: mark some inlined functions "unused" Clang complains that h2_get_n64() is not used, and a few other protocol specific functions may fall in that category depending on how the code evolves. Better mark them unused to silence the warning since it's on purpose.	2017-11-20 21:27:45 +01:00
Willy Tarreau	28b55c6fed	CLEANUP: mux: remove the unused "release()" function In commit `53a4766` ("MEDIUM: connection: start to introduce a mux layer between xprt and data") we introduced a release() function which ends up never being used. Let's get rid of it now.	2017-11-10 16:43:05 +01:00
Willy Tarreau	22cf59bbba	BUG/MEDIUM: h2: support orphaned streams When a stream_interface performs a shutw() then a shutr(), the stream is marked closed. Then cs_destroy() calls h2_detach() and it cannot fail since we're on the leaving path of the caller. The problem is that in order to close streams we usually have to send either an emty DATA frame with the ES flag set or an RST_STREAM frame, and the mux buffer might already be full, forcing the stream to be queued. The forced removal of this stream causes this last message to silently disappear, and the client to wait forever for a response. This commit ensures we can detach the conn_stream from the h2 stream if the stream is blocked, effectively making the h2 stream an orphan, ensures that the mux can deal with orphaned streams after processing them, and that the demux can kill them upon receipt of GOAWAY.	2017-11-10 11:48:15 +01:00
Willy Tarreau	8c0ea7d21a	BUG/MEDIUM: h2: split the function to send RST_STREAM There is an issue with how the RST_STREAM frames are sent. Some of them are sent from the demux, either for valid or for closed streams, and some are sent from the mux always for valid streams. At the moment the demux stream ID is used, which is wrong for all streams being muxed, and sometimes results in certain bad HTTP responses causing the emission of an RST_STREAM referencing stream zero. In addition, the stream's blocked flags could be updated even if the stream was the closed or idle ones. We really need to split the function for the two distinct use cases where one is used to send an RST on a condition detected at the connection level (such as a closed stream) and the other one is used to send an RST for a condition detected at the stream level. The first one is used only in the demux, and the other one only by a valid stream.	2017-11-10 10:05:24 +01:00
Willy Tarreau	a87f202b49	BUG/MEDIUM: h2: reject non-3-digit status codes If the H1 parser would report a status code length not consisting in exactly 3 digits, the error case was confused with a lack of buffer room and was causing the parser to loop infinitely.	2017-11-09 11:23:00 +01:00
Willy Tarreau	926fa4c098	BUG/MINOR: h2: don't send GOAWAY on failed response As part of the detection for intentional closes, we can kill the connection if a shutw() happens before the headers. But it can also happen that an invalid response is not properly parsed, preventing any headers frame from being sent and making the function believe it was an abort. Now instead we check if any response was received from the stream, regardless of the fact that it was properly converted.	2017-11-07 14:47:04 +01:00
Willy Tarreau	c4312d3dfd	MINOR: h2: add new stream flag H2_SF_OUTGOING_DATA This one indicates whether we've received data to mux out. It helps make the difference between a clean close and a an erroneous one.	2017-11-07 14:47:04 +01:00
Willy Tarreau	58e3208714	BUG/MINOR: h2: correctly check for H2_SF_ES_SENT before closing In h2_shutw() we must not send another empty frame (nor RST) after one has been sent, as the stream is already in HLOC/CLOSED state.	2017-11-07 14:47:04 +01:00
Willy Tarreau	6d8b682f9a	BUG/MEDIUM: h2: properly set H2_SF_ES_SENT when sending the final frame When sending DATA+ES, it's important to set H2_SF_ES_SENT as we don't want to emit is several times nor to send an RST afterwards.	2017-11-07 14:47:04 +01:00
Willy Tarreau	e6ae77f64f	MINOR: h2: don't re-enable the connection's task when we're closing It's pointless to requeue the task when we're closing, so swap the order of the task_queue() and h2_release(). It also matches what was written in the comment regarding re-arming the timer.	2017-11-07 14:47:04 +01:00
Willy Tarreau	83906c2f91	BUG/MEDIUM: h2: don't close the connection is there are data left h2_detach() is called after a stream was closed, and it evaluates if it's worth closing the connection. The issue there is that the connection is closed too early in case there's demand for closing after the last stream, even if some data remain in the mux. Let's change the condition to check for this.	2017-11-07 14:47:04 +01:00
Christopher Faulet	2a944ee16b	BUILD: threads: Rename SPIN/RWLOCK macros using HA_ prefix This remove any name conflicts, especially on Solaris.	2017-11-07 11:10:24 +01:00
Willy Tarreau	7d8e4af46a	BUG/MEDIUM: h2: fix some wrong error codes on connections When the assignment of the connection state was moved into h2c_error(), 3 of them were missed because they were wrong, using H2_SS_ERROR instead. This resulted in the connection's state being set to H2_CS_ERROR2 in fact, so the error was not properly sent.	2017-11-07 11:08:28 +01:00
Willy Tarreau	721c974e5e	MEDIUM: h2: remove the H2_SS_RESET intermediate state This one was created to maintain the knowledge that a stream was closed after having sent an RST_STREAM frame but that's not needed anymore and it confuses certain conditions on the error processing path. It's time to get rid of it.	2017-11-07 11:05:42 +01:00
Willy Tarreau	319994a2e9	BUG/MEDIUM: h2: don't try (and fail) to send non-existing data in the mux The call to xprt->snd_buf() was not conditionned on the presence of data in the buffer, resulting in snd_buf() returning 0 and never disabling the polling. It was revealed by the previous bug on error processing but must properly be handled.	2017-11-07 11:03:56 +01:00
Willy Tarreau	3eabe9b174	BUG/MEDIUM: h2: properly send the GOAWAY frame in the mux A typo on a condition prevented H2_CS_ERROR from being processed, leading to an infinite loop on connection error.	2017-11-07 11:03:01 +01:00
Willy Tarreau	c6795ca7c1	BUG/MEDIUM: h2: properly send an RST_STREAM on mux stream error Some stream errors are detected on the MUX path (eg: H1 response encoding). The ones forgot to emit an RST_STREAM frame, causing the client to wait and/or to see the connection being immediately closed. This is now fixed.	2017-11-07 09:43:06 +01:00
Willy Tarreau	6743420778	BUG/MINOR: h2: set the "HEADERS_SENT" flag on stream, not connection This flag was added after the GOAWAY flags were introduced and mistakenly placed in the connection, but that doesn't make sense as it's specific to the stream. The main impact is the risk of returning a DATA0+ES frame for an error instead of an RST_STREAM.	2017-11-06 20:20:51 +01:00
Willy Tarreau	3340029b97	BUG/MAJOR: h2: set the connection's task to NULL when no client timeout is set If "timeout client" is missing from the frontend, the task is not initialized, causing a crash on connection teardown.	2017-11-05 11:23:40 +01:00
Willy Tarreau	f13ef96e70	BUG/MEDIUM: h2: don't try to parse incomplete H1 responses This situation which must not happen does in fact happen when feeding artificial responses using errorfiles, Lua or an applet. For now it causes the H1 response parser to loop forever trying to get a more complete response. Since it cannot progress, let's return an error.	2017-11-02 15:53:04 +01:00
Willy Tarreau	3f133570b8	BUG/MEDIUM: h2: fix incorrect timeout handling on the connection Previous commit ea3928 (MEDIUM: h2: apply a timeout to h2 connections) was wrong for two reasons. The first one is that if the client timeout is not set, it's used as zero, preventing connections from establishing. The second reason is that if the timeout triggers with active streams (normally it should not since the task is supposed to be disabled), the task is removed (h2c->task=NULL), and the last quitting stream might try to dereference it. Instead of doing this, we simply not register the task if there's no timeout (it's useless) and we always control its presence in the streams.	2017-10-31 19:21:06 +01:00
Willy Tarreau	ea39282e85	MEDIUM: h2: apply a timeout to h2 connections Till now there was no way to deal with a dead H2 connection. Now each connection creates a task that wakes up to kill the connection. Its timeout is constantly refreshed when there's some activity. In case the timeout triggers, the best effort attempts are made at sending a clean GOAWAY message before closing and signaling the streams. The timeout is automatically disabled when there's an active stream on the connection, and restarted when the last stream finishes. This way it should not affect long sessions.	2017-10-31 18:16:19 +01:00
Willy Tarreau	a1349f0207	MEDIUM: h2: send a GOAWAY frame when dealing with an empty response Given that we're processing data produced by haproxy, we know that the situations where haproxy doesn't return anything are : - request timeout with option http-ignore-probes : there's no reason to hit this since we're creating the stream with the request into it ; - tcp-request content reject : this definitely means we want to kill the connection and abort keep-alive and any further processing ; - using /dev/null as the error file to hide an error In practice it appears that using the abort on empty response as a hint to trigger a connection close is very appropriate to continue to give the control over the connection management. This patch thus tries to send a GOAWAY frame with the max_id presented as the last stream ID, then sends an RST_STREAM for the current stream. For the client, this means that the connection must be shut down immediately after processing the last pending streams and that the current stream is aborted. This way it's still possible to force connections to be closed using tcp-request rules.	2017-10-31 18:16:19 +01:00
Willy Tarreau	af1e4f5167	MEDIUM: h2: perform a graceful shutdown on "Connection: close" After some long brainstorming sessions, it appears that "Connection: close" seems to be the best signal from the L7 layer to indicate the need to close the connection. Indeed, in H1 it is only present in very rare cases (eg: certain unrecoverable errors, some of which could remove it now by the way). It will also be added when the L7 layer wants to force the connection to terminate. By default when running in keep-alive mode it is not present. It's worth mentionning that in H1 with persistent connections, we have sort of a concurrency-1 mux and this header field is used the same way. Thus here this patch detects "Connection: close" in response headers and if seen, sends a GOAWAY frame with the highest possible ID so that the client knows that it can quit whenever it wants to. If more aggressive closures are needed in the future, we may decide to advertise the max_id to abort after the current requests and better honor "http-request deny".	2017-10-31 18:16:19 +01:00
Willy Tarreau	1c661986a8	MINOR: h2: properly reject PUSH_PROMISE frames coming from the client These ones deserve a connection error as per 5.1.	2017-10-31 18:16:19 +01:00
Willy Tarreau	c0da1964ba	MEDIUM: h2: silently ignore frames higher than last_id after GOAWAY For a graceful shutdown, the specs requries to discard frames with a stream ID higher than the advertised last_id. (RFC7540#6.8). Well, finally for now the code is disabled (see last page of #6.8). Some frames need to be processed anyway to maintain the compression state and the flow control window state, but we don't have any trivial way to do this and ignore them at the same time. For the headers it's the worst case where we can't parse headers frames without coming from the streams, and we don't want to create such streams as we'd have to abort them, and aborting would cause errors to flow back. Possibly that a longterm solution might involve using some dummy streams and dummy buffers for this and calling the parsers directly.	2017-10-31 18:16:19 +01:00
Willy Tarreau	f182a9a8b4	MINOR: h2: centralize the check for the half-closed(remote) streams RFC7540#5.1 is pretty clear : "any frame other than WINDOW_UPDATE, PRIORITY, or RST_STREAM in this state MUST be treated as a connection error of type STREAM_CLOSED". Instead of dealing with this for each and every frame type, let's do it once for all in the main demux loop.	2017-10-31 18:16:19 +01:00
Willy Tarreau	f65b80dd47	MINOR: h2: centralize the check for the idle streams RFC7540#5.1 is pretty clear : "any frame other than HEADERS or PRIORITY in this state MUST be treated as a connection error". Instead of dealing with this for each and every frame type, let's do it once for all in the main demux loop.	2017-10-31 18:16:19 +01:00
Willy Tarreau	e96b0922e9	MEDIUM: h2: handle GOAWAY frames The ID is respected, and only IDs greater than the advertised last_id are woken up, with a CS_FL_ERROR flag to signal that the stream is aborted. This is necessary for a browser to abort a download or to reject a bad response that affects the connection's state.	2017-10-31 18:16:19 +01:00
Willy Tarreau	23b92aa2bb	MINOR: h2: use a common function to signal some and all streams. Let's replace h2_wake_all_streams() with h2_wake_some_streams(), to support signaling only streams by their ID (for GOAWAY frames) and to pass the flags to add on the conn_stream.	2017-10-31 18:16:19 +01:00
Willy Tarreau	c7576eac46	MEDIUM: h2: send DATA+ES or RST_STREAM on shutw/shutr When a stream sends a shutw, we send an empty DATA frame with the ES flag set, except if no HEADERS were sent, in which case we rather send RST_STREAM. On shutr(1) to abort a request, an RST_STREAM frame is sent if the stream is OPEN and the stream is closed. Care is taken to switch the stream's state accordingly and to avoid sending an ES bit again or another RST once already done.	2017-10-31 18:16:19 +01:00
Willy Tarreau	cd234e9fb0	MINOR: h2: handle RST_STREAM frames These ones are received when the browser aborts a page load, it's the only moment we can abort the stream.	2017-10-31 18:16:19 +01:00
Willy Tarreau	454f905084	MEDIUM: h2: handle request body in DATA frames Data frames are received and transmitted. The per-connection and per-stream amount of data to ACK is automatically updated. Each DATA frame is ACKed because usually the downstream link is large and the upstream one is small, so it seems better to waste a few bytes every few kilobytes to maintain a low ACK latency and help the sender keep the link busy. The connection's ACK however is sent at the end of the demux loop and at the beginning of the mux loop so that a single aggregated one is emitted (connection windows tend to be much larger than stream windows). A future improvement would consist in sending a single ACK for multiple subsequent DATA frames of the same stream (possibly interleaved with window updates frames), but this is much trickier as it also requires to remember the ID of the stream for which DATA frames have to be sent. Ideally in the near future we should chunk-encode the body sent to HTTP/1 when there's no content length and when the request is not a CONNECT. It's just uncertain whether it's the best option or not for now.	2017-10-31 18:16:19 +01:00
Willy Tarreau	cc0b8c34a6	MEDIUM: h2: send WINDOW_UPDATE frames for connection When it is detected that the number of received bytes is > 0 on the connection at the end of the demux call or before starting to process pending output data, an attempt is made at sending a WINDOW UPDATE on the connection. In case of failure, it's attempted later.	2017-10-31 18:16:19 +01:00
Willy Tarreau	c199faf5bd	MEDIUM: h2: properly continue to parse header block when facing a 1xx response We still didn't handle the 1xx responses properly.	2017-10-31 18:16:19 +01:00
Willy Tarreau	9d89ac8f42	MEDIUM: h2: skip the response trailers if any For now we don't build a HEADERS frame with them, but at least we remove them from the response so that the L7 chunk parser inside isn't blocked on these (often two) remaining bytes that don't want to leave the buffer. It also ensures that trailers delivered progressively will correctly be skipped.	2017-10-31 18:16:19 +01:00
Willy Tarreau	c652dbde9d	MEDIUM: h2: send the H1 response body as DATA frames The H1 response data are processed (either following content-length or chunks) and emitted as H2 DATA frames. In the case of content-length, the maximum size permitted by the mux buffer, the max frame size, the connection's window and the stream's window it used to determine the frame size. For chunked encoding, the same limitation applies, but in addition, each chunk leads to a distinct frame. This could be improved in the future to aggregate chunks into larger frames. Streams blocked on the connection's flow control subscribe to the connection's fctl_list to be woken up when the window opens again. Streams blocked on their own flow control don't subscribe to anything, they just sit waiting for window update frames to reopen the window. The connection-close mode (without content-length) partially works thanks to the fact that the SHUTW event leads to a close of the stream. In practice an empty DATA frame should be sent in this case though.	2017-10-31 18:16:19 +01:00
Willy Tarreau	9e5ae1d721	MEDIUM: h2: implement the response HEADERS frame to encode the H1 response This calls the h1 response parser and feeds the output through the hpack encoder to produce stateless HPACK bytecode into an output chunk. For now it's a bit naive but reasonably efficient. The HPACK encoder relies on hpack_encode_header() so that the most common response header fields are encoded based on the static header table. The forbidden header field names (connection, proxy-connection, upgrade, transfer-encoding, keep-alive) are dropped before calling the hpack encoder. A new flag (H2_CF_HEADERS_SENT) is set once such a frame is emitted. It will be used to know if we can send an empty DATA+ES frame to use as a shutdown() signal or if we have to use RST_STREAM.	2017-10-31 18:16:19 +01:00
Willy Tarreau	68dd9856ce	MEDIUM: h2: don't use trash to decode headers! The trash is already used by the hpack layer and for Huffman decoding, it's unsafe to use here as a buffer and results in corrupted data. Use a safely allocated trash instead.	2017-10-31 18:16:18 +01:00
Willy Tarreau	13278b44b1	MEDIUM: h2: basic processing of HEADERS frame This takes care of creating a new h2s and a new conn_stream when a HEADERS frame arrives. The recv() callback from the data layer is then called to extract the frame into the stream's buffer. It is verified that the stream ID is strictly greater than the known max stream ID. And the last_id is updated if the current request is properly converted. The streams are created in open or half-closed(remote) states. For now there are some limitations : - frames without END_HEADERS are rejected (CONTINUATION not supported yet, will require some more changes so that the stream processor checks the H2 frame header by itself and steals the frames from the connection) - padding/stream_dep/priority are currently ignored - limited error handling, could be improved But at least the request is properly decoded, transcoded and processed.	2017-10-31 18:16:18 +01:00
Willy Tarreau	45f752e037	MEDIUM: h2: unblock a connection when its current stream detaches If a stream is killed for whatever reason and it happens to be the one currently blocking the connection, we must unblock the connection and enable polling again so that it can attempt to make progress. This may happen for example on upload timeout, where the demux is blocked due to a full stream buffer, and the stream dies on server timeout and quits.	2017-10-31 18:16:18 +01:00
Willy Tarreau	6093514933	MEDIUM: h2: partial implementation of h2_detach() This does the very minimum required to release a stream and/or a connection upon the stream's request. The only thing is that it doesn't kill the connection unless it's already closed or in error or the stream ID reached the one specified in GOAWAY frame. We're supposed to arm a timer to close after some idle timeout but it's not done.	2017-10-31 18:16:18 +01:00
Willy Tarreau	61290ec774	MINOR: h2: handle CONTINUATION frames For now we have nowhere to store partial header frames so we can't handle CONTINUATION frames and we must reject them. In this case we respond with a stream error of type INTERNAL_ERROR.	2017-10-31 18:16:18 +01:00
Willy Tarreau	27a84c90ce	MINOR: h2: implement h2_send_rst_stream() to send RST_STREAM frames This one sends an RST_STREAM for a given stream, using the current demux stream ID. It's also used to send RST_STREAM for streams which have lost their CS part (ie were aborted).	2017-10-31 18:16:18 +01:00
Willy Tarreau	26f95954fe	MEDIUM: h2: honor WINDOW_UPDATE frames Now they really increase the window size of connections and streams. If a stream was not queued but requested to send, it means it was flow-controlled so it's added again into the connection's send list.	2017-10-31 18:16:18 +01:00
Willy Tarreau	f3ee0697f3	MINOR: h2: lookup the stream during demuxing Several stream-oriented functions will need to perform this lookup, so better centralize it.	2017-10-31 18:16:18 +01:00
Willy Tarreau	3421aba3de	MEDIUM: h2: decode SETTINGS frames and extract relevant settings The INITIAL_WINDOW_SIZE and MAX_FRAME_SIZE settings are now extracted from the settings frame, assigned to the connection, and attempted to be propagated to all existing streams as per the specification. In practice clients rarely update the settings after sending the first stream, so the propagation will rarely be used. The ACK is properly sent after the frame is completely parsed.	2017-10-31 18:16:18 +01:00
Willy Tarreau	cf68c787ae	MINOR: h2: implement PING frames Now we can detect and properly parse PING frames as well as emit a response containing the same payload.	2017-10-31 18:16:18 +01:00
Willy Tarreau	7e98c057ff	MINOR: h2: create a stream parser for the demuxer The function h2_process_demux() now tries to parse the incoming bytes to process as many streams as possible. For now it does nothing but dropping all incoming frames.	2017-10-31 18:16:18 +01:00
Willy Tarreau	4c3690bf96	MEDIUM: h2: detect the presence of the first settings frame Instead of doing a special processing of the first SETTINGS frame, we simply parse its header, check that it matches the expected frame type and flags (ie no ACK), and switch to FRAME_P to parse it as any regular frame. The regular frame parser will take care of decoding it.	2017-10-31 18:16:18 +01:00
Willy Tarreau	be5b715fb2	MINOR: h2: send a real SETTINGS frame based on the configuration An initial settings frame is emitted upon receipt of the connection preface, which takes care of configured values. These settings are only emitted when they differ from the protocol's default value : - header_table_size (defaults to 4096) - initial_window_size (defaults to 65535) - max_concurrent_streams (defaults to unlimited) - max_frame_size (defaults to 16384) The max frame size is a copy of tune.bufsize. Clients will most often reject values lower than 16384 and currently there's no trivial way to check if H2 is going to be used at boot time.	2017-10-31 18:16:18 +01:00
Willy Tarreau	bacdf5a49b	MEDIUM: h2: process streams pending for sending The send() callback calls h2_process_mux() which iterates over the list of flow controlled streams first, then streams waiting for room in the send_list. If a stream from the send_list ends up being flow controlled, it is then moved to the fctl_list. This way we can maintain the most accurate fairness by ensuring that flows are always processed in order of arrival except when they're blocked by flow control, in which case only the other ones may pass in front of them. It's a bit tricky as we want to remove a stream from the active lists if it doesn't block (ie it has no reason for staying there).	2017-10-31 18:16:18 +01:00
Willy Tarreau	d7739c8820	MEDIUM: h2: enable reading again on the connection if it was blocked on stream buffer full If the polling update function is called with RD_ENA while H2_CF_DEM_SFULL indicates the demux had to block on a stream buffer full condition, we can remove the flag and re-enable polling for receiving because this is the indication that a consumer stream has made some room in the buffer. Probably that we should improve this to ensure that h2s->id == h2c->dsi and avoid trying to receive multiple times in a row for the wrong stream.	2017-10-31 18:16:18 +01:00
Willy Tarreau	1d393228e0	MEDIUM: h2: enable connection polling for send when a cs wants to emit A conn_stream indicates its intent to send by setting the WR_ENA flag and calling mux->update_poll(). There's no synchronous write so the only way to emit a response from a stream is to proceed this way. The sender h2s is then queued into the h2c's send_list if it was not yet queued. Once the connection is ready, it will enter its send() callback to visit writers, calling their data->send_cb() callback to complete the operation using mux->snd_buf(). Also we enable polling if the mux contains data and wasn't enabled. This may happen just after a response has been transmitted using chk_snd(). It likely is incomplete for now and should probably be refined.	2017-10-31 18:16:18 +01:00
Willy Tarreau	52eed75ced	MINOR: h2: match the H2 connection preface on init The H2 preface is properly detected to switch to the settings state. It's important to note that for now we don't send out settings frame so the operation is not complete yet.	2017-10-31 18:16:18 +01:00
Willy Tarreau	081d472f79	MINOR: h2: add a function to send a GOAWAY error frame For now it's only used to report immediate errors by announcing the highest known stream-id on the mux's error path. The function may be used both while processing a stream or directly in relation with the connection. The wake() callback will automatically ask for send access if an error is reported. The function should be usable for graceful shutdowns as well by simply setting h2c->last_sid to the highest acceptable stream-id (2^31-1) prior to calling the function. A connection flag (H2_CF_GOAWAY_SENT) is set once the frame was successfully sent. It will be usable to detect when it's safe to close the connection. Another flag (H2_CF_GOAWAY_FAILED) is set in case of unrecoverable error while trying to send. It will also be used to know when it's safe to close the connection.	2017-10-31 18:16:18 +01:00
Willy Tarreau	bc933930a7	MEDIUM: h2: start to implement the frames processing loop The rcv_buf() callback now calls h2_process_demux() after an recv() call leaving some data in the buffer, and the snd_buf() callback calls h2_process_mux() to try to process pending data from streams.	2017-10-31 18:16:18 +01:00
Willy Tarreau	5160683fc7	MEDIUM: h2: wake the connection up for send on pending streams If some streams were blocked on flow control and the connection's window was recently opened, or if some streams are waiting while no block flag remains, we immediately want to try to send again. This can happen if a recv() for a stream wants to send after the send() loop has already been processed.	2017-10-31 18:16:17 +01:00
Willy Tarreau	29a9824144	MEDIUM: h2: properly consider all conditions for end of connection During h2_wake(), there are various situations that can lead to the connection being closed : - low-level connection error - read0 received - fatal error (ERROR2) - failed to emit a GOAWAY - empty stream list with max_id >= last_sid In such cases, all streams are notified and we have to wait for all streams to leave while doing nothing, or if the last stream is gone, we can simply terminate the connection. It's important to do this test there again because an error might arise while trying to send a pending GOAWAY after the last stream for example, thus there's possibly no way to get notified of a closing stream.	2017-10-31 18:16:17 +01:00
Willy Tarreau	26bd761f01	MINOR: h2: also terminate the connection on shutr It happens that an H2 mux is totally unusable once the client has shut, so we must consider this situation equivalent to the connection error, and let the possible streams drain their data if needed then stop.	2017-10-31 18:16:17 +01:00
Willy Tarreau	fbe3b4fcbe	MEDIUM: h2: start to consider the H2_CF_{MUX,DEM}_* flags for polling Now we start to set the flags to indicate that the response buffer is being awaited or that it is full, it makes it possible to centralize a little bit the polling management into the wake() callback. In case of error, we wake all the streams up so that they are aware of the nature of the event and are able to detach if needed.	2017-10-31 18:16:17 +01:00
Willy Tarreau	1b62c5caef	MINOR: h2: update the {MUX,DEM}_{M,D}ALLOC flags on buffer availability Flag H2_CF_DEM_DALLOC is set when the demux buffer fails to be allocated in the recv() callback, and is cleared when it succeeds. Both flags H2_CF_MUX_MALLOC and H2_CF_DEM_MROOM are cleared when the mux buffer allocation succeeds. In both cases it will be up to the callers to report allocation failures.	2017-10-31 18:16:17 +01:00
Willy Tarreau	3ccf4b2a20	MINOR: h2: add the function to create a new stream This one will be used by the HEADERS frame handler and maybe later by the PUSH frame handler. It creates a conn_stream in the mux's connection. The create streams are inserted in the h2c's tree sorted by IDs. The caller is expected to have verified that the stream doesn't exist yet.	2017-10-31 18:16:17 +01:00
Willy Tarreau	2a8561895d	MINOR: h2: create dummy idle and closed streams It will be more convenient to always manipulate existing streams than null pointers. Here we create one idle stream and one closed stream. The idea is that we can easily point any stream to one of these states in order to merge maintenance operations.	2017-10-31 18:15:51 +01:00
Willy Tarreau	2373acc384	MINOR: h2: add stream lookup function based on the stream ID The function performs a simple lookup in the tree and returns either the matching h2s or NULL if not found.	2017-10-31 18:12:14 +01:00
Willy Tarreau	54c150653d	MINOR: h2: add a few functions to retrieve contents from a wrapping buffer Functions h2_get_buf_n{16,32,64}() and h2_get_buf_bytes() respectively extract a network-ordered 16/32/64 bit value from a possibly wrapping buffer, or any arbitrary size. They're convenient to retrieve a PING payload or to parse SETTINGS frames. Since they copy one byte at a time, they will be less efficient than a memcpy-based implementation on large blocks.	2017-10-31 18:12:14 +01:00
Willy Tarreau	715d5316e5	MINOR: h2: new function h2_peek_frame_hdr() to retrieve a new frame header This function extracts the next frame header but doesn't consume it. This will allow to detect a stream-id change and to perform a yielding window update without losing information. The result is stored into a temporary frame descriptor. We could also store the next frame header into the connection but parsing the header again is much cheaper than wasting bytes in the connection for a rare use case. A function (h2_skip_frame_hdr()) is also provided to skip the parsed header (always 9 bytes) and another one (h2_get_frame_hdr()) to do both at once.	2017-10-31 18:12:14 +01:00
Willy Tarreau	e482074c96	MINOR: h2: add h2_set_frame_size() to update the size in a binary frame This function is called after preparing a frame, in order to update the frame's size in the frame header. It takes the frame payload length in argument. It simply writes a 24-bit frame size into a buffer, making use of the net_helper functions which try to optimize per platform (this is a frequently used operation).	2017-10-31 18:12:14 +01:00
Willy Tarreau	2e43f08c60	MINOR: h2: new function h2s_error() to mark an error on a stream This one will store the error into the stream's errcode if it's neither idle nor closed (since these ones are read-only) and switch its state to H2_SS_ERROR. If a conn_stream is attached, it will be flagged with CS_FL_ERROR.	2017-10-31 18:12:14 +01:00
Willy Tarreau	741d6df870	MINOR: h2: new function h2c_error to mark an error on the connection This one sets the error code in h2c->errcode and changes the connection's stat to H2_CS_ERROR.	2017-10-31 18:12:14 +01:00
Willy Tarreau	5b5e68741a	MINOR: h2: small function to know when the mux is busy A mux is busy when any stream id >= 0 is currently being handled and the current stream's id doesn't match. When no stream is involved (ie: demuxer), stream 0 is considered. This will be necessary to know when it's possible to send frames.	2017-10-31 18:12:14 +01:00
Willy Tarreau	71681174f3	MINOR: h2: add function h2s_id() to report a stream's ID This one supports being called with NULL and returns 0 in this case, making it easier to check for stream IDs in various send functions.	2017-10-31 18:12:14 +01:00
Willy Tarreau	2e5b60ee18	MINOR: h2: add the connection and stream flags listing the causes for blocking A demux may be prevented from receiving for the following reasons : - no receive buffer could be allocated - the receive buffer is full - a response is needed and the mux is currently being used by a stream - a response is needed and some room could not be found in the mux buffer (either full or waiting for allocation) - the stream buffer is waiting for allocation - the stream buffer is full A mux may stop accepting data for the following reasons : - the buffer could not be allocated - the buffer is full A stream may stop sending data to a mux for the following reaons : - the mux is busy processing another stream - the mux buffer lacks room (full or not allocated) - the mux's flow control prevents from sending - the stream's flow control prevents from sending All these conditions were turned into flags for use by the respective places.	2017-10-31 18:12:14 +01:00
Willy Tarreau	1439812da8	MEDIUM: h2: implement the mux buffer allocator The idea is that we may need a mux buffer for anything, ranging from receiving to sending traffic. For now it's unclear where exactly the calls will be placed so let's block both send and recv when a buffer is missing, and re-enable both of them at the end. This will have to be changed later.	2017-10-31 18:12:14 +01:00
Willy Tarreau	35dbd5d719	MEDIUM: h2: dynamically allocate the demux buffer on Rx This patch implements a very basic Rx buffer management. The mux needs an rx buffer to decode the connection's stream. If this buffer it available upon Rx events, we fill it with whatever input data are available. Otherwise we try to allocate it and subscribe to the buffer wait queue in case of failure. In such a situation, a function "h2_dbuf_available()" will be called once a buffer may be allocated. The buffer is released if it's still empty after recv().	2017-10-31 18:12:14 +01:00
Willy Tarreau	a2af51291f	MEDIUM: h2: implement basic recv/send/wake functions For now they don't do much since the buffers are not yet allocated, but the squeletton is here.	2017-10-31 18:12:14 +01:00

... 7 8 9 10 11 ...

859 Commits