haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-11-22 03:11:32 +01:00

Author	SHA1	Message	Date
Christopher Faulet	46e058dda5	BUG/MEDIUM: mux-h1: Adjust conditions to ask more space in the channel buffer When a message is parsed and copied into the channel buffer, in h1_process_demux(), more space is requested if some pending data remain after the parsing while the channel buffer is not empty. To do so, CS_FL_WANT_ROOM flag is set. It means the H1 parser needs more space in the channel buffer to continue. In the stream-interface, when this flag is set, the SI is considered as blocked on the RX path. It is only unblocked when some data are sent. However, it is not accurrate because the parsing may be stopped because there is not enough data to continue. For instance in the middle of a chunk size. In this case, some data may have been already copied but the parser is blocked because it must receive more data to continue. If the calling SI is blocked on RX at this stage when the stream is waiting for the payload (because http-buffer-request is set for instance), the stream remains stuck infinitely. To fix the bug, we must request more space to the app layer only when it is not possible to copied more data. Actually, this happens when data remain in the input buffer while the H1 parser is in states MSG_DATA or MSG_TUNNEL, or when we are unable to copy headers or trailers into a non-empty buffer. The first condition is quite easy to handle. The second one requires an API refactoring. h1_parse_msg_hdrs() and h1_parse_msg_tlrs() fnuctions have been updated. Now it is possible to know when we need more space in the buffer to copy headers or trailers (-2 is returned). In the H1 mux, a new H1S flag (H1S_F_RX_CONGESTED) is used to track this state inside h1_process_demux(). This patch is part of a series related to the issue #1362. It should be backported as far as 2.0, probably with some adaptations. So be careful during backports.	2021-09-23 16:13:17 +02:00
Christopher Faulet	216d3352b1	BUG/MINOR: h1-htx: Fix a typo when request parser is reset In h1_postparse_req_hdrs(), if we need more space to copy headers, the request parser is reset. However, because of a typo, it was reset as a response parser instead of a request one. h1m_init_req() must be called. This patch must be backported as far as 2.2.	2021-09-23 16:10:36 +02:00
Christopher Faulet	52a5ec2d18	BUG/MEDIUM: mux-h1: Remove "Upgrade:" header for requests with payload Instead of returning a 501-Not-implemented error when "Ugrade:" header is found for a request with a payload, the header is removed. This way, the upgrade is disabled and the request is still sent to the server. It is required because some frameworks seem to try to perform H2 upgrade on every requests, including POST ones. The h2 mux was slightly fixed to convert Upgrade requests to extended connect ones only if the rigth HTX flag is set. This patch should fix the issue #1381. It must be backported to 2.4.	2021-09-10 09:17:51 +02:00
Amaury Denoyelle	852d78c232	MEDIUM: h1-htx: apply scheme-based normalization on h1 requests Apply the rfc 3986 scheme-based normalization on h1 requests. It is executed only for requests which uses absolute-form target URI, which is not the standard case.	2021-07-07 15:34:01 +02:00
Ilya Shipitsin	213bb99f9e	CLEANUP: assorted typo fixes in the code and comments This is 24th iteration of typo fixes	2021-06-17 09:02:16 +02:00
Christopher Faulet	bf76df12a6	BUG/MINOR: h1-htx: Fix a signess bug with char data type when parsing chunk size On some platform, a char may be unsigned. Of course, we should not rely on the signess of a char to be portable. Unfortunatly, since the commit a835f3cb ("MINOR: h1-htx: Use a correlation table to speed-up small chunks parsing") we rely on it to test the value retrieved from the hexadecimal correlation table when the size of a chunk is parsed. To fix the bug, we now test the result is in the range [0,15] with a bitwise AND. This patch should fix the issue #1272. It is 2.5-specific, no backport is needed except if the commit above is backported.	2021-06-11 14:15:48 +02:00
Christopher Faulet	00d7cde551	MINOR: muxes/h1-htx: Realign input buffer using b_slow_realign_ofs() Input buffers have never output data. So, use b_slow_realign_ofs() function instead of b_slow_realign(). It is a slighly simpler function. And in the H1 mux, it allows a realign by setting the input buffer head to permit zero-copies.	2021-05-25 10:41:50 +02:00
Christopher Faulet	7a835f3cb0	MINOR: h1-htx: Use a correlation table to speed-up small chunks parsing Instead of using hex2i() to convert an hexa digit to an integer in the function parsing small chunks, we now use a table because it is faster.	2021-05-25 10:41:50 +02:00
Christopher Faulet	bdcefe58b7	MEDIUM: h1-htx: Add a function to parse contiguous small chunks Add h1_parse_full_contig_chunks() function to parse full contiguous chunks. This function neither handles incomplete chunks nor wrapping buffers. It is designed to efficiently parse a buffer with several small chunks. Of course, there is no zero copy here because it is not possible. This function is a bit tricky and all changes may a have a impact. This one may probably be optimized, but it is good enough for now and not too complex. The main function (h1_parse_msg_chunks) always tries to use this function when the HTTP parser is waiting for a chunk size. In this case, there is no zero-copy, so there is no reason to call the generic version to parse the chunk. However, if some unparsed data remain after this step, the generic function is called. This way, wrapping data and incomplete chunks may be parsed. Quick tests show it is now slightly faster in all cases than the legacy mode.	2021-05-25 10:41:50 +02:00
Christopher Faulet	0d4c924c34	MEDIUM: h1-htx: Split function to parse a chunk and the loop on the buffer A generic function is now used to only parse the current chunk (h1_parse_chunk) and the main one (h1_parse_msg_chunks) is used to loop on the buffer and relies on the first one. This change is mandatory to be able to use an optimized function to parse contiguous small chunks.	2021-05-25 10:41:50 +02:00
Christopher Faulet	140691baf9	MINOR: h1-htx: Move HTTP chunks parsing into a dedicated function Chunked data are now parsed in a dedicated function. This way, it will be possible to have two functions to parse chunked messages. The current one for messages with large chunks and an other one to parse messages with small chunks. The parsing of small chunks is really sensitive because it may be used as a DoS attack. So we must be carefull to have an optimized function to parse such messages.	2021-05-25 10:41:50 +02:00
Christopher Faulet	f7c2044f8f	MEDIUM: h1-htx: Adapt H1 data parsing to copy wrapping data in one call Since the beginning, wrapping input data are parsed and copied in 2 steps to not deal with the wrapping in H1 parsing functions. But there is no reason to do so. This needs 2 calls to parsing functions. This also means, most of time, when the input buffer does not wrap, there is an extra call for nothing. Thus, now, the data parsing functions try to copy as much data as possible, handling wrapping buffer if necessary.	2021-05-25 10:41:50 +02:00
Christopher Faulet	de471a4a8d	MINOR: h1-htx: Update h1 parsing functions to return result as a size_t h1 parsing functions (h1_parse_msg_*) returns the number of bytes parsed or 0 if nothing is parsed because an error occurred or some data are missing. But they never return negative values. Thus, instead of a signed integer, these function now return a size_t value. The H1 and FCGI muxes are updated accordingly. Note that h1_parse_msg_data() has been slightly adapted because the parsing of chunked messages still need to handle negative values when a parsing error is reported by h1_parse_chunk_size() or h1_skip_chunk_crlf().	2021-05-25 10:41:50 +02:00
Christopher Faulet	2b78f0bfc4	CLEANUP: htx: Remove unsued hdrs_bytes field from the HTX start-line Thanks to the htx_xfer_blks() refactoring, it is now possible to remove hdrs_bytes field from the start-line because no function rely on it anymore.	2021-04-28 10:51:08 +02:00
Christopher Faulet	fb38c910f8	BUG/MINOR: mux-fcgi: Don't send normalized uri to FCGI application A normalized URI is the internal term used to specify an URI is stored using the absolute format (scheme + authority + path). For now, it is only used for H2 clients. It is the default and recommended format for H2 request. However, it is unusual for H1 servers to receive such URI. So in this case, we only send the path of the absolute URI. It is performed for H1 servers, but not for FCGI applications. This patch fixes the difference. Note that it is not a real bug, because FCGI applications should support abosolute URI. Note also a normalized URI is only detected for H2 clients when a request is received. There is no such test on the H1 side. It means an absolute URI received from an H1 client will be sent without modification to an H1 server or a FCGI application. To make it possible, a dedicated function has been added to get the H1 URI. This function is called by the H1 and the FCGI multiplexer when a request is sent to a server. This patch should fix the issue #1232. It must be backported as far as 2.2.	2021-04-26 10:23:18 +02:00
Christopher Faulet	405f054652	MINOR: h1: Raise the chunk size limit up to (2^52 - 1) The allowed chunk size was historically limited to 2GB to avoid risk of overflow. This restriction is no longer necessary because the chunk size is immediately stored into a 64bits integer after the parsing. Thus, it is now possible to raise this limit. However to never fed possibly bogus values from languages that use floats for their integers, we don't get more than 13 hexa-digit (2^52 - 1). 4 petabytes is probably enough ! This patch should fix the issue #1065. It may be backported as far as 2.1. For the 2.0, the legacy HTTP part must be reviewed. But there is honestely no reason to do so.	2021-01-28 16:37:14 +01:00
Christopher Faulet	d1ac2b90cd	MAJOR: htx: Remove the EOM block type and use HTX_FL_EOM instead The EOM block may be removed. The HTX_FL_EOM flags is enough. Most of time, to know if the end of the message is reached, we just need to have an empty HTX message with HTX_FL_EOM flag set. It may also be detected when the last block of a message with HTX_FL_EOM flag is manipulated. Removing EOM blocks simplifies the HTX message filling. Indeed, there is no more edge problems when the message ends but there is no more space to write the EOM block. However, some part are more tricky. Especially the compression filter or the FCGI mux. The compression filter must finish the compression on the last DATA block. Before it was performed on the EOM block, an extra DATA block with the checksum was added. Now, we must detect the last DATA block to be sure to finish the compression. The FCGI mux on its part must be sure to reserve the space for the empty STDIN record on the last DATA block while this record was inserted on the EOM block. The H2 multiplexer is probably the part that benefits the most from this change. Indeed, it is now fairly easier to known when to set the ES flag. The HTX documentaion has been updated accordingly.	2021-01-28 16:37:14 +01:00
Christopher Faulet	42432f347f	MINOR: htx: Rename HTX_FL_EOI flag into HTX_FL_EOM The HTX_FL_EOI flag is not well named. For now, it is not very used. But that will change. It will replace the EOM block. Thus, it is renamed.	2021-01-28 16:37:14 +01:00
Christopher Faulet	5be651d4d7	BUG/MAJOR: mux-h1/mux-h2/htx: Fix HTTP tunnel management at the mux level Tunnel management between the H1 and H2 multiplexers is a bit blurred. And the HTX is not enough well defined on this point to make things clear. In fact, Establishing a tunnel between an H2 client and an H1 server, or the opposite is buggy because the both multiplexers don't handle the EOM block the same way when a tunnel is established. In fact, the H2 multiplexer is pretty strict and add an END_STREAM flag when an EOM block is found, while the H1 multiplexer is more flexible. The purpose of this patch is to make the EOM block usage pretty clear and to fix the HTTP multiplexers to really handle HTTP tunnels in the right way. Now, an EOM block is used to mark the end of an HTTP message, semantically speaking. That means it may be followed by tunneled data. Thus, CONNECT requests are now finished by an EOM block, just after the EOH block. On the H1 multiplexer side, a tunnel is now only established on the response path. So a CONNECT request remains in a DONE state waiting for the 2xx response. On the H2 multiplexer side, a flag is used to know an HTTP tunnel is requested, to not immediately add the END_STREAM flag on the EOM block. All these changes are sensitives and not backportable because of recent changes. The same problem exists on earlier versions and should be addressed. But it will only be possible with a specific patchset. This patch relies on the following ones : * MEDIUM: mux-h1: Properly handle tunnel establishments and aborts * MEDIUM: mux-h2: Close streams when processing data for an aborted tunnel * MEDIUM: mux-h2: Block client data on server side waiting tunnel establishment * MINOR: mux-h2: Add 2 flags to help to properly handle tunnel mode * MINOR: mux-h1: Split H1C_F_WAIT_OPPOSITE flag to separate input/output sides * MINOR: mux-h1/mux-fcgi: Don't set TUNNEL mode if payload length is unknown	2021-01-28 16:37:14 +01:00
Christopher Faulet	1e857785e9	MINOR: mux-h1/mux-fcgi: Don't set TUNNEL mode if payload length is unknown Responses with no C-L and T-E headers are no longer switched in TUNNEL mode and remains in DATA mode instead. The H1 and FCGI muxes are updated accordingly. This change reflects the real message state. It is not a true tunnel. Data received are still part of the message. It is not a bug. However, this message may be backported after some observation period (at least as far as 2.2).	2021-01-28 16:37:14 +01:00
Christopher Faulet	576c358508	MINOR: htx/http-ana: Save info about Upgrade option in the Connection header Add an HTX start-line flag and its counterpart into the HTTP message to track the presence of the Upgrade option into the Connection header. This way, without parsing the Connection header again, it will be easy to know if a client asks for a protocol upgrade and if the server agrees to do so. It will also be easy to perform some conformance checks when a 101-switching-protocols is received.	2021-01-28 16:27:48 +01:00
Christopher Faulet	c75668ebff	BUG/MINOR: http: Establish a tunnel for all 2xx responses to a CONNECT As stated in the rfc7231, section 4.3.6, an HTTP tunnel via a CONNECT method is successfully established if the server replies with any 2xx status code. However, only 200 responses are considered as valid. With this patch, any 2xx responses are now considered to estalish the tunnel. This patch may be backported on demand to all stable versions and adapted for the legacy HTTP. It works this way since a very long time and nobody complains.	2020-12-18 15:05:57 +01:00
Christopher Faulet	c31bc724d4	MINOR: h1-htx/http-ana: Set BODYLESS flag on message in TUNNEL state When a H1 message is parsed, if the parser state is switched to TUNNEL mode just after the header parsing, the BODYLESS flag is set on the HTX start-line. By transitivity, the corresponding flag is set on the message in HTTP analysers. Thus it is possible to rely on it to not wait for the request body.	2020-12-04 14:41:49 +01:00
Christopher Faulet	810df06145	MEDIUM: htx: Add a flag on a HTX message when no more data are expected The HTX_FL_EOI flag must now be set on a HTX message when no more data are expected. Most of time, it must be set before adding the EOM block. Thus, if there is no space for the EOM, there is still an information to know all data were received and pushed in the HTX message. There is only an exception for the HTTP replies (deny, return...). For these messages, the flag is set after all blocks are pushed in the message, including the EOM block, because, on error, we remove all inserted data.	2020-07-22 16:43:32 +02:00
Ilya Shipitsin	47d17182f4	CLEANUP: assorted typo fixes in the code and comments This is 10th iteration of typo fixes	2020-06-26 11:27:28 +02:00
Willy Tarreau	36979d9ad5	REORG: include: move the error reporting functions to from log.h to errors.h Most of the files dealing with error reports have to include log.h in order to access ha_alert(), ha_warning() etc. But while these functions don't depend on anything, log.h depends on a lot of stuff because it deals with log-formats and samples. As a result it's impossible not to embark long dependencies when using ha_warning() or qfprintf(). This patch moves these low-level functions to errors.h, which already defines the error codes used at the same places. About half of the users of log.h could be adjusted, sometimes revealing other issues such as missing tools.h. Interestingly the total preprocessed size shrunk by 4%.	2020-06-11 10:18:59 +02:00
Willy Tarreau	6be7849f39	REORG: include: move cfgparse.h to haproxy/cfgparse.h There's no point splitting the file in two since only cfgparse uses the types defined there. A few call places were updated and cleaned up. All of them were in C files which register keywords. There is nothing left in common/ now so this directory must not be used anymore.	2020-06-11 10:18:58 +02:00
Willy Tarreau	dfd3de8826	REORG: include: move stream.h to haproxy/stream{,-t}.h This one was not easy because it was embarking many includes with it, which other files would automatically find. At least global.h, arg.h and tools.h were identified. 93 total locations were identified, 8 additional includes had to be added. In the rare files where it was possible to finalize the sorting of includes by adjusting only one or two extra lines, it was done. But all files would need to be rechecked and cleaned up now. It was the last set of files in types/ and proto/ and these directories must not be reused anymore.	2020-06-11 10:18:58 +02:00
Willy Tarreau	c6fe884c74	REORG: include: move h1_htx.h to haproxy/h1_htx.h This one didn't have a type file. A few missing includes were added (htx, types).	2020-06-11 10:18:57 +02:00
Willy Tarreau	16f958c0e9	REORG: include: split common/htx.h into haproxy/htx{,-t}.h Most of the file was a large set of HTX elements manipulation functions and few types, so splitting them allowed to further reduce dependencies and shrink the build time. Doing so revealed that a few files (h2.c, mux_pt.c) needed haproxy/buf.h and were previously getting it through htx.h. They were fixed.	2020-06-11 10:18:57 +02:00
Willy Tarreau	5413a87ad3	REORG: include: move common/h1.h to haproxy/h1.h The file was moved as-is. There was a wrong dependency on dynbuf.h instead of buf.h which was addressed. There was no benefit to splitting this between types and functions.	2020-06-11 10:18:57 +02:00
Willy Tarreau	cd72d8c981	REORG: include: split common/http.h into haproxy/http{,-t}.h So the enums and structs were placed into http-t.h and the functions into http.h. This revealed that several files were dependeng on http.h but not including it, as it was silently inherited via other files.	2020-06-11 10:18:57 +02:00
Willy Tarreau	58017eef3f	REORG: include: move the BUG_ON() code to haproxy/bug.h This one used to be stored into debug.h but the debug tools got larger and require a lot of other includes, which can't use BUG_ON() anymore because of this. It does not make sense and instead this macro should be placed into the lower includes and given its omnipresence, the best solution is to create a new bug.h with the few surrounding macros needed to trigger bugs and place assertions anywhere. Another benefit is that it won't be required to add include <debug.h> anymore to use BUG_ON, it will automatically be covered by api.h. No less than 32 occurrences were dropped. The FSM_PRINTF macro was dropped since not used at all anymore (probably since 1.6 or so).	2020-06-11 10:18:56 +02:00
Willy Tarreau	4c7e4b7738	REORG: include: update all files to use haproxy/api.h or api-t.h if needed All files that were including one of the following include files have been updated to only include haproxy/api.h or haproxy/api-t.h once instead: - common/config.h - common/compat.h - common/compiler.h - common/defaults.h - common/initcall.h - common/tools.h The choice is simple: if the file only requires type definitions, it includes api-t.h, otherwise it includes the full api.h. In addition, in these files, explicit includes for inttypes.h and limits.h were dropped since these are now covered by api.h and api-t.h. No other change was performed, given that this patch is large and affects 201 files. At least one (tools.h) was already freestanding and didn't get the new one added.	2020-06-11 10:18:42 +02:00
Christopher Faulet	76014fd118	MEDIUM: h1-htx: Add HTX EOM block when the message is in H1_MSG_DONE state During H1 parsing, the HTX EOM block is added before switching the message state to H1_MSG_DONE. It is an exception in the way to convert an H1 message to HTX. Except for this block, the message is first switched to the right state before starting to add the corresponding HTX blocks. For instance, the message is switched in H1_MSG_DATA state and then the HTX DATA blocks are added. With this patch, the message is switched to the H1_MSG_DONE state when all data blocks or trailers were processed. It is the caller responsibility to call h1_parse_msg_eom() when the H1_MSG_DONE state is reached. This way, it is far easier to catch failures when the HTX buffer is full. The H1 and FCGI muxes have been updated accordingly. This patch may eventually be backported to 2.1 if it helps other backports.	2019-12-11 16:46:16 +01:00
Christopher Faulet	53a899b946	CLEANUP: h1-htx: Move htx-to-h1 formatting functions from htx.c to h1_htx.c The functions "htx__to_h1()" have been renamed into "h1_format_htx_()" and moved in the file h1_htx.c. It is the right place for such functions.	2019-10-14 22:28:50 +02:00
Christopher Faulet	fe451fb9ef	MINOR: h1-htx: Set the flag HTX_SL_F_HAS_AUTHORITY during the request parsing When an h1 request is received and parsed, this flag is set if it is a CONNECT request or if an absolute URI is detected.	2019-10-09 11:05:31 +02:00
Christopher Faulet	af542635f7	MINOR: h1-htx: Update h1_copy_msg_data() to ease the traces in the mux-h1 This function now uses the address of the pointer to the htx message where the copy must be performed. This way, when a zero-copy is performed, there is no need to refresh the caller's htx message. It is a bit easier to do that way, especially to add traces in the mux-h1.	2019-10-04 15:46:59 +02:00
Christopher Faulet	cc3124cf44	MINOR: h1-htx: Use the same function to copy message payload in all cases This function will try to do a zero-copy transfer. Otherwise, it adds a data block. The same is used for messages with a content-length, chunked messages and messages with unknown body length.	2019-09-17 10:18:54 +02:00
Christopher Faulet	4f0f88a9d0	MEDIUM: mux-h1/h1-htx: move HTX convertion of H1 messages in dedicated file To avoid code duplication in the futur mux FCGI, functions parsing H1 messages and converting them into HTX have been moved in the file h1_htx.c. Some specific parts remain in the mux H1. But most of the parsing is now generic.	2019-09-17 10:18:54 +02:00

40 Commits