haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-17 20:46:58 +02:00

Author	SHA1	Message	Date
Willy Tarreau	e56cdd3629	MEDIUM: http: make the chunk size parser only depend on the buffer The chunk parser used to depend on the channel and on the HTTP message but it's not really needed as they're only used to retrieve the buffer as well as to return the number of bytes parsed and the chunk size. Here instead we pass the (few) relevant information in arguments so that the function may be reused without a channel nor an HTTP message (ie from the H2 to H1 gateway). As part of this API change, it was renamed to h1_parse_chunk_size() to mention that it doesn't depend on http_msg anymore.	2017-10-22 09:54:14 +02:00
Willy Tarreau	8740c8b1b2	REORG: http: move the HTTP/1 header block parser to h1.c Since it still depends on http_msg, it was not renamed yet.	2017-10-22 09:54:13 +02:00
Willy Tarreau	db4893d6a4	REORG: http: move the HTTP/1 chunk parser to h1.{c,h} Functions http_parse_chunk_size(), http_skip_chunk_crlf() and http_forward_trailers() were moved to h1.h and h1.c respectively so that they can be called from outside. The parts that were inline remained inline as it's critical for performance (+41% perf difference reported in an earlier test). For now the "http_" prefix remains in their name since they still depend on the http_msg type.	2017-10-22 09:54:13 +02:00
Willy Tarreau	0da5b3bddc	REORG: http: move some very http1-specific parts to h1.{c,h} Certain types and enums are very specific to the HTTP/1 parser, and we'll need to share them with the HTTP/2 to HTTP/1 translation code. Let's move them to h1.c/h1.h. Those with very few occurrences or only used locally were renamed to explicitly mention the relevant HTTP version : enum ht_state -> h1_state. http_msg_state_str -> h1_msg_state_str HTTP_FLG_* -> H1_FLG_* http_char_classes -> h1_char_classes Others like HTTP_IS_, HTTP_MSG_ are left to be done later.	2017-10-22 09:54:13 +02:00
Willy Tarreau	0621da5f5b	MINOR: buffer: make bo_getblk_nc() not return 2 for a full buffer Thus function returns the number of blocks. When a buffer is full and properly aligned, buf->p loops back the beginning, and the test in the code doesn't cover that specific case, so it returns two chunks, a full one and an empty one. It's harmless but can sometimes have a small impact on performance and definitely makes the code hard to debug.	2017-10-22 09:54:12 +02:00
Emeric Brun	5a1335110c	BUG/MEDIUM: log: check result details truncated. Fix regression introduced by commit: 'MAJOR: servers: propagate server status changes asynchronously.' The building of the log line was re-worked to be done at the postponed point without lack of data. [wt: this only affects 1.8-dev, no backport needed]	2017-10-19 18:51:32 +02:00
Willy Tarreau	e67c4e5744	MINOR: ist: add ist0() to add a trailing zero to a string. This function modifies the string to add a zero after the end, and returns the start pointer. The purpose is to use it on strings extracted by parsers from larger strings cut with delimiters that are not important and can be destroyed. It allows any such string to be used with regular string functions. It's also convenient to use with printf() to show data extracted from writable areas.	2017-10-19 15:01:08 +02:00
Willy Tarreau	41ab86898e	MINOR: channel: make the channel be a const in all {ci,co}_get* functions There's no point having the channel marked writable as these functions only extract data from the channel. The code was retrieved from their ci/co ancestors.	2017-10-19 15:01:08 +02:00
Willy Tarreau	6b3f353bcf	MINOR: channel: make use of bo_getblk{,_nc} for their channel equivalents Let's reuse the buffer-level functions to perform the operations.	2017-10-19 15:01:08 +02:00
Willy Tarreau	e0e734ccc5	MINOR: buffer: add bo_getblk() and bo_getblk_nc() These functions respectively extract a block from an output buffer by copying it or by just passing pointers and lengths for zero copy operation.	2017-10-19 15:01:08 +02:00
Willy Tarreau	06d80a9a9c	REORG: channel: finally rename the last bi_* / bo_* functions For HTTP/2 we'll need some buffer-only equivalent functions to some of the ones applying to channels and still squatting the bi_* / bo_* namespace. Since these names have kept being misleading for quite some time now and are really getting annoying, it's time to rename them. This commit will use "ci/co" as the prefix (for "channel in", "channel out") instead of "bi/bo". The following ones were renamed : bi_getblk_nc, bi_getline_nc, bi_putblk, bi_putchr, bo_getblk, bo_getblk_nc, bo_getline, bo_getline_nc, bo_inject, bi_putchk, bi_putstr, bo_getchr, bo_skip, bi_swpbuf	2017-10-19 15:01:08 +02:00
Willy Tarreau	5b9834f12a	MINOR: buffer: add buffer_space_wraps() This function returns true if the available buffer space wraps. This will be used to detect if it's worth realigning a buffer when it lacks contigous space.	2017-10-19 15:01:08 +02:00
Willy Tarreau	e5676e7103	MINOR: buffer: add two functions to inject data into buffers bi_istput() injects the ist string into the input region of the buffer, it will be used to feed small data chunks into the conn_stream. bo_istput() does the same into the output region of the buffer, it will be used to send data via the transport layer and assumes there's no input data.	2017-10-19 15:01:08 +02:00
Willy Tarreau	6634b63c78	MINOR: buffer: add a function to match against string patterns In order to match known patterns in wrapping buffer, we'll introduce new string manipulation functions for buffers. The new function b_isteq() relies on an ist string for the pattern and compares it against any location in the buffer relative to <p>. The second function bi_eat() is specially designed to match input contents.	2017-10-19 15:01:07 +02:00
Willy Tarreau	7f564d2b60	MINOR: buffer: add bo_del() to delete a number of characters from output This simply reduces the amount of output data from the buffer after they have been transferred, in a way that is more natural than by fiddling with buf->o. b_del() was renamed to bi_del() to avoid any ambiguity (it's not yet used).	2017-10-19 15:01:07 +02:00
Emeric Brun	253e53e661	BUG/MAJOR: lua: scheduled task is freezing. Since commit 'MAJOR: task: task scheduler rework' `0194897e54`. LUA's scheduling tasks are freezing. A running task should not handle the scheduling itself but let the task scheduler to handle it based on the 'expire' field. [wt: no backport needed]	2017-10-18 19:23:33 +02:00
Olivier Houchard	00bc3cb59f	BUG/MINOR: stats: Clear a bit more counters with in cli_parse_clear_counters(). Clear MaxSslRate, SslFrontendMaxKeyRate and SslBackendMaxKeyRate when clear counters is used, it was probably forgotten when those counters were added. [wt: this can probably be backported as far as 1.5 in dumpstats.c]	2017-10-18 18:36:08 +02:00
Willy Tarreau	dea7c5c03d	BUG/MINOR: tools: fix my_htonll() on x86_64 Commit `36eb3a3` ("MINOR: tools: make my_htonll() more efficient on x86_64") brought an incorrect asm statement missing the input constraints, causing the input value not necessarily to be placed into the same register as the output one, resulting in random output. It happens to work when building at -O0 but not above. This was only detected in the HTTP/2 parser, but in mainline it could only affect the integer to binary sample cast. No backport is needed since this bug was only introduced in the development branch.	2017-10-18 11:46:17 +02:00
Olivier Houchard	9130a9605d	MINOR: checks: Add a new keyword to specify a SNI when doing SSL checks. Add a new keyword, "check-sni", to be able to specify the SNI to be used when doing health checks over SSL.	2017-10-17 18:10:24 +02:00
Olivier Houchard	f8eb8d56a7	MINOR: server: Handle weight increase in consistent hash. When the server weight is rised using the CLI, extra nodes have to be allocated, or the weight will be effectively the same as the original one. [wt: given that the doc made no explicit mention about this limitation, this patch could even be backported as it fixes an unexpected behaviour]	2017-10-17 18:08:38 +02:00
Willy Tarreau	4ac4928718	BUG/MINOR: stream-int: don't set MSG_MORE on SHUTW_NOW without AUTO_CLOSE Since around 1.5-dev12, we've been setting MSG_MORE on send() on various conditions, including the fact that SHUTW_NOW is present, but we don't check that it's accompanied with AUTO_CLOSE. The result is that on requests immediately followed by a close (where AUTO_CLOSE is not set), the request gets delayed in the TCP stack before being sent to the server. This is visible with the H2 code where the end-of-stream flag is set on requests, but probably happens when a POLL_HUP is detected along with the request. The (lack of) presence of option abortonclose has no effect here since we never send the SHUTW along with the request. This fix can be backported to 1.7, 1.6 and 1.5.	2017-10-17 16:38:21 +02:00
Frederik Deweerdt	953917abc9	BUG/MEDIUM: ssl: fix OCSP expiry calculation The hour part of the timezone offset was multiplied by 60 instead of 3600, resulting in an inaccurate expiry. This bug was introduced in 1.6-dev1 by commit `4f3c87a` ("BUG/MEDIUM: ssl: Fix to not serve expired OCSP responses."), so this fix must be backported into 1.7 and 1.6.	2017-10-16 18:14:36 +02:00
Emeric Brun	64cc49cf7e	MAJOR: servers: propagate server status changes asynchronously. In order to prepare multi-thread development, code was re-worked to propagate changes asynchronoulsy. Servers with pending status changes are registered in a list and this one is processed and emptied only once 'run poll' loop. Operational status changes are performed before administrative status changes. In a case of multiple operational status change or admin status change in the same 'run poll' loop iteration, those changes are merged to reach only the targeted status.	2017-10-13 12:00:27 +02:00
Willy Tarreau	d716f9bacf	MINOR: payload: add new sample fetch functions to process distcc protocol When using haproxy in front of distccd, it's possible to provide significant improvements by only connecting when the preprocessing is completed, and by selecting different farms depending on the payload size. This patch provides two new sample fetch functions : distcc_param(<token>[,<occ>]) : integer distcc_body(<token>[,<occ>]) : binary	2017-10-13 11:47:19 +02:00
Willy Tarreau	ff2b7afe0b	MINOR: server: add the srv_queue() sample fetch method srv_queue([<backend>/]<server>) : integer Returns an integer value corresponding to the number of connections currently pending in the designated server's queue. If <backend> is omitted, then the server is looked up in the current backend. It can sometimes be used together with the "use-server" directive to force to use a known faster server when it is not much loaded. See also the "srv_conn", "avg_queue" and "queue" sample fetch methods.	2017-10-13 11:47:18 +02:00
Patrick Starr	dce734e10f	DOC: fix some typos [wt: ~25 typos, most of which should be eligible for backporting]	2017-10-11 04:26:07 +02:00
Willy Tarreau	bf08beb2a3	MINOR: session: remove the list of streams from struct session Commit `bcb86ab` ("MINOR: session: add a streams field to the session struct") added this list of streams that is not needed anymore. Let's get rid of it now.	2017-10-08 22:32:05 +02:00
Willy Tarreau	c939835f77	MINOR: compiler: restore the likely() wrapper for gcc 5.x After some tests, gcc 5.x produces better code with likely() than without, contrary to gcc 4.x where it was better to disable it. Let's re-enable it for 5 and above.	2017-10-08 22:32:05 +02:00
Ben51Degrees	636e6afcfa	DOC: 51d: Updated git URL and instructions for getting Hash Trie data files. Use branch, not tag for download URL, and recommend switching to Hash Trie.	2017-10-06 16:47:25 +02:00
Dragan Dosen	16586e635b	DOC: 51d: add 51Degrees git URL that points to release version 3.2.12.12 The 51Degrees C library version 3.2.12.12 has support for a new Hash Trie algorithm. This patch can be backported in 1.7.	2017-10-05 11:24:25 +02:00
Dragan Dosen	483b93cc9a	BUILD/MINOR: 51d: fix warning when building with 51Degrees release version 3.2.12.12 The warning appears when building with 51Degrees release that uses a new Hash Trie algorithm (release version 3.2.12.12): src/51d.c: In function init_51degrees: src/51d.c:566:2: warning: enumeration value DATA_SET_INIT_STATUS_TOO_MANY_OPEN_FILES not handled in switch [-Wswitch] switch (_51d_dataset_status) { ^ This patch can be backported in 1.7.	2017-10-05 11:23:38 +02:00
Bin Wang	95fad5ba4b	BUG/MAJOR: stream-int: don't re-arm recv if send fails When 1) HAProxy configured to enable splice on both directions 2) After some high load, there are 2 input channels with their socket buffer being non-empty and pipe being full at the same time, sitting in `fd_cache` without any other fds. The 2 channels will repeatedly be stopped for receiving (pipe full) and waken for receiving (data in socket), thus getting out and in of `fd_cache`, making their fd swapping location in `fd_cache`. There is a `if (entry < fd_cache_num && fd_cache[entry] != fd) continue;` statement in `fd_process_cached_events` to prevent frequent polling, but since the only 2 fds are constantly swapping location, `fd_cache[entry] != fd` will always hold true, thus HAProxy can't make any progress. The root cause of the issue is dual : - there is a single fd_cache, for next events and for the ones being processed, while using two distinct arrays would avoid the problem. - the write side of the stream interface wakes the read side up even when it couldn't write, and this one really is a bug. Due to CF_WRITE_PARTIAL not being cleared during fast forwarding, a failed send() attempt will still cause ->chk_rcv() to be called on the other side, re-creating an entry for its connection fd in the cache, causing the same sequence to be repeated indefinitely without any opportunity to make progress. CF_WRITE_PARTIAL used to be used for what is present in these tests : check if a recent write operation was performed. It's part of the CF_WRITE_ACTIVITY set and is tested to check if timeouts need to be updated. It's also used to detect if a failed connect() may be retried. What this patch does is use CF_WROTE_DATA() to check for a successful write for connection retransmits, and to clear CF_WRITE_PARTIAL before preparing to send in stream_int_notify(). This way, timeouts are still updated each time a write succeeds, but chk_rcv() won't be called anymore after a failed write. It seems the fix is required all the way down to 1.5. Without this patch, the only workaround at this point is to disable splicing in at least one direction. Strictly speaking, splicing is not absolutely required, as regular forwarding could theorically cause the issue to happen if the timing is appropriate, but in practice it appears impossible to reproduce it without splicing, and even with splicing it may vary. The following config manages to reproduce it after a few attempts (haproxy going 100% CPU and having to be killed) : global maxpipes 50000 maxconn 10000 listen srv1 option splice-request option splice-response bind :8001 server s1 127.0.0.1:8002 server$ tcploop 8002 L N20 A R10 S1000000 R10 S1000000 R10 S1000000 R10 S1000000 R10 S1000000 client$ tcploop 8001 N20 C T S1000000 R10 J	2017-10-05 11:20:16 +02:00
Christopher Faulet	a258479e3f	BUG/MEDIUM: http: Return an error when url_dec sample converter failed url_dec sample converter uses url_decode function to decode an URL. This function fails by returning -1 when an invalid character is found. But the sample converter never checked the return value and it used it as length for the decoded string. Because it always succeeded, the invalid sample (with a string length set to -1) could be used by other sample fetches or sample converters, leading to undefined behavior like segfault. The fix is pretty simple, url_dec sample converter just needs to return an error when url_decode fails. This patch must be backported in 1.7 and 1.6.	2017-10-05 11:11:34 +02:00
Willy Tarreau	017af2477e	BUG/MEDIUM: cli: fix "show fd" crash when dumping closed FDs I misplaced the "if (!fdt.owner)" test so it can occasionally crash when dumping an fd that's already been closed but still appears in the table. It's not critical since this was not pushed into any release nor backported though.	2017-10-04 20:28:26 +02:00
Willy Tarreau	00149121b7	MEDIUM: checks: do not allocate a permanent connection anymore Health check currently cheat, they allocate a connection upon startup and never release it, it's only recycled. The problem with doing this is that this code is preventing the connection code from evolving towards multiplexing. This code ensures that it's safe for the checks to run without a connection all the time. Given that the code heavily relies on CO_FL_ERROR to signal check errors, it is not trivial but in practice this is the principle adopted here : - the connection is not allocated anymore on startup - new checks are not supposed to have a connection, so an attempt is made to allocate this connection in the check task's context. If it fails, the check is aborted on a resource error, and the rare code on this path verifying the connection was adjusted to check for its existence (in practice, avoid to close it) - returning checks necessarily have a valid connection (which may possibly be closed). - a "tcp-check connect" rule tries to allocate a new connection before releasing the previous one (but after closing it), so that if it fails, it still keeps the previous connection in a closed state. This ensures a connection is always valid here Now it works well on all tested cases (regular and TCP checks, even with multiple reconnections), including when the connection is forced to NULL or randomly allocated.	2017-10-04 19:36:29 +02:00
Willy Tarreau	6bdcab0149	MEDIUM: checks: make tcpcheck_main() indicate if it recycled a connection The tcp-checks are very fragile. They can modify a connection's FD by closing and reopening a socket without informing the connection layer, which may then possibly touch the wrong fd. Given that the events are only cleared and that the fd is just created, there should be no visible side effect because the old fd is deleted so even if its flags get cleared they were already, and the new fd already has them cleared as well so it's a NOP. Regardless, this is too fragile and will not resist to threads. In order to address this situation, this patch makes tcpcheck_main() indicate if it closed a connection and report it to wake_srv_chk(), which will then report it to the connection's fd handler so that it refrains from updating the connection polling and the fd. Instead the connection polling status is updated in the wake() function.	2017-10-04 18:49:22 +02:00
Willy Tarreau	f411cce456	MINOR: checks: don't create then kill a dummy connection before tcp-checks When tcp-checks are in use, a connection starts to be created, then it's destroyed so that tcp-check can recreate its own. Now we directly move to tcpcheck_main() when it's detected that tcp-check is in use.	2017-10-04 16:29:19 +02:00
Willy Tarreau	be74b88be8	MINOR: tcp-check: make tcpcheck_main() take a check, not a connection We want this one to allocate its own connection so it must not take a connection but a check.	2017-10-04 16:29:19 +02:00
Willy Tarreau	668730fd00	TESTS: checks: add a simple test config for tcp-checks tcp-check.cfg tests various arrangements of initial tcp-check rules.	2017-10-04 16:29:19 +02:00
Willy Tarreau	894c642fbf	BUG/MINOR: tcp-check: don't initialize then break a connection starting with a comment The following config : backend tcp9000 option tcp-check tcp-check comment "this is a comment" tcp-check connect port 10000 server srv 127.0.0.1:9000 check inter 1s will result in a connection being first made to port 9000 then immediately destroyed and re-created on port 10000, because the first rule is a comment and doesn't match the test for the first rule being a connect(). It's mostly harmless (unless the server really must not receive empty connections) and the workaround simply consists in removing the comment. Let's proceed like in other places where we simply skip leading comments. A new function was made to make this lookup les boring. The fix should be backported to 1.7 and 1.6.	2017-10-04 16:13:57 +02:00
Willy Tarreau	59070784fc	TESTS: checks: add a simple test config for external checks ext-check.cfg tests both for success and failure in two different backends.	2017-10-04 15:42:00 +02:00
Willy Tarreau	b398e643d4	CLEANUP: checks: do not allocate a connection for process checks Since this connection is not used at all anymore, do not allocate it. It was verified that check successes and failures (both synchronous and asynchronous) continue to be properly reported.	2017-10-04 15:25:38 +02:00
Willy Tarreau	d7c3fbd5c3	CLEANUP: checks: don't report report the fork() error twice Upon fork() error, a first report is immediately made by connect_proc_chk() via set_server_check_status(), then process_chk_proc() detects the error code and makes up a dummy connection error to call chk_report_conn_err(), which tries to retrieve the errno code from the connection, fails, then saves the status message from the check, fails all "if" tests on its path related to the connection then resets the check's state to the current one with the current status message. All this useless chain is the only reason why process checks require a connection! Let's simply get rid of this second useless call.	2017-10-04 15:19:26 +02:00
Willy Tarreau	1e62e2a780	CLEANUP: checks: remove misleading comments and statuses for external process The external process check code abused a little bit from copy-pasting to the point of making think it requires a connection... The initialization code only returns SF_ERR_NONE and SF_ERR_RESOURCE, so the other one can be folded there. The code now only uses the connection to report the error status.	2017-10-04 15:07:02 +02:00
Willy Tarreau	b5259bf44f	MINOR: checks: make chk_report_conn_err() take a check, not a connection Amazingly, this function takes a connection to report an error and is used by process checks, placing a hard dependency between the connection and the check preventing the mux from being completely implemented. Let's first get rid of this.	2017-10-04 14:47:29 +02:00
Willy Tarreau	a1a247bd90	BUG/MINOR: unix: properly check for octal digits in the "mode" argument A config containing "stats socket /path/to/socket mode admin" used to silently start and be unusable (mode 0, level user) because the "mode" parser doesn't take care of non-digits. Now it properly reports : [ALERT] 276/144303 (7019) : parsing [ext-check.cfg:4] : 'stats socket' : ''mode' : missing or invalid mode 'admin' (octal integer expected)' This can probably be backported to 1.7, 1.6 and 1.5, though reporting parsing errors in very old versions probably isn't a good idea if the feature was left unused for years.	2017-10-04 14:43:44 +02:00
Willy Tarreau	c09572fd8b	BUG/MEDIUM: tcp-check: don't call tcpcheck_main() from the I/O handlers! This function can destroy a socket and create a new one, resulting in a change of FD on the connection between recv() and send() for example, which is absolutely not permitted, and can result in various funny games like polling not being properly updated (or with the flags from a previous fd) etc. Let's only call this from the wake() callback which is more tolerant. Ideally the operations should be made even more reliable by returning a specific value to indicate that the connection was released and that another one was created. But this is hasardous for stable releases as it may reveal other issues. This fix should be backported to 1.7 and 1.6.	2017-10-04 13:41:20 +02:00
Willy Tarreau	82feaaf042	BUG/MINOR: tcp-check: don't quit with pending data in the send buffer In the rare case where the "tcp-check send" directive is the last one in the list, it leaves the loop without sending the data. Fortunately, the polling is still enabled on output, resulting in the connection handler calling back to send what remains, but this is ugly and not very reliable. This may be backported to 1.7 and 1.6.	2017-10-04 13:41:20 +02:00
Willy Tarreau	a3782e7594	BUG/MEDIUM: tcp-check: properly indicate polling state before performing I/O While porting the connection to use the mux layer, it appeared that tcp-checks wouldn't receive anymore because the polling is not enabled before attempting to call xprt->rcv_buf() nor xprt->snd_buf(), and it is illegal to call these functions with polling disabled as they directly manipulate the FD state, resulting in an inconsistency where the FD is enabled and the connection's polling flags disabled. Till now it happened to work only because when recv() fails on EAGAIN it calls fd_cant_recv() which enables polling while signaling the failure, so that next time the message is received. But the connection's polling is never enabled, and any tiny change resulting in a call to conn_data_update_polling() immediately disables reading again. It's likely that this problem already happens on some corner cases such as multi-packet responses. It definitely breaks as soon as the response buffer is full but we don't support consuming more than one response buffer. This fix should be backported to 1.7 and 1.6. In order to check for the proper behaviour, this tcp-check must work and clearly show an SSH banner in recvfrom() as observed under strace, otherwise it's broken : tcp-check connect port 22 tcp-check expect rstring SSH tcp-check send blah	2017-10-04 13:41:17 +02:00
Willy Tarreau	3cad394520	CLEANUUP: checks: don't set conn->handle.fd to -1 This used to be needed to know whether there was a check in progress a long time ago (before tcp_checks) but this is not true anymore and even becomes wrong after the check is reused as conn_init() initializes it to DEAD_FD_MAGIC.	2017-10-04 07:53:19 +02:00

... 36 37 38 39 40 ...

8506 Commits