haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-14 02:57:01 +02:00

Author	SHA1	Message	Date
Willy Tarreau	e35d1d4f42	BUILD: http_act: cast file sizes when reporting file size error As seen in issue #496, st_size may be of varying types on different systems. Let's simply cast it to long long and use long long for all size outputs.	2020-02-11 10:58:56 +01:00
Willy Tarreau	157788c7b1	BUG/MINOR: connection: correctly retry I/O on signals Issue #490 reports that there are a few bogus constructs of the famous "do { if (cond) continue; } while (0)" in the connection code, that are used to retry on I/O failures caused by receipt of a signal. Let's turn them into the more correct "while (1) { if (cond) continue; break }" instead. This may or may not be backported, it shouldn't have any visible effect.	2020-02-11 10:26:39 +01:00
Willy Tarreau	327ea5aec8	BUG/MINOR: unix: better catch situations where the unix socket path length is close to the limit We do have some checks for the UNIX socket path length to validate the full pathname of a unix socket but the pathname extension is only taken into account when using a bind_prefix. The second check only matches against MAXPATHLEN. So this means that path names between 98 and 108 might successfully parse but fail to bind. Let's adjust the check in the address parser and refine the error checking at the bind() step. This addresses bug #493.	2020-02-11 06:49:42 +01:00
Willy Tarreau	508f989758	BUG/MAJOR: mux-h2: don't wake streams after connection was destroyed In commit `477902b` ("MEDIUM: connections: Get ride of the xprt_done callback.") we added an inconditional call to h2_wake_some_streams() in h2_wake(), though we must not do it if the connection is destroyed or we end up with a use-after-free. In this case it's already done in h2_process() before destroying the connection anyway. Let's just add this test for now. A cleaner approach might consist in doing it in the h2_process() function itself when a connection status change is detected. No backport is needed, this is purely 2.2.	2020-02-11 04:42:05 +01:00
Christopher Faulet	67307796e6	BUG/MEDIUM: tcp-rules: Fix track-sc* actions for L4/L5 TCP rules A bug was introduced during TCP rules refactoring by the commit `ac98d81f4` ("MINOR: http-rule/tcp-rules: Make track-sc* custom actions"). There is no stream when L4/L5 TCP rules are evaluated. For these rulesets, In track-sc* actions, we must take care to rely on the session instead of the stream. Because of this bug, any evaluation of L4/L5 TCP rules using a track-sc* action leads to a crash of HAProxy. No backport needed, except if the above commit is backported.	2020-02-10 10:09:58 +01:00
William Lallemand	696f317f13	BUG/MEDIUM: ssl/cli: 'commit ssl cert' wrong SSL_CTX init The code which is supposed to apply the bind_conf configuration on the SSL_CTX was not called correctly. Indeed it was called with the previous SSL_CTX so the new ones were left with default settings. For example the ciphers were not changed. This patch fixes #429. Must be backported in 2.1.	2020-02-07 20:55:35 +01:00
Christopher Faulet	817c4e39e5	BUG/MINOR: http-act: Fix bugs on error path during parsing of return actions This patch fixes memory leaks and a null pointer dereference found by coverity on the error path when an HTTP return action is parsed. See issue #491. No need to backport this patch except the HTT return action is backported too.	2020-02-07 10:37:59 +01:00
Christopher Faulet	692a6c2e69	BUG/MINOR: http-act: Set stream error flag before returning an error In action_http_set_status(), when a rewrite error occurred, the stream error flag must be set before returning the error. No need to backport this patch except if commit `333bf8c33` ("MINOR: http-rules: Set SF_ERR_PRXCOND termination flag when a header rewrite fails") is backported. This bug was reported in issue #491.	2020-02-07 10:37:53 +01:00
Tim Duesterhus	f1bc24cb27	BUG/MINOR: acl: Fix type of log message when an acl is named 'or' The patch adding this check initially only issued a warning, instead of being fatal. It was changed before committing. However when making this change the type of the log message was not changed from `ha_warning` to `ha-alert`. This patch makes this forgotten adjustment. see `0cf811a5f9` No backport needed. The initial patch was backported as a warning, thus the log message type is correct.	2020-02-06 22:16:07 +01:00
Tim Duesterhus	0cf811a5f9	MINOR: acl: Warn when an ACL is named 'or' Consider a configuration like this: > acl t always_true > acl or always_false > > http-response set-header Foo Bar if t or t The 'or' within the condition will be treated as a logical disjunction and the header will be set, despite the ACL 'or' being falsy. This patch makes it an error to declare such an ACL that will never work. This patch may be backported to stable releases, turning the error into a warning only (the code was written in a way to make this trivial). It should not break anything and might improve the users' lifes.	2020-02-06 16:08:36 +01:00
Willy Tarreau	9d6bb5a546	BUILD: lua: silence a warning on systems where longjmp is not marked as noreturn If the longjmp() call is not flagged as "noreturn", for example, because the operating system doesn't target a gcc-compatible compiler, we may get this warning when building Lua : src/hlua.c: In function 'hlua_panic_ljmp': src/hlua.c:128:1: warning: no return statement in function returning non-void [-Wreturn-type] static int hlua_panic_ljmp(lua_State *L) { longjmp(safe_ljmp_env, 1); } ^~~~~~ The function's prototype cannot be changed because it must be compatible with Lua's callbacks. Let's simply enclose the call inside WILL_LJMP() which we created exactly to signal a call to longjmp(). It lets the compiler know we won't get back into the function and that the return statement is not needed.	2020-02-06 16:01:04 +01:00
Christopher Faulet	700d9e88ad	MEDIUM: lua: Add ability for actions to intercept HTTP messages It is now possible to intercept HTTP messages from a lua action and reply to clients. To do so, a reply object must be provided to the function txn:done(). It may contain a status code with a reason, a header list and a body. By default, if an empty reply object is used, an empty 200 response is returned. If no reply is passed when txn:done() is called, the previous behaviour is respected, the transaction is terminated and nothing is returned to the client. The same is done for TCP streams. When txn:done() is called, the action is terminated with the code ACT_RET_DONE on success and ACT_RET_ERR on error, interrupting the message analysis. The reply object may be created for the lua, by hand. Or txn:reply() may be called. If so, this object provides some methods to fill it: * Reply:set_status(<status> [ <reason>]) : Set the status and optionally the reason. If no reason is provided, the default one corresponding to the status code is used. * Reply:add_header(<name>, <value>) : Add a header. For a given name, the values are stored in an ordered list. * Reply:del_header(<name>) : Removes all occurrences of a header name. * Reply:set_body(<body>) : Set the reply body. Here are some examples, all doing the same: -- ex. 1 txn:done{ status = 400, reason = "Bad request", headers = { ["content-type"] = { "text/html" }, ["cache-control"] = { "no-cache", "no-store" }, }, body = "<html><body><h1>invalid request<h1></body></html>" } -- ex. 2 local reply = txn:reply{ status = 400, reason = "Bad request", headers = { ["content-type"] = { "text/html" }, ["cache-control"] = { "no-cache", "no-store" } }, body = "<html><body><h1>invalid request<h1></body></html>" } txn:done(reply) -- ex. 3 local reply = txn:reply() reply:set_status(400, "Bad request") reply:add_header("content-length", "text/html") reply:add_header("cache-control", "no-cache") reply:add_header("cache-control", "no-store") reply:set_body("<html><body><h1>invalid request<h1></body></html>") txn:done(reply)	2020-02-06 15:13:04 +01:00
Christopher Faulet	2c2c2e381b	MINOR: lua: Add act:wake_time() function to set a timeout when an action yields This function may be used to defined a timeout when a lua action returns act:YIELD. It is a way to force to reexecute the script after a short time (defined in milliseconds). Unlike core:sleep() or core:yield(), the script is fully reexecuted if it returns act:YIELD. With core functions to yield, the script is interrupted and restarts from the yield point. When a script returns act:YIELD, it is finished but the message analysis is blocked on the action waiting its end.	2020-02-06 15:13:04 +01:00
Christopher Faulet	0f3c8907c3	MINOR: lua: Create the global 'act' object to register all action return codes ACT_RET_* code are now available from lua scripts. The gloabl object "act" is used to register these codes as constant. Now, lua actions can return any of following codes : * act.CONTINUE for ACT_RET_CONT * act.STOP for ACT_RET_STOP * act.YIELD for ACT_RET_YIELD * act.ERROR for ACT_RET_ERR * act.DONE for ACT_RET_DONE * act.DENY for ACT_RET_DENY * act.ABORT for ACT_RET_ABRT * act.INVALID for ACT_RET_INV For instance, following script denied all requests : core.register_action("deny", { "http-req" }, function (txn) return act.DENY end) Thus "http-request lua.deny" do exactly the same than "http-request deny".	2020-02-06 15:13:03 +01:00
Christopher Faulet	7716cdf450	MINOR: lua: Get the action return code on the stack when an action finishes When an action successfully finishes, the action return code (ACT_RET_*) is now retrieve on the stack, ff the first element is an integer. In addition, in hlua_txn_done(), the value ACT_RET_DONE is pushed on the stack before exiting. Thus, when a script uses this function, the corresponding action still finishes with the good code. Thanks to this change, the flag HLUA_STOP is now useless. So it has been removed. It is a mandatory step to allow a lua action to return any action return code.	2020-02-06 15:13:03 +01:00
Christopher Faulet	a20a653e07	BUG/MINOR: http-ana: Increment failed_resp counters on invalid response In http_process_res_common() analyzer, when a invalid response is reported, the failed_resp counters must be incremented. No need to backport this patch, except if the commit `b8a5371a` ("MEDIUM: http-ana: Properly handle internal processing errors") is backported too.	2020-02-06 15:13:03 +01:00
Christopher Faulet	07a718e712	CLEANUP: lua: Remove consistency check for sample fetches and actions It is not possible anymore to alter the HTTP parser state from lua sample fetches or lua actions. So there is no reason to still check for the parser state consistency.	2020-02-06 15:13:03 +01:00
Christopher Faulet	4a2c142779	MEDIUM: http-rules: Support extra headers for HTTP return actions It is now possible to append extra headers to the generated responses by HTTP return actions, while it is not based on an errorfile. For return actions based on errorfiles, these extra headers are ignored. To define an extra header, a "hdr" argument must be used with a name and a value. The value is a log-format string. For instance: http-request status 200 hdr "x-src" "%[src]" hdr "x-dst" "%[dst]"	2020-02-06 15:13:03 +01:00
Christopher Faulet	24231ab61f	MEDIUM: http-rules: Add the return action to HTTP rules Thanks to this new action, it is now possible to return any responses from HAProxy, with any status code, based on an errorfile, a file or a string. Unlike the other internal messages generated by HAProxy, these ones are not interpreted as errors. And it is not necessary to use a file containing a full HTTP response, although it is still possible. In addition, using a log-format string or a log-format file, it is possible to have responses with a dynamic content. This action can be used on the request path or the response path. The only constraint is to have a responses smaller than a buffer. And to avoid any warning the buffer space reserved to the headers rewritting should also be free. When a response is returned with a file or a string as payload, it only contains the content-length header and the content-type header, if applicable. Here are examples: http-request return content-type image/x-icon file /var/www/favicon.ico \ if { path /favicon.ico } http-request return status 403 content-type text/plain \ lf-string "Access denied. IP %[src] is blacklisted." \ if { src -f /etc/haproxy/blacklist.lst }	2020-02-06 15:12:54 +01:00
Christopher Faulet	6d0c3dfac6	MEDIUM: http: Add a ruleset evaluated on all responses just before forwarding This patch introduces the 'http-after-response' rules. These rules are evaluated at the end of the response analysis, just before the data forwarding, on ALL HTTP responses, the server ones but also all responses generated by HAProxy. Thanks to this ruleset, it is now possible for instance to add some headers to the responses generated by the stats applet. Following actions are supported : * allow * add-header * del-header * replace-header * replace-value * set-header * set-status * set-var * strict-mode * unset-var	2020-02-06 14:55:34 +01:00
Christopher Faulet	a72a7e49e8	MINOR: http-ana/http-rules: Use dedicated function to forward internal responses Call http_forward_proxy_resp() function when an internal response is returned. It concerns redirect, auth and error reponses. But also 100-Continue and 103-Early-Hints responses. For errors, there is a subtlety. if the forward fails, an HTTP 500 error is generated if it is not already an internal error. For now http_forward_proxy_resp() cannot fail. But it will be possible when the new ruleset applied on all responses will be added.	2020-02-06 14:55:34 +01:00
Christopher Faulet	ef70e25035	MINOR: http-ana: Add a function for forward internal responses Operations performed when internal responses (redirect/deny/auth/errors) are returned are always the same. The http_forward_proxy_resp() function is added to group all of them under a unique function.	2020-02-06 14:55:34 +01:00
Christopher Faulet	72c7d8d040	MINOR: http-ana: Rely on http_reply_and_close() to handle server error The http_server_error() function now relies on http_reply_and_close(). Both do almost the same actions. In addtion, http_server_error() sets the error flag and the final state flag on the stream.	2020-02-06 14:55:34 +01:00
Christopher Faulet	60b33a5a62	MINOR: http-rules: Handle the rule direction when a redirect is evaluated The rule direction must be tested to do specific processing on the request path. intercepted_req counter shoud be updated if the rule is evaluated on the frontend and remaining request's analyzers must be removed. But only on the request path. The rule direction must also be tested to set the right final stream state flag. This patch depends on the commit "MINOR: http-rules: Add a flag on redirect rules to know the rule direction". Both must be backported to all stable versions.	2020-02-06 14:55:34 +01:00
Christopher Faulet	c87e468816	MINOR: http-rules: Add a flag on redirect rules to know the rule direction HTTP redirect rules can be evaluated on the request or the response path. So when a redirect rule is evaluated, it is important to have this information because some specific processing may be performed depending on the direction. So the REDIRECT_FLAG_FROM_REQ flag has been added. It is set when applicable on the redirect rule during the parsing. This patch is mandatory to fix a bug on redirect rule. It must be backported to all stable versions.	2020-02-06 14:55:34 +01:00
Christopher Faulet	c20afb810f	BUG/MINOR: http-ana: Set HTX_FL_PROXY_RESP flag if a server perform a redirect It is important to not forget to specify the HTX resposne was internally generated when a server perform a redirect. This information is used by the H1 multiplexer to choose the right connexion mode when the response is sent to the client. This patch must be backported to 2.1.	2020-02-06 14:55:34 +01:00
Christopher Faulet	7a138dc908	BUG/MINOR: http-ana: Reset HTX first index when HAPRoxy sends a response The first index in an HTX message is the HTX block index from which the HTTP analysis must be performed. When HAProxy sends an HTTP response, on error or redirect, this index must be reset because all pending incoming data are considered as forwarded. For now, it is only a bug for 103-Early-Hints response. For other responses, it is not a problem. But it will be when the new ruleset applied on all responses will be added. For 103 responses, if the first index is not reset, if there are rewritting rules on server responses, the generated 103 responses, if any, are evaluated too. This patch must be backported and probably adapted, at least for 103 responses, as far as 1.9.	2020-02-06 14:55:34 +01:00
Christopher Faulet	3b2bb63ded	MINOR: dns: Add function to release memory allocated for a do-resolve rule Memory allocated when a do-resolve rule is parsed is now released when HAProxy exits.	2020-02-06 14:55:34 +01:00
Christopher Faulet	a4168434a7	MINOR: dns: Dynamically allocate dns options to reduce the act_rule size <.arg.dns.dns_opts> field in the act_rule structure is now dynamically allocated when a do-resolve rule is parsed. This drastically reduces the structure size.	2020-02-06 14:55:34 +01:00
Christopher Faulet	637259e044	BUG/MINOR: http-ana: Don't overwrite outgoing data when an error is reported When an error is returned to a client, the right message is injected into the response buffer. It is performed by http_server_error() or http_replay_and_close(). Both ignore any data already present into the channel's buffer. While it is legitimate to remove all input data, it is important to not remove any outgoing data. So now, we try to append the error message to the response buffer, only removing input data. We rely on the channel_htx_copy_msg() function to do so. So this patch depends on the following two commits: * MINOR: htx: Add a function to append an HTX message to another one * MINOR: htx/channel: Add a function to copy an HTX message in a channel's buffer This patch must be backported as far as 1.9. However, above patches must be backported first.	2020-02-06 14:55:34 +01:00
Christopher Faulet	0ea0c86753	MINOR: htx: Add a function to append an HTX message to another one the htx_append_msg() function can now be used to append an HTX message to another one. All the message is copied or nothing. If an error occurs during the copy, all changes are rolled back. This patch is mandatory to fix a bug in http_reply_and_close() function. Be careful to backport it first.	2020-02-06 14:54:47 +01:00
Christopher Faulet	0a589fde7c	MINOR: http-htx: Emit a warning if an error file runs over the buffer's reserve If an error file is too big and, once converted in HTX, runs over the buffer space reserved to headers rewritting, a warning is emitted. Because a new set of rules will be added to allow headers rewritting on all responses, including HAProxy ones, it is important to always keep this space free for error files.	2020-02-06 09:36:36 +01:00
Christopher Faulet	333bf8c33f	MINOR: http-rules: Set SF_ERR_PRXCOND termination flag when a header rewrite fails When a header rewrite fails, an internal errors is triggered. But SF_ERR_INTERNAL is documented to be the concequence of a bug and must be reported to the dev teamm. So, when this happens, the SF_ERR_PRXCOND termination flag is set now.	2020-02-06 09:36:36 +01:00
Christopher Faulet	546c4696bb	MINOR: global: Set default tune.maxrewrite value during global structure init When the global structure is initialized, instead of setting tune.maxrewrite to -1, its default value can be immediately set. This way, it is always defined during the configuration validity check. Otherwise, the only way to have it at this stage, it is to explicity set it in the global section.	2020-02-06 09:36:36 +01:00
Christopher Faulet	91e31d83c9	BUG/MINOR: http-act: Use the good message to test strict rewritting mode Since the strict rewritting mode was introduced, actions manipulating headers (set/add/replace) always rely on the request message to test if the HTTP_MSGF_SOFT_RW flag is set or not. But, of course, we must only rely on the request for http-request rules. For http-response rules, we must use the response message. This patch must be backported if the strict rewritting is backported too.	2020-02-06 09:36:36 +01:00
Tim Duesterhus	d02ffe9b6d	CLEANUP: peers: Remove unused static function `free_dcache_tx` The function was added in commit `6c39198b57`, but was also used within a single function `free_dcache` which was unused itself. see issue #301 see commit `10ce0c2f31` which removed `free_dcache`	2020-02-05 23:40:17 +01:00
Tim Duesterhus	10ce0c2f31	CLEANUP: peers: Remove unused static function `free_dcache` The function was changed to be static in commit `6c39198b57`, but even that commit no longer uses it. The purpose of the change vs. outright removal is unclear. see issue #301	2020-02-05 18:49:29 +01:00
Willy Tarreau	077d366ef7	CLEANUP: hpack: remove a redundant test in the decoder As reported in issue #485 the test for !len at the end of the loop in get_var_int() is useless since it was already done inside the loop. Actually the code is more readable if we remove the first one so let's do this instead. The resulting code is exactly the same since the compiler already optimized the test away.	2020-02-05 15:39:08 +01:00
William Lallemand	4dd145a888	BUG/MINOR: ssl: clear the SSL errors on DH loading failure In ssl_sock_load_dh_params(), if haproxy failed to apply the dhparam with SSL_CTX_set_tmp_dh(), it will apply the DH with SSL_CTX_set_dh_auto(). The problem is that we don't clean the OpenSSL errors when leaving this function so it could fail to load the certificate, even if it's only a warning. Fixes bug #483. Must be backported in 2.1.	2020-02-05 15:32:24 +01:00
Willy Tarreau	731248f0db	BUG/MINOR: ssl: we may only ignore the first 64 errors We have the ability per bind option to ignore certain errors (CA, crt, ...), and for this we use a 64-bit field. In issue #479 coverity reports a risk of too large a left shift. For now as of OpenSSL 1.1.1 the highest error value that may be reported by X509_STORE_CTX_get_error() seems to be around 50 so there should be no risk yet, but it's enough of a warning to add a check so that we don't accidently hide random errors in the future. This may be backported to relevant stable branches.	2020-02-04 14:04:36 +01:00
William Lallemand	3af48e706c	MINOR: ssl: ssl-load-extra-files configure loading of files This new setting in the global section alters the way HAProxy will look for unspecified files (.ocsp, .sctl, .issuer, bundles) during the loading of the SSL certificates. By default, HAProxy discovers automatically a lot of files not specified in the configuration, and you may want to disable this behavior if you want to optimize the startup time. This patch sets flags in global_ssl.extra_files and then check them before trying to load an extra file.	2020-02-03 17:50:26 +01:00
Olivier Houchard	04f5fe87d3	BUG/MEDIUM: memory: Add a rwlock before freeing memory. When using lockless pools, add a new rwlock, flush_pool. read-lock it when getting memory from the pool, so that concurrenct access are still authorized, but write-lock it when we're about to free memory, in pool_flush() and pool_gc(). The problem is, when removing an item from the pool, we unreference it to get the next one, however, that pointer may have been free'd in the meanwhile, and that could provoke a crash if the pointer has been unmapped. It should be OK to use a rwlock, as normal operations will still be able to access the pool concurrently, and calls to pool_flush() and pool_gc() should be pretty rare. This should be backported to 2.1, 2.0 and 1.9.	2020-02-01 18:08:34 +01:00
Olivier Houchard	8af97eb4a1	MINOR: memory: Only init the pool spinlock once. In pool_create(), only initialize the pool spinlock if we just created the pool, in the event we're reusing it, there's no need to initialize it again.	2020-02-01 18:08:34 +01:00
Olivier Houchard	b6fa08bc7b	BUG/MEDIUM: memory_pool: Update the seq number in pool_flush(). In pool_flush(), we can't just set the free_list to NULL, or we may suffer the ABA problem. Instead, use a double-width CAS and update the sequence number. This should be backported to 2.1, 2.0 and 1.9. This may, or may not, be related to github issue #476.	2020-02-01 18:08:34 +01:00
Willy Tarreau	952c2640b0	MINOR: task: don't set TASK_RUNNING on tasklets We can't clear flags on tasklets because we don't know if they're still present upon return (they all return NULL, maybe that could change in the future). As a side effect, once TASK_RUNNING is set, it's never cleared anymore, which is misleading and resulted in some incorrect flagging of bulk tasks in the recent scheduler changes. And the only reason for setting TASK_RUNNING on tasklets was to detect self-wakers, which is not done using a dedicated flag. So instead of setting this flags for no opportunity to clear it, let's simply not set it.	2020-01-31 18:37:03 +01:00
Willy Tarreau	1dfc9bbdc6	OPTIM: task: readjust CPU bandwidth distribution since last update Now that we can more accurately watch which connection is really being woken up from itself, it was desirable to re-adjust the CPU BW thresholds based on measurements. New tests with 60000 concurrent connections were run at 100 Gbps with unbounded queues and showed the following distribution: scenario TC0 TC1 TC2 observation -------------------+---+---+----+--------------------------- TCP conn rate : 32, 51, 17 HTTP conn rate : 34, 41, 25 TCP byte rate : 2, 3, 95 (2 MB objets) splicing byte rate: 11, 6, 83 (2 MB objets) H2 10k object : 44, 23, 33 client-limited mixed traffic : 18, 10, 72 21m+10: 11kcps, 36 Gbps The H2 experienced a huge change since it uses a persistent connection that was accidently flagged in the previous test. The splicing test exhibits a higher need for short tasklets, so does the mixed traffic test. Given that latency mainly matters for conn rate and H2 here, the ratios were readjusted as 33% for TC0, 50% for TC1 and 17% for TC2, keeping in mind that whatever is not consumed by one class is automatically shared in equal propertions by the next one(s). This setting immediately provided a nice improvement as with the default settings (maxpollevents=200, runqueue-depth=200), the same ratios as above are still reported, while the time to request "show activity" on the CLI dropped to 30-50ms. The average loop time is around 5.7ms on the mixed traffic. In addition, one extra stress test at 90.5 Gbps with 5100 conn/s shows 70-100ms CLI request time, with an average loop time of 17 ms.	2020-01-31 18:37:01 +01:00
Willy Tarreau	d23d413e38	MINOR: task: make sched->current also reflect tasklets sched->current is used to know the current task/tasklet, and is currently only used by the panic dump code. However it turns out it was not set for tasklets, which prevents us from using it for more usages, despite the panic handling code already handling this case very well. Let's make sure it's now set.	2020-01-31 17:45:10 +01:00
Willy Tarreau	bb238834da	MINOR: task: permanently flag tasklets waking themselves up Commit `a17664d829` ("MEDIUM: tasks: automatically requeue into the bulk queue an already running tasklet") tried to inflict a penalty to self-requeuing tasks/tasklets which correspond to those involved in large, high-latency data transfers, for the benefit of all other processing which requires a low latency. However, it turns out that while it ought to do this on a case-by-case basis, basing itself on the RUNNING flag isn't accurate because this flag doesn't leave for tasklets, so we'd rather need a distinct flag to tag such tasklets. This commit introduces TASK_SELF_WAKING to mark tasklets acting like this. For now it's still set when TASK_RUNNING is present but this will have to change. The flag is kept across wakeups.	2020-01-31 17:45:10 +01:00
Olivier Houchard	849d4f047f	BUG/MEDIUM: connections: Don't forget to unlock when killing a connection. Commit `140237471e` made sure we hold the toremove_lock for the corresponding thread before removing a connection from its idle_orphan_conns list, however it failed to unlock it if we found a connection, leading to a deadlock, so add the missing deadlock. This should be backported to 2.1 and 2.0.	2020-01-31 17:25:37 +01:00
Willy Tarreau	c633607c06	OPTIM: task: refine task classes default CPU bandwidth ratios Measures with unbounded execution ratios under 40000 concurrent connections at 100 Gbps showed the following CPU bandwidth distribution between task classes depending on traffic scenarios: scenario TC0 TC1 TC2 observation -------------------+---+---+----+--------------------------- TCP conn rate : 29, 48, 23 221 kcps HTTP conn rate : 29, 47, 24 200 kcps TCP byte rate : 3, 5, 92 53 Gbps splicing byte rate: 5, 10, 85 70 Gbps H2 10k object : 10, 21, 74 client-limited mixed traffic : 4, 7, 89 21m+10: 11kcps, 36 Gbps Thus it seems that we always need a bit of bulk tasks even for short connections, which seems to imply a suboptimal processing somewhere, and that there are roughly twice as many tasks (TC1=normal) as regular tasklets (TC0=urgent). This ratio stands even when data forwarding increases. So at first glance it looks reasonable to enforce the following ratio by default: - 16% for TL_URGENT - 33% for TL_NORMAL - 50% for TL_BULK With this, the TCP conn rate climbs to ~225 kcps, and the mixed traffic pattern shows a more balanced 17kcps + 35 Gbps with 35ms CLI request time time instead of 11kcps + 36 Gbps and 400 ms response time. The byte rate tests (1M objects) are not affected at all. This setting looks "good enough" to allow immediate merging, and could be refined later. It's worth noting that it resists very well to massive increase of run queue depth and maxpollevents: with the run queue depth changed from 200 to 10000 and maxpollevents to 10000 as well, the CLI's request time is back to the previous ~400ms, but the mixed traffic test reaches 52 Gbps + 7500 CPS, which was never met with the previous scheduling model, while the CLI used to show ~1 minute response time. The reason is that in the bulk class it becomes possible to perform multiple rounds of recv+send and eliminate objects at once, increasing the L3 cache hit ratio, and keeping the connection count low, without degrading too much the latency. Another test with mixed traffic involving 2/3 splicing on huge objects and 1/3 on empty objects without touching any setting reports 51 Gbps + 5300 cps and 35ms CLI request time.	2020-01-31 07:09:10 +01:00
Willy Tarreau	a62917b890	MEDIUM: tasks: implement 3 different tasklet classes with their own queues We used to mix high latency tasks and low latency tasklets in the same list, and to even refill bulk tasklets there, causing some unfairness in certain situations (e.g. poll-less transfers between many connections saturating the machine with similarly-sized in and out network interfaces). This patch changes the mechanism to split the load into 3 lists depending on the task/tasklet's desired classes : - URGENT: this is mainly for tasklets used as deferred callbacks - NORMAL: this is for regular tasks - BULK: this is for bulk tasks/tasklets Arbitrary ratios of max_processed are picked from each of these lists in turn, with the ability to complete in one list from what was not picked in the previous one. After some quick tests, the following setup gave apparently good results both for raw TCP with splicing and for H2-to-H1 request rate: - 0 to 75% for urgent - 12 to 50% for normal - 12 to what remains for bulk Bulk is not used yet.	2020-01-30 18:59:33 +01:00
Willy Tarreau	4ffa0b526a	MINOR: tasks: move the list walking code to its own function New function run_tasks_from_list() will run over a tasklet list and will run all the tasks and tasklets it finds there within a limit of <max> that is passed in arggument. This is a preliminary work for scheduler QoS improvements.	2020-01-30 18:13:13 +01:00
Willy Tarreau	876b411f2b	BUG/MEDIUM: pipe/thread: fix atomicity of pipe counters Previous patch `160287b676` ("MEDIUM: pipe/thread: maintain a per-thread local cache of recently used pipes") didn't replace all pipe counter updates with atomic ops since some were already under a lock, which is obviously not a valid reason since these ones can be updated in parallel to other atomic ops. The result was that the pipes_used could seldom be seen as negative in the stats (harmless) but also this could result in slightly more pipes being allocated than permitted, thus stealing a few file descriptors that were not usable for connections anymore. Let's use pure atomic ops everywhere these counters are updated. No backport is needed.	2020-01-30 09:15:37 +01:00
Willy Tarreau	160287b676	MEDIUM: pipe/thread: maintain a per-thread local cache of recently used pipes In order to completely remove the pipe locking cost and try to reuse hot pipes, each thread now maintains a local cache of recently used pipes that is no larger than its share (maxpipes/nbthreads). All extra pipes are instead refilled into the global pool. Allocations are made from the local pool first, and fall back to the global one before allocating one. This completely removes the observed pipe locking cost at high bit rates, which was still around 5-6%.	2020-01-29 11:12:07 +01:00
Willy Tarreau	a945cfdfe0	MEDIUM: pipe/thread: reduce the locking overhead In a quick test involving splicing, we can see that get_pipe() and put_pipe() together consume up to 12% of the CPU. That's not surprizing considering how much work is performed under the lock, including the pipe struct allocation, the pipe creation and its initialization. Same for releasing, we don't need a lock there to call close() nor to free to the pool. Changing this alone was enough to cut the overhead in half. A better approach should consist in having a per-thread pipe cache, which will also help keep pages hot in the CPU caches.	2020-01-29 10:44:00 +01:00
William Lallemand	a25a19fdee	BUG/MINOR: ssl/cli: fix unused variable with openssl < 1.0.2 src/ssl_sock.c: In function ‘cli_io_handler_show_cert’: src/ssl_sock.c:10214:6: warning: unused variable ‘n’ [-Wunused-variable] int n; ^ Fix this problem in the io handler of the "show ssl cert" function.	2020-01-29 00:08:10 +01:00
Willy Tarreau	1113116b4a	MEDIUM: raw-sock: remove obsolete calls to fd_{cant,cond,done}_{send,recv} Given that raw_sock's functions solely act on connections and that all its callers properly use subscribe() when they want to receive/send more, there is no more reason for calling fd_{cant,cond,done}_{send,recv} anymore as this call is immediately overridden by the subscribe call. It's also worth noting that the purpose of fd_cond_recv() whose purpose was to speculatively enable reading in the FD cache if the FD was active but not yet polled was made to save on expensive epoll_ctl() calls and was implicitly covered more cleanly by recent commit `5d7dcc2a8e` ("OPTIM: epoll: always poll for recv if neither active nor ready"). No change on the number of calls to epoll_ctl() was noticed consecutive to this change.	2020-01-28 19:06:41 +01:00
William Dauchy	1e2256d4d3	MINOR: proxy: clarify number of connections log when stopping this log could be sometimes a bit confusing (depending on the number in fact) when you read it (e.g is it the number of active connection?) - only trained eyes knows haproxy output a different log when closing active connections while stopping. Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-28 13:10:03 +01:00
William Dauchy	aecd5dcac2	BUG/MINOR: dns: allow 63 char in hostname hostname were limited to 62 char, which is not RFC1035 compliant; - the parsing loop should stop when above max label char - fix len label test where d[i] was wrongly used - simplify the whole function to avoid using two extra char* variable this should fix github issue #387 Signed-off-by: William Dauchy <w.dauchy@criteo.com> Reviewed-by: Tim Duesterhus <tim@bastelstu.be> Acked-by: Baptiste <bedis9@gmail.com>	2020-01-28 13:08:08 +01:00
William Dauchy	bd8bf67102	BUG/MINOR: connection: fix ip6 dst_port copy in make_proxy_line_v2 triggered by coverity; src_port is set earlier. this should fix github issue #467 Fixes: `7fec021537` ("MEDIUM: proxy_protocol: Convert IPs to v6 when protocols are mixed") This should be backported to 1.8. Signed-off-by: William Dauchy <w.dauchy@criteo.com> Reviewed-by: Tim Duesterhus <tim@bastelstu.be>	2020-01-28 13:02:58 +01:00
Christopher Faulet	c20b37112b	BUG/MINOR: http-rules: Always init log-format expr for common HTTP actions Many HTTP actions rely on <.arg.http> in the act_rule structure. Not all actions use the log-format expression, but it must be initialized anyway. Otherwise, HAProxy may crash during the deinit when the release function is called. No backport needed. This patch should fix issue #468.	2020-01-27 15:51:57 +01:00
Willy Tarreau	74ab7d2b80	BUG/MINOR: tcpchecks: fix the connect() flags regarding delayed ack In issue #465, we see that Coverity detected dead code in checks.c which is in fact a missing parenthesis to build the connect() flags consecutive to the API change in commit `fdcb007ad8` ("MEDIUM: proto: Change the prototype of the connect() method."). The impact should be imperceptible as in the best case it may have resulted in a missed optimization trying to save a syscall or to merge outgoing packets. It may be backported as far as 2.0 though it's not critical.	2020-01-24 17:52:37 +01:00
Olivier Houchard	1fc5a648bf	MEDIUM: streams: Don't close the connection in back_handle_st_rdy(). In back_handle_st_rdy(), don't bother trying to close the connection, it should be taken care of somewhere else.	2020-01-24 15:40:34 +01:00
Olivier Houchard	7c30642ede	MEDIUM: streams: Don't close the connection in back_handle_st_con(). In back_handle_st_con(), don't bother trying to close the connection, it should be taken care of elsewhere.	2020-01-24 15:40:34 +01:00
Olivier Houchard	b43589cac5	BUG/MEDIUM: stream: Don't install the mux in back_handle_st_con(). In back_handle_st_con(), don't bother setting up the mux, it is now done by conn_fd_handler().	2020-01-24 15:40:34 +01:00
Olivier Houchard	efe5e8e998	BUG/MEDIUM: ssl: Don't forget to free ctx->ssl on failure. In ssl_sock_init(), if we fail to allocate the BIO, don't forget to free the SSL *, or we'd end up with a memory leak. This should be backported to 2.1 and 2.0.	2020-01-24 15:17:38 +01:00
Olivier Houchard	6d53cd6978	MINOR: ssl: Remove dead code. Now that we don't call the handshake function directly, but merely wake the tasklet, we can no longer have CO_FL_ERR, so don't bother checking it.	2020-01-24 15:13:57 +01:00
Frédéric Lécaille	3139c1b198	BUG/MINOR: ssl: Possible memleak when allowing the 0RTT data buffer. As the server early data buffer is allocated in the middle of the loop used to allocate the SSL session without being freed before retrying, this leads to a memory leak. To fix this we move the section of code responsible of this early data buffer alloction after the one reponsible of allocating the SSL session. Must be backported to 2.1 and 2.0.	2020-01-24 15:12:21 +01:00
Olivier Houchard	ecffb7d841	BUG/MEDIUM: streams: Move the conn_stream allocation outside #IF USE_OPENSSL. When commit `477902bd2e` made the conn_stream allocation unconditional, it unfortunately moved the code doing the allocation inside #if USE_OPENSSL, which means anybody compiling haproxy without openssl wouldn't allocate any conn_stream, and would get a segfault later. Fix that by moving the code that does the allocation outside #if USE_OPENSSL.	2020-01-24 14:14:35 +01:00
Christopher Faulet	99ac8a1aa4	BUG/MINOR: stream: Be sure to have a listener to increment its counters In process_stream(), when a client or a server abort is handled, the corresponding listener's counter is incremented. But, we must be sure to have a listener attached to the session. This bug was introduced by the commit `cff0f739e5`. Thanks to Fred to reporting me the bug. No need to backport this patch, except if commit `cff0f739e5` is backported.	2020-01-24 11:55:17 +01:00
Christopher Faulet	be20cf36af	BUG/MINOR: http-ana: Increment the backend counters on the backend A stupid cut-paste bug was introduced in the commit `cff0f739e5`. Backend counters must of course be incremented on the stream's backend. Not the frontend. No need to backport this patch, except if commit `cff0f739e5` is backported.	2020-01-24 11:55:17 +01:00
Willy Tarreau	645c588e71	BUILD: cfgparse: silence a bogus gcc warning on 32-bit machines A first patch was made during 2.0-dev to silence a bogus warning emitted by gcc : `dd1c8f1f72` ("MINOR: cfgparse: Add a cast to make gcc happier."), but it happens it was not sufficient as the warning re-appeared on 32-bit machines under gcc-8 and gcc-9 : src/cfgparse.c: In function 'check_config_validity': src/cfgparse.c:3642:33: warning: argument 1 range [2147483648, 4294967295] exceeds maximum object size 2147483647 [-Walloc-size-larger-than=] newsrv->idle_orphan_conns = calloc((unsigned int)global.nbthread, sizeof(*newsrv->idle_orphan_conns)); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ This warning doesn't trigger in other locations, and it immediately vanishes if the previous or subsequent loops do not depend on global.nbthread anymore, or if the field ordering of the struct server changes! As discussed in the thread at: https://www.mail-archive.com/haproxy@formilux.org/msg36107.html playing with -Walloc-size-larger-than has no effect. And a minimal reproducer could be isolated, indicating it's pointless to circle around this one. Let's just cast nbthread to ushort so that gcc cannot make this wrong detection. It's unlikely we'll use more than 65535 threads in the near future anyway. This may be backported to older releases if they are also affected, at least to ease the job of distro maintainers. Thanks to Ilya for testing.	2020-01-24 11:30:06 +01:00
Tim Duesterhus	541fe1ec52	MINOR: lua: Add HLUA_PREPEND_C?PATH build option This complements the lua-prepend-path configuration option to allow distro maintainers to add a default path for HAProxy specific Lua libraries.	2020-01-24 09:22:03 +01:00
Tim Duesterhus	dd74b5f237	MINOR: lua: Add lua-prepend-path configuration option lua-prepend-path allows the administrator to specify a custom Lua library path to load custom Lua modules that are useful within the context of HAProxy without polluting the global Lua library folder.	2020-01-24 09:22:03 +01:00
Tim Duesterhus	c9fc9f2836	MINOR: lua: Add hlua_prepend_path function This function is added in preparation for following patches.	2020-01-24 09:21:35 +01:00
Willy Tarreau	bb2c4ae065	BUG/MEDIUM: mux-h2: make sure we don't emit TE headers with anything but "trailers" While the H2 parser properly checks for the absence of anything but "trailers" in the TE header field, we forget to check this when sending the request to an H2 server. The problem is that an H2->H2 conversion may keep "gzip" and fail on the next stage. This patch makes sure that we only send "TE: trailers" if the TE header contains the "trailers" token, otherwise it's dropped. This fixes issue #464 and should be backported till 1.9.	2020-01-24 09:07:53 +01:00
Willy Tarreau	508d232a06	BUG/MINOR: stktable: report the current proxy name in error messages Since commit `1b8e68e89a` ("MEDIUM: stick-table: Stop handling stick-tables as proxies."), a rule referencing the current proxy with no table leads to the following error : [ALERT] 023/071924 (16479) : Proxy 'px': unable to find stick-table '(null)'. [ALERT] 023/071914 (16479) : Fatal errors found in configuration. for a config like this one: backend px stick on src This patch fixes it and should be backported as far as 2.0.	2020-01-24 07:19:34 +01:00
Willy Tarreau	f22758d12a	MINOR: connection: remove some unneeded checks for CO_FL_SOCK_WR_SH A few places in health checks and stream-int on the send path were still checking for this flag. Now we do not and instead we rely on snd_buf() to report the error if any. It's worth noting that all 3 real muxes still use CO_FL_SOCK_WR_SH and CO_FL_ERROR interchangeably at various places to decide to abort and/or free their data. This should be clarified and fixed so that only CO_FL_ERROR is used, and this will render the error paths simpler and more accurate.	2020-01-23 19:01:37 +01:00
Willy Tarreau	a8c7e8e3a8	MINOR: raw-sock: always check for CO_FL_SOCK_WR_SH before sending The test was added before splice() and send() to make sure we never accidently send after a shutdown, because upper layers do not all check and it's not their job to do it. In such a case we also set errno to EPIPE so that the error can be accurately reported, e.g., in health checks.	2020-01-23 19:01:37 +01:00
Willy Tarreau	49139cb914	MINOR: connection: don't check for CO_FL_SOCK_WR_SH too early in handshakes Just like with CO_FL_SOCK_RD_SH, we don't need to check for this flag too early because conn_sock_send() already does it. No error was lost so it was harmless, it was only useless code.	2020-01-23 19:01:37 +01:00
Willy Tarreau	d838fb840c	MINOR: connection: do not check for CO_FL_SOCK_RD_SH too early The handshake functions dedicated to proxy proto, netscaler and socks4 all check for this flag before proceeding. This is wrong, they must not do and instead perform the call to recv() then report the close. The reason for this is that the current construct managed to lose the CO_ER_CIP_EMPTY error code in case the connection was already shut, thus causing a race condition with some errors being reported correctly or as unknown depending on the timing.	2020-01-23 18:05:18 +01:00
Willy Tarreau	6d015724ec	MINOR: connection: remove checks for CO_FL_HANDSHAKE before I/O There are still leftovers from the pre-xprt_handshake era with lots of places where I/O callbacks refrain from receiving/sending if they see that a handshake is present. This needlessly duplicates the subscribe calls as it will automatically be done by the underlying xprt_handshake code when attempting the operation. The only reason for still checking CO_FL_HANDSHAKE is when we decide to instantiate xprt_handshake. This patch removes all other ones.	2020-01-23 17:30:42 +01:00
Willy Tarreau	911db9bd29	MEDIUM: connection: use CO_FL_WAIT_XPRT more consistently than L4/L6/HANDSHAKE As mentioned in commit `c192b0ab95` ("MEDIUM: connection: remove CO_FL_CONNECTED and only rely on CO_FL_WAIT_*"), there is a lack of consistency on which flags are checked among L4/L6/HANDSHAKE depending on the code areas. A number of sample fetch functions only check for L4L6 to report MAY_CHANGE, some places only check for HANDSHAKE and many check both L4L6 and HANDSHAKE. This patch starts to make all of this more consistent by introducing a new mask CO_FL_WAIT_XPRT which is the union of L4/L6/HANDSHAKE and reports whether the transport layer is ready or not. All inconsistent call places were updated to rely on this one each time the goal was to check for the readiness of the transport layer.	2020-01-23 16:34:26 +01:00
Willy Tarreau	4450b587dd	MINOR: connection: remove CO_FL_SSL_WAIT_HS from CO_FL_HANDSHAKE Most places continue to check CO_FL_HANDSHAKE while in fact they should check CO_FL_HANDSHAKE_NOSSL, which contains all handshakes but the one dedicated to SSL renegotiation. In fact the SSL layer should be the only one checking CO_FL_SSL_WAIT_HS, so as to avoid processing data when a renegotiation is in progress, but other ones randomly include it without knowing. And ideally it should even be an internal flag that's not exposed in the connection. This patch takes CO_FL_SSL_WAIT_HS out of CO_FL_HANDSHAKE, uses this flag consistently all over the code, and gets rid of CO_FL_HANDSHAKE_NOSSL. In order to limit the confusion that has accumulated over time, the CO_FL_SSL_WAIT_HS flag which indicates an ongoing SSL handshake, possibly used by a renegotiation was moved after the other ones.	2020-01-23 16:34:26 +01:00
Willy Tarreau	18955db43d	MINOR: stream-int: always report received shutdowns As mentioned in `c192b0ab95` ("MEDIUM: connection: remove CO_FL_CONNECTED and only rely on CO_FL_WAIT_*"), si_cs_recv() currently does not propagate CS_FL_EOS to CF_READ_NULL if CO_FL_WAIT_L4L6 is set, while this situation doesn't exist anymore. Let's get rid of this confusing test.	2020-01-23 16:34:26 +01:00
Olivier Houchard	220a26c316	BUG/MEDIUM: 0rtt: Only consider the SSL handshake. We only add the Early-data header, or get ssl_fc_has_early to return 1, if we didn't already did the SSL handshake, as otherwise, we know the early data were fine, and there's no risk of replay attack. But to do so, we wrongly checked CO_FL_HANDSHAKE, we have to check CO_FL_SSL_WAIT_HS instead, as we don't care about the status of any other handshake. This should be backported to 2.1, 2.0, and 1.9. When deciding if we should add the Early-Data header, or if the sample fetch should return	2020-01-23 15:01:11 +01:00
Willy Tarreau	c192b0ab95	MEDIUM: connection: remove CO_FL_CONNECTED and only rely on CO_FL_WAIT_* Commit `477902bd2e` ("MEDIUM: connections: Get ride of the xprt_done callback.") broke the master CLI for a very obscure reason. It happens that short requests immediately terminated by a shutdown are properly received, CS_FL_EOS is correctly set, but in si_cs_recv(), we refrain from setting CF_SHUTR on the channel because CO_FL_CONNECTED was not yet set on the connection since we've not passed again through conn_fd_handler() and it was not done in conn_complete_session(). While commit `a8a415d31a` ("BUG/MEDIUM: connections: Set CO_FL_CONNECTED in conn_complete_session()") fixed the issue, such accident may happen again as the root cause is deeper and actually comes down to the fact that CO_FL_CONNECTED is lazily set at various check points in the code but not every time we drop one wait bit. It is not the first time we face this situation. Originally this flag was used to detect the transition between WAIT_* and CONNECTED in order to call ->wake() from the FD handler. But since at least 1.8-dev1 with commit `7bf3fa3c23` ("BUG/MAJOR: connection: update CO_FL_CONNECTED before calling the data layer"), CO_FL_CONNECTED is always synchronized against the two others before being checked. Moreover, with the I/Os moved to tasklets, the decision to call the ->wake() function is performed after the I/Os in si_cs_process() and equivalent, which don't care about this transition either. So in essence, checking for CO_FL_CONNECTED has become a lazy wait to check for (CO_FL_WAIT_L4_CONN \| CO_FL_WAIT_L6_CONN), but that always relies on someone else having synchronized it. This patch addresses it once for all by killing this flag and only checking the two others (for which a composite mask CO_FL_WAIT_L4L6 was added). This revealed a number of inconsistencies that were purposely not addressed here for the sake of bisectability: - while most places do check both L4+L6 and HANDSHAKE at the same time, some places like assign_server() or back_handle_st_con() and a few sample fetches looking for proxy protocol do check for L4+L6 but don't care about HANDSHAKE ; these ones will probably fail on TCP request session rules if the handshake is not complete. - some handshake handlers do validate that a connection is established at L4 but didn't clear CO_FL_WAIT_L4_CONN - the ->ctl method of mux_fcgi, mux_pt and mux_h1 only checks for L4+L6 before declaring the mux ready while the snd_buf function also checks for the handshake's completion. Likely the former should validate the handshake as well and we should get rid of these extra tests in snd_buf. - raw_sock_from_buf() would directly set CO_FL_CONNECTED and would only later clear CO_FL_WAIT_L4_CONN. - xprt_handshake would set CO_FL_CONNECTED itself without actually clearing CO_FL_WAIT_L4_CONN, which could apparently happen only if waiting for a pure Rx handshake. - most places in ssl_sock that were checking CO_FL_CONNECTED don't need to include the L4 check as an L6 check is enough to decide whether to wait for more info or not. It also becomes obvious when reading the test in si_cs_recv() that caused the failure mentioned above that once converted it doesn't make any sense anymore: having CS_FL_EOS set while still waiting for L4 and L6 to complete cannot happen since for CS_FL_EOS to be set, the other ones must have been validated. Some of these parts will still deserve further cleanup, and some of the observations above may induce some backports of potential bug fixes once totally analyzed in their context. The risk of breaking existing stuff is too high to blindly backport everything.	2020-01-23 14:41:37 +01:00
Emmanuel Hocdet	078156d063	BUG/MINOR: ssl/cli: ocsp_issuer must be set w/ "set ssl cert" ocsp_issuer is primary set from ckch->chain when PEM is loaded from file, but not set when PEM is loaded via CLI payload. Set ckch->ocsp_issuer in ssl_sock_load_pem_into_ckch to fix that. Should be backported in 2.1.	2020-01-23 14:33:14 +01:00
Olivier Houchard	a8a415d31a	BUG/MEDIUM: connections: Set CO_FL_CONNECTED in conn_complete_session(). We can't just assume conn_create_mux() will be called, and set CO_FL_CONNECTED, conn_complete_session() might be call synchronously if we're not using SSL, so ew haee no choice but to set CO_FL_CONNECTED in there. This should fix the recent breakage of the mcli reg tests.	2020-01-23 13:20:03 +01:00
William Lallemand	dad239d08b	BUG/MINOR: ssl: typo in previous patch The previous patch `5c3c96f` ("BUG/MINOR: ssl: memory leak w/ the ocsp_issuer") contains a typo that prevent it to build. Should be backported in 2.1.	2020-01-23 11:59:02 +01:00
William Lallemand	5c3c96fd36	BUG/MINOR: ssl: memory leak w/ the ocsp_issuer This patch frees the ocsp_issuer in ssl_sock_free_cert_key_and_chain_contents(). Shoudl be backported in 2.1.	2020-01-23 11:57:39 +01:00
William Lallemand	b829dda57b	BUG/MINOR: ssl: increment issuer refcount if in chain When using the OCSP response, if the issuer of the response is in the certificate chain, its address will be stored in ckch->ocsp_issuer. However, since the ocsp_issuer could be filled by a separate file, this pointer is free'd. The refcount of the X509 need to be incremented to avoid a double free if we free the ocsp_issuer AND the chain.	2020-01-23 11:57:39 +01:00
Willy Tarreau	027d206b57	CLEANUP: stats: shut up a wrong null-deref warning from gcc 9.2 As reported in bug #447, gcc 9.2 invents impossible code paths and then complains that we don't check for our pointers to be NULL... This code path is not critical, better add the test to shut it up than try to help it being less creative. This code hasn't changed for a while, so it could help distros to backport this to older releases.	2020-01-23 11:49:02 +01:00
Willy Tarreau	79fd577ac1	CLEANUP: backend: shut another false null-deref in back_handle_st_con() objt_conn() may return a NULL though here we don't have this situation anymore since the connection is always there, so let's simply switch to the unchecked __objt_conn(). This addresses issue #454.	2020-01-23 11:40:40 +01:00
Willy Tarreau	b1a40c72e7	CLEANUP: backend: remove useless test for inexistent connection Coverity rightfully reported that it's pointless to test for "conn" to be null while all code paths leading to it have already dereferenced it. This addresses issue #461.	2020-01-23 11:37:43 +01:00
William Lallemand	75b15f790f	BUG/MINOR: ssl/cli: free the previous ckch content once a PEM is loaded When using "set ssl cert" on the CLI, if we load a new PEM, the previous sctl, issuer and OCSP response are still loaded. This doesn't make any sense since they won't be usable with a new private key. This patch free the previous data. Should be backported in 2.1.	2020-01-23 11:08:46 +01:00
Adis Nezirovic	d0142e7224	MINOR: cli: Report location of errors or any extra data for "show table" When using multiple filters with "show table", it can be useful to report which filter entry failed > show table MY_TABLE data.gpc0 gt 0 data.gpc0a lt 1000 Filter entry #2: Unknown data type > show table MY_TABLE data.gpc0 gt 0 data.gpc0 lt 1000a Filter entry #2: Require a valid integer value to compare against We now also catch garbage data after the filter > show table MY_TABLE data.gpc0 gt 0 data.gpc0 lt 1000 data.gpc0 gt 1\ data.gpc0 gt 10 a Detected extra data in filter, 16th word of input, after '10' Even before multi-filter feature we've also silently accepted garbage after the input, hiding potential bugs > show table MY_TABLE data.gpc0 gt 0 data.gpc0 or > show table MY_TABLE data.gpc0 gt 0 a In both cases, only first filter entry would be used, silently ignoring extra filter entry or garbage data. Last, but not the least, it is now possible to detect multi-filter feature from cli with something like the following: > show table MY_TABLE data.blah Filter entry #1: Unknown data type	2020-01-23 10:43:52 +01:00
Olivier Houchard	477902bd2e	MEDIUM: connections: Get ride of the xprt_done callback. The xprt_done_cb callback was used to defer some connection initialization until we're connected and the handshake are done. As it mostly consists of creating the mux, instead of using the callback, introduce a conn_create_mux() function, that will just call conn_complete_session() for frontend, and create the mux for backend. In h2_wake(), make sure we call the wake method of the stream_interface, as we no longer wakeup the stream task.	2020-01-22 18:56:05 +01:00
Olivier Houchard	8af03b396a	MEDIUM: streams: Always create a conn_stream in connect_server(). In connect_server(), when creating a new connection for which we don't yet know the mux (because it'll be decided by the ALPN), instead of associating the connection to the stream_interface, always create a conn_stream. This way, we have less special-casing needed. Store the conn_stream in conn->ctx, so that we can reach the upper layers if needed.	2020-01-22 18:55:59 +01:00
Adis Nezirovic	56dd354b3c	BUG/MINOR: cli: Missing arg offset for filter data values. We don't properly check for missing data values for additional filter entries, passing out of bounds index to args[], then passing to strlen. Introduced in commit `1a693fc2`: (MEDIUM: cli: Allow multiple filter entries for "show table")	2020-01-22 18:09:06 +01:00
Willy Tarreau	2b64a35184	BUILD: stick-table: fix build errors introduced by last stick-table change Last commit `1a693fc2fd` ("MEDIUM: cli: Allow multiple filter entries for "show table"") broke the build at two places: src/stick_table.c: In function 'table_prepare_data_request': src/stick_table.c:3620:33: warning: ordered comparison of pointer with integer zero [-Wextra] src/stick_table.c: In function 'cli_io_handler_table': src/stick_table.c:3763:5: error: 'for' loop initial declarations are only allowed in C99 mode src/stick_table.c:3763:5: note: use option -std=c99 or -std=gnu99 to compile your code make: *** [src/stick_table.o] Error 1 This patch fixes both. No backport needed.	2020-01-22 17:11:00 +01:00
Emmanuel Hocdet	6b5b44e10f	BUG/MINOR: ssl: ssl_sock_load_pem_into_ckch is not consistent "set ssl cert <filename> <payload>" CLI command should have the same result as reload HAproxy with the updated pem file (<filename>). Is not the case, DHparams/cert-chain is kept from the previous context if no DHparams/cert-chain is set in the context (<payload>). This patch should be backport to 2.1	2020-01-22 15:55:55 +01:00
Olivier Houchard	1a9dbe58a6	BUG/MEDIUM: netscaler: Don't forget to allocate storage for conn->src/dst. In conn_recv_netscaler_cip(), don't forget to allocate conn->src and conn->dst, as those are now dynamically allocated. Not doing so results in getting a crash when using netscaler. This should fix github issue #460. This should be backported to 2.1.	2020-01-22 15:33:03 +01:00
Adis Nezirovic	1a693fc2fd	MEDIUM: cli: Allow multiple filter entries for "show table" For complex stick tables with many entries/columns, it can be beneficial to filter using multiple criteria. The maximum number of filter entries can be controlled by defining STKTABLE_FILTER_LEN during build time. This patch can be backported to older releases.	2020-01-22 14:33:17 +01:00
Willy Tarreau	71f95fa20e	[RELEASE] Released version 2.2-dev1 Released version 2.2-dev1 with the following main changes : - DOC: this is development again - MINOR: version: this is development again, update the status - SCRIPTS: update create-release to fix the changelog on new branches - CLEANUP: ssl: Clean up error handling - BUG/MINOR: contrib/prometheus-exporter: decode parameter and value only - BUG/MINOR: h1: Don't test the host header during response parsing - BUILD/MINOR: trace: fix use of long type in a few printf format strings - DOC: Clarify behavior of server maxconn in HTTP mode - MINOR: ssl: deduplicate ca-file - MINOR: ssl: compute ca-list from deduplicate ca-file - MINOR: ssl: deduplicate crl-file - CLEANUP: dns: resolution can never be null - BUG/MINOR: http-htx: Don't make http_find_header() fail if the value is empty - DOC: ssl/cli: set/commit/abort ssl cert - BUG/MINOR: ssl: fix SSL_CTX_set1_chain compatibility for openssl < 1.0.2 - BUG/MINOR: fcgi-app: Make the directive pass-header case insensitive - BUG/MINOR: stats: Fix HTML output for the frontends heading - BUG/MINOR: ssl: fix X509 compatibility for openssl < 1.1.0 - DOC: clarify matching strings on binary fetches - DOC: Fix ordered list in summary - DOC: move the "group" keyword at the right place - MEDIUM: init: prevent process and thread creation at runtime - BUG/MINOR: ssl/cli: 'ssl cert' cmd only usable w/ admin rights - BUG/MEDIUM: stream-int: don't subscribed for recv when we're trying to flush data - BUG/MINOR: stream-int: avoid calling rcv_buf() when splicing is still possible - BUG/MINOR: ssl/cli: don't overwrite the filters variable - BUG/MEDIUM: listener/thread: fix a race when pausing a listener - BUG/MINOR: ssl: certificate choice can be unexpected with openssl >= 1.1.1 - BUG/MEDIUM: mux-h1: Never reuse H1 connection if a shutw is pending - BUG/MINOR: mux-h1: Don't rely on CO_FL_SOCK_RD_SH to set H1C_F_CS_SHUTDOWN - BUG/MINOR: mux-h1: Fix conditions to know whether or not we may receive data - BUG/MEDIUM: tasks: Make sure we switch wait queues in task_set_affinity(). - BUG/MEDIUM: checks: Make sure we set the task affinity just before connecting. - MINOR: debug: replace popen() with pipe+fork() in "debug dev exec" - MEDIUM: init: set NO_NEW_PRIVS by default when supported - BUG/MINOR: mux-h1: Be sure to set CS_FL_WANT_ROOM when EOM can't be added - BUG/MEDIUM: mux-fcgi: Handle cases where the HTX EOM block cannot be inserted - BUG/MINOR: proxy: make soft_stop() also close FDs in LI_PAUSED state - BUG/MINOR: listener/threads: always use atomic ops to clear the FD events - BUG/MINOR: listener: also clear the error flag on a paused listener - BUG/MEDIUM: listener/threads: fix a remaining race in the listener's accept() - MINOR: listener: make the wait paths cleaner and more reliable - MINOR: listener: split dequeue_all_listener() in two - REORG: listener: move the global listener queue code to listener.c - DOC: document the listener state transitions - BUG/MEDIUM: kqueue: Make sure we report read events even when no data. - BUG/MAJOR: dns: add minimalist error processing on the Rx path - BUG/MEDIUM: proto_udp/threads: recv() and send() must not be exclusive. - DOC: listeners: add a few missing transitions - BUG/MINOR: tasks: only requeue a task if it was already in the queue - MINOR: tasks: split wake_expired_tasks() in two parts to avoid useless wakeups - DOC: proxies: HAProxy only supports 3 connection modes - DOC: remove references to the outdated architecture.txt - BUG/MINOR: log: fix minor resource leaks on logformat error path - BUG/MINOR: mworker: properly pass SIGTTOU/SIGTTIN to workers - BUG/MINOR: listener: do not immediately resume on transient error - BUG/MINOR: server: make "agent-addr" work on default-server line - BUG/MINOR: listener: fix off-by-one in state name check - BUILD/MINOR: unix sockets: silence an absurd gcc warning about strncpy() - MEDIUM: h1-htx: Add HTX EOM block when the message is in H1_MSG_DONE state - MINOR: http-htx: Add some htx sample fetches for debugging purpose - REGTEST: Add an HTX reg-test to check an edge case - DOC: clarify the fact that replace-uri works on a full URI - BUG/MINOR: sample: fix the closing bracket and LF in the debug converter - BUG/MINOR: sample: always check converters' arguments - MINOR: sample: Validate the number of bits for the sha2 converter - BUG/MEDIUM: ssl: Don't set the max early data we can receive too early. - MINOR: ssl/cli: 'show ssl cert' give information on the certificates - BUG/MINOR: ssl/cli: fix build for openssl < 1.0.2 - MINOR: debug: support logging to various sinks - MINOR: http: add a new "replace-path" action - REGTEST: ssl: test the "set ssl cert" CLI command - REGTEST: run-regtests: implement #REQUIRE_BINARIES - MINOR: task: only check TASK_WOKEN_ANY to decide to requeue a task - BUG/MAJOR: task: add a new TASK_SHARED_WQ flag to fix foreing requeuing - BUG/MEDIUM: ssl: Revamp the way early data are handled. - MINOR: fd/threads: make _GET_NEXT()/_GET_PREV() use the volatile attribute - BUG/MEDIUM: fd/threads: fix a concurrency issue between add and rm on the same fd - REGTEST: make the "set ssl cert" require version 2.1 - BUG/MINOR: ssl: openssl-compat: Fix getm_ defines - BUG/MEDIUM: state-file: do not allocate a full buffer for each server entry - BUG/MINOR: state-file: do not store duplicates in the global tree - BUG/MINOR: state-file: do not leak memory on parse errors - BUG/MAJOR: mux-h1: Don't pretend the input channel's buffer is full if empty - BUG/MEDIUM: stream: Be sure to never assign a TCP backend to an HTX stream - BUILD: ssl: improve SSL_CTX_set_ecdh_auto compatibility - BUILD: travis-ci: link with ssl libraries using rpath instead of LD_LIBRARY_PATH/DYLD_LIBRARY_PATH - BUILD: travis-ci: reenable address sanitizer for clang builds - BUG/MINOR: checks: refine which errno values are really errors. - BUG/MINOR: connection: only wake send/recv callbacks if the FD is active - CLEANUP: connection: conn->xprt is never NULL - MINOR: pollers: add a new flag to indicate pollers reporting ERR & HUP - MEDIUM: tcp: make tcp_connect_probe() consider ERR/HUP - REORG: connection: move tcp_connect_probe() to conn_fd_check() - MINOR: connection: check for connection validation earlier - MINOR: connection: remove the double test on xprt_done_cb() - CLEANUP: connection: merge CO_FL_NOTIFY_DATA and CO_FL_NOTIFY_DONE - MINOR: poller: do not call the IO handler if the FD is not active - OPTIM: epoll: always poll for recv if neither active nor ready - OPTIM: polling: do not create update entries for FD removal - BUG/MEDIUM: checks: Only attempt to do handshakes if the connection is ready. - BUG/MEDIUM: connections: Hold the lock when wanting to kill a connection. - BUILD: CI: modernize cirrus-ci - MINOR: config: disable busy polling on old processes - MINOR: ssl: Remove unused variable "need_out". - BUG/MINOR: h1: Report the right error position when a header value is invalid - BUG/MINOR: proxy: Fix input data copy when an error is captured - BUG/MEDIUM: http-ana: Truncate the response when a redirect rule is applied - BUG/MINOR: channel: inject output data at the end of output - BUG/MEDIUM: session: do not report a failure when rejecting a session - MEDIUM: dns: implement synchronous send - MINOR: raw_sock: make sure to disable polling once everything is sent - MINOR: http: Add 410 to http-request deny - MINOR: http: Add 404 to http-request deny - CLEANUP: mux-h2: remove unused goto "out_free_h2s" - BUILD: cirrus-ci: choose proper openssl package name - BUG/MAJOR: listener: do not schedule a task-less proxy - CLEANUP: server: remove unused err section in server_finalize_init - REGTEST: set_ssl_cert.vtc: replace "echo" with "printf" - BUG/MINOR: stream-int: Don't trigger L7 retry if max retries is already reached - BUG/MEDIUM: tasks: Use the MT macros in tasklet_free(). - BUG/MINOR: mux-h2: use a safe list_for_each_entry in h2_send() - BUG/MEDIUM: mux-h2: fix missing test on sending_list in previous patch - CLEANUP: ssl: remove opendir call in ssl_sock_load_cert - MEDIUM: lua: don't call the GC as often when dealing with outgoing connections - BUG/MEDIUM: mux-h2: don't stop sending when crossing a buffer boundary - BUG/MINOR: cli/mworker: can't start haproxy with 2 programs - REGTEST: mcli/mcli_start_progs: start 2 programs - BUG/MEDIUM: mworker: remain in mworker mode during reload - DOC: clarify crt-base usage - CLEANUP: compression: remove unused deinit_comp_ctx section - BUG/MEDIUM: mux_h1: Don't call h1_send if we subscribed(). - BUG/MEDIUM: raw_sock: Make sur the fd and conn are sync. - CLEANUP: proxy: simplify proxy_parse_rate_limit proxy checks - BUG/MAJOR: hashes: fix the signedness of the hash inputs - REGTEST: add sample_fetches/hashes.vtc to validate hashes - BUG/MEDIUM: cli: _getsocks must send the peers sockets - CLEANUP: cli: deduplicate the code in _getsocks - BUG/MINOR: stream: don't mistake match rules for store-request rules - BUG/MEDIUM: connection: add a mux flag to indicate splice usability - BUG/MINOR: pattern: handle errors from fgets when trying to load patterns - MINOR: connection: move the CO_FL_WAIT_ROOM cleanup to the reader only - MINOR: stream-int: remove dependency on CO_FL_WAIT_ROOM for rcv_buf() - MEDIUM: connection: get rid of CO_FL_CURR_* flags - BUILD: pattern: include errno.h - MEDIUM: mux-h2: do not try to stop sending streams on blocked mux - MEDIUM: mux-fcgi: do not try to stop sending streams on blocked mux - MEDIUM: mux-h2: do not make an h2s subscribe to itself on deferred shut - MEDIUM: mux-fcgi: do not make an fstrm subscribe to itself on deferred shut - REORG: stream/backend: move backend-specific stuff to backend.c - MEDIUM: backend: move the connection finalization step to back_handle_st_con() - MEDIUM: connection: merge the send_wait and recv_wait entries - MEDIUM: xprt: merge recv_wait and send_wait in xprt_handshake - MEDIUM: ssl: merge recv_wait and send_wait in ssl_sock - MEDIUM: mux-h1: merge recv_wait and send_wait - MEDIUM: mux-h2: merge recv_wait and send_wait event notifications - MEDIUM: mux-fcgi: merge recv_wait and send_wait event notifications - MINOR: connection: make the last arg of subscribe() a struct wait_event* - MINOR: ssl: Add support for returning the dn samples from ssl_(c\|f)_(i\|s)_dn in LDAP v3 (RFC2253) format. - DOC: Fix copy and paste mistake in http-response replace-value doc - BUG/MINOR: cache: Fix leak of cache name in error path - BUG/MINOR: dns: Make dns_query_id_seed unsigned - BUG/MINOR: 51d: Fix bug when HTX is enabled - MINOR: http-htx: Move htx sample fetches in the scope "internal" - MINOR: http-htx: Rename 'internal.htx_blk.val' to 'internal.htx_blk.data' - MINOR: http-htx: Make 'internal.htx_blk_data' return a binary string - DOC: Add a section to document the internal sample fetches - MINOR: mux-h1: Inherit send flags from the upper layer - MINOR: contrib/prometheus-exporter: Add heathcheck status/code in server metrics - BUG/MINOR: http-ana/filters: Wait end of the http_end callback for all filters - BUG/MINOR: http-rules: Remove buggy deinit functions for HTTP rules - BUG/MINOR: stick-table: Use MAX_SESS_STKCTR as the max track ID during parsing - MEDIUM: http-rules: Register an action keyword for all http rules - MINOR: tcp-rules: Always set from which ruleset a rule comes from - MINOR: actions: Use ACT_RET_CONT code to ignore an error from a custom action - MINOR: tcp-rules: Kill connections when custom actions return ACT_RET_ERR - MINOR: http-rules: Return an error when custom actions return ACT_RET_ERR - MINOR: counters: Add a counter to report internal processing errors - MEDIUM: http-ana: Properly handle internal processing errors - MINOR: http-rules: Add a rule result to report internal error - MINOR: http-rules: Handle internal errors during HTTP rules evaluation - MINOR: http-rules: Add more return codes to let custom actions act as normal ones - MINOR: tcp-rules: Handle denied/aborted/invalid connections from TCP rules - MINOR: http-rules: Handle denied/aborted/invalid connections from HTTP rules - MINOR: stats: Report internal errors in the proxies/listeners/servers stats - MINOR: contrib/prometheus-exporter: Export internal errors per proxy/server - MINOR: counters: Remove failed_secu counter and use denied_resp instead - MINOR: counters: Review conditions to increment counters from analysers - MINOR: http-ana: Add a txn flag to support soft/strict message rewrites - MINOR: http-rules: Handle all message rewrites the same way - MINOR: http-rules: Add a rule to enable or disable the strict rewriting mode - MEDIUM: http-rules: Enable the strict rewriting mode by default - REGTEST: Fix format of set-uri HTTP request rule in h1or2_to_h1c.vtc - MINOR: actions: Add a function pointer to release args used by actions - MINOR: actions: Regroup some info about HTTP rules in the same struct - MINOR: http-rules/tcp-rules: Call the defined action function first if defined - MINOR: actions: Rename the act_flag enum into act_opt - MINOR: actions: Add flags to configure the action behaviour - MINOR: actions: Use an integer to set the action type - MINOR: http-rules: Use a specific action type for some custom HTTP actions - MINOR: http-rules: Make replace-header and replace-value custom actions - MINOR: http-rules: Make set-header and add-header custom actions - MINOR: http-rules: Make set/del-map and add/del-acl custom actions - MINOR: http-rules: Group all processing of early-hint rule in its case clause - MEDIUM: http-rules: Make early-hint custom actions - MINOR: http-rule/tcp-rules: Make track-sc* custom actions - MINOR: tcp-rules: Make tcp-request capture a custom action - MINOR: http-rules: Add release functions for existing HTTP actions - BUG/MINOR: http-rules: Fix memory releases on error path during action parsing - MINOR: tcp-rules: Add release functions for existing TCP actions - BUG/MINOR: tcp-rules: Fix memory releases on error path during action parsing - MINOR: http-htx: Add functions to read a raw error file and convert it in HTX - MINOR: http-htx: Add functions to create HTX redirect message - MINOR: config: Use dedicated function to parse proxy's errorfiles - MINOR: config: Use dedicated function to parse proxy's errorloc - MEDIUM: http-htx/proxy: Use a global and centralized storage for HTTP error messages - MINOR: proxy: Register keywords to parse errorfile and errorloc directives - MINOR: http-htx: Add a new section to create groups of custom HTTP errors - MEDIUM: proxy: Add a directive to reference an http-errors section in a proxy - MINOR: http-rules: Update txn flags and status when a deny rule is executed - MINOR: http-rules: Support an optional status on deny rules for http reponses - MINOR: http-rules: Use same function to parse request and response deny actions - MINOR: http-ana: Add an error message in the txn and send it when defined - MEDIUM: http-rules: Support an optional error message in http deny rules - REGTEST: Add a strict rewriting mode reg test - REGEST: Add reg tests about error files - MINOR: ssl: accept 'verify' bind option with 'set ssl cert' - BUG/MINOR: ssl: ssl_sock_load_ocsp_response_from_file memory leak - BUG/MINOR: ssl: ssl_sock_load_issuer_file_into_ckch memory leak - BUG/MINOR: ssl: ssl_sock_load_sctl_from_file memory leak - BUG/MINOR: http_htx: Fix some leaks on error path when error files are loaded - CLEANUP: http-ana: Remove useless test on txn when the error message is retrieved - BUILD: CI: introduce ARM64 builds - BUILD: ssl: more elegant anti-replay feature presence check - MINOR: proxy/http-ana: Add support of extra attributes for the cookie directive - MEDIUM: dns: use Additional records from SRV responses - CLEANUP: Consistently `unsigned int` for bitfields - CLEANUP: pattern: remove the pat_time definition - BUG/MINOR: http_act: don't check capture id in backend - BUG/MINOR: ssl: fix build on development versions of openssl-1.1.x	2020-01-22 10:34:58 +01:00
Baptiste Assmann	19a69b3740	BUG/MINOR: http_act: don't check capture id in backend A wrong behavior was introduced by `e9544935e8`, leading to preventing loading any configuration where a capture slot id is used in a backend. IE, the configuration below does not parse: frontend f bind *:80 declare capture request len 32 default_backend webserver backend webserver http-request capture req.hdr(Host) id 1 The point is that such type of configuration is valid and should run. This patch enforces the check of capture slot id only if the action rule is configured in a frontend. The point is that at configuration parsing time, it is impossible to check which frontend could point to this backend (furthermore if we use dynamic backend name resolution at runtime). The documentation has been updated to warn the user to ensure that relevant frontends have required declaration when such rule has to be used in a backend. If no capture slot can be found, then the action will just not be executed and HAProxy will process the next one in the list, as expected. This should be backported to all supported branches (bug created as part of a bug fix introduced into 1.7 and backported to 1.6).	2020-01-22 07:44:36 +01:00
Baptiste Assmann	13a9232ebc	MEDIUM: dns: use Additional records from SRV responses Most DNS servers provide A/AAAA records in the Additional section of a response, which correspond to the SRV records from the Answer section: ;; QUESTION SECTION: ;_http._tcp.be1.domain.tld. IN SRV ;; ANSWER SECTION: _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A1.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A8.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A5.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A6.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A4.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A3.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A2.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A7.domain.tld. ;; ADDITIONAL SECTION: A1.domain.tld. 3600 IN A 192.168.0.1 A8.domain.tld. 3600 IN A 192.168.0.8 A5.domain.tld. 3600 IN A 192.168.0.5 A6.domain.tld. 3600 IN A 192.168.0.6 A4.domain.tld. 3600 IN A 192.168.0.4 A3.domain.tld. 3600 IN A 192.168.0.3 A2.domain.tld. 3600 IN A 192.168.0.2 A7.domain.tld. 3600 IN A 192.168.0.7 SRV record support was introduced in HAProxy 1.8 and the first design did not take into account the records from the Additional section. Instead, a new resolution is associated to each server with its relevant FQDN. This behavior generates a lot of DNS requests (1 SRV + 1 per server associated). This patch aims at fixing this by: - when a DNS response is validated, we associate A/AAAA records to relevant SRV ones - set a flag on associated servers to prevent them from running a DNS resolution for said FADN - update server IP address with information found in the Additional section If no relevant record can be found in the Additional section, then HAProxy will failback to running a dedicated resolution for this server, as it used to do. This behavior is the one described in RFC 2782.	2020-01-22 07:19:54 +01:00
Christopher Faulet	2f5339079b	MINOR: proxy/http-ana: Add support of extra attributes for the cookie directive It is now possible to insert any attribute when a cookie is inserted by HAProxy. Any value may be set, no check is performed except the syntax validity (CTRL chars and ';' are forbidden). For instance, it may be used to add the SameSite attribute: cookie SRV insert attr "SameSite=Strict" The attr option may be repeated to add several attributes. This patch should fix the issue #361.	2020-01-22 07:18:31 +01:00
Ilya Shipitsin	e9ff8992a1	BUILD: ssl: more elegant anti-replay feature presence check Instead of tracking the version number to figure whether SSL_OP_NO_ANTI_REPLAY is defined, simply rely on its definition.	2020-01-22 06:50:21 +01:00
Christopher Faulet	53a87e134e	CLEANUP: http-ana: Remove useless test on txn when the error message is retrieved In http_error_message(), the HTTP txn is always defined. So, this is no reason to test its nullity. This patch partially fixes the issue #457.	2020-01-21 11:12:37 +01:00
Christopher Faulet	7cde96c829	BUG/MINOR: http_htx: Fix some leaks on error path when error files are loaded No backports needed. This patch partially fixes the issue #457.	2020-01-21 11:12:37 +01:00
Emmanuel Hocdet	224a087a27	BUG/MINOR: ssl: ssl_sock_load_sctl_from_file memory leak "set ssl cert <filename.sctl> <payload>" CLI command must free previous context. This patch should be backport to 2.1	2020-01-21 10:44:33 +01:00
Emmanuel Hocdet	eb73dc34bb	BUG/MINOR: ssl: ssl_sock_load_issuer_file_into_ckch memory leak "set ssl cert <filename.issuer> <payload>" CLI command must free previous context. This patch should be backport to 2.1	2020-01-21 10:44:33 +01:00
Emmanuel Hocdet	0667faebcf	BUG/MINOR: ssl: ssl_sock_load_ocsp_response_from_file memory leak "set ssl cert <filename.ocsp> <payload>" CLI command must free previous context. This patch should be backport to 2.1	2020-01-21 10:44:33 +01:00
Emmanuel Hocdet	ebf840bf37	MINOR: ssl: accept 'verify' bind option with 'set ssl cert' Since patches initiated with `d4f9a60e` "MINOR: ssl: deduplicate ca-file", no more file access is done for 'verify' bind options (crl/ca file). Remove conditional restriction for "set ssl cert" CLI commands.	2020-01-21 09:58:41 +01:00
Christopher Faulet	554c0ebffd	MEDIUM: http-rules: Support an optional error message in http deny rules It is now possible to set the error message to use when a deny rule is executed. It may be a specific error file, adding "errorfile <file>" : http-request deny deny_status 400 errorfile /etc/haproxy/errorfiles/400badreq.http It may also be an error file from an http-errors section, adding "errorfiles <name>" : http-request deny errorfiles my-errors # use 403 error from "my-errors" section When defined, this error message is set in the HTTP transaction. The tarpit rule is also concerned by this change.	2020-01-20 15:18:46 +01:00
Christopher Faulet	473e880a25	MINOR: http-ana: Add an error message in the txn and send it when defined It is now possible to set the error message to return to client in the HTTP transaction. If it is defined, this error message is used instead of proxy's errors or default errors.	2020-01-20 15:18:46 +01:00
Christopher Faulet	e0fca297d5	MINOR: http-rules: Use same function to parse request and response deny actions Because there is no more difference between http-request and http-response rules, the same function is now used to parse them.	2020-01-20 15:18:46 +01:00
Christopher Faulet	040c8cdbbe	MINOR: http-rules: Support an optional status on deny rules for http reponses It is now possible to specified the status code to return an http-response deny rules. For instance : http-response deny deny_status 500	2020-01-20 15:18:46 +01:00
Christopher Faulet	b58f62b316	MINOR: http-rules: Update txn flags and status when a deny rule is executed When a deny rule is executed, the flag TX_CLDENY and the status code are set on the HTTP transaction. Now, these steps are handled by the code executing the deny rule. So into http_req_get_intercept_rule() for the request and http_res_get_intercept_rule() for the response.	2020-01-20 15:18:46 +01:00
Christopher Faulet	76edc0f29c	MEDIUM: proxy: Add a directive to reference an http-errors section in a proxy It is now possible to import in a proxy, fully or partially, error files declared in an http-errors section. It may be done using the "errorfiles" directive, followed by a name and optionally a list of status code. If there is no status code specified, all error files of the http-errors section are imported. Otherwise, only error files associated to the listed status code are imported. For instance : http-errors my-errors errorfile 400 ... errorfile 403 ... errorfile 404 ... frontend frt errorfiles my-errors 403 404 # ==> error 400 not imported	2020-01-20 15:18:46 +01:00
Christopher Faulet	35cd81d363	MINOR: http-htx: Add a new section to create groups of custom HTTP errors A new section may now be declared in the configuration to create global groups of HTTP errors. These groups are not linked to a proxy and are referenced by name. The section must be declared using the keyword "http-errors" followed by the group name. This name must be unique. A list of "errorfile" directives may be declared in such section. For instance: http-errors website-1 errorfile 400 /path/to/site1/400.http errorfile 404 /path/to/site1/404.http http-errors website-2 errorfile 400 /path/to/site2/400.http errorfile 404 /path/to/site2/404.http For now, it is just possible to create "http-errors" sections. There is no documentation because these groups are not used yet.	2020-01-20 15:18:46 +01:00
Christopher Faulet	07f41f79cb	MINOR: proxy: Register keywords to parse errorfile and errorloc directives errorfile and errorloc directives are now pased in dedicated functions in http_htx.c.	2020-01-20 15:18:46 +01:00
Christopher Faulet	5885775de1	MEDIUM: http-htx/proxy: Use a global and centralized storage for HTTP error messages All custom HTTP errors are now stored in a global tree. Proxies use a references on these messages. The key used for errorfile directives is the file name as specified in the configuration. For errorloc directives, a key is created using the redirect code and the url. This means that the same custom error message is now stored only once. It may be used in several proxies or for several status code, it is only parsed and stored once.	2020-01-20 15:18:46 +01:00
Christopher Faulet	ac2412fee8	MINOR: config: Use dedicated function to parse proxy's errorloc The parsing of the "errorloc" directive is now handled by the function http_parse_errorloc().	2020-01-20 15:18:45 +01:00
Christopher Faulet	13d297f3d6	MINOR: config: Use dedicated function to parse proxy's errorfiles The parsing of the "errorfile" directive is now handled by the function http_parse_errorfile().	2020-01-20 15:18:45 +01:00
Christopher Faulet	bdf6526e94	MINOR: http-htx: Add functions to create HTX redirect message http_parse_errorloc() may now be used to create an HTTP 302 or 303 redirect message with a specific url passed as parameter. A parameter is used to known if it is a 302 or a 303 redirect. A status code is passed as parameter. It must be one of the supported HTTP error codes to be valid. Otherwise an error is returned. It aims to be used to parse "errorloc" directives. It relies on http_load_errormsg() to do most of the job, ie converting it in HTX.	2020-01-20 15:18:45 +01:00
Christopher Faulet	5031ef58ca	MINOR: http-htx: Add functions to read a raw error file and convert it in HTX http_parse_errorfile() may now be used to parse a raw HTTP message from a file. A status code is passed as parameter. It must be one of the supported HTTP error codes to be valid. Otherwise an error is returned. It aims to be used to parse "errorfile" directives. It relies on http_load_errorfile() to do most of the job, ie reading the file content and converting it in HTX.	2020-01-20 15:18:45 +01:00
Christopher Faulet	fdb6fbfa9a	BUG/MINOR: tcp-rules: Fix memory releases on error path during action parsing When an error occurred during the parsing of a TCP action, if some memory was allocated, it should be released before exiting. Here, the fix consists for replace a call to free() on a sample expression by a call to release_sample_expr(). This patch may be backported to all supported versions.	2020-01-20 15:18:45 +01:00
Christopher Faulet	adfc6e8e14	MINOR: tcp-rules: Add release functions for existing TCP actions TCP actions allocating memory during configuration parsing now use dedicated functions to release it.	2020-01-20 15:18:45 +01:00
Christopher Faulet	1337b328d9	BUG/MINOR: http-rules: Fix memory releases on error path during action parsing When an error occurred during the parsing of an HTTP action, if some memory was allocated, it should be released before exiting. Sometime a call to free() is used on a sample expression instead of a call to release_sample_expr(). Other time, it is just a string or a regex that should be released. There is no real reason to backport this patch. Especially because this part was highly modified recentely in 2.2-DEV.	2020-01-20 15:18:45 +01:00
Christopher Faulet	2eb539687e	MINOR: http-rules: Add release functions for existing HTTP actions HTTP actions allocating memory during configuration parsing now use dedicated functions to release it.	2020-01-20 15:18:45 +01:00
Christopher Faulet	d73b96d48c	MINOR: tcp-rules: Make tcp-request capture a custom action Now, this action is use its own dedicated function and is no longer handled "in place" during the TCP rules evaluation. Thus the action name ACT_TCP_CAPTURE is removed. The action type is set to ACT_CUSTOM and a check function is used to know if the rule depends on request contents while there is no inspect-delay.	2020-01-20 15:18:45 +01:00
Christopher Faulet	ac98d81f46	MINOR: http-rule/tcp-rules: Make track-sc* custom actions Now, these actions use their own dedicated function and are no longer handled "in place" during the TCP/HTTP rules evaluation. Thus the action names ACT_ACTION_TRK_SC0 and ACT_ACTION_TRK_SCMAX are removed. The action type is now the tracking index. Thus the function trk_idx() is no longer needed.	2020-01-20 15:18:45 +01:00
Christopher Faulet	91b3ec13c6	MEDIUM: http-rules: Make early-hint custom actions Now, the early-hint action uses its own dedicated action and is no longer handled "in place" during the HTTP rules evaluation. Thus the action name ACT_HTTP_EARLY_HINT is removed. In additionn, http_add_early_hint_header() and http_reply_103_early_hints() are also removed. This part is now handled in the new action_ptr callback function.	2020-01-20 15:18:45 +01:00
Christopher Faulet	5275aa7540	MINOR: http-rules: Group all processing of early-hint rule in its case clause To know if the 103 response start-line must be added, we test if it is the first rule of the ruleset or if the previous rule is not an early-hint rule. And at the end, to know if the 103 response must be terminated, we test if it is the last rule of the ruleset or if the next rule is not an early-hint rule. This way, all the code dealing with early-hint rules is grouped in its case clause.	2020-01-20 15:18:45 +01:00
Christopher Faulet	046cf44f6c	MINOR: http-rules: Make set/del-map and add/del-acl custom actions Now, these actions use their own dedicated function and are no longer handled "in place" during the HTTP rules evaluation. Thus the action names ACT_HTTP__ACL and ACT_HTTP__MAP are removed. The action type is now mapped as following: 0 = add-acl, 1 = set-map, 2 = del-acl and 3 = del-map.	2020-01-20 15:18:45 +01:00
Christopher Faulet	d1f27e3394	MINOR: http-rules: Make set-header and add-header custom actions Now, these actions use their own dedicated function and are no longer handled "in place" during the HTTP rules evaluation. Thus the action names ACT_HTTP_SET_HDR and ACT_HTTP_ADD_VAL are removed. The action type is now set to 0 to set a header (so remove existing ones if any and add a new one) or to 1 to add a header (add without remove).	2020-01-20 15:18:45 +01:00
Christopher Faulet	92d34fe38d	MINOR: http-rules: Make replace-header and replace-value custom actions Now, these actions use their own dedicated function and are no longer handled "in place" during the HTTP rules evaluation. Thus the action names ACT_HTTP_REPLACE_HDR and ACT_HTTP_REPLACE_VAL are removed. The action type is now set to 0 to evaluate the whole header or to 1 to evaluate every comma-delimited values. The function http_transform_header_str() is renamed to http_replace_hdrs() to be more explicit and the function http_transform_header() is removed. In fact, this last one is now more or less the new action function. The lua code has been updated accordingly to use http_replace_hdrs().	2020-01-20 15:18:45 +01:00
Christopher Faulet	2c22a6923a	MINOR: http-rules: Use a specific action type for some custom HTTP actions For set-method, set-path, set-query and set-uri, a specific action type is used. The same as before but no longer stored in <arg.http.i>. Same is done for replace-path and replace-uri. The same types are used than the "set-" versions.	2020-01-20 15:18:45 +01:00
Christopher Faulet	245cf795c1	MINOR: actions: Add flags to configure the action behaviour Some flags can now be set on an action when it is registered. The flags are defined in the act_flag enum. For now, only ACT_FLAG_FINAL may be set on an action to specify if it stops the rules evaluation. It is set on ACT_ACTION_ALLOW, ACT_ACTION_DENY, ACT_HTTP_REQ_TARPIT, ACT_HTTP_REQ_AUTH, ACT_HTTP_REDIR and ACT_TCP_CLOSE actions. But, when required, it may also be set on custom actions. Consequently, this flag is checked instead of the action type during the configuration parsing to trigger a warning when a rule inhibits all the following ones.	2020-01-20 15:18:45 +01:00
Christopher Faulet	105ba6cc54	MINOR: actions: Rename the act_flag enum into act_opt The flags in the act_flag enum have been renamed act_opt. It means ACT_OPT prefix is used instead of ACT_FLAG. The purpose of this patch is to reserve the action flags for the actions configuration.	2020-01-20 15:18:45 +01:00
Christopher Faulet	cd26e8a2ec	MINOR: http-rules/tcp-rules: Call the defined action function first if defined When TCP and HTTP rules are evaluated, if an action function (action_ptr field in the act_rule structure) is defined for a given action, it is now always called in priority over the test on the action type. Concretly, for now, only custom actions define it. Thus there is no change. It just let us the choice to extend the action type beyond the existing ones in the enum.	2020-01-20 15:18:45 +01:00
Christopher Faulet	96bff76087	MINOR: actions: Regroup some info about HTTP rules in the same struct Info used by HTTP rules manipulating the message itself are splitted in several structures in the arg union. But it is possible to group all of them in a unique struct. Now, <arg.http> is used by most of these rules, which contains: * <arg.http.i> : an integer used as status code, nice/tos/mark/loglevel or action id. * <arg.http.str> : an IST used as header name, reason string or auth realm. * <arg.http.fmt> : a log-format compatible expression * <arg.http.re> : a regular expression used by replace rules	2020-01-20 15:18:45 +01:00
Christopher Faulet	58b3564fde	MINOR: actions: Add a function pointer to release args used by actions Arguments used by actions are never released during HAProxy deinit. Now, it is possible to specify a function to do so. ".release_ptr" field in the act_rule structure may be set during the configuration parsing to a specific deinit function depending on the action type.	2020-01-20 15:18:45 +01:00
Christopher Faulet	1aea50e1ff	MEDIUM: http-rules: Enable the strict rewriting mode by default Now, by default, when a rule performing a rewrite on an HTTP message fails, an internal error is triggered. Before, the failure was ignored. But most of users are not aware of this behavior. And it does not happen very often because the buffer reserve space in large enough. So it may be surprising. Returning an internal error makes the rewrite failure explicit. If it is acceptable to silently ignore it, the strict rewriting mode can be disabled.	2020-01-20 15:18:45 +01:00
Christopher Faulet	46f95543c5	MINOR: http-rules: Add a rule to enable or disable the strict rewriting mode It is now possible to explicitly instruct rewriting rules to be strict or not towards errors. It means that in this mode, an internal error is trigger if a rewrite rule fails. The HTTP action "strict-mode" can be used to enable or disable the strict rewriting mode. It can be used in an http-request and an http-response ruleset. For now, by default the strict rewriting mode is disabled. Because it is the current behavior. But it will be changed in another patch.	2020-01-20 15:18:45 +01:00
Christopher Faulet	e00d06c99f	MINOR: http-rules: Handle all message rewrites the same way In HTTP rules, error handling during a rewrite is now handle the same way for all rules. First, allocation errors are reported as internal errors. Then, if soft rewrites are allowed, rewrite errors are ignored and only the failed_rewrites counter is incremented. Otherwise, when strict rewrites are mandatory, interanl errors are returned. For now, only soft rewrites are supported. Note also that the warning sent to notify a rewrite failure was removed. It will be useless once the strict rewrites will be possible.	2020-01-20 15:18:45 +01:00
Christopher Faulet	a00071e2e5	MINOR: http-ana: Add a txn flag to support soft/strict message rewrites the HTTP_MSGF_SOFT_RW flag must now be set on the HTTP transaction to ignore rewrite errors on a message, from HTTP rules. The mode is called the soft rewrites. If thes flag is not set, strict rewrites are performed. In this mode, if a rewrite error occurred, an internal error is reported. For now, HTTP_MSGF_SOFT_RW is always set and there is no way to switch a transaction in strict mode.	2020-01-20 15:18:45 +01:00
Christopher Faulet	cff0f739e5	MINOR: counters: Review conditions to increment counters from analysers Now, for these counters, the following rules are followed to know if it must be incremented or not: * if it exists for a frontend, the counter is incremented * if stats must be collected for the session's listener, if the counter exists for this listener, it is incremented * if the backend is already assigned, if the counter exists for this backend, it is incremented * if a server is attached to the stream, if the counter exists for this server, it is incremented It is not hardcoded rules. Some counters are still handled in a different way. But many counters are incremented this way now.	2020-01-20 15:18:45 +01:00
Christopher Faulet	a08546bb5a	MINOR: counters: Remove failed_secu counter and use denied_resp instead The failed_secu counter is only used for the servers stats. It is used to report the number of denied responses. On proxies, the same info is stored in the denied_resp counter. So, it is more consistent to use the same field for servers.	2020-01-20 15:18:45 +01:00
Christopher Faulet	0159ee4032	MINOR: stats: Report internal errors in the proxies/listeners/servers stats The stats field ST_F_EINT has been added to report internal errors encountered per proxy, per listener and per server. It appears in the CLI export and on the HTML stats page.	2020-01-20 15:18:45 +01:00
Christopher Faulet	74f67af8d4	MINOR: http-rules: Handle denied/aborted/invalid connections from HTTP rules The new possible results for a custom action (deny/abort/invalid) are now handled during HTTP rules evaluation. These codes are mapped on HTTP rules ones : * ACT_RET_DENY => HTTP_RULE_RES_DENY * ACT_RET_ABRT => HTTP_RULE_RES_ABRT * ACT_RET_INV => HTTP_RULE_RES_BADREQ For now, no custom action uses these new codes.	2020-01-20 15:18:45 +01:00
Christopher Faulet	282992e25f	MINOR: tcp-rules: Handle denied/aborted/invalid connections from TCP rules The new possible results for a custom action (deny/abort/invalid) are now handled during TCP rules evaluation. For L4/L5 rules, the session is rejected. For L7 rules, the right counter is incremented, then the connections killed. For now, no custom action uses these new codes.	2020-01-20 15:18:45 +01:00
Christopher Faulet	3a26beea18	MINOR: http-rules: Handle internal errors during HTTP rules evaluation The HTTP_RULE_RES_ERROR code is now used by HTTP analyzers to handle internal errors during HTTP rules evaluation. It is used instead of HTTP_RULE_RES_BADREQ, used for invalid requests/responses. In addition, the SF_ERR_RESOURCE flag is set on the stream when an allocation failure happens. Note that the return value of http-response rules evaluation is now tested in the same way than the result of http-request rules evaluation.	2020-01-20 15:18:45 +01:00
Christopher Faulet	b8a5371a32	MEDIUM: http-ana: Properly handle internal processing errors Now, processing errors are properly handled. Instead of returning an error 400 or 502, depending where the error happens, an error 500 is now returned. And the processing_errors counter is incremented. By default, when such error is detected, the SF_ERR_INTERNAL stream error is used. When the error is caused by an allocation failure, and when it is reasonnably possible, the SF_ERR_RESOURCE stream error is used. Thanks to this patch, bad requests and bad responses should be easier to detect.	2020-01-20 15:18:45 +01:00
Christopher Faulet	28160e73dd	MINOR: http-rules: Return an error when custom actions return ACT_RET_ERR Thanks to the commit "MINOR: actions: Use ACT_RET_CONT code to ignore an error from a custom action", it is now possible to trigger an error from a custom action in http rules. Now, when a custom action returns the ACT_RET_ERR code from an http-request rule, an error 400 is returned. And from an http-response rule, an error 502 is returned. Be careful if this patch is backported. The other mentioned patch must be backported first.	2020-01-20 15:18:45 +01:00
Christopher Faulet	491ab5e2e5	MINOR: tcp-rules: Kill connections when custom actions return ACT_RET_ERR Thanks to the commit "MINOR: actions: Use ACT_RET_CONT code to ignore an error from a custom action", it is now possible to trigger an error from a custom action in tcp-content rules. Now, when a custom action returns the ACT_RET_ERR code, it has the same behavior than a reject rules, the connection is killed. Be careful if this patch is backported. The other mentioned patch must be backported first.	2020-01-20 15:18:45 +01:00
Christopher Faulet	13403761d5	MINOR: actions: Use ACT_RET_CONT code to ignore an error from a custom action Some custom actions are just ignored and skipped when an error is encoutered. In that case, we jump to the next rule. To do so, most of them use the return code ACT_RET_ERR. Currently, for http rules and tcp content rules, it is not a problem because this code is handled the same way than ACT_RET_CONT. But, it means there is no way to handle the error as other actions. The custom actions must handle the error and return ACT_RET_DONE. For instance, when http-request rules are processed, an error when we try to replace a header value leads to a bad request and an error 400 is returned to the client. But when we fail to replace the URI, the error is silently ignored. This difference between the custom actions and the others is an obstacle to write new custom actions. So, in this first patch, ACT_RET_CONT is now returned from custom actions instead of ACT_RET_ERR when an error is encoutered if it should be ignored. The behavior remains the same but it is now possible to handle true errors using the return code ACT_RET_ERR. Some actions will probably be reviewed to determine if an error is fatal or not. Other patches will be pushed to trigger an error when a custom action returns the ACT_RET_ERR code. This patch is not tagged as a bug because it is just a design issue. But others will depends on it. So be careful during backports, if so.	2020-01-20 15:18:45 +01:00
Christopher Faulet	cb9106b3e3	MINOR: tcp-rules: Always set from which ruleset a rule comes from The ruleset from which a TCP rule comes from (the <from> field in the act_rule structure) is only set when a rule is created from a registered keyword and not for all TCP rules. But this information may be useful to check the configuration validity or during the rule evaluation. So now, we systematically set it.	2020-01-20 15:18:45 +01:00
Christopher Faulet	81e20177df	MEDIUM: http-rules: Register an action keyword for all http rules There are many specific http actions that don't use the action registration mechanism (allow, deny, set-header...). Instead, the parsing of these actions is inlined in the functions responsible to parse the http-request/http-response rules. There is no reason to not register an action keyword for all these actions. It it the purpose of this patch. The new functions responsible to parse these http actions are defined in http_act.c	2020-01-20 15:18:45 +01:00
Christopher Faulet	28436e23d3	BUG/MINOR: stick-table: Use MAX_SESS_STKCTR as the max track ID during parsing During the parsing of the sc-inc-gpc0, sc-inc-gpc1 and sc-inc-gpt1 actions, the maximum stick table track ID allowed is tested against ACT_ACTION_TRK_SCMAX. It is the action number and not the maximum number of stick counters. Instead, MAX_SESS_STKCTR must be used. This patch must be backported to all stable versions.	2020-01-20 15:18:45 +01:00
Christopher Faulet	cb5501327c	BUG/MINOR: http-rules: Remove buggy deinit functions for HTTP rules Functions to deinitialize the HTTP rules are buggy. These functions does not check the action name to release the right part in the arg union. Only few info are released. For auth rules, the realm is released and there is no problem here. But the regex <arg.hdr_add.re> is always unconditionally released. So it is easy to make these functions crash. For instance, with the following rule HAProxy crashes during the deinit : http-request set-map(/path/to/map) %[src] %[req.hdr(X-Value)] For now, These functions are simply removed and we rely on the deinit function used for TCP rules (renamed as deinit_act_rules()). This patch fixes the bug. But arguments used by actions are not released at all, this part will be addressed later. This patch must be backported to all stable versions.	2020-01-20 15:18:45 +01:00
Christopher Faulet	1a3e0279c6	BUG/MINOR: http-ana/filters: Wait end of the http_end callback for all filters Filters may define the "http_end" callback, called at the end of the analysis of any HTTP messages. It is called at the end of the payload forwarding and it can interrupt the stream processing. So we must be sure to not remove the XFER_BODY analyzers while there is still at least filter in progress on this callback. Unfortunatly, once the request and the response are borh in the DONE or the TUNNEL mode, we consider the XFER_BODY analyzer has finished its processing on both sides. So it is possible to prematurely interrupt the execution of the filters "http_end" callback. To fix this bug, we switch a message in the ENDING state. It is then switched in DONE/TUNNEL mode only after the execution of the filters "http_end" callback. This patch must be backported (and adapted) to 2.1, 2.0 and 1.9. The legacy HTTP mode shoud probaly be fixed too.	2020-01-20 15:18:45 +01:00
Christopher Faulet	46230363af	MINOR: mux-h1: Inherit send flags from the upper layer Send flags (CO_SFL_*) used when xprt->snd_buf() is called, in h1_send(), are now inherited from the upper layer, when h1_snd_buf() is called. First, the flag CO_SFL_MSG_MORE is no more set if the output buffer is full, but only if the stream-interface decides to set it. It has more info to do it than the mux. Then, the flag CO_SFL_STREAMER is now also handled this way. It was just ignored till now.	2020-01-20 15:18:45 +01:00
Christopher Faulet	8178e4006c	MINOR: http-htx: Make 'internal.htx_blk_data' return a binary string This internal sample fetch now returns a binary string (SMP_T_BIN) instead of a character string.	2020-01-20 15:18:45 +01:00
Christopher Faulet	c5db14c5d4	MINOR: http-htx: Rename 'internal.htx_blk.val' to 'internal.htx_blk.data' Use a more explicit name for this internal sample fetch.	2020-01-20 15:18:45 +01:00
Christopher Faulet	01f44456e6	MINOR: http-htx: Move htx sample fetches in the scope "internal" HTX sample fetches are now prefixed by "internal." to explicitly reserve their uses for debugging or testing purposes.	2020-01-20 15:18:45 +01:00
Ben51Degrees	6bf0672711	BUG/MINOR: 51d: Fix bug when HTX is enabled When HTX is enabled, the sample flags were set too early. When matching for multiple HTTP headers, the sample is fetched more than once, meaning that the flags would need to be set again. Instead, the flags are now set last (just before the outermost function returns). This could be further improved by passing around the message without calling prefetch again. This patch must be backported as far as 1.9. it should fix bug #450.	2020-01-20 14:01:52 +01:00
Tim Duesterhus	fcac33d0c1	BUG/MINOR: dns: Make dns_query_id_seed unsigned Left shifting of large signed values and negative values is undefined. In a test script clang's ubsan rightfully complains: > runtime error: left shift of 1934242336581872173 by 13 places cannot be represented in type 'int64_t' (aka 'long') This bug was introduced in the initial version of the DNS resolver in `325137d603`. The fix must be backported to HAProxy 1.6+.	2020-01-18 06:45:54 +01:00
Tim Duesterhus	d34b1ce5a2	BUG/MINOR: cache: Fix leak of cache name in error path This issue was introduced in commit `99a17a2d91` which first appeared in tag v1.9-dev11. This bugfix should be backported to HAProxy 1.9+.	2020-01-18 06:45:54 +01:00
Elliot Otchet	71f829767d	MINOR: ssl: Add support for returning the dn samples from ssl_(c\|f)_(i\|s)_dn in LDAP v3 (RFC2253) format. Modifies the existing sample extraction methods (smp_fetch_ssl_x_i_dn, smp_fetch_ssl_x_s_dn) to accommodate a third argument that indicates the DN should be returned in LDAP v3 format. When the third argument is present, the new function (ssl_sock_get_dn_formatted) is called with three parameters including the X509_NAME, a buffer containing the format argument, and a buffer for the output. If the supplied format matches the supported format string (currently only "rfc2253" is supported), the formatted value is extracted into the supplied output buffer using OpenSSL's X509_NAME_print_ex and BIO_s_mem. 1 is returned when a dn value is retrieved. 0 is returned when a value is not retrieved. Argument validation is added to each of the related sample configurations to ensure the third argument passed is either blank or "rfc2253" using strcmp. An error is returned if the third argument is present with any other value. Documentation was updated in configuration.txt and it was noted during preliminary reviews that a CLEANUP patch should follow that adjusts the documentation. Currently, this patch and the existing documentation are copied with some minor revisions for each sample configuration. It might be better to have one entry for all of the samples or entries for each that reference back to a primary entry that explains the sample in detail. Special thanks to Chris, Willy, Tim and Aleks for the feedback. Author: Elliot Otchet <degroens@yahoo.com> Reviewed-by: Tim Duesterhus <tim@bastelstu.be>	2020-01-18 06:42:30 +01:00
Willy Tarreau	ee1a6fc943	MINOR: connection: make the last arg of subscribe() a struct wait_event* The subscriber used to be passed as a "void param" that was systematically cast to a struct wait_event. By now it appears clear that the subscribe() call at every layer is well defined and always takes a pointer to an event subscriber of type wait_event, so let's enforce this in the functions' prototypes, remove the intermediary variables used to cast it and clean up the comments to clarify what all these functions do in their context.	2020-01-17 18:30:37 +01:00
Willy Tarreau	8907e4ddb8	MEDIUM: mux-fcgi: merge recv_wait and send_wait event notifications This is the last of the "recv_wait+send_wait merge" patches and is functionally equivalent to previous commit "MEDIUM: mux-h2: merge recv_wait and send_wait event notifications" but for FCGI this time. The principle is pretty much the same, since the code is very similar. We use a single wait_event for both recv and send and rely on the subscribe flags to know the desired notifications.	2020-01-17 18:30:37 +01:00
Willy Tarreau	f96508aae6	MEDIUM: mux-h2: merge recv_wait and send_wait event notifications This is the continuation of the recv+send event notifications merge that was started. This patch is less trivial than the previous ones because the existence of a send event subscription is also used to decide to put a stream back into the send list.	2020-01-17 18:30:36 +01:00
Willy Tarreau	1b0d4d19fc	MEDIUM: mux-h1: merge recv_wait and send_wait This is the same principle as previous commit, but for the H1 mux this time. The checks in the subscribe()/unsubscribe() calls were factored and some BUG_ON() were added to detect unexpected cases. h1_wake_for_recv() and h1_wake_for_send() needed to be refined to consider the current subscription before deciding to wake up.	2020-01-17 18:30:36 +01:00
Willy Tarreau	113d52bfb4	MEDIUM: ssl: merge recv_wait and send_wait in ssl_sock This is the same principle as previous commit, but for ssl_sock.	2020-01-17 18:30:36 +01:00
Willy Tarreau	ac6febd3ae	MEDIUM: xprt: merge recv_wait and send_wait in xprt_handshake This is the same principle as previous commit, but for xprt_handshake.	2020-01-17 18:30:36 +01:00
Willy Tarreau	7872d1fc15	MEDIUM: connection: merge the send_wait and recv_wait entries In practice all callers use the same wait_event notification for any I/O so instead of keeping specific code to handle them separately, let's merge them and it will allow us to create new events later.	2020-01-17 18:30:36 +01:00
Willy Tarreau	062df2c23a	MEDIUM: backend: move the connection finalization step to back_handle_st_con() Currently there's still lots of code in conn_complete_server() that performs one half of the connection setup, which is then checked and finalized in back_handle_st_con(). There isn't a valid reason for this anymore, we can simplify this and make sure that conn_complete_server() only wakes the stream up to inform it about the fact the whole connection stack is set up so that back_handle_st_con() finishes its job at the stream-int level. It looks like the there could even be further simplified, but for now it was moved straight out of conn_complete_server() with no modification.	2020-01-17 18:30:36 +01:00
Willy Tarreau	3a9312af8f	REORG: stream/backend: move backend-specific stuff to backend.c For more than a decade we've kept all the sess_update_st_*() functions in stream.c while they're only there to work in relation with what is currently being done in backend.c (srv_redispatch_connect, connect_server, etc). Let's move all this pollution over there and take this opportunity to try to find slightly less confusing names for these old functions whose role is only to handle transitions from one specific stream-int state: sess_update_st_rdy_tcp() -> back_handle_st_rdy() sess_update_st_con_tcp() -> back_handle_st_con() sess_update_st_cer() -> back_handle_st_cer() sess_update_stream_int() -> back_try_conn_req() sess_prepare_conn_req() -> back_handle_st_req() sess_establish() -> back_establish() The last one remained in stream.c because it's more or less a completion function which does all the initialization expected on a connection success or failure, can set analysers and emit logs. The other ones could possibly slightly benefit from being modified to take a stream-int instead since it's really what they're working with, but it's unimportant here.	2020-01-17 18:30:36 +01:00
Willy Tarreau	7aad7039e4	MEDIUM: mux-fcgi: do not make an fstrm subscribe to itself on deferred shut This is the port to FCGI of previous commit "MEDIUM: mux-h2: do not make an h2s subscribe to itself on deferred shut". The purpose is to avoid subscribing to the send_wait list when trying to close, because we'll soon have to merge both recv and send lists. Basic testing showed no difference (performance nor issues).	2020-01-17 18:30:36 +01:00
Willy Tarreau	5723f295d8	MEDIUM: mux-h2: do not make an h2s subscribe to itself on deferred shut The logic handling the deferred shutdown is a bit complex because it involves a wait_event struct in each h2s dedicated to subscribing to itself when shutdowns are not immediately possible. This implies that we will not be able to support a shutdown and a receive subscription in the future when we merge all wait events. Let's solely rely on the H2_SF_WANT_SHUT_{R,W} flags instead and have an autonomous tasklet for this. This requires to add a few controls in the code because now when waking up a stream we need to check if it is for I/O or just a shut, but since sending and shutting are exclusive it's not difficult. One point worth noting is that further resources could be shaved off by only allocating the tasklet when failing to shut, given that in the vast majority of streams it will never be used. In fact the sole purpose of the tasklet is to support calling this code from outside the H2 mux context. Looking at the code, it seems that not too many adaptations would be required to have the send_list walking code deal with sending the shut bits itself and further simplify all this.	2020-01-17 18:30:36 +01:00
Willy Tarreau	f11be0ea1e	MEDIUM: mux-fcgi: do not try to stop sending streams on blocked mux This is essentially the same change as applied to mux-h2 in previous commit "MEDIUM: mux-h2: do not try to stop sending streams on blocked mux". The goal is to make sure we don't need to keep the item in the send_wait list until it's executed so that we can later merge it with the recv_wait list. No performance changes were observed.	2020-01-17 18:30:36 +01:00
Willy Tarreau	d9464167fa	MEDIUM: mux-h2: do not try to stop sending streams on blocked mux This partially reverts commit `d846c267` ("MINOR: h2: Don't run tasks that are waiting to send if mux in full"). This commit was introduced to limit the start/stop overhead incurred by waking many streams to let only a few work. But since commit `9c218e7521` ("MAJOR: mux-h2: switch to next mux buffer on buffer full condition."), this situation occurs way less (typically 2000 to 4000 times less often) and the benefits of the patch above do not outweigh its shortcomings anymore. And commit `c7ce4e3e7f` ("BUG/MEDIUM: mux-h2: don't stop sending when crossing a buffer boundary") addressed a root cause of many unexpected sleeps and wakeups. The main problem it's causing is that it requires to keep the element in the send_wait list until it's executed, leaving the entry in an uncertain state, and significantly complicating the coexistence of this list and the wait list dedicated to shutdown. Also it happens that this call to tasklet_remove_from_task_list() will not be usable anymore once we start to support streams on different threads. And finally, some of the other streams that we remove might very well have managed to find their way to the h2_snd_buf() with an unblocked condition as well so it is possible that some of these removals were not welcome. So this patch now makes sure that send_wait is immediately nulled when the task is woken up, and that we don't have to play with it afterwards. Since we don't need to stop the tasklets anymore, we don't need the sending_list that we can remove. However one very useful benefit of the sending_list was that it used to provide the information about the fact that the stream already tried to send and failed. This was an important factor to improve fairness because late arrived streams should not be allowed to send if others are already scheduled. So this patch introduces a new per-stream flag H2_SF_NOTIFIED to distinguish such streams. With this patch the fairness is preserved, and the ratio of aborted h2_snd_buf() due to other streams already sending remains quite low (~0.3-2.1% measured depending on object size, this is within expectations for 100 independent streams). If the contention issue the patch above used to address comes up again in the future, a much better (though more complicated) solution would be to switch to per-connection buffer pools to distribute between the connection and the streams so that by default there are more buffers available for the mux and the streams only have some when the mux's are unused, i.e. it would push the memory pressure back to the data layer. One observation made while developing this patch is that when dealing with large objects we still spend a huge amount of time scanning the send_list with tasks that are already woken up every time a send() manages to purge a bit more data. Indeed, by removing the elements from the list when H2_SF_NOTIFIED is set, the netowrk bandwidth on 1 MB objects fetched over 100 streams per connection increases by 38%. This was not done here to preserve fairness but is worth studying (e.g. by keeping a restart pointer on the list or just having a flag indicating if an entry was added since last scan).	2020-01-17 18:30:36 +01:00
Jerome Magnin	b8bd6d7efd	BUILD: pattern: include errno.h Commit `3c79d4bdc` introduced the use of errno in pattern.c without including errno.h. If we build haproxy without any option errno is not defined and the build fails.	2020-01-17 18:30:06 +01:00
Willy Tarreau	3381bf89e3	MEDIUM: connection: get rid of CO_FL_CURR_* flags These ones used to serve as a set of switches between CO_FL_SOCK_* and CO_FL_XPRT_, and now that the SOCK layer is gone, they're always a copy of the last know CO_FL_XPRT_ ones that is resynchronized before I/O events by calling conn_refresh_polling_flags(), and that are pushed back to FDs when detecting changes with conn_xprt_polling_changes(). While these functions are not particularly heavy, what they do is totally redundant by now because the fd_want_/fd_stop_() actions already perform test-and-set operations to decide to create an entry or not, so they do the exact same thing that is done by conn_xprt_polling_changes(). As such it is pointless to call that one, and given that the only reason to keep CO_FL_CURR_* is to detect changes there, we can now remove them. Even if this does only save very few cycles, this removes a significant complexity that has been responsible for many bugs in the past, including the last one affecting FreeBSD. All tests look good, and no performance regressions were observed.	2020-01-17 17:45:12 +01:00
Willy Tarreau	93c9f59a9c	MINOR: stream-int: remove dependency on CO_FL_WAIT_ROOM for rcv_buf() The only case where this made sense was with mux_h1 but Since we introduced CS_FL_MAY_SPLICE, we don't need to rely on this anymore, thus we don't need to clear it either when we do not splice. There is a last check on this flag used to determine if the rx channel is full and that cannot go away unless it's changed to use the CS instead but for now this wouldn't add any benefit so better not do it yet.	2020-01-17 17:24:30 +01:00
Willy Tarreau	e2a0eeca77	MINOR: connection: move the CO_FL_WAIT_ROOM cleanup to the reader only CO_FL_WAIT_ROOM is set by the splicing function in raw_sock, and cleared by the stream-int when splicing is disabled, as well as in conn_refresh_polling_flags() so that a new call to ->rcv_pipe() could be attempted by the I/O callbacks called from conn_fd_handler(). This clearing in conn_refresh_polling_flags() makes no sense anymore and is in no way related to the polling at all. Since we don't call them from there anymore it's better to clear it before attempting to receive, and to set it again later. So let's move this operation where it should be, in raw_sock_to_pipe() so that it's now symmetric. It was also placed in raw_sock_to_buf() so that we're certain that it gets cleared if an attempt to splice is replaced with a subsequent attempt to recv(). And these were currently already achieved by the call to conn_refresh_polling_flags(). Now it could theorically be removed from the stream-int.	2020-01-17 17:19:27 +01:00
Jerome Magnin	3c79d4bdc4	BUG/MINOR: pattern: handle errors from fgets when trying to load patterns We need to do some error handling after we call fgets to make sure everything went fine. If we don't users can be fooled into thinking they can load pattens from directory because cfgparse doesn't flinch. This applies to acl patterns map files. This should be backported to all supported versions.	2020-01-17 17:09:50 +01:00
Willy Tarreau	17ccd1a356	BUG/MEDIUM: connection: add a mux flag to indicate splice usability Commit `c640ef1a7d` ("BUG/MINOR: stream-int: avoid calling rcv_buf() when splicing is still possible") fixed splicing in TCP and legacy mode but broke it badly in HTX mode. What happens in HTX mode is that the channel's to_forward value remains set to CHN_INFINITE_FORWARD during the whole transfer, and as such it is not a reliable signal anymore to indicate whether more data are expected or not. Thus, when data are spliced out of the mux using rcv_pipe(), even when the end is reached (that only the mux knows about), the call to rcv_buf() to get the final HTX blocks completing the message were skipped and there was often no new event to wake this up, resulting in transfer timeouts at the end of large objects. All this goes down to the fact that the channel has no more information about whether it can splice or not despite being the one having to take the decision to call rcv_pipe() or not. And we cannot afford to call rcv_buf() inconditionally because, as the commit above showed, this reduces the forwarding performance by 2 to 3 in TCP and legacy modes due to data lying in the buffer preventing splicing from being used later. The approach taken by this patch consists in offering the muxes the ability to report a bit more information to the upper layers via the conn_stream. This information could simply be to indicate that more data are awaited but the real need being to distinguish splicing and receiving, here instead we clearly report the mux's willingness to be called for splicing or not. Hence the flag's name, CS_FL_MAY_SPLICE. The mux sets this flag when it knows that its buffer is empty and that data waiting past what is currently known may be spliced, and clears it when it knows there's no more data or that the caller must fall back to rcv_buf() instead. The stream-int code now uses this to determine if splicing may be used or not instead of looking at the rcv_pipe() callbacks through the whole chain. And after the rcv_pipe() call, it checks the flag again to decide whether it may safely skip rcv_buf() or not. All this bitfield dance remains a bit complex and it starts to appear obvious that splicing vs reading should be a decision of the mux based on permission granted by the data layer. This would however increase the API's complexity but definitely need to be thought about, and should even significantly simplify the data processing layer. The way it was integrated in mux-h1 will also result in no more calls to rcv_pipe() on chunked encoded data, since these ones are currently disabled at the mux level. However once the issue with chunks+splice is fixed, it will be important to explicitly check for curr_len\|CHNK to set MAY_SPLICE, so that we don't call rcv_buf() after each chunk. This fix must be backported to 2.1 and 2.0.	2020-01-17 17:00:12 +01:00
Jerome Magnin	bee00ad080	BUG/MINOR: stream: don't mistake match rules for store-request rules In process_sticking_rules() we only want to apply the first store-request rule for a given table, but when doing so we need to make sure we only count actual store-request rules when we list the sticking rules. Failure to do so leads to not being able to write store-request and match sticking rules in any order as a match rule after a store-request rule will be ignored. The following configuration reproduces the issue: global stats socket /tmp/foobar defaults mode http frontend in bind :8080 default_backend bar backend bar server s1 127.0.0.1:21212 server s2 127.0.0.1:21211 stick store-request req.hdr(foo) stick match req.hdr(foo) stick-table type string size 10 listen foo bind :21212 bind *:21211 http-request deny deny_status 200 if { dst_port 21212 } http-request deny This patch fixes issue #448 and should be backported as far as 1.6.	2020-01-16 18:07:52 +01:00
William Lallemand	d308c5e0ce	CLEANUP: cli: deduplicate the code in _getsocks Since the fix `5fd3b28` ("BUG/MEDIUM: cli: _getsocks must send the peers sockets") for bug #443. The code which sends the socket for the peers and the proxies is duplicated. This patch move this code in a separated function.	2020-01-16 16:26:41 +01:00
William Lallemand	5fd3b28c9c	BUG/MEDIUM: cli: _getsocks must send the peers sockets This bug prevents to reload HAProxy when you have both the seamless reload (-x / expose-fd listeners) and the peers. Indeed the _getsocks command does not send the FDs of the peers listeners, so if no reuseport is possible during the bind, the new process will fail to bind and exits. With this feature, it is not possible to fallback on the SIGTTOU method if we didn't receive all the sockets, because you can't close() the sockets of the new process without closing those of the previous process, they are the same. Should fix bug #443. Must be backported as far as 1.8.	2020-01-16 16:25:22 +01:00
Willy Tarreau	340b07e868	BUG/MAJOR: hashes: fix the signedness of the hash inputs Wietse Venema reported in the thread below that we have a signedness issue with our hashes implementations: due to the use of const char* for the input key that's often text, the crc32, sdbm, djb2, and wt6 algorithms return a platform-dependent value for binary input keys containing bytes with bit 7 set. This means that an ARM or PPC platform will hash binary inputs differently from an x86 typically. Worse, some algorithms are well defined in the industry (like CRC32) and do not provide the expected result on x86, possibly causing interoperability issues (e.g. a user-agent would fail to compare the CRC32 of a message body against the one computed by haproxy). Fortunately, and contrary to the first impression, the CRC32c variant used in the PROXY protocol processing is not affected. Thus the impact remains very limited (the vast majority of input keys are text-based, such as user-agent headers for exmaple). This patch addresses the issue by fixing all hash functions' prototypes (even those not affected, for API consistency). A reg test will follow in another patch. The vast majority of users do not use these hashes. And among those using them, very few will pass them on binary inputs. However, for the rare ones doing it, this fix MAY have an impact during the upgrade. For example if the package is upgraded on one LB then on another one, and the CRC32 of a binary input is used as a stick table key (why?) then these CRCs will not match between both nodes. Similarly, if "hash-type ... crc32" is used, LB inconsistency may appear during the transition. For this reason it is preferable to apply the patch on all nodes using such hashes at the same time. Systems upgraded via their distros will likely observe the least impact since they're expected to be upgraded within a short time frame. And it is important for distros NOT to skip this fix, in order to avoid distributing an incompatible implementation of a hash. This is the reason why this patch is tagged as MAJOR, eventhough it's extremely unlikely that anyone will ever notice a change at all. This patch must be backported to all supported branches since the hashes were introduced in 1.5-dev20 (commit `98634f0c`). Some parts may be dropped since implemented later. Link to Wietse's report: https://marc.info/?l=postfix-users&m=157879464518535&w=2	2020-01-16 08:23:42 +01:00
William Dauchy	bb9da0b8e2	CLEANUP: proxy: simplify proxy_parse_rate_limit proxy checks rate-limits are valid for both frontend and listen, but not backend; so we can simplify this check in a similar manner as it is done in e.g max-keep-alive-queue. this should fix github issue #449 Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-16 07:04:05 +01:00
Olivier Houchard	ac8147446c	BUG/MEDIUM: raw_sock: Make sur the fd and conn are sync. Commit `08fa16e397` made sure we let the fd layer we didn't want to poll anymore if we failed to send and sendto() returne EAGAIN. However, just disabling the polling with fd_stop_send() while not notifying the connection layer means the connection layer may believe the polling is activated and nothing needs to be done when it is wrong. A better fix is to revamp that whole code, for the time being, just make sure the fd and connection layer are properly synchronised. This should fix the problem recently reported on FreeBSD.	2020-01-15 19:16:23 +01:00
Olivier Houchard	68787ef70a	BUG/MEDIUM: mux_h1: Don't call h1_send if we subscribed(). In h1_snd_buf(), only attempt to call h1_send() if we haven't already subscribed. It makes no sense to do it if we subscribed, as we know we failed to send before, and will create a useless call to sendto(), and in 2.2, the call to raw_sock_from_buf() will disable polling if it is enabled. This should be backported to 2.2, 2.1, 2.0 and 1.9.	2020-01-15 19:13:32 +01:00
William Dauchy	8602394040	CLEANUP: compression: remove unused deinit_comp_ctx section Since commit `27d93c3f94` ("BUG/MAJOR: compression/cache: Make it really works with these both filters"), we no longer use section deinit_comp_ctx. This should fix github issue #441 Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-15 10:58:17 +01:00
William Lallemand	24c928c8bd	BUG/MEDIUM: mworker: remain in mworker mode during reload If you reload an haproxy started in master-worker mode with "master-worker" in the configuration, and no "-W" argument, the new process lost the fact that is was in master-worker mode resulting in weird behaviors. The bigest problem is that if it is reloaded with an bad configuration, the master will exits instead of remaining in waitpid mode. This problem was discovered in bug #443. Should be backported in every version using the master-worker mode. (as far as 1.8)	2020-01-14 18:10:29 +01:00
William Lallemand	a31b09e982	BUG/MINOR: cli/mworker: can't start haproxy with 2 programs When trying to start HAProxy with the master CLI and more than one program in the configuration, it refuses to start with: [ALERT] 013/132926 (1378) : parsing [cur--1:0] : proxy 'MASTER', another server named 'cur--1' was already defined at line 0, please use distinct names. [ALERT] 013/132926 (1378) : Fatal errors found in configuration. The problem is that haproxy tries to create a server for the MASTER proxy but only the worker are supposed to be in the server list. Fix issue #446. Must be backported as far as 2.0.	2020-01-14 15:42:38 +01:00
Willy Tarreau	c7ce4e3e7f	BUG/MEDIUM: mux-h2: don't stop sending when crossing a buffer boundary In version 2.0, after commit `9c218e7521` ("MAJOR: mux-h2: switch to next mux buffer on buffer full condition."), the H2 mux started to use a ring buffer for the output data in order to reduce competition between streams. However, one corner case was suboptimally covered: when crossing a buffer boundary, we have to shrink the outgoing frame size to the one left in the output buffer, but this shorter size is later used as a signal of incomplete send due to a buffer full condition (which used to be true when using a single buffer). As a result, function h2s_frt_make_resp_data() used to return less than requested, which in turn would cause h2_snd_buf() to stop sending and leave some unsent data in the buffer, and si_cs_send() to subscribe for sending more later. But it goes a bit further than this, because subscribing to send again causes the mux's send_list not to be empty anymore, hence extra streams can be denied the access to the mux till the first stream is woken again. This causes a nasty wakeup-sleep dance between streams that makes it totally impractical to try to remove the sending list. A test showed that it was possible to observe 3 million h2_snd_buf() giveups for only 100k requests when using 100 concurrent streams on 20kB objects. It doesn't seem likely that a stream could get blocked and time out due to this bug, though it's not possible either to demonstrate the opposite. One risk is that incompletely sent streams do not have any blocking flags so they may not be identified as blocked. However on first scan of the send_list they meet all conditions for a wakeup. This patch simply allows to continue on a new frame after a partial frame. with only this change, the number of failed h2_snd_buf() was divided by 800 (4% of calls). And by slightly increasing the H2C_MBUF_CNT size, it can go down to zero. This fix must be backported to 2.1 and 2.0.	2020-01-14 13:55:04 +01:00
Willy Tarreau	f31af9367e	MEDIUM: lua: don't call the GC as often when dealing with outgoing connections In order to properly close connections established from Lua in case a Lua context dies, the context currently automatically gets a flag HLUA_MUST_GC set whenever an outgoing connection is used. This causes the GC to be enforced on the context's death as well as on yield. First, it does not appear necessary to do it when yielding, since if the connections die they are already cleaned up. Second, the problem with the flag is that even if a connection gets properly closed, the flag is not removed and the GC continues to be called on the Lua context. The impact on performance looks quite significant, as noticed and diagnosed by Sadasiva Gujjarlapudi in the following thread: https://www.mail-archive.com/haproxy@formilux.org/msg35810.html This patch changes the flag for a counter so that each created connection increments it and each cleanly closed connection decrements it. That way we know we have to call the GC on the context's death only if the count is non-null. As reported in the thread above, the Lua performance gain is now over 20% by doing this. Thanks to Sada and Thierry for the design discussion and tests that led to this solution.	2020-01-14 10:12:31 +01:00
William Dauchy	9a8ef7f51d	CLEANUP: ssl: remove opendir call in ssl_sock_load_cert Since commit `3180f7b554` ("MINOR: ssl: load certificates in alphabetical order"), `readdir` was replaced by `scandir`. We can indeed replace it with a check on the previous `stat` call. This micro cleanup can be a good benefit when you have hundreds of bind lines which open TLS certificates directories in terms of syscall, especially in a case of frequent reloads. Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-13 19:51:52 +01:00
Willy Tarreau	70c5b0e5fd	BUG/MEDIUM: mux-h2: fix missing test on sending_list in previous patch Previous commit `989539b048` ("BUG/MINOR: mux-h2: use a safe list_for_each_entry in h2_send()") accidently lost its sending_list test, resulting in some elements to be woken up again while already in the sending_list and h2_unsubscribe() crashing on integrity tests (only when built with DEBUG_DEV). If the fix above is backported this one must be as well.	2020-01-10 18:20:15 +01:00
Willy Tarreau	989539b048	BUG/MINOR: mux-h2: use a safe list_for_each_entry in h2_send() h2_send() uses list_for_each_entry() to scan paused streams and resume them, but happily deletes any leftover from a previous failed unsubscribe, which is obviously not safe and would corrupt the list. In practice this is a proof that this doesn't happen, but it's not the best way to prove it. In order to fix this and reduce the maintenance burden caused by code duplication (this list walk exists at 3 places), let's introduce a new function h2_resume_each_sending_h2s() doing exactly this and use it at all 3 places. This bug was introduced as a side effect of fix `998410a41b` ("BUG/MEDIUM: h2: Revamp the way send subscriptions works.") so it should be backported as far as 1.9.	2020-01-10 17:18:32 +01:00
Christopher Faulet	48726b78e5	BUG/MINOR: stream-int: Don't trigger L7 retry if max retries is already reached When an HTTP response is received, at the stream-interface level, if a L7 retry must be triggered because of the status code, the response is trashed and a read error is reported on the response channel. Then the stream handles this error and perform the retry. Except if the maximum connection retries is reached. In this case, an error is reported. Because the server response was already trashed by the stream-interface, a generic 502 error is returned to the client instead of the server's one. Now, the stream-interface triggers a L7 retry only if the maximum connection retries is not already reached. Thus, at the end, the last server's response is returned. This patch must be backported to 2.1 and 2.0. It should fix the issue #439.	2020-01-09 15:39:06 +01:00
William Dauchy	7675c720f8	CLEANUP: server: remove unused err section in server_finalize_init Since commit `980855bd95` ("BUG/MEDIUM: server: initialize the orphaned conns lists and tasks at the end"), we no longer use err section. This should fix github issue #438 Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-09 05:54:48 +01:00
Willy Tarreau	eeea8082a8	BUG/MAJOR: listener: do not schedule a task-less proxy Apparently seamingless commit `0591bf7deb` ("MINOR: listener: make the wait paths cleaner and more reliable") caused a nasty regression and revealed a rare race that hits regtest stickiness/lb-services.vtc about 4% of the times for 8 threads. The problem is that when a multi-threaded listener wakes up on an incoming connection, several threads can receive the event, especially when idle. And all of them will race to accept the connections in parallel, adjusting the listener's nbconn and proxy's feconn until one reaches the proxy's limit and declines. At this step the changes are cancelled, the listener is marked "limited", and when the threads exit the function, one of them will unlimit the listener/proxy again so that it can accept incoming connections again. The problem happens when many threads connect to a small peers section because its maxconn is very limited (typically 6 for 2 peers), and it's sometimes possible for enough competing threads to hit the limit and one of them will limit the listener and queue the proxy's task... except that peers do not initialize their proxy task since they do not use rate limiting. Thus the process crashes when doing task_schedule(p->task). Prior to the cleanup patch above, this didn't happen because the error path that was dedicated to only limiting the listener did not call task_schedule(p->task). Given that the proxy's task is optional, and that the expire value passed there is always TICK_ETERNITY, it's sufficient and reasonable to avoid calling this task_schedule() when expire is not set. And for long term safety we can also avoid to do it when the task is not set. A first fix consisted in allocating a task for the peers proxies but it's never used and would eat resources for reason. No backport is needed as this commit was only merged into 2.2.	2020-01-08 19:39:09 +01:00
William Dauchy	cd7fa3dcfc	CLEANUP: mux-h2: remove unused goto "out_free_h2s" Since commit `fa8aa867b9` ("MEDIUM: connections: Change struct wait_list to wait_event.") we no longer use this section. this should fix github issue #437 Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-08 16:16:19 +01:00
Florian Tham	9205fea13a	MINOR: http: Add 404 to http-request deny This patch adds http status code 404 Not Found to http-request deny. See issue #80.	2020-01-08 16:15:23 +01:00
Florian Tham	272e29b5cc	MINOR: http: Add 410 to http-request deny This patch adds http status code 410 Gone to http-request deny. See issue #80.	2020-01-08 16:15:23 +01:00
Willy Tarreau	08fa16e397	MINOR: raw_sock: make sure to disable polling once everything is sent Analysing traces revealed a rare but surprizing pattern : connect() = -1 EAGAIN send() = success epoll_ctl(ADD, EPOLLOUT) epoll_wait() recvfrom() = success close() What happens is that the failed connect() creates an FD update for pollout, but the successful synchronous send() doesn't disable it because polling was only disabled in the FD handler. But a successful synchronous connect() cancellation is a good opportunity to disable polling before it's effectively enabled in the next loop, so better disable it when reaching the end. The cost is very low if it was already disabled anyway (one atomic op). This only affects local connections but with this the typical number of epoll_ctl() calls per connection dropped from ~4.2 to ~3.8 for plain TCP and 10k transfers.	2020-01-08 09:59:40 +01:00
Willy Tarreau	0eae6323bf	MEDIUM: dns: implement synchronous send In dns_send_query(), there's no point in first waking up the FD, to get called back by the poller to send the request and sleep. Instead let's simply send the request as soon as it's known and only subscribe to the poller when the socket buffers are full and it's required to poll (i.e. almost never). This significantly reduces the number of calls to the poller. A large config sees the number of epoll_ctl() calls reduced from 577 to 7 over 10 seconds, the number of recvfrom() from 1533 to 582 and the number of sendto() from 369 to 162. It also has the extra benefit of building each requests only once per resolution and sending it to multiple resolvers instead of rebuilding it for each and every resolver. This will reduce the risk of seeing situations similar to bug #416 in the future.	2020-01-08 06:10:38 +01:00
Willy Tarreau	e5891ca6c1	BUG/MEDIUM: session: do not report a failure when rejecting a session In session_accept_fd() we can perform a synchronous call to conn_complete_session() and if it succeeds the connection is accepted and turned into a session. If it fails we take it as an error while it is not, in this case, it's just that a tcp-request rule has decided to reject the incoming connection. The problem with reporting such an event as an error is that the failed status is passed down to the listener code which decides to disable accept() for 100ms in order to leave some time for transient issues to vanish, and that's not what we want to do here. This fix must be backported as far as 1.7. In 1.7 the code is a bit different as tcp_exec_l5_rules() is called directly from within session_new_fd() and ret=0 must be assigned there.	2020-01-07 18:15:32 +01:00
Christopher Faulet	584348be63	BUG/MINOR: channel: inject output data at the end of output In co_inject(), data must be inserted at the end of output, not the end of input. For the record, this function does not take care of input data which are supposed to not exist. But the caller may reset input data after or before the call. It is its own choice. This bug, among other effects, is visible when a redirect is performed on the response path, on legacy HTTP mode (so for HAProxy < 2.1). The redirect response is appended after the server response when it should overwrite it. Thanks to Kevin Zhu <ip0tcp@gmail.com> to report the bug. It must be backported as far as 1.9.	2020-01-07 10:51:15 +01:00
Kevin Zhu	96b363963f	BUG/MEDIUM: http-ana: Truncate the response when a redirect rule is applied When a redirect rule is executed on the response path, we must truncate the received response. Otherwise, the redirect is appended after the response, which is sent to the client. So it is obviously a bug because the redirect is not performed. With bodyless responses, it is the "only" bug. But if the response has a body, the result may be invalid. If the payload is not fully received yet when the redirect is performed, an internal error is reported. It must be backported as far as 1.9.	2020-01-07 10:50:28 +01:00
Christopher Faulet	47a7210b9d	BUG/MINOR: proxy: Fix input data copy when an error is captured In proxy_capture_error(), input data are copied in the error snapshot. The copy must take care of the data wrapping. But the length of the first block is wrong. It should be the amount of contiguous input data that can be copied starting from the input's beginning. But the mininum between the input length and the buffer size minus the input length is used instead. So it is a problem if input data are wrapping or if more than the half of the buffer is used by input data. This patch must be backported as far as 1.9.	2020-01-06 13:58:30 +01:00
Christopher Faulet	1703478e2d	BUG/MINOR: h1: Report the right error position when a header value is invalid During H1 messages parsing, when the parser has finished to parse a full header line, some tests are performed on its value, depending on its name, to be sure it is valid. The content-length is checked and converted in integer and the host header is also checked. If an error occurred during this step, the error position must point on the header value. But from the parser point of view, we are already on the start of the next header. Thus the effective reported position in the error capture is the beginning of the unparsed header line. It is a bit confusing when we try to figure out why a message is rejected. Now, the parser state is updated to point on the invalid value. This way, the error position really points on the right position. This patch must be backported as far as 1.9.	2020-01-06 13:58:21 +01:00
Olivier Houchard	7f4f7f140f	MINOR: ssl: Remove unused variable "need_out". The "need_out" variable was used to let the ssl code know we're done reading early data, and we should start the handshake. Now that the handshake function is responsible for taking care of reading early data, all that logic has been removed from ssl_sock_to_buf(), but need_out was forgotten, and left. Remove it know. This patch was submitted by William Dauchy <w.dauchy@criteo.com>, and should fix github issue #434. This should be backported to 2.0 and 2.1.	2020-01-05 16:45:14 +01:00
William Dauchy	3894d97fb8	MINOR: config: disable busy polling on old processes in the context of seamless reload and busy polling, older processes will create unecessary cpu conflicts; we can assume there is no need for busy polling for old processes which are waiting to be terminated. This patch is not a bug fix itself but might be a good stability improvment when you are un the context of frequent seamless reloads with a high "hard-stop-after" value; for that reasons I think this patch should be backported in all 2.x versions. Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-01-02 10:29:49 +01:00
Olivier Houchard	140237471e	BUG/MEDIUM: connections: Hold the lock when wanting to kill a connection. In connect_server(), when we decide we want to kill the connection of another thread because there are too many idle connections, hold the toremove_lock of the corresponding thread, othervise, there's a small race condition where we could try to add the connection to the toremove_connections list while it has already been free'd. This should be backported to 2.0 and 2.1.	2019-12-30 18:18:28 +01:00
Olivier Houchard	37d7897aaf	BUG/MEDIUM: checks: Only attempt to do handshakes if the connection is ready. When creating a new check connection, only attempt to add an handshake connection if the connection has fully been initialized. It can not be the case if a DNS resolution is still pending, and thus we don't yet have the address for the server, as the handshake code assumes the connection is fully initialized and would otherwise crash. This is not ideal, the check shouldn't probably run until we have an address, as it leads to check failures with "Socket error". While I'm there, also add an xprt handshake if we're using socks4, otherwise checks wouldn't be able to use socks4 properly. This should fix github issue #430 This should be backported to 2.0 and 2.1.	2019-12-30 15:18:16 +01:00
Willy Tarreau	5d7dcc2a8e	OPTIM: epoll: always poll for recv if neither active nor ready The cost of enabling polling in one direction with epoll is very high because it requires one syscall per FD and per direction change. In addition we don't know about input readiness until we either try to receive() or enable polling and watch the result. With HTTP keep-alive, both are equally expensive as it's very uncommon to see the server instantly respond (unless it's a second stage of the same process on localhost, which has become much less common with threads). But when a connection is established it's also quite usual to have to poll for sending (except on localhost or UNIX sockets where it almost always instantly works). So this cost of polling could be factored out with the second step if both were enabled together. This is the idea behind this patch. What it does is to always enable polling for Rx if it's not ready and at least one direction is active. This means that if it's not explicitly disabled, or if it was but in a state that causes the loss of the information (rx ready cannot be guessed), then let's take any opportunity for a polling change to enable it at the same time, and learn about rx readiness for free. In addition the FD never gets unregistered for Rx unless it's ready and was blocked (buffer full). This avoids a lot of the flip-flop behaviour at beginning and end of requests. On a test with 10k requests in keep-alive, the difference is quite noticeable: Before: % time seconds usecs/call calls errors syscall ------ ----------- ----------- --------- --------- ---------------- 83.67 0.010847 0 20078 epoll_ctl 16.33 0.002117 0 2231 epoll_wait 0.00 0.000000 0 20 20 connect ------ ----------- ----------- --------- --------- ---------------- 100.00 0.012964 22329 20 total After: % time seconds usecs/call calls errors syscall ------ ----------- ----------- --------- --------- ---------------- 96.35 0.003351 1 2644 epoll_wait 2.36 0.000082 4 20 20 connect 1.29 0.000045 0 66 epoll_ctl ------ ----------- ----------- --------- --------- ---------------- 100.00 0.003478 2730 20 total It may also save a recvfrom() after connect() by changing the following sequence, effectively saving one epoll_ctl() and one recvfrom() : before \| after -----------------------------+---------------------------- - connect() \| - connect() - epoll_ctl(add,out) \| - epoll_ctl(add, in\|out) - sendto() \| - epoll_wait() = out - epoll_ctl(mod,in\|out) \| - send() - epoll_wait() = out \| - epoll_wait() = in\|out - recvfrom() = EAGAIN \| - recvfrom() = OK - epoll_ctl(mod,in) \| - recvfrom() = EAGAIN - epoll_wait() = in \| - epoll_ctl(mod, in) - recvfrom() = OK \| - epoll_wait() - recvfrom() = EAGAIN \| - epoll_wait() \| (...) Now on a 10M req test on 16 threads with 2k concurrent conns and 415kreq/s, we see 190k updates total and 14k epoll_ctl() only.	2019-12-27 16:38:47 +01:00
Willy Tarreau	0fbc318e24	CLEANUP: connection: merge CO_FL_NOTIFY_DATA and CO_FL_NOTIFY_DONE Both flags became equal in commit `82967bf9` ("MINOR: connection: adjust CO_FL_NOTIFY_DATA after removal of flags"), which already predicted the overlap between xprt_done_cb() and wake() after the removal of the DATA specific flags in 1.8. Let's simply remove CO_FL_NOTIFY_DATA since the "_DONE" version already covers everything and explains the intent well enough.	2019-12-27 16:38:47 +01:00
Willy Tarreau	cbcf77edb7	MINOR: connection: remove the double test on xprt_done_cb() The conn_fd_handler used to have one possible call to this function to notify about end of handshakes, and another one to notify about connection setup or error. But given that we're now only performing wakeup calls after connection validation, we don't need to keep two places to run this test since the conditions do not change in between. This patch merges the two tests into a single one and moves the CO_FL_CONNECTED test appropriately as well so that it's called even on the error path if needed.	2019-12-27 16:38:47 +01:00
Willy Tarreau	b2a7ab08a8	MINOR: connection: check for connection validation earlier In conn_fd_handler() we used to first give a chance to the send() callback to try to send data and validate the connection at the same time. But since 1.9 we do not call this callback anymore inline, it's scheduled. So let's validate the connection ealier so that all other decisions can be taken based on this confirmation. This may notably be useful to the xprt_done_cb() to know that the connection was properly validated.	2019-12-27 16:38:47 +01:00
Willy Tarreau	4970e5adb7	REORG: connection: move tcp_connect_probe() to conn_fd_check() The function is not TCP-specific at all, it covers all FD-based sockets so let's move this where other similar functions are, in connection.c, and rename it conn_fd_check().	2019-12-27 16:38:43 +01:00
Willy Tarreau	7deff246ce	MEDIUM: tcp: make tcp_connect_probe() consider ERR/HUP Now that we know what pollers can return ERR/HUP, we can take this into account to save one syscall: with such a poller, if neither are reported, then we know the connection succeeded and we don't need to go with getsockopt() nor connect() to validate this. In addition, for the remaining cases (select() or suspected errors), we'll always go through the extra connect() attempt and enumerate possible "in progress", "connected" or "failed" status codes and take action solely based on this. This results in one saved syscall on modern pollers, only a second connect() still being used on select() and the server's address never being needed anymore. Note that we cannot safely replace connect() with getsockopt() as the latter clears the error on the socket without saving it, and health checks rely on it for their reporting. This would be OK if the error was saved in the connection itself.	2019-12-27 16:38:04 +01:00
Willy Tarreau	11ef0837af	MINOR: pollers: add a new flag to indicate pollers reporting ERR & HUP In practice it's all pollers except select(). It turns out that we're keeping some legacy code only for select and enforcing it on all pollers, let's offer the pollers the ability to declare that they do not need that.	2019-12-27 14:04:33 +01:00
Willy Tarreau	8081abe26a	CLEANUP: connection: conn->xprt is never NULL Let's remove this outdated test that's been there since 1.5. For quite some time now xprt hasn't been NULL anymore on an initialized connection.	2019-12-27 14:04:33 +01:00
Willy Tarreau	70ccb2cddf	BUG/MINOR: connection: only wake send/recv callbacks if the FD is active Since commit `c3df4507fa` ("MEDIUM: connections: Wake the upper layer even if sending/receiving is disabled.") the send/recv callbacks are called on I/O if the FD is ready and not just if it's active. This means that in some situations (e.g. send ready but nothing to send) we may needlessly enter the if() block, notice we're not subscribed, set io_available=1 and call the wake() callback even if we're just called for read activity. Better make sure we only do this when the FD is active in that direction.. This may be backported as far as 2.0 though it should remain under observation for a few weeks first as the risk of harm by a mistake is higher than the trouble it should cause.	2019-12-27 14:04:33 +01:00
Willy Tarreau	c8dc20a825	BUG/MINOR: checks: refine which errno values are really errors. Two regtest regularly fail in a random fashion depending on the machine's load (one could really wonder if it's really worth keeping such unreproducible tests) : - tcp-check_multiple_ports.vtc - 4be_1srv_smtpchk_httpchk_layer47errors.vtc It happens that one of the reason is the time it takes to connect to the local socket (hence the load-dependent aspect): if connect() on the loopback returns EINPROGRESS then this status is reported instead of a real error. Normally such a test is expected to see the error cleaned by tcp_connect_probe() but it really depends on the timing and instead we may very well send() first and see this error. The problem is that everything is collected based on errno, hoping it won't get molested in the way from the last unsuccesful syscall to wake_srv_chk(), which obviously is hard to guarantee. This patch at least makes sure that a few non-errors are reported as zero just like EAGAIN. It doesn't fix the root cause but makes it less likely to report incorrect failures. This fix could be backported as far as 1.9.	2019-12-27 14:04:33 +01:00
Lukas Tribus	a26d1e1324	BUILD: ssl: improve SSL_CTX_set_ecdh_auto compatibility SSL_CTX_set_ecdh_auto() is not defined when OpenSSL 1.1.1 is compiled with the no-deprecated option. Remove existing, incomplete guards and add a compatibility macro in openssl-compat.h, just as OpenSSL does: `bf4006a6f9/include/openssl/ssl.h (L1486)` This should be backported as far as 2.0 and probably even 1.9.	2019-12-21 06:46:55 +01:00
Christopher Faulet	eec7f8ac01	BUG/MEDIUM: stream: Be sure to never assign a TCP backend to an HTX stream With a TCP frontend, it is possible to upgrade a connection to HTTP when the backend is in HTTP mode. Concretly the upgrade install a new mux. So, once it is done, the downgrade to TCP is no longer possible. So we must take care to never assign a TCP backend to a stream on this connection. Otherwise, HAProxy crashes because raw data from the server are handled as structured data on the client side. This patch fixes the issue #420. It must be backported to all versions supporting the HTX.	2019-12-20 18:09:49 +01:00
Christopher Faulet	6716cc2b93	BUG/MAJOR: mux-h1: Don't pretend the input channel's buffer is full if empty A regression was introduced by the commit `76014fd1` ("MEDIUM: h1-htx: Add HTX EOM block when the message is in H1_MSG_DONE state"). When nothing is copied in the channel's buffer when the input message is parsed, we erroneously pretend it is because there is not enough room by setting the CS_FL_WANT_ROOM flag on the conn-stream. This happens when a partial request is parsed. Because of this flag, we never try anymore to get input data from the mux because we first wait for more room in the channel's buffer, which is empty. Because of this bug, it is pretty easy to freeze a h1 connection. To fix the bug, we must obsiously set the CS_FL_WANT_ROOM flag only when there are still data to transfer while the channel's buffer is not empty. This patch must be backported if the patch `76014fd1` is backported too. So for now, no backport needed.	2019-12-20 18:09:19 +01:00
Willy Tarreau	ca7a5af664	BUG/MINOR: state-file: do not leak memory on parse errors Issue #417 reports a possible memory leak in the state-file loading code. There's one such place in the loop which corresponds to parsing errors where the curreently allocated line is not freed when dropped. In any case this is very minor in that no more than the file's length may be lost in the worst case, considering that the whole file is kept anyway in case of success. This fix addresses this. It should be backported to 2.1.	2019-12-20 17:33:05 +01:00
Willy Tarreau	fd1aa01f72	BUG/MINOR: state-file: do not store duplicates in the global tree The global state file tree isn't configured for unique keys, so if an entry appears multiple times, e.g. due to a bogus script that concatenates entries multiple times, this will needlessly eat memory. Let's just drop duplicates. This should be backported to 2.1.	2019-12-20 17:23:40 +01:00
Willy Tarreau	7d6a1fa311	BUG/MEDIUM: state-file: do not allocate a full buffer for each server entry Starting haproxy with a state file of 700k servers eats 11.2 GB of RAM due to a mistake in the function that loads the strings into a tree: it allocates a full buffer for each backend+server name instead of allocating just the required string. By just fixing this we're down to 80 MB. This should be backported to 2.1.	2019-12-20 17:18:13 +01:00
Olivier Houchard	fc51f0f588	BUG/MEDIUM: fd/threads: fix a concurrency issue between add and rm on the same fd There's a very hard-to-trigger bug in the FD list code where the fd_add_to_fd_list() function assumes that if the FD it's trying to add is already locked, it's in the process of being added. Unfortunately, it can also be in the process of being removed. It is very hard to trigger because it requires that one thread is removing the FD while another one is adding it. First very few FDs run on multiple threads (listeners and DNS), and second, it does not make sense to add and remove the FD at the same time. In practice the DNS code built on the older callback-only model does perform bursts of fd_want_send() for all resolvers at once when it wants to send a new query (dns_send_query()). And this is more likely to happen when here are lots of resolutions in parallel and many resolvers, because the dns_response_recv() callback can also trigger a series of queries on all resolvers for each invalid response it receives. This means that it really is perfectly possible to both stop and start in parallel during short periods of time there. This issue was not reported before 2.1, but 2.1 had the FD cache, built on the exact same code base. It's very possible that the issue caused exactly the opposite situation, where an event was occasionally lost, causing a DNS retry that worked, and nobody noticing the problem in the end. In 2.1 the lost entries are the updates asking for not polling for writes anymore, and the effect is that the poller contiuously reports writability on the socket when the issue happens. This patch fixes bug #416 and must be backported as far as 1.8, and absolutely requires that previous commit "MINOR: fd/threads: make _GET_NEXT()/_GET_PREV() use the volatile attribute" is backported as well otherwise it will make the issue worse. Special thanks to Julien Pivotto for setting up a reliable reproducer for this difficult issue.	2019-12-20 08:09:28 +01:00
Willy Tarreau	337fb719ee	MINOR: fd/threads: make _GET_NEXT()/_GET_PREV() use the volatile attribute These macros are either used between atomic ops which cause the volatile to be implicit, or with an explicit volatile cast. However not having it in the macro causes some traps in the code because certain loop paths cannot safely be used without risking infinite loops if one isn't careful enough. Let's place the volatile attribute inside the macros and remove them from the explicit places to avoid this. It was verified that the output executable remains exactly the same byte-wise.	2019-12-20 08:09:28 +01:00
Olivier Houchard	54907bb848	BUG/MEDIUM: ssl: Revamp the way early data are handled. Instead of attempting to read the early data only when the upper layer asks for data, allocate a temporary buffer, stored in the ssl_sock_ctx, and put all the early data in there. Requiring that the upper layer takes care of it means that if for some reason the upper layer wants to emit data before it has totally read the early data, we will be stuck forever. This should be backported to 2.1 and 2.0. This may fix github issue #411.	2019-12-19 15:22:04 +01:00
Willy Tarreau	dd0e89a084	BUG/MAJOR: task: add a new TASK_SHARED_WQ flag to fix foreing requeuing Since 1.9 with commit `b20aa9eef3` ("MAJOR: tasks: create per-thread wait queues") a task bound to a single thread will not use locks when being queued or dequeued because the wait queue is assumed to be the owner thread's. But there exists a rare situation where this is not true: the health check tasks may be running on one thread waiting for a response, and may in parallel be requeued by another thread calling health_adjust() after a detecting a response error in traffic when "observe l7" is set, and "fastinter" is lower than "inter", requiring to shorten the running check's timeout. In this case, the task being requeued was present in another thread's wait queue, thus opening a race during task_unlink_wq(), and gets requeued into the calling thread's wait queue instead of the running one's, opening a second race here. This patch aims at protecting against the risk of calling task_unlink_wq() from one thread while the task is queued on another thread, hence unlocked, by introducing a new TASK_SHARED_WQ flag. This new flag indicates that a task's position in the wait queue may be adjusted by other threads than then one currently executing it. This means that such WQ manipulations must be performed under a lock. There are two types of such tasks: - the global ones, using the global wait queue (technically speaking, those whose thread_mask has at least 2 bits set). - some local ones, which for now will be placed into the global wait queue as well in order to benefit from its lock. The flag is automatically set on initialization if the task's thread mask indicates more than one thread. The caller must also set it if it intends to let other threads update the task's expiration delay (e.g. delegated I/Os), or if it intends to change the task's affinity over time as this could lead to the same situation. Right now only the situation described above seems to be affected by this issue, and it is very difficult to trigger, and even then, will often have no visible effect beyond stopping the checks for example once the race is met. On my laptop it is feasible with the following config, chained to httpterm: global maxconn 400 # provoke FD errors, calling health_adjust() defaults mode http timeout client 10s timeout server 10s timeout connect 10s listen px bind :8001 option httpchk /?t=50 server sback 127.0.0.1:8000 backup server-template s 0-999 127.0.0.1:8000 check port 8001 inter 100 fastinter 10 observe layer7 This patch will automatically address the case for the checks because check tasks are created with multiple threads bound and will get the TASK_SHARED_WQ flag set. If in the future more tasks need to rely on this (multi-threaded muxes for example) and the use of the global wait queue becomes a bottleneck again, then it should not be too difficult to place locks on the local wait queues and queue the task on its bound thread. This patch needs to be backported to 2.1, 2.0 and 1.9. It depends on previous patch "MINOR: task: only check TASK_WOKEN_ANY to decide to requeue a task". Many thanks to William Dauchy for providing detailed traces allowing to spot the problem.	2019-12-19 14:42:22 +01:00
Willy Tarreau	8fe4253bf6	MINOR: task: only check TASK_WOKEN_ANY to decide to requeue a task After processing a task, its RUNNING bit is cleared and at the same time we check for other bits to decide whether to requeue the task or not. It happens that we only want to check the TASK_WOKEN_* bits, because : - TASK_RUNNING was just cleared - TASK_GLOBAL and TASK_QUEUE cannot be set yet as the task was running, preventing it from being requeued It's important not to catch yet undefined flags there because it would prevent addition of new task flags. This also shows more clearly that waking a task up with flags 0 is not something safe to do as the task will not be woken up if it's already running.	2019-12-19 14:42:22 +01:00
Willy Tarreau	262c3f1a00	MINOR: http: add a new "replace-path" action This action is very similar to "replace-uri" except that it only acts on the path component. This is assumed to better match users' expectations when they used to rely on "replace-uri" in HTTP/1 because mostly origin forms were used in H1 while mostly absolute URI form is used in H2, and their rules very often start with a '/', and as such do not match. It could help users to get this backported to 2.0 and 2.1.	2019-12-19 09:24:57 +01:00
Willy Tarreau	0851fd5eef	MINOR: debug: support logging to various sinks As discussed in the thread below [1], the debug converter is currently not of much use given that it's only built when DEBUG_EXPR is set, and it is limited to stderr only. This patch changes this to make it take an optional prefix and an optional target sink so that it can log to stdout, stderr or a ring buffer. The default output is the "buf0" ring buffer, that can be consulted from the CLI. [1] https://www.mail-archive.com/haproxy@formilux.org/msg35671.html Note: if this patch is backported, it also requires the following commit to work: `46dfd78cbf` ("BUG/MINOR: sample: always check converters' arguments").	2019-12-19 09:19:13 +01:00
William Lallemand	ba22e901b3	BUG/MINOR: ssl/cli: fix build for openssl < 1.0.2 Commit `d4f946c` ("MINOR: ssl/cli: 'show ssl cert' give information on the certificates") introduced a build issue with openssl version < 1.0.2 because it uses the certificate bundles.	2019-12-18 20:40:20 +01:00
William Lallemand	d4f946c469	MINOR: ssl/cli: 'show ssl cert' give information on the certificates Implement the 'show ssl cert' command on the CLI which list the frontend certificates. With a certificate name in parameter it will show more details.	2019-12-18 18:16:34 +01:00
Olivier Houchard	545989f37f	BUG/MEDIUM: ssl: Don't set the max early data we can receive too early. When accepting the max early data, don't set it on the SSL_CTX while parsing the configuration, as at this point global.tune.maxrewrite may still be -1, either because it was not set, or because it hasn't been set yet. Instead, set it for each connection, just after we created the new SSL. Not doing so meant that we could pretend to accept early data bigger than one of our buffer. This should be backported to 2.1, 2.0, 1.9 and 1.8.	2019-12-17 15:45:38 +01:00
Tim Duesterhus	cd3732456b	MINOR: sample: Validate the number of bits for the sha2 converter Instead of failing the conversion when an invalid number of bits is given the sha2 converter now fails with an appropriate error message during startup. The sha2 converter was introduced in `d437630237`, which is in 2.1 and higher.	2019-12-17 13:28:00 +01:00
Willy Tarreau	46dfd78cbf	BUG/MINOR: sample: always check converters' arguments In 1.5-dev20, sample-fetch arguments parsing was addresse by commit `689a1df0a1` ("BUG/MEDIUM: sample: simplify and fix the argument parsing"). The issue was that argument checks were not run for sample-fetches if parenthesis were not present. Surprisingly, the fix was mde only for sample-fetches and not for converters which suffer from the exact same problem. There are even a few comments in the code mentioning that some argument validation functions are not called when arguments are missing. This fix applies the exact same method as the one above. The impact of this bug is limited because over the years the code has learned to work around this issue instead of fixing it. This may be backported to all maintained versions.	2019-12-17 10:44:49 +01:00
Willy Tarreau	5060326798	BUG/MINOR: sample: fix the closing bracket and LF in the debug converter The closing bracket was emitted for the "debug" converter even when the opening one was not sent, and the new line was not always emitted. Let's fix this. This is harmless since this converter is not built by default.	2019-12-17 09:04:38 +01:00
Christopher Faulet	29f7284333	MINOR: http-htx: Add some htx sample fetches for debugging purpose These sample fetches are internal and must be used for debugging purpose. Idea is to have a way to add some checks on the HTX content from http rules. The main purpose is to ease reg-tests writing.	2019-12-11 16:46:16 +01:00
Christopher Faulet	76014fd118	MEDIUM: h1-htx: Add HTX EOM block when the message is in H1_MSG_DONE state During H1 parsing, the HTX EOM block is added before switching the message state to H1_MSG_DONE. It is an exception in the way to convert an H1 message to HTX. Except for this block, the message is first switched to the right state before starting to add the corresponding HTX blocks. For instance, the message is switched in H1_MSG_DATA state and then the HTX DATA blocks are added. With this patch, the message is switched to the H1_MSG_DONE state when all data blocks or trailers were processed. It is the caller responsibility to call h1_parse_msg_eom() when the H1_MSG_DONE state is reached. This way, it is far easier to catch failures when the HTX buffer is full. The H1 and FCGI muxes have been updated accordingly. This patch may eventually be backported to 2.1 if it helps other backports.	2019-12-11 16:46:16 +01:00
Willy Tarreau	719e07c989	BUILD/MINOR: unix sockets: silence an absurd gcc warning about strncpy() Apparently gcc developers decided that strncpy() semantics are no longer valid and now deserve a warning, especially if used exactly as designed. This results in issue #304. Let's just remove one to the target size to please her majesty gcc, the God of C Compilers, who tries hard to make users completely eliminate any use of string.h and reimplement it by themselves at much higher risks. Pfff.... This can be backported to stable version, the fix is harmless since it ignores the last zero that is already set on next line.	2019-12-11 16:29:10 +01:00
Willy Tarreau	2444108f16	BUG/MINOR: server: make "agent-addr" work on default-server line As reported in issue #408, "agent-addr" doesn't work on default-server lines. This is due to the transcription of the old "addr" option in commit `6e5e0d8f9e` ("MINOR: server: Make 'default-server' support 'addr' keyword.") which correctly assigns it to the check.addr and agent.addr fields, but which also copies the default check.addr into both the check's and the agent's addr fields. Thus the default agent's address is never used. This fix makes sure to copy the check from the check and the agent from the agent. However it's worth noting that if "addr" is specified on the server line, it will still overwrite both the check and the agent's addresses. This must be backported as far as 1.8.	2019-12-11 15:43:45 +01:00
Willy Tarreau	cdcba115b8	BUG/MINOR: listener: do not immediately resume on transient error The listener supports a "transient error" situation, which corresponds to those situations where accept fails badly but poll() reports an event. This happens for example when a listener is paused, or on out of FD. The same mechanism is used when facing a maxconn or maxsessrate limitation. When this happens, the listener is disabled for up to 100ms and put back into the global listener queue so that it automatically wakes up again as soon as the conditions change from an existing connection releasing one resource, or the system recovers from a transient issue. The listener_accept() function has a bug in its exit path causing a freshly limited listener to be immediately enabled again because all the conditions are met (connection count < max). It doesn't take into account the fact that the listener might have been queued and must first wait for the timeout to expire before doing so. The impact is that upon certain errors, the faulty process will busy loop on the accept code without sleeping. This is the scenario reported and diagnosed by @hedong0411 in issue #382. This commit fixes it by verifying that the global queue's delay is at least expired before deciding to resume the listener. Another approach could consist in having an extra state like LI_DELAY for situations where only a delay is acceptable, but this would probably not bring anything except more complex code. This issue was introduced with the lock-free listener accept code (commits `3f0d02b` and `82c9789a`) that were backported to 1.8.20+ and 1.9.7+, so this fix must be backported to the relevant branches.	2019-12-11 15:06:30 +01:00
Willy Tarreau	d26c9f9465	BUG/MINOR: mworker: properly pass SIGTTOU/SIGTTIN to workers If a new process is started with -sf and it fails to bind, it may send a SIGTTOU to the master process in hope that it will temporarily unbind. Unfortunately this one doesn't catch it and stops to background instead of forwarding the signal to the workers. The same is true for SIGTTIN. This commit simply implements an extra signal handler for the master to deal with such signals that must be passed down to the workers. It must be backported as far as 1.8, though there the code differs in that it's entirely in haproxy.c and doesn't require an extra sig handler.	2019-12-11 14:26:53 +01:00
Willy Tarreau	51013e82d4	BUG/MINOR: log: fix minor resource leaks on logformat error path As reported by Ilya in issue #392, Coverity found that we're leaking allocated strings on error paths in parse_logformat(). Let's use a proper exit label for failures instead of seeding return 0 everywhere. This should be backported to all supported versions.	2019-12-11 12:05:39 +01:00
Willy Tarreau	c49ba52524	MINOR: tasks: split wake_expired_tasks() in two parts to avoid useless wakeups We used to have wake_expired_tasks() wake up tasks and return the next expiration delay. The problem this causes is that we have to call it just before poll() in order to consider latest timers, but this also means that we don't wake up all newly expired tasks upon return from poll(), which thus systematically requires a second poll() round. This is visible when running any scheduled task like a health check, as there are systematically two poll() calls, one with the interval, nothing is done after it, and another one with a zero delay, and the task is called: listen test bind *:8001 server s1 127.0.0.1:1111 check 09:37:38.200959 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=8696843}) = 0 09:37:38.200967 epoll_wait(3, [], 200, 1000) = 0 09:37:39.202459 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=8712467}) = 0 >> nothing run here, as the expired task was not woken up yet. 09:37:39.202497 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=8715766}) = 0 09:37:39.202505 epoll_wait(3, [], 200, 0) = 0 09:37:39.202513 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=8719064}) = 0 >> now the expired task was woken up 09:37:39.202522 socket(AF_INET, SOCK_STREAM, IPPROTO_TCP) = 7 09:37:39.202537 fcntl(7, F_SETFL, O_RDONLY\|O_NONBLOCK) = 0 09:37:39.202565 setsockopt(7, SOL_TCP, TCP_NODELAY, [1], 4) = 0 09:37:39.202577 setsockopt(7, SOL_TCP, TCP_QUICKACK, [0], 4) = 0 09:37:39.202585 connect(7, {sa_family=AF_INET, sin_port=htons(1111), sin_addr=inet_addr("127.0.0.1")}, 16) = -1 EINPROGRESS (Operation now in progress) 09:37:39.202659 epoll_ctl(3, EPOLL_CTL_ADD, 7, {EPOLLOUT, {u32=7, u64=7}}) = 0 09:37:39.202673 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=8814713}) = 0 09:37:39.202683 epoll_wait(3, [{EPOLLOUT\|EPOLLERR\|EPOLLHUP, {u32=7, u64=7}}], 200, 1000) = 1 09:37:39.202693 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=8818617}) = 0 09:37:39.202701 getsockopt(7, SOL_SOCKET, SO_ERROR, [111], [4]) = 0 09:37:39.202715 close(7) = 0 Let's instead split the function in two parts: - the first part, wake_expired_tasks(), called just before process_runnable_tasks(), wakes up all expired tasks; it doesn't compute any timeout. - the second part, next_timer_expiry(), called just before poll(), only computes the next timeout for the current thread. Thanks to this, all expired tasks are properly woken up when leaving poll, and each poll call's timeout remains up to date: 09:41:16.270449 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=10223556}) = 0 09:41:16.270457 epoll_wait(3, [], 200, 999) = 0 09:41:17.270130 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=10238572}) = 0 09:41:17.270157 socket(AF_INET, SOCK_STREAM, IPPROTO_TCP) = 7 09:41:17.270194 fcntl(7, F_SETFL, O_RDONLY\|O_NONBLOCK) = 0 09:41:17.270204 setsockopt(7, SOL_TCP, TCP_NODELAY, [1], 4) = 0 09:41:17.270216 setsockopt(7, SOL_TCP, TCP_QUICKACK, [0], 4) = 0 09:41:17.270224 connect(7, {sa_family=AF_INET, sin_port=htons(1111), sin_addr=inet_addr("127.0.0.1")}, 16) = -1 EINPROGRESS (Operation now in progress) 09:41:17.270299 epoll_ctl(3, EPOLL_CTL_ADD, 7, {EPOLLOUT, {u32=7, u64=7}}) = 0 09:41:17.270314 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=10337841}) = 0 09:41:17.270323 epoll_wait(3, [{EPOLLOUT\|EPOLLERR\|EPOLLHUP, {u32=7, u64=7}}], 200, 1000) = 1 09:41:17.270332 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=10341860}) = 0 09:41:17.270340 getsockopt(7, SOL_SOCKET, SO_ERROR, [111], [4]) = 0 09:41:17.270367 close(7) = 0 This may be backported to 2.1 and 2.0 though it's unlikely to bring any user-visible improvement except to clarify debugging.	2019-12-11 09:42:58 +01:00
Willy Tarreau	d7f76a0a50	BUG/MEDIUM: proto_udp/threads: recv() and send() must not be exclusive. This is a complement to previous fix for bug #399. The exclusion between the recv() and send() calls prevents send handlers from being called if rx readiness is reported. The DNS code can trigger this situations with threads where the fd_recv_ready() flag disappears between the test in dgram_fd_handler() and the second test in dns_resolve_recv() while a thread calls fd_cant_recv(), and this situation can sustain itself for a while. With 8 threads and an error in the socket queue, placing a printf on the return statement in dns_resolve_recv() scrolls very fast. Simply removing the "else" in dgram_fd_handler() addresses the issue. This fix must be backported as far as 1.6.	2019-12-10 19:09:15 +01:00
Willy Tarreau	1c75995611	BUG/MAJOR: dns: add minimalist error processing on the Rx path It was reported in bug #399 that the DNS sometimes enters endless loops after hours working fine. The issue is caused by a lack of error processing in the DNS's recv() path combined with an exclusive recv OR send in the UDP layer, resulting in some errors causing CPU loops that will never stop until the process is restarted. The basic cause is that the FD_POLL_ERR and FD_POLL_HUP flags are sticky on the FD, and contrary to a stream socket, receiving an error on a datagram socket doesn't indicate that this socket cannot be used anymore. Thus the Rx code must at least handle this situation and flush the error otherwise it will constantly be reported. In theory this should not be a big issue but in practise it is due to another bug in the UDP datagram handler which prevents the send() callback from being called when Rx readiness was reported, so the situation cannot go away. It happens way more easily with threads enabled, so that there is no dead time between the moment the FD is disabled and another recv() is called, such as in the example below where the request was sent to a closed port on the loopback provoking an ICMP unreachable to be sent back: [pid 20888] 18:26:57.826408 sendto(29, ";\340\1\0\0\1\0\0\0\0\0\1\0031wt\2eu\0\0\34\0\1\0\0)\2\0\0\0\0\0\0\0", 35, 0, NULL, > [pid 20893] 18:26:57.826566 recvfrom(29, 0x7f97c54ef2f0, 513, 0, NULL, NULL) = -1 ECONNREFUSED (Connection refused) [pid 20889] 18:26:57.826601 recvfrom(29, 0x7f97c76182f0, 513, 0, NULL, NULL) = -1 EAGAIN (Resource temporarily unavailable) [pid 20892] 18:26:57.826630 recvfrom(29, 0x7f97c5cf02f0, 513, 0, NULL, NULL) = -1 EAGAIN (Resource temporarily unavailable) [pid 20891] 18:26:57.826684 recvfrom(29, 0x7f97c66162f0, 513, 0, NULL, NULL) = -1 EAGAIN (Resource temporarily unavailable) [pid 20895] 18:26:57.826716 recvfrom(29, 0x7f97bffda2f0, 513, 0, NULL, NULL) = -1 EAGAIN (Resource temporarily unavailable) [pid 20894] 18:26:57.826747 recvfrom(29, 0x7f97c4cee2f0, 513, 0, NULL, NULL) = -1 EAGAIN (Resource temporarily unavailable) [pid 20888] 18:26:58.419838 recvfrom(29, 0x7ffcc8712c20, 513, 0, NULL, NULL) = -1 EAGAIN (Resource temporarily unavailable) [pid 20893] 18:26:58.419900 recvfrom(29, 0x7f97c54ef2f0, 513, 0, NULL, NULL) = -1 EAGAIN (Resource temporarily unavailable) (... hundreds before next sendto() ...) This situation was handled by clearing HUP and ERR when recv() returns <0. A second case was handled, there was a control for a missing dgram handler, but it does nothing, causing the FD to ring again if this situation ever happens. After looking at the rest of the code, it doesn't seem possible to face such a situation because these handlers are registered during startup, but at least we need to handle it properly. A third case was handled, that's mainly a small optimization. With threads and massive responses, due to the large lock around the loop, it's likely that some threads will have seen fd_recv_ready() and will wait at the lock(). But if they wait here, chances are that other threads will have eliminated pending data and issued fd_cant_recv(). In this case, better re-check fd_recv_ready() before performing the recv() call to avoid the huge amounts of syscalls that happen on massively threaded setups. This patch must be backported as far as 1.6 (the atomic AND just needs to be turned to a regular AND).	2019-12-10 19:09:15 +01:00
Olivier Houchard	eaefc3c503	BUG/MEDIUM: kqueue: Make sure we report read events even when no data. When we have a EVFILT_READ event, an optimization was made, and the FD was not reported as ready to receive if there were no data available. That way, if the socket was closed by our peer (the EV8EOF flag was set), and there were no remaining data to read, we would just close(), and avoid doing a recv(). However, it may be fine for TCP socket, but it is not for UDP. If we send data via UDP, and we receive an error, the only way to detect it is to attempt a recv(). However, in this case, kevent() will report a read event, but with no data, so we'd just ignore that read event, nothing would be done about it, and the poller would be woken up by it over and over. To fix this, report read events if either we have data, or the EV_EOF flag is not set. This should be backported to 2.1, 2.0, 1.9 and 1.8.	2019-12-10 18:27:17 +01:00
Willy Tarreau	a1d97f88e0	REORG: listener: move the global listener queue code to listener.c The global listener queue code and declarations were still lying in haproxy.c while not needed there anymore at all. This complicates the code for no reason. As a result, the global_listener_queue_task and the global_listener_queue were made static.	2019-12-10 14:16:03 +01:00
Willy Tarreau	241797a3fc	MINOR: listener: split dequeue_all_listener() in two We use it half times for the global_listener_queue and half times for a proxy's queue and this requires the callers to take care of these. Let's split it in two versions, the current one working only on the global queue and another one dedicated to proxies for the per-proxy queues. This cleans up quite a bit of code.	2019-12-10 14:14:09 +01:00
Willy Tarreau	0591bf7deb	MINOR: listener: make the wait paths cleaner and more reliable In listener_accept() there are several situations where we have to wait for an event or a delay. These ones all implement their own call to limit_listener() and the associated task_schedule(). In addition to being ugly and confusing, one expire date computation is even wrong as it doesn't take in account the fact that we're using threads and that the value might change in the middle. Fortunately task_schedule() gets it right for us. This patch creates two jump locations, one for the global queue and one for the proxy queue, allowing the rest of the code to only compute the expire delay and jump to the right location.	2019-12-10 12:04:27 +01:00
Willy Tarreau	92079934a9	BUG/MEDIUM: listener/threads: fix a remaining race in the listener's accept() Recent fix `4c044e274c` ("BUG/MEDIUM: listener/thread: fix a race when pausing a listener") is insufficient and moves the race slightly farther. What now happens is that if we're limiting a listener due to a transient error such as an accept() error for example, or because the proxy's maxconn was reached, another thread might in the mean time have switched again to LI_READY and at the end of the function we'll disable polling on this FD, resulting in a listener that never accepts anything anymore. It can more easily happen when sending SIGTTOU/SIGTTIN to temporarily pause the listeners to let another process bind next to them. What this patch does instead is to move all enable/disable operations at the end of the function and condition them to the state. The listener's state is checked under the lock and the FD's polling state adjusted accordingly so that the listener's state and the FD always remain 100% synchronized. It was verified with 16 threads that the cost of taking that lock is not measurable so that's fine. This should be backported to the same branches the patch above is backported to.	2019-12-10 10:43:31 +01:00
Willy Tarreau	20aeb1c7cd	BUG/MINOR: listener: also clear the error flag on a paused listener When accept() fails because a listener is temporarily paused, the FD might have both FD_POLL_HUP and FD_POLL_ERR bits set. While we do not exploit FD_POLL_ERR here it's better to clear it because it is reported on "show fd" and is confusing. This may be backported to all versions.	2019-12-10 10:43:31 +01:00
Willy Tarreau	7cdeb61701	BUG/MINOR: listener/threads: always use atomic ops to clear the FD events There was a leftover of the single-threaded era when removing the FD_POLL_HUP flag from the listeners. By not using an atomic operation to clear the flag, another thread acting on the same listener might have lost some events, though this would have resulted in that thread to reprocess them immediately on the next loop pass. This should be backported as far as 1.8.	2019-12-10 10:43:31 +01:00
Willy Tarreau	67878d7bdc	BUG/MINOR: proxy: make soft_stop() also close FDs in LI_PAUSED state The proxies' soft_stop() function closes the FDs in all opened states except LI_PAUSED. This means that a transient error on a listener might cause it to turn back to the READY state if it happens exactly when a reload signal is received. This must be backported to all supported versions.	2019-12-10 10:43:31 +01:00
Christopher Faulet	f950c2e97e	BUG/MEDIUM: mux-fcgi: Handle cases where the HTX EOM block cannot be inserted During the HTTP response parsing, if there is not enough space in the channel's buffer, it is possible to fail to add the HTX EOM block while all data in the rxbuf were consumed. As for the h1 mux, we must notify the conn-stream the buffer is full to have a chance to add the HTX EOM block later. In this case, we must also be carefull to not report a server abort by setting too early the CS_FL_EOS flag on the conn-stream. To do so, the FCGI_SF_APPEND_EOM flag must be set on the FCGI stream to know the HTX EOM block is missing. This patch must be backported to 2.1.	2019-12-09 09:30:50 +01:00
Christopher Faulet	7aae858001	BUG/MINOR: mux-h1: Be sure to set CS_FL_WANT_ROOM when EOM can't be added During the message parsing, when the HTX buffer is full and only the HTX EOM block cannot be added, it is important to notify the conn-stream that some processing must still be done but it is blocked because there is not enough room in the buffer. The way to do so is to set the CS_FL_WANT_ROOM flag on the conn-stream. Otherwise, because all data are received and consumed, the mux is not called anymore to add this last block, leaving the message unfinished from the HAProxy point of view. The only way to unblock it is to receive a shutdown for reads or to hit a timeout. This patch must be backported to 2.1 and 2.0. The 1.9 does not seem to be affected.	2019-12-09 09:30:50 +01:00
Willy Tarreau	a45a8b5171	MEDIUM: init: set NO_NEW_PRIVS by default when supported HAProxy doesn't need to call executables at run time (except when using external checks which are strongly recommended against), and is even expected to isolate itself into an empty chroot. As such, there basically is no valid reason to allow a setuid executable to be called without the user being fully aware of the risks. In a situation where haproxy would need to call external checks and/or disable chroot, exploiting a vulnerability in a library or in haproxy itself could lead to the execution of an external program. On Linux it is possible to lock the process so that any setuid bit present on such an executable is ignored. This significantly reduces the risk of privilege escalation in such a situation. This is what haproxy does by default. In case this causes a problem to an external check (for example one which would need the "ping" command), then it is possible to disable this protection by explicitly adding this directive in the global section. If enabled, it is possible to turn it back off by prefixing it with the "no" keyword. Before the option: $ socat - /tmp/sock1 <<< "expert-mode on; debug dev exec sudo /bin/id" uid=0(root) gid=0(root) groups=0(root After the option: $ socat - /tmp/sock1 <<< "expert-mode on; debug dev exec sudo /bin/id" sudo: effective uid is not 0, is /usr/bin/sudo on a file system with the 'nosuid' option set or an NFS file system without root privileges?	2019-12-06 17:20:26 +01:00
Willy Tarreau	368bff40ce	MINOR: debug: replace popen() with pipe+fork() in "debug dev exec" popen() is annoying because it doesn't catch stderr. The command was implemented using it just by pure laziness, let's just redo it a bit cleaner using normal syscalls. Note that this command is only enabled when built with -DDEBUG_DEV.	2019-12-06 17:20:26 +01:00
Olivier Houchard	aebeff74fc	BUG/MEDIUM: checks: Make sure we set the task affinity just before connecting. In process_chk_conn(), make sure we set the task affinity to the current thread as soon as we're attempting a connection (and reset the affinity to "any thread" if we detect a failure). We used to only set the task affinity if connect_conn_chk() returned SF_ERR_NONE, however for TCP checks, SF_ERR_UP is returned, so for those checks, the task could still run on any thread, and this could lead to a race condition where the connection runs on one thread, while the task runs on another one, which could create random memory corruption and/or crashes. This may fix github issue #369. This should be backported to 2.1, 2.0 and 1.9.	2019-12-05 15:31:44 +01:00
Christopher Faulet	2545a0b352	BUG/MINOR: mux-h1: Fix conditions to know whether or not we may receive data The h1_recv_allowed() function is inherited from the h2 multiplexer. But for the h1, conditions to know if we may receive data are less complex because there is no multiplexing and because data are not parsed when received. So now, following rules are respected : * if an error or a shutdown for reads was detected on the connection we must not attempt to receive * if the input buffer failed to be allocated or is full, we must not try to receive * if the input processing is busy waiting for the output side, we may attempt to receive * otherwise must may not attempt to receive This patch must be backported as far as 1.9.	2019-12-05 13:36:03 +01:00
Christopher Faulet	7b109f2f8b	BUG/MINOR: mux-h1: Don't rely on CO_FL_SOCK_RD_SH to set H1C_F_CS_SHUTDOWN The CO_FL_SOCK_RD_SH flag is only set when a read0 is received. So we must not rely on it to set the H1 connection in shutdown state (H1C_F_CS_SHUTDOWN). In fact, it is suffisant to set the connection in shutdown state when the shutdown for writes is forwared to the sock layer. This patch must be backported as far as 1.9.	2019-12-05 13:36:03 +01:00
Christopher Faulet	aaa67bcef2	BUG/MEDIUM: mux-h1: Never reuse H1 connection if a shutw is pending On the server side, when a H1 stream is detached from the connection, if the connection is not reusable but some outgoing data remain, the connection is not immediatly released. In this case, the connection is not inserted in any idle connection list. But it is still attached to the session. Because of that, it can be erroneously reused. h1_avail_streams() always report a free slot if no stream is attached to the connection, independently on the connection's state. It is obviously a bug. If a second request is handled by the same session (it happens with H2 connections on the client side), this connection is reused before we close it. There is small window to hit the bug, but it may lead to very strange behaviors. For instance, if a first h2 request is quickly aborted by the client while it is blocked in the mux on the server side (so before any response is received), a second request can be processed and sent to the server. Because the connection was not closed, the possible reply to the first request will be interpreted as a reply to the second one. It is probably the bug described by Peter Fröhlich in the issue #290. To fix the bug, a new flag has been added to know if an H1 connection is idle or not. So now, H1C_F_CS_IDLE is set when a connection is idle and useable to handle a new request. If it is set, we try to add the connection in an idle connection list. And h1_avail_streams() only relies on this flag now. Concretely, this flag is set when a K/A stream is detached and both the request and the response are in DONE state. It is exclusive to other H1C_F_CS flags. This patch must be backported as far as 1.9.	2019-12-05 13:31:16 +01:00
Emmanuel Hocdet	3777e3ad14	BUG/MINOR: ssl: certificate choice can be unexpected with openssl >= 1.1.1 It's regression from `9f9b0c6` "BUG/MEDIUM: ECC cert should work with TLS < v1.2 and openssl >= 1.1.1". Wilcard EC certifcate could be selected at the expense of specific RSA certificate. In any case, specific certificate should always selected first, next wildcard. Reflect this rule in a loop to avoid any bug in certificate selection changes. Fix issue #394. It should be backported as far as 1.8.	2019-12-05 10:49:24 +01:00
Willy Tarreau	4c044e274c	BUG/MEDIUM: listener/thread: fix a race when pausing a listener There exists a race in the listener code where a thread might disable receipt on a listener's FD then turn it to LI_PAUSED while at the same time another one faces EAGAIN on accept() and enables it again via fd_cant_recv(). The result is that the FD is in LI_PAUSED state with its polling still enabled. listener_accept() does not do anything then and doesn't disable the FD either, resulting in a thread eating all the CPU as reported in issue #358. A solution would be to take the listener's lock to perform the fd_cant_recv() call and do it only if the FD is still in LI_READY state, but this would be totally overkill while in practice the issue only happens during shutdown. Instead what is done here is that when leaving we recheck the state and disable polling if the listener is not in LI_READY state, which never happens except when being limited. In the worst case there could be one extra check per thread for the time required to converge, which is absolutely nothing. This fix was successfully tested, and should be backported to all versions using the lock-free listeners, which means all those containing commit `3f0d02bb` ("MAJOR: listener: do not hold the listener lock in listener_accept()"), hence 2.1, 2.0, 1.9.7+, 1.8.20+.	2019-12-05 07:40:32 +01:00
William Lallemand	920b035238	BUG/MINOR: ssl/cli: don't overwrite the filters variable When a crt-list line using an already used ckch_store does not contain filters, it will overwrite the ckchs->filters variable with 0. This problem will generate all sni_ctx of this ckch_store without filters. Filters generation mustn't be allowed in any case. Must be backported in 2.1.	2019-12-05 00:00:04 +01:00
Willy Tarreau	c640ef1a7d	BUG/MINOR: stream-int: avoid calling rcv_buf() when splicing is still possible In si_cs_recv(), we can end up with a partial splice() call that will be followed by an attempt to us rcv_buf(). Sometimes this works and places data into the buffer, which then prevent splicing from being used, and this causes splice() and recvfrom() calls to alternate. Better simply refrain from calling rcv_buf() when there are data in the pipe and still data to be forwarded. Usually this indicates that we've ate everything available and that we still want to use splice() on subsequent calls. This should be backported to 2.1 and 2.0.	2019-12-04 11:55:49 +01:00
Willy Tarreau	1ac5f20804	BUG/MEDIUM: stream-int: don't subscribed for recv when we're trying to flush data If we cannot splice incoming data using rcv_pipe() due to remaining data in the buffer, we must not subscribe to the mux but instead tag the stream-int as blocked on missing Rx room. Otherwise when data are flushed, calling si_chk_rcv() will have no effect because the WAIT_EP flag remains present, and we'll end in an rx timeout. This case is very hard to reproduce, and requires an inversion of the polling side in the middle of a transfer. This can only happen when the client and the server are using similar links and when splicing is enabled. It typically takes hundreds of MB to GB for the problem to happen, and tends to be magnified by the use of option contstats which causes process_stream() to be called every 5s and to try again to recv. This fix must be backported to 2.1, 2.0, and possibly 1.9.	2019-12-04 11:55:49 +01:00
William Lallemand	230662a0dd	BUG/MINOR: ssl/cli: 'ssl cert' cmd only usable w/ admin rights The 3 commands 'set ssl cert', 'abort ssl cert' and 'commit ssl cert' must be only usable with admin rights over the CLI. Must be backported in 2.1.	2019-12-03 15:10:46 +01:00
Willy Tarreau	d96f1126fe	MEDIUM: init: prevent process and thread creation at runtime Some concerns are regularly raised about the risk to inherit some Lua files which make use of a fork (e.g. via os.execute()) as well as whether or not some of bugs we fix might or not be exploitable to run some code. Given that haproxy is event-driven, any foreground activity completely stops processing and is easy to detect, but background activity is a different story. A Lua script could very well discretely fork a sub-process connecting to a remote location and taking commands, and some injected code could also try to hide its activity by creating a process or a thread without blocking the rest of the processing. While such activities should be extremely limited when run in an empty chroot without any permission, it would be better to get a higher assurance they cannot happen. This patch introduces something very simple: it limits the number of processes and threads to zero in the workers after the last thread was created. By doing so, it effectively instructs the system to fail on any fork() or clone() syscall. Thus any undesired activity has to happen in the foreground and is way easier to detect. This will obviously break external checks (whose concept is already totally insecure), and for this reason a new option "insecure-fork-wanted" was added to disable this protection, and it is suggested in the fork() error report from the checks. It is obviously recommended not to use it and to reconsider the reasons leading to it being enabled in the first place. If for any reason we fail to disable forks, we still start because it could be imaginable that some operating systems refuse to set this limit to zero, but in this case we emit a warning, that may or may not be reported since we're after the fork point. Ideally over the long term it should be conditionned by strict-limits and cause a hard fail.	2019-12-03 11:49:00 +01:00
Christopher Faulet	bc271ec113	BUG/MINOR: stats: Fix HTML output for the frontends heading Since the flag STAT_SHOWADMIN was removed, the frontends heading in the HTML output appears unaligned because the space reserved for the checkbox (not displayed for frontends) is not inserted. This patch fixes the issue #390. It must be backported to 2.1.	2019-12-02 11:40:04 +01:00
Christopher Faulet	bc96c90614	BUG/MINOR: fcgi-app: Make the directive pass-header case insensitive The header name configured by the directive "pass-header", in the "fcgi-app" section, must be case insensitive. For now, it must be in lowercase to match an header. Internally, header names are in lowercase but there is no reason to impose this syntax in the configuration. This patch must be backported to 2.1.	2019-12-02 10:38:52 +01:00
Emmanuel Hocdet	140b64fb56	BUG/MINOR: ssl: fix SSL_CTX_set1_chain compatibility for openssl < 1.0.2 Commit `1c65fdd5` "MINOR: ssl: add extra chain compatibility" really implement SSL_CTX_set0_chain. Since ckch can be used to init more than one ctx with openssl < 1.0.2 (commit `89f58073` for X509_chain_up_ref compatibility), SSL_CTX_set1_chain compatibility is required. This patch must be backported to 2.1.	2019-11-29 17:02:30 +01:00
Christopher Faulet	f3ad62996f	BUG/MINOR: http-htx: Don't make http_find_header() fail if the value is empty http_find_header() is used to find the next occurrence of a header matching on its name. When found, the matching header is returned with the corresponding value. This value may be empty. Unfortunatly, because of a bug, an empty value make the function fail. This patch must be backported to 2.1, 2.0 and 1.9.	2019-11-29 11:48:15 +01:00
William Dauchy	be8a387e93	CLEANUP: dns: resolution can never be null `eb` being tested above, `res` cannot be null, so the condition is not needed and introduces potential dead code. also fix a typo in associated comment This should fix issue #349 Reported-by: Илья Шипицин <chipitsine@gmail.com> Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2019-11-28 20:41:46 +01:00
Emmanuel Hocdet	b270e8166c	MINOR: ssl: deduplicate crl-file Load file for crl or ca-cert is realy done with the same function in OpenSSL, via X509_STORE_load_locations. Accordingly, deduplicate crl-file and ca-file can share the same function.	2019-11-28 11:11:20 +01:00
Emmanuel Hocdet	129d3285a5	MINOR: ssl: compute ca-list from deduplicate ca-file ca-list can be extracted from ca-file already loaded in memory. This patch set ca-list from deduplicated ca-file when needed and share it in ca-file tree. As a corollary, this will prevent file access for ca-list when updating a certificate via CLI.	2019-11-28 11:11:20 +01:00
Emmanuel Hocdet	d4f9a60ee2	MINOR: ssl: deduplicate ca-file Typically server line like: 'server-template srv 1-1000 *:443 ssl ca-file ca-certificates.crt' load ca-certificates.crt 1000 times and stay duplicated in memory. Same case for bind line: ca-file is loaded for each certificate. Same 'ca-file' can be load one time only and stay deduplicated in memory. As a corollary, this will prevent file access for ca-file when updating a certificate via CLI.	2019-11-28 11:11:20 +01:00
Willy Tarreau	e18f53e01c	BUILD/MINOR: trace: fix use of long type in a few printf format strings Building on a 32-bit platform produces these warnings in trace code: src/stream.c: In function 'strm_trace': src/stream.c:226:29: warning: format '%lu' expects argument of type 'long unsigned int', but argument 9 has type 'size_t {aka const unsigned int}' [-Wformat=] chunk_appendf(&trace_buf, " req=(%p .fl=0x%08x .ana=0x%08x .exp(r,w,a)=(%u,%u,%u) .o=%lu .tot=%llu .to_fwd=%u)", ^ src/stream.c:229:29: warning: format '%lu' expects argument of type 'long unsigned int', but argument 9 has type 'size_t {aka const unsigned int}' [-Wformat=] chunk_appendf(&trace_buf, " res=(%p .fl=0x%08x .ana=0x%08x .exp(r,w,a)=(%u,%u,%u) .o=%lu .tot=%llu .to_fwd=%u)", ^ src/mux_fcgi.c: In function 'fcgi_trace': src/mux_fcgi.c:443:29: warning: format '%lu' expects argument of type 'long unsigned int', but argument 3 has type 'size_t {aka const unsigned int}' [-Wformat=] chunk_appendf(&trace_buf, " - VAL=%lu", val); ^ src/mux_h1.c: In function 'h1_trace': src/mux_h1.c:290:29: warning: format '%lu' expects argument of type 'long unsigned int', but argument 3 has type 'size_t {aka const unsigned int}' [-Wformat=] chunk_appendf(&trace_buf, " - VAL=%lu", val); ^ Let's just cast the type to long. This should be backported to 2.1.	2019-11-27 15:45:11 +01:00
Christopher Faulet	bc7c03eba3	BUG/MINOR: h1: Don't test the host header during response parsing During the H1 message parsing, the host header is tested to be sure it matches the request's authority, if defined. When there are multiple host headers, we also take care they are all the same. Of course, these tests must only be performed on the requests. A host header in a response has no special meaning. This patch must be backported to 2.1.	2019-11-27 14:01:17 +01:00
Tim Duesterhus	9312853530	CLEANUP: ssl: Clean up error handling This commit removes the explicit checks for `if (err)` before passing `err` to `memprintf`. `memprintf` already checks itself whether the `*out` parameter is `NULL` before doing anything. This reduces the indentation depth and makes the code more readable, before there is less boilerplate code. Instead move the check into the ternary conditional when the error message should be appended to a previous message. This is consistent with the rest of ssl_sock.c and with the rest of HAProxy. Thus this patch is the arguably cleaner fix for issue #374 and builds upon `5f1fa7db86` and `8b453912ce` Additionally it fixes a few places where the check still was missing.	2019-11-26 04:16:56 +01:00
Willy Tarreau	2e7fdfc9a1	BUG/MEDIUM: trace: fix a typo causing an incorrect startup error Since commit `88ebd40` ("MINOR: trace: add allocation of buffer-sized trace buffers") we have a trace buffer allocated at boot time. But there was a copy-paste error there making the test verify that the trash was allocated instead of the trace buffer. The result is that depending on the link order either the test will succeed or fail, preventing haproxy from starting at all. No backport is needed.	2019-11-25 19:47:22 +01:00
Willy Tarreau	f3ce0418aa	MINOR: mux-h2/trace: report the connection and/or stream error code We were missing the error code when tracing a call to h2s_error() or h2c_error(), let's report it when it's set.	2019-11-25 11:34:26 +01:00
Willy Tarreau	57a1816fae	BUG/MAJOR: mux-h2: don't try to decode a response HEADERS frame in idle state Christopher found another issue in the H2 backend implementation that results from a miss in the H2 spec: the processing of a HEADERS frame is always permitted in IDLE state, but this doesn't make sense on the response path! And here when facing such a frame, we try to decode it while we didn't allocate any stream, so we end up trying to fill the idle stream's buffer (read-only) and crash. What we're doing here is that if we get a HEADERS frame in IDLE state from a server, we terminate the connection with a PROTOCOL_ERROR. No such transition seems to be permitted by the spec but it seems to be the only sane solution. This fix must be backported as far as 1.9. Note that in 2.0 and earlier there's no h2_frame_check_vs_state() function, instead the check is inlined in h2_process_demux().	2019-11-25 11:34:20 +01:00
Willy Tarreau	146f53ae7e	BUG/MAJOR: h2: make header field name filtering stronger Tim D�sterhus found that the amount of sanitization we perform on HTTP header field names received in H2 is insufficient. Currently we reject upper case letters as mandated by RFC7540#8.1.2, but section 10.3 also requires that intermediaries translating streams to HTTP/1 further refine the filtering to also reject invalid names (which means any name that doesn't match a token). There is a small trick here which is that the colon character used to start pseudo-header names doesn't match a token, so pseudo-header names fall into that category, thus we have to swap the pseudo-header name lookup with this check so that we only check from the second character (past the ':') in case of pseudo-header names. Another possibility could have been to perform this check only in the HTX-to-H1 trancoder but doing would still expose the configured rules and logs to such header names. This fix must be backported as far as 1.8 since this bug could be exploited and serve as the base for an attack. In 2.0 and earlier, functions h2_make_h1_request() and h2_make_h1_trailers() must also be adapted to sanitize requests coming in legacy mode.	2019-11-25 11:11:32 +01:00
Willy Tarreau	54f53ef7ce	BUG/MAJOR: h2: reject header values containing invalid chars Tim D�sterhus reported an annoying problem in the H2 decoder related to an ambiguity in the H2 spec. The spec says in section 10.3 that HTTP/2 allows header field values that are not valid (since they're binary) and at the same time that an H2 to H1 gateway must be careful to reject headers whose values contain \0, \r or \n. Till now, and for the sake of the ability to maintain end-to-end binary transparency in H2-to-H2, the H2 mux wouldn't reject this since it does not know what version will be used on the other side. In theory we should in fact perform such a check when converting an HTX header to H1. But this causes a problem as it means that all our rule sets, sample fetches, captures, logs or redirects may still find an LF in a header coming from H2. Also in 2.0 and older in legacy mode, the frames are instantly converted to H1 and HTX couldn't help there. So this means that in practice we must refrain from delivering such a header upwards, regardless of any outgoing protocol consideration. Applying such a lookup on all headers leaving the mux comes with a significant performance hit, especially for large ones. A first attempt was made at placing this into the HPACK decoder to refrain from learning invalid literals but error reporting becomes more complicated. Additional tests show that doing this within the HTX transcoding loop benefits from the hot L1 cache, and that by skipping up to 8 bytes per iteration the CPU cost remains within noise margin, around ~0.5%. This patch must be backported as far as 1.8 since this bug could be exploited and serve as the base for an attack. In 2.0 and earlier the fix must also be added to functions h2_make_h1_request() and h2_make_h1_trailers() to handle legacy mode. It relies on previous patch "MINOR: ist: add ist_find_ctl()" to speed up the control bytes lookup. All credits go to Tim for his detailed bug report and his initial patch.	2019-11-25 11:06:19 +01:00
William Lallemand	2e945c8ee7	BUG/MINOR: cli: fix out of bounds in -S parser Out of bounds when the number or arguments is greater than MAX_LINE_ARGS. Fix issue #377. Must be backported in 2.0 and 1.9.	2019-11-25 10:04:34 +01:00
William Dauchy	c8bb1539cb	CLEANUP: ssl: check if a transaction exists once before setting it trivial patch to fix issue #351 Fixes: `bc6ca7ccaa` ("MINOR: ssl/cli: rework 'set ssl cert' as 'set/commit'") Reported-by: Илья Шипицин <chipitsine@gmail.com> Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2019-11-25 08:58:44 +01:00
Tim Duesterhus	c0e820c352	BUG/MINOR: ssl: Stop passing dynamic strings as format arguments gcc complains rightfully: src/ssl_sock.c: In function ‘ssl_sock_prepare_all_ctx’: src/ssl_sock.c:5507:3: warning: format not a string literal and no format arguments [-Wformat-security] ha_warning(errmsg); ^ src/ssl_sock.c:5509:3: warning: format not a string literal and no format arguments [-Wformat-security] ha_alert(errmsg); ^ src/ssl_sock.c: In function ‘cli_io_handler_commit_cert’: src/ssl_sock.c:10208:3: warning: format not a string literal and no format arguments [-Wformat-security] chunk_appendf(trash, err); Introduced in `8b453912ce`.	2019-11-25 08:55:34 +01:00
Lukas Tribus	d14b49c128	BUG/MINOR: ssl: fix curve setup with LibreSSL Since commit `9a1ab08` ("CLEANUP: ssl-sock: use HA_OPENSSL_VERSION_NUMBER instead of OPENSSL_VERSION_NUMBER") we restrict LibreSSL to the OpenSSL 1.0.1 API, to avoid breaking LibreSSL every minute. We set HA_OPENSSL_VERSION_NUMBER to 0x1000107fL if LibreSSL is detected and only allow curves to be configured if HA_OPENSSL_VERSION_NUMBER is at least 0x1000200fL. However all relevant LibreSSL releases actually support settings curves, which is now broken. Fix this by always allowing curve configuration when using LibreSSL. Reported on GitHub in issue #366. Fixes: `9a1ab08` ("CLEANUP: ssl-sock: use HA_OPENSSL_VERSION_NUMBER instead of OPENSSL_VERSION_NUMBER").	2019-11-24 18:24:20 +01:00
William Dauchy	5f1fa7db86	MINOR: ssl: fix possible null dereference in error handling recent commit `8b453912ce` ("MINOR: ssl: ssl_sock_prepare_ctx() return an error code") converted all errors handling; in this patch we always test `err`, but three of them are missing. I did not found a plausible explanation about it. this should fix issue #374 Fixes: `8b453912ce` ("MINOR: ssl: ssl_sock_prepare_ctx() return an error code") Reported-by: Илья Шипицин <chipitsine@gmail.com> Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2019-11-23 21:38:15 +01:00
Willy Tarreau	47479eb0e7	MINOR: version: emit the link to the known bugs in output of "haproxy -v" The link to the known bugs page for the current version is built and reported there. When it is a development version (less than 2 dots), instead a link to github open issues is reported as there's no way to be sure about the current situation in this case and it's better that users report their trouble there.	2019-11-21 18:48:20 +01:00
Willy Tarreau	08dd202d73	MINOR: version: report the version status in "haproxy -v" As discussed on Discourse here: https://discourse.haproxy.org/t/haproxy-branch-support-lifetime/4466 it's not always easy for end users to know the lifecycle of the version they are using. This patch introduces a "Status" line in the output of "haproxy -vv" indicating whether it's a development, stable, long-term supported version, possibly with an estimated end of life for the branch when it can be anticipated (e.g. for stable versions). This field should be adjusted when creating a major release to reflect the new status. It may make sense to backport this to other branches to clarify the situation.	2019-11-21 18:47:54 +01:00
William Lallemand	ed44243de7	MINOR: ssl/cli: display warning during 'commit ssl cert' Display the warnings on the CLI during a commit of the certificates.	2019-11-21 17:48:11 +01:00
William Lallemand	8ef0c2a569	MEDIUM: ssl/cli: apply SSL configuration on SSL_CTX during commit Apply the configuration of the ssl_bind_conf on the generated SSL_CTX. It's a little bit hacky at the moment because the ssl_sock_prepare_ctx() function was made for the configuration parsing, not for being using at runtime. Only the 'verify' bind keyword seems to cause a file access so we prevent it before calling the function.	2019-11-21 17:48:11 +01:00
William Lallemand	8b453912ce	MINOR: ssl: ssl_sock_prepare_ctx() return an error code Rework ssl_sock_prepare_ctx() so it fills a buffer with the error messages instead of using ha_alert()/ha_warning(). Also returns an error code (ERR_*) instead of the number of errors.	2019-11-21 17:48:11 +01:00
Daniel Corbett	f8716914c7	MEDIUM: dns: Add resolve-opts "ignore-weight" It was noted in #48 that there are times when a configuration may use the server-template directive with SRV records and simultaneously want to control weights using an agent-check or through the runtime api. This patch adds a new option "ignore-weight" to the "resolve-opts" directive. When specified, any weight indicated within an SRV record will be ignored. This is for both initial resolution and ongoing resolution.	2019-11-21 17:25:31 +01:00
Christopher Faulet	e6d8cb1e91	BUG/MINOR: stream-int: Fix si_cs_recv() return value The previous patch on this function (`36b536d6c` "BUG/MEDIUM: stream-int: Don't loose events on the CS when an EOS is reported") contains a bug. The return value is based on the conn-stream's flags. But it may be reset if the CS is closed. Ironically it was exactly the purpose of this patch... This patch must be backported to 2.0 and 1.9.	2019-11-20 16:48:01 +01:00
Christopher Faulet	145719a722	BUG/MINOR: http-ana: Properly catch aborts during the payload forwarding When no data filter are registered on a channel, if the message length is known, the HTTP payload is infinitely forwarded to save calls to process_stream(). When we finally fall back again in XFER_BODY analyzers, we detect the end of the message by checking channel flags. If CF_EOI or CF_SHUTR is set, we switch the message in DONE state. For CF_EOI, it is relevant. But not for CF_SHUTR. a shutdown for reads without the end of input must be interpreted as an abort for messages with a known length. Because of this bug, some aborts are not properly handled and reported. Instead, we interpret it as a legitimate shutdown. This patch must be backported to 2.0.	2019-11-20 14:11:47 +01:00
Christopher Faulet	f3158e94ee	BUG/MINOR: mux-h1: Fix tunnel mode detection on the response path There are two issues with the way tunnel mode is detected on the response path. First, when a response with an unknown content length is handled, the request is also switched in tunnel mode. It is obviously wrong. Because it was done on the server side only (so not during the request parsing), it is no noticeable effects. The second issue is about the way protocol upgrades are handled. The request is switched in tunnel mode from the time the 101 response is processed. So an unfinished request may be switched in tunnel mode too early. It is not a common use, but a protocol upgrade on a POST is allowed. Thus, parsing of the payload may be hijacked. It is especially bad for chunked payloads. Now, conditions to switch the request in tunnel mode reflect what should be done. Especially for the second issue. We wait the end of the request to switch it in tunnel mode. This patch must be backported to 2.0 and 1.9. Note that these versions are only affected by the second issue but the patch cannot be easily splitted.	2019-11-20 14:11:47 +01:00
Christopher Faulet	ea009736d8	BUILD: debug: Avoid warnings in dev mode with -02 because of some BUG_ON tests Some BUG_ON() tests emit a warning because of a potential null pointer dereference on an HTX block. In fact, it should never happen, but now, GCC is happy. This patch must be backported to 2.0.	2019-11-20 14:11:47 +01:00
Christopher Faulet	36b536d6c8	BUG/MEDIUM: stream-int: Don't loose events on the CS when an EOS is reported In si_cs_recv(), when a shutdown for reads is handled, the conn-stream may be closed. It happens when the ouput channel is closed for writes or if SI_FL_NOHALF is set on the stream-interface. In this case, conn-stream's flags are reset. Thus, if an error (CS_FL_ERROR) or an end of input (CS_FL_EOI) is reported by the mux, the event is lost. si_cs_recv() does not report these events by itself. It relies on si_cs_process() to report them to the stream-interface and/or the channel. For instance, if CS_FL_EOS and CS_FL_EOI are set by the H1 multiplexer during a call to si_cs_recv() on the server side, if the conn-stream is closed (read0 + SI_FL_NOHALF), the CS_FL_EOI flag is lost. Thus, this may lead the stream to interpret it as a server abort. Now, conn-stream's flags are processed at the end of si_cs_recv(). The function is responsible to set the right flags on the stream-interface and/or the channel. Due to this patch, the function is now almost linear. Except some early checks at the beginning, there is only one return statement. It also fixes a potential bug because of an inconsistency between the splicing and the buffered receipt. On the first case, CS_FL_EOS if handled before errors on the connection or the conn-stream. On the second one, it is the opposite. This patch must be backported to 2.0 and 1.9.	2019-11-20 14:11:47 +01:00
Eric Salama	3c8bde88ca	BUILD/MINOR: ssl: fix compiler warning about useless statement There is a compiler warning after commit `a9363eb6` ("BUG/MEDIUM: ssl: 'tune.ssl.default-dh-param' value ignored with openssl > 1.1.1"): src/ssl_sock.c: In function 'ssl_sock_prepare_ctx': src/ssl_sock.c:4481:4: error: statement with no effect [-Werror=unused-value] Fix it by adding a (void)	2019-11-20 13:49:21 +01:00
Fr�d�ric L�caille	3585cab221	BUG/MINOR: peers: "peer alive" flag not reset when deconnecting. The peer flags (->flags member of peer struct) are reset by __peer_session_deinit() function. PEER_F_ALIVE flag which is used by the heartbeat part of the peer protocol to mark a peer as being alive was not reset by this function. This simple patch adds add the statement to this. Note that, at this time, there was no identified issue due to this missing reset. Must be backported to 2.0.	2019-11-20 13:38:13 +01:00
William Lallemand	677e2f2c35	BUG/MEDIUM: mworker: don't fill the -sf argument with -1 during the reexec Upon a reexec_on_failure, if the process tried to exit after the initialization of the process structure but before it was filled with a PID, the PID in the mworker_proc structure is set to -1. In this particular case the -sf argument is filled with -1 and haproxy will exit with the usage message because of that argument. Should be backported in 2.0.	2019-11-19 17:30:34 +01:00
William Lallemand	0bc9c8a243	MINOR: ssl/cli: 'abort ssl cert' deletes an on-going transaction This patch introduces the new CLI command 'abort ssl cert' which abort an on-going transaction and free its content. This command takes the name of the filename of the transaction as an argument.	2019-11-19 16:21:24 +01:00
Fr�d�ric L�caille	af9990f035	BUG/MINOR: peers: Wrong null "server_name" data field handling. As the peers protocol expects to parse at least one encoded integer value for each stick-table data field even when not configured on the local side, about the "server_name" data field we must emit something even if it has not been set (no server was configured for instance). As this data field is made of first one encoded integer which is the length of the remaining data (the dictionary cache entry), we encode the length 0 when emitting such an absent dictionary cache entry. On the remote side, when we decode such an integer with 0 as value, we stop parsing the data field and that's it. Must be backported to 2.0.	2019-11-19 14:48:33 +01:00
Fr�d�ric L�caille	ec1c10b839	MINOR: peers: Add debugging information to "show peers". This patch adds three counters to help in debugging peers protocol issues to "peer" struct: ->no_hbt counts the number of reconnection period without receiving heartbeat ->new_conn counts the number of reconnections after ->reconnect timeout expirations. ->proto_err counts the number of protocol errors.	2019-11-19 14:48:28 +01:00
Fr�d�ric L�caille	33cab3c0eb	MINOR: peers: Add TX/RX heartbeat counters. Add RX/TX heartbeat counters to "peer" struct to have an idead about which peer is alive or not. Dump these counters values on the CLI via "show peers" command.	2019-11-19 14:48:25 +01:00
Fr�d�ric L�caille	470502b420	MINOR: peers: Alway show the table info for disconnected peers. This patch enable us to dump the stick-table information of remote or local peers without already opened peer session. This may be the case also for the local peer during synchronizations with an old processus (reload).	2019-11-19 14:48:21 +01:00
Emmanuel Hocdet	c5fdf0f3dc	BUG/MINOR: ssl: fix crt-list neg filter for openssl < 1.1.1 Certificate selection in client_hello_cb (openssl >= 1.1.1) correctly handles crt-list neg filter. Certificate selection for openssl < 1.1.1 has not been touched for a while: crt-list neg filter is not the same than his counterpart and is wrong. Fix it to mimic the same behavior has is counterpart. It should be backported as far as 1.6.	2019-11-18 14:58:27 +01:00
Emmanuel Hocdet	c3775d28f9	BUG/MINOR: ssl: ssl_pkey_info_index ex_data can store a dereferenced pointer With CLI cert update, sni_ctx can be removed at runtime. ssl_pkey_info_index ex_data is filled with one of sni_ctx.kinfo pointer but SSL_CTX can be shared between sni_ctx. Remove and free a sni_ctx can lead to a segfault when ssl_pkey_info_index ex_data is used (in ssl_sock_get_pkey_algo). Removing the dependency on ssl_pkey_info_index ex_data is the easiest way to fix the issue.	2019-11-18 14:55:32 +01:00
William Dauchy	f9af9d7f3c	MINOR: init: avoid code duplication while setting identify since the introduction of mworker, the setuid/setgid was duplicated in two places; try to improve that by creating a dedicated function. this patch does not introduce any functional change. Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2019-11-17 16:55:50 +01:00
William Dauchy	e039f26ba4	BUG/MINOR: init: fix set-dumpable when using uid/gid in mworker mode used with uid/gid settings, it was not possible to get a coredump despite the set-dumpable option. indeed prctl(2) manual page specifies the dumpable attribute is reverted to `/proc/sys/fs/suid_dumpable` in a few conditions such as process effective user and group are changed. this patch moves the whole set-dumpable logic before the polling code in order to catch all possible cases where we could have changed the uid/gid. It however does not cover the possible segfault at startup. this patch should be backported in 2.0. Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2019-11-17 16:55:24 +01:00
C�dric Dufour	0d7712dff0	MINOR: stick-table: allow sc-set-gpt0 to set value from an expression Allow the sc-set-gpt0 action to set GPT0 to a value dynamically evaluated from its <expr> argument (in addition to the existing static <int> alternative).	2019-11-15 18:24:19 +01:00
Willy Tarreau	869efd5eeb	BUG/MINOR: log: make "show startup-log" use a ring buffer instead The copy of the startup logs used to rely on a re-allocated memory area on the fly, that would attempt to be delivered at once over the CLI. But if it's too large (too many warnings) it will take time to start up, and may not even show up on the CLI as it doesn't fit in a buffer. The ring buffer infrastructure solves all this with no more code, let's switch to this instead. It simply requires a parsing function to attach the ring via ring_attach_cli() and all the rest is automatically handled. Initially this was imagined as a code cleanup, until a test with a config involving 100k backends and just one occurrence of "load-server-state-from-file global" in the defaults section took approx 20 minutes to parse due to the O(N^2) cost of concatenating the warnings resulting in ~1 TB of data to be copied, while it took only 0.57s with the ring. Ideally this patch should be backported to 2.0 and 1.9, though it relies on the ring infrastructure which will then also need to be backported. Configs able to trigger the bug are uncommon, so another workaround for older versions without backporting the rings would consist in simply limiting the size of the error message in print_message() to something always printable, which will only return the first errors.	2019-11-15 15:50:16 +01:00
Willy Tarreau	fcf94981e4	MINOR: ring: make the parse function automatically set the handler/release ring_attach_cli() is called by the keyword parsing function to dump a ring to the CLI. It can only work with a specific handler and release function. Let's make it set them appropriately instead of having the caller know these functions. This way adding a command to dump a ring is as simple as declaring a parsing function calling ring_attach_cli().	2019-11-15 15:48:12 +01:00
Christopher Faulet	a63a5c2c65	MINOR: sink: Set the default max length for a message to BUFSIZE It was set to MAX_SYSLOG_LEN (1K). It is a bit short to print debug traces. Especially when part of a buffers is dump. Now, the maximum length is set to BUFSIZE (16K).	2019-11-15 15:10:19 +01:00
Christopher Faulet	466080da0e	MINOR: mux-h1: Set EOI on the conn-stream when EOS is reported in TUNNEL state It could help to distinguish client/server aborts from legitimate shudowns for reads.	2019-11-15 14:24:06 +01:00
Christopher Faulet	3f21611bdd	BUG/MINOR: mux-h1: Don't set CS_FL_EOS on a read0 when receiving data to pipe This is mandatory to process input one more time to add the EOM in the HTX message and to set CS_FL_EOI on the conn-stream. Otherwise, in the stream, a SHUTR will be reported on the corresponding channel without the EOI. It may be erroneously interpreted as an abort. This patch must be backported to 2.0 and 1.9.	2019-11-15 14:24:06 +01:00
Christopher Faulet	02a0253888	BUG/MINOR: mux-h1: Properly catch parsing errors on payload and trailers Errors during the payload or the trailers parsing are reported with the HTX_FL_PARSING_ERROR flag on the HTX message and not a negative return value. This change was introduced when the fonctions to convert an H1 message to HTX one were moved to a dedicated file. But the h1 mux was not fully updated accordingly. No backport needed except if the commits about file h1_htx.c are backported.	2019-11-15 14:24:06 +01:00
Christopher Faulet	0d1c2a65e8	MINOR: stats: Report max times in addition of the averages for sessions Now, for the sessions, the maximum times (queue, connect, response, total) are reported in addition of the averages over the last 1024 connections. These values are called qtime_max, ctime_max, rtime_max and ttime_max. This patch is related to #272.	2019-11-15 14:23:54 +01:00
Christopher Faulet	efb41f0d8d	MINOR: counters: Add fields to store the max observed for {q,c,d,t}_time For backends and servers, some average times for last 1024 connections are already calculated. For the moment, the averages for the time passed in the queue, the connect time, the response time (for HTTP session only) and the total time are calculated. Now, in addition, the maximum time observed for these values are also stored. In addition, These new counters are cleared as all other max values with the CLI command "clear counters". This patch is related to #272.	2019-11-15 14:23:21 +01:00
Christopher Faulet	b927a9d866	MINOR: stream: Remove the lock on the proxy to update time stats swrate_add() is now thread-safe. So the lock on the proxy is no longer needed to update q_time, c_time, d_time and t_time.	2019-11-15 13:43:08 +01:00
Christopher Faulet	b2e58492b1	MEDIUM: filters: Adapt filters API to allow again TCP filtering on HTX streams This change make the payload filtering uniform between TCP and HTTP filters. Now, in TCP, like in HTTP, there is only one callback responsible to forward data. Thus, old callbacks, tcp_data() and tcp_forward_data(), are replaced by a single callback function, tcp_payload(). This new callback gets the offset in the payload to (re)start the filtering and the maximum amount of data it can forward. It is the filter's responsibility to be compatible with HTX streams. If not, it must not set the flag FLT_CFG_FL_HTX. Because of this change, nxt and fwd offsets are no longer needed. Thus they are removed from the filter structure with their update functions, flt_change_next_size() and flt_change_forward_size(). Moreover, the trace filter has been updated accordingly. This patch breaks the compatibility with the old API. Thus it should probably not be backported. But, AFAIK, there is no TCP filter, thus the breakage is very limited.	2019-11-15 13:43:08 +01:00
Christopher Faulet	bb9a7e04bd	BUG/MEDIUM: filters: Don't call TCP callbacks for HTX streams For now, TCP callbacks are incompatible with the HTX streams because they are designed to manipulate raw buffers. A new callback will probably be added to be used in both modes, raw and HTX. So, for HTX streams, these callbacks are ignored. This should not be a real problem because there is no known filters, expect the trace filter, implementing these callbacks. This patch must be backported to 2.0 and 1.9.	2019-11-15 13:43:08 +01:00
Willy Tarreau	93604edb65	BUG/MEDIUM: listeners: always pause a listener on out-of-resource condition A corner case was opened in the listener_accept() code by commit `3f0d02bbc2` ("MAJOR: listener: do not hold the listener lock in listener_accept()"). The issue is when one listener (or a group of) managed to eat all the proxy's or all the process's maxconn, and another listener tries to accept a new socket. This results in the atomic increment to detect the excess connection count and immediately abort, without pausing the listener, thus the call is immediately performed again. This doesn't happen when the test is run on a single listener because this listener got limited when crossing the limit. But with 2 or more listeners, we don't have this luxury. The solution consists in limiting the listener as soon as we have to decline accepting an incoming connection. This means that the listener will not be marked full yet if it gets the exact connection count but this is not a problem in practice since all other listeners will only be marked full after their first attempt. Thus from now on, a listener is only full once it has already failed taking an incoming connection. This bug was definitely responsible for the unreproduceable occasional reports of high CPU usage showing epoll_wait() returning immediately without accepting an incoming connection, like in bug #129. This fix must be backported to 1.9 and 1.8.	2019-11-15 10:34:51 +01:00
Willy Tarreau	af7ea814f9	CLEANUP: stats: use srv_shutdown_streams() instead of open-coding it The "shutdown sessions" admin-mode command used to open-code the list traversal while there's already a function for this: srv_shutdown_streams(). Better use it.	2019-11-15 07:06:46 +01:00
Willy Tarreau	d9e26a7dd5	CLEANUP: cli: use srv_shutdown_streams() instead of open-coding it The "shutdown session server" command used to open-code the list traversal while there's already a function for this: srv_shutdown_streams(). Better use it.	2019-11-15 07:06:46 +01:00
Willy Tarreau	5de7817ae8	CLEANUP: session: slightly simplify idle connection cleanup logic Since previous commit `a132e5efa9` ("BUG/MEDIUM: Make sure we leave the session list in session_free().") it's pointless to delete the conn element inside "if" blocks given that the second test is always true as well. Let's simplify this with a single LIST_DEL_INIT() before the test.	2019-11-15 07:06:46 +01:00
Olivier Houchard	a132e5efa9	BUG/MEDIUM: Make sure we leave the session list in session_free(). In session_free(), if we're about to destroy a connection that had no mux, make sure we leave the session_list before calling conn_free(). Otherwise, conn_free() would call session_unown_conn(), which would potentially free the associated srv_list, but session_free() also frees it, so that would lead to a double free, and random memory corruption. This should be backported to 1.9 and 2.0.	2019-11-14 19:25:49 +01:00
Willy Tarreau	9ada030697	BUG/MINOR: queue/threads: make the queue unlinking atomic There is a very short race in the queues which happens in the following situation: - stream A on thread 1 is being processed by a server - stream B on thread 2 waits in the backend queue for a server - stream B on thread 2 is fed up with waiting and expires, calls stream_free() which calls pendconn_free(), which sees the stream attached - at the exact same instant, stream A finishes on thread 1, sees one stream is waiting (B), detaches it and wakes it up - stream B continues pendconn_free() and calls pendconn_unlink() - pendconn_unlink() now detaches the node again and performs a second deletion (harmless since idempotent), and decrements srv/px->nbpend again => the number of connections on the proxy or server may reach -1 if/when this race occurs. It is extremely tight as it can only occur during the test on p->leaf_p though it has been witnessed at least once. The solution consists in testing leaf_p again once the lock is held to make sure the element was not removed in the mean time. This should be backported to 2.0 and 1.9, probably even 1.8.	2019-11-14 14:58:39 +01:00
Jerome Magnin	2f44e8843a	BUG/MINOR: stream: init variables when the list is empty We need to call vars_init() when the list is empty otherwise we can't use variables in the response scope. This regression was introduced by `cda7f3f5` (MINOR: stream: don't prune variables if the list is empty). The following config reproduces the issue: defaults mode http frontend in bind :11223 http-request set-var(req.foo) str("foo") if { path /bar } http-request set-header bar %[var(req.foo)] if { var(req.foo) -m found } http-response set-var(res.bar) str("bar") http-response set-header foo %[var(res.bar)] if { var(res.bar) -m found } use_backend out backend out server s1 127.0.0.1:11224 listen back bind :11224 http-request deny deny_status 200 > GET /ba HTTP/1.1 > Host: localhost:11223 > User-Agent: curl/7.66.0 > Accept: / > < HTTP/1.0 200 OK < Cache-Control: no-cache < Content-Type: text/html > GET /bar HTTP/1.1 > Host: localhost:11223 > User-Agent: curl/7.66.0 > Accept: / > < HTTP/1.0 200 OK < Cache-Control: no-cache < Content-Type: text/html < foo: bar This must be backported as far as 1.9.	2019-11-09 18:25:41 +01:00
Baptiste Assmann	f50e1ac444	BUG: dns: timeout resolve not applied for valid resolutions Documentation states that the interval between 2 DNS resolution is driven by "timeout resolve <time>" directive. From a code point of view, this was applied unless the latest status of the resolution was VALID. In such case, "hold valid" was enforce. This is a bug, because "hold" timers are not here to drive how often we want to trigger a DNS resolution, but more how long we want to keep an information if the status of the resolution itself as changed. This avoid flapping and prevent shutting down an entire backend when a DNS server is not answering. This issue was reported by hamshiva in github issue #345. Backport status: 1.8	2019-11-07 18:50:07 +01:00
Baptiste Assmann	7264dfe949	BUG/MINOR: action: do-resolve now use cached response As reported by David Birdsong on the ML, the HTTP action do-resolve does not use the DNS cache. Actually, the action is "registred" to the resolution for said name to be resolved and wait until an other requester triggers the it. Once the resolution is finished, then the action is updated with the result. To trigger this, you must have a server with runtime DNS resolution enabled and run a do-resolve action with the same fqdn AND they use the same resolvers section. This patch fixes this behavior by ensuring the resolution associated to the action has a valid answer which is not considered as expired. If those conditions are valid, then we can use it (it's the "cache"). Backport status: 2.0	2019-11-07 18:46:55 +01:00
Christopher Faulet	fee726ffa7	MINOR: http-ana: Remove the unused function http_reset_txn() Since the legacy HTTP mode was removed, the stream is always released at the end of each HTTP transaction and a new is created to handle the next request for keep-alive connections. So the HTTP transaction is no longer reset and the function http_reset_txn() can be removed.	2019-11-07 15:32:52 +01:00
Christopher Faulet	5939925a38	BUG/MEDIUM: stream: Be sure to release allocated captures for TCP streams All TCP and HTTP captures are stored in 2 arrays, one for the request and another for the response. In HAPRoxy 1.5, these arrays are part of the HTTP transaction and thus are released during its cleanup. Because in this version, the transaction is part of the stream (in 1.5, streams are still called sessions), the cleanup is always performed, for HTTP and TCP streams. In HAProxy 1.6, the HTTP transaction was moved out from the stream and is now dynamically allocated only when required (becaues of an HTTP proxy or an HTTP sample fetch). In addition, still in 1.6, the captures arrays were moved from the HTTP transaction to the stream. This way, it is still possible to capture elements from TCP rules for a full TCP stream. Unfortunately, the release is still exclusively performed during the HTTP transaction cleanup. Thus, for a TCP stream where the HTTP transaction is not required, the TCP captures, if any, are never released. Now, all captures are released when the stream is freed. This fixes the memory leak for TCP streams. For streams with an HTTP transaction, the captures are now released when the transaction is reset and not systematically during its cleanup. This patch must be backported as fas as 1.6.	2019-11-07 15:32:52 +01:00
Christopher Faulet	eea8fc737b	MEDIUM: stream/trace: Register a new trace source with its events Runtime traces are now supported for the streams, only if compiled with debug. process_stream() is covered as well as TCP/HTTP analyzers and filters. In traces, the first argument is always a stream. So it is easy to get the info about the channels and the stream-interfaces. The second argument, when defined, is always a HTTP transaction. And the third one is an HTTP message. The trace message is adapted to report HTTP info when possible.	2019-11-06 10:14:32 +01:00
Christopher Faulet	a3ed271ed4	MINOR: flt_trace: Rename macros to print trace messages Names of these macros may enter in conflict with the macros of the runtime tracing mechanism. So the prefix "FLT_" has been added to avoid any ambiguities.	2019-11-06 10:14:32 +01:00
Christopher Faulet	276c1e0533	BUG/MEDIUM: stream: Be sure to support splicing at the mux level to enable it Despite the addition of the mux layer, no change have been made on how to enable the TCP splicing on process_stream(). We still check if transport layer on both sides support the splicing, but we don't check the muxes support. So it is possible to start to splice data with an unencrypted H2 connection on a side and an H1 connection on the other. This leads to a freeze of the stream until a client or server timeout is reached. This patch fixed a part of the issue #356. It must be backported as far as 1.8.	2019-11-06 10:14:32 +01:00
Christopher Faulet	9fa40c46df	BUG/MEDIUM: mux-h1: Disable splicing for chunked messages The mux H1 announces the support of the TCP splicing. It only works for payload data. It works for messages with an explicit content-length or for tunnelled data. For chunked messages, the mux H1 should normally not try to xfer more than the current chunk through the pipe. Unfortunately, this works on the read side but the send is completely bogus. During the output formatting, the announced size of chunks does not handle the size that will be spliced. Because there is no formatting when spliced data are sent, the produced message is malformed and rejected by the peer. For now, because it is quick and simple, the TCP splicing is disabled for chunked messages. I will try to enable it again in a proper way. I don't know for now if it will be backportable in previous versions. This will depend on the amount of changes required to handle it. This patch fixes a part of the issue #356. It must be backported to 2.0 and 1.9.	2019-11-06 10:14:27 +01:00
Fr�d�ric L�caille	b6f759b43d	MINOR: peers: Add "log" directive to "peers" section. This patch is easy to review: let's call parse_logsrv() function to parse "log" directive as this is already for other sections for proxies. This enable us to log incoming TCP connections for the listeners for "peers" sections. Update the documentation for "peers" section.	2019-11-06 04:49:56 +01:00
William Lallemand	21724f0807	MINOR: ssl/cli: replace the default_ctx during 'commit ssl cert' If the SSL_CTX of a previous instance (ckch_inst) was used as a default_ctx, replace the default_ctx of the bind_conf by the first SSL_CTX inserted in the SNI tree. Use the RWLOCK of the sni tree to handle the change of the default_ctx.	2019-11-04 18:16:53 +01:00
William Lallemand	3246d9466a	BUG/MINOR: ssl/cli: fix an error when a file is not found When trying to update a certificate <file>.{rsa,ecdsa,dsa}, but this one does not exist and if <file> was used as a regular file in the configuration, the error was ambiguous. Correct it so we can return a certificate not found error.	2019-11-04 14:11:41 +01:00
William Lallemand	37031b85ca	BUG/MINOR: ssl/cli: unable to update a certificate without bundle extension Commit `bc6ca7c` ("MINOR: ssl/cli: rework 'set ssl cert' as 'set/commit'") broke the ability to commit a unique certificate which does not use a bundle extension .{rsa,ecdsa,dsa}.	2019-11-04 14:11:41 +01:00
William Lallemand	8a7fdf036b	BUG/MEDIUM: ssl/cli: don't alloc path when cert not found When doing an 'ssl set cert' with a certificate which does not exist in configuration, the appctx->ctx.ssl.old_ckchs->path was duplicated while app->ctx.ssl.old_ckchs was NULL, resulting in a NULL dereference. Move the code so the 'not referenced' error is done before this.	2019-11-04 11:22:33 +01:00
vkill	1dfd16536f	MINOR: backend: Add srv_name sample fetche The sample fetche can get srv_name without foreach `core.backends["bk"].servers`. Then we can get Server class quickly via `core.backends[txn.f:be_name()].servers[txn.f:srv_name()]`. Issue#342	2019-11-01 05:40:24 +01:00
Emmanuel Hocdet	40f2f1e341	BUG/MEDIUM: ssl/cli: fix dot research in cli_parse_set_cert During a 'set ssl cert', the result of the strrchr was wrongly tested and can lead to a segfault when the certificate path did not contained a dot.	2019-10-31 17:32:06 +01:00
Emmanuel Hocdet	eaad5cc2d8	MINOR: ssl: BoringSSL ocsp_response does not need issuer HAproxy can fail when issuer is not found, it must not with BoringSSL.	2019-10-31 17:24:16 +01:00
Emmanuel Hocdet	83cbd3c89f	BUG/MINOR: ssl: double free on error for ckch->{key,cert} On last error in ssl_sock_load_pem_into_ckch, key/cert are released and ckch->{key,cert} are released in ssl_sock_free_cert_key_and_chain_contents.	2019-10-31 16:56:51 +01:00
Emmanuel Hocdet	ed17f47c71	BUG/MINOR: ssl: ckch->chain must be initialized It's a regression from `96a9c973` "MINOR: ssl: split ssl_sock_load_crt_file_into_ckch()".	2019-10-31 16:53:28 +01:00
Emmanuel Hocdet	f6ac4fa745	BUG/MINOR: ssl: segfault in cli_parse_set_cert with old openssl/boringssl Fix `541a534` ("BUG/MINOR: ssl/cli: fix build of SCTL and OCSP") was not enough. [wla: It will probably be better later to put the #ifdef in the functions so they can return an error if they are not implemented]	2019-10-31 16:21:06 +01:00
Willy Tarreau	1eb3b4828e	BUG/MINOR: stats: properly check the path and not the whole URI Since we now have full URIs with h2, stats may fail to work over H2 so we must carefully only check the path there if the stats URI was passed with a path only. This way it remains possible to intercept proxy requests to report stats on explicit domains but it continues to work as expected on origin requests. No backport needed.	2019-10-31 15:52:14 +01:00
Willy Tarreau	cab2295ae7	BUG/MEDIUM: mux-h2: immediately report connection errors on streams In case a stream tries to send on a connection error, we must report the error so that the stream interface keeps the data available and may safely retry on another connection. Till now this would happen only before the connection was established, not in case of a failed handshake or an early GOAWAY for example. This should be backported to 2.0 and 1.9.	2019-10-31 15:48:18 +01:00
Willy Tarreau	4481e26e5d	BUG/MEDIUM: mux-h2: immediately remove a failed connection from the idle list If a connection faces an error or a timeout, it must be removed from its idle list ASAP. We certainly don't want to risk sending new streams on it. This should be backported to 2.0 (replacing MT_LIST_DEL with LIST_DEL_LOCKED) and 1.9 (there's no lock there, the idle lists are per-thread and per-server however a LIST_DEL_INIT will be needed).	2019-10-31 15:39:27 +01:00
Willy Tarreau	c61966f9b4	BUG/MEDIUM: mux-h2: report no available stream on a connection having errors If an H2 mux has met an error, we must not report available streams anymore, or it risks to accumulate new streams while not being able to process them. This should be backported to 2.0 and 1.9.	2019-10-31 15:10:03 +01:00
William Lallemand	33cc76f918	BUG/MINOR: ssl/cli: check trash allocation in cli_io_handler_commit_cert() Possible NULL pointer dereference found by coverity. Fix #350 #340.	2019-10-31 11:48:01 +01:00
Damien Claisse	ae6f125c7b	MINOR: sample: add us/ms support to date/http_date It can be sometimes interesting to have a timestamp with a resolution of less than a second. It is currently painful to obtain this, because concatenation of date and date_us lead to a shorter timestamp during first 100ms of a second, which is not parseable and needs ugly ACLs in configuration to prepend 0s when needed. To improve this, add an optional <unit> parameter to date sample to report an integer with desired unit. Also support this unit in http_date converter to report a date string with sub-second precision.	2019-10-31 08:47:31 +01:00
Joao Morais	e1583751b6	BUG/MINOR: config: Update cookie domain warn to RFC6265 The domain option of the cookie keyword allows to define which domain or domains should use the the cookie value of a cookie-based server affinity. If the domain does not start with a dot, the user agent should only use the cookie on hosts that matches the provided domains. If the configured domain starts with a dot, the user agent can use the cookie with any host ending with the configured domain. haproxy config parser helps the admin warning about a potentially buggy config: defining a domain without an embedded dot which does not start with a dot, which is forbidden by the RFC. The current condition to issue the warning implements RFC2109. This change updates the implementation to RFC6265 which allows domain without a leading dot. Should be backported to all supported versions. The feature exists at least since 1.5.	2019-10-31 06:06:52 +01:00
William Lallemand	beea2a476e	CLEANUP: ssl/cli: remove leftovers of bundle/certs (it < 2) Remove the leftovers of the certificate + bundle updating in 'ssl set cert' and 'commit ssl cert'. * Remove the it variable in appctx.ctx.ssl. * Stop doing everything twice. * Indent	2019-10-30 17:52:34 +01:00
William Lallemand	bc6ca7ccaa	MINOR: ssl/cli: rework 'set ssl cert' as 'set/commit' This patch splits the 'set ssl cert' CLI command into 2 commands. The previous way of updating the certificate on the CLI was limited with the bundles. It was only able to apply one of the tree part of the certificate during an update, which mean that we needed 3 updates to update a full 3 certs bundle. It was also not possible to apply atomically several part of a certificate with the ability to rollback on error. (For example applying a .pem, then a .ocsp, then a .sctl) The command 'set ssl cert' will now duplicate the certificate (or bundle) and update it in a temporary transaction.. The second command 'commit ssl cert' will commit all the changes made during the transaction for the certificate. This commit breaks the ability to update a certificate which was used as a unique file and as a bundle in the HAProxy configuration. This way of using the certificates wasn't making any sense. Example: // For a bundle: $ echo -e "set ssl cert localhost.pem.rsa <<\n$(cat kikyo.pem.rsa)\n" \| socat /tmp/sock1 - Transaction created for certificate localhost.pem! $ echo -e "set ssl cert localhost.pem.dsa <<\n$(cat kikyo.pem.dsa)\n" \| socat /tmp/sock1 - Transaction updated for certificate localhost.pem! $ echo -e "set ssl cert localhost.pem.ecdsa <<\n$(cat kikyo.pem.ecdsa)\n" \| socat /tmp/sock1 - Transaction updated for certificate localhost.pem! $ echo "commit ssl cert localhost.pem" \| socat /tmp/sock1 - Committing localhost.pem. Success!	2019-10-30 17:01:07 +01:00
William Dauchy	0fec3ab7bf	MINOR: init: always fail when setrlimit fails this patch introduces a strict-limits parameter which enforces the setrlimit setting instead of a warning. This option can be forcingly disable with the "no" keyword. The general aim of this patch is to avoid bad surprises on a production environment where you change the maxconn for example, a new fd limit is calculated, but cannot be set because of sysfs setting. In that case you might want to have an explicit failure to be aware of it before seeing your traffic going down. During a global rollout it is also useful to explictly fail as most progressive rollout would simply check the general health check of the process. As discussed, plan to use the strict by default mode starting from v2.3. Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2019-10-29 17:42:27 +01:00
William Dauchy	ec73098171	MINOR: config: allow no set-dumpable config option in global config parsing, we currently expect to have a possible no keyword (KWN_NO), but we never allow it in config parsing. another patch could have been to simply remove the code handling a possible KWN_NO. take this opportunity to update documentation of set-dumpable. Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2019-10-29 17:42:27 +01:00
Olivier Houchard	e8f5f5d8b2	BUG/MEDIUM: servers: Only set SF_SRV_REUSED if the connection if fully ready. In connect_server(), if we're reusing a connection, only use SF_SRV_REUSED if the connection is fully ready. We may be using a multiplexed connection created by another stream that is not yet ready, and may fail. If we set SF_SRV_REUSED, process_stream() will then not wait for the timeout to expire, and retry to connect immediately. This should be backported to 1.9 and 2.0. This commit depends on 55234e33708c5a584fb9efea81d71ac47235d518.	2019-10-29 14:15:20 +01:00
Olivier Houchard	9b8e11e691	MINOR: mux: Add a new method to get informations about a mux. Add a new method, ctl(), to muxes. It uses a "enum mux_ctl_type" to let it know which information we're asking for, and can output it either directly by returning the expected value, or by using an optional argument. "output" argument. Right now, the only known mux_ctl_type is MUX_STATUS, that will return 0 if the mux is not ready, or MUX_STATUS_READY if the mux is ready. We probably want to backport this to 1.9 and 2.0.	2019-10-29 14:15:20 +01:00
Willy Tarreau	20020ae804	MINOR: chunk: add chunk_istcat() to concatenate an ist after a chunk We previously relied on chunk_cat(dst, b_fromist(src)) for this but it is not reliable as the allocated buffer is inside the expression and may be on a temporary stack. While it's possible to allocate stack space for a struct and return a pointer to it, it's not possible to initialize it form a temporary variable to prevent arguments from being evaluated multiple times. Since this is only used to append an ist after a chunk, let's instead have a chunk_istcat() function to perform exactly this from a native ist. The only call place (URI computation in the cache) was updated.	2019-10-29 13:09:14 +01:00
Willy Tarreau	0580052bb6	BUILD/MINOR: ssl: shut up a build warning about format truncation Actually gcc believes it has detected a possible truncation but it cannot since the output string is necessarily at least one char shorter than what it expects. However addressing it is easy and removes the need for an intermediate copy so let's do it.	2019-10-29 10:50:22 +01:00
Willy Tarreau	4fd6d671b2	BUG/MINOR: spoe: fix off-by-one length in UUID format string The per-thread UUID string produced by generate_pseudo_uuid() could be off by one character due to too small of size limit in snprintf(). In practice the UUID remains large enough to avoid any collision though. This should be backported to 2.0 and 1.9.	2019-10-29 10:33:13 +01:00
Willy Tarreau	e112c8a64b	BUILD/MINOR: tools: shut up the format truncation warning in get_gmt_offset() The gcc warning about format truncation in get_gmt_offset() is annoying since we always call it with a valid time thus it cannot fail. However it's true that nothing guarantees that future code reuses this function incorrectly in the future, so better enforce the modulus on one day and shut the warning.	2019-10-29 10:19:34 +01:00
William Lallemand	430413e285	MINOR: ssl/cli: rework the 'set ssl cert' IO handler Rework the 'set ssl cert' IO handler so it is clearer. Use its own SETCERT_ST_* states insted of the STAT_ST ones. Use an inner loop in SETCERT_ST_GEN and SETCERT_ST_INSERT to do the work for both the certificate and the bundle. The io_release() is now called only when the CKCH spinlock is taken so we can unlock during a release without any condition.	2019-10-28 14:57:37 +01:00
William Lallemand	1212db417b	BUG/MINOR: ssl/cli: cleanup on cli_parse_set_cert error Since commit `90b098c` ("BUG/MINOR: cli: don't call the kw->io_release if kw->parse failed"), the io_release() callback is not called anymore when the parse() failed. Call it directly on the error path of the cli_parse_set_cert() function.	2019-10-28 14:57:37 +01:00
Christopher Faulet	04400bc787	BUG/MAJOR: stream-int: Don't receive data from mux until SI_ST_EST is reached This bug is pretty pernicious and have serious consequences : In 2.1, an infinite loop in process_stream() because the backend stream-interface remains in the ready state (SI_ST_RDY). In 2.0, a call in loop to process_stream() because the stream-interface remains blocked in the connect state (SI_ST_CON). In both cases, it happens after a connection retry attempt. In 1.9, it seems to not happen. But it may be just by chance or just because it is harder to get right conditions to trigger the bug. However, reading the code, the bug seems to exist too. Here is how the bug happens in 2.1. When we try to establish a new connection to a server, the corresponding stream-interface is first set to the connect state (SI_ST_CON). When the underlying connection is known to be connected (the flag CO_FL_CONNECTED set), the stream-interface is switched to the ready state (SI_ST_RDY). It is a transient state between the connect state (SI_ST_CON) and the established state (SI_ST_EST). It must be handled on the next call to process_stream(), which is responsible to operate the transition. During all this time, errors can occur. A connection error or a client abort. The transient state SI_ST_RDY was introduced to let a chance to process_stream() to catch these errors before considering the connection as fully established. Unfortunatly, if a read0 is catched in states SI_ST_CON or SI_ST_RDY, it is possible to have a shutdown without transition to SI_ST_DIS (in fact, here, SI_ST_CON is swichted to SI_ST_RDY). This happens if the request was fully received and analyzed. In this case, the flag SI_FL_NOHALF is set on the backend stream-interface. If an error is also reported during the connect, the behavior is undefined because an error is returned to the client and a connection retry is performed. So on the next connection attempt to the server, if another error is reported, a client abort is detected. But the shutdown for writes was already done. So the transition to the state SI_ST_DIS is impossible. We stay in the state SI_ST_RDY. Because it is a transient state, we loop in process_stream() to perform the transition. It is hard to understand how the bug happens reading the code and even harder to explain. But there is a trivial way to hit the bug by sending h2 requests to a server only speaking h1. For instance, with the following config : listen tst bind *:80 server www 127.0.0.1:8000 proto h2 # in reality, it is a HTTP/1.1 server It is a configuration error, but it is an easy way to observe the bug. Note it may happen with a valid configuration. So, after a careful analyzis, it appears that si_cs_recv() should never be called for a not fully established stream-interface. This way the connection retries will be performed before reporting an error to the client. Thus, if a shutdown is performed because a read0 is handled, the stream-interface is inconditionnaly set to the transient state SI_ST_DIS. This patch must be backported to 2.0 and 1.9. However on these versions, this patch reveals a design flaw about connections and a bad way to perform the connection retries. We are working on it.	2019-10-26 08:24:45 +02:00
Christopher Faulet	69fe5cea21	BUG/MINOR: mux-h2: Don't pretend mux buffers aren't full anymore if nothing sent In h2_send(), when something is sent, we remove the flags (H2_CF_MUX_MFULL\|H2_CF_DEM_MROOM) on the h2 connection. This way, we are able to wake up all streams waiting to send data. Unfortunatly, these flags are unconditionally removed, even when nothing was sent. So if the h2c is blocked because the mux buffers are full and we are unable to send anything, all streams in the send_list are woken up for nothing. Now, we only remove these flags if at least a send succeeds. This patch must be backport to 2.0.	2019-10-26 08:24:45 +02:00
William Lallemand	90b098c921	BUG/MINOR: cli: don't call the kw->io_release if kw->parse failed The io_release() callback of the cli_kw is supposed to be used to clean what an io_handler() has made. It is called once the work in the IO handler is finished, or when the connection was aborted by the client. This patch fixes a bug where the io_release callback was called even when the parse() callback failed. Which means that the io_release() could called even if the io_handler() was not called. Should be backported in every versions that have a cli_kw->release(). (as far as 1.7)	2019-10-25 22:00:49 +02:00
Willy Tarreau	b2fee0406d	BUG/MEDIUM: debug: address a possible null pointer dereference in "debug dev stream" As reported in issue #343, there is one case where a NULL stream can still be dereferenced, when getting &s->txn->flags. Let's protect all assignments to stay on the safe side for future additions. No backport is needed.	2019-10-25 10:10:07 +02:00
Willy Tarreau	9b013701f1	MINOR: stats/debug: maintain a counter of debug commands issued Debug commands will usually mark the fate of the process. We'd rather have them counted and visible in a core or in stats output than trying to guess how a flag combination could happen. The counter is only incremented when the command is about to be issued however, so that failed attempts are ignored.	2019-10-24 18:38:00 +02:00
Willy Tarreau	b24ab22ac0	MINOR: debug: make most debug CLI commands accessible in expert mode Instead of relying on DEBUG_DEV for most debugging commands, which is limiting, let's condition them to expert mode. Only one ("debug dev exec") remains conditionned to DEBUG_DEV because it can have a security implication on the system. The commands are not listed unless "expert-mode on" was first entered on the CLI : > expert-mode on > help debug dev close <fd> : close this file descriptor debug dev delay [ms] : sleep this long debug dev exec [cmd] ... : show this command's output debug dev exit [code] : immediately exit the process debug dev hex <addr> [len]: dump a memory area debug dev log [msg] ... : send this msg to global logs debug dev loop [ms] : loop this long debug dev panic : immediately trigger a panic debug dev stream ... : show/manipulate stream flags debug dev tkill [thr] [sig] : send signal to thread > debug dev stream Usage: debug dev stream { <obj> <op> <value> \| wake }* <obj> = {strm \| strm.f \| sif.f \| sif.s \| sif.x \| sib.f \| sib.s \| sib.x \| txn.f \| req.f \| req.r \| req.w \| res.f \| res.r \| res.w} <op> = {'' (show) \| '=' (assign) \| '^' (xor) \| '+' (or) \| '-' (andnot)} <value> = 'now' \| 64-bit dec/hex integer (0x prefix supported) 'wake' wakes the stream asssigned to 'strm' (default: current)	2019-10-24 18:38:00 +02:00
Willy Tarreau	abb9f9b057	MINOR: cli: add an expert mode to hide dangerous commands Some commands like the debug ones are not enabled by default but can be useful on some production environments. In order to avoid the temptation of using them incorrectly, let's introduce an "expert" mode for a CLI connection, which allows some commands to appear and be used. It is enabled by command "expert-mode on" which is not listed by default.	2019-10-24 18:38:00 +02:00
Willy Tarreau	2b5520da47	MINOR: cli/debug: validate addresses using may_access() in "debug dev stream" This function adds some control by verifying that the target address is really readable. It will not protect against writing to wrong places, but will at least protect against a large number of mistakes such as incorrectly copy-pasted addresses.	2019-10-24 18:38:00 +02:00
Willy Tarreau	68680bb14e	MINOR: debug: add a new "debug dev stream" command This new "debug dev stream" command allows to manipulate flags, timeouts, states for streams, channels and stream interfaces, as well as waking a stream up. These may be used to help reproduce certain bugs during development. The operations are performed to the stream assigned by "strm" which defaults to the CLI's stream. This stream pointer can be chosen from one of those reported in "show sess". Example: socat - /tmp/sock1 <<< "debug dev stream strm=0x1555b80 req.f=-1 req.r=now wake"	2019-10-24 10:43:04 +02:00
William Dauchy	b705b4d7d3	MINOR: tcp: avoid confusion in time parsing init We never enter val_fc_time_value when an associated fetcher such as `fc_rtt` is called without argument. meaning `type == ARGT_STOP` will never be true and so the default `data.sint = TIME_UNIT_MS` will never be set. remove this part to avoid thinking default data.sint is set to ms while reading the code. Signed-off-by: William Dauchy <w.dauchy@criteo.com> [Cf: This patch may safely backported as far as 1.7. But no matter if not.]	2019-10-24 10:25:00 +02:00
William Lallemand	f29cdefccd	BUG/MINOR: ssl/cli: out of bounds when built without ocsp/sctl Commit `541a534` ("BUG/MINOR: ssl/cli: fix build of SCTL and OCSP") introduced a bug in which we iterate outside the array durint a 'set ssl cert' if we didn't built with the ocsp or sctl.	2019-10-23 15:05:00 +02:00
William Lallemand	541a534c9f	BUG/MINOR: ssl/cli: fix build of SCTL and OCSP Fix the build issue of SCTL and OCSP for boring/libressl introduced by `44b3532` ("MINOR: ssl/cli: update ocsp/issuer/sctl file from the CLI")	2019-10-23 14:47:16 +02:00
William Lallemand	8f840d7e55	MEDIUM: cli/ssl: handle the creation of SSL_CTX in an IO handler To avoid affecting too much the traffic during a certificate update, create the SNIs in a IO handler which yield every 10 ckch instances. This way haproxy continues to respond even if we tries to update a certificate which have 50 000 instances.	2019-10-23 11:54:51 +02:00
William Lallemand	0c3b7d9e1c	MINOR: ssl/cli: assignate a new ckch_store When updating a certificate from the CLI, it is not possible to revert some of the changes if part of the certicate update failed. We now creates a copy of the ckch_store for the changes so we can revert back if something goes wrong. Even if the ckch_store was affected before this change, it wasn't affecting the SSL_CTXs used for the traffic. It was only a problem if we try to update a certificate after we failed to do it the first time. The new ckch_store is also linked to the new sni_ctxs so it's easy to insert the sni_ctxs before removing the old ones.	2019-10-23 11:54:51 +02:00
William Lallemand	8c1cddef6d	MINOR: ssl: new functions duplicate and free a ckch_store ckchs_dup() alloc a new ckch_store and copy the content of its source. ckchs_free() frees a ckch_store and its content.	2019-10-23 11:54:51 +02:00
William Lallemand	8d0f893222	MINOR: ssl: copy a ckch from src to dst ssl_sock_copy_cert_key_and_chain() copy the content of a <src> cert_key_and_chain to a <dst>. It applies a refcount increasing on every SSL structures (X509, DH, privte key..) and allocate new buffers for the other fields.	2019-10-23 11:54:51 +02:00
William Lallemand	455af50fac	MINOR: ssl: update ssl_sock_free_cert_key_and_chain_contents The struct cert_key_and_chain now contains the DH, the sctl and the ocsp_response. Free them.	2019-10-23 11:54:51 +02:00
William Lallemand	44b3532250	MINOR: ssl/cli: update ocsp/issuer/sctl file from the CLI It is now possible to update new parts of a CKCH from the CLI. Currently you will be able to update a PEM (by default), a OCSP response in base64, an issuer file, and a SCTL file. Each update will creates a new CKCH and new sni_ctx structure so we will need a "commit" command later to apply several changes and create the sni_ctx only once.	2019-10-23 11:54:51 +02:00
William Lallemand	849eed6b25	BUG/MINOR: ssl/cli: fix looking up for a bundle If we want a bundle but we didn't find a bundle, we shouldn't try to apply the changes.	2019-10-23 11:54:51 +02:00
William Lallemand	96a9c97369	MINOR: ssl: split ssl_sock_load_crt_file_into_ckch() Split the ssl_sock_load_crt_file_into_ckch() in two functions: - ssl_sock_load_files_into_ckch() which is dedicated to opening every files related to a filename during the configuration parsing (PEM, sctl, ocsp, issuer etc) - ssl_sock_load_pem_into_ckch() which is dedicated to opening a PEM, either in a file or a buffer	2019-10-23 11:54:51 +02:00
William Lallemand	f9568fcd79	MINOR: ssl: load issuer from file or from buffer ssl_sock_load_issuer_file_into_ckch() is a new function which is able to load an issuer from a buffer or from a file to a CKCH. Use this function directly in ssl_sock_load_crt_file_into_ckch()	2019-10-23 11:54:51 +02:00
William Lallemand	0dfae6c315	MINOR: ssl: load sctl from buf OR from a file The ssl_sock_load_sctl_from_file() function was modified to fill directly a struct cert_key_and_chain. The function prototype was normalized in order to be used with the CLI payload parser. This function either read text from a buffer or read a file on the filesystem. It fills the ocsp_response buffer of the struct cert_key_and_chain.	2019-10-23 11:54:51 +02:00
William Lallemand	3b5f360744	MINOR: ssl: OCSP functions can load from file or buffer The ssl_sock_load_ocsp_response_from_file() function was modified to fill directly a struct cert_key_and_chain. The function prototype was normalized in order to be used with the CLI payload parser. This function either read a base64 from a buffer or read a binary file on the filesystem. It fills the ocsp_response buffer of the struct cert_key_and_chain.	2019-10-23 11:54:51 +02:00
William Lallemand	02010478e9	CLEANUP: ssl: fix SNI/CKCH lock labels The CKCH and the SNI locks originally used the same label, we split them but we forgot to change some of them.	2019-10-23 11:54:51 +02:00
William Lallemand	34779c34fc	CLEANUP: ssl: remove old TODO commentary Remove an old commentary above ckch_inst_new_load_multi_store(). This function doe not do filesystem syscalls anymore.	2019-10-23 11:54:51 +02:00
Willy Tarreau	9364a5fda3	BUG/MINOR: mux-h2: do not emit logs on backend connections The logs were added to the H2 mux so that we can report logs in case of errors that prevent a stream from being created, but as a side effect these logs are emitted twice for backend connections: once by the H2 mux itself and another time by the upper layer stream. It can even happen more with connection retries. This patch makes sure we do not emit logs for backend connections. It should be backported to 2.0 and 1.9.	2019-10-23 11:12:22 +02:00
Willy Tarreau	403bfbb130	BUG/MEDIUM: pattern: make the pattern LRU cache thread-local and lockless As reported in issue #335, a lot of contention happens on the PATLRU lock when performing expensive regex lookups. This is absurd since the purpose of the LRU cache was to have a fast cache for expressions, thus the cache must not be shared between threads and must remain lockless. This commit makes the LRU cache thread-local and gets rid of the PATLRU lock. A test with 7 threads on 4 cores climbed from 67kH/s to 369kH/s, or a scalability factor of 5.5. Given the huge performance difference and the regression caused to users migrating from processes to threads, this should be backported at least to 2.0. Thanks to Brian Diekelman for his detailed report about this regression.	2019-10-23 07:27:25 +02:00
Willy Tarreau	28c63c15f5	BUG/MINOR: stick-table: fix an incorrect 32 to 64 bit key conversion As reported in issue #331, the code used to cast a 32-bit to a 64-bit stick-table key is wrong. It only copies the 32 lower bits in place on little endian machines or overwrites the 32 higher ones on big endian machines. It ought to simply remove the wrong cast dereference. This bug was introduced when changing stick table keys to samples in 1.6-dev4 by commit `bc8c404449` ("MAJOR: stick-tables: use sample types in place of dedicated types") so it the fix must be backported as far as 1.6.	2019-10-23 06:24:58 +02:00
Emeric Brun	eb46965bbb	BUG/MINOR: ssl: fix memcpy overlap without consequences. A trick is used to set SESSION_ID, and SESSION_ID_CONTEXT lengths to 0 and avoid ASN1 encoding of these values. There is no specific function to set the length of those parameters to 0 so we fake this calling these function to a different value with the same buffer but a length to zero. But those functions don't seem to check the length of zero before performing a memcpy of length zero but with src and dst buf on the same pointer, causing valgrind to bark. So the code was re-work to pass them different pointers even if buffer content is un-used. In a second time, reseting value, a memcpy overlap happened on the SESSION_ID_CONTEXT. It was re-worked and this is now reset using the constant global value SHCTX_APPNAME which is a different pointer with the same content. This patch should be backported in every version since ssl support was added to haproxy if we want valgrind to shut up. This is tracked in github issue #56.	2019-10-22 18:57:45 +02:00
Baptiste Assmann	25e6fc2030	BUG/MINOR: dns: allow srv record weight set to 0 Processing of SRV record weight was inaccurate and when a SRV record's weight was set to 0, HAProxy enforced it to '1'. This patch aims at fixing this without breaking compability with previous behavior. Backport status: 1.8 to 2.0	2019-10-22 13:44:12 +02:00
Vedran Furac	5d48627aba	BUG/MINOR: server: check return value of fopen() in apply_server_state() fopen() can return NULL when state file is missing. This patch adds a check of fopen() return value so we can skip processing in such case. No backport needed.	2019-10-21 16:00:24 +02:00
Tim Duesterhus	4381d26edc	BUG/MINOR: sample: Make the `field` converter compatible with `-m found` Previously an expression like: path,field(2,/) -m found always returned `true`. Bug exists since the `field` converter exists. That is: `f399b0debf` The fix should be backported to 1.6+.	2019-10-21 15:49:42 +02:00
William Lallemand	d1d1e22945	BUG/MINOR: cache: alloc shctx after check config When running haproxy -c, the cache parser is trying to allocate the size of the cache. This can be a problem in an environment where the RAM is limited. This patch moves the cache allocation in the post_check callback which is not executed during a -c. This patch may be backported at least to 2.0 and 1.9. In 1.9, the callbacks registration mechanism is not the same. So the patch will have to be adapted. No need to backport it to 1.8, the code is probably too different.	2019-10-21 15:05:46 +02:00
Christopher Faulet	a9fa88a1ea	BUG/MINOR: stick-table: Never exceed (MAX_SESS_STKCTR-1) when fetching a stkctr When a stick counter is fetched, it is important that the requested counter does not exceed (MAX_SESS_STKCTR -1). Actually, there is no bug with a default build because, by construction, MAX_SESS_STKCTR is defined to 3 and we know that we never exceed the max value. scN_* sample fetches are numbered from 0 to 2. For other sample fetches, the value is tested. But there is a bug if MAX_SESS_STKCTR is set to a lower value. For instance 1. In this case the counters sc1_* and sc2_* may be undefined. This patch fixes the issue #330. It must be backported as far as 1.7.	2019-10-21 11:17:04 +02:00
Christopher Faulet	e566f3db11	BUG/MINOR: ssl: Fix fd leak on error path when a TLS ticket keys file is parsed When an error occurred in the function bind_parse_tls_ticket_keys(), during the configuration parsing, the opened file is not always closed. To fix the bug, all errors are catched at the same place, where all ressources are released. This patch fixes the bug #325. It must be backported as far as 1.7.	2019-10-21 10:04:51 +02:00
William Lallemand	f7f488d8e9	BUG/MINOR: mworker/cli: reload fail with inherited FD When using the master CLI with 'fd@', during a reload, the master CLI proxy is stopped. Unfortunately if this is an inherited FD it is closed too, and the master CLI won't be able to bind again during the re-execution. It lead the master to fallback in waitpid mode. This patch forbids the inherited FDs in the master's listeners to be closed during a proxy_stop(). This patch is mandatory to use the -W option in VTest versions that contain the -mcli feature. (`86e65f1024`) Should be backported as far as 1.9.	2019-10-18 21:45:42 +02:00
Emeric Brun	a9363eb6a5	BUG/MEDIUM: ssl: 'tune.ssl.default-dh-param' value ignored with openssl > 1.1.1 If openssl 1.1.1 is used, `c2aae74f0` commit mistakenly enables DH automatic feature from openssl instead of ECDH automatic feature. There is no impact for the ECDH one because the feature is always enabled for that version. But doing this, the 'tune.ssl.default-dh-param' was completely ignored for DH parameters. This patch fix the bug calling 'SSL_CTX_set_ecdh_auto' instead of 'SSL_CTX_set_dh_auto'. Currently some users may use a 2048 DH bits parameter, thinking they're using a 1024 bits one. Doing this, they may experience performance issue on light hardware. This patch warns the user if haproxy fails to configure the given DH parameter. In this case and if openssl version is > 1.1.0, haproxy will let openssl to automatically choose a default DH parameter. For other openssl versions, the DH ciphers won't be usable. A commonly case of failure is due to the security level of openssl.cnf which could refuse a 1024 bits DH parameter for a 2048 bits key: $ cat /etc/ssl/openssl.cnf ... [system_default_sect] MinProtocol = TLSv1 CipherString = DEFAULT@SECLEVEL=2 This should be backport into any branch containing the commit `c2aae74f0`. It requires all or part of the previous CLEANUP series. This addresses github issue #324.	2019-10-18 15:18:52 +02:00
Emeric Brun	0655c9b222	CLEANUP: bind: handle warning label on bind keywords parsing. All bind keyword parsing message were show as alerts. With this patch if the message is flagged only with ERR_WARN and not ERR_ALERT it will show a label [WARNING] and not [ALERT].	2019-10-18 15:18:52 +02:00
Emeric Brun	7a88336cf8	CLEANUP: ssl: make ssl_sock_load_dh_params handle errcode/warn ssl_sock_load_dh_params used to return >0 or -1 to indicate success or failure. Make it return a set of ERR_* instead so that its callers can transparently report its status. Given that its callers only used to know about ERR_ALERT \| ERR_FATAL, this is the only code returned for now. An error message was added in the case of failure and the comment was updated.	2019-10-18 15:18:52 +02:00
Emeric Brun	a96b582d0e	CLEANUP: ssl: make ssl_sock_put_ckch_into_ctx handle errcode/warn ssl_sock_put_ckch_into_ctx used to return 0 or >0 to indicate success or failure. Make it return a set of ERR_* instead so that its callers can transparently report its status. Given that its callers only used to know about ERR_ALERT \| ERR_FATAL, this is the only code returned for now. And a comment was updated.	2019-10-18 15:18:52 +02:00
Emeric Brun	054563de13	CLEANUP: ssl: make ckch_inst_new_load_(multi_)store handle errcode/warn ckch_inst_new_load_store() and ckch_inst_new_load_multi_store used to return 0 or >0 to indicate success or failure. Make it return a set of ERR_* instead so that its callers can transparently report its status. Given that its callers only used to know about ERR_ALERT \| ERR_FATAL, his is the only code returned for now. And the comment was updated.	2019-10-18 15:18:52 +02:00
Emeric Brun	f69ed1d21c	CLEANUP: ssl: make cli_parse_set_cert handle errcode and warnings. cli_parse_set_cert was re-work to show errors and warnings depending of ERR_* bitfield value.	2019-10-18 15:18:52 +02:00
Willy Tarreau	8c5414a546	CLEANUP: ssl: make ssl_sock_load_ckchs() return a set of ERR_* ssl_sock_load_ckchs() used to return 0 or >0 to indicate success or failure even though this was not documented. Make it return a set of ERR_* instead so that its callers can transparently report its status. Given that its callers only used to know about ERR_ALERT \| ERR_FATAL, this is the only code returned for now. And a comment was added.	2019-10-18 15:18:52 +02:00
Willy Tarreau	bbc91965bf	CLEANUP: ssl: make ssl_sock_load_cert() return real error codes These functions were returning only 0 or 1 to mention success or error, and made it impossible to return a warning. Let's make them return error codes from ERR_ and map all errors to ERR_ALERT\|ERR_FATAL for now since this is the only code that was set on non-zero return value. In addition some missing comments were added or adjusted around the functions' return values.	2019-10-18 15:18:52 +02:00
Olivier Houchard	2ed389dc6e	BUG/MEDIUM: mux_pt: Only call the wake emthod if nobody subscribed to receive. In mux_pt_io_cb(), instead of always calling the wake method, only do so if nobody subscribed for receive. If we have a subscription, just wake the associated tasklet up. This should be backported to 1.9 and 2.0.	2019-10-18 14:18:29 +02:00
Olivier Houchard	ea510fc5e7	BUG/MEDIUM: mux_pt: Don't destroy the connection if we have a stream attached. There's a small window where the mux_pt tasklet may be woken up, and thus mux_pt_io_cb() get scheduled, and then the connection is attached to a new stream. If this happen, don't do anything, and just let the stream know by calling its wake method. If the connection had an error, the stream should take care of destroying it by calling the detach method. This should be backported to 2.0 and 1.9.	2019-10-18 14:07:22 +02:00
Olivier Houchard	9dce2c53a8	Revert `e8826ded5f`. This reverts commit "BUG/MEDIUM: mux_pt: Make sure we don't have a conn_stream before freeing.". mux_pt_io_cb() is only used if we have no associated stream, so we will never have a cs, so there's no need to check that, and we of course have to destroy the mux in mux_pt_detach() if we have no associated session, or if there's an error on the connection. This should be backported to 2.0 and 1.9.	2019-10-18 11:24:04 +02:00
Willy Tarreau	bbb5f1d6d2	BUG/MAJOR: idle conns: schedule the cleanup task on the correct threads The idle cleanup tasks' masks are wrong for threads 32 to 64, which causes the wrong thread to wake up and clean the connections that it does not own, with a risk of crash or infinite loop depending on concurrent accesses. For thread 32, any thread between 32 and 64 will be woken up, but for threads 33 to 64, in fact threads 1 to 32 will run the task instead. This issue only affects deployments enabling more than 32 threads. While is it not common in 1.9 where this has to be explicit, and can easily be dealt with by lowering the number of threads, it can be more common in 2.0 since by default the thread count is determined based on the number of available processors, hence the MAJOR tag which is mostly relevant to 2.x. The problem was first introduced into 1.9-dev9 by commit `0c18a6fe3` ("MEDIUM: servers: Add a way to keep idle connections alive.") and was later moved to cfgparse.c by commit `980855bd9` ("BUG/MEDIUM: server: initialize the orphaned conns lists and tasks at the end"). This patch needs to be backported as far as 1.9, with care as 1.9 is slightly different there (uses idle_task[] instead of idle_conn_cleanup[] like in 2.x).	2019-10-18 09:04:02 +02:00
Olivier Houchard	e8826ded5f	BUG/MEDIUM: mux_pt: Make sure we don't have a conn_stream before freeing. On error, make sure we don't have a conn_stream before freeing the connection and the associated mux context. Otherwise a stream will still reference the connection, and attempt to use it. If we still have a conn_stream, it will properly be free'd when the detach method is called, anyway. This should be backported to 2.0 and 1.9.	2019-10-17 18:02:57 +02:00
Christopher Faulet	ba0c53ef71	BUG/MINOR: tcp: Don't alter counters returned by tcp info fetchers There are 2 kinds of tcp info fetchers. Those returning a time value (fc_rtt and fc_rttval) and those returning a counter (fc_unacked, fc_sacked, fc_retrans, fc_fackets, fc_lost, fc_reordering). Because of a bug, the counters were handled as time values, and by default, were divided by 1000 (because of an invalid conversion from us to ms). To work around this bug and have the right value, the argument "us" had to be specified. So now, tcp info fetchers returning a counter don't support any argument anymore. To not break old configurations, if an argument is provided, it is ignored and a warning is emitted during the configuration parsing. In addition, parameter validiation is now performed during the configuration parsing. This patch must be backported as far as 1.7.	2019-10-17 15:20:06 +02:00
William Lallemand	5fdb5b36e1	BUG/MINOR: mworker/ssl: close openssl FDs unconditionally Patch `56996da` ("BUG/MINOR: mworker/ssl: close OpenSSL FDs on reload") fixes a issue where the /dev/random FD was leaked by OpenSSL upon a reload in master worker mode. Indeed the FD was not flagged with CLOEXEC. The fix was checking if ssl_used_frontend or ssl_used_backend were set to close the FD. This is wrong, indeed the lua init code creates an SSL server without increasing the backend value, so the deinit is never done when you don't use SSL in your configuration. To reproduce the problem you just need to build haproxy with openssl and lua with an openssl which does not use the getrandom() syscall. No openssl nor lua configuration are required for haproxy. This patch must be backported as far as 1.8. Fix issue #314.	2019-10-17 11:36:22 +02:00
Willy Tarreau	ccc61d87ae	BUG/MINOR: cache: also cache absolute URIs The recent changes to address URI issues mixed with the recent fix to stop caching absolute URIs have caused the cache not to cache H2 requests anymore since these ones come with a scheme and authority. Let's unbreak this by using absolute URIs all the time, now that we keep host and authority in sync. So what is done now is that if we have an authority, we take the whole URI as it is as the cache key. This covers H2 and H1 absolute requests. If no authority is present (most H1 origin requests), then we prepend "https://" and the Host header. The reason for https:// is that most of the time we don't care about the scheme, but since about all H2 clients use this scheme, at least we can share the cache between H1 and H2. No backport is needed since the breakage only affects 2.1-dev.	2019-10-17 10:40:47 +02:00
David Carlier	5e4c8e2a67	BUILD/MEDIUM: threads: enable cpu_affinity on osx Enable it but on a per thread basis only using Darwin native API.	2019-10-17 07:20:58 +02:00
David Carlier	a92c5cec2d	BUILD/MEDIUM: threads: rename thread_info struct to ha_thread_info On Darwin, the thread_info name exists as a standard function thus we need to rename our array to ha_thread_info to fix this conflict.	2019-10-17 07:15:17 +02:00
Christopher Faulet	04f8919a78	MINOR: mux-h1: Force close mode for proxy responses with an unfinished request When a response generated by HAProxy is handled by the mux H1, if the corresponding request has not fully been received, the close mode is forced. Thus, the client is notified the connection will certainly be closed abruptly, without waiting the end of the request.	2019-10-16 10:03:12 +02:00
Christopher Faulet	065118166c	MINOR: htx: Add a flag on HTX to known when a response was generated by HAProxy The flag HTX_FL_PROXY_RESP is now set on responses generated by HAProxy, excluding responses returned by applets and services. It is an informative flag set by the applicative layer.	2019-10-16 10:03:12 +02:00
Christopher Faulet	0d4ce93fcf	BUG/MINOR: http-htx: Properly set htx flags on error files to support keep-alive When an error file was loaded, the flag HTX_SL_F_XFER_LEN was never set on the HTX start line because of a bug. During the headers parsing, the flag H1_MF_XFER_LEN is never set on the h1m. But it was the condition to set HTX_SL_F_XFER_LEN on the HTX start-line. Instead, we must only rely on the flags H1_MF_CLEN or H1_MF_CHNK. Because of this bug, it was impossible to keep a connection alive for a response generated by HAProxy. Now the flag HTX_SL_F_XFER_LEN is set when an error file have a content length (chunked responses are unsupported at this stage) and the connection may be kept alive if there is no connection header specified to explicitly close it. This patch must be backported to 2.0 and 1.9.	2019-10-16 10:03:12 +02:00
Willy Tarreau	abefa34c34	MINOR: version: make the version strings variables, not constants It currently is not possible to figure the exact haproxy version from a core file for the sole reason that the version is stored into a const string and as such ends up in the .text section that is not part of a core file. By turning them into variables we move them to the data section and they appear in core files. In order to help finding them, we just prepend an extra variable in front of them and we're able to immediately spot the version strings from a core file: $ strings core \| fgrep -A2 'HAProxy version' HAProxy version follows 2.1-dev2-e0f48a-88 2019/10/15 (These are haproxy_version and haproxy_date respectively). This may be backported to 2.0 since this part is not support to impact anything but the developer's time spent debugging.	2019-10-16 09:56:57 +02:00
William Lallemand	e0f48ae976	BUG/MINOR: ssl: can't load ocsp files `246c024` ("MINOR: ssl: load the ocsp in/from the ckch") broke the loading of OCSP files. The function ssl_sock_load_ocsp_response_from_file() was not returning 0 upon success which lead to an error after the .ocsp was read.	2019-10-15 13:50:20 +02:00
William Lallemand	786188f6bf	BUG/MINOR: ssl: fix error messages for OCSP loading The error messages for OCSP in ssl_sock_load_crt_file_into_ckch() add a double extension to the filename, that can be confusing. The messages reference a .issuer.issuer file.	2019-10-15 13:50:20 +02:00
Miroslav Zagorac	f0eb3739ac	BUG/MINOR: WURFL: fix send_log() function arguments If the user agent data contains text that has special characters that are used to format the output from the vfprintf() function, haproxy crashes. String "%s %s %s" may be used as an example. % curl -A "%s %s %s" localhost:10080/index.html curl: (52) Empty reply from server haproxy log: 00000000:WURFL-test.clireq[00c7:ffffffff]: GET /index.html HTTP/1.1 00000000:WURFL-test.clihdr[00c7:ffffffff]: host: localhost:10080 00000000:WURFL-test.clihdr[00c7:ffffffff]: user-agent: %s %s %s 00000000:WURFL-test.clihdr[00c7:ffffffff]: accept: / segmentation fault (core dumped) gdb 'where' output: #0 strlen () at ../sysdeps/x86_64/strlen.S:106 #1 0x00007f7c014a8da8 in _IO_vfprintf_internal (s=s@entry=0x7ffc808fe750, format=<optimized out>, format@entry=0x7ffc808fe9c0 "WURFL: retrieve header request returns [%s %s %s]\n", ap=ap@entry=0x7ffc808fe8b8) at vfprintf.c:1637 #2 0x00007f7c014cfe89 in _IO_vsnprintf ( string=0x55cb772c34e0 "WURFL: retrieve header request returns [(null) %s %s %s B,w\313U", maxlen=<optimized out>, format=format@entry=0x7ffc808fe9c0 "WURFL: retrieve header request returns [%s %s %s]\n", args=args@entry=0x7ffc808fe8b8) at vsnprintf.c:114 #3 0x000055cb758f898f in send_log (p=p@entry=0x0, level=level@entry=5, format=format@entry=0x7ffc808fe9c0 "WURFL: retrieve header request returns [%s %s %s]\n") at src/log.c:1477 #4 0x000055cb75845e0b in ha_wurfl_log ( message=message@entry=0x55cb75989460 "WURFL: retrieve header request returns [%s]\n") at src/wurfl.c:47 #5 0x000055cb7584614a in ha_wurfl_retrieve_header (header_name=<optimized out>, wh=0x7ffc808fec70) at src/wurfl.c:763 In case WURFL (actually HAProxy) is not compiled with debug option enabled (-DWURFL_DEBUG), this bug does not come to light. This patch could be backported in every version supporting the ScientiaMobile's WURFL. (as far as 1.7)	2019-10-15 10:47:31 +02:00
Christopher Faulet	531b83e039	MINOR: h1: Reject requests if the authority does not match the header host As stated in the RCF7230#5.4, a client must send a field-value for the header host that is identical to the authority if the target URI includes one. So, now, by default, if the authority, when provided, does not match the value of the header host, an error is triggered. To mitigate this behavior, it is possible to set the option "accept-invalid-http-request". In that case, an http error is captured without interrupting the request parsing.	2019-10-14 22:28:50 +02:00
Christopher Faulet	497ab4f519	MINOR: h1: Reject requests with different occurrences of the header host There is no reason for a client to send several headers host. It even may be considered as a bug. However, it is totally invalid to have different values for those. So now, in such case, an error is triggered during the request parsing. In addition, when several headers host are found with the same value, only the first instance is kept and others are skipped.	2019-10-14 22:28:50 +02:00
Christopher Faulet	486498c630	BUG/MINOR: mux-h1: Capture ignored parsing errors When the option "accept-invalid-http-request" is enabled, some parsing errors are ignored. But the position of the error is reported. In legacy HTTP mode, such errors were captured. So, we now do the same in the H1 multiplexer. If required, this patch may be backported to 2.0 and 1.9.	2019-10-14 22:28:50 +02:00
Christopher Faulet	53a899b946	CLEANUP: h1-htx: Move htx-to-h1 formatting functions from htx.c to h1_htx.c The functions "htx__to_h1()" have been renamed into "h1_format_htx_()" and moved in the file h1_htx.c. It is the right place for such functions.	2019-10-14 22:28:50 +02:00
Christopher Faulet	d9233f091a	MINOR: mux-h1: Xfer as much payload data as possible during output processing When an outgoing HTX message is formatted to a raw message, DATA blocks may be splitted to not tranfser more data than expected. But if the buffer is almost full, the formatting is interrupted, leaving some unused free space in the buffer, because data are too large to be copied in one time. Now, we transfer as much data as possible. When the message is chunked, we also count the size used to encode the data.	2019-10-14 22:28:44 +02:00
Christopher Faulet	a61aa544b4	BUG/MINOR: mux-h1: Mark the output buffer as full when the xfer is interrupted When an outgoing HTX message is formatted to a raw message, if we fail to copy data of an HTX block into the output buffer, we mark it as full. Before it was only done calling the function buf_room_for_htx_data(). But this function is designed to optimize input processing. This patch must be backported to 2.0 and 1.9.	2019-10-14 22:09:33 +02:00
Christopher Faulet	e0f8dc576f	BUG/MEDIUM: htx: Catch chunk_memcat() failures when HTX data are formatted to h1 In functions htx_*_to_h1(), most of time several calls to chunk_memcat() are chained. The expected size is always compared to available room in the buffer to be sure the full copy will succeed. But it is a bit risky because it relies on the fact the function chunk_memcat() evaluates the available room in the buffer in a same way than htx ones. And, unfortunately, it does not. A bug in chunk_memcat() will always leave a byte unused in the buffer. So, for instance, when a chunk is copied in an almost full buffer, the last CRLF may be skipped. To fix the issue, we now rely on the result of chunk_memcat() only. This patch must be backported to 2.0 and 1.9.	2019-10-14 16:42:46 +02:00
William Lallemand	4a66013069	BUG/MINOR: ssl: fix OCSP build with BoringSSL `246c024` broke the build of the OCSP code with BoringSSL. Rework it a little so it could load the OCSP buffer of the ckch. Issue #322.	2019-10-14 15:07:44 +02:00
William Lallemand	104a7a6c14	BUILD: ssl: wrong #ifdef for SSL engines code The SSL engines code was written below the OCSP #ifdef, which means you can't build the engines code if the OCSP is deactived in the SSL lib. Could be backported in every version since 1.8.	2019-10-14 15:07:44 +02:00
William Lallemand	963b2e70ba	BUG/MINOR: ssl: fix build without multi-cert bundles Commit `150bfa8` broke the build with ssl libs that does not support multi certificate bundles. Issue #322.	2019-10-14 11:41:18 +02:00
William Lallemand	e15029bea9	BUG/MEDIUM: ssl: NULL dereference in ssl_sock_load_cert_sni() A NULL dereference can occur when inserting SNIs. In the case of checking for duplicates, if there is already several sni_ctx with the same key. Fix issue #321.	2019-10-14 10:57:16 +02:00
William Lallemand	246c0246d3	MINOR: ssl: load the ocsp in/from the ckch Don't try to load the files containing the issuer and the OCSP response each time we generate a SSL_CTX. The .ocsp and the .issuer are now loaded in the struct cert_key_and_chain only once and then loaded from this structure when creating a SSL_CTX.	2019-10-11 17:32:03 +02:00
William Lallemand	a17f4116d5	MINOR: ssl: load the sctl in/from the ckch Don't try to load the file containing the sctl each time we generate a SSL_CTX. The .sctl is now loaded in the struct cert_key_and_chain only once and then loaded from this structure when creating a SSL_CTX. Note that this now make possible the use of sctl with multi-cert bundles.	2019-10-11 17:32:03 +02:00
William Lallemand	150bfa84e3	MEDIUM: ssl/cli: 'set ssl cert' updates a certificate from the CLI $ echo -e "set ssl cert certificate.pem <<\n$(cat certificate2.pem)\n" \| \ socat stdio /var/run/haproxy.stat Certificate updated! The operation is locked at the ckch level with a HA_SPINLOCK_T which prevents the ckch architecture (ckch_store, ckch_inst..) to be modified at the same time. So you can't do a certificate update at the same time from multiple CLI connections. SNI trees are also locked with a HA_RWLOCK_T so reading operations are locked only during a certificate update. Bundles are supported but you need to update each file (.rsa\|ecdsa\|.dsa) independently. If a file is used in the configuration as a bundle AND as a unique certificate, both will be updated. Bundles, directories and crt-list are supported, however filters in crt-list are currently unsupported. The code tries to allocate every SNIs and certificate instances first, so it can rollback the operation if that was unsuccessful. If you have too much instances of the certificate (at least 20000 in my tests on my laptop), the function can take too much time and be killed by the watchdog. This will be fixed later. Also with too much certificates it's possible that socat exits before the end of the generation without displaying a message, consider changing the socat timeout in this case (-t2 for example). The size of the certificate is currently limited by the maximum size of a payload, that must fit in a buffer.	2019-10-11 17:32:03 +02:00
William Lallemand	f11365b26a	MINOR: ssl: ssl_sock_load_crt_file_into_ckch() is filling from a BIO The function ssl_sock_load_crt_file_into_ckch() is now able to fill a ckch using a BIO in input.	2019-10-11 17:32:03 +02:00
William Lallemand	614ca0d370	MEDIUM: ssl: ssl_sock_load_ckchs() alloc a ckch_inst The ssl_sock_load_{multi}_ckchs() function were renamed and modified: - allocate a ckch_inst and loads the sni in it - return a ckch_inst or NULL - the sni_ctx are not added anymore in the sni trees from there - renamed in ckch_inst_new_load_{multi}_store() - new ssl_sock_load_ckchs() function calls ckch_inst_new_load_{multi}_store() and add the sni_ctx to the sni trees.	2019-10-11 17:32:03 +02:00
William Lallemand	0c6d12fb66	MINOR: ssl: ssl_sock_load_multi_ckchs() can properly fail ssl_sock_load_multi_ckchs() is now able to fail without polluting the bind_conf trees and leaking memory. It is a prerequisite to load certificate on-the-fly with the CLI. The insertion of the sni_ctxs in the trees are done once everything has been allocated correctly.	2019-10-11 17:32:03 +02:00
William Lallemand	d919937991	MINOR: ssl: ssl_sock_load_ckchn() can properly fail ssl_sock_load_ckchn() is now able to fail without polluting the bind_conf trees and leaking memory. It is a prerequisite to load certificate on-the-fly with the CLI. The insertion of the sni_ctxs in the trees are done once everything has been allocated correctly.	2019-10-11 17:32:03 +02:00
William Lallemand	1d29c7438e	MEDIUM: ssl: split ssl_sock_add_cert_sni() In order to allow the creation of sni_ctx in runtime, we need to split the function to allow rollback. We need to be able to allocate all sni_ctxs required before inserting them in case we need to rollback if we didn't succeed the allocation. The function was splitted in 2 parts. The first one ckch_inst_add_cert_sni() allocates a struct sni_ctx, fill it with the right data and insert it in the ckch_inst's list of sni_ctx. The second will take every sni_ctx in the ckch_inst and insert them in the bind_conf's sni tree.	2019-10-11 17:32:03 +02:00
William Lallemand	9117de9e37	MEDIUM: ssl: introduce the ckch instance structure struct ckch_inst represents an instance of a certificate (ckch_node) used in a bind_conf. Every sni_ctx created for 1 ckch_node in a bind_conf are linked in this structure. This patch allocate the ckch_inst for each bind_conf and inserts the sni_ctx in its linked list.	2019-10-11 17:32:03 +02:00
William Lallemand	28a8fce485	BUG/MINOR: ssl: abort on sni_keytypes allocation failure The ssl_sock_populate_sni_keytypes_hplr() function does not return an error upon an allocation failure. The process would probably crash during the configuration parsing if the allocation fail since it tries to copy some data in the allocated memory. This patch could be backported as far as 1.5.	2019-10-11 17:32:02 +02:00
William Lallemand	8ed5b96587	BUG/MINOR: ssl: free the sni_keytype nodes This patch frees the sni_keytype nodes once the sni_ctxs have been allocated in ssl_sock_load_multi_ckchn(); Could be backported in every version using the multi-cert SSL bundles.	2019-10-11 17:32:02 +02:00
William Lallemand	fe49bb3d0c	BUG/MINOR: ssl: abort on sni allocation failure The ssl_sock_add_cert_sni() function never return an error when a sni_ctx allocation fail. It silently ignores the problem and continues to try to allocate other snis. It is unlikely that a sni allocation will succeed after one failure and start a configuration without all the snis. But to avoid any problem we return a -1 upon an sni allocation error and stop the configuration parsing. This patch must be backported in every version supporting the crt-list sni filters. (as far as 1.5)	2019-10-11 17:32:02 +02:00
William Lallemand	4b989f2fac	MINOR: ssl: initialize the sni_keytypes_map as EB_ROOT The sni_keytypes_map was initialized to {0}, it's better to initialize it explicitly to EB_ROOT	2019-10-11 17:32:02 +02:00
William Lallemand	f6adbe9f28	REORG: ssl: move structures to ssl_sock.h	2019-10-11 17:32:02 +02:00
William Lallemand	e3af8fbad3	REORG: ssl: rename ckch_node to ckch_store A ckch_store is a storage which contains a pointer to one or several cert_key_and_chain structures. This patch renames ckch_node to ckch_store, and ckch_n, ckchn to ckchs.	2019-10-11 17:32:02 +02:00
William Lallemand	eed4bf234e	MINOR: ssl: crt-list do ckchn_lookup	2019-10-11 17:32:02 +02:00
Willy Tarreau	572d9f5847	MINOR: mux-h2: also support emitting CONTINUATION on trailers Trailers were forgotten by commit `cb985a4da6` ("MEDIUM: mux-h2: support emitting CONTINUATION frames after HEADERS"), this one just fixes this miss.	2019-10-11 17:00:04 +02:00
Olivier Houchard	5a3671d8b1	MINOR: h2: Document traps to be avoided on multithread. Document a few traps to avoid if we ever attempt to allow the upper layer of the mux h2 to be run by multiple threads.	2019-10-11 16:37:41 +02:00
Olivier Houchard	06910464dd	MEDIUM: task: Split the tasklet list into two lists. As using an mt_list for the tasklet list is costly, instead use a regular list, but add an mt_list for tasklet woken up by other threads, to be run on the current thread. At the beginning of process_runnable_tasks(), we just take the new list, and merge it into the task_list. This should give us performances comparable to before we started using a mt_list, but allow us to use tasklet_wakeup() from other threads.	2019-10-11 16:37:41 +02:00
Willy Tarreau	6d4897eec0	BUILD: stats: fix missing '=' sign in array declaration I introduced this mistake when adding the description for the stats metrics, it's even amazing it built and worked at all! This was reported by Travis CI on non-GNU platforms : src/stats.c:92:39: warning: use of GNU 'missing =' extension in designator [-Wgnu-designator] [INF_NAME] { .name = "Name", .desc = "Product name" }, ^ = No backport is needed.	2019-10-11 16:39:00 +02:00
Willy Tarreau	19920d6fc9	BUG/MEDIUM: applet: always check a fast running applet's activity before killing In issue #277 is reported a strange problem related to a fast-spinning applet which seems to show valid progress being made. It's uncertain how this can happen, maybe some very specific timing patterns manage to place just a few bytes in each buffer and result in the peers applet being called a lot. But it appears possible to artificially cross the spinning threshold by asking for monster stats page (500 MB) and limiting the send() size to 1 MSS (1460 bytes), causing the stats page to be called for very small blocks which most often do not leave enough room to place a new chunk. The idea developed in this patch consists in not crashing for an applet which reaches a very high call rate if it shows some indication of progress. Detecting progress on applets is not trivial but in our case we know that they must at least not claim to wait for a buffer allocation if this buffer is present, wait for room if the buffer is empty, ask for more data without polling if such data are still present, nor leave with an empty input buffer without having written anything nor read anything from the other side while a shutw is pending. Doing so doesn't affect normal behaviors nor abuses of our existing applets and does at least protect against an applet performing an early return without processing events, or one causing an endless loop by asking for impossible conditions. This must be backported to 2.0.	2019-10-11 16:05:57 +02:00
Willy Tarreau	d89331ecb5	MINOR: stats: fill all the descriptions for "show info" and "show stat" Now "show info desc", "show info typed desc" and "show stat typed desc" will report (hopefully) accurate descriptions of each field. These ones were verified in the code. When some metrics are specific to the process or the thread, they are indicated. Sometimes a config option is known for a setting and it is reported as well. The purpose mainly is to help sysadmins in field more easily sort out issues vs non-issues. In part inspired by this very informative talk : https://kernel-recipes.org/en/2019/metrics-are-money/ Example: $ socat - /var/run/haproxy.sock <<< "show info desc" Name: HAProxy:"Product name" Version: 2.1-dev2-991035-31:"Product version" Release_date: 2019/10/09:"Date of latest source code update" Nbthread: 1:"Number of started threads (global.nbthread)" Nbproc: 1:"Number of started worker processes (global.nbproc)" Process_num: 1:"Relative process number (1..Nbproc)" Pid: 11975:"This worker process identifier for the system" Uptime: 0d 0h00m10s:"How long ago this worker process was started (days+hours+minutes+seconds)" Uptime_sec: 10:"How long ago this worker process was started (seconds)" Memmax_MB: 0:"Worker process's hard limit on memory usage in MB (-m on command line)" PoolAlloc_MB: 0:"Amount of memory allocated in pools (in MB)" PoolUsed_MB: 0:"Amount of pool memory currently used (in MB)" PoolFailed: 0:"Number of failed pool allocations since this worker was started" Ulimit-n: 300000:"Hard limit on the number of per-process file descriptors" Maxsock: 300000:"Hard limit on the number of per-process sockets" Maxconn: 149982:"Hard limit on the number of per-process connections (configured or imposed by Ulimit-n)" Hard_maxconn: 149982:"Hard limit on the number of per-process connections (imposed by Memmax_MB or Ulimit-n)" CurrConns: 0:"Current number of connections on this worker process" CumConns: 1:"Total number of connections on this worker process since started" CumReq: 1:"Total number of requests on this worker process since started" MaxSslConns: 0:"Hard limit on the number of per-process SSL endpoints (front+back), 0=unlimited" CurrSslConns: 0:"Current number of SSL endpoints on this worker process (front+back)" CumSslConns: 0:"Total number of SSL endpoints on this worker process since started (front+back)" Maxpipes: 0:"Hard limit on the number of pipes for splicing, 0=unlimited" PipesUsed: 0:"Current number of pipes in use in this worker process" PipesFree: 0:"Current number of allocated and available pipes in this worker process" ConnRate: 0:"Number of front connections created on this worker process over the last second" ConnRateLimit: 0:"Hard limit for ConnRate (global.maxconnrate)" MaxConnRate: 0:"Highest ConnRate reached on this worker process since started (in connections per second)" SessRate: 0:"Number of sessions created on this worker process over the last second" SessRateLimit: 0:"Hard limit for SessRate (global.maxsessrate)" MaxSessRate: 0:"Highest SessRate reached on this worker process since started (in sessions per second)" SslRate: 0:"Number of SSL connections created on this worker process over the last second" SslRateLimit: 0:"Hard limit for SslRate (global.maxsslrate)" MaxSslRate: 0:"Highest SslRate reached on this worker process since started (in connections per second)" SslFrontendKeyRate: 0:"Number of SSL keys created on frontends in this worker process over the last second" SslFrontendMaxKeyRate: 0:"Highest SslFrontendKeyRate reached on this worker process since started (in SSL keys per second)" SslFrontendSessionReuse_pct: 0:"Percent of frontend SSL connections which did not require a new key" SslBackendKeyRate: 0:"Number of SSL keys created on backends in this worker process over the last second" SslBackendMaxKeyRate: 0:"Highest SslBackendKeyRate reached on this worker process since started (in SSL keys per second)" SslCacheLookups: 0:"Total number of SSL session ID lookups in the SSL session cache on this worker since started" SslCacheMisses: 0:"Total number of SSL session ID lookups that didn't find a session in the SSL session cache on this worker since started" CompressBpsIn: 0:"Number of bytes submitted to HTTP compression in this worker process over the last second" CompressBpsOut: 0:"Number of bytes out of HTTP compression in this worker process over the last second" CompressBpsRateLim: 0:"Limit of CompressBpsOut beyond which HTTP compression is automatically disabled" Tasks: 10:"Total number of tasks in the current worker process (active + sleeping)" Run_queue: 1:"Total number of active tasks+tasklets in the current worker process" Idle_pct: 100:"Percentage of last second spent waiting in the current worker thread" node: wtap.local:"Node name (global.node)" Stopping: 0:"1 if the worker process is currently stopping, otherwise zero" Jobs: 14:"Current number of active jobs on the current worker process (frontend connections, master connections, listeners)" Unstoppable Jobs: 0:"Current number of unstoppable jobs on the current worker process (master connections)" Listeners: 13:"Current number of active listeners on the current worker process" ActivePeers: 0:"Current number of verified active peers connections on the current worker process" ConnectedPeers: 0:"Current number of peers having passed the connection step on the current worker process" DroppedLogs: 0:"Total number of dropped logs for current worker process since started" BusyPolling: 0:"1 if busy-polling is currently in use on the worker process, otherwise zero (config.busy-polling)" FailedResolutions: 0:"Total number of failed DNS resolutions in current worker process since started" TotalBytesOut: 0:"Total number of bytes emitted by current worker process since started" BytesOutRate: 0:"Number of bytes emitted by current worker process over the last second"	2019-10-10 11:30:07 +02:00
Willy Tarreau	6b19b142e8	MINOR: stats: make "show stat" and "show info" Now "show info" supports "desc" after the default and "typed" formats, and "show stat" supports this after the typed format. In both cases this appends the description for the represented metric between double quotes. The same could be done for JSON output but would possibly require to update the schema first.	2019-10-10 11:30:07 +02:00
Willy Tarreau	eaa55370c3	MINOR: stats: prepare to add a description with each stat/info field Several times some users have expressed the non-intuitive aspect of some of our stat/info metrics and suggested to add some help. This patch replaces the char* arrays with an array of name_desc so that we now have some reserved room to store a description with each stat or info field. These descriptions are currently empty and not reported yet.	2019-10-10 11:30:07 +02:00
Willy Tarreau	2f39738750	MINOR: stats: support the "desc" output format modifier for info and stat Now "show info" and "show stat" can parse "desc" as an output format modifier that will be passed down the chain to add some descriptions to the fields depending on the format in use. For now it is not exploited.	2019-10-10 11:30:07 +02:00
Willy Tarreau	43241ffb6c	MINOR: stats: uniformize the calling convention of the dump functions Some functions used to take flags + appctx with flags==appctx.flags, others neither, others just one of them. Some functions used to have the flags before the object being dumped (server) while others had it after (listener). This patch aims at cleaning this up a little bit by following this principle: - low-level functions which do not need the appctx take flags only - medium-level functions which already use the appctx for other reasons do not keep the flags - top-level functions which already have the stream-int don't need the flags nor the appctx.	2019-10-10 11:30:07 +02:00
Willy Tarreau	b0ce3ad9ff	MINOR: stats: make stats_dump_fields_json() directly take flags It used to take an inverted flag for STAT_STARTED, let's make it take the raw flags instead.	2019-10-10 11:30:07 +02:00
Willy Tarreau	ab02b3f345	MINOR: stats: get rid of the STAT_SHOWADMIN flag This flag is used to decide to show the check box in front of a proxy on the HTML stat page. It is always equal to STAT_ADMIN except when the proxy has no backend capability (i.e. a pure frontend) or has no server, in which case it's only used to avoid leaving an empty column at the beginning of the table. Not only this is pretty useless, but it also causes the columns not to align well when mixing multiple proxies with or without servers. Let's simply always use STAT_ADMIN and get rid of this flag.	2019-10-10 11:30:07 +02:00
Willy Tarreau	578d6e4360	MINOR: stats: set the appctx flags when initializing the applet only When "show stat" is emitted on the CLI, we need to set the relevant flags on the appctx. We must not re-adjust them while dumping a proxy.	2019-10-10 11:30:07 +02:00
Willy Tarreau	676c29e3ae	MINOR: stats: always merge the uri_auth flags into the appctx flags Now we only use the appctx flags everywhere in the code, and the uri_auth flags are read only by the HTTP analyser which presets the appctx ones. This will allow to simplify access to the flags everywhere.	2019-10-10 11:30:07 +02:00
Willy Tarreau	708c41602b	MINOR: stats: replace the ST_* uri_auth flags with STAT_* We used to rely on some config flags defined in uri_auth.h set during parsing, and another set of STAT_* flags defined in stats.h set at run time, with a somewhat gray area between the two sets. This is confusing in the stats code as both are called "flags" in various functions and it's quite hard to know which one describes what. This patch cleans this up by replacing all ST_* by a newly assigned value from the STAT_* set so that we can now use unified flags to describe both the configuration and the current state. There is no functional change at all.	2019-10-10 11:30:07 +02:00
Willy Tarreau	ee4f5f83d3	MINOR: stats: get rid of the ST_CONVDONE flag This flag was added in 1.4-rc1 by commit `329f74d463` ("[BUG] uri_auth: do not attemp to convert uri_auth -> http-request more than once") to address the case where two proxies inherit the stats settings from the defaults instance, and the first one compiles the expression while the second one uses it. In this case since they use the exact same uri_auth pointer, only the first one should compile and the second one must not fail the check. This was addressed by adding an ST_CONVDONE flag indicating that the expression conversion was completed and didn't need to be done again. But this is a hack and it becomes cumbersome in the middle of the other flags which are all relevant to the stats applet. Let's instead fix it by checking if we're dealing with an alias of the defaults instance and refrain from compiling this twice. This allows us to remove the ST_CONVDONE flag. A typical config requiring this check is : defaults mode http stats auth foo:bar listen l1 bind :8080 listen l2 bind :8181 Without this (or previous) check it would cmoplain when checking l2's validity since the rule was already built.	2019-10-10 11:30:07 +02:00
Willy Tarreau	6103836315	MINOR: stats: mention in the help message support for "json" and "typed" Both "show info" and "show stat" support the "typed" output format and the "json" output format. I just never can remind them, which is an indication that some help is missing.	2019-10-10 11:30:07 +02:00
Willy Tarreau	30ee1efe67	MEDIUM: h2: use the normalized URI encoding for absolute form requests H2 strongly recommends that clients exclusively use the absolute form for requests, which contains a scheme, an authority and a path, instead of the old format involving the Host header and a path. Thus there is no way to distinguish between a request intended for a proxy and an origin request, and as such proxied requests are lost. This patch makes sure to keep the encoding of all absolute form requests so that the URI is kept end-to-end. If the scheme is http or https, there is an uncertainty so the request is tagged as a normalized URI so that the other end (H1) can decide to emit it in origin form as this is by far the most commonly expected one, and it's certain that quite a number of H1 setups are not ready to cope with absolute URIs. There is a direct visible impact of this change, which is that the uri sample fetch will now return absolute URIs (as they really come on the wire) whenever these are used. It also means that default http logs will report absolute URIs. If a situation is once met where a client uses H2 to join an H1 proxy with haproxy in the middle, then it will be trivial to add an option to ask the H1 output to use absolute encoding for such requests. Later we may be able to consider that the normalized URI is the default output format and stop sending them in origin form unless an option is set. Now chaining multiple instances keeps the semantics as far as possible along the whole chain : 1) H1 to H1 H1:"GET /" --> H1:"GET /" # log: / H1:"GET http://" --> H1:"GET http://" # log: http:// H1:"GET ftp://" --> H1:"GET ftp://" # log: ftp:// 2) H2 to H1 H2:"GET /" --> H1:"GET /" # log: / H2:"GET http://" --> H1:"GET /" # log: http:// H2:"GET ftp://" --> H1:"GET ftp://" # log: ftp:// 3) H1 to H2 to H2 to H1 H1:"GET /" --> H2:"GET /" --> H2:"GET /" --> H1:"GET /" H1:"GET http://" --> H2:"GET http://" --> H2:"GET http://" --> H1:"GET /" H1:"GET ftp://" --> H2:"GET ftp://" --> H2:"GET ftp://" --> H1:"GET ftp://" Thus there is zero loss on H1->H1, H1->H2 nor H2->H2, and H2->H1 is normalized in origin format if ambiguous.	2019-10-09 11:10:19 +02:00
Willy Tarreau	b8ce8905cf	MEDIUM: mux-h2: do not map Host to :authority on output Instead of mapping the Host header field to :authority, we now act differently if the request is in origin form or in absolute form. If it's absolute, we extract the scheme and the authority from the request, fix the path if it's empty, and drop the Host header. Otherwise we take the scheme from the http/https flags in the HTX layer, make the URI be the path only, and emit the Host header, as indicated in RFC7540#8.1.2.3. This allows to distinguish between absolute and origin requests for H1 to H2 conversions.	2019-10-09 11:10:19 +02:00
Willy Tarreau	1440fe8b4b	MINOR: h2: report in the HTX flags when the request has an authority The other side will need to know when to emit an authority or not. We need to pass this information in the HTX flags.	2019-10-09 11:10:19 +02:00
Willy Tarreau	92919f7fd5	MEDIUM: h2: make the request parser rebuild a complete URI Till now we've been producing path components of the URI and using the :authority header only to be placed into the host part. But this practice is not correct, as if we're used to convey H1 proxy requests over H2 then over H1, the absolute URI is presented as a path on output, which is not valid. In addition the scheme on output is not updated from the absolute URI either. Now the request parser will continue to deliver origin-form for request received using the http/https schemes, but will use the absolute-form when dealing with other schemes, by concatenating the scheme, the authority and the path if it's not '*'.	2019-10-09 11:10:19 +02:00
Christopher Faulet	92916d343c	MINOR: h1-htx: Only use the path of a normalized URI to format a request line When a request start-line is converted to its raw representation, if its URI is normalized, only the path part is used. Most of H2 clients send requests using the absolute form (:scheme + :authority + :path), regardless the request is sent to a proxy or not. But, when the request is relayed to an H1 origin server, it is unusual to send it using the absolute form. And, even if the servers must support this form, some old servers may reject it. So, for such requests, we only get the path of the absolute URI. Most of time, it will be the right choice. However, an option will probably by added to customize this behavior.	2019-10-09 11:10:16 +02:00
Christopher Faulet	d7b7a1ce50	MEDIUM: http-htx: Keep the Host header and the request start-line synchronized In HTTP, the request authority, if any, and the Host header must be identical (excluding any userinfo subcomponent and its "@" delimiter). So now, during the request analysis, when the Host header is updated, the start-line is also updated. The authority of an absolute URI is changed accordingly. Symmetrically, if the URI is changed, if it contains an authority, then then Host header is also changed. In this latter case, the flags of the start-line are also updated to reflect the changes on the URI.	2019-10-09 11:05:31 +02:00
Christopher Faulet	fe451fb9ef	MINOR: h1-htx: Set the flag HTX_SL_F_HAS_AUTHORITY during the request parsing When an h1 request is received and parsed, this flag is set if it is a CONNECT request or if an absolute URI is detected.	2019-10-09 11:05:31 +02:00
Christopher Faulet	16fdc55f79	MINOR: http: Add a function to get the authority into a URI The function http_get_authority() may be used to parse a URI and looks for the authority, between the scheme and the path. An option may be used to skip the user info (part before the '@'). Most of time, the user info will be ignored.	2019-10-09 11:05:31 +02:00
Willy Tarreau	2be362c937	MINOR: h2: clarify the rules for how to convert an H2 request to HTX The H2 request parsing is not trivial given that we have multiple possible syntaxes. Mainly we can have :authority or not, and when a CONNECT method is seen, :scheme and :path are missing. This mostly updates the functions' comments and header index assignments to make them less confusing. Functionally there is no change.	2019-10-09 11:05:31 +02:00
Christopher Faulet	08618a733d	BUG/MINOR: mux-h1/mux-fcgi/trace: Fix position of the 4th arg in some traces In these muxes, when an integer value is provided in a trace, it must be the 4th argument. The 3rd one, if defined, is always an HTX message. Unfortunately, some traces are buggy and the 4th argument is erroneously passed in 3rd position. No backport needed.	2019-10-08 16:28:30 +02:00
Willy Tarreau	cb985a4da6	MEDIUM: mux-h2: support emitting CONTINUATION frames after HEADERS There are some reports of users not being able to pass "enterprise" traffic through haproxy when using H2 because it doesn't emit CONTINUATION frames and as such is limited to headers no longer than the negociated max-frame-size which usually is 16 kB. This patch implements support form emitting CONTINUATION when a HEADERS frame cannot fit within a limit of mfs. It does this by first filling a buffer-wise frame, then truncating it starting from the tail to append CONTINUATION frames. This makes sure that we can truncate on any byte without being forced to stop on a header boundary, and ensures that the common case (no fragmentation) doesn't add any extra cost. By moving the tail first we make sure that each byte is moved only once, thus the performance impact remains negligible. This addresses github issue #249.	2019-10-07 18:18:32 +02:00
Willy Tarreau	22c6107dba	BUG/MEDIUM: cache: make sure not to cache requests with absolute-uri If a request contains an absolute URI and gets its Host header field rewritten, or just the request's URI without touching the Host header field, it can lead to different Host and authority parts. The cache will always concatenate the Host and the path while a server behind would instead ignore the Host and use the authority found in the URI, leading to incorrect content possibly being cached. Let's simply refrain from caching absolute requests for now, which also matches what the comment at the top of the function says. Later we can improve this by having a special handling of the authority. This should be backported as far as 1.8.	2019-10-07 14:21:30 +02:00
Christopher Faulet	5c0f859c27	MINOR: mux-fcgi/trace: Register a new trace source with its events As for the mux h1 and h2, traces are now supported in the mux fcgi. All parts of the multiplexer is covered by these traces. Events are splitted by categories (fconn, fstrm, stream, rx, tx and rsp) for a total of ~40 different events with 5 verboisty levels. In traces, the first argument is always a connection. So it is easy to get the fconn (conn->ctx). The second argument is always a fstrm. The third one is an HTX message. Depending on the context it is the request or the response. In all cases it is owned by a channel. Finally, the fourth argument is an integer value. Its meaning depends on the calling context.	2019-10-04 16:12:02 +02:00
Christopher Faulet	660f6f34d7	MINOR: mux-h1: Try to wakeup the stream on output buffer allocation When the output buffer allocation failed, we block stream processing. When finally a buffer is available and we succed to allocate the output buffer, it seems fair to wake up the stream.	2019-10-04 16:12:02 +02:00
Christopher Faulet	7a991a9b83	BUG/MINOR: mux-h1: Adjust header case when chunked encoding is add to a message When an outgoing h1 message is formatted, if it is considered as chunked but the corresponding header is missing, we add it. And as all other h1 headers, if configured so, the case of this header must be adjusted. No backport needed.	2019-10-04 16:12:02 +02:00
Christopher Faulet	5cef2a6d84	BUG/MINOR: mux-h1: Adjust header case when the server name is add to a request As all other h1 headers, if configured so, the case of this header must be adjusted. No backport needed.	2019-10-04 16:12:02 +02:00
Christopher Faulet	67d580994e	MINOR: http: Remove headers matching the name of http-send-name-header option It is not explicitly stated in the documentation, but some users rely on this behavior. When the server name is inserted in a request, headers with the same name are first removed. This patch is not tagged as a bug, because it is not explicitly documented. We choose to keep the same implicit behavior to not break existing configuration. Because this option is used very little, it is not a big deal.	2019-10-04 16:12:02 +02:00
Christopher Faulet	dabcc8eb47	MINOR: proxy: Store http-send-name-header in lower case All HTTP header names are now handled in lower case. So this one is now stored in lower case. It will simplify some processing in HTTP muxes.	2019-10-04 16:12:02 +02:00
Christopher Faulet	6b81df7276	MINOR: mux-h1/trace: register a new trace source with its events As for the mux h2, traces are now supported in the mux h1. All parts of the multiplexer is covered by these traces. Events are splitted by categories (h1c, h1s, stream, rx and tx) for a total of ~30 different events with 5 verboisty levels. In traces, the first argument is always a connection. So it is easy to get the h1c (conn->ctx). The second argument is always a h1s. The third one is an HTX message. Depending on the context it is the request or the response. In all cases it is owned by a channel. Finally, the fourth argument is an integer value. Its meaning depends on the calling context.	2019-10-04 16:11:57 +02:00
Christopher Faulet	af542635f7	MINOR: h1-htx: Update h1_copy_msg_data() to ease the traces in the mux-h1 This function now uses the address of the pointer to the htx message where the copy must be performed. This way, when a zero-copy is performed, there is no need to refresh the caller's htx message. It is a bit easier to do that way, especially to add traces in the mux-h1.	2019-10-04 15:46:59 +02:00
Christopher Faulet	f81ef0344e	BUG/MINOR: mux-h2/trace: Fix traces on h2c initialization When a new H2 connection is initialized, the connection context is not changed before the end. So, traces emitted during this initialization are buggy, except the last one when no error occurred, because the connection context is not an h2c. To fix the bug, the connection context is saved and set as soon as possible. So, the connection can always safely be used in all traces, except for the very first one. And on error, the connection context is restored. No need to backport.	2019-10-04 15:46:59 +02:00
Fr�d�ric L�caille	5a4fe5a35d	BUG/MINOR: peers: crash on reload without local peer. When we configure a "peers" section without local peer, this makes haproxy old process crash on reload. Such a configuration file allows to reproduce this issue: global stats socket /tmp/sock1 mode 666 level admin stats timeout 10s peers peers peer localhost 127.0.0.1:1024 This bug was introduced by this commit: "MINOR: cfgparse: Make "peer" lines be parsed as "server" lines" This commit introduced a new condition to detect a "peers" section without local peer. This is a "peers" section with a frontend struct which has no ->id initialized member. Such a "peers" section must be removed. This patch adds this new condition to remove such peers sections without local peer as this was always done before. Must be backported to 2.0.	2019-10-04 10:21:04 +02:00
Olivier Houchard	07308677dd	BUG/MEDIUM: tasks: Don't forget to decrement tasks_run_queue. When executing tasks, don't forget to decrement tasks_run_queue once we popped one task from the task_list. tasks_run_queue used to be decremented by __tasklet_remove_from_tasklet_list(), but we now call MT_LIST_POP().	2019-10-03 14:55:40 +02:00
Willy Tarreau	c2ea47fb18	BUG/MEDIUM: mux-h2: do not enforce timeout on long connections Alexandre Derumier reported issue #308 in which the client timeout will strike on an H2 mux when it's shorter than the server's response time. What happens in practice is that there is no activity on the connection and there's no data pending on output so we can expire it. But this does not take into account the possibility that some streams are in fact waiting for the data layer above. So what we do now is that we enforce the timeout when: - there are no more streams - some data are pending in the output buffer - some streams are blocked on the connection's flow control - some streams are blocked on their own flow control - some streams are in the send/sending list In all other cases the connection will not timeout as it means that some streams are actively used by the data layer. This fix must be backported to 2.0, 1.9 and probably 1.8 as well. It depends on the new "blocked_list" field introduced by "MINOR: mux-h2: add a per-connection list of blocked streams". It would be nice to also backport "ebtree: make eb_is_empty() and eb_is_dup() take a const" to avoid a build warning.	2019-10-02 15:27:03 +02:00
Willy Tarreau	9edf6dbecc	MINOR: mux-h2: add a per-connection list of blocked streams Currently the H2 mux doesn't have a list of all the streams blocking on the H2 side. It only knows about those trying to send or waiting for a connection window update. It is problematic to enforce timeouts because we never know if a stream has to live as long as the data layer wants or has to be timed out becase it's waiting for a stream window update. This patch adds a new list, "blocked_list", to store streams blocking on stream flow control, or later, dependencies. Streams blocked on sfctl are now added there. It doesn't modify the rest of the logic.	2019-10-02 14:16:14 +02:00
Willy Tarreau	35fb846333	MINOR: mux-h2/trace: missing conn pointer in demux full message One trace was missing the connection's pointer, reporting "demux buffer full" without indicating for what connection it was.	2019-10-02 14:16:14 +02:00
Willy Tarreau	6905d18495	Revert "MINOR: cache: allow caching of OPTIONS request" This reverts commit `1263540fe8`. As discussed in issues #214 and #251, this is not the correct way to cache CORS responses, since it relies on hacking the cache to cache the OPTIONS method which is explicitly non-cacheable and for which we cannot rely on any standard caching semantics (cache headers etc are not expected there). Let's roll this back for now and keep that for a more reliable and flexible CORS-specific solution later.	2019-10-01 17:59:17 +02:00
Baptiste Assmann	4c52e4b560	BUG/MINOR: action: do-resolve does not yield on requests with body @davidmogar reported a github issue (#227) about problems with do-resolve action when the request contains a body. The variable was never populated in such case, despite tcpdump shows a valid DNS response coming back. The do-resolve action is a task in HAProxy and so it's waken by the scheduler each time the scheduler think such task may have some work to do. When a simple HTTP request is sent, then the task is called, it sends the DNS request, then the scheduler will wake up the task again later once the DNS response is there. Now, when the client send a PUT or a POST request (or any other type) with a BODY, then the do-resolve action if first waken up once the headers are processed. It sends the DNS request. Then, when the bytes for the body are processed by HAProxy AND the DNS response has not yet been received, then the action simply terminates and cleans up all the data associated to this resolution... This patch detect such behavior and if the action is now waken up while a DNS resolution is in RUNNING state, then the action will tell the scheduler to wake it up again later. Backport status: 2.0 and above	2019-10-01 15:50:50 +02:00
William Lallemand	1633e39d91	BUILD: ssl: fix a warning when built with openssl < 1.0.2 src/ssl_sock.c:2928:12: warning: ‘ssl_sock_is_ckch_valid’ defined but not used [-Wunused-function] static int ssl_sock_is_ckch_valid(struct cert_key_and_chain *ckch) This function is only used with openssl >= 1.0.2, this patch adds a condition to build the function.	2019-09-30 13:40:53 +02:00
Tim Duesterhus	9fe7c6376a	BUG/MEDIUM: lua: Store stick tables into the sample's `t` field This patch fixes issue #306. This bug was introduced in the stick table refactoring in `1b8e68e89a`. This fix must be backported to 2.0.	2019-09-30 04:11:36 +02:00
Tim Duesterhus	2e89dec513	CLEANUP: lua: Get rid of obsolete (size_t *) cast in hlua_lua2(smp\|arg) This was required for the `chunk` API (`data` was an int), but is not required with the `buffer` API.	2019-09-30 04:11:36 +02:00
Tim Duesterhus	29d2e8aa9a	BUG/MINOR: lua: Properly initialize the buffer's fields for string samples in hlua_lua2(smp\|arg) `size` is used in conditional jumps and valgrind complains: ==24145== Conditional jump or move depends on uninitialised value(s) ==24145== at 0x4B3028: smp_is_safe (sample.h:98) ==24145== by 0x4B3028: smp_make_safe (sample.h:125) ==24145== by 0x4B3028: smp_to_stkey (stick_table.c:936) ==24145== by 0x4B3F2A: sample_conv_in_table (stick_table.c:1113) ==24145== by 0x420AD4: hlua_run_sample_conv (hlua.c:3418) ==24145== by 0x54A308F: ??? (in /usr/lib/x86_64-linux-gnu/liblua5.3.so.0.0.0) ==24145== by 0x54AFEFC: ??? (in /usr/lib/x86_64-linux-gnu/liblua5.3.so.0.0.0) ==24145== by 0x54A29F1: ??? (in /usr/lib/x86_64-linux-gnu/liblua5.3.so.0.0.0) ==24145== by 0x54A3523: lua_resume (in /usr/lib/x86_64-linux-gnu/liblua5.3.so.0.0.0) ==24145== by 0x426433: hlua_ctx_resume (hlua.c:1097) ==24145== by 0x42D7F6: hlua_action (hlua.c:6218) ==24145== by 0x43A414: http_req_get_intercept_rule (http_ana.c:3044) ==24145== by 0x43D946: http_process_req_common (http_ana.c:500) ==24145== by 0x457892: process_stream (stream.c:2084) Found while investigating issue #306. A variant of this issue exists since `55da165301`, which was using the old `chunk` API instead of the `buffer` API thus this patch must be backported to HAProxy 1.6 and higher.	2019-09-30 04:11:36 +02:00
Christopher Faulet	52c91bb72c	BUG/MINOR: stats: Add a missing break in a switch statement A break is missing in the switch statement in the function stats_emit_json_data_field(). This bug was introduced in the commit `88a0db28a` ("MINOR: stats: Add the support of float fields in stats"). This patch fixes the issue #302 and #303. It must be backported to 2.0.	2019-09-28 10:41:09 +02:00
Willy Tarreau	fc41e25c2e	BUG/MEDIUM: fcgi: fix missing list tail in sample fetch registration Ilya reported in bug #300 that ASAN found a read overflow during startup in the fcgi code due to a missing empty element at the end of the list of sample fetches. The effect is that will randomly either work or crash on startup. No backport is needed, this is solely for 2.1-dev.	2019-09-27 22:48:27 +02:00
Christopher Faulet	88a0db28ae	MINOR: stats: Add the support of float fields in stats It is now possible to format stats counters as floats. But the stats applet does not use it. This patch is required by the Prometheus exporter to send the time averages in seconds. If the promex change is backported, this patch must be backported first.	2019-09-27 08:49:09 +02:00
Christopher Faulet	d72665b425	CLEANUP: http-ana: Remove the unused function http_send_name_header() Because the HTTP multiplexers are now responsible to handle the option "http-send-name-header", the function http_send_name_header() can be removed.	2019-09-27 08:48:53 +02:00
Christopher Faulet	72ba6cd8c0	MINOR: http: Add server name header from HTTP multiplexers the option "http-send-name-header" is an eyesore. It was responsible of several bugs because it is handled after the message analysis. With the HTX representation, the situation is cleaner because no rewind on forwarded data is required. But it remains ugly. With recent changes in HAProxy, we have the opportunity to make it fairly better. The message formatting in now done in the HTTP multiplexers. So it seems to be the right place to handle this option. Now, the server name is added by the HTTP multiplexers (h1, h2 and fcgi).	2019-09-27 08:48:21 +02:00
Christopher Faulet	b1bb1afa47	MINOR: spoe: Support the async mode with several threads A different engine-id is now generated for each thread. So, it is possible to enable the async mode with several threads. This patch may be backported to older versions.	2019-09-26 16:51:02 +02:00
Christopher Faulet	09bd9aa412	MINOR: spoe: Improve generation of the engine-id Use the same algo than the sample fetch uuid(). This one was added recently. So it is better to use the same way to generate UUIDs. This patch may be backported to older versions.	2019-09-26 16:51:02 +02:00
Kevin Zhu	d87b1a56d5	BUG/MEDIUM: spoe: Use a different engine-id per process SPOE engine-id is the same for all processes when nbproc is more than 1. So, in async mode, an agent receiving a NOTIFY frame from a process may send the ACK to another process. It is abviously wrong. A different engine-id must be generated for each process. This patch must be backported to 2.0, 1.9 and 1.8.	2019-09-26 16:51:02 +02:00
Christopher Faulet	eec96b5381	BUG/MINOR: mux-h1: Do h2 upgrade only on the first request When a request is received, if the h2 preface is matched, an implicit upgrade from h1 to h2 is performed. This must only be done for the first request on a connection. But a test was missing to unsure it is really the first request. This patch must be backported to 2.0.	2019-09-26 16:51:02 +02:00
Christopher Faulet	5112a603d9	BUG/MAJOR: mux_h2: Don't consume more payload than received for skipped frames When a frame is received for a unknown or already closed stream, it must be skipped. This also happens when a stream error is reported. But we must be sure to only skip received data. In the loop in h2_process_demux(), when such frames are handled, all the frame lenght is systematically skipped. If the frame payload is partially received, it leaves the demux buffer in an undefined state. Because of this bug, all sort of errors may be observed, like crash or intermittent freeze. This patch must be backported to 2.0, 1.9 and 1.8.	2019-09-26 16:51:02 +02:00
Christopher Faulet	ea7a7781a9	BUG/MINOR: mux-h2: Use the dummy error when decoding headers for a closed stream Since the commit `6884aa3e` ("BUG/MAJOR: mux-h2: Handle HEADERS frames received after a RST_STREAM frame"), HEADERS frames received for an unknown or already closed stream are decoded. Once decoded, an error is reported for the stream. But because it is a dummy stream (h2_closed_stream), its state cannot be changed. So instead, we must return the dummy error stream (h2_error_stream). This patch must be backported to 2.0 and 1.9.	2019-09-26 16:51:02 +02:00
Christopher Faulet	b2d930ebe6	BUG/MINOR: mux-h2: Fix missing braces because of traces in h2_detach() Braces was missing aroung a "if" statement in the function h2_detach(), leaving an unconditional return. No backport needed.	2019-09-26 16:51:02 +02:00
William Lallemand	13ed9faecd	BUG/MINOR: mux-fcgi: silence a gcc warning about null dereference Silence an impossible warning that gcc reports about a NULL dereference.	2019-09-26 11:07:39 +02:00
Willy Tarreau	4c08f12dd8	BUG/MEDIUM: mux-h2: don't reject valid frames on closed streams Consecutive to commit `6884aa3eb0` ("BUG/MAJOR: mux-h2: Handle HEADERS frames received after a RST_STREAM frame") some valid frames on closed streams (RST_STREAM, PRIORITY, WINDOW_UPDATE) were now rejected. It turns out that the previous condition was in fact intentional to catch only sensitive frames, which was indeed a mistake since these ones needed to be decoded to keep HPACK synchronized. But we must absolutely accept WINDOW_UPDATES or we risk to stall some transfers. And RST/PRIO definitely are valid. Let's adjust the condition to reflect that and update the comment to explain the reason for this unobvious condition. This must be backported to 2.0 and 1.9 after the commit above is brought there.	2019-09-26 08:47:15 +02:00
Willy Tarreau	f8340e38bf	MINOR: sink: change ring buffer "buf0"'s format to "timed" This way we now always have the events date which were really missing, especially when used with traces : <0>2019-09-26T07:57:25.183845 [00\|h2\|1\|mux_h2.c:3024] receiving H2 HEADERS frame : h2c=0x1ddcad0(B,FRP) h2s=0x1dde9e0(3,HCL) <0>2019-09-26T07:57:25.183845 [00\|h2\|4\|mux_h2.c:2505] h2c_bck_handle_headers(): entering : h2c=0x1ddcad0(B,FRP) h2s=0x1dde9> <0>2019-09-26T07:57:25.183846 [00\|h2\|4\|mux_h2.c:4096] h2c_decode_headers(): entering : h2c=0x1ddcad0(B,FRP) <0>2019-09-26T07:57:25.183847 [00\|h2\|4\|mux_h2.c:4298] h2c_decode_headers(): leaving : h2c=0x1ddcad0(B,FRH) <0>2019-09-26T07:57:25.183848 [00\|h2\|0\|mux_h2.c:2559] rcvd H2 response : h2c=0x1ddcad0(B,FRH) : [3] H2 RES: HTTP/2.0 200 <0>2019-09-26T07:57:25.183849 [00\|h2\|4\|mux_h2.c:2560] h2c_bck_handle_headers(): leaving : h2c=0x1ddcad0(B,FRH) h2s=0x1dde9e> <0>2019-09-26T07:57:25.183849 [00\|h2\|4\|mux_h2.c:2866] h2_process_demux(): no more Rx data : h2c=0x1ddcad0(B,FRH) <0>2019-09-26T07:57:25.183849 [00\|h2\|4\|mux_h2.c:3123] h2_process_demux(): notifying stream before switching SID : h2c=0x1dd> <0>2019-09-26T07:57:25.183850 [00\|h2\|4\|mux_h2.c:1014] h2s_notify_recv(): in : h2c=0x1ddcad0(B,FRH) h2s=0x1dde9e0(3,HCL) <0>2019-09-26T07:57:25.183850 [00\|h2\|4\|mux_h2.c:3135] h2_process_demux(): leaving : h2c=0x1ddcad0(B,FRH) <0>2019-09-26T07:57:25.183851 [00\|h2\|4\|mux_h2.c:3319] h2_send(): entering : h2c=0x1ddcad0(B,FRH) <0>2019-09-26T07:57:25.183851 [00\|h2\|4\|mux_h2.c:3145] h2_process_mux(): entering : h2c=0x1ddcad0(B,FRH) <0>2019-09-26T07:57:25.183851 [00\|h2\|4\|mux_h2.c:3234] h2_process_mux(): leaving : h2c=0x1ddcad0(B,FRH) <0>2019-09-26T07:57:25.183852 [00\|h2\|4\|mux_h2.c:3428] h2_send(): leaving with everything sent : h2c=0x1ddcad0(B,FRH) <0>2019-09-26T07:57:25.183852 [00\|h2\|4\|mux_h2.c:3319] h2_send(): entering : h2c=0x1ddcad0(B,FRH) It looks like some format options could finally be separate from the sink, or maybe enforced. For example we could imagine making the date optional or its resolution configurable within a same buffer. Similarly, maybe trace events would like to always emit the date even on stdout, while traffic logs would prefer not to emit the date in the ring buffer given that there's already one in the message.	2019-09-26 08:13:38 +02:00
Willy Tarreau	53ba9d9bcf	MINOR: sink: finally implement support for SINK_FMT_{TIMED,ISO} These formats add the date with a resolution of the microsecond before the message fields.	2019-09-26 08:13:38 +02:00
Willy Tarreau	93acfa2263	MINOR: time: add timeofday_as_iso_us() to return instant time as ISO We often need ISO time + microseconds in traces and ring buffers, thus function does this by calling gettimeofday() and keeping a cached value of the part representing the tv_sec value, and only rewrites the microsecond part. The cache is per-thread so it's lockless and safe to use as-is. Some tests already show that it's easy to see 3-4 events in a single microsecond, thus it's likely that the nanosecond version will have to be implemented as well. But certain comments on the net suggest that some parsers are having trouble beyond microsecond, thus for now let's stick to the microsecond only.	2019-09-26 08:13:38 +02:00
Krisztian Kovacs	710d987cd6	BUG/MEDIUM: namespace: close open namespaces during soft shutdown When doing a soft shutdown, we won't be making new connections anymore so there's no point in keeping the namespace file descriptors open anymore. Keeping these open effectively makes it impossible to properly clean up namespaces which are no longer used in the new configuration until all previously opened connections are closed in the old worker process. This change introduces a cleanup function that is called during soft shutdown that closes all namespace file descriptors by iterating over the namespace ebtree.	2019-09-25 23:33:52 +02:00
Willy Tarreau	cec60056e4	BUG/MINOR: mux-h2: do not wake up blocked streams before the mux is ready In h2_send() we used to scan pending streams and wake them up when it's possible to send, without considering the connection's state. Thus caused some excess failed calls to h2_snd_buf() during the preface on backend connections : [01\|h2\|4\|mux_h2.c:3562] h2_wake(): entering : h2c=0x7f1430032ed0(B,PRF) [01\|h2\|4\|mux_h2.c:3475] h2_process(): entering : h2c=0x7f1430032ed0(B,PRF) [01\|h2\|4\|mux_h2.c:3326] h2_send(): entering : h2c=0x7f1430032ed0(B,PRF) [01\|h2\|4\|mux_h2.c:3152] h2_process_mux(): entering : h2c=0x7f1430032ed0(B,PRF) [01\|h2\|4\|mux_h2.c:1508] h2c_bck_send_preface(): entering : h2c=0x7f1430032ed0(B,PRF) [01\|h2\|4\|mux_h2.c:1379] h2c_send_settings(): entering : h2c=0x7f1430032ed0(B,PRF) [01\|h2\|4\|mux_h2.c:1464] h2c_send_settings(): leaving : h2c=0x7f1430032ed0(B,PRF) [01\|h2\|4\|mux_h2.c:1543] h2c_bck_send_preface(): leaving : h2c=0x7f1430032ed0(B,PRF) [01\|h2\|4\|mux_h2.c:3241] h2_process_mux(): leaving : h2c=0x7f1430032ed0(B,STG) [01\|h2\|3\|mux_h2.c:3384] sent data : h2c=0x7f1430032ed0(B,STG) >>> streams woken up here [01\|h2\|4\|mux_h2.c:3428] h2_send(): waking up pending stream : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:3435] h2_send(): leaving with everything sent : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:3326] h2_send(): entering : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:3152] h2_process_mux(): entering : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:3241] h2_process_mux(): leaving : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:3435] h2_send(): leaving with everything sent : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:3552] h2_process(): leaving : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:3564] h2_wake(): leaving >>> I/O callback was already scheduled and called despite having nothing left to do [01\|h2\|4\|mux_h2.c:3454] h2_io_cb(): entering : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:3326] h2_send(): entering : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:3152] h2_process_mux(): entering : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:3241] h2_process_mux(): leaving : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:3435] h2_send(): leaving with everything sent : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:3463] h2_io_cb(): leaving >>> stream tries and fails again here! [01\|h2\|4\|mux_h2.c:5568] h2_snd_buf(): entering : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:5587] h2_snd_buf(): connection not ready, leaving : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:5398] h2_subscribe(): entering : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:5408] h2_subscribe(): subscribe(send) : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:5422] h2_subscribe(): leaving : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:5475] h2_rcv_buf(): entering : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:5535] h2_rcv_buf(): leaving : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:5398] h2_subscribe(): entering : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:5400] h2_subscribe(): subscribe(recv) : h2c=0x7f1430032ed0(B,STG) [01\|h2\|4\|mux_h2.c:5422] h2_subscribe(): leaving : h2c=0x7f1430032ed0(B,STG) This can happen when sending the preface, the settings, and the settings ACK. Let's simply condition the wake up on st0 >= FRAME_H as is done at other places.	2019-09-25 08:34:15 +02:00
Willy Tarreau	73db434f7f	MINOR: h2/trace: report the frame type when known In state match error cases, we don't know what frame type was received because we don't reach the frame parsers. Let's add the demuxed frame type and flags in the trace when it's known. For this we make sure to always reset h2c->dsi when switching back to FRAME_H. Only one location was missing. The state transitions were not always clear (sometimes reported before, sometimes after), these were clarified by being reported only before switching.	2019-09-25 08:34:15 +02:00
Willy Tarreau	2d22144559	MINOR: h2/trace: indicate 'F' or 'B' to locate the side of an h2c in traces It was difficult in traces showing h2-to-h2 communications to figure the connection side solely based on the pointer. With this patch we prepend 'F' or 'B' before the state to make this more explicit: [06\|h2\|4\|mux_h2.c:5487] h2_rcv_buf(): entering : h2c=0x7f6acc026440(F,FRH) h2s=0x7f6acc021720(1,CLO) [06\|h2\|4\|mux_h2.c:5547] h2_rcv_buf(): leaving : h2c=0x7f6acc026440(F,FRH) h2s=0x7f6acc021720(1,CLO) [06\|h2\|4\|mux_h2.c:4040] h2_shutw(): entering : h2c=0x7f6acc026440(F,FRH) h2s=0x7f6acc021720(1,CLO)	2019-09-25 07:30:59 +02:00
Olivier Houchard	bba1a263c5	BUG/MEDIUM: tasklets: Make sure we're waking the target thread if it sleeps. Now that we can wake tasklet for other threads, make sure that if the thread is sleeping, we wake it up, or the tasklet won't be executed until it's done sleeping. That also means that, before going to sleep, and after we put our bit in sleeping_thread_mask, we have to check that nobody added a tasklet for us, just checking for global_tasks_mask isn't enough anymore.	2019-09-24 14:58:45 +02:00
Christopher Faulet	c45791aa52	BUG/MINOR: mux-fcgi: Use a literal string as format in app_log() This avoid any crashes if stderr messages contain format specifiers. This patch partially fixes the issue #295. No backport needed.	2019-09-24 14:30:49 +02:00
Christopher Faulet	82c798a082	CLEANUP: mux-fcgi: Remove the unused function fcgi_strm_id() This patch partially fixes the issue #295.	2019-09-24 14:11:01 +02:00
Willy Tarreau	d022e9c98b	MINOR: task: introduce a thread-local "sched" variable for local scheduler stuff The aim is to rassemble all scheduler information related to the current thread. It simply points to task_per_thread[tid] without having to perform the operation at each time. We save around 1.2 kB of code on performance sensitive paths and increase the request rate by almost 1%.	2019-09-24 11:23:30 +02:00
Willy Tarreau	d66d75656e	MINOR: task: split the tasklet vs task code in process_runnable_tasks() There are a number of tests there which are enforced on tasklets while they will never apply (various handlers, destroyed task or not, arguments, results, ...). Instead let's have a single TASK_IS_TASKLET() test and call the tasklet processing function directly, skipping all the rest. It now appears visible that the only unneeded code is the update to curr_task that is never used for tasklets, except for opportunistic reporting in the debug handler, which can only catch si_cs_io_cb, which in practice doesn't appear in any report so the extra cost incurred there is pointless. This change alone removes 700 bytes of code, mostly in process_runnable_tasks() and increases the performance by about 1%.	2019-09-24 11:23:30 +02:00
Willy Tarreau	4c1e1ad6a8	CLEANUP: task: cache the task_per_thread pointer In process_runnable_tasks() we perform a lot of dereferences to task_per_thread[tid] but tid is thread_local and the compiler cannot know that it doesn't change so this results in making lots of thread local accesses and array dereferences. By just keeping a copy pointer of this, we let the compiler optimize the code. Just doing this has reduced process_runnable_tasks() by 124 bytes in the fast path. Doing the same in wake_expired_tasks() results in 16 extra bytes saved.	2019-09-24 11:23:30 +02:00
Willy Tarreau	9b48c629f2	CLEANUP: task: remove impossible test In process_runnable_task(), after the task's process() function returns, we used to check if the return is not NULL and is not a tasklet, to update profiling measurements. This is useless since only tasks can return non-null here. Let's remove this useless test.	2019-09-24 11:23:30 +02:00
Willy Tarreau	0f0393fc0d	BUG/MEDIUM: checks: make sure the connection is ready before trying to recv As identified in issue #278, the backport of commit `c594039225` ("BUG/MINOR: checks: do not uselessly poll for reads before the connection is up") introduced a regression in 2.0 when default checks are enabled (not "option tcp-check"), but it did not affect 2.1. What happens is that in 2.0 and earlier we have the fd cache which makes a speculative call to the I/O functions after an attempt to connect, and the __event_srv_chk_r() function was absolutely not designed to be called while a connection attempt is still pending. Thus what happens is that the test for success/failure expects the verdict to be final before waking up the check task, and since the connection is not yet validated, it fails. It will usually work over the loopback depending on scheduling, which is why it doesn't fail in reg tests. In 2.1 after the failed connect(), we subscribe to polling and usually come back with a validated connection, so the function is not expected to be called before it completes, except if it happens as a side effect of some spurious wake calls, which should not have any effect on such a check. The other check types are not impacted by this issue because they all check for a minimum data length in the buffer, and wait for more data until they are satisfied. This patch fixes the issue by explicitly checking that the connection is established before trying to read or to give a verdict. This way the function becomes safe to call regardless of the connection status (even if it's still totally ugly). This fix must be backported to 2.0.	2019-09-24 10:59:55 +02:00
Christopher Faulet	e55a5a4171	BUG/MEDIUM: stream-int: Process connection/CS errors during synchronous sends If an error occurred on the connection or the conn-stream, no syncrhonous send is performed. If the error was not already processed and there is no more I/O, it will never be processed and the stream will never be notified of this error. This may block the stream until a timeout is reached or infinitly if there is no timeout. Concretly, this bug can be triggered time to time with h2spec, running the test "http2/5.1.1/2". This patch depends on the commit `328ed220a` "BUG/MINOR: stream-int: Process connection/CS errors first in si_cs_send()". Both must be backported to 2.0 and probably to 1.9. In 1.9, the code is totally different, so this patch would have to be adapted.	2019-09-24 10:04:19 +02:00
Christopher Faulet	328ed220a8	BUG/MINOR: stream-int: Process connection/CS errors first in si_cs_send() Errors on the connections or the conn-stream must always be processed in si_cs_send(), even if the stream-interface is already subscribed on sending. This patch does not fix any concrete bug per-se. But it is required by the following one to handle those errors during synchronous sends. This patch must be backported with the following one to 2.0 and probably to 1.9 too, but with caution because the code is really different.	2019-09-24 10:04:05 +02:00
Willy Tarreau	2bd65a781e	OPTIM: listeners: use tasklets for the multi-queue rings Now that we can wake up a remote thread's tasklet, it's way more interesting to use a tasklet than a task in the accept queue, as it will avoid passing through all the scheduler. Just doing this increases the accept rate by about 4%, overall recovering the slight loss introduced by the tasklet change. In addition it makes sure that even a heavily loaded scheduler (e.g. many very fast checks) will not delay a connection accept.	2019-09-24 06:57:32 +02:00
Kriszti�n Kov�cs (kkovacs)	538aa7168f	BUG/MEDIUM: namespace: fix fd leak in master-worker mode When namespaces are used in the configuration, the respective namespace handles are opened during config parsing and stored in an ebtree for lookup later. Unfortunately, when the master process re-execs itself these file descriptors were not closed, effectively leaking the fds and preventing destruction of namespaces no longer present in the configuration. This change fixes this issue by opening the namespace file handles as close-on-exec, making sure that they will be closed during re-exec.	2019-09-23 19:08:39 +02:00
Emmanuel Hocdet	7ceb96be72	BUG/MINOR: build: fix event ports (Solaris) Patch `6b308985` "MEDIUM: fd: do not use the FD_POLL_* flags in the pollers anymore" break ev_evports.c build. Restore variable name to fix it.	2019-09-23 19:08:39 +02:00
Olivier Houchard	ff1e9f39b9	MEDIUM: tasklets: Make the tasklet list a struct mt_list. Change the tasklet code so that the tasklet list is now a mt_list. That means that tasklet now do have an associated tid, for the thread it is expected to run on, and any thread can now call tasklet_wakeup() for that tasklet. One can change the associated tid with tasklet_set_tid().	2019-09-23 18:16:08 +02:00
Olivier Houchard	859dc80f94	MEDIUM: list: Separate "locked" list from regular list. Instead of using the same type for regular linked lists and "autolocked" linked lists, use a separate type, "struct mt_list", for the autolocked one, and introduce a set of macros, similar to the LIST_* macros, with the MT_ prefix. When we use the same entry for both regular list and autolocked list, as is done for the "list" field in struct connection, we know have to explicitely cast it to struct mt_list when using MT_ macros.	2019-09-23 18:16:08 +02:00
Willy Tarreau	6dd4ac890b	BUG/MEDIUM: check/threads: make external checks run exclusively on thread 1 See GH issues #141 for all the context. In short, registered signal handlers are not inherited by other threads during startup, which is normally not a problem, except that we need that the same thread as the one doing the fork() cleans up the old process using waitpid() once its death is reported via SIGCHLD, as happens in external checks. The only simple solution to this at the moment is to make sure that external checks are exclusively run on the first thread, the one which registered the signal handlers on startup. It will be far more than enough anyway given that external checks must not require to be load balanced on multiple threads! A more complex solution could be designed over the long term to let each thread deal with all signals but it sounds overkill. This must be backported as far as 1.8.	2019-09-23 18:14:37 +02:00
Christopher Faulet	6884aa3eb0	BUG/MAJOR: mux-h2: Handle HEADERS frames received after a RST_STREAM frame As stated in the RFC7540#5.1, an endpoint that receives any frame other than PRIORITY after receiving a RST_STREAM MUST treat that as a stream error of type STREAM_CLOSED. However, frames carrying compression state must still be processed before being dropped to keep the HPACK decoder synchronized. This had to be the purpose of the commit `8d9ac3ed8b` ("BUG/MEDIUM: mux-h2: do not abort HEADERS frame before decoding them"). But, the test on the frame type was inverted. This bug is major because desynchronizing the HPACK decoder leads to mixup indexed headers in messages. From the time an HEADERS frame is received and ignored for a closed stream, wrong headers may be sent to the following streams. This patch may fix several bugs reported on github (#116, #290, #292). It must be backported to 2.0 and 1.9.	2019-09-23 15:28:23 +02:00
Christopher Faulet	0ce57b05de	BUG/MINOR: mux-fcgi: Don't compare the filter name in its parsing callback The function parse_fcgi_flt() is called when the keyword "fcgi-app" is found on a filter line. We don't need to compare it again in the function. This patch fixes the issue #284. No backport needed.	2019-09-18 11:20:55 +02:00
Christopher Faulet	d432b3e5c8	CLEANUP: fcgi-app: Remove useless test on fcgi_conf pointer fcgi_conf was already tested after allocation. No need to test it again. This patch fixes the isssue #285.	2019-09-18 11:20:55 +02:00
Christopher Faulet	a99db937c5	BUG/MINOR: mux-fcgi: Be sure to have a connection to unsubcribe When the mux is released, It must own the connection to unsubcribe. This patch fixes the issue #283. No backport needed.	2019-09-18 11:20:55 +02:00
Christopher Faulet	21d849f52f	BUG/MINOR: mux-h2: Be sure to have a connection to unsubcribe When the mux is released, It must own the connection to unsubcribe. This patch must be backported to 2.0.	2019-09-18 11:20:55 +02:00
Christopher Faulet	d66700a91c	BUG/MINOR: build: Fix compilation of mux_fcgi.c when compiled without SSL The function ssl_sock_is_ssl is only available when HAProxy is compile with the SSL support. This patch fixes the issue #279. No need to backport.	2019-09-17 13:50:20 +02:00
Christopher Faulet	99eff65f4f	MEDIUM: mux-fcgi: Add the FCGI multiplexer This multiplexer is only available on the backend side. It may handle multiplexed connections if the FCGI application supports it. A FCGI application must be configured on the backend to be used. If not redefined during the request processing by the FCGI filter, this mux handles all mandatory parameters. There is a limitation on the way the requests are processed. The parameters must be encoded into a uniq PARAMS record. It means, once encoded, all HTTP headers and FCGI parameters must small enough to be store in a buffer. Otherwise, an internal processing error is returned.	2019-09-17 10:18:54 +02:00
Christopher Faulet	78fbb9f991	MEDIUM: fcgi-app: Add FCGI application and filter The FCGI application handles all the configuration parameters used to format requests sent to an application. The configuration of an application is grouped in a dedicated section (fcgi-app <name>) and referenced in a backend to be used (use-fcgi-app <name>). To be valid, a FCGI application must at least define a document root. But it is also possible to set the default index, a regex to split the script name and the path-info from the request URI, parameters to set or unset... In addition, this patch also adds a FCGI filter, responsible for all processing on a stream.	2019-09-17 10:18:54 +02:00
Christopher Faulet	63bbf284a1	MINOR: fcgi: Add code related to FCGI protocol This code is independant and is only responsible to encode and decode part of the FCGI protocol.	2019-09-17 10:18:54 +02:00
Christopher Faulet	86d144c74b	MINOR: muxes/htx: Ignore pseudo header during message formatting When an HTX message is formatted to an H1 or H2 message, pseudo-headers (with header names starting by a colon (':')) are now ignored. In fact, for now, only H2 messages have such headers, and the H2 mux already skips them when it creates the HTX message. But in the futur, it may be useful to keep these headers in the HTX message to help the message analysis or to do some processing during the HTTP formatting. It would also be a good idea to have scopes for pseudo-headers (:h1-, :h2-, :fcgi-...) to limit their usage to a specific mux.	2019-09-17 10:18:54 +02:00
Christopher Faulet	cc3124cf44	MINOR: h1-htx: Use the same function to copy message payload in all cases This function will try to do a zero-copy transfer. Otherwise, it adds a data block. The same is used for messages with a content-length, chunked messages and messages with unknown body length.	2019-09-17 10:18:54 +02:00
Christopher Faulet	4f0f88a9d0	MEDIUM: mux-h1/h1-htx: move HTX convertion of H1 messages in dedicated file To avoid code duplication in the futur mux FCGI, functions parsing H1 messages and converting them into HTX have been moved in the file h1_htx.c. Some specific parts remain in the mux H1. But most of the parsing is now generic.	2019-09-17 10:18:54 +02:00
Christopher Faulet	341fac1eb2	MINOR: http: Add function to parse value of the header Status It will be used by the mux FCGI to get the status a response.	2019-09-17 10:18:54 +02:00
Christopher Faulet	5c6fefc8eb	MINOR: log: Provide a function to emit a log for an application Application is a generic term here. It is a modules which handle its own log server list, with no dependency on a proxy. Such applications can now call the function app_log() to log messages, passing a log server list and a tag as parameters. Internally, the function __send_log() has been adapted accordingly.	2019-09-17 10:18:54 +02:00
Christopher Faulet	a406356255	MINOR: http_fetch: Add sample fetches to get auth method/user/pass Now, following sample fetches may be used to get information about authentication: * http_auth_type : returns the auth method as supplied in Authorization header * http_auth_user : returns the auth user as supplied in Authorization header * http_auth_pass : returns the auth pass as supplied in Authorization header Only Basic authentication is supported.	2019-09-17 10:18:54 +02:00
Christopher Faulet	c16929658f	MINOR: config: Support per-proxy and per-server post-check functions callbacks Most of times, when a keyword is added in proxy section or on the server line, we need to have a post-parser callback to check the config validity for the proxy or the server which uses this keyword. It is possible to register a global post-parser callback. But all these callbacks need to loop on the proxies and servers to do their job. It is neither handy nor efficient. Instead, it is now possible to register per-proxy and per-server post-check callbacks.	2019-09-17 10:18:54 +02:00
Christopher Faulet	3ea5cbe6a4	MINOR: config: Support per-proxy and per-server deinit functions callbacks Most of times, when any allocation is done during configuration parsing because of a new keyword in proxy section or on the server line, we must add a call in the deinit() function to release allocated ressources. It is now possible to register a post-deinit callback because, at this stage, the proxies and the servers are already releases. Now, it is possible to register deinit callbacks per-proxy or per-server. These callbacks will be called for each proxy and server before releasing them.	2019-09-17 10:18:54 +02:00
Christopher Faulet	e3d2a877fb	MINOR: http-ana: Remove err_state field from http_msg This field is not used anymore. In addition, the state HTTP_MSG_ERROR is now only used when an error occurred during the body forward.	2019-09-17 10:18:54 +02:00
Christopher Faulet	b9a92f308a	MINOR: http-ana: Handle HTX errors first during message analysis When an error occurred in a mux, most of time, an error is also reported on the conn-stream, leading to an error (read and/or write) on the channel. When a parsing or a processing error is reported for the HTX message, it is better to handle it first.	2019-09-17 10:18:54 +02:00
Christopher Faulet	69b482180c	MINOR: mux-h1: Report a processing error during output processing During output processing, It is unexpected to have a malformed HTX message. Instead of reporting a parsing error, we now report a processing error.	2019-09-17 10:18:54 +02:00
Christopher Faulet	4e9a83349a	BUG/MEDIUM: stick-table: Properly handle "show table" with a data type argument Since the commit `1b8e68e8` ("MEDIUM: stick-table: Stop handling stick-tables as proxies."), the target field into the table context of the CLI applet was not anymore a pointer to a proxy. It was replaced by a pointer to a stktable. But, some parts of the code was not updated accordingly. the function table_prepare_data_request() still tries to cast it to a pointer to a proxy. The result is totally undefined. With a bit of luck, when the "show table" command is used with a data type, we failed to find a table and the error "Data type not stored in this table" is returned. But crashes may also be experienced. This patch fixes the issue #262. It must be backported to 2.0.	2019-09-13 15:46:46 +02:00
Adis Nezirovic	a46b142e88	BUG/MINOR: Missing stat_field_names (since `f21d17bb`) Recently Lua code which uses Proxy class (get_stats method) stopped working ("table index is nil from [C] method 'get_stats'") It probably affects other codepaths too. This should be backported do 2.0 and 1.9.	2019-09-13 12:40:50 +02:00
Christopher Faulet	1dbc4676c6	BUG/MINOR: backend: Fix a possible null pointer dereference In the function connect_server(), when we are not able to reuse a connection and too many FDs are opened, the variable srv must be defined to kill an idle connection. This patch fixes the issue #257. It must be backported to 2.0	2019-09-13 10:08:44 +02:00
Christopher Faulet	361935aa1e	BUG/MINOR: acl: Fix memory leaks when an ACL expression is parsed This only happens during the configuration parsing. First leak is the string representing the last converter parsed, if any. The second one is on the error path, when the allocation of the ACL expression failed. In this case, the sample was not released. This patch fixes the issue #256. It must be backported to all stable versions.	2019-09-13 10:08:44 +02:00
Christopher Faulet	3e395632bf	CLEANUP: mux-h2: Remove unused flag H2_SF_DATA_CHNK Since the legacy HTTP mode has been removed, this flag is not necessary anymore. Removing this flag, a test on the HTX message at the end of the function h2c_decode_headers() has also been removed fixing the github issue #244. No backport needed.	2019-09-13 10:08:28 +02:00
Luca Schimweg	8a694b859c	MINOR: sample: Add UUID-fetch Adds the fetch uuid(int). It returns a UUID following the format of version 4 in the RFC4122 standard. New feature, but could be backported.	2019-09-13 04:43:33 +02:00
Christopher Faulet	e058f7359f	BUG/MINOR: filters: Properly set the HTTP status code on analysis error When a filter returns an error during the HTTP analysis, an error must be returned if the status code is not already set. On the request path, an error 400 is returned. On the response path, an error 502 is returned. The status is considered as unset if its value is not strictly positive. If needed, this patch may be backported to all versions having filters (as far as 1.7). Because nobody have never report any bug, the backport to 2.0 is probably enough.	2019-09-10 10:29:54 +02:00
Christopher Faulet	6338a08c34	MINOR: stats: Add JSON export from the stats page It is now possible to export stats using the JSON format from the HTTP stats page. Like for the CSV export, to export stats in JSON, you must add the option ";json" on the stats URL. It is also possible to dump the JSON schema with the option ";json-schema". Corresponding Links have been added on the HTML page. This patch fixes the issue #263.	2019-09-10 10:29:54 +02:00
Christopher Faulet	82004145d4	BUG/MINOR: ssl: always check for ssl connection before getting its XPRT context In several SSL functions, the XPRT context is retrieved before any check on the connection. In the function ssl_sock_is_ssl(), a test suggests the connection may be null. So, it is safer to test the ssl connection before retrieving its XPRT context. It removes any ambiguities and prevents possible null pointer dereferences. This patch fixes the issue #265. It must be backported to 2.0.	2019-09-10 10:29:54 +02:00
Christopher Faulet	ad6c2eac28	BUG/MINOR: listener: Fix a possible null pointer dereference It seems to be possible to have no frontend for a listener. A test was missing before dereferencing it at the end of the function listener_accept(). This patch fixes the issue #264. It must be backported to 2.0 and 1.9.	2019-09-10 10:29:54 +02:00
David Carlier	6c00eba63b	BUILD/MINOR: auth: enabling for osx macOS supports this but as part of libc. Little typo fix while here.	2019-09-08 12:20:13 +02:00
Willy Tarreau	f21d17bbe8	MINOR: stats: report the number of idle connections for each server This adds two extra fields to the stats, one for the current number of idle connections and one for the configured limit. A tooltip link now appears on the HTML page to show these values in front of the active connection values. This should be backported to 2.0 and 1.9 as it's the only way to monitor the idle connections behaviour.	2019-09-08 09:30:50 +02:00
Willy Tarreau	6b3089856f	MEDIUM: fd: do not use the FD_POLL_* flags in the pollers anymore As mentioned in previous commit, these flags do not map well to modern poller capabilities. Let's use the FD_EV_*_{R,W} flags instead. This first patch only performs a 1-to-1 mapping making sure that the previously reported flags are still reported identically while using the closest possible semantics in the pollers. It's worth noting that kqueue will now support improvements such as returning distinctions between shut and errors on each direction, though this is not exploited for now.	2019-09-06 19:09:56 +02:00
Willy Tarreau	ccf3f6d1d6	MEDIUM: connection: enable reading only once the connection is confirmed In order to address the absurd polling sequence described in issue #253, let's make sure we disable receiving on a connection until it's established. Previously with bottom-top I/Os, we were almost certain that a connection was ready when the first I/O was confirmed. Now we can enter various functions, including process_stream(), which will attempt to read something, will fail, and will then subscribe. But we don't want them to try to receive if we know the connection didn't complete. The first prerequisite for this is to mark the connection as not ready for receiving until it's validated. But we don't want to mark it as not ready for sending because we know that attempting I/Os later is extremely likely to work without polling. Once the connection is confirmed we re-enable recv readiness. In order for this event to be taken into account, the call to tcp_connect_probe() was moved earlier, between the attempt to send() and the attempt to recv(). This way if tcp_connect_probe() enables reading, we have a chance to immediately fall back to this and read the possibly pending data. Now the trace looks like the following. It's far from being perfect but we've already saved one recvfrom() and one epollctl(): epoll_wait(3, [], 200, 0) = 0 socket(AF_INET, SOCK_STREAM, IPPROTO_TCP) = 7 fcntl(7, F_SETFL, O_RDONLY\|O_NONBLOCK) = 0 setsockopt(7, SOL_TCP, TCP_NODELAY, [1], 4) = 0 connect(7, {sa_family=AF_INET, sin_port=htons(8000), sin_addr=inet_addr("127.0.0.1")}, 16) = -1 EINPROGRESS (Operation now in progress) epoll_ctl(3, EPOLL_CTL_ADD, 7, {EPOLLIN\|EPOLLOUT\|EPOLLRDHUP, {u32=7, u64=7}}) = 0 epoll_wait(3, [{EPOLLOUT, {u32=7, u64=7}}], 200, 1000) = 1 connect(7, {sa_family=AF_INET, sin_port=htons(8000), sin_addr=inet_addr("127.0.0.1")}, 16) = 0 getsockopt(7, SOL_SOCKET, SO_ERROR, [0], [4]) = 0 sendto(7, "OPTIONS / HTTP/1.0\r\n\r\n", 22, MSG_DONTWAIT\|MSG_NOSIGNAL, NULL, 0) = 22 epoll_ctl(3, EPOLL_CTL_MOD, 7, {EPOLLIN\|EPOLLRDHUP, {u32=7, u64=7}}) = 0 epoll_wait(3, [{EPOLLIN\|EPOLLRDHUP, {u32=7, u64=7}}], 200, 1000) = 1 getsockopt(7, SOL_SOCKET, SO_ERROR, [0], [4]) = 0 getsockopt(7, SOL_SOCKET, SO_ERROR, [0], [4]) = 0 recvfrom(7, "HTTP/1.0 200\r\nContent-length: 0\r\nX-req: size=22, time=0 ms\r\nX-rsp: id=dummy, code=200, cache=1, size=0, time=0 ms (0 real)\r\n\r\n", 16384, 0, NULL, NULL) = 126 close(7) = 0	2019-09-06 17:50:36 +02:00
Emeric Brun	5762a0db0a	BUG/MAJOR: ssl: ssl_sock was not fully initialized. 'ssl_sock' wasn't fully initialized so a new session can inherit some flags from an old one. This causes some fetches, related to client's certificate presence or its verify status and errors, returning erroneous values. This issue could generate other unexpected behaviors because a new session could also inherit other flags such as SSL_SOCK_ST_FL_16K_WBFSIZE, SSL_SOCK_SEND_UNLIMITED, or SSL_SOCK_RECV_HEARTBEAT from an old session. This must be backported to 2.0 but it's useless for previous.	2019-09-06 17:33:33 +02:00
Willy Tarreau	ed5ac9c786	BUG/MINOR: lb/leastconn: ignore the server weights for empty servers As discussed in issue #178, the change brought around 1.9-dev11 by commit `1eb6c55808` ("MINOR: lb: make the leastconn algorithm more accurate") causes some harm in the situation it tried to improve. By always applying the server's weight even for no connection, we end up always picking the same servers for the first connections, so under a low load, if servers only have either 0 or 1 connections, in practice the same servers will always be picked. This patch partially restores the original behaviour but still keeping the spirit of the aforementioned patch. Now what is done is that servers with no connections will always be picked first, regardless of their weight, so they will effectively follow round-robin. Only servers with one connection or more will see an accurate weight applied. This patch was developed and tested by @malsumis and @jaroslawr who reported the initial issue. It should be backported to 2.0 and 1.9.	2019-09-06 17:13:44 +02:00
Christopher Faulet	cac5c094d1	BUG/MINOR: mux-h1: Fix a UAF in cfg_h1_headers_case_adjust_postparser() When an error occurs in the post-parser callback which checks configuration validity of the option outgoing-headers-case-adjust-file, the error message is freed too early, before being used. No backport needed. It fixes the github issue #258.	2019-09-06 08:59:23 +02:00
Willy Tarreau	c594039225	BUG/MINOR: checks: do not uselessly poll for reads before the connection is up It's pointless to start to perform a recv() call on a connection that is not yet established. The only purpose used to be to subscribe but that causes many extra syscalls when we know we can do it later. This patch only attempts a read if the connection is established or if there is no write planed, since we want to be certain to be called. And in wake_srv_chk() we continue to attempt to read if the reader was not subscribed, so as to perform the first read attempt. In case a first result is provided, __event_srv_chk_r() will not do anything anyway so this is totally harmless in this case. This fix requires that commit "BUG/MINOR: checks: make __event_chk_srv_r() report success before closing" is applied before, otherwise it will break some checks (notably SSL) by doing them again after the connection is shut down. This completes the fixes on the checks described in issue #253 by roughly cutting the number of syscalls in half. It must be backported to 2.0.	2019-09-06 08:13:15 +02:00
Willy Tarreau	4c1a2b30a3	BUG/MINOR: checks: make __event_chk_srv_r() report success before closing On a plain TCP check, this function will do nothing except shutting the connection down and will not even update the status. This prevents it from being called again, which is the reason why we attempt to do it once too early. Let's first fix this function to make it report success on plain TCP checks before closing, as it does for all other ones. This must be backported to 2.0. It should be safe to backport to older versions but it doesn't seem it would fix anything there.	2019-09-06 08:13:15 +02:00
Willy Tarreau	cc705a6b61	BUG/MINOR: checks: start sending the request right after connect() Since the change of I/O direction, we must not wait for an empty connect callback before sending the request, we must attempt to send it as soon as possible so that we don't uselessly poll. This is what this patch does. This reduces the total check duration by a complete poll loop compared to what is described in issue #253. This must be backported to 2.0.	2019-09-06 08:13:15 +02:00

... 10 11 12 13 14 ...

9436 Commits