haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-11-14 07:21:01 +01:00

Author	SHA1	Message	Date
Tim Duesterhus	70f58997f4	BUG/MINOR: cfgparse: Support configurations without newline at EOF Fix parsing of configurations if the configuration file does not end with an LF. This patch fixes GitHub issue #704. It's a regression in 9e1758efbd68c8b1d27e17e2abe4444e110f3ebe which is 2.2 specific. No backport needed.	2020-06-23 05:14:36 +02:00
Miroslav Zagorac	88403266e5	BUG/MINOR: spoe: correction of setting bits for analyzer When a SPOE filter starts the response analyze, the wrong flag is tested on the pre_analyzers bit field. AN_RES_INSPECT must be tested instead of SPOE_EV_ON_TCP_RSP. This patch must be backported to all versions with the SPOE support, i.e as far as 1.7.	2020-06-22 11:52:04 +02:00
Christopher Faulet	c97406f790	BUG/MEDIUM: fcgi-app: Resolve the sink if a fcgi-app logs in a ring buffer If a fcgi application is configured to send its logs to a ring buffer, the corresponding sink must be resolved during the configuration post parsing. Otherwise, the sink is undefined when a log message is emitted, crashing HAProxy. No need to backport.	2020-06-22 11:35:55 +02:00
Christopher Faulet	60837d340c	REGTEST: Add a simple script to tests errorfile directives in proxy sections This script is compatible with all HAProxy versions. It does not depend on 2.2 features.	2020-06-22 10:35:38 +02:00
Willy Tarreau	dc0936c255	[RELEASE] Released version 2.2-dev10 Released version 2.2-dev10 with the following main changes : - BUILD: include: add sys/types before netinet/tcp.h - BUG/MEDIUM: log: don't hold the log lock during writev() on a file descriptor - BUILD: Remove nowarn for warnings that do not trigger - BUG/MEDIUM: pattern: fix thread safety of pattern matching - BUILD: Re-enable -Wimplicit-fallthrough - BUG/MINOR: ssl: fix ssl-{min,max}-ver with openssl < 1.1.0 - BUILD: thread: add parenthesis around values of locking macros - BUILD: proto_uxst: shut up yet another gcc's absurd warning - BUG/MEDIUM: checks: Fix off-by-one in allocation of SMTP greeting cmd - CI: travis-ci: use "-O1" for clang builds - MINOR: haproxy: Add void deinit_and_exit(int) - MINOR: haproxy: Make use of deinit_and_exit() for clean exits - BUG/MINOR: haproxy: Free rule->arg.vars.expr during deinit_act_rules - BUILD: compression: make gcc 10 happy with free_zlib() - BUILD: atomic: add string.h for memcpy() on ARM64 - BUG/MINOR: http: make smp_fetch_body() report that the contents may change - BUG/MINOR: tcp-rules: tcp-response must check the buffer's fullness - BUILD: haproxy: mark deinit_and_exit() as noreturn - BUG/MAJOR: vars: Fix bogus free() during deinit() for http-request rules - BUG/MEDIUM: ebtree: use a byte-per-byte memcmp() to compare memory blocks - MINOR: tools: add a new configurable line parse, parse_line() - BUG/MEDIUM: cfgparse: use parse_line() to expand/unquote/unescape config lines - BUG/MEDIUM: cfgparse: stop after a reasonable amount of fatal error - MINOR: http: do not close connections anymore after internal responses - BUG/MINOR: cfgparse: Add missing fatal++ in PARSE_ERR_HEX case - BUG/MINOR: spoe: add missing key length check before checking key names - MINOR: version: put the compiler version output into version.c not haproxy.c - MINOR: compiler: always define __has_feature() - MINOR: version: report the presence of the compiler's address sanitizer - BUILD: Fix build by including haproxy/global.h - BUG/MAJOR: connection: always disable ready events once reported - CLEANUP: activity: remove unused counter fd_lock - DOC: fd: make it clear that some fields ordering must absolutely be respected - MINOR: activity: report the number of times poll() reports I/O - MINOR: activity: rename confusing poll_* fields in the output - MINOR: fd: Fix a typo in a coment. - BUG/MEDIUM: fd: Don't fd_stop_recv() a fd we don't own. - BUG/MEDIUM: fd: Call fd_stop_recv() when we just got a fd. - MINOR: activity: group the per-loop counters at the top - MINOR: activity: rename the "stream" field to "stream_calls" - MEDIUM: fd: refine the fd_takeover() migration lock - MINOR: fd: slightly optimize the fd_takeover double-CAS loop - MINOR: fd: factorize the fd_takeover() exit path to make it safer - MINOR: peers: do not use localpeer as an array anymore - MEDIUM: peers: add the "localpeer" global option - MEDIUM: fd: add experimental support for edge-triggered polling - CONTRIB: debug: add the missing flags CO_FL_SAFE_LIST and CO_FL_IDLE_LIST - MINOR: haproxy: process signals before runnable tasks - MEDIUM: tasks: clean up the front side of the wait queue in wake_expired_tasks() - MEDIUM: tasks: also process late wakeups in process_runnable_tasks() - BUG/MINOR: cli: allow space escaping on the CLI - BUG/MINOR: mworker/cli: fix the escaping in the master CLI - BUG/MINOR: mworker/cli: fix semicolon escaping in master CLI - REGTEST: http-rules: test spaces in ACLs - REGTEST: http-rules: test spaces in ACLs with master CLI - BUG/MAJOR: init: properly compute the default global.maxpipes value - MEDIUM: map: make the "clear map" operation yield - BUG/MEDIUM: stream-int: fix loss of CO_SFL_MSG_MORE flag in forwarding - MINOR: mux_h1: Set H1_F_CO_MSG_MORE if we know we have more to send. - BUG/MINOR: systemd: Wait for network to be online - DOC: configuration: Unindent non-code sentences in the protobuf example - DOC: configuration: http-check send was missing from matrix v2.2-dev10	2020-06-19 21:43:26 +02:00
Peter Gervai	8912ae6987	DOC: configuration: http-check send was missing from matrix The new directive and its doc were added by commit 8acb1284b ("MINOR: checks: Add a way to send custom headers and payload during http chekcs") but the index was not updated.	2020-06-19 21:38:05 +02:00
Peter Gervai	df4c9d2a28	DOC: configuration: Unindent non-code sentences in the protobuf example Unindent to make the explanation go back to text from code formatted example in tyhe HTMLized version. Still it's not perfect since these are not haproxy examples but protobuf config, but... way better.	2020-06-19 21:33:37 +02:00
Ryan O'Hara	f49a6049b8	BUG/MINOR: systemd: Wait for network to be online Change systemd service file to wait for network to be completely online. This solves two problems: If haproxy is configured to bind to IP address(es) that are not yet assigned, haproxy would previously fail. The workaround is to use "option transparent". If haproxy us configured to use a resolver to resolve servers via DNS, haproxy would previously fail due to the fact that the network is not fully online yet. This is the most compelling reason for this patch. Signed-off-by: Ryan O'Hara <rohara@redhat.com> Acked-by: Lukas Tribus <lukas@ltri.eu>	2020-06-19 21:31:10 +02:00
Olivier Houchard	c89a42feba	MINOR: mux_h1: Set H1_F_CO_MSG_MORE if we know we have more to send. In h1_snd_buf(), also set H1_F_CO_MSG_MORE if we know we still have more to send, not just if the stream-interface told us to do so. This may happen if the last block of a transfer doesn't fit in the buffer, it remains useful for the transport layer to know that more data follows what's already in the buffer.	2020-06-19 17:42:42 +02:00
Willy Tarreau	8945bb6c05	BUG/MEDIUM: stream-int: fix loss of CO_SFL_MSG_MORE flag in forwarding In 2.2-dev1, a change was made by commit 46230363a ("MINOR: mux-h1: Inherit send flags from the upper layer"). The purpose was to accurately set the CO_SFL_MSG_MORE flag on the transport layer because previously it as only set based on the buffer full condition, which does not accurately indicate that there are more data to follow. The problem is that the stream-interface never sets this flag anymore in HTX mode due to the channel's to_forward always being set to infinity. Because of this, HTX transfers are always performed without the MSG_MORE flag and experience a severe performance degradation on large transfers. This patch addresses this by making the stream-interface aware of HTX and having it check for CF_EOI to check if more contents are expected or not. With this change, the single-threaded forwarding performance on 10 MB objects jumped from 29 to 40 Gbps. No backport is needed.	2020-06-19 17:42:42 +02:00
Willy Tarreau	d1d005d7f6	MEDIUM: map: make the "clear map" operation yield As reported in issue #419, a "clear map" operation on a very large map can take a lot of time and freeze the entire process for several seconds. This patch makes sure that pat_ref_prune() can regularly yield after clearing some entries so that the rest of the process continues to work. The first part, the removal of the patterns, can take quite some time by itself in one run but it's still relatively fast. It may block for up to 100ms for 16M IP addresses in a tree typically. This change needed to declare an I/O handler for the clear operation so that we can get back to it after yielding. The second part can be much slower because it deconstructs the elements and its users, but it iterates progressively so we can yield less often here. The patch was tested with traffic in parallel sollicitating the map being released and showed no problem. Some traffic will definitely notice an incomplete map but the filling is already not atomic anyway thus this is not different. It may be backported to stable versions once sufficiently tested for side effects, at least as far as 2.0 in order to avoid the watchdog triggering when the process is frozen there. For a better behaviour, all these prune_* functions should support yielding so that the callers have a chance to continue also yield in turn.	2020-06-19 16:57:51 +02:00
Willy Tarreau	a4818db0a9	BUG/MAJOR: init: properly compute the default global.maxpipes value Initial default settings for maxconn/maxsock/maxpipes were rearranged in commit a409f30d0 ("MINOR: init: move the maxsock calculation code to compute_ideal_maxsock()") but as a side effect, the calculated maxpipes value was not stored anymore into global.maxpipes. This resulted in splicing being disabled unless there is an explicit maxpipes setting in the global section. This patch just stores the calculated ideal value as planned in the computation and as was done before the patch above. This is strictly 2.2, no backport is needed.	2020-06-19 16:23:36 +02:00
William Lallemand	5bb21b1d29	REGTEST: http-rules: test spaces in ACLs with master CLI Do the tests for spaces on the CLI with the master CLI. Could be backported as far as 2.0 once the required patches are applied.	2020-06-19 14:32:55 +02:00
William Lallemand	398c5f39ee	REGTEST: http-rules: test spaces in ACLs This reg-test tests the spaces in an ACL file, it tries to add new entries with spaces from the CLI This reg-test could backported in all stable branches if the fix for spaces on the CLI was backported.	2020-06-19 14:32:55 +02:00
William Lallemand	02c255e64b	BUG/MINOR: mworker/cli: fix semicolon escaping in master CLI Fix the semicolon escaping which must be handled in the master CLI, the commands were wrongly splitted and could be forwarded partially to the target CLI.	2020-06-19 14:32:55 +02:00
William Lallemand	fe249c3df5	BUG/MINOR: mworker/cli: fix the escaping in the master CLI The master CLI must not do the escaping since it forwards the commands to another CLI. It should be able to split into words by taking care of the escaping, but must not remove the forwarded backslashes. This fix do the same thing as the previous patch applied to the cli_parse_request() function, by taking care of the escaping during the word split, but it also remove the part which was removing the backslashes from the forwarded command.	2020-06-19 14:32:55 +02:00
Yves Lafon	b08c6d06e7	BUG/MINOR: cli: allow space escaping on the CLI It was not possible to escape spaces over the CLI, making impossible the insertion of new ACL entries with spaces from the CLI. This patch fixes the escaping of spaces over the CLI. It is now possible to launch "add acl agents.acl My\ User\ Agent" over the CLI. Could be backported in all stable branches. Should fix issue #400.	2020-06-19 14:32:55 +02:00
Willy Tarreau	5c8be272c7	MEDIUM: tasks: also process late wakeups in process_runnable_tasks() Since version 1.8, we've started to use tasks and tasklets more extensively to defer I/O processing. Originally with the simple scheduler, a task waking another one up using task_wakeup() would have caused it to be processed right after the list of runnable ones. With the introduction of tasklets, we've started to spill running tasks from the run queues to the tasklet queues, so if a task wakes another one up, it will only be executed on the next call to process_runnable_task(), which means after yet another round of polling loop. This is particularly visible with I/Os hitting muxes: poll() reports a read event, the connection layer performs a tasklet_wakeup() on the mux subscribed to this I/O, and this mux in turn signals the upper layer stream using task_wakeup(). The process goes back to poll() with a null timeout since there's one active task, then back to checking all possibly expired events, and finally back to process_runnable_tasks() again. Worse, when there is high I/O activity, doing so will make the task's execution further apart from the tasklet and will both increase the total processing latency and reduce the cache hit ratio. This patch brings back to the original spirit of process_runnable_tasks() which is to execute runnable tasks as long as the execution budget is not exhausted. By doing so, we're immediately cutting in half the number of calls to all functions called by run_poll_loop(), and halving the number of calls to poll(). Furthermore, calling poll() less often also means purging FD updates less often and offering more chances to merge them. This also has the nice effect of making tune.runqueue-depth effective again, as in the past it used to be quickly bounded by this artificial event horizon which was preventing from executing remaining tasks. On certain workloads we can see a 2-3% performance increase.	2020-06-19 14:21:46 +02:00
Willy Tarreau	77015abe0b	MEDIUM: tasks: clean up the front side of the wait queue in wake_expired_tasks() Due to the way the wait queue works, some tasks might be postponed but not requeued. However when we exit wake_expired_tasks() on a not-yet-expired task and leave it in this situation, the next call to next_timer_expiry() will use this first task's key in the tree as an expiration date, but this date might be totally off and cause needless wakeups just to reposition it. This patch makes sure that we leave wake_expired_tasks with a clean state of frontside tasks and that their tree's key matches their expiration date. Doing so we can already observe a ~15% reduction of the number of wakeups when dealing with large numbers of health checks. The patch looks large because the code was rearranged but the real change is to take the wakeup/requeue decision on the task's expiration date instead of the tree node's key, the rest is unchanged.	2020-06-19 14:21:46 +02:00
Willy Tarreau	a7ad4aed60	MINOR: haproxy: process signals before runnable tasks Nowadays signals cause tasks to be woken up. The historic code still processes signals after tasks, which forces a second round in the loop before they can effectively be processed. Let's move the signal queue handling between wake_expired_tasks() and process_runnable_tasks() where it makes much more sense.	2020-06-19 14:21:46 +02:00
Willy Tarreau	54067e9d38	CONTRIB: debug: add the missing flags CO_FL_SAFE_LIST and CO_FL_IDLE_LIST As often when flags are added they're not updated here. These ones were missing. They're 2.2 only so no backport is needed.	2020-06-19 14:21:46 +02:00
Willy Tarreau	bc52bec163	MEDIUM: fd: add experimental support for edge-triggered polling Some of the recent optimizations around the polling to save a few epoll_ctl() calls have shown that they could also cause some trouble. However, over time our code base has become totally asynchronous with I/Os always attempted from the upper layers and only retried at the bottom, making it look like we're getting closer to EPOLLET support. There are showstoppers there such as the listeners which cannot support this. But given that most of the epoll_ctl() dance comes from the connections, we can try to enable edge-triggered polling on connections. What this patch does is to add a new global tunable "tune.fd.edge-triggered", that makes fd_insert() automatically set an et_possible bit on the fd if the I/O callback is conn_fd_handler. When the epoll code sees an update for such an FD, it immediately registers it in both directions the first time and doesn't update it anymore. On a few tests it proved quite useful with a 14% request rate increase in a H2->H1 scenario, reducing the epoll_ctl() calls from 2 per request to 2 per connection. The option is obviously disabled by default as bugs are still expected, particularly around the subscribe() code where it is possible that some layers do not always re-attempt reading data after being woken up.	2020-06-19 14:21:46 +02:00
Dragan Dosen	13cd54c08b	MEDIUM: peers: add the "localpeer" global option localpeer <name> Sets the local instance's peer name. It will be ignored if the "-L" command line argument is specified or if used after "peers" section definitions. In such cases, a warning message will be emitted during the configuration parsing. This option will also set the HAPROXY_LOCALPEER environment variable. See also "-L" in the management guide and "peers" section in the configuration manual.	2020-06-19 11:37:30 +02:00
Dragan Dosen	4f01415d3b	MINOR: peers: do not use localpeer as an array anymore It is now dynamically allocated by using strdup().	2020-06-19 11:37:11 +02:00
Willy Tarreau	f1cad38281	MINOR: fd: factorize the fd_takeover() exit path to make it safer Since there was a risk of leaving fd_takeover() without properly stopping the fd, let's take this opportunity for factoring the code around a commont exit point that's common to both double-cas and locked modes. This means using the "ret" variable inside the double-CAS code, and inverting the loop to first test the old values. Doing do also produces cleaner code because the compiler cannot factorize common exit paths using asm statements that are present in some atomic ops.	2020-06-18 08:25:42 +02:00
Willy Tarreau	4297363de3	MINOR: fd: slightly optimize the fd_takeover double-CAS loop The loop in fd_takeover() around the double-CAS is conditionned on a previous value of old_masks[0] that always matches tid_bit on the first iteration because it does not result from the atomic op but from a pre-loaded value. Let's set the result of the atomic op there instead so that the conflict between threads can be detected earlier and before performing the double-word CAS.	2020-06-18 08:08:50 +02:00
Willy Tarreau	c460c91633	MEDIUM: fd: refine the fd_takeover() migration lock When haproxy is compiled without double-word CAS, we use a migration lock in fd_takeover(). This lock was covering the atomic OR on the running_mask before checking its value, while it is not needed since this atomic op already returns the result. Let's just refine the code to avoid grabbing the lock in the event another thread has already stolen the FD, this may reduce contention in high reuse rate scenarios.	2020-06-18 07:28:09 +02:00
Willy Tarreau	7af4fa9a48	MINOR: activity: rename the "stream" field to "stream_calls" This one was confusingly called, I thought it was the cumulated number of streams but it's the number of calls to process_stream(). Let's make this clearer.	2020-06-17 20:52:29 +02:00
Willy Tarreau	a00cf9bbaf	MINOR: activity: group the per-loop counters at the top empty_rq and long_rq are per-loop so it makes sense to group them together with the loop count. In addition since ctxsw and tasksw apply in the context of these counters, let's move them as well. More precisely the difference between wake_tasks and long_rq should roughly correspond to the number of inter-task messages. Visually it's much easier to spot ratios of wakeup causes now.	2020-06-17 20:52:29 +02:00
Olivier Houchard	ddc874c46c	BUG/MEDIUM: fd: Call fd_stop_recv() when we just got a fd. In fd_takeover(), when a double-width compare-and-swap is implemented, make sure, if we managed to get the fd, to call fd_stop_recv() on it, so that the thread that used to own it will know it has to stop polling it.	2020-06-17 20:36:28 +02:00
Olivier Houchard	8d7b517824	BUG/MEDIUM: fd: Don't fd_stop_recv() a fd we don't own. In fd_takeover(), if we failed to grab the fd, when a double-width compare-and-swap is not implemented, do not call fd_stop_recv() on the fd, it is not ours and may be used by another thread.	2020-06-17 20:36:28 +02:00
Olivier Houchard	f86a106f68	MINOR: fd: Fix a typo in a coment. The function si called fd_takeover, not fd_takeother.	2020-06-17 20:36:28 +02:00
Willy Tarreau	e406386542	MINOR: activity: rename confusing poll_* fields in the output We have poll_drop, poll_dead and poll_skip which are confusingly named like their poll_io and poll_exp counterparts except that they are not per poll() call but per-fd. This patch renames them to poll_drop_fd(), poll_dead_fd() and poll_skip_fd() for this reason.	2020-06-17 20:35:33 +02:00
Willy Tarreau	e545153c50	MINOR: activity: report the number of times poll() reports I/O The "show activity" output mentions a number of indicators to explain wake up reasons but doesn't have the number of times poll() sees some I/O. And given that multiple events can happen simultaneously, it's not always possible to deduce this metric by subtracting. This patch adds a new "poll_io" counter that allows one to see how often poll() returns with at least one active FD. This should help detect stuck events and measure various ratios of poll sub-metrics.	2020-06-17 20:25:18 +02:00
Willy Tarreau	c208a54ab2	DOC: fd: make it clear that some fields ordering must absolutely be respected fd_set_running() and fd_takeover() may both use a double-word CAS on the (running_mask, thread_mask) couple and as such they expect the fields to be exactly arranged like this. It's critical not to reorder them, so add a comment to avoid such a potential mistake later.	2020-06-17 19:58:37 +02:00
Willy Tarreau	4f72ec851c	CLEANUP: activity: remove unused counter fd_lock Since 2.1-dev2, with commit 305d5ab46 ("MAJOR: fd: Get rid of the fd cache.") we don't have the fd_lock anymore and as such its acitvity counter is always zero. Let's remove it from the struct and from "show activity" output, as there are already plenty of indicators to look at. The cache line comment in the struct activity was updated to reflect reality as it looks like another one already got removed in the past.	2020-06-17 19:15:51 +02:00
Willy Tarreau	4cabfc18a3	BUG/MAJOR: connection: always disable ready events once reported This effectively reverts the two following commits: 6f95f6e11 ("OPTIM: connection: disable receiving on disabled events when the run queue is too high") 065a02561 ("MEDIUM: connection: don't stop receiving events in the FD handler") The problem as reported in issue #662 is that when the events signals the readiness of input data that has to be forwarded over a congested stream, the mux will read data and wake the stream up to forward them, but the buffer full condition makes this impossible immediately, then nobody in the chain will be able to disable the event after it was first reported. And given we don't know at the connection level whether an event was already reported or not, we can't decide anymore to forcefully stop it if for any reason its processing gets delayed. The problem is magnified in issue #662 by the fact that a shutdown is reported with pending data occupying the buffer. The shutdown will strike in loops and cause the upper layer stream to be notified until it's handled, but with a buffer full it's not possible to call cs_recv() hence to purge the event. All this can only be handled optimally by implementing a lower layer, direct mux-to-mux forwarding that will not require any scheduling. This was no wake up will be needed and the event will be instantly handled or paused for a long time. For now let's simply revert these optimizations. Running a 1 MB transfer test over H2 using 8 connections having each 32 streams with a limited link of 320 Mbps shows the following profile before this fix: calls syscall (100% CPU) ------ ------- 259878 epoll_wait 519759 clock_gettime 17277 sendto 17129 recvfrom 672 epoll_ctl And the following one after the fix: calls syscall (2-3% CPU) ------ ------- 17201 sendto 17357 recvfrom 2304 epoll_wait 4609 clock_gettime 1200 epoll_ctl Thus the behavior is much better. No backport is needed as these patches were only in 2.2-dev. Many thanks to William Dauchy for reporting a lot of details around this difficult issue.	2020-06-17 17:00:51 +02:00
Olivier Houchard	b25970f896	BUILD: Fix build by including haproxy/global.h In srv/version.c, fix build by including haproxy/global.h, so that REGISTER_BUILD_OPTS is properly defined.	2020-06-16 23:36:04 +02:00
Willy Tarreau	7bf484ac91	MINOR: version: report the presence of the compiler's address sanitizer Since we've seen clang emit bad code when the address sanitizer is enabled at -O2, better clearly report it in the version output. It is detected both for clang and gcc (both tested with and without).	2020-06-16 19:14:19 +02:00
Willy Tarreau	6d4c81db96	MINOR: compiler: always define __has_feature() This macro is provided by clang but gcc lacks it. Not having it makes it painful to test features on both compilers. Better define it to zero when not available so that __has_feature(foo) never errors.	2020-06-16 19:13:24 +02:00
Willy Tarreau	88bd9ee6a3	MINOR: version: put the compiler version output into version.c not haproxy.c For an unknown reason in commit bb1b63c079 I placed the compiler version output in haproxy.c instead of version.c. Better have it in version.c which is more suitable to this sort of things.	2020-06-16 19:11:11 +02:00
Willy Tarreau	da21ed1662	BUG/MINOR: spoe: add missing key length check before checking key names The spoe parser fails to check that the decoded key length is large enough to match a given key but it uses the returned length in memcmp(). So returning "ver" could match "version" for example. In addition this makes clang 10's ASAN complain because the second argument to memcmp() is the static key which is shorter than the decoded buffer size, which in practice has no impact. I'm still not 100% sure the parser is entirely correct because even with this fix it cannot parse a key whose name matches the beginning of another one, but in practice this does not happen. Ideally a preliminary length check before the comparison would be safer. This needs to be backported as far as 1.7.	2020-06-16 18:25:40 +02:00
Tim Duesterhus	9f658a554f	BUG/MINOR: cfgparse: Add missing fatal++ in PARSE_ERR_HEX case This fixes up commit 32234e751320b60a3879f274d4a4753d7570e757. This patch should be backported whereever that commit is backported.	2020-06-16 18:25:40 +02:00
Willy Tarreau	2c4dfaeff6	MINOR: http: do not close connections anymore after internal responses Since we dropped support for legacy mode, it's not the stream which deals with the connection but the mux, and there's no point in closing the client connection after most internal status codes. For example if the client gets a 401 or a 503 because a server doesn't respond, it makes no sense forcing the connection to close after reporting this status, because it's already done by the mux if the client asks for it or is not compatible with keep-alive. This current state was inherited from the early days but is still limiting the amount of client-side connection reuse in a number of circumstances (typically server-side errors). This change was planned for 2.1 but forgotten. The status codes for which the connection is not closed anymore are those that do not depend on the client side connection itself, which are all except 400 and 408. This could be backported to 2.1 but not further, in order to make sure legacy and HTX behave strictly similarly.	2020-06-16 17:41:32 +02:00
Willy Tarreau	32234e7513	BUG/MEDIUM: cfgparse: stop after a reasonable amount of fatal error One issue with the config parser is that while it tries to report as many errors as possible at once, it's actually unbounded. Thus, when calling haproxy on a wrong file, it can take ages to process, such as here on half a gigabyte of map file instead of config file: $ time ./haproxy -c -f large.map 2>&1 \|wc -l 16777220 real 0m31.324s user 0m22.595s sys 0m28.909s This patch modifies readcfgfile() to stop reading the config file after a reasonable amount of fatal errors. This threshold is set to 50, which seems more than enough to spot a recurrent issue with a bit of context in a terminal to address several issues at once, without filling logs nor taking time to parse the file. The difference is clear now: $ time ./haproxy -c -f large.map 2>&1 \|wc -l 55 real 0m0.005s user 0m0.004s sys 0m0.003s This may be backported to older versions without causing too many difficulties. However the patch will not apply as-is, it will require to increment the "fatal" count for each place where ERR_FATAL is set in the parsing loop.	2020-06-16 17:19:01 +02:00
Willy Tarreau	9e1758efbd	BUG/MEDIUM: cfgparse: use parse_line() to expand/unquote/unescape config lines Issue 22689 in oss-fuzz shows that specially crafted config files can take a long time to process. This happens when variable expansion, backslash escaping or unquoting causes calls to memmove() and possibly to realloc() resulting in O(N^2) complexity with N following the line size. By using parse_line() we now have a safe parser that remains in O(N) regardless of the type of operation. Error reporting changed a little bit since the errors are not reported anymore from the deepest parsing level. As such we now report the beginning of the error. One benefit is that for many invalid character sequences, the original line is shown and the first bad char or sequence is designated with a caret ('^'), which tends to be visually easier to spot, for example: [ALERT] 167/170507 (14633) : parsing [mini5.cfg:19]: unmatched brace in environment variable name below: "${VAR"} ^ or: [ALERT] 167/170645 (14640) : parsing [mini5.cfg:18]: unmatched quote below: timeout client 10s' ^ In case the target buffer is too short for the new line, the output buffer is grown in 1kB chunks and kept till the end, so that it should not happen too often. Before this patch a test like below involving a 4 MB long line would take 138s to process, 98% of which were spent in __memmove_avx_unaligned_erms(), and now it takes only 65 milliseconds: $ perl -e 'print "\"\$A\""x1000000,"\n"' \| ./haproxy -c -f /dev/stdin 2>/dev/null This may be backported to stable versions after a long period of observation to be sure nothing broke. It relies on patch "MINOR: tools: add a new configurable line parse, parse_line()".	2020-06-16 17:07:02 +02:00
Willy Tarreau	c8d167bcfb	MINOR: tools: add a new configurable line parse, parse_line() This function takes on input a string to tokenize, an output storage (which may be the same) and a number of options indicating how to handle certain characters (single & double quote support, backslash support, end of line on '#', environment variables etc). On output it will provide a list of pointers to individual words after having possibly unescaped some character sequences, handled quotes and resolved environment variables, and it will also indicate a status made of: - a list of failures (overlap between src/dst, wrong quote etc) - the pointer to the first sequence in error - the required output length (a-la snprintf()). This allows a caller to freely unescape/unquote a string by using a pre-allocated temporary buffer and expand it as necessary. It takes extreme care at avoiding expensive operations and intentionally does not use memmove() when removing escapes, hence the reason for the different input and output buffers. The goal is to use it as the basis for the config parser.	2020-06-16 16:27:26 +02:00
Willy Tarreau	853926a9ac	BUG/MEDIUM: ebtree: use a byte-per-byte memcmp() to compare memory blocks As reported in issue #689, there is a subtle bug in the ebtree code used to compared memory blocks. It stems from the platform-dependent memcmp() implementation. Original implementations used to perform a byte-per-byte comparison and to stop at the first non-matching byte, as in this old example: https://www.retro11.de/ouxr/211bsd/usr/src/lib/libc/compat-sys5/memcmp.c.html The ebtree code has been relying on this to detect the first non-matching byte when comparing keys. This is made so that a zero-terminated string can fail to match against a longer string. Over time, especially with large busses and SIMD instruction sets, multi-byte comparisons have appeared, making the processor fetch bytes past the first different byte, which could possibly be a trailing zero. This means that it's possible to read past the allocated area for a string if it was allocated by strdup(). This is not correct and definitely confuses address sanitizers. In real life the problem doesn't have visible consequences. Indeed, multi-byte comparisons are implemented so that aligned words are loaded (e.g. 512 bits at once to process a cache line at a time). So there is no way such a multi-byte access will cross a page boundary and end up reading from an unallocated zone. This is why it was never noticed before. This patch addresses this by implementing a one-byte-at-a-time memcmp() variant for ebtree, called eb_memcmp(). It's optimized for both small and long strings and guarantees to stop after the first non-matching byte. It only needs 5 instructions in the loop and was measured to be 3.2 times faster than the glibc's AVX2-optimized memcmp() on short strings (1 to 257 bytes), since that latter one comes with a significant setup cost. The break-even seems to be at 512 bytes where both version perform equally, which is way longer than what's used in general here. This fix should be backported to stable versions and reintegrated into the ebtree code.	2020-06-16 11:30:33 +02:00
Tim Duesterhus	01a0ce39e2	BUG/MAJOR: vars: Fix bogus free() during deinit() for http-request rules We cannot simply `release_sample_expr(rule->arg.vars.expr)` for a `struct act_rule`, because `rule->arg` is a union that might not contain valid `vars`. This leads to a crash on a configuration using `http-request redirect` and possibly others: frontend http mode http bind 127.0.0.1:80 http-request redirect scheme https Instead a `struct act_rule` has a `release_ptr` that must be used to properly free any additional storage allocated. This patch fixes a regression in commit ff78fcdd7f15c8626c7e70add7a935221ee2920c. It must be backported to whereever that patch is backported. It has be verified that the configuration above no longer crashes. It has also been verified that the configuration in ff78fcdd7f15c8626c7e70add7a935221ee2920c does not leak.	2020-06-15 18:51:11 +02:00
Willy Tarreau	f3ca5a0273	BUILD: haproxy: mark deinit_and_exit() as noreturn Commit 0a3b43d9c ("MINOR: haproxy: Make use of deinit_and_exit() for clean exits") introduced this build warning: src/haproxy.c: In function 'main': src/haproxy.c:3775:1: warning: control reaches end of non-void function [-Wreturn-type] } ^ This is because the new deinit_and_exit() is not marked as "noreturn" so depending on the optimizations, the noreturn attribute of exit() will either leak through it and silence the warning or not and confuse the compiler. Let's just add the attribute to fix this. No backport is needed, this is purely 2.2.	2020-06-15 18:43:46 +02:00

1 2 3 4 5 ...

12292 Commits