haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-14 02:57:01 +02:00

Author	SHA1	Message	Date
William Lallemand	74f0ec3894	BUG/MINOR: mworker: ensure that we still quits with SIGINT Since the fix "BUG/MINOR: mworker: don't exit with an ambiguous value" we are leaving with a EXIT_SUCCESS upon a SIGINT. We still need to quit with a SIGINT when a worker leaves with a SIGINT. This is done this way because vtest expect a 130 during the process stop, haproxy without mworker returns a 130, so it should be the same in mworker mode. This should be backported in 1.9, with the previous patch ("BUG/MINOR: mworker: don't exit with an ambiguous value"). Code has moved, mworker_catch_sigchld() is in haproxy.c.	2019-04-16 18:14:29 +02:00
William Lallemand	4cf4b33744	BUG/MINOR: mworker: don't exit with an ambiguous value When the sigchld handler is called and waitpid() returns -1, the behavior of waitpid() with the status variable is undefined. It is not a good idea to exit with the value contained in it. Since this exit path does not use the exitcode variable, it means that this is an expected and successful exit. This should be backported in 1.9, code has moved, mworker_catch_sigchld() is in haproxy.c.	2019-04-16 18:14:29 +02:00
William Lallemand	32b6901550	BUG/MINOR: mworker: mworker_kill should apply on every children Commit `3f12887` ("MINOR: mworker: don't use children variable anymore") introduced a regression. The previous behavior was to send a signal to every children, whether or not they are former children. Instead of this, we only send a signal to the current children, so we don't try to kill -INT or -TERM all processes during a reload. No backport needed.	2019-04-16 18:14:29 +02:00
Willy Tarreau	85d0424b20	BUG/MINOR: listener/mq: correctly scan all bound threads under low load When iterating on the CLI using "show activity" and no other load, it was visible that the last thread was always skipped. This was caused by the way the thread bits were walking : t1 was updated after t2 to make sure it never equals t2 (thus it skips t2), and in case of a tie we choose t1. This results in the chosen thread never to equal t2 unless the other ones already have one connection. In addition to this, t2 was recalulated upon each pass due to the fact that only the 31th bit was looked at instead of looking at the t2'th bit. This patch fixes this by updating t2 after t1 so that t1 is free to walk over all positions under equal load. No measurable performance gains are expected from this though, but it at least removes one strange indicator which could lead to some suspicion. No backport is needed.	2019-04-16 18:09:13 +02:00
Willy Tarreau	636848aa86	MINOR: init: add a "set-dumpable" global directive to enable core dumps It's always a pain to get a core dump when enabling user/group setting (which disables the dumpable flag on Linux), when using a chroot and/or when haproxy is started by a service management tool which requires complex operations to just raise the core dump limit. This patch introduces a new "set-dumpable" global directive to work around these troubles by doing the following : - remove file size limits (equivalent of ulimit -f unlimited) - remove core size limits (equivalent of ulimit -c unlimited) - mark the process dumpable again (equivalent of suid_dumpable=1) Some of these will depend on the operating system. This way it becomes much easier to retrieve a core file. Temporarily moving the chroot to a user-writable place generally enough.	2019-04-16 14:31:23 +02:00
William Lallemand	482f9a9a2f	MINOR: mworker: export HAPROXY_MWORKER=1 when running in mworker mode Export HAPROXY_MWORKER=1 in an environment variable when running in mworker mode.	2019-04-16 13:26:43 +02:00
William Lallemand	620072bc0d	MINOR: cli: don't add a semicolon at the end of HAPROXY_CLI Only add the semicolon when there is several CLI in HAPROXY_CLI and HAPROXY_MASTER_CLI.	2019-04-16 13:26:43 +02:00
William Lallemand	9a37fd0f19	MEDIUM: mworker/cli: export the HAPROXY_MASTER_CLI variable It works the same way as the HAPROXY_CLI variable, it exports the listeners addresses separated by semicolons.	2019-04-16 13:26:43 +02:00
William Lallemand	8f7069a389	CLEANUP: mworker: remove the type field in mworker_proc Since the introduction of the options field, we can use it to store the type of process. type = 'm' is replaced by PROC_O_TYPE_MASTER type = 'w' is replaced by PROC_O_TYPE_WORKER type = 'e' is replaced by PROC_O_TYPE_PROG The old values are still used in the HAPROXY_PROCESSES environment variable to pass the information during a reload.	2019-04-16 13:26:43 +02:00
William Lallemand	bd3de3efb7	MEDIUM: mworker-prog: implements 'option start-on-reload' This option is already the default, but its opposite 'no option start-on-reload' allows the master to keep a previous instance of a program and don't start a new one upon a reload. The old program will then appear as a current one in "show proc" and could also trigger an exit-on-failure upon a segfault.	2019-04-16 13:26:43 +02:00
William Lallemand	4528611ed6	MEDIUM: mworker: store the leaving state of a process Previously we were assuming than a process was in a leaving state when its number of reload was greater than 0. With mworker programs it's not the case anymore so we need to store a leaving state.	2019-04-16 13:26:43 +02:00
Willy Tarreau	9df86f997e	BUG/MAJOR: lb/threads: fix insufficient locking on round-robin LB Maksim Kupriianov reported very strange crashes in fwrr_update_position() which didn't make sense because of an apparent divide overflow except that the value was not null in the core. It happens that while the locking is correct in all the functions' call graph, the uppermost one (fwrr_get_next_server()) incorrectly expected that its target server was already locked when called. This stupid assumption causd the server lock not to be held when calling the other ones, explaining how it was possible to change the server's eweight by calling srv_lb_commit_status() under the server lock yet collide with its unprotected usage. This commit makes sure that fwrr_get_server_from_group() retrieves a locked server and that fwrr_get_next_server() is responsible for unlocking the server before returning it. There is one subtlety in this function which is that it builds a list of avoided servers that were full while scanning the tree, and all of them are queued in a full state so they must be unlocked upon return. Many thanks to Maksim for providing detailed info allowing to narrow down this bug. This fix must be backported to 1.9. In 1.8 the lock seems much wider and changes to the server's state are performed under the rendez-vous point so this it doesn't seem possible that it happens there.	2019-04-16 11:21:14 +02:00
Fr�d�ric L�caille	95679dc096	MINOR: peers: Add a new command to the CLI for peers. Implements "show peers [peers section]" new CLI command to dump information about the peers and their stick-tables to be synchronized and others internal. May be backported as far as 1.5.	2019-04-16 09:58:40 +02:00
Willy Tarreau	6f7a02a381	BUILD: htx: fix a used uninitialized warning on is_cookie2 gcc-3.4 reports this which actually looks like a valid warning when looking at the code, it's unsure why others didn't notice it : src/proto_htx.c: In function `htx_manage_server_side_cookies': src/proto_htx.c:4266: warning: 'is_cookie2' might be used uninitialized in this function	2019-04-15 21:55:48 +02:00
Willy Tarreau	8de1df92a3	BUILD: do not specify "const" on functions returning structs or scalars Older compilers (like gcc-3.4) warn about the use of "const" on functions returning a struct, which makes sense since the return may only be copied : include/common/htx.h:233: warning: type qualifiers ignored on function return type Let's simply drop "const" here.	2019-04-15 21:55:48 +02:00
Willy Tarreau	0e492e2ad0	BUILD: address a few cases of "static <type> inline foo()" Older compilers don't like to see "inline" placed after the type in a function declaration, it must be "static inline <type>" only. This patch touches various areas. The warnings were seen with gcc-3.4.	2019-04-15 21:55:48 +02:00
Olivier Houchard	998410a41b	BUG/MEDIUM: h2: Revamp the way send subscriptions works. Instead of abusing the SUB_CALL_UNSUBSCRIBE flag, revamp the H2 code a bit so that it just checks if h2s->sending_list is empty to know if the tasklet of the stream_interface has been waken up or not. send_wait is now set to NULL in h2_snd_buf() (ideally we'd set it to NULL as soon as we're waking the tasklet, but it can't be done, because we still need it in case we have to remove the tasklet from the task list).	2019-04-15 19:27:57 +02:00
Olivier Houchard	9a0f559676	BUG/MEDIUM: h2: Make sure we're not already in the send_list in h2_subscribe(). In h2_subscribe(), don't add ourself to the send_list if we're already in it. That may happen if we try to send and fail twice, as we're only removed from the send_list if we managed to send data, to promote fairness. Failing to do so can lead to either an infinite loop, or some random crashes, as we'd get the same h2s in the send_list twice. This should be backported to 1.9.	2019-04-15 19:27:57 +02:00
Olivier Houchard	0e0793715c	BUG/MEDIUM: muxes: Make sure we unsubcribed when destroying mux ctx. In the h1 and h2 muxes, make sure we unsubscribed before destroying the mux context. Failing to do so will lead in a segfault later, as the connection will attempt to dereference its conn->send_wait or conn->recv_wait, which pointed to the now-free'd mux context. This was introduced by commit `39a96ee16e`, so should only be backported if that commit gets backported.	2019-04-15 19:27:57 +02:00
Willy Tarreau	e61828449c	BUILD: cli/threads: fix build in single-threaded mode Commit `a8f57d51a` ("MINOR: cli/activity: report the accept queue sizes in "show activity"") broke the single-threaded build because the accept-rings are not implemented there. Let's ifdef this out. Ideally we should start to think about always having such elements initialized even without threads to improve the test coverage.	2019-04-15 18:55:31 +02:00
Willy Tarreau	3466e3cdcb	BUILD: task/thread: fix single-threaded build of task.c As expected, commit `cde7902ac` ("MEDIUM: tasks: improve fairness between the local and global queues") broke the build with threads disabled, and I forgot to rerun this test before committing. No backport is needed.	2019-04-15 18:52:40 +02:00
Nenad Merdanovic	646b7741bc	BUG/MEDIUM: map: Fix memory leak in the map converter The allocated trash chunk is not freed properly and causes a memory leak exhibited as the growth in the trash pool allocations. Bug was introduced in commit 271022 (BUG/MINOR: map: fix map_regm with backref). This should be backported to all branches where the above commit was backported.	2019-04-15 09:53:46 +02:00
Willy Tarreau	c8da044b41	MINOR: tasks: restore the lower latency scheduling when niced tasks are present In the past we used to reduce the number of tasks consulted at once when some niced tasks were present in the run queue. This was dropped in 1.8 when the scheduler started to take batches. With the recent fixes it now becomes possible to restore this behaviour which guarantees a better latency between tasks when niced tasks are present. Thanks to this, with the default number of 200 for tune.runqueue-depth, with a parasitic load of 14000 requests per second, nice 0 gives 14000 rps, nice 1024 gives 12000 rps and nice -1024 gives 16000 rps. The amplitude widens if the runqueue depth is lowered.	2019-04-15 09:50:56 +02:00
Willy Tarreau	2d1fd0a0d2	MEDIUM: tasks: only base the nice offset on the run queue depth The offset calculated for the nice value used to be wrong for a long time and got even worse when the improved multi-thread sheduler was implemented because it continued to rely on the run queue size, which become irrelevant given that we extract tasks in batches, so the run queue size moves following a sawtooth form. However the offsets much better reflects insertion positions in the queue, so it's worth dropping this rq_size component of the equation. Last point, due to the batches made of runqueue-depth entries at once, the higher the depth, the lower the effect of the nice setting since values are picked together in batches and placed into a list. An intuitive approach consists in multiplying the nice value with the batch size to allow tasks to participate to a different batch. And experimentation shows that this works pretty well. With a runqueue-depth of 16 and a parasitic load of 16000 requests per second on 100 streams, a default nice of 0 shows 16000 requests per second for nice 0, 22000 for nice -1024 and 10000 for nice 1024. The difference is even bigger with a runqueue depth of 5. At 200 however it's much smoother (16000-22000).	2019-04-15 09:50:56 +02:00
Willy Tarreau	cde7902ac9	MEDIUM: tasks: improve fairness between the local and global queues Tasks allowed to run on multiple threads, as well as those scheduled by one thread to run on another one pass through the global queue. The local queues only see tasks scheduled by one thread to run on itself. The tasks extracted from the global queue are transferred to the local queue when they're picked by one thread. This causes a priority issue because the global tasks experience a priority contest twice while the local ones experience it only once. Thus if a tasks returns still running, it's immediately reinserted into the local run queue and runs much faster than the ones coming from the global queue. Till 1.9 the tasks going through the global queue were mostly : - health checks initialization - queue management - listener dequeue/requeue These ones are moderately sensitive to unfairness so it was not that big an issue. Since 2.0-dev2 with the multi-queue accept, tasks are scheduled to remote threads on most accept() and it becomes fairly visible under load that the accept slows down, even for the CLI. This patch remedies this by consulting both the local and the global run queues in parallel and by always picking the task whose deadline is the earliest. This guarantees to maintain an excellent fairness between the two queues and removes the cascade effect experienced by the global tasks. Now the CLI always continues to respond quickly even in presence of expensive tasks running for a long time. This patch may possibly be backported to 1.9 if some scheduling issues are reported but at this time it doesn't seem necessary.	2019-04-15 09:50:56 +02:00
Willy Tarreau	24f382f555	CLEANUP: task: do not export rq_next anymore This one hasn't been used anymore since the scheduler changes after 1.8 but it kept being exported and maintained up to date while it's always reset when scanning the trees. Let's stop exporting it and updating it.	2019-04-15 09:50:56 +02:00
Christopher Faulet	61840e715f	BUG/MEDIUM: muxes: Don't dereference mux context if null in release functions When a mux context is released, we must be sure it exists before dereferencing it. The bug was introduced in the commit `39a96ee16` ("MEDIUM: muxes: Be prepared to don't own connection during the release"). No need to backport this patch, expect if the commit `39a96ee16` is backported too.	2019-04-15 09:47:10 +02:00
Christopher Faulet	1d2b586cdd	MAJOR: htx: Enable the HTX mode by default for all proxies The legacy HTTP mode is no more the default one. So now, by default, without any option in your configuration, all proxies will use the HTX mode. The line "option http-use-htx" in proxy sections are now useless, except to cancel the legacy HTTP mode. To fallback on legacy HTTP mode, you should use the line "no option http-use-htx" explicitly. Note that the reg-tests still work by default on legacy HTTP mode. The HTX will be enabled by default in a futur commit.	2019-04-12 22:06:53 +02:00
Christopher Faulet	0ef372a390	MAJOR: muxes/htx: Handle inplicit upgrades from h1 to h2 The upgrade is performed when an H2 preface is detected when the first request on a connection is parsed. The CS is destroyed by setting EOS flag on it. A special flag is added on the HTX message to warn the HTX analyzers the stream will be closed because of an upgrade. This way, no error and no log are emitted. When the mux h1 is released, we create a mux h2, without any CS and passing the buffer with the unparsed H2 preface.	2019-04-12 22:06:53 +02:00
Christopher Faulet	bbe685452f	MAJOR: proxy/htx: Handle mux upgrades from TCP to HTTP in HTX mode It is now possible to upgrade TCP streams to HTX when an HTTP backend is set for a TCP frontend (both with the HTX enabled). So concretely, in such case, an upgrade is performed from the mux pt to the mux h1. The current CS and the channel's buffer are used to initialize the mux h1.	2019-04-12 22:06:53 +02:00
Christopher Faulet	eb7098035c	MEDIUM: htx: Allow the option http-use-htx to be used on TCP proxies too This will be mandatory to allow upgrades from TCP to HTTP in HTX. Of course, raw buffers will still be used by default on TCP proxies, this option sets or not. But if you want to handle mux upgrades from a TCP proxy, you must enable the HTX on it and on all its backends. There is only a small change in the lua code. Because TCP proxies can be HTX aware, to exclude TCP services only for HTTP proxies, we must also check the mode (TCP/HTTP) now.	2019-04-12 22:06:53 +02:00
Christopher Faulet	39a96ee16e	MEDIUM: muxes: Be prepared to don't own connection during the release This happens during mux upgrades. In such case, when the destroy() callback is called, the connection points to a different mux's context than the one passed to the callback. It means the connection is owned by another mux. The old mux is then released but the connection is not closed.	2019-04-12 22:06:53 +02:00
Christopher Faulet	73c1207c71	MINOR: muxes: Pass the context of the mux to destroy() instead of the connection It is mandatory to handle mux upgrades, because during a mux upgrade, the connection will be reassigned to another multiplexer. So when the old one is destroyed, it does not own the connection anymore. Or in other words, conn->ctx does not point to the old mux's context when its destroy() callback is called. So we now rely on the multiplexer context do destroy it instead of the connection. In addition, h1_release() and h2_release() have also been updated in the same way.	2019-04-12 22:06:53 +02:00
Christopher Faulet	51f73eb11a	MEDIUM: muxes: Add an optional input buffer during mux initialization The mux's callback init() now take a pointer to a buffer as extra argument. It must be used by the multiplexer as its input buffer. This buffer is always NULL when a multiplexer is initialized with a fresh connection. But if a mux upgrade is performed, it may be filled with existing data. Note that, for now, mux upgrades are not supported. But this commit is mandatory to do so.	2019-04-12 22:06:53 +02:00
Christopher Faulet	e9b7072e9e	MINOR: muxes: Rely on conn_is_back() during init to handle front/back conn Instead of using the connection context to make the difference between a frontend connection and a backend connection, we now rely on the function conn_is_back().	2019-04-12 22:06:53 +02:00
Christopher Faulet	0f17a9b510	MINOR: filters/htx: Use stream flags instead of px mode to instanciate a filter In the function flt_stream_add_filter(), if the HTX is enabled, before attaching a filter to a stream, we test if the filter can handle it or not. If not, the filter is ignored. Before the proxy mode was tested. Now we test if the stream is an HTX stream or not.	2019-04-12 22:06:53 +02:00
Christopher Faulet	eca8854555	MINOR: http_fetch/htx: Use stream flags instead of px mode in smp_prefetch_htx In the function smp_prefetch_htx(), we must know if data in the channel's buffer are structured or not. Before the proxy mode was tested. Now we test if the stream is an HTX stream or not. If yes, we know the HTX is used to structure data in the channel's buffer.	2019-04-12 22:06:53 +02:00
Christopher Faulet	0e160ff5bb	MINOR: stream: Set a flag when the stream uses the HTX The flag SF_HTX has been added to know when a stream uses the HTX or not. It is set when an HTX stream is created. There are 2 conditions to set it. The first one is when the HTTP frontend enables the HTX. The second one is when the attached conn_stream uses an HTX multiplexer.	2019-04-12 22:06:53 +02:00
Christopher Faulet	9f38f5aa80	MINOR: muxes: Add a flag to specify a multiplexer uses the HTX A multiplexer must now set the flag MX_FL_HTX when it uses the HTX to structured the data exchanged with channels. the muxes h1 and h2 set this flag. Of course, for the mux h2, it is set on h2_htx_ops only.	2019-04-12 22:06:53 +02:00
Christopher Faulet	9b579106fe	MINOR: mux-h2: Add a mux_ops dedicated to the HTX mode Instead of using the same mux_ops structure for the legacy HTTP mode and the HTX mode, a dedicated mux_ops is now used for the HTX mode. Same callbacks are used for both. But the flags may be different depending on the mode used.	2019-04-12 22:06:53 +02:00
Christopher Faulet	7f36636c21	BUG/MINOR: mux-h1: Handle the flag CS_FL_KILL_CONN during a shutdown read/write This flag is used to explicitly kill the connection when the CS is closed. It may be set by tcp rules. It must be respect by the mux-h1. This patch must be backported to 1.9.	2019-04-12 22:06:53 +02:00
Christopher Faulet	14c91cfdf8	MINOR: mux-h1: Don't release the conn_stream anymore when h1s is destroyed An H1 stream is destroyed when the conn_stream is detached or when the H1 connection is destroyed. In the first case, the CS is released by the caller. In the second one, because the connection is closed, no CS is attached anymore. In both, there is no reason to release the conn_stream in h1s_destroy().	2019-04-12 22:06:53 +02:00
Christopher Faulet	b992af00b6	MEDIUM: mux-h1: Simplify the connection mode management by sanitizing headers Connection headers are now sanitized during the parsing and the formatting. This means "close" and "keep-alive" values are always removed but right flags are set. This way, client side and server side are independent of each other. On the input side, after the parsing, neither "close" nor "keep-alive" values remain. So on the output side, if we found one of these values in a connection headers, it means it was explicitly added by HAProxy. So it overwrites the other rules, if applicable. Always sanitizing the output is also a way to simplifiy conditions to update the connection header. Concretly, only additions of "close" or "keep-alive" values remain, depending the case. No need to backport this patch.	2019-04-12 22:06:53 +02:00
Christopher Faulet	a51ebb7f56	MEDIUM: h1: Add an option to sanitize connection headers during parsing The flag H1_MF_CLEAN_CONN_HDR has been added to let the H1 parser sanitize connection headers. It means it will remove all "close" and "keep-alive" values during the parsing. One noticeable effect is that connection headers may be unfolded. In practice, this is not a problem because it is not frequent to have multiple values for the connection headers. If this flag is set, during the parsing The function h1_parse_next_connection_header() is called in a loop instead of h1_parse_conection_header(). No need to backport this patch	2019-04-12 22:06:53 +02:00
Christopher Faulet	b829f4c726	MINOR: stats/htx: Don't add "Connection: close" header anymore in stats responses On the client side, as far as possible, we will try to keep connection alive. So, in most of cases, this header will be removed. So it is better to not add it at all. If finally the connection must be closed, the header will be added by the mux h1. No need to backport this patch.	2019-04-12 22:06:53 +02:00
Christopher Faulet	cdc90e9175	MINOR: mux-h1: Simplify handling of 1xx responses Because of previous changes on http tunneling, the synchronization of the transaction can be simplified. Only the check on intermediate messages remains and it only concerns the response path. This patch must be backported to 1.9. It is not strictly speaking required but it will ease futur backports.	2019-04-12 22:06:53 +02:00
Christopher Faulet	c62c2b9d92	BUG/MEDIUM: htx: Fix the process of HTTP CONNECT with h2 connections In HTX, the HTTP tunneling does not work if h1 and h2 are mixed (an h1 client sending requests to an h2 server or this opposite) because the h1 multiplexer always adds an EOM before switching it to tunnel mode. The h2 multiplexer interprets it as an end of stream, closing the stream as for any other transaction. To make it works again, we need to swith to the tunnel mode without emitting any EOM blocks. Because of that, HTX analyzers have been updated to switch the transaction to tunnel mode before end of the message (because there is no end of message...). To be consistent, the protocol switching is also handled the same way even though the 101 responses are not supported in h2. This patch must be backported to 1.9.	2019-04-12 22:06:53 +02:00
Christopher Faulet	03b9d8ba4a	MINOR: proto_htx: Don't adjust transaction mode anymore in HTX analyzers Because the option http-tunnel is now ignored in HTX, there is no longer any need to adjust the transaction mode in HTX analyzers. A channel can still be switch to the tunnel mode for legitimate cases (HTTP CONNECT or switching protocols). So the function htx_adjust_conn_mode() is now useless. This patch must be backported to 1.9. It is not strictly speaking required but it will ease futur backports.	2019-04-12 22:06:53 +02:00
Christopher Faulet	6c9bbb2265	MEDIUM: htx: Deprecate the option 'http-tunnel' and ignore it in HTX The option http-tunnel disables any HTTP processing past the first transaction. In HTX, it works for full h1 transactions. As for the legacy HTTP, it is a workaround, but it works. But it is impossible to make it works with an h2 connection. In such case, it has no effect, the stream is closed at the end of the transaction. So to avoid any inconsistancies between h1 and h2 connections, this option is now always ignored when the HTX is enabled. It is also a good opportinity to deprecate an old and ugly option. A warning is emitted during HAProxy startup to encourage users to remove this option. Note that in legacy HTTP, this option only works with full h1 transactions too. If an h2 connection is established on a frontend with this option enabled, it will have no effect at all. But we keep it for the legacy HTTP for compatibility purpose. It will be removed with the legacy HTTP. So to be short, if you have to really (REALLY) use it, it will only work for legacy HTTP frontends with H1 clients. The documentation has been updated accordingly. This patch must be backported to 1.9. It is not strictly speaking required but it will ease futur backports.	2019-04-12 22:06:53 +02:00
Christopher Faulet	f1449b785e	BUG/MEDIUM: htx: Don't crush blocks payload when append is done on a data block If there is a data block when a header block is added in a HTX message, its payload will be inserted after the data block payload. But its index will be moved before the EOH block. So at this stage, if a new data block is added, we will try to append its payload to the last data block (because it is also the tail). Thus the payload of the further header block will be crushed. This cannot happens if the payloads wrap thanks to the previous fix. But it happens when the tail is not the front too. So now, in this case, we add a new block instead of appending. This patch must be backported in 1.9.	2019-04-12 22:06:45 +02:00
Christopher Faulet	05aab64b06	BUG/MEDIUM: htx: Defrag if blocks position is changed and the payloads wrap When a header is added or when a data block is added before another one, the blocks position may be changed (but not their payloads position). For instance, when a header is added, we move the block just before the EOH, if any. When the payloads wraps, it is pretty annoying because we loose the last inserted block. It is neither the tail nor the head. And it is not the front either. It is a design problem. Waiting for fixing this problem, we force a defragmentation in such case. Anyway, it should be pretty rare, so it's not really critical. This patch must be backported to 1.9.	2019-04-12 21:34:30 +02:00
Christopher Faulet	63263e50ed	BUG/MINOR: spoe: Be sure to set tv_request when each message fragment is encoded When a message or a fragment is encoded, the date the frame processing starts must be set if it is undefined. The test on tv_request field was wrong. This patch must be backported to 1.9.	2019-04-12 21:33:52 +02:00
Christopher Faulet	a715ea82ea	BUG/MEDIUM: spoe: Return an error if nothing is encoded for fragmented messages If the maximum frame size is very small with a large message or argument name, it is possible to be unable to encode anything. In such case, it is important to stop processing returning an error otherwise we will retry in loop to encode the message, failing each time because of the too small frame size. This patch must be backported to 1.9 and 1.8.	2019-04-12 16:38:54 +02:00
Christopher Faulet	3e86cec05e	BUG/MEDIUM: spoe: Queue message only if no SPOE applet is attached to the stream If a SPOE applet is already attached to a stream to handle its messages, we must not queue them. Otherwise it could be handled by another applet leading to errors. This happens with fragmented messages only. When the first framgnent is sent, the SPOE applet sending it is attached to the stream. It should be used to send all other fragments. This patch must be backported to 1.9 and 1.8.	2019-04-12 16:38:54 +02:00
Willy Tarreau	a8f57d51a0	MINOR: cli/activity: report the accept queue sizes in "show activity" Seeing the size of each ring helps understand which threads are overloaded and why some of them are less often elected than others by the multi-queue load balancer.	2019-04-12 15:54:15 +02:00
Willy Tarreau	64a9c05f37	MINOR: cli/listener: report the number of accepts on "show activity" The "show activity" command reports the number of incoming connections dispatched per thread but doesn't report the number of connections received by each thread. It is important to be able to monitor this value as it can show that for whatever reason a smaller set of threads is receiving the connections and dispatching them to all other ones.	2019-04-12 15:54:15 +02:00
Willy Tarreau	0d858446b6	BUG/MINOR: listener: renice the accept ring processing task It is not acceptable that the accept queues are handled with a normal priority since they are supposed to quickly dispatch the incoming traffic, resulting in tasks which will have their respective nice values and place in the queue. Let's renice the accept ring tasks to -1024. No backport is needed, this is strictly 2.0.	2019-04-12 15:54:03 +02:00
Willy Tarreau	587a8130b1	BUG/MINOR: tasks: make sure the first task to be queued keeps its nice value The run queue offset computed from the nice value depends on the run queue size, but for the first task to enter the run queue, this size is zero and the task gets queued just as if its nice value was zero as well. This is problematic for example for the CLI socket if another higher priority task gets queued immediately after as it can steal its place. This patch simply adds one to the rq_size value to make sure the nice is never multiplied by zero. The way the offset is calculated is questionable anyway these days, since with the newer scheduler it seems that just using the nice value as an offset should work (possibly damped by the task's number of calls). This fix must be backported to 1.9. It may possibly be backported to older versions if it proves to make the CLI more interactive.	2019-04-12 15:54:02 +02:00
Willy Tarreau	f8bce3125e	BUG/MEDIUM: task/threads: address a fairness issue between local and global tasks It is possible to hit a fairness issue in the scheduler when a local task runs for a long time (i.e. process_stream() returns running), and a global task wants to run on the same thread and remains in the global queue. What happens in this case is that the condition to extract tasks from the global queue will rarely be satisfied for very low task counts since whatever non-null queue size multiplied by a thread count >1 is always greater than the small remaining number of tasks in the queue. In theory another thread should pick the task but we do have some mono threaded tasks in the global queue as well during inter-thread wakeups. Note that this can only happen with task counts lower than the thread counts, typically one task in each queue for more than two threads. This patch works around the problem by allowing a very small unfairness, making sure that we can always pick at least one task from the global queue even if there is already one in the local queue. A better approach will consist in scanning the two trees in parallel and always pick the best task. This will be more complex and will constitute a separate patch. This fix must be backported to 1.9.	2019-04-12 15:53:43 +02:00
Olivier Houchard	b2fc04ebef	BUG/MEDIUM: stream_interface: Don't bother doing chk_rcv/snd if not connected. If the interface is not in state SI_ST_CON or SI_ST_EST, don't bother trying to send/recv data, we can't do it anyway, and if we're in SI_ST_TAR, that may lead to adding the SI_FL_ERR flag back on the stream_interface, while we don't want it. This should be backported to 1.9.	2019-04-12 13:14:55 +02:00
Olivier Houchard	56897e20a3	BUG/MEDIUM: streams: Only re-run process_stream if we're in a connected state. In process_stream(), only try again when there's the SI_FL_ERR flag and we're in a connected state, otherwise we can loop forever. It used to work because si_update_both() bogusly removed the SI_FL_ERR flag, and it would never be set at this point. Now it does, so take that into account. Many, many thanks to Maciej Zdeb for reporting the problem, and helping investigating it. This should be backported to 1.9.	2019-04-12 13:14:48 +02:00
Emmanuel Hocdet	2b4edfb0bd	MINOR: ssl: Activate aes_gcm_dec converter for BoringSSL BoringSSL can support it, no need to disable.	2019-04-11 15:00:13 +02:00
Robin H. Johnson	543d4507ca	MINOR: skip get_gmtime where tm is unused For LOG_FMT_TS (%Ts), the tm variable is not used, so save some cycles on the call to get_gmtime. Backport: 1.9 1.8 Signed-off-by: Robin H. Johnson <rjohnson@digitalocean.com>	2019-04-11 14:58:32 +02:00
Willy Tarreau	0f93672dfe	BUG/MEDIUM: pattern: assign pattern IDs after checking the config validity Pavlos Parissis reported an interesting case where some map identifiers were not assigned (appearing as -1 in show map). It turns out that it only happens for log-format expressions parsed in check_config_validity() that involve maps (log-format, use_backend, unique-id-header), as in the sample configuration below : frontend foo bind :8001 unique-id-format %[src,map(addr.lst)] log-format %[src,map(addr.lst)] use_backend %[src,map(addr.lst)] The reason stems from the initial introduction of unique IDs in 1.5 via commit `af5a29d5f` ("MINOR: pattern: Each pattern is identified by unique id.") : the unique_id assignment was done before calling check_config_validity() so all maps loaded after this call are not properly configured. From what the function does, it seems they will not be able to use a cache, will not have a unique_id assigned and will not be updatable from the CLI. This fix must be backported to all supported versions.	2019-04-11 14:52:25 +02:00
Olivier Houchard	46453d3f7d	MINOR: threads: Implement thread_cpus_enabled() for FreeBSD. Use cpuset_getaffinity() to implement thread_cpus_enabled() on FreeBSD, so that we can know the number of CPUs available, and automatically launch as much threads if nbthread isn't specified.	2019-04-11 00:09:22 +02:00
Olivier Houchard	86dcad6c62	BUG/MEDIUM: stream: Don't clear the stream_interface flags in si_update_both. In commit `d7704b534`, we introduced and expiration flag on the stream interface, which is used for the connect, the queue and the turn around. Because the turn around state isn't an error, the flag was reset in process_stream(), and later in commit `cff6411f9` when introducing the SI_FL_ERR flag, the cleanup of the flag at this place was erroneously generalized. To fix this, the SI_FL_EXP flag is only cleared at the end of the turn around state, and nobody should clear the stream interface flags anymore. This should be backported to 1.9, it has no known impact on older versions.	2019-04-09 19:31:22 +02:00
Olivier Houchard	120f64a8c4	BUG/MEDIUM: streams: Store prev_state before calling si_update_both(). As si_update_both() sets prev_state to state for each stream_interface, if we want to check it changed, copy it before calling si_update_both(). This should be backported to 1.9.	2019-04-09 19:31:22 +02:00
Olivier Houchard	39cc020af1	BUG/MEDIUM: streams: Don't remove the SI_FL_ERR flag in si_update_both(). Don't inconditionally remove the SI_FL_ERR code in si_update_both(), which is called at the end of process_stream(). Doing so was a bug that was there since the flag was introduced, because we were always setting si->flags to SI_FL_NONE, however we don't want to lose that one, except if we will retry connecting, so only remove it in sess_update_st_cer(). This should be backported to 1.9.	2019-04-09 19:31:22 +02:00
Willy Tarreau	90caa07935	BUG/MEDIUM: htx: fix random premature abort of data transfers It can happen in some cases that the last block of an H2 transfer over HTX is truncated. This was tracked down to a leftover of an earlier implementation of htx_xfer_blks() causing the computed size of a block to be incorrectly calculated if a data block doesn't completely fit into the target buffer. In practice it causes the EOM block to be attempted to be emitted with a wrong size and the message to be truncated. One way to reproduce this is to chain two haproxy instances in h1->h2->h1 with httpterm as the server and h2load as the client, making many requests between 8 and 10kB over a single connection. Usually one of the very first requests will fail. This fix must be backported to 1.9.	2019-04-09 16:30:20 +02:00
Olivier Houchard	3ca18bf0bd	BUG/MEDIUM: h2: Don't attempt to recv from h2_process_demux if we subscribed. Modify h2c_restart_reading() to add a new parameter, to let it know if it should consider if the buffer isn't empty when retrying to read or not, and call h2c_restart_reading() using 0 as a parameter from h2_process_demux(). If we're leaving h2_process_demux() with a non-empty buffer, it means the frame is incomplete, and we're waiting for more data, and if we already subscribed, we'll be waken when more data are available. Failing to do so means we'll be waken up in a loop until more data are available. This should be backported to 1.9.	2019-04-05 16:03:54 +02:00
Emeric Brun	9ef2ad7844	BUG/MEDIUM: peers: fix a case where peer session is not cleanly reset on release. The deinit took place in only peer_session_release, but in the a case of a previous call to peer_session_forceshutdown, the session cursors won't be reset, resulting in a bad state for new session of the same peer. For instance, a table definition message could be dropped and so all update messages will be dropped by the remote peer. This patch move the deinit processing directly in the force shutdown funtion. Killed session remains in "ST_END" state but ref on peer was reset to NULL and deinit will be skipped on session release function. The session release continue to assure the deinit for "active" sessions. This patch should be backported on all stable version since proto peers v2.	2019-04-03 14:42:10 +02:00
Christopher Faulet	aed68d4390	BUG/MINOR: proto_htx: Reset to_forward value when a message is set to DONE Because we try to forward infinitly message body, when its state is set to DONE, we must be sure to reset to_foward value of the corresponding channel. Otherwise, some errors can be errornously triggered. No need to backport this patch.	2019-04-01 15:43:40 +02:00
William Lallemand	33d29e2a11	MINOR: cli: export HAPROXY_CLI environment variable Export the HAPROXY_CLI environment variable which contains the list of all stats sockets (including the sockpair@) separated by semicolons.	2019-04-01 14:45:37 +02:00
William Lallemand	e58915f07f	MINOR: cli: start addresses by a prefix in 'show cli sockets' Displays a prefix for every addresses in 'show cli sockets'. It could be 'unix@', 'ipv4@', 'ipv6@', 'abns@' or 'sockpair@'. Could be backported in 1.9 and 1.8.	2019-04-01 14:45:37 +02:00
William Lallemand	75812a7a3c	BUG/MINOR: cli: correctly handle abns in 'show cli sockets' The 'show cli sockets' was not handling the abns sockets. This is a problem since it uses the AF_UNIX family, it displays nothing in the path column because the path starts by \0. Should be backported to 1.9 and 1.8.	2019-04-01 14:45:37 +02:00
William Lallemand	ad53d6dd75	MINOR: mworker/cli: show programs in 'show proc' Show the programs in 'show proc' Example: # programs 2285 dataplane-api - 0 0d 00h00m12s # old programs 2261 dataplane-api - 1 0d 00h00m53s	2019-04-01 14:45:37 +02:00
William Lallemand	9a1ee7ac31	MEDIUM: mworker-prog: implement program for master-worker This patch implements the external binary support in the master worker. To configure an external process, you need to use the program section, for example: program dataplane-api command ./dataplane_api Those processes are launched at the same time as the workers. During a reload of HAProxy, those processes are dealing with the same sequence as a worker: - the master is re-executed - the master sends a USR1 signal to the program - the master launches a new instance of the program During a stop, or restart, a SIGTERM is sent to the program.	2019-04-01 14:45:37 +02:00
William Lallemand	88dc7c5de9	REORG: mworker/cli: move CLI functions to mworker.c Move the CLI functions of the master worker to mworker.c	2019-04-01 14:45:37 +02:00
William Lallemand	3f12887ffa	MINOR: mworker: don't use children variable anymore The children variable is still used in haproxy, it is not required anymore since we have the information about the current workers in the mworker_proc linked list. The oldpids array is also replaced by this linked list when we generated the arguments for the master reexec.	2019-04-01 14:45:37 +02:00
William Lallemand	f3a86831ae	MINOR: mworker: calloc mworker_proc structures Initialize mworker_proc structures to 0 with calloc instead of just doing a malloc.	2019-04-01 14:45:37 +02:00
William Lallemand	9001ce8c2f	REORG: mworker: move mworker_cleanlisteners to mworker.c	2019-04-01 14:45:37 +02:00
William Lallemand	e25473c846	REORG: mworker: move signal handlers and related functions Move the following functions to mworker.c: void mworker_catch_sighup(struct sig_handler sh); void mworker_catch_sigterm(struct sig_handler sh); void mworker_catch_sigchld(struct sig_handler *sh); static void mworker_kill(int sig); int current_child(int pid);	2019-04-01 14:45:37 +02:00
William Lallemand	3fa724db87	REORG: mworker: move IPC functions to mworker.c Move the following functions to mworker.c: void mworker_accept_wrapper(int fd); void mworker_pipe_register();	2019-04-01 14:45:37 +02:00
William Lallemand	3cd95d2f1b	REORG: mworker: move signals functions to mworker.c Move the following functions to mworker.c: void mworker_block_signals(); void mworker_unblock_signals();	2019-04-01 14:45:37 +02:00
William Lallemand	48dfbbdea9	REORG: mworker: move serializing functions to mworker.c Move the 2 following functions to mworker.c: void mworker_proc_list_to_env() void mworker_env_to_proc_list()	2019-04-01 14:45:37 +02:00
Nenad Merdanovic	c31499d747	MINOR: ssl: Add aes_gcm_dec converter The converter can be used to decrypt the raw byte input using the AES-GCM algorithm, using provided nonce, key and AEAD tag. This can be useful to decrypt encrypted cookies for example and make decisions based on the content.	2019-04-01 13:33:31 +02:00
Willy Tarreau	0ca24aa028	BUILD: connection: fix naming of ip_v field AIX defines ip_v as ip_ff.ip_fv in netinet/ip.h using a macro, and unfortunately we do have a local variable with such a name and which uses the same header file. Let's rename the variable to ip_ver to fix this.	2019-04-01 07:44:56 +02:00
Willy Tarreau	a1bd1faeeb	BUILD: use inttypes.h instead of stdint.h I found on an (old) AIX 5.1 machine that stdint.h didn't exist while inttypes.h which is expected to include it does exist and provides the desired functionalities. As explained here, stdint being just a subset of inttypes for use in freestanding environments, it's probably always OK to switch to inttypes instead: https://pubs.opengroup.org/onlinepubs/009696799/basedefs/stdint.h.html Also it's even clearer here in the autoconf doc : https://www.gnu.org/software/autoconf/manual/autoconf-2.61/html_node/Header-Portability.html "The C99 standard says that inttypes.h includes stdint.h, so there's no need to include stdint.h separately in a standard environment. Some implementations have inttypes.h but not stdint.h (e.g., Solaris 7), but we don't know of any implementation that has stdint.h but not inttypes.h"	2019-04-01 07:44:56 +02:00
Willy Tarreau	7b5654f54a	BUILD: re-implement an initcall variant without using executable sections The current initcall implementation relies on dedicated sections (one section per init stage) to store the initcall descriptors. Then upon startup, these sections are scanned from beginning to end and all items found there are called in sequence. On platforms like AIX or Cygwin it seems difficult to figure the beginning and end of sections as the linker doesn't seem to provide the corresponding symbols. In order to replace this, this patch simply implements an array of single linked (one per init stage) which are fed using constructors for each register call. These constructors are declared static, with a name depending on their line number in the file, in order to avoid name clashes. The final effect is the same, except that the method is slightly more expensive in that it explicitly produces code to register these initcalls : $ size haproxy.sections haproxy.constructor text data bss dec hex filename 4060312 249176 1457652 5767140 57ffe4 haproxy.sections 4062862 260408 1457652 5780922 5835ba haproxy.constructor This mechanism is enabled as an alternative to the default one when build option USE_OBSOLETE_LINKER is set. This option is currently enabled by default only on AIX and Cygwin, and may be attempted for any target which fails to build complaining about missing symbols __start_init_* and/or __stop_init_*. Once confirmed as a reliable fix, this will likely have to be backported to 1.9 where AIX and Cygwin do not build anymore.	2019-04-01 07:43:07 +02:00
Willy Tarreau	9d22e56178	MINOR: tools: add an unsetenv() implementation Older Solaris and AIX versions do not have unsetenv(). This adds a fairly simple implementation which scans the environment, for use with those systems. It will simply require to pass the define in the "DEFINE" macro at build time like this : DEFINE="-Dunsetenv=my_unsetenv"	2019-03-29 21:05:37 +01:00
Willy Tarreau	e0609f5f49	MINOR: tools: make memvprintf() never pass a NULL target to vsnprintf() Most modern platforms don't touch the output buffer when the size argument is null, but there exist a few old ones (like AIX 5 and possibly Tru64) where the output will be dereferenced anyway, probably to write the trailing null, crashing the process. memprintf() uses this to measure the desired length. There is a very simple workaround to this consisting in passing a pointer to a character instead of a NULL pointer. It was confirmed to fix the issue on AIX 5.1.	2019-03-29 21:03:39 +01:00
Willy Tarreau	2231b63887	BUILD: cache: avoid a build warning with some compilers/linkers The struct http_cache_applet was fully declared at the beginning instead of just doing a forward declaration using an extern modifier. Some linkers report warnings about a redefined symbol since these really are two complete declarations. The proper way to do this is to use extern on the first one and to have a full declaration later. However it's not permitted to have both static and extern so the change done in commit `0f2229943` ("CLEANUP: cache: don't export http_cache_applet anymore") has to be partially undone. This should be backported to 1.9 for sanity but has no effet on most platforms. However on 1.9 the extern keyword must also be added to include/types/cache.h.	2019-03-29 21:03:24 +01:00
Ricardo Nabinger Sanchez	4bccea9891	BUG/MAJOR: checks: segfault during tcpcheck_main When using TCP health checks (tcp-check connect), it is possible to crash with a segfault when, for reasons yet to be understood, the protocol family is unknown. In the function tcpcheck_main(), proto is dereferenced without a prior test in case it is NULL, leading to the segfault during proto->connect dereference. The line has been unmodified since it was introduced, in commit `69e273f3fc`. This was the only use of proto (or more specifically, the return of protocol_by_family()) that was unprotected; all other callsites perform the test for a NULL pointer. This patch should be backported to 1.9, 1.8, 1.7, and 1.6.	2019-03-29 11:12:35 +01:00
Olivier Houchard	06f6811d9f	BUG/MEDIUM: checks: Don't bother subscribing if we have a connection error. In __event_srv_chk_r() and __event_srv_chk_w(), don't bother subscribing if we're waiting for a handshake, but we had a connection error. We will never be able to send/receive anything on that connection anyway, and the conn_stream is probably about to be destroyed, and we will crash if the tasklet is waken up. I'm not convinced we need to subscribe here at all anyway, but I'd rather modify the check code as little as possible. This should be backported to 1.9.	2019-03-28 17:32:42 +01:00
William Lallemand	f94afebb94	BUG/MEDIUM: mworker: don't free the wrong child when not found A bug occurs when the sigchld handler is called and a child which is not in the process list just left, or with an empty process list. The child variable won't be set and left as an uninitialized variable or set to the wrong child entry, which can lead to a free of this uninitialized variable or of the wrong child. This can lead to a crash of the master during a stop or a reload. It is not supposed to happen with a worker which was created by the master. A cause could be a fork made by a dependency. (openssl, lua ?) This patch strengthens the case of the missing child by doing the free only if the child was found. This patch must be backported to 1.9.	2019-03-28 11:36:18 +01:00
Christopher Faulet	5220ef25e3	BUG/MINOR: mux-h1: Only skip invalid C-L headers on output When an HTTP request with an empty body is received, the flag HTX_SL_F_BODYLESS is set on the HTX start-line block. It is true if the header content-length is explicitly set to 0 or if it is omitted for a non chunked request. On the server side, when the request is reformatted, because HTX_SL_F_BODYLESS is set, the flag H1_MF_CLEN is added on the request parser. It is done to not add an header transfer-encoding on bodyless requests. But if an header content-length is explicitly set to 0, when it is parsed, because H1_MF_CLEN is set, the function h1_parse_cont_len_header() returns 0, meaning the header can be dropped. So in such case, a request without any header content-length is sent to the server. Some servers seems to reject empty POST requests with an error 411 when there is no header content-length. So to fix this issue, on the output side, only headers with an invalid content length are skipped, ie only when the function h1_parse_cont_len_header() returns a negative value. This patch must be backported to 1.9.	2019-03-28 10:00:36 +01:00
David Carlier	5671662f08	BUILD/MINOR: listener: Silent a few signedness warnings. Silenting couple of warnings related to signedness, due to a mismatch of signed and unsigned ints with l->nbconn, actconn and p->feconn.	2019-03-27 17:37:44 +01:00
Fr�d�ric L�caille	b7405c1c50	BUG/MINOR: peers: Missing initializations after peer session shutdown. This patch fixes a bug introduced by `045e0d4` commit where it was really a bad idea to reset the peer applet context before shutting down the underlying session. This had as side effect to cancel the re-initializations done by peer_session_release(), especially prevented this function from re-initializing the current table pointer which is there to force annoucement of stick-table definitions on when reconnecting. Consequently the peers could send stick-table update messages without a first stick-table definition message. As this is forbidden, this leaded the remote peers to close the sessions.	2019-03-27 15:16:25 +01:00
Willy Tarreau	7728ed3565	BUILD: report the whole feature set with their status in haproxy -vv It's not convenient not to know the status of default options, and requires the user to know what option is enabled by default in each target. With this patch, a new "Features list" line is added to the output of "haproxy -vv" to report the whole list of known features with their respective status. They're prefixed with a "+" when enabled or a "-" when disabled. The "USE_" prefix is removed for clarity.	2019-03-27 14:32:58 +01:00
Fr�d�ric L�caille	54bff83f43	CLEANUP: peers: replace timeout constants by macros. This adds two macros PEER_RESYNC_TIMEOUT and PEER_RECONNECT_TIMEOUT both set to 5 seconds in order to remove magic timeouts which appear in the code.	2019-03-26 10:54:06 +01:00
Fr�d�ric L�caille	aba44a2abc	CLEANUP: peers: remove useless annoying tabulations. There were tabs in between macro names and their values in their definition, forcing everyone to do the same, and causing some mangling in patches. Let's fix all this.	2019-03-26 10:53:09 +01:00
Fr�d�ric L�caille	045e0d4b3b	BUG/MINOR: peers: Really close the sessions with no heartbeat. `645635d` commit was not sufficient to implement the heartbeat feature. When no heartbeat was received before its timeout has expired the session was not closed due to the fact that process_peer_sync() which is the task responsible of handling the heartbeat and session expirations only checked the heartbeat timeout, and sent a heartbeat message if it has expired. This has as side effect to leave the session opened. On the remote side, a peer which receives a heartbeat message, even if not supported, does not close the session. Furthermore it not sufficient to update ->reconnect peer member field to schedule a peer session release. With this patch, a peer is flagged as alive as soon as it received peer protocol messages (and not only heartbeat messages). When no updates must be sent, we first check the reconnection timeout (->reconnect peer member field). If expired, we really shutdown the session if the peer is not alive, but if the peer seen as alive, we reset this flag and update the ->reconnect for the next period. If the reconnection timeout has not expired, then we check the heartbeat timeout which is there only to emit heartbeat messages upon expirations. If expired, as before this patch we increment the heartbeat timeout by 3s to schedule the next heartbeat message then we emit a heartbeat message waking up the peer I/O handler. In every cases we update the task expiration to the earlier time between the reconnection time and the heartbeat timeout time so that to be sure to check again these two ->reconnect and ->heartbeat timers.	2019-03-26 10:51:12 +01:00
Willy Tarreau	65e04eb2bb	MINOR: channel: don't unset CF_SHUTR_NOW after shutting down. This flag is set by the stream layer to request an abort, and results in CF_SHUTR being set once the abort is performed. However by analogy with the send side, the flag was removed once the CF_SHUTR flag was set, thus we lose the information about the cause of the shutr. This is what creates the confusion that sometimes arises between client and server aborts. This patch makes sure we don't remove this flag anymore in this case. All call places only use it to perform the shutr and already check it against CF_SHUTR. So no condition needs to be updated to take this into account. Some later, more careful changes may consist in refining the conditions where we report a client reset or a server reset to ignore SHUTR when SHUTR_NOW is set so that we don't report such misleading information anymore.	2019-03-25 18:35:05 +01:00
Willy Tarreau	a27db38f12	BUG/MEDIUM: mux-h2: make sure to always notify streams of EOS condition Recent commit `63768a63d` ("MEDIUM: mux-h2: Don't mix the end of the message with the end of stream") introduced a race which may manifest itself with small connection counts on large objects and large server timeouts in legacy mode. Sometimes h2s_close() is called while the data layer is subscribed to read events but nothing in the chain can cause this wake-up to happen and some streams stall for a while at the end of a transfer until the server timeout strikes and ends the stream completes. We need to wake the stream up if it's subscribed to rx events there, which is what this patch does. When the patch above is backported to 1.9, this patch will also have to be backported.	2019-03-25 18:13:16 +01:00
Willy Tarreau	e73256fd2a	BUG/MEDIUM: task/h2: add an idempotent task removal fucntion Previous commit `3ea351368` ("BUG/MEDIUM: h2: Remove the tasklet from the task list if unsubscribing.") uncovered an issue which needs to be addressed in the scheduler's API. The function task_remove_from_task_list() was initially designed to remove a task from the running tasklet list from within the scheduler, and had to be used in h2 to abort pending I/O events. However this function was not designed to be idempotent, occasionally causing a double removal from the tasklet list, with the second doing nothing but affecting the apparent tasks count and making haproxy use 100% CPU on some tests consisting in stopping the client during some transfers. The h2_unsubscribe() function can sometimes be called upon stream exit after an error where the tasklet was possibly already removed, so it. This patch does 2 things : - it renames task_remove_from_task_list() to __task_remove_from_tasklet_list() to discourage users from calling it. Also note the fix in the naming since it's a tasklet list and not a task list. This function is still uesd from the scheduler. - it adds a new, idempotent, task_remove_from_tasklet_list() function which does nothing if the task is already not in the tasklet list. This patch will need to be backported where the commit above is backported.	2019-03-25 18:02:54 +01:00
Olivier Houchard	3ea3513689	BUG/MEDIUM: h2: Remove the tasklet from the task list if unsubscribing. In h2_unsubscribe(), if we unsubscribe on SUB_CALL_UNSUBSCRIBE, then remove ourself from the sending_list, and remove the tasklet from the task list. We're probably about to destroy the stream anyway, so we don't want the tasklet to run, or to stay in the sending_list, or it could lead to a crash. This should be backpored to 1.9.	2019-03-25 14:34:26 +01:00
Olivier Houchard	afc7cb85c4	BUG/MEDIUM: h2: Follow the same logic in h2_deferred_shut than in h2_snd_buf. In h2_deferred_shut(), don't just set h2s->send_wait to NULL, instead, use the same logic as in h2_snd_buf() and only do so if we successfully sent data (or if we don't want to send them anymore). Setting it to NULL can lead to crashes. This should be backported to 1.9.	2019-03-25 14:34:26 +01:00
Olivier Houchard	fd1e96d2fb	BUG/MEDIUM: h2: Use the new sending_list in h2s_notify_send(). In h2s_notify_send(), use the new sending_list instead of using the old way of setting hs->send_wait to NULL, failing to do so may lead to crashes. This should be backported to 1.9.	2019-03-25 14:34:26 +01:00
Olivier Houchard	01d4cb5339	BUG/MEDIUM: h2: only destroy the h2s if h2s->cs is NULL. In h2_deferred_shut(), only attempt to destroy the h2s if h2s->cs is NULL. h2s->cs being non-NULL means it's still referenced by the stream interface, so it may try to use it later, and that could lead to a crash. This should be backported to 1.9.	2019-03-25 13:35:02 +01:00
Christopher Faulet	66af0b2b99	MEDIUM: proto_htx: Reintroduce the infinite forwarding on data This commit was reverted because of bugs. Now it should be ok. Difference with the commit `f52170d2f` ("MEDIUM: proto_htx: Switch to infinite forwarding if there is no data filte") is that when the infinite forwarding is enabled, the message is switched to the state HTTP_MSG_DONE if the flag CF_EOI is set.	2019-03-25 06:55:23 +01:00
Christopher Faulet	87a8f353f1	CLEANUP: muxes/stream-int: Remove flags CS_FL_READ_NULL and SI_FL_READ_NULL Since the flag CF_SHUTR is no more set to mark the end of the message, these flags become useless. This patch should be backported to 1.9.	2019-03-25 06:55:23 +01:00
Christopher Faulet	769d0e98b8	BUG/MEDIUM: http/htx: Fix handling of the option abortonclose Because the flag CF_SHUTR is no more set to mark the end of the message by the H2 multiplexer, we can rely on it again to detect aborts. there is no more need to make a check on the flag SI_FL_CLEAN_ABRT when the option abortonclose is enabled. So, this option should work as before for h2 clients. This patch must be backported to 1.9 with the previous EOI patches.	2019-03-25 06:55:13 +01:00
Christopher Faulet	dbe2cb4ee5	MINOR: mux-h1: Set CS_FL_EOI the end of the message is reached As for the H2 multiplexer, When the end of a message is detected, the flag CS_FL_EOI is set on the conn_stream. This patch should be backported to 1.9.	2019-03-25 06:33:53 +01:00
Christopher Faulet	63768a63d7	MEDIUM: mux-h2: Don't mix the end of the message with the end of stream The H2 multiplexer now sets CS_FL_EOI when it receives a frame with the ES flag. And when the H2 streams is closed, it set the flag CS_FL_REOS. This patch should be backported to 1.9.	2019-03-25 06:26:30 +01:00
Christopher Faulet	297d3e2e0f	MINOR: channel: Report EOI on the input channel if it was reached in the mux The flag CF_EOI is now set on the input channel when the flag CS_FL_EOI is set on the corresponding conn_stream. In addition, if a read activity is reported when this flag is set, the stream is woken up. This patch should be backported to 1.9.	2019-03-25 06:24:43 +01:00
Christopher Faulet	3ab07c35b4	MINOR: mux-h2: Remove useless test on ES flag in h2_frt_transfer_data() Same test is already performed in the caller function, h2c_frt_handle_data(). This patch should be backported to 1.9.	2019-03-22 18:06:17 +01:00
Christopher Faulet	2f5c784864	BUG/MINOR: proto-http: Don't forward request body anymore on error In the commit `93e02d8b7` ("MINOR: proto-http/proto-htx: Make error handling clearer during data forwarding"), a return clause was removed by error in the function http_request_forward_body(). This bug seems not having any visible impact. This patch must be backported to 1.9.	2019-03-22 18:05:50 +01:00
Olivier Houchard	d360ac60f4	BUG/MEDIUM: h2: Try to be fair when sending data. On the send path, try to be fair, and make sure the first to attempt to send data will actually be the first to send data when it's possible (ie when the mux' buffer is not full anymore). To do so, use a separate list element for the sending_list, and only remove the h2s from the send_list/fctl_list if we successfully sent data. If we did not, we'll keep our place in the list, and will be able to try again next time. This should be backported to 1.9.	2019-03-22 18:05:03 +01:00
Radek Zajic	594c456d14	BUG/MINOR: log: properly format IPv6 address when LOG_OPT_HEXA modifier is used. In lf_ip(), when LOG_OPT_HEXA modifier is used, there is a code to format the IP address as a hexadecimal string. This code does not properly handle cases when the IP address is IPv6. In such case, the code only prints `00000000`. This patch adds support for IPv6. For legacy IPv4, the format remains unchanged. If IPv6 socket is used to accept IPv6 connection, the full IPv6 address is returned. For example, IPv6 localhost, ::1, is printed as 00000000000000000000000000000001. If IPv6 socket accepts IPv4 connection, the IPv4 address is mapped by the kernel into the IPv4-mapped-IPv6 address space (RFC4291, section 2.5.5.2) and is formatted as such. For example, 127.0.0.1 becomes ::ffff:127.0.0.1, which is printed as 00000000000000000000FFFF7F000001. This should be backported to 1.9.	2019-03-22 17:31:18 +01:00
Pierre Cheynier	bc34cd1de2	BUG/MEDIUM: ssl: ability to set TLS 1.3 ciphers using ssl-default-server-ciphersuites Any attempt to put TLS 1.3 ciphers on servers failed with output 'unable to set TLS 1.3 cipher suites'. This was due to usage of SSL_CTX_set_cipher_list instead of SSL_CTX_set_ciphersuites in the TLS 1.3 block (protected by OPENSSL_VERSION_NUMBER >= 0x10101000L & so). This should be backported to 1.9 and 1.8. Signed-off-by: Pierre Cheynier <p.cheynier@criteo.com> Reported-by: Damien Claisse <d.claisse@criteo.com> Cc: Emeric Brun <ebrun@haproxy.com>	2019-03-22 17:24:14 +01:00
Willy Tarreau	749f5cab83	CLEANUP: mux-h2: add some comments to help understand the code Some functions' roles and usage are far from being obvious, and diving into this part each time requires deep concentration before starting to understand who does what. Let's add a few comments which help figure some of the useful pieces.	2019-03-21 19:19:36 +01:00
Willy Tarreau	8ab128c06a	MINOR: mux-h2: copy small data blocks more often and reduce the number of pauses We tend to refrain from sending data a bit too much in the H2 mux : whenever there are pending data in the buffer and we try to copy something larger than 1/4 of the buffer we prefer to pause. This is suboptimal for medium-sized objects which have to send their headers and later their data. This patch slightly changes this by allowing a copy of a large block if it fits at once and if the realign cost is small, i.e. the pending data are small or the block fits in the contiguous area. Depending on the object size this measurably improves the download performance by between 1 and 10%, and possibly lowers the transfer latency for medium objects.	2019-03-21 18:28:31 +01:00
Olivier Houchard	fd8bd4521a	BUG/MEDIUM: mux-h2: Use the right list in h2_stop_senders(). In h2_stop_senders(), when we're about to move the h2s about to send back to the send_list, because we know the mux is full, instead of putting them all in the send_list, put them back either in the fctl_list or the send_list depending on if they are waiting for the flow control or not. This also makes sure they're inserted in their arrival order and not reversed. This should be backported to 1.9.	2019-03-21 18:28:31 +01:00
Olivier Houchard	16ff261633	BUG/MEDIUM: mux-h2: Don't bother keeping the h2s if detaching and nothing to send. In h2_detach(), don't bother keeping the h2s even if it was waiting for flow control if we no longer are subscribed for receiving or sending, as nobody will do anything once we can write in the mux, anyway. Failing to do so may lead to h2s being kept opened forever. This should be backported to 1.9.	2019-03-21 18:28:31 +01:00
Olivier Houchard	7a977431ca	BUG/MEDIUM: mux-h2: Make sure we destroyed the h2s once shutr/shutw is done. If we're waiting until we can send a shutr and/or a shutw, once we're done and not considering sending anything, destroy the h2s, and eventually the h2c if we're done with the whole connection, or it will never be done. This should be backported to 1.9.	2019-03-21 18:28:31 +01:00
Willy Tarreau	6e8d6a9163	Revert "MEDIUM: proto_htx: Switch to infinite forwarding if there is no data filter" This reverts commit `f52170d2f4`. This commit was merged too early, some areas are not ready and transfers from H1 to H2 often stall. Christopher suggested to wait for the other parts to be ready before reintroducing it.	2019-03-21 18:28:31 +01:00
Christopher Faulet	18c2e8dc0f	MINOR: lua: Don't handle the header Expect in lua HTTP applets anymore This header is now handled in HTTP analyzers the same way for all HTTP applets.	2019-03-19 09:58:35 +01:00
Willy Tarreau	0f22299435	CLEANUP: cache: don't export http_cache_applet anymore This one can become static since it's not used by http/htx anymore.	2019-03-19 09:58:35 +01:00
Christopher Faulet	2571bc6410	MINOR: http/applets: Handle all applets intercepting HTTP requests the same way In addition to stats and cache applets, there are also HTTP applet services declared in an http-request rule. All these applets are now handled the same way. Among other things, the header Expect is handled at the same place for all these applets.	2019-03-19 09:54:20 +01:00
Christopher Faulet	bcf242a1d5	MINOR: stats/cache: Handle the header Expect when applets are registered First of all, it is a way to handle 100-Continue for the cache without duplicating code. Then, for the stats, it is no longer necessary to wait for the request body.	2019-03-19 09:53:14 +01:00
Christopher Faulet	4a28a536a3	MINOR: proto_htx: Add function to handle the header "Expect: 100-continue" The function htx_handle_expect_hdr() is now responsible to search the header "Expect" and send the corresponding response if necessary.	2019-03-19 09:51:38 +01:00
Christopher Faulet	87451fd0bf	MINOR: proto_http: Add function to handle the header "Expect: 100-continue" The function http_handle_expect_hdr() is now responsible to search the header "Expect" and send the corresponding response if necessary.	2019-03-19 09:50:54 +01:00
Christopher Faulet	56a3d6e1f1	BUG/MEDIUM: lua: Fully consume large requests when an HTTP applet ends In Lua, when an HTTP applet ends (in HTX and legacy HTTP), we must flush remaining outgoing data on the request. But only outgoing data at time the applet is called are consumed. If a request with a huge body is sent, an error is triggerred because a SHUTW is catched for an unfinisehd request. Now, we consume request data until the end. In fact, we don't try to shutdown the request's channel for write anymore. This patch must be backported to 1.9 after some observation period. It should probably be backported in prior versions too. But honnestly, with refactoring on the connection layer and the stream interface in 1.9, it is probably safer to not do so.	2019-03-19 09:49:50 +01:00
Christopher Faulet	3a78aa6e95	BUG/MINOR: stats: Fully consume large requests in the stats applet In the stats applet (in HTX and legacy HTTP), after a response is fully sent to a client, the request is consumed. It is done at the end, after all the response was copied into the channel's buffer. But only outgoing data at time the applet is called are consumed. Then the applet is closed. If a request with a huge body is sent, an error is triggerred because a SHUTW is catched for an unfinisehd request. Now, we consume request data until the end. In fact, we don't try to shutdown the request's channel for write anymore. This patch must be backported to 1.9 after some observation period. It should probably be backported in prior versions too. But honnestly, with refactoring on the connection layer and the stream interface in 1.9, it is probably safer to not do so.	2019-03-19 09:49:29 +01:00
Christopher Faulet	adb363135c	BUG/MINOR: cache: Fully consume large requests in the cache applet In the cache applet (in HTX and legacy HTTP), when an cached object is sent to a client, the request must be consumed. It is done at the end, after all the response was copied into the channel's buffer. But only outgoing data at time the applet is called are consumed. Then the applet is closed. If a request with a huge body is sent, an error is triggerred because a SHUTW is catched on an unfinished request. Now, we consume request data as soon as possible and we do it until the end. In fact, we don't try to shutdown the request's channel for write anymore. This patch must be backported to 1.9 after some observation period.	2019-03-19 09:49:08 +01:00
Christopher Faulet	f52170d2f4	MEDIUM: proto_htx: Switch to infinite forwarding if there is no data filter Because in HTX the parsing is done by the multiplexers, there is no reason to limit the amount of data fast-forwarded. Of course, it is only true when there is no data filter registered on the corresponding channel. So now, we enable the infinite forwarding when possible. However, the HTTP message state remains HTTP_MSG_DATA. Then, when infinite forwarding is enabled, if the flag CF_SHUTR is set, the state is switched to HTTP_MSG_DONE.	2019-03-19 09:48:05 +01:00
Willy Tarreau	679bba13f7	MINOR: init: report the list of optionally available services It's never easy to guess what services are built in. We currently have the prometheus exporter in contrib/ which is the only extension for now. Let's enumerate all available ones just like we do for filterr and pollers.	2019-03-19 08:08:10 +01:00
Willy Tarreau	9b6be3bbeb	BUILD: tools: fix a build warning on some 32-bit archs Some recent versions of gcc apparently can detect that x >> 32 will not work on a 32-bit architecture, but are failing to see that the code will not be built since it's enclosed in "if (sizeof(LONG) > 4)" or equivalent. Just shift right twice by 16 bits in this case, the compiler correctly replaces it by a single 32-bit shift. No backport is needed.	2019-03-18 16:33:15 +01:00
Christopher Faulet	93e02d8b73	MINOR: proto-http/proto-htx: Make error handling clearer during data forwarding It is just a cleanup. Error handling is grouped at the end HTTP data analysers. This patch must be backported to 1.9 because it is used by another patch to fix a bug.	2019-03-18 15:50:23 +01:00
Christopher Faulet	203b2b0a5a	MINOR: muxes: Report the Last read with a dedicated flag For conveniance, in HTTP muxes (h1 and h2), the end of the stream and the end of the message are reported the same way to the stream, by setting the flag CS_FL_EOS. In the stream-interface, when CS_FL_EOS is detected, a shutdown for read is reported on the channel side. This is historical. With the legacy HTTP layer, because the parsing is done by the stream in HTTP analyzers, the EOS really means a shutdown for read. Most of time, for muxes h1 and h2, it works pretty well, especially because the keep-alive is handled by the muxes. The stream is only used for one transaction. So mixing EOS and EOM is good enough. But not everytime. For now, client aborts are only reported if it happens before the end of the request. It is an error and it is properly handled. But because the EOS was already reported, client aborts after the end of the request are silently ignored. Eventually an error can be reported when the response is sent to the client, if the sending fails. Otherwise, if the server does not reply fast enough, an error is reported when the server timeout is reached. It is the expected behaviour, excpect when the option abortonclose is set. In this case, we must report an error when the client aborts. But as said before, this event can be ignored. So to be short, for now, the abortonclose is broken. In fact, it is a design problem and we have to rethink all channel's flags and probably the conn-stream ones too. It is important to split EOS and EOM to not loose information anymore. But it is not a small job and the refactoring will be far from straightforward. So for now, temporary flags are introduced. When the last read is received, the flag CS_FL_READ_NULL is set on the conn-stream. This way, we can set the flag SI_FL_READ_NULL on the stream interface. Both flags are persistant. And to be sure to wake the stream, the event CF_READ_NULL is reported. So the stream will always have the chance to handle the last read. This patch must be backported to 1.9 because it will be used by another patch to fix the option abortonclose.	2019-03-18 15:50:23 +01:00
Christopher Faulet	35757d38ce	MINOR: mux-h2: Set REFUSED_STREAM error to reset a stream if no data was never sent According to the H2 spec (see #8.1.4), setting the REFUSED_STREAM error code is a way to indicate that the stream is being closed prior to any processing having occurred, such as when a server-side H1 keepalive connection is closed without sending anything (which differs from the regular error case since haproxy doesn't even generate an error message). Any request that was sent on the reset stream can be safely retried. So, when a stream is closed, if no data was ever sent back (ie. the flag H2_SF_HEADERS_SENT is not set), we can set the REFUSED_STREAM error code on the RST_STREAM frame. This patch may be backported to 1.9.	2019-03-18 15:50:23 +01:00
Christopher Faulet	f02ca00a36	BUG/MEDIUM: mux-h2: Always wakeup streams with no id to avoid frozen streams This only happens for server streams because their id is assigned when the first message is sent. If these streams are not woken up, some events can be lost leading to frozen streams. For instance, it happens when a server closes its connection before sending its preface. This patch must be backported to 1.9.	2019-03-18 15:50:23 +01:00
Willy Tarreau	d1fd6f5f64	BUG/MINOR: http/counters: fix missing increment of fe->srv_aborts When a server aborts a transfer, we used to increment the backend's counter but not the frontend's during the forwarding phase. This fixes it. It might be backported to all supported versions (possibly removing the htx part) though it is of very low importance.	2019-03-18 15:50:23 +01:00
Christopher Faulet	2f9a41d52b	BUG/MAJOR: stats: Fix how huge POST data are read from the channel When the body length is greater than a chunk size (so if length of POST data exceeds the buffer size), the requests is rejected with the status code STAT_STATUS_EXCD. Otherwise the stats applet will wait to have all the data to copy and parse them. But there is a problem when the total request size (including the headers) is just lower than the buffer size but greater the buffer size less the reserve. In such case, the body length is considered as enough small to be processed but not entierly received. So the stats applet waits for more data. But because outgoing data are still there, the channel's buffer is considered as full and nothing more can be read, leading to a freeze of the session. Note this bug is pretty easy to reproduce with the legacy HTTP. It is harder with the HTX but still possible. To fix the bug, in the stats applet, when the request is not fully received, we check if at least the reserve remains available the channel's buffer. This patch must be backported as far as 1.5. But because the HTX does not exist in 1.8 and lower, it will have to be adapted for these versions.	2019-03-18 15:50:23 +01:00
Christopher Faulet	fe261551b9	BUG/MAJOR: spoe: Fix initialization of thread-dependent fields A bug was introduced in the commit b0769b ("BUG/MEDIUM: spoe: initialization depending on nbthread must be done last"). The code depending on global.nbthread was moved from cfg_parse_spoe_agent() to spoe_check() but the pointer on the agent configuration was not updated to use the filter's one. The variable curagent is a global variable only valid during the configuration parsing. In spoe_check(), conf->agent must be used instead. This patch must be backported to 1.9 and 1.8.	2019-03-18 14:07:38 +01:00
Willy Tarreau	57cb506df8	BUILD: listener: shut up a build warning when threads are disabled We get this with __decl_hathreads due to the lone semi-colon, let's move it at the end of the innermost declaration : src/listener.c: In function 'listener_accept': src/listener.c:601:2: warning: ISO C90 forbids mixed declarations and code [-Wdeclaration-after-statement]	2019-03-15 17:17:33 +01:00
Christopher Faulet	5d45e381b4	BUG/MINOR: stats: Be more strict on what is a valid request to the stats applet First of all, only GET, HEAD and POST methods are now allowed. Others will be rejected with the status code STAT_STATUS_IVAL (invalid request). Then, for the legacy HTTP, only POST requests with a content-length are allowed. Now, chunked encoded requests are also considered as invalid because the chunk formatting will interfere with the parsing of POST parameters. In HTX, It is not a problem because data are unchunked. This patch must be backported to 1.9. For prior versions too, but HTX part must be removed. The patch introducing the status code STAT_STATUS_IVAL must also be backported.	2019-03-15 14:35:11 +01:00
Christopher Faulet	2b9b6784b9	MINOR: stats: Move stuff about the stats status codes in stats files The status codes definition (STAT_STATUS_*) and their string representation stat_status_codes) have been moved in stats files. There is no reason to keep them in proto_http files.	2019-03-15 14:34:59 +01:00
Christopher Faulet	3c2ecf75c8	MINOR: stats: Add the status code STAT_STATUS_IVAL to handle invalid requests This patch must be backported to 1.9 because a bug fix depends on it.	2019-03-15 14:34:52 +01:00
Christopher Faulet	0ae79d0b0e	BUG/MINOR: lua/htx: Don't forget to call htx_to_buf() when appropriate When htx_from_buf() is used to get an HTX message from a buffer, htx_to_buf() must always be called when finish. Some calls to htx_to_buf() were missing. This patch must be backported to 1.9.	2019-03-15 14:34:36 +01:00
Christopher Faulet	f6cce3f0ef	BUG/MINOR: lua/htx: Use channel_add_input() when response data are added This patch must be backported to 1.9.	2019-03-15 14:33:50 +01:00
Christopher Faulet	1e2d636413	BUG/MINOR: stats/htx: Call channel_add_input() when response headers are sent This function will only increment the total amount of bytes read by a channel because at this stage there is no fast forwarding. So the bug is pretty limited. This patch must be backported to 1.9.	2019-03-15 14:33:38 +01:00
Christopher Faulet	269223886d	BUG/MINOR: mux-h1: Don't report an error on EOS if no message was received An error is reported if the EOS is detected before the end of the message. But we must be carefull to not report an error if there is no message at all. This patch must be backported to 1.9.	2019-03-15 14:33:02 +01:00
Olivier Houchard	1b32790324	BUG/MEDIUM: tasks: Make sure we wake sleeping threads if needed. When waking a task on a remote thread, we currently check 1) if this thread was sleeping, and 2) if it was already marked as active before writing to its pipe. Unfortunately this doesn't always work as desired because only one thread from the mask is woken up, while the active_tasks_mask indicates all eligible threads for this task. As a result, if one multi-thread task (e.g. a health check) wakes up to run on any thread, then an accept() dispatches an incoming connection on thread 2, this thread will already have its bit set in active_tasks_mask because of the previous wakeup and will not be woken up. This is easily noticeable on 2.0-dev by injecting on a multi-threaded listener with a single connection at a time while health checks are running quickly in the background : the injection runs slowly with random response times (the poll timeouts). In 1.9 it affects the dequeing of server connections, which occasionally experience pauses if multiple threads share the same queue. The correct solution consists in adjusting the sleeping_thread_mask when waking another thread up. This mask reflects threads that are sleeping, hence that need to be signaled to wake up. Threads with a bit in active_tasks_mask already don't have their sleeping_thread_mask bit set before polling so the principle remains consistent. And by doing so we can remove the old_active_mask field. This should be backported to 1.9.	2019-03-15 14:09:39 +01:00
Willy Tarreau	3f20085617	BUG/MEDIUM: init/threads: consider epoll_fd/pipes for automatic maxconn calculation This is the equivalent of the previous patch for the automatic maxconn calculation. This doesn't need any backport.	2019-03-14 20:02:37 +01:00
Willy Tarreau	2c58b41c96	BUG/MEDIUM: threads/fd: do not forget to take into account epoll_fd/pipes Each thread uses one epoll_fd or kqueue_fd, and a pipe (thus two FDs). These ones have to be accounted for in the maxsock calculation, otherwise we can reach maxsock before maxconn. This is difficult to observe but it in fact happens when a server connects back to the frontend and has checks enabled : the check uses its FD and serves to fill the loop. In this case all FDs planed for the datapath are used for this. This needs to be backported to 1.9 and 1.8.	2019-03-14 20:02:37 +01:00
Willy Tarreau	897e2c58e6	BUG/MEDIUM: listener: make sure we don't pick stopped threads Dragan Dosen reported that after the multi-queue changes, appending "process 1/even" on a bind line can make the process immediately crash when delivering a first connection. This is due to the fact that I believed that thread_mask(mask) applied the all_threads_mask value, but it doesn't. And in case of even/odd the bits cover more than the available threads, resulting in too high a thread number being selected and a non-existing task to be woken up. No backport is needed.	2019-03-13 15:03:53 +01:00
Willy Tarreau	df23c0ce45	MINOR: config: continue to rely on DEFAULT_MAXCONN to set the minimum maxconn Some packages used to rely on DEFAULT_MAXCONN to set the default global maxconn value to use regardless of the initial ulimit. The recent changes made the lowest bound set to 100 so that it is compatible with almost any environment. Now that DEFAULT_MAXCONN is not needed for anything else, we can use it for the lowest bound set when maxconn is not configured. This way it retains its original purpose of setting the default maxconn value eventhough most of the time the effective value will be higher thanks to the automatic computation based on "ulimit -n".	2019-03-13 10:10:49 +01:00
Willy Tarreau	ca783d4ee6	MINOR: config: remove obsolete use of DEFAULT_MAXCONN at various places This entry was still set to 2000 but never used anymore. The only places where it appeared was as an alias to SYSTEM_MAXCONN which forces it, so let's turn these ones to SYSTEM_MAXCONN and remove the default value for DEFAULT_MAXCONN. SYSTEM_MAXCONN still defines the upper bound however.	2019-03-13 10:10:25 +01:00
Olivier Houchard	25ad13f9a0	MEDIUM: vars: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	cab0f0b418	MEDIUM: time: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	64dbb2df23	MEDIUM: tcp_rules: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	dc6111e864	MEDIUM: stream: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	2be5a4c627	MEDIUM: ssl: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	d5b3d30b60	MEDIUM: sessions: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	b4df492d01	MEDIUM: queues: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	4051410fef	MEDIUM: proto_tcp: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	ed87989ab5	MEDIUM: peers: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	20872763dd	MEDIUM: memory: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	d2ee3e7227	MEDIUM: logs: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	64213e910d	MEDIUM: listeners: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	36a8e6f970	MEDIUM: lb/threads: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	a798bf56e2	MEDIUM: http: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	b23a61f78a	MEDIUM: threads: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	9e7ae28a16	MEDIUM: spoe: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	43da3430f1	MEDIUM: compression: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	cb6c9274ae	MEDIUM: pollers: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	7059c55463	MEDIUM: checks: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	aa090d46fe	MEDIUM: cache: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	237f781f2d	MEDIUM: backend: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:37 +01:00
Olivier Houchard	0823ca8b96	MEDIUM: activity: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:37 +01:00
Olivier Houchard	4c28328572	MEDIUM: task: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:37 +01:00
Olivier Houchard	d360879fb5	MEDIUM: fd: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:37 +01:00
Olivier Houchard	d2b5d16187	MEDIUM: various: Use __ha_barrier_atomic* when relevant. When protecting data modified by atomic operations, use __ha_barrier_atomic* to avoid unneeded barriers on x86.	2019-03-11 17:02:37 +01:00
Olivier Houchard	a51885621d	BUG/MEDIUM: listeners: Don't call fd_stop_recv() if fd_updt is NULL. In do_unbind_listener, don't bother calling fd_stop_recv() if fd_updt is NULL. It means it has already been free'd, and it would crash.	2019-03-08 16:05:31 +01:00
Dragan Dosen	bc6218e1b0	BUG/MEDIUM: 51d: fix possible segfault on deinit_51degrees() When haproxy is built with 51Degrees support, but not configured to use 51Degrees database, a segfault can occur when deinit_51degrees() function is called, eg. during soft-stop on SIGUSR1 signal. Only builds that use Pattern algorithm are affected. This fix must be backported to all stable branches where 51Degrees support is available. Additional adjustments are required for some branches due to API and naming changes.	2019-03-07 17:16:27 +01:00
Fr�d�ric L�caille	2365fb0c97	BUG/MAJOR: config: Wrong maxconn adjustment. Before `c8d5b95` the "maxconn" of the backend of dynamic "use_backend" rules was not modified (this does not make sense and this is correct). When implementing proxy_adjust_all_maxconn(), `c8d5b95` commit missed this case. With this patch we adjust the "maxconn" of the backend of such rules only if they are not dynamic. Without this patch reg-tests/http-rules/h00003.vtc could make haproxy crash.	2019-03-07 17:07:23 +01:00
Olivier Houchard	7c49711d60	BUG/MEDIUM: logs: Only attempt to free startup_logs once. deinit_log_buffers() can be called once per thread, however startup_logs is common to all threads. So only attempt to free it once. This should be backported to 1.9 and 1.8.	2019-03-07 14:59:34 +01:00
Willy Tarreau	0cf33176bd	MINOR: listener: move thr_idx from the bind_conf to the listener Tests show that it's slightly faster to have this field in the listener. The cache walk patterns are under heavy stress and having only this field written to in the bind_conf was wasting a cache line that was heavily read. Let's move this close to the other entries already written to in the listener. Warning, the position does have an impact on peak performance.	2019-03-07 14:08:26 +01:00
Willy Tarreau	9f1d4e7f7f	CLEANUP: listener: remove old thread bit mapping Now that the P2C algorithm for the accept queue is removed, we don't need to map a number to a thread bit anymore, so let's remove all these fields which are taking quite some space for no reason.	2019-03-07 13:59:04 +01:00
Willy Tarreau	0fe703bd50	MEDIUM: listener: change the LB algorithm again to use two round robins instead At this point, the random used in the hybrid queue distribution algorithm provides little benefit over a periodic scan, can even have a slightly worse worst case, and it requires to establish a mapping between a discrete number and a thread ID among a mask. This patch introduces a different approach using two indexes. One scans the thread mask from the left, the other one from the right. The related threads' loads are compared, and the least loaded one receives the new connection. Then one index is adjusted depending on the load resulting from this election, so that we start the next election from two known lightly loaded threads. This approach provides an extra 1% peak performance boost over the previous one, which likely corresponds to the removal of the extra work on the random and the previously required two mappings of index to thread. A test was attempted with two indexes going in the same direction but it was much less interesting because the same thread pairs were compared most of the time with the load climbing in a ladder-like model. With the reverse directions this cannot happen.	2019-03-07 13:57:33 +01:00
Willy Tarreau	fc630bd373	MINOR: listener: improve incoming traffic distribution By picking two randoms following the P2C algorithm, we seldom observe asymmetric loads on bursts of small session counts. This is typically what makes h2load take a bit of time to complete the last 100% because if a thread gets two connections while the other ones only have one, it takes twice the time to complete its work. This patch proposes a modification of the p2c algorithm which seems more suitable to this case : it mixes a rotating index with a random. This way, we're certain that all threads are consulted in turn and at the same time we're not forced to use the ones we're giving a chance. This significantly increases the traffic rate. Now h2load shows faster completion and the average request rates on H2 and the TLS resume rate increases by a bit more than 5% compared to pure p2c. The index was placed into the struct bind_conf because 1) it's faster there and it's the best place to optimally distribute traffic among a group of listeners. It's the only runtime-modified element there and it will be quite cache-hot.	2019-03-07 13:48:04 +01:00
Fr�d�ric L�caille	bfe6138150	MINOR: sample: Add a protocol buffers specific converter. This patch adds "protobuf" protocol buffers specific converter wich may used in combination with "ungrpc" as first converter to extract a protocol buffers field value. It is simply implemented reusing protobuf_field_lookup() which is the protocol buffers specific parser already used by "ungrpc" converter which only parse a gRPC header in addition of parsing protocol buffers message. Update the documentation for this new "protobuf" converter.	2019-03-06 15:36:02 +01:00
Fr�d�ric L�caille	5f33f85ce8	MINOR: sample: Extract some protocol buffers specific code. We move the code responsible of parsing protocol buffers messages inside gRPC messages from sample.c to include/proto/protocol_buffers.h so that to reuse it to cascade "ungrpc" converter.	2019-03-06 15:36:02 +01:00
Lukas Tribus	1aabc93978	BUG/MINOR: ssl: fix warning about ssl-min/max-ver support In `84e417d8` ("MINOR: ssl: support Openssl 1.1.1 early callback for switchctx") the code was extended to also support OpenSSL 1.1.1 (code already supported BoringSSL). A configuration check warning was updated but with the wrong logic, the #ifdef needs a && instead of an \|\|. Reported in #54. Should be backported to 1.8.	2019-03-05 23:56:58 +01:00
Willy Tarreau	5799e9cd37	MINOR: config: relax the range checks on cpu-map Emeric reports that when MAX_THREADS and/or MAX_PROCS are set to lower values, referencing thread or process numbers higher than these limits in cpu-map returns errors. This is annoying because these typically are silent settings that are expected to be used only when set. Let's switch back to LONGBITS for this limit.	2019-03-05 18:14:03 +01:00
Willy Tarreau	8e5e1e7bf0	CLEANUP: wurfl: remove dead, broken and unmaintained code Since the "wurfl" device detection engine was merged slightly more than two years ago (2016-11-04), it never received a single fix nor update. For almost two years it didn't receive even the minimal review or changes needed to be compatible with threads, and it's remained build-broken for about the last 9 months, consecutive to the last buffer API changes, without anyone ever noticing! When asked on the list, nobody confirmed using it : https://www.mail-archive.com/haproxy@formilux.org/msg32516.html And obviously nobody even cared to verify that it did still build. So we are left with this broken code with no user and no maintainer. It might even suffer from remotely exploitable vulnerabilities without anyone being able to check if it presents any risk. It's a pain to update each time there is an API change because it doesn't build as it depends on external libraries that are not publicly accessible, leading to careful blind changes. It slows down the whole project. This situation is not acceptable at all. It's time to cure the problem where it is. This patch removes all this dead, non-buildable, non-working code. If anyone ever decides to use it, which I seriously doubt based on history, it could be reintegrated, but this time the following guarantees will be required : - someone has to step up as a maintainer and have his name listed in the MAINTAINERS file (I should have been more careful last time). This person will take the sole blame for all issues and will be responsible for fixing the bugs and incompatibilities affecting this code, and for making it evolve to follow regular internal API updates. - support building on a standard distro with automated tools (i.e. no more "click on this site, register your e-mail and download an archive then figure how to place this into your build system"). Dummy libs are OK though as long as they allow the mainline code to build and start. - multi-threaded support must be fixed. I mean seriously, not worked around with a check saying "please disable threads, we've been busy fishing for the last two years". This may be backported to 1.9 given that the code has never worked there either, thus at least we're certain nobody will miss it.	2019-03-05 13:46:12 +01:00
Fr�d�ric L�caille	756d97f205	MINOR: sample: Rework gRPC converter code. For now on, "ungrpc" may take a second optional argument to provide the protocol buffers types used to encode the field value to be extracted. When absent the field value is extracted as a binary sample which may then followed by others converters like "hex" which takes binary as input sample. When this second argument is a type which does not match the one found by "ungrpc", this field is considered as not found even if present. With this patch we also remove the useless "varint" and "svarint" converters. Update the documentation about "ungrpc" converters.	2019-03-05 11:04:23 +01:00
Fr�d�ric L�caille	7c93e88d0c	MINOR: sample: Code factorization "ungrpc" converter. Parsing protocol buffer fields always consists in skip the field if the field is not found or store the field value if found. So, with this patch we factorize a little bit the code for "ungrpc" converter.	2019-03-05 11:03:53 +01:00
Willy Tarreau	9255e7e971	BUG/MEDIUM: h2/htx: verify that :path doesn't contain invalid chars While the legacy code converts h2 to h1 and provides some control over what is passed, in htx mode there is no such control and it is possible to pass control chars and linear white spaces in the path, which are possibly reencoded differently once passed to the H1 side. HTX supports parse error reporting using a special flag. Let's check the correctness of the :path pseudo header and report any anomaly in the HTX flag. Thanks to J�r�me Magnin for reporting this bug with a working reproducer. This fix must be backported to 1.9 along with the two previous patches ("MINOR: htx: unconditionally handle parsing errors in requests or responses" and "MINOR: mux-h2: always pass HTX_FL_PARSING_ERROR between h2s and buf on RX").	2019-03-05 10:58:28 +01:00
Willy Tarreau	7196dd6071	MINOR: mux-h2: always pass HTX_FL_PARSING_ERROR between h2s and buf on RX In order to allow the H2 parser to report parsing errors, we must make sure to always pass the HTX_FL_PARSING_ERROR flag from the h2s htx to the conn_stream's htx.	2019-03-05 10:56:34 +01:00
Willy Tarreau	4236f035fe	MINOR: htx: unconditionally handle parsing errors in requests or responses The htx request and response processing functions currently only check for HTX_FL_PARSING_ERROR on incomplete messages because that's how mux_h1 delivers these. However with H2 we have to detect some parsing errors in the format of certain pseudo-headers (e.g. :path), so we do have a complete message but we want to report an error. Let's move the parse error check earlier so that it always triggers when the flag is present. It was also moved for htx_wait_for_request_body() since we definitely want to be able to abort processing such an invalid request even if it appears complete, but it was not changed in the forward functions so as not to truncate contents before the position of the first error.	2019-03-05 10:56:34 +01:00
Fr�d�ric L�caille	50290fbb42	MINOR: sample: Replace "req.ungrpc" smp fetch by a "ungrpc" converter. This patch simply extracts the code of smp_fetch_req_ungrpc() for "req.ungrpc" from http_fetch.c to move it to sample.c with very few modifications. Furthermore smp_fetch_body_buf() used to fetch the body contents is no more needed. Update the documentation for gRPC.	2019-03-04 08:28:42 +01:00
Willy Tarreau	927b88ba00	BUG/MAJOR: mux-h2: fix race condition between close on both ends A crash in H2 was reported in issue #52. It turns out that there is a small but existing race by which a conn_stream could detach itself using h2_detach(), not being able to destroy the h2s due to pending output data blocked by flow control, then upon next h2s activity (transfer_data or trailers parsing), an ES flag may need to be turned into a CS_FL_REOS bit, causing a dereference of a NULL stream. This is a side effect of the fact that we still have a few places which incorrectly depend on the CS flags, while these flags should only be set by h2_rcv_buf() and h2_snd_buf(). All candidate locations along this path have been secured against this risk, but the code should really evolve to stop depending on CS anymore. This fix must be backported to 1.9 and possibly partially to 1.8.	2019-03-04 08:17:12 +01:00
Willy Tarreau	ac35093a19	MEDIUM: init: make the global maxconn default to what rlim_fd_cur permits The global maxconn value is often a pain to configure : - in development the user never has the permissions to increase the rlim_cur value too high and gets warnings all the time ; - in some production environments, users may have limited actions on it or may only be able to act on rlim_fd_cur using ulimit -n. This is sometimes particularly true in containers or whatever environment where the user has no privilege to upgrade the limits. - keeping config homogenous between machines is even less easy. We already had the ability to automatically compute maxconn from the memory limits when they were set. This patch goes a bit further by also computing the limit permitted by the configured limit on the number of FDs. For this it simply reverses the rlim_fd_cur calculation to determine maxconn based on the number of reserved sockets for listeners & checks, the number of SSL engines and the number of pipes (absolute or relative). This way it becomes possible to make maxconn always be the highest possible value resulting in maxsock matching what was set using "ulimit -n", without ever setting it. Note that we adjust to the soft limit, not the hard one, since it's what is configured with ulimit -n. This allows users to also limit to low values if needed. Just like before, the calculated value is reported in verbose mode.	2019-03-01 15:54:16 +01:00
Willy Tarreau	8d687d8464	MINOR: init: move some maxsock updates earlier We'll need to know the global maxsock before the maxconn calculation. Actually only two components were calculated too late, the peers FD and the stats FD. Let's move them a few lines upward.	2019-03-01 15:53:14 +01:00
Willy Tarreau	5a023f0d7a	MINOR: init: make the maxpipe computation more accurate The default number of pipes is adjusted based on the sum of frontends and backends maxconn/fullconn settings. Now that it is possible to have a null maxconn on a frontend to indicate "unlimited" with commit `c8d5b95e6` ("MEDIUM: config: don't enforce a low frontend maxconn value anymore"), the sum of maxconn may remain low and limited to the only frontends/backends where this limit is set. This patch considers this new unlimited case when doing the check, and automatically switches to the default value which is maxconn/4 in this case. All the calculation was moved to a distinct function for ease of use. This function also supports returning unlimited (-1) when the value depends on global.maxconn and this latter is not yet set.	2019-03-01 15:53:14 +01:00
Willy Tarreau	8dca19549a	BUG/MINOR: mworker: be careful to restore the original rlim_fd_cur/max on reload When the master re-execs itself on reload, it doesn't restore the initial rlim_fd_cur/rlim_fd_max values, which have been modified by the ulimit-n or global maxconn directives. This is a problem, because if these values were set really low it could prevent the process from restarting, and if they were set very high, this could have some implications on the restart time, or later on the computed maxconn. Let's simply reset these values to the ones we had at boot to maintain the system in a consistent state. A backport could be performed to 1.9 and maybe 1.8. This patch depends on the two previous ones.	2019-03-01 11:26:08 +01:00
Willy Tarreau	9f6dc72477	BUG/MINOR: checks: make external-checks restore the original rlim_fd_cur/max It's not normal that external processes are run with high FD limits, as quite often such processes (especially shell scripts) will iterate over all FDs to close them. Ideally we should even provide a tunable with the external-check directive to adjust this value, but at least we need to restore it to the value that was active when starting haproxy (before it was adjusted for maxconn). Additionally with very low maxconn values causing rlim_fd_cur to be low, some heavy checks could possibly fail. This was also mentioned in issue #45. Currently the following config and scripts report this : $ cat rlim.cfg global maxconn 500000 external-check listen www bind :8001 timeout client 5s timeout server 5s timeout connect 5s option external-check external-check command "$PWD/sleep1.sh" server local 127.0.0.1:80 check inter 1s $ cat sleep1.sh #!/bin/sh /bin/sleep 0.1 echo -n "soft: ";ulimit -S -n echo -n "hard: ";ulimit -H -n # ./haproxy -db -f rlim.cfg soft: 1000012 hard: 1000012 soft: 1000012 hard: 1000012 Now with the fix : # ./haproxy -db -f rlim.cfg soft: 1024 hard: 4096 soft: 1024 hard: 4096 This fix should be backported to stable versions but it depends on "MINOR: global: keep a copy of the initial rlim_fd_cur and rlim_fd_max values" and "BUG/MINOR: init: never lower rlim_fd_max".	2019-03-01 11:23:45 +01:00
Willy Tarreau	e5cfdacb83	BUG/MINOR: init: never lower rlim_fd_max If a ulimit-n value is set, we must not lower the rlim_max value if the new value is lower, we must only adjust the rlim_cur one. The effect is that on very low values, this could prevent a master-worker reload, or make an external check fail by lack of FDs. This may be backported to 1.9 and earlier, but it depends on this patch "MINOR: global: keep a copy of the initial rlim_fd_cur and rlim_fd_max values".	2019-03-01 10:40:30 +01:00
Willy Tarreau	bf6964007a	MINOR: global: keep a copy of the initial rlim_fd_cur and rlim_fd_max values Let's keep a copy of these initial values. They will be useful to compute automatic maxconn, as well as to restore proper limits when doing an execve() on external checks.	2019-03-01 10:40:30 +01:00
Fr�d�ric L�caille	645635da84	MINOR: peers: Add a message for heartbeat. This patch implements peer heartbeat feature to prevent any haproxy peer from reconnecting too often, consuming sockets for nothing. To do so, we add PEER_MSG_CTRL_HEARTBEAT new message to PEER_MSG_CLASS_CONTROL peers control class of messages. A ->heartbeat field is added to peer structs to store the heatbeat timeout value which is handled by the same function as for ->reconnect to control the session timeouts. A 2-bytes heartbeat message is sent every 3s when no updates have to be sent. This way, the peer which receives such a message is sure the remote peer is still alive. So, it resets the ->reconnect peer session timeout to its initial value (5s). This prevents any reconnection to an already connected alive peer.	2019-03-01 09:33:26 +01:00
Willy Tarreau	c8d5b95e6d	MEDIUM: config: don't enforce a low frontend maxconn value anymore Historically the default frontend's maxconn used to be quite low (2000), which was sufficient two decades ago but often proved to be a problem when users had purposely set the global maxconn value but forgot to set the frontend's. There is no point in keeping this arbitrary limit for frontends : when the global maxconn is lower, it's already too high and when the global maxconn is much higher, it becomes a limiting factor which causes trouble in production. This commit allows the value to be set to zero, which becomes the new default value, to mean it's not directly limited, or in fact it's set to the global maxconn. Since this operation used to be performed before computing a possibly automatic global maxconn based on memory limits, the calculation of the maxconn value and its propagation to the backends' fullconn has now moved to a dedicated function, proxy_adjust_all_maxconn(), which is called once the global maxconn is stabilized. This comes with two benefits : 1) a configuration missing "maxconn" in the defaults section will not limit itself to a magically hardcoded value but will scale up to the global maxconn ; 2) when the global maxconn is not set and memory limits are used instead, the frontends' maxconn automatically adapts, and the backends' fullconn as well.	2019-02-28 17:05:32 +01:00
Willy Tarreau	d89cc8bfc0	MINOR: proxy: do not change the listeners' maxconn when updating the frontend's It is possible to update a frontend's maxconn from the CLI. Unfortunately when doing this it scratches all listeners' maxconn values and sets them all to the new frontend's value. This can be problematic when mixing different traffic classes (bind to interface or private networks, etc). Now that the listener's maxconn is allowed to remain unset, let's not change these values when setting the frontend's maxconn. This way the overall frontend's limit can be raised but if certain specific listeners had their own value forced in the config, they will be preserved. This makes more sense and is more in line with the principle of defaults propagation.	2019-02-28 17:05:32 +01:00
Willy Tarreau	a8cf66bcab	MINOR: listener: do not needlessly set l->maxconn It's pointless to always set and maintain l->maxconn because the accept loop already enforces the frontend's limit anyway. Thus let's stop setting this value by default and keep it to zero meaning "no limit". This way the frontend's maxconn will be used by default. Of course if a value is set, it will be enforced.	2019-02-28 17:05:32 +01:00
Willy Tarreau	e2711c7bd6	MINOR: listener: introduce listener_backlog() to report the backlog value In an attempt to try to provide automatic maxconn settings, we need to decorrelate a listner's backlog and maxconn so that these values can be independent. This introduces a listener_backlog() function which retrieves the backlog value from the listener's backlog, the frontend's, the listener's maxconn, the frontend's or falls back to 1024. This corresponds to what was done in cfgparse.c to force a value there except the last fallback which was not set since the frontend's maxconn is always known.	2019-02-28 17:05:29 +01:00
Willy Tarreau	82c9789ac4	BUG/MEDIUM: listener: make sure the listener never accepts too many conns We were not checking p->feconn nor the global actconn soon enough. In older versions this could result in a frontend accepting more connections than allowed by its maxconn or the global maxconn, exactly N-1 extra connections where N is the number of threads, provided each of these threads were running a different listener. But with the lock removal, it became worse, the excess could be the listener's maxconn multiplied by the number of threads. Among the nasty side effect was that LI_FULL could be removed while the limit was still over and in some cases the polling on the socket was no re-enabled. This commit takes care of updating and checking p->feconn and the global actconn before processing the connection, so that the listener can be turned off before accepting the socket if needed. This requires to move some of the bookkeeping operations form session to listen, which totally makes sense in this context. Now the limits are properly respected, even if a listener's maxconn is over a frontend's. This only applies on top of the listener lock removal series and doesn't have to be backported.	2019-02-28 16:08:54 +01:00
Willy Tarreau	01abd02508	BUG/MEDIUM: listener: use a self-locked list for the dequeue lists There is a very difficult to reproduce race in the listener's accept code, which is much easier to reproduce once connection limits are properly enforced. It's an ABBA lock issue : - the following functions take l->lock then lq_lock : disable_listener, pause_listener, listener_full, limit_listener, do_unbind_listener - the following ones take lq_lock then l->lock : resume_listener, dequeue_all_listener This is because __resume_listener() only takes the listener's lock and expects to be called with lq_lock held. The problem can easily happen when listener_full() and limit_listener() are called a lot while in parallel another thread releases sessions for the same listener using listener_release() which in turn calls resume_listener(). This scenario is more prevalent in 2.0-dev since the removal of the accept lock in listener_accept(). However in 1.9 and before, a different but extremely unlikely scenario can happen : thread1 thread2 ............................ enter listener_accept() limit_listener() ............................ long pause before taking the lock session_free() dequeue_all_listeners() lock(lq_lock) [1] ............................ try_lock(l->lock) [2] __resume_listener() spin_lock(l->lock) =>WAIT[2] ............................ accept() l->accept() nbconn==maxconn => listener_full() state==LI_LIMITED => lock(lq_lock) =>DEADLOCK[1]! In practice it is almost impossible to trigger it because it requires to limit both on the listener's maxconn and the frontend's rate limit, at the same time, and to release the listener when the connection rate goes below the limit between poll() returns the FD and the lock is taken (a few nanoseconds). But maybe with threads competing on the same core it has more chances to appear. This patch removes the lq_lock and replaces it with a lockless queue for the listener's wait queue (well, technically speaking a self-locked queue) brought by commit `a8434ec14` ("MINOR: lists: Implement locked variations.") and its few subsequent fixes. This relieves us from the need of the lq_lock and removes the deadlock. It also gets rid of the distinction between __resume_listener() and resume_listener() since the only difference was the lq_lock. All listener removals from the list are now unconditional to avoid races on the state. It's worth noting that the list used to never be initialized and that it used to work only thanks to the state tests, so the initialization has now been added. This patch must carefully be backported to 1.9 and very likely 1.8. It is mandatory to be careful about replacing all manipulations of l->wait_queue, global.listener_queue and p->listener_queue.	2019-02-28 16:08:54 +01:00
Willy Tarreau	c912f94b57	MINOR: server: remove a few unneeded LIST_INIT calls after LIST_DEL_LOCKED Since LIST_DEL_LOCKED() and LIST_POP_LOCKED() now automatically reinitialize the removed element, there's no need for keeping this LIST_INIT() call in the idle connection code.	2019-02-28 16:08:54 +01:00
Willy Tarreau	18215cba6a	BUG/MINOR: config: don't over-count the global maxsock value global.maxsock used to be augmented by the frontend's maxconn value for each frontend listener, which is absurd when there are many listeners in a frontend because the frontend's maxconn fixes an upper limit to how many connections will be accepted on all of its listeners anyway. What is needed instead is to add one to count the listening socket. In addition, the CLI's and peers' value was incremented twice, the first time when creating the listener and the second time in the main init code. Let's now make sure we only increment global.maxsock by the required amount of sockets. This means not adding maxconn for each listener, and relying on the global values when they are correct.	2019-02-27 19:35:37 +01:00
Willy Tarreau	149ab779cc	MAJOR: threads: enable one thread per CPU by default Threads have long matured by now, still for most users their usage is not trivial. It's about time to enable them by default on platforms where we know the number of CPUs bound. This patch does this, it counts the number of CPUs the process is bound to upon startup, and enables as many threads by default. Of course, "nbthread" still overrides this, but if it's not set the default behaviour is to start one thread per CPU. The default number of threads is reported in "haproxy -vv". Simply using "taskset -c" is now enough to adjust this number of threads so that there is no more need for playing with cpu-map. And thanks to the previous patches on the listener, the vast majority of configurations will not need to duplicate "bind" lines with the "process x/y" statement anymore either, so a simple config will automatically adapt to the number of processors available.	2019-02-27 14:51:50 +01:00
Willy Tarreau	7ac908bf8c	MINOR: config: add global tune.listener.multi-queue setting tune.listener.multi-queue { on \| off } Enables ('on') or disables ('off') the listener's multi-queue accept which spreads the incoming traffic to all threads a "bind" line is allowed to run on instead of taking them for itself. This provides a smoother traffic distribution and scales much better, especially in environments where threads may be unevenly loaded due to external activity (network interrupts colliding with one thread for example). This option is enabled by default, but it may be forcefully disabled for troubleshooting or for situations where it is estimated that the operating system already provides a good enough distribution and connections are extremely short-lived.	2019-02-27 14:27:07 +01:00
Willy Tarreau	8a03408d81	MINOR: activity: add accept queue counters for pushed and overflows It's important to monitor the accept queues to know if some incoming connections had to be handled by their originating thread due to an overflow. It's also important to be able to confirm thread fairness. This patch adds "accq_pushed" to activity reporting, which reports the number of connections that were successfully pushed into each thread's queue, and "accq_full", which indicates the number of connections that couldn't be pushed because the thread's queue was full.	2019-02-27 14:27:07 +01:00
Willy Tarreau	e0e9c48ab2	MAJOR: listener: use the multi-queue for multi-thread listeners The idea is to redistribute an incoming connection to one of the threads a bind_conf is bound to when there is more than one. We do this using a random improved by the p2c algorithm : a random() call returns two different thread numbers. We then compare their respective connection count and the length of their accept queues, and pick the least loaded one. We even use this deferred accept mechanism if the target thread ends up being the local thread, because this maintains fairness between all connections and tests show that it's about 1% faster this way, likely due to cache locality. If the target thread's accept queue is full, the connection is accepted synchronously by the current thread.	2019-02-27 14:27:07 +01:00
Willy Tarreau	1efafce61f	MINOR: listener: implement multi-queue accept for threads There is one point where we can migrate a connection to another thread without taking risk, it's when we accept it : the new FD is not yet in the fd cache and no task was created yet. It's still possible to assign it a different thread than the one which accepted the connection. The only requirement for this is to have one accept queue per thread and their respective processing tasks that have to be woken up each time an entry is added to the queue. This is a multiple-producer, single-consumer model. Entries are added at the queue's tail and the processing task is woken up. The consumer picks entries at the head and processes them in order. The accept queue contains the fd, the source address, and the listener. Each entry of the accept queue was rounded up to 64 bytes (one cache line) to avoid cache aliasing because tests have shown that otherwise performance suffers a lot (5%). A test has shown that it's important to have at least 256 entries for the rings, as at 128 it's still possible to fill them often at high loads on small thread counts. The processing task does almost nothing except calling the listener's accept() function and updating the global session and SSL rate counters just like listener_accept() does on synchronous calls. At this point the accept queue is implemented but not used.	2019-02-27 14:27:07 +01:00
Willy Tarreau	b2b50a7784	MINOR: listener: pre-compute some thread counts per bind_conf In order to quickly pick a thread ID when accepting a connection, we'll need to know certain pre-computed values derived from the thread mask, which are counts of bits per position multiples of 1, 2, 4, 8, 16 and 32. In practice it is sufficient to compute only the 4 first ones and store them in the bind_conf. We update the count every time the bind_thread value is adjusted. The fields in the bind_conf struct have been moved around a little bit to make it easier to group all thread bit values into the same cache line. The function used to return a thread number is bind_map_thread_id(), and it maps a number between 0 and 31/63 to a thread ID between 0 and 31/63, starting from the left.	2019-02-27 14:27:07 +01:00
Willy Tarreau	f3241115e7	MINOR: tools: implement functions to look up the nth bit set in a mask Function mask_find_rank_bit() returns the bit position in mask <m> of the nth bit set of rank <r>, between 0 and LONGBITS-1 included, starting from the left. For example ranks 0,1,2,3 for mask 0x55 will be 6, 4, 2 and 0 respectively. This algorithm is based on a popcount variant and is described here : https://graphics.stanford.edu/~seander/bithacks.html.	2019-02-27 14:27:07 +01:00
Willy Tarreau	9e85318417	MINOR: listener: maintain a per-thread count of the number of connections on a listener Having this information will help us improve thread-level distribution of incoming traffic.	2019-02-27 14:27:07 +01:00
Willy Tarreau	3f0d02bbc2	MAJOR: listener: do not hold the listener lock in listener_accept() This function used to hold the listener's lock as a way to stay safe against concurrent manipulations, but it turns out this is wrong. First, the lock is held during l->accept(), which itself might indirectly call listener_release(), which, if the listener is marked full, could result in __resume_listener() to be called and the lock being taken twice. In practice it doesn't happen right now because the listener's FULL state cannot change while we're doing this. Second, all the code does is now protected against concurrent accesses. It used not to be the case in the early days of threads : the frequency counters are thread-safe. The rate limiting doesn't require extreme precision. Only the nbconn check is not thread safe. Third, the parts called here will have to be called from different threads without holding this lock, and this becomes a bigger issue if we need to keep this one. This patch does 3 things which need to be addressed at once : 1) it moves the lock to the only 2 functions that were not protected since called form listener_accept() : - limit_listener() - listener_full() 2) it makes sure delete_listener() properly checks its state within the lock. 3) it updates the l->nbconn tracking to make sure that it is always properly reported and accounted for. There is a point of particular care around the situation where the listener's maxconn is reached because the listener has to be marked full before accepting the connection, then resumed if the connection finally gets dropped. It is not possible to perform this change without removing the lock due to the deadlock issue explained above. This patch almost doubles the accept rate in multi-thread on a shared port between 8 threads, and multiplies by 4 the connection rate on a tcp-request connection reject rule.	2019-02-27 14:27:07 +01:00
Willy Tarreau	a36b324777	MEDIUM: listener: keep a single thread-mask and warn on "process" misuse Now that nbproc and nbthread are exclusive, we can still provide more detailed explanations about what we've found in the config when a bind line appears on multiple threads and processes at the same time, then ignore the setting. This patch reduces the listener's thread mask to a single mask instead of an array of masks per process. Now we have only one thread mask and one process mask per bind-conf. This removes ~504 bytes of RAM per bind-conf and will simplify handling of thread masks. If a "bind" line only refers to process numbers not found by its parent frontend or not covered by the global nbproc directive, or to a thread not covered by the global nbthread directive, a warning is emitted saying what will be used instead.	2019-02-27 14:27:07 +01:00
Willy Tarreau	26f6ae12c0	MAJOR: config: disable support for nbproc and nbthread in parallel When 1.8 was released, we wanted to support both nbthread and nbproc to observe how things would go. Since then it appeared obvious that the two are never used together because of the pain to configure affinity in this case, and instead of bringing benefits, it brings the limitations of both models, and causes multiple threads to compete for the same CPU. In addition, it costs a lot to support both in parallel, so let's get rid of this once for all.	2019-02-27 14:27:04 +01:00
Willy Tarreau	741b4d6b7a	BUG/MINOR: listener: keep accept rate counters accurate under saturation The test on l->nbconn forces to exit the loop before updating the freq counters, so the last session which reaches a listener's limit will not be accounted for in the session rate measurement. Let's move the test at the beginning of the loop and mark the listener as saturated on exit. This may be backported to 1.9 and 1.8.	2019-02-27 08:03:41 +01:00
Fr�d�ric L�caille	12a718488a	BUG/MEDIUM: standard: Wrong reallocation size. The number of bytes to use with "my_realloc2()" in parse_dotted_nums() was wrong: missing multiplication by the size of an element of an array when reallocating it.	2019-02-26 19:07:44 +01:00
Olivier Houchard	dd1c8f1f72	MINOR: cfgparse: Add a cast to make gcc happier. When calling calloc(), cast global.nbthread to unsigned int, so that gcc doesn't freak out, as it has no way of knowing global.nbthread can't be negative.	2019-02-26 18:47:59 +01:00
Olivier Houchard	9ea5d361ae	MEDIUM: servers: Reorganize the way idle connections are cleaned. Instead of having one task per thread and per server that does clean the idling connections, have only one global task for every servers. That tasks parses all the servers that currently have idling connections, and remove half of them, to put them in a per-thread list of connections to kill. For each thread that does have connections to kill, wake a task to do so, so that the cleaning will be done in the context of said thread.	2019-02-26 18:17:32 +01:00
Olivier Houchard	7f1bc31fee	MEDIUM: servers: Used a locked list for idle_orphan_conns. Use the locked macros when manipulating idle_orphan_conns, so that other threads can remove elements from it. It will be useful later to avoid having a task per server and per thread to cleanup the orphan list.	2019-02-26 18:17:32 +01:00
Tim Duesterhus	36839dc39f	CLEANUP: stream: Remove bogus loop in conn_si_send_proxy The if-statement was converted into a while-loop in `7fe45698f5` to handle EINTR. This special handling was later replaced in `0a03c0f022` by conn_sock_send. The while-loop was not changed back and is not unconditionally exited after one iteration, with no `continue` inside the body. Replace by an if-statement.	2019-02-26 17:27:04 +01:00
Tim Duesterhus	c7f880ee3b	CLEANUP: http: Remove unreachable code in parse_http_req_capture `len` has already been checked to be strictly positive a few lines above. This unreachable code was introduced in `82bf70dff4`.	2019-02-26 17:27:04 +01:00
Willy Tarreau	6c1b667e57	[RELEASE] Released version 2.0-dev1 Released version 2.0-dev1 with the following main changes : - MINOR: mux-h2: only increase the connection window with the first update - REGTESTS: remove the expected window updates from H2 handshakes - BUG/MINOR: mux-h2: make empty HEADERS frame return a connection error - BUG/MEDIUM: mux-h2: mark that we have too many CS once we have more than the max - MEDIUM: mux-h2: remove padlen during headers phase - MINOR: h2: add a bit-based frame type representation - MINOR: mux-h2: remove useless check for empty frame length in h2s_decode_headers() - MEDIUM: mux-h2: decode HEADERS frames before allocating the stream - MINOR: mux-h2: make h2c_send_rst_stream() use the dummy stream's error code - MINOR: mux-h2: add a new dummy stream for the REFUSED_STREAM error code - MINOR: mux-h2: fail stream creation more cleanly using RST_STREAM - MINOR: buffers: add a new b_move() function - MINOR: mux-h2: make h2_peek_frame_hdr() support an offset - MEDIUM: mux-h2: handle decoding of CONTINUATION frames - CLEANUP: mux-h2: remove misleading comments about CONTINUATION - BUG/MEDIUM: servers: Don't try to reuse connection if we switched server. - BUG/MEDIUM: tasks: Decrement tasks_run_queue in tasklet_free(). - BUG/MINOR: htx: send the proper authenticate header when using http-request auth - BUG/MEDIUM: mux_h2: Don't add to the idle list if we're full. - BUG/MEDIUM: servers: Fail if we fail to allocate a conn_stream. - BUG/MAJOR: servers: Use the list api correctly to avoid crashes. - BUG/MAJOR: servers: Correctly use LIST_ELEM(). - BUG/MAJOR: sessions: Use an unlimited number of servers for the conn list. - BUG/MEDIUM: servers: Flag the stream_interface on handshake error. - MEDIUM: servers: Be smarter when switching connections. - MEDIUM: sessions: Keep track of which connections are idle. - MINOR: payload: add sample fetch for TLS ALPN - BUG/MEDIUM: log: don't mark log FDs as non-blocking on terminals - MINOR: channel: Add the function channel_add_input - MINOR: stats/htx: Call channel_add_input instead of updating channel state by hand - BUG/MEDIUM: cache: Be sure to end the forwarding when XFER length is unknown - BUG/MAJOR: htx: Return the good block address after a defrag - MINOR: lb: allow redispatch when using consistent hash - CLEANUP: mux-h2: fix end-of-stream flag name when processing headers - BUG/MEDIUM: mux-h2: always restart reading if data are available - BUG/MINOR: mux-h2: set the stream-full flag when leaving h2c_decode_headers() - BUG/MINOR: mux-h2: don't check the CS count in h2c_bck_handle_headers() - BUG/MINOR: mux-h2: mark end-of-stream after processing response HEADERS, not before - BUG/MINOR: mux-h2: only update rxbuf's length for H1 headers - BUG/MEDIUM: mux-h1: use per-direction flags to indicate transitions - BUG/MEDIUM: mux-h1: make HTX chunking consistent with H2 - BUG/MAJOR: stream-int: Update the stream expiration date in stream_int_notify() - BUG/MEDIUM: proto-htx: Set SI_FL_NOHALF on server side when request is done - BUG/MEDIUM: mux-h1: Add a task to handle connection timeouts - MINOR: mux-h2: make h2c_decode_headers() return a status, not a count - MINOR: mux-h2: add a new dummy stream : h2_error_stream - MEDIUM: mux-h2: make h2c_decode_headers() support recoverable errors - BUG/MINOR: mux-h2: detect when the HTX EOM block cannot be added after headers - MINOR: mux-h2: remove a misleading and impossible test - CLEANUP: mux-h2: clean the stream error path on HEADERS frame processing - MINOR: mux-h2: check for too many streams only for idle streams - MINOR: mux-h2: set H2_SF_HEADERS_RCVD when a HEADERS frame was decoded - BUG/MEDIUM: mux-h2: decode trailers in HEADERS frames - MINOR: h2: add h2_make_h1_trailers to turn H2 headers to H1 trailers - MEDIUM: mux-h2: pass trailers to H1 (legacy mode) - MINOR: htx: add a new function to add a block without filling it - MINOR: h2: add h2_make_htx_trailers to turn H2 headers to HTX trailers - MEDIUM: mux-h2: pass trailers to HTX - MINOR: mux-h1: parse the content-length header on output and set H1_MF_CLEN - BUG/MEDIUM: mux-h1: don't enforce chunked encoding on requests - MINOR: mux-h2: make HTX_BLK_EOM processing idempotent - MINOR: h1: make the H1 headers block parser able to parse headers only - MEDIUM: mux-h2: emit HEADERS frames when facing HTX trailers blocks - MINOR: stream/htx: Add info about the HTX structs in "show sess all" command - MINOR: stream: Add the subscription events of SIs in "show sess all" command - MINOR: mux-h1: Add the subscription events in "show fd" command - BUG/MEDIUM: h1: Get the h1m state when restarting the headers parsing - BUG/MINOR: cache/htx: Be sure to count partial trailers - BUG/MEDIUM: h1: In h1_init(), wake the tasklet instead of calling h1_recv(). - BUG/MEDIUM: server: Defer the mux init until after xprt has been initialized. - MINOR: connections: Remove a stall comment. - BUG/MEDIUM: cli: make "show sess" really thread-safe - BUILD: add a new file "version.c" to carry version updates - MINOR: stream/htx: add the HTX flags output in "show sess all" - MINOR: stream/cli: fix the location of the waiting flag in "show sess all" - MINOR: stream/cli: report more info about the HTTP messages on "show sess all" - BUG/MINOR: lua: bad args are returned for Lua actions - BUG/MEDIUM: lua: dead lock when Lua tasks are trigerred - MINOR: htx: Add an helper function to get the max space usable for a block - MINOR: channel/htx: Add HTX version for some helper functions - BUG/MEDIUM: cache/htx: Respect the reserve when cached objects are served - BUG/MINOR: stats/htx: Respect the reserve when the stats page is dumped - DOC: regtest: make it clearer what the purpose of the "broken" series is - REGTEST: mailers: add new test for 'mailers' section - REGTEST: Add a reg test for health-checks over SSL/TLS. - BUG/MINOR: mux-h1: Close connection on shutr only when shutw was really done - MEDIUM: mux-h1: Clarify how shutr/shutw are handled - BUG/MINOR: compression: Disable it if another one is already in progress - BUG/MINOR: filters: Detect cache+compression config on legacy HTTP streams - BUG/MINOR: cache: Disable the cache if any compression filter precedes it - REGTEST: Add some informatoin to test results. - MINOR: htx: Add a function to truncate all blocks after a specific offset - MINOR: channel/htx: Add the HTX version of channel_truncate/erase - BUG/MINOR: proto_htx: Use HTX versions to truncate or erase a buffer - BUG/CRITICAL: mux-h2: re-check the frame length when PRIORITY is used - DOC: Fix typo in req.ssl_alpn example (commit 4afdd138424ab...) - DOC: http-request cache-use / http-response cache-store expects cache name - REGTEST: "capture (request\|response)" regtest. - BUG/MINOR: lua/htx: Respect the reserve when data are send from an HTX applet - REGTEST: filters: add compression test - BUG/MEDIUM: init: Initialize idle_orphan_conns for first server in server-template - BUG/MEDIUM: ssl: Disable anti-replay protection and set max data with 0RTT. - DOC: Be a bit more explicit about allow-0rtt security implications. - MINOR: mux-h1: make the mux_h1_ops struct static - BUILD: makefile: add an EXTRA_OBJS variable to help build optional code - BUG/MEDIUM: connection: properly unregister the mux on failed initialization - BUG/MAJOR: cache: fix confusion between zero and uninitialized cache key - REGTESTS: test case for map_regm commit `271022150d` - REGTESTS: Basic tests for concat,strcmp,word,field,ipmask converters - REGTESTS: Basic tests for using maps to redirect requests / select backend - DOC: REGTESTS README varnishtest -Dno-htx= define. - MINOR: spoe: Make the SPOE filter compatible with HTX proxies - MINOR: checks: Store the proxy in checks. - BUG/MEDIUM: checks: Avoid having an associated server for email checks. - REGTEST: Switch to vtest. - REGTEST: Adapt reg test doc files to vtest. - BUG/MEDIUM: h1: Make sure we destroy an inactive connectin that did shutw. - BUG/MINOR: base64: dec func ignores padding for output size checking - BUG/MEDIUM: ssl: missing allocation failure checks loading tls key file - MINOR: ssl: add support of aes256 bits ticket keys on file and cli. - BUG/MINOR: backend: don't use url_param_name as a hint for BE_LB_ALGO_PH - BUG/MINOR: backend: balance uri specific options were lost across defaults - BUG/MINOR: backend: BE_LB_LKUP_CHTREE is a value, not a bit - MINOR: backend: move url_param_name/len to lbprm.arg_str/len - MINOR: backend: make headers and RDP cookie also use arg_str/len - MINOR: backend: add new fields in lbprm to store more LB options - MINOR: backend: make the header hash use arg_opt1 for use_domain_only - MINOR: backend: remap the balance uri settings to lbprm.arg_opt{1,2,3} - MINOR: backend: move hash_balance_factor out of chash - MEDIUM: backend: move all LB algo parameters into an union - MINOR: backend: make the random algorithm support a number of draws - BUILD/MEDIUM: da: Necessary code changes for new buffer API. - BUG/MINOR: stick_table: Prevent conn_cur from underflowing - BUG: 51d: Changes to the buffer API in 1.9 were not applied to the 51Degrees code. - BUG/MEDIUM: stats: Get the right scope pointer depending on HTX is used or not - DOC: add a missing space in the documentation for bc_http_major - REGTEST: checks basic stats webpage functionality - BUG/MEDIUM: servers: Make assign_tproxy_address work when ALPN is set. - BUG/MEDIUM: connections: Add the CO_FL_CONNECTED flag if a send succeeded. - DOC: add github issue templates - MINOR: cfgparse: Extract some code to be re-used. - CLEANUP: cfgparse: Return asap from cfg_parse_peers(). - CLEANUP: cfgparse: Code reindentation. - MINOR: cfgparse: Useless frontend initialization in "peers" sections. - MINOR: cfgparse: Rework peers frontend init. - MINOR: cfgparse: Simplication. - MINOR: cfgparse: Make "peer" lines be parsed as "server" lines. - MINOR: peers: Make outgoing connection to SSL/TLS peers work. - MINOR: cfgparse: SSL/TLS binding in "peers" sections. - DOC: peers: SSL/TLS documentation for "peers" - BUG/MINOR: startup: certain goto paths in init_pollers fail to free - BUG/MEDIUM: checks: fix recent regression on agent-check making it crash - BUG/MINOR: server: don't always trust srv_check_health when loading a server state - BUG/MINOR: check: Wake the check task if the check is finished in wake_srv_chk() - BUG/MEDIUM: ssl: Fix handling of TLS 1.3 KeyUpdate messages - DOC: mention the effect of nf_conntrack_tcp_loose on src/dst - BUG/MINOR: proto-htx: Return an error if all headers cannot be received at once - BUG/MEDIUM: mux-h2/htx: Respect the channel's reserve - BUG/MINOR: mux-h1: Apply the reserve on the channel's buffer only - BUG/MINOR: mux-h1: avoid copying output over itself in zero-copy - BUG/MAJOR: mux-h2: don't destroy the stream on failed allocation in h2_snd_buf() - BUG/MEDIUM: backend: also remove from idle list muxes that have no more room - BUG/MEDIUM: mux-h2: properly abort on trailers decoding errors - MINOR: h2: declare new sets of frame types - BUG/MINOR: mux-h2: CONTINUATION in closed state must always return GOAWAY - BUG/MINOR: mux-h2: headers-type frames in HREM are always a connection error - BUG/MINOR: mux-h2: make it possible to set the error code on an already closed stream - BUG/MINOR: hpack: return a compression error on invalid table size updates - MINOR: server: make sure pool-max-conn is >= -1 - BUG/MINOR: stream: take care of synchronous errors when trying to send - CLEANUP: server: fix indentation mess on idle connections - BUG/MINOR: mux-h2: always check the stream ID limit in h2_avail_streams() - BUG/MINOR: mux-h2: refuse to allocate a stream with too high an ID - BUG/MEDIUM: backend: never try to attach to a mux having no more stream available - MINOR: server: add a max-reuse parameter - MINOR: mux-h2: always consider a server's max-reuse parameter - MEDIUM: stream-int: always mark pending outgoing SI_ST_CON - MINOR: stream: don't wait before retrying after a failed connection reuse - MEDIUM: h2: always parse and deduplicate the content-length header - BUG/MINOR: mux-h2: always compare content-length to the sum of DATA frames - CLEANUP: h2: Remove debug printf in mux_h2.c - MINOR: cfgparse: make the process/thread parser support a maximum value - MINOR: threads: make MAX_THREADS configurable at build time - DOC: nbthread is no longer experimental. - BUG/MINOR: listener: always fill the source address for accepted socketpairs - BUG/MINOR: mux-h2: do not report available outgoing streams after GOAWAY - BUG/MINOR: spoe: corrected fragmentation string size - BUG/MINOR: task: fix possibly missed event in inter-thread wakeups - BUG/MEDIUM: servers: Attempt to reuse an unfinished connection on retry. - BUG/MEDIUM: backend: always call si_detach_endpoint() on async connection failure - SCRIPTS: add the issue tracker URL to the announce script - MINOR: peers: Extract some code to be reused. - CLEANUP: peers: Indentation fixes. - MINOR: peers: send code factorization. - MINOR: peers: Add new functions to send code and reduce the I/O handler. - MEDIUM: peers: synchronizaiton code factorization to reduce the size of the I/O handler. - MINOR: peers: Move update receive code to reduce the size of the I/O handler. - MINOR: peers: Move ack, switch and definition receive code to reduce the size of the I/O handler. - MINOR: peers: Move high level receive code to reduce the size of I/O handler. - CLEANUP: peers: Be more generic. - MINOR: peers: move error handling to reduce the size of the I/O handler. - MINOR: peers: move messages treatment code to reduce the size of the I/O handler. - MINOR: peers: move send code to reduce the size of the I/O handler. - CLEANUP: peers: Remove useless statements. - MINOR: peers: move "hello" message treatment code to reduce the size of the I/O handler. - MINOR: peers: move peer initializations code to reduce the size of the I/O handler. - CLEANUP: peers: factor the error handling code in peer_treet_updatemsg() - CLEANUP: peers: factor error handling in peer_treat_definedmsg() - BUILD/MINOR: peers: shut up a build warning introduced during last cleanup - BUG/MEDIUM: mux-h2: only close connection on request frames on closed streams - CLEANUP: mux-h2: remove two useless but misleading assignments - BUG/MEDIUM: checks: Check that conn_install_mux succeeded. - BUG/MEDIUM: servers: Only destroy a conn_stream we just allocated. - BUG/MEDIUM: servers: Don't add an incomplete conn to the server idle list. - BUG/MEDIUM: checks: Don't try to set ALPN if connection failed. - BUG/MEDIUM: h2: In h2_send(), stop the loop if we failed to alloc a buf. - BUG/MEDIUM: peers: Handle mux creation failure. - BUG/MEDIUM: servers: Close the connection if we failed to install the mux. - BUG/MEDIUM: compression: Rewrite strong ETags - BUG/MINOR: deinit: tcp_rep.inspect_rules not deinit, add to deinit - CLEANUP: mux-h2: remove misleading leftover test on h2s' nullity - BUG/MEDIUM: mux-h2: wake up flow-controlled streams on initial window update - BUG/MEDIUM: mux-h2: fix two half-closed to closed transitions - BUG/MEDIUM: mux-h2: make sure never to send GOAWAY on too old streams - BUG/MEDIUM: mux-h2: do not abort HEADERS frame before decoding them - BUG/MINOR: mux-h2: make sure response HEADERS are not received in other states than OPEN and HLOC - MINOR: h2: add a generic frame checker - MEDIUM: mux-h2: check the frame validity before considering the stream state - CLEANUP: mux-h2: remove stream ID and frame length checks from the frame parsers - BUG/MINOR: mux-h2: make sure request trailers on aborted streams don't break the connection - DOC: compression: Update the reasons for disabled compression - BUG/MEDIUM: buffer: Make sure b_is_null handles buffers waiting for allocation. - DOC: htx: make it clear that htxbuf() and htx_from_buf() always return valid pointers - MINOR: htx: never check for null htx pointer in htx_is_{,not_}empty() - MINOR: mux-h2: consistently rely on the htx variable to detect the mode - BUG/MEDIUM: peers: Peer addresses parsing broken. - BUG/MEDIUM: mux-h1: Don't add "transfer-encoding" if message-body is forbidden - BUG/MEDIUM: connections: Don't forget to remove CO_FL_SESS_IDLE. - BUG/MINOR: stream: don't close the front connection when facing a backend error - BUG/MEDIUM: mux-h2: wait for the mux buffer to be empty before closing the connection - MINOR: stream-int: add a new flag to mention that we want the connection to be killed - MINOR: connstream: have a new flag CS_FL_KILL_CONN to kill a connection - BUG/MEDIUM: mux-h2: do not close the connection on aborted streams - BUG/MINOR: server: fix logic flaw in idle connection list management - MINOR: mux-h2: max-concurrent-streams should be unsigned - MINOR: mux-h2: make sure to only check concurrency limit on the frontend - MINOR: mux-h2: learn and store the peer's advertised MAX_CONCURRENT_STREAMS setting - BUG/MEDIUM: mux-h2: properly consider the peer's advertised max-concurrent-streams - MINOR: xref: Add missing barriers. - MINOR: muxes: Don't bother to LIST_DEL(&conn->list) before calling conn_free(). - MINOR: debug: Add an option that causes random allocation failures. - BUG/MEDIUM: backend: always release the previous connection into its own target srv_list - BUG/MEDIUM: htx: check the HTX compatibility in dynamic use-backend rules - BUG/MINOR: tune.fail-alloc: Don't forget to initialize ret. - BUG/MINOR: backend: check srv_conn before dereferencing it - BUG/MEDIUM: mux-h2: always omit :scheme and :path for the CONNECT method - BUG/MEDIUM: mux-h2: always set :authority on request output - BUG/MEDIUM: stream: Don't forget to free s->unique_id in stream_free(). - BUG/MINOR: threads: fix the process range of thread masks - BUG/MINOR: config: fix bind line thread mask validation - CLEANUP: threads: fix misleading comment about all_threads_mask - CLEANUP: threads: use nbits to calculate the thread mask - OPTIM: listener: optimize cache-line packing for struct listener - MINOR: tools: improve the popcount() operation - MINOR: config: keep an all_proc_mask like we have all_threads_mask - MINOR: global: add proc_mask() and thread_mask() - MINOR: config: simplify bind_proc processing using proc_mask() - MINOR: threads: make use of thread_mask() to simplify some thread calculations - BUG/MINOR: compression: properly report compression stats in HTX mode - BUG/MINOR: task: close a tiny race in the inter-thread wakeup - BUG/MAJOR: config: verify that targets of track-sc and stick rules are present - BUG/MAJOR: spoe: verify that backends used by SPOE cover all their callers' processes - BUG/MAJOR: htx/backend: Make all tests on HTTP messages compatible with HTX - BUG/MINOR: config: make sure to count the error on incorrect track-sc/stick rules - DOC: ssl: Clarify when pre TLSv1.3 cipher can be used - DOC: ssl: Stop documenting ciphers example to use - BUG/MINOR: spoe: do not assume agent->rt is valid on exit - BUG/MINOR: lua: initialize the correct idle conn lists for the SSL sockets - BUG/MEDIUM: spoe: initialization depending on nbthread must be done last - BUG/MEDIUM: server: initialize the idle conns list after parsing the config - BUG/MEDIUM: server: initialize the orphaned conns lists and tasks at the end - MINOR: config: make MAX_PROCS configurable at build time - BUG/MAJOR: spoe: Don't try to get agent config during SPOP healthcheck - BUG/MINOR: config: Reinforce validity check when a process number is parsed - BUG/MEDIUM: peers: check that p->srv actually exists before using p->srv->use_ssl - CONTRIB: contrib/prometheus-exporter: Add a Prometheus exporter for HAProxy - BUG/MINOR: mux-h1: verify the request's version before dropping connection: keep-alive - BUG: 51d: In Hash Trie, multi header matching was affected by the header names stored globaly. - MEDIUM: 51d: Enabled multi threaded operation in the 51Degrees module. - BUG/MAJOR: stream: avoid double free on unique_id - BUILD/MINOR: stream: avoid a build warning with threads disabled - BUILD/MINOR: tools: fix build warning in the date conversion functions - BUILD/MINOR: peers: remove an impossible null test in intencode() - BUILD/MINOR: htx: fix some potential null-deref warnings with http_find_stline - BUG/MEDIUM: peers: Missing peer initializations. - BUG/MEDIUM: http_fetch: fix the "base" and "base32" fetch methods in HTX mode - BUG/MEDIUM: proto_htx: Fix data size update if end of the cookie is removed - BUG/MEDIUM: http_fetch: fix "req.body_len" and "req.body_size" fetch methods in HTX mode - BUILD/MEDIUM: initcall: Fix build on MacOS. - BUG/MEDIUM: mux-h2/htx: Always set CS flags before exiting h2_rcv_buf() - MINOR: h2/htx: Set the flag HTX_SL_F_BODYLESS for messages without body - BUG/MINOR: mux-h1: Add "transfer-encoding" header on outgoing requests if needed - BUG/MINOR: mux-h2: Don't add ":status" pseudo-header on trailers - BUG/MINOR: proto-htx: Consider a XFER_LEN message as chunked by default - BUG/MEDIUM: h2/htx: Correctly handle interim responses when HTX is enabled - MINOR: mux-h2: Set HTX extra value when possible - BUG/MEDIUM: htx: count the amount of copied data towards the final count - MINOR: mux-h2: make the H2 MAX_FRAME_SIZE setting configurable - BUG/MEDIUM: mux-h2/htx: send an empty DATA frame on empty HTX trailers - BUG/MEDIUM: servers: Use atomic operations when handling curr_idle_conns. - BUG/MEDIUM: servers: Add a per-thread counter of idle connections. - MINOR: fd: add a new my_closefrom() function to close all FDs - MINOR: checks: use my_closefrom() to close all FDs - MINOR: fd: implement an optimised my_closefrom() function - BUG/MINOR: fd: make sure my_closefrom() doesn't miss some FDs - BUG/MAJOR: fd/threads, task/threads: ensure all spin locks are unlocked - BUG/MAJOR: listener: Make sure the listener exist before using it. - MINOR: fd: Use closefrom() as my_closefrom() if supported. - BUG/MEDIUM: mux-h1: Report the right amount of data xferred in h1_rcv_buf() - BUG/MINOR: channel: Set CF_WROTE_DATA when outgoing data are skipped - MINOR: htx: Add function to drain data from an HTX message - MINOR: channel/htx: Add function to skips output bytes from an HTX channel - BUG/MAJOR: cache/htx: Set the start-line offset when a cached object is served - BUG/MEDIUM: cache: Get objects from the cache only for GET and HEAD requests - BUG/MINOR: cache/htx: Return only the headers of cached objects to HEAD requests - BUG/MINOR: mux-h1: Always initilize h1m variable in h1_process_input() - BUG/MEDIUM: proto_htx: Fix functions applying regex filters on HTX messages - BUG/MEDIUM: h2: advertise to servers that we don't support push - MINOR: standard: Add a function to parse uints (dotted notation). - MINOR: arg: Add support for ARGT_PBUF_FNUM arg type. - MINOR: http_fetch: add "req.ungrpc" sample fetch for gRPC. - MINOR: sample: Add two sample converters for protocol buffers. - DOC: sample: Add gRPC related documentation.	2019-02-26 16:43:49 +01:00
Fr�d�ric L�caille	fd95c62f1b	MINOR: sample: Add two sample converters for protocol buffers. Add "varint" to convert all the protocol buffers binary varints excepted the signed ones ("sint32" and "sint64") to an integer. The binary signed varints may be converted to an integer with "svarint" converter implemented by this patch. These two new converters do not take any argument.	2019-02-26 16:27:05 +01:00
Fr�d�ric L�caille	1fceee8316	MINOR: http_fetch: add "req.ungrpc" sample fetch for gRPC. This patch implements "req.ungrpc" sample fetch method to decode and parse a gRPC request. It takes only one argument: a protocol buffers field number to identify the protocol buffers message number to be looked up. This argument is a sort of path in dotted notation to the terminal field number to be retrieved. ex: req.ungrpc(1.2.3.4) This sample fetch catch the data in raw mode, without interpreting them. Some protocol buffers specific converters may be used to convert the data to the correct type.	2019-02-26 16:27:05 +01:00
Fr�d�ric L�caille	3a463c92cf	MINOR: arg: Add support for ARGT_PBUF_FNUM arg type. This new argument type is used to parse Protocol Buffers field number with dotted notation (e.g: 1.2.3.4).	2019-02-26 16:27:05 +01:00
Fr�d�ric L�caille	3b71716685	MINOR: standard: Add a function to parse uints (dotted notation). This function is useful to parse strings made of unsigned integers and to allocate a C array of unsigned integers from there. For instance this function allocates this array { 1, 2, 3, 4, } from this string: "1.2.3.4".	2019-02-26 16:27:05 +01:00
Willy Tarreau	0bbad6bb06	BUG/MEDIUM: h2: advertise to servers that we don't support push The h2c_send_settings() function was initially made to serve on the frontend. Here we don't need to advertise that we don't support PUSH since we don't do that ourselves. But on the backend side it's different because PUSH is enabled by default so we must announce that we don't want the server to use it. This must be backported to 1.9.	2019-02-26 16:07:27 +01:00
Christopher Faulet	02e771a9e0	BUG/MEDIUM: proto_htx: Fix functions applying regex filters on HTX messages The HTX functions htx_apply_filter_to_req_headers() and htx_apply_filter_to_resp_headers() contain 2 bugs. The first one is about the matching on each header. The chunk 'hdr' used to format a full header line was never reset. The second bug appears when we try to replace or remove a header. The variable ctx was not fully initialized, leading to sefaults. This patch must be backported to 1.9.	2019-02-26 15:45:02 +01:00
Christopher Faulet	7402776c52	BUG/MINOR: mux-h1: Always initilize h1m variable in h1_process_input() It is used at the end of the function to know if the end of the message was reached. So we must be sure to always initialize it. This patch must be backported to 1.9.	2019-02-26 14:51:17 +01:00
Christopher Faulet	f0dd037456	BUG/MINOR: cache/htx: Return only the headers of cached objects to HEAD requests The body of a cached object must not be sent in response to a HEAD request. This works for the legacy HTTP because the parsing is performed by HTTP analyzers _AND_ because the connection is closed at the end of the transaction. So the body is ignored. But the applet send it. For the HTX, the applet must skip the body explicitly. This patch must be backported to 1.9.	2019-02-26 14:04:23 +01:00
Christopher Faulet	b3d4bca415	BUG/MEDIUM: cache: Get objects from the cache only for GET and HEAD requests Only responses for GET requests are stored in the cache. But there is no check on the method during the lookup. So it is possible to retrieve an object from the cache independently of the method, from the time the key of the object matches. Now, lookups are performed only for GET and HEAD requests. This patch must be backportedi in 1.9.	2019-02-26 14:04:23 +01:00
Christopher Faulet	a0df957471	BUG/MAJOR: cache/htx: Set the start-line offset when a cached object is served When the function htx_add_stline() is used, this offset is automatically set when necessary. But the HTX cache applet adds all header blocks of the responses manually, including the start-line. So its offset must be explicitly set by the applet. When everything goes well, the HTTP analyzer http_wait_for_response() looks for the start-line in the HTX messages, calling http_find_stline(). If necessary, the start-line offet will also be automatically set during this stage. So the bug of the HTX cache applet does not hurt most of the time. But, when an error occurred, HTTP responses analyzers can be bypassed. In such caese, the start-line offset of cached responses remains unset. Some part of the code relies on the start-line offset to process the HTX messages. Among others, when H2 responses are sent to clients, the H2 multiplexer read the start-line without any check, because it _MUST_ always be there. if its offset is not set, a NULL pointer is dereferenced leading to a segfault. The patch must be backported to 1.9.	2019-02-26 14:04:23 +01:00
Christopher Faulet	549822f0a1	MINOR: htx: Add function to drain data from an HTX message The function htx_drain() can now be used to drain data from an HTX message. It will be used by other commits to fix bugs, so it must be backported to 1.9.	2019-02-26 14:04:23 +01:00
Christopher Faulet	b8d2ee0406	BUG/MEDIUM: mux-h1: Report the right amount of data xferred in h1_rcv_buf() h1_rcv_buf() must return the amount of data copied in the channel's buffer and not the number of bytes parsed. Because this value is used during the fast forwarding to decrement to_forward value, returning the wrong value leads to undefined behaviours. This patch must be backported to 1.9.	2019-02-26 14:04:23 +01:00
Olivier Houchard	2292edf67c	MINOR: fd: Use closefrom() as my_closefrom() if supported. Add a new option, USE_CLOSEFROM. If set, it is assumed the system provides a closefrom() function, so use it. It is only implicitely used on FreeBSD for now, it should work on OpenBSD/NetBSD/DragonflyBSD/Solaris too, but as I have no such system to test it, I'd rather leave it disabled by default. Users can add USE_CLOSEFROM explicitely on their make command line to activate it.	2019-02-25 16:51:03 +01:00
Olivier Houchard	d16a9dfed8	BUG/MAJOR: listener: Make sure the listener exist before using it. In listener_accept(), make sure we have a listener before attempting to use it. An another thread may have closed the FD meanwhile, and set fdtab[fd].owner to NULL. As the listener is not free'd, it is ok to attempt to accept() a new connection even if the listener was closed. At worst the fd has been reassigned to another connection, and accept() will fail anyway. Many thanks to Richard Russo for reporting the problem, and suggesting the fix. This should be backported to 1.9 and 1.8.	2019-02-25 16:30:13 +01:00
Richard Russo	bc9d9844d5	BUG/MAJOR: fd/threads, task/threads: ensure all spin locks are unlocked Calculate if the fd or task should be locked once, before locking, and reuse the calculation when determing when to unlock. Fixes a race condition added in `87d54a9a` for fds, and `b20aa9ee` for tasks, released in 1.9-dev4. When one thread modifies thread_mask to be a single thread for a task or fd while a second thread has locked or is waiting on a lock for that task or fd, the second thread will not unlock it. For FDs, this is observable when a listener is polled by multiple threads, and is closed while those threads have events pending. For tasks, this seems possible, where task_set_affinity is called, but I did not observe it. This must be backported to 1.9.	2019-02-25 16:16:36 +01:00
Willy Tarreau	b8e602cb1b	BUG/MINOR: fd: make sure my_closefrom() doesn't miss some FDs The optimized my_closefrom() implementation introduced with previous commit `9188ac60e` ("MINOR: fd: implement an optimised my_closefrom() function") has a small bug causing it to miss some FDs at the end of each batch. The reason is that poll() returns the number of non-zero events, so it contains the size of the batch minus the FDs to close. Thus if the FDs to close are at the beginning they'll be seen but if they're at the end after all other closed ones, the returned count will not cover them. No backport is needed.	2019-02-22 09:07:42 +01:00
Willy Tarreau	9188ac60eb	MINOR: fd: implement an optimised my_closefrom() function The idea is that poll() can set the POLLNVAL flag for each invalid FD in a pollfd list. Thus this function makes use of poll() when compiled in, and builds lists of up to 1024 FDs at once, checks the output and only closes those which do not have this flag set. Tests show that this is about twice as fast as blindly calling close() for each closed fd.	2019-02-21 23:07:24 +01:00
Willy Tarreau	2555ccf4d0	MINOR: checks: use my_closefrom() to close all FDs Instead of looping on all FDs, let's use my_closefrom() which does it respecting the current process' limits and possibly doing it more efficiently.	2019-02-21 22:22:06 +01:00
Willy Tarreau	2d7f81b809	MINOR: fd: add a new my_closefrom() function to close all FDs This is a naive implementation of closefrom() which closes all FDs starting from the one passed in argument. closefrom() is not provided on all operating systems, and other versions will follow.	2019-02-21 22:19:17 +01:00
Olivier Houchard	f131481a0a	BUG/MEDIUM: servers: Add a per-thread counter of idle connections. Add a per-thread counter of idling connections, and use it to determine how many connections we should kill after the timeout, instead of using the global counter, or we're likely to just kill most of the connections. This should be backported to 1.9.	2019-02-21 19:07:45 +01:00
Olivier Houchard	e737103173	BUG/MEDIUM: servers: Use atomic operations when handling curr_idle_conns. Use atomic operations when dealing with srv->curr_idle_conns, as it's shared between threads, otherwise we could get inconsistencies. This should be backported to 1.9.	2019-02-21 19:07:19 +01:00
Willy Tarreau	67b8caefc9	BUG/MEDIUM: mux-h2/htx: send an empty DATA frame on empty HTX trailers When chunked-encoding is used in HTX mode, a trailers HTX block is always made due to the way trailers are currently implemented (verbatim copy of the H1 representation). Because of this it's not possible to know when processing data that we've reached the end of the stream, and it's up to the function encoding the trailers (h2s_htx_make_trailers) to put the end of stream. But when there are no trailers and only an empty HTX block, this one cannot produce a HEADERS frame, thus it cannot send the END_STREAM flag either, leaving the other end with an incomplete message, waiting for either more data or some trailers. This is particularly visible with POST requests where the server continues to wait. What this patch does is transform the HEADERS frame into an empty DATA frame when meeting an empty trailers block. It is possible to do this because we've not sent any trailers so the other end is still waiting for DATA frames. The check is made after attempting to encode the list of headers, so as to minimize the specific code paths. Thanks to Dragan Dosen for reporting the issue with a reproducer. This fix must be backported to 1.9.	2019-02-21 18:22:26 +01:00
Willy Tarreau	a24b35ca18	MINOR: mux-h2: make the H2 MAX_FRAME_SIZE setting configurable This creates a new tunable "tune.h2.max-frame-size" to adjust the advertised max frame size. When not set it still defaults to the buffer size. It is convenient to advertise sizes lower than the buffer size, for example when using very large buffers.	2019-02-21 17:30:59 +01:00
Willy Tarreau	2bf0c13261	BUG/MEDIUM: htx: count the amount of copied data towards the final count Currently htx_xfer_blks() respects the <count> limit for each block instead of for the sum of the transfered blocks. This causes it to return slightly more than requested when both headers and data are present in the source buffer, which happens early in the transfer when the reserve is still active. Thus with large enough headers, the reserve will not be respected. Note that this function is only called from h2_rcv_buf() thus this only affects data entering over H2 (H2 requests or H2 responses). This fix must be backported to 1.9.	2019-02-21 17:13:07 +01:00
Christopher Faulet	eaf0d2a936	MINOR: mux-h2: Set HTX extra value when possible For now, this can be only done when a content-length is specified. In fact, it is the same value than h2s->body_len, the remaining body length according to content-length. Setting this field allows the fast forwarding at the channel layer, improving significantly data transfer for big objects. This patch may be backported to 1.9.	2019-02-19 16:26:14 +01:00
Christopher Faulet	0b46548a68	BUG/MEDIUM: h2/htx: Correctly handle interim responses when HTX is enabled 1xx responses does not work in HTTP2 when the HTX is enabled. First of all, when a response is parsed, only one HEADERS frame is expected. So when an interim response is received, the flag H2_SF_HEADERS_RCVD is set and the next HEADERS frame (for another interim repsonse or the final one) is parsed as a trailers one. Then when the response is sent, because an EOM block is found at the end of the interim HTX response, the ES flag is added on the frame, closing too early the stream. Here, it is a design problem of the HTX. Iterim responses are considered as full messages, leading to some ambiguities when HTX messages are processed. This will not be fixed now, but we need to keep it in mind for future improvements. To fix the parsing bug, the flag H2_MSGF_RSP_1XX is added when the response headers are decoded. When this flag is set, an EOM block is added into the HTX message, despite the fact that there is no ES flag on the frame. And we don't set the flag H2_SF_HEADERS_RCVD on the corresponding H2S. So the next HEADERS frame will not be parsed as a trailers one. To fix the sending bug, the ES flag is not set on the frame when an interim response is processed and the flag H2_SF_HEADERS_SENT is not set on the corresponding H2S. This patch must be backported to 1.9.	2019-02-19 16:26:14 +01:00
Christopher Faulet	834eee7928	BUG/MINOR: proto-htx: Consider a XFER_LEN message as chunked by default An HTX message with a known body length is now considered by default as chunked. It means the header "content-length" must be found to consider it as a non-chunked message. Before, it was the reverse, the message was considered with a content length by default. But it is a bug for HTTP/2 messages. There is no chunked transfer encoding in HTTP/2 but internally messages without content length are considered as chunked. It eases HTTP/1 <-> HTTP/2 conversions. This patch must be backported to 1.9.	2019-02-18 16:25:06 +01:00
Christopher Faulet	fd74267264	BUG/MINOR: mux-h2: Don't add ":status" pseudo-header on trailers It is a cut-paste bug. Pseudo-header fields MUST NOT appear in trailers. This patch must be backported to 1.9.	2019-02-18 16:25:06 +01:00
Christopher Faulet	1f890ddbe2	BUG/MINOR: mux-h1: Add "transfer-encoding" header on outgoing requests if needed As for outgoing response, if an HTTP/1.1 or above request is sent to a server with neither the headers "content-length" nor "transfer-encoding", it is considered as a chunked request and the header "transfer-encoding: chunked" is automatically added. Of course, it is only true for requests with a body. Concretely, it only happens for incoming HTTP/2 requests sent to an HTTP/1.1 server. This patch must be backported to 1.9.	2019-02-18 16:25:06 +01:00
Christopher Faulet	44af3cfca3	MINOR: h2/htx: Set the flag HTX_SL_F_BODYLESS for messages without body This information is usefull to know if a body is expected or not, regardless the presence or not of the header "Content-Length" and its value. Once the ES flag is set on the header frame or when the content length is 0, we can safely add the flag HTX_SL_F_BODYLESS on the HTX start-line. Among other things, it will help the mux-h1 to know if it should add TE header or not. It will also help the HTTP compression filter. This patch must be backported to 1.9 because a bug fix depends on it.	2019-02-18 16:25:06 +01:00
Christopher Faulet	37070b2b15	BUG/MEDIUM: mux-h2/htx: Always set CS flags before exiting h2_rcv_buf() It is especially important when some data are blocked in the RX buf and the channel buffer is already full. In such case, instead of exiting the function directly, we need to set right flags on the conn_stream. CS_FL_RCV_MORE and CS_FL_WANT_ROOM must be set, otherwise, the stream-interface will subscribe to receive events, thinking it is not blocked. This bug leads to connection freeze when everything was received with some data blocked in the RX buf and a channel full. This patch must be backported to 1.9.	2019-02-18 16:25:06 +01:00
Dragan Dosen	5a606685f1	BUG/MEDIUM: http_fetch: fix "req.body_len" and "req.body_size" fetch methods in HTX mode When in HTX mode, in functions smp_fetch_body_len() and smp_fetch_body_size() we were subtracting the size of each header block from the total size htx->data to calculate the size of body, and that could result in wrong calculated value. To avoid this, we now loop on blocks to sum up the size of only those that are of type HTX_BLK_DATA. This patch must be backported to 1.9.	2019-02-14 15:41:17 +01:00
Christopher Faulet	6cdaf2ad9a	BUG/MEDIUM: proto_htx: Fix data size update if end of the cookie is removed When client-side or server-side cookies are parsed, if the end of the cookie line is removed, the HTX message must be updated. The length of the HTX block is decreased and the data size of the HTX message is modified accordingly. The update of the HTX block was ok but the update of the HTX message was wrong, leading to undefined behaviours during the data forwarding. One of possible effect was a freeze of the connection and no data forward. This patch must be backported in 1.9.	2019-02-13 09:56:54 +01:00
Dragan Dosen	8861e1c082	BUG/MEDIUM: http_fetch: fix the "base" and "base32" fetch methods in HTX mode The resulting value produced in functions smp_fetch_base() and smp_fetch_base32() was wrong when in HTX mode. This patch also adds the semicolon at the end of the for-loop line, used in function smp_fetch_path(), since it's actually with no body. This must be backported to 1.9.	2019-02-12 20:43:03 +01:00
Fr�d�ric L�caille	76d2cef0c2	BUG/MEDIUM: peers: Missing peer initializations. Initialize ->srv peer field for all the peers, the local peer included. Indeed, a haproxy process needs to connect to the local peer of a remote process. Furthermore, when a "peer" or "server" line is parsed by parse_server() the address must be copied to ->addr field of the peer object only if this address has been also parsed by parse_server(). This is not the case if this address belongs to the local peer and is provided on a "server" line. After having parsed the "peer" or "server" lines of a peer sections, the ->srv part of all the peer must be initialized for SSL, if enabled. Same thing for the binding part. Revert `1417f0b` commit which is no more required. No backport is needed, this is purely 2.0.	2019-02-12 19:49:22 +01:00
Willy Tarreau	cdce54c2b7	BUILD/MINOR: htx: fix some potential null-deref warnings with http_find_stline http_find_stline() carefully verifies that it finds a start line otherwise returns NULL when not found. But a few calling functions ignore this NULL in return and dereference this pointer without checking. Let's add the test where needed in the callers. If it turns out that over the long term a start line is mandatory, then the test will be removed and the faulty function will have to be simplified. This must be backported to 1.9.	2019-02-12 12:02:27 +01:00
Willy Tarreau	9bdd7bc63d	BUILD/MINOR: peers: remove an impossible null test in intencode() intencode() tests for the nullity of the target pointer passed in argument, but the code calling intencode() never does so and happily dereferences it. gcc at -O3 detects this as a potential null deref. Let's remove this incorrect and misleading test. If this pointer was null, the code would already crash in the calling functions. This must be backported to stable versions.	2019-02-12 11:59:35 +01:00
Willy Tarreau	4eee38aa57	BUILD/MINOR: tools: fix build warning in the date conversion functions Some gcc versions emit potential null deref warnings at -O3 in date2str_log(), gmt2str_log() and localdate2str_log() after utoa_pad() because this function may return NULL if its size argument is too small for the integer value. And it's true that we can't guarantee that the input number is always valid. This must be backported to all stable versions.	2019-02-12 11:30:04 +01:00
Willy Tarreau	1ef724e216	BUILD/MINOR: stream: avoid a build warning with threads disabled gcc 6+ complains about a possible null-deref here due to the test in objt_server() : if (objt_server(s->target)) HA_ATOMIC_ADD(&objt_server(s->target)->counters.retries, 1); Let's simply change it to __objt_server(). This can be backported to 1.9 and 1.8.	2019-02-12 10:59:32 +01:00
Willy Tarreau	09c4bab411	BUG/MAJOR: stream: avoid double free on unique_id Commit `32211a1` ("BUG/MEDIUM: stream: Don't forget to free s->unique_id in stream_free().") addressed a memory leak but in exchange may cause double-free due to the fact that after freeing s->unique_id it doesn't null it and then calls http_end_txn() which frees it again. Thus the process quickly crashes at runtime. This fix must be backported to all stable branches where the aforementioned patch was backported.	2019-02-10 18:49:37 +01:00
Ben51Degrees	4ddf59d070	MEDIUM: 51d: Enabled multi threaded operation in the 51Degrees module. The existing threading flag in the 51Degrees API (FIFTYONEDEGREES_NO_THREADING) has now been mapped to the HAProxy threading flag (USE_THREAD), and the 51Degrees module code has been made thread safe. In Pattern, the cache is now locked with a spin lock from hathreads.h using a new lable 'OTHER_LOCK'. The workset pool is now created with the same size as the number of threads to avoid any time waiting on a worket. In Hash Trie, the global device offsets structure is only used in single threaded operation. Multi threaded operation creates a new offsets structure in each thread.	2019-02-08 21:29:23 +01:00
Ben51Degrees	e0f6a2a2aa	BUG: 51d: In Hash Trie, multi header matching was affected by the header names stored globaly. Some logic around mutli header matching in Hash Trie has been improved where only the name of the most important header was stored in the global heade_names structure. Now all headers are stored, so are used in the mutli header matching correctly.	2019-02-08 21:29:08 +01:00
Willy Tarreau	7701cad444	BUG/MINOR: mux-h1: verify the request's version before dropping connection: keep-alive The mux h1 properly avoid to set "connection: keep-alive" when the response is in HTTP/1.1 but it forgot to check the request's version. Thus when the client requests using HTTP/1.0 and connection: keep-alive (like ab does), the response is in 1.1 with no keep-alive and ab waits for the close without checking for the content-length. Response headers actually depend on the recipient, thus on both request and response's version. This patch must be backported to 1.9.	2019-02-08 15:38:22 +01:00
Christopher Faulet	18cca781f5	BUG/MINOR: config: Reinforce validity check when a process number is parsed Now, in the function parse_process_number(), when a process number or a set of processes is parsed, an error is triggered if an invalid character is found. It means following syntaxes are not forbidden and will emit an alert during the HAProxy startup: 1a 1/2 1-2-3 This bug was reported on Github. See issue #36. This patch may be backported to 1.9 and 1.8.	2019-02-07 21:23:58 +01:00
Christopher Faulet	11389018bc	BUG/MAJOR: spoe: Don't try to get agent config during SPOP healthcheck During SPOP healthchecks, a dummy appctx is used to create the HAPROXY-HELLO frame and then to parse the AGENT-HELLO frame. No agent are attached to it. So it is important to not rely on an agent during these stages. When HAPROXY-HELLO frame is created, there is no problem, all accesses to an agent are guarded. This is not true during the parsing of the AGENT-HELLO frame. Thus, it is possible to crash HAProxy with a SPOA declaring the async or the pipelining capability during a healthcheck. This patch must be backported to 1.9 and 1.8.	2019-02-07 21:23:58 +01:00
Willy Tarreau	ff9c9140f4	MINOR: config: make MAX_PROCS configurable at build time For some embedded systems, it's pointless to have 32- or even 64- large arrays of processes when it's known that much fewer processes will be used in the worst case. Let's introduce this MAX_PROCS define which contains the highest number of processes allowed to run at once. It still defaults to LONGBITS but may be lowered.	2019-02-07 15:10:19 +01:00
Willy Tarreau	980855bd95	BUG/MEDIUM: server: initialize the orphaned conns lists and tasks at the end This also depends on the nbthread count, so it must only be performed after parsing the whole config file. As a side effect, this removes some code duplication between servers and server-templates. This must be backported to 1.9.	2019-02-07 15:08:13 +01:00
Willy Tarreau	835daa119e	BUG/MEDIUM: server: initialize the idle conns list after parsing the config The idle conns lists are sized according to the number of threads. As such they cannot be initialized during the parsing since nbthread can be set later, as revealed by this simple config which randomly crashes when used. Let's do this at the end instead. listen proxy bind :4445 mode http timeout client 10s timeout server 10s timeout connect 10s http-reuse always server s1 127.0.0.1:8000 global nbthread 8 This fix must be backported to 1.9 and 1.8.	2019-02-07 15:08:13 +01:00
Willy Tarreau	b0769b2ed6	BUG/MEDIUM: spoe: initialization depending on nbthread must be done last The agent used to be configured depending on global.nbthread while nbthread may be set after the agent is parsed. Let's move this part to the spoe_check() function to make sure nbthread is always correct and arrays are appropriately sized. This fix must be backported to 1.9 and 1.8.	2019-02-07 15:08:13 +01:00
Willy Tarreau	b784b35ce8	BUG/MINOR: lua: initialize the correct idle conn lists for the SSL sockets Commit `40a007cf2` ("MEDIUM: threads/server: Make connection list (priv/idle/safe) thread-safe") made a copy-paste error when initializing the Lua sockets, as the TCP one was initialized twice. Fortunately it has no impact because the pointers are set to NULL after a memset(0) and are not changed in between. This must be backported to 1.9 and 1.8.	2019-02-07 15:08:13 +01:00
Willy Tarreau	3ddcf7643c	BUG/MINOR: spoe: do not assume agent->rt is valid on exit As reported by Christopher, we may call spoe_release_agent() when leaving after an allocation failure or a config parse error. We must not assume agent->rt is valid there as the allocation could have failed. This should be backported to 1.9 and 1.8.	2019-02-07 15:08:13 +01:00
Willy Tarreau	1a0fe3becd	BUG/MINOR: config: make sure to count the error on incorrect track-sc/stick rules When commit `151e1ca98` ("BUG/MAJOR: config: verify that targets of track-sc and stick rules are present") added a check for some process inconsistencies between rules and their stick tables, some errors resulted in a "return 0" statement, which is taken as "no error" in some cases. Let's fix this. This must be backported to all versions using the above commit.	2019-02-06 10:25:07 +01:00
Christopher Faulet	f7679ad4db	BUG/MAJOR: htx/backend: Make all tests on HTTP messages compatible with HTX A piece of code about the HTX was lost this summer, after the "big merge" (htx/http2/connection layer refactoring). I forgot to keep HTX changes in the functions connect_server() and assign_server(). So, this patch fixes "uri", "url_param" and "hdr" LB algorithms when the HTX is enabled. It also fixes evaluation of the "sni" expression on server lines. This issue was reported on github. See issue #32. This patch must be backported in 1.9.	2019-02-06 10:20:01 +01:00
Willy Tarreau	2bdcfde426	BUG/MAJOR: spoe: verify that backends used by SPOE cover all their callers' processes When a filter is installed on a proxy and references spoe, we must be absolutely certain that the whole chain is valid on a given process when running in multi-process mode. The problem here is that if a proxy 1 runs on process 1, referencing an SPOE agent itself based on a backend running on process 2, this last one will be completely deinited on process 1, and will thus cause random crashes when it gets messages from this proess. This patch makes sure that the whole chain is valid on all of the caller's processes. This fix must be backported to all spoe-enabled maintained versions. It may potentially disrupt configurations which already randomly crash. There hardly is any intermediary solution though, such configurations need to be fixed.	2019-02-05 13:37:19 +01:00
Willy Tarreau	151e1ca989	BUG/MAJOR: config: verify that targets of track-sc and stick rules are present Stick and track-sc rules may optionally designate a table in a different proxy. In this case, a number of verifications are made such as validating that this proxy actually exists. However, in multi-process mode, the target table might indeed exist but not be bound to the set of processes the rules will execute on. This will definitely result in a random behaviour especially if these tables do require peer synchronization, because some tasks will be started to try to synchronize form uninitialized areas. The typical issue looks like this : peers my-peers peer foo ... listen proxy bind-process 1 stick on src table ip ... backend ip bind-process 2 stick-table type ip size 1k peers my-peers While it appears obvious that the example above will not work, there are less obvious situations, such as having bind-process in a defaults section and having a larger set of processes for the referencing proxy than the referenced one. The present patch adds checks for such situations by verifying that all processes from the referencing proxy are present on the other one in all track-sc* and stick-* rules, and in sample fetch / converters referencing another table so that sc_inc_gpc0() and similar are safe as well. This fix must be backported to all maintained versions. It may potentially disrupt configurations which already randomly crash. There hardly is any intermediary solution though, such configurations need to be fixed.	2019-02-05 11:54:49 +01:00
Willy Tarreau	155acffc13	BUG/MINOR: task: close a tiny race in the inter-thread wakeup __task_wakeup() takes care of a small race that exists between threads, but it uses a store barrier that is not sufficient since apparently the state read after clearing the leaf_p pointer sometimes is incorrect. This results in missed wakeups between threads competing at a high rate. Let's use a full barrier instead to serialize the operations. This may be backported to 1.9 though it's extremely unlikely that this bug will ever manifest itself there.	2019-02-04 14:21:35 +01:00
Willy Tarreau	ef6fd85623	BUG/MINOR: compression: properly report compression stats in HTX mode When HTX support was added to HTTP compression, a set of counters was missed, namely comp_in and comp_byp, resulting in no stats being available for compression. This must be backported to 1.9.	2019-02-04 11:48:03 +01:00
Willy Tarreau	3d95717b58	MINOR: threads: make use of thread_mask() to simplify some thread calculations By doing so it's visible that some fd_insert() calls were relying on MAX_THREADS while all_threads_mask should have been more suitable.	2019-02-04 05:09:16 +01:00
Willy Tarreau	6daac19b3f	MINOR: config: simplify bind_proc processing using proc_mask() At a number of places we used to have null tests on bind_proc for listeners and proxies. Let's simplify all these tests by always having the proper bits reported via proc_mask().	2019-02-04 05:09:16 +01:00
Willy Tarreau	a38a7175b1	MINOR: config: keep an all_proc_mask like we have all_threads_mask This simplifies some mask comparisons at various places where nbits(global.nbproc) was used.	2019-02-04 05:09:15 +01:00
Willy Tarreau	fc647360e0	CLEANUP: threads: use nbits to calculate the thread mask It's pointless to do arithmetics by hand, we have a function for this.	2019-02-02 17:48:39 +01:00
Willy Tarreau	6b4a39adc4	BUG/MINOR: config: fix bind line thread mask validation When no nbproc is specified, a computation leads to reading bind_thread[-1] before checking if the thread mask is valid for a bind conf. It may either report a false warning and compute a wrong mask, or miss some incorrect configs. This must be backported to 1.9 and possibly 1.8.	2019-02-02 17:46:24 +01:00
Willy Tarreau	bbcf2b9e0d	BUG/MINOR: threads: fix the process range of thread masks Commit `421f02e` ("MINOR: threads: add a MAX_THREADS define instead of LONGBITS") used a MAX_THREADS macros to fix threads limits. However, one change was wrong as it affected the upper bound of the process loop when setting threads masks. No backport is needed.	2019-02-02 13:18:01 +01:00
Olivier Houchard	32211a17eb	BUG/MEDIUM: stream: Don't forget to free s->unique_id in stream_free(). In stream_free(), free s->unique_id. We may still have one, because it's allocated in log.c::strm_log() no matter what, even if it's a TCP connection and thus it won't get free'd by http_end_txn(). Failure to do so leads to a memory leak. This should probably be backported to all maintained branches.	2019-02-01 18:17:36 +01:00
Willy Tarreau	053c15750b	BUG/MEDIUM: mux-h2: always set :authority on request output PiBa-NL reported that some servers don't fall back to the Host header when :authority is absent. After studying all the combinations of Host and :authority, it appears that we always have to send the latter, hence we never need the former. In case of CONNECT method, the authority is retrieved from the URI part, otherwise it's extracted from the Host field. The tricky part is that we have to scan all headers for the Host header before dumping other headers. This is due to the fact that we must emit pseudo headers before other ones. One improvement could possibly be made later in the request parser to search and emit the Host header immediately if authority was not found. This would cost nothing on the vast marjority of requests and make the lookup faster on output since Host would appear first. This fix must be backported to 1.9.	2019-02-01 16:47:46 +01:00
Willy Tarreau	5be92ff23f	BUG/MEDIUM: mux-h2: always omit :scheme and :path for the CONNECT method This is mandated by RFC7540 #8.3, these pseudo-headers must not be emitted with the CONNECT method. This must be backported to 1.9.	2019-02-01 16:47:46 +01:00
Willy Tarreau	1da41ecf5b	BUG/MINOR: backend: check srv_conn before dereferencing it Commit `3c4e19f42` ("BUG/MEDIUM: backend: always release the previous connection into its own target srv_list") introduced a valid warning about a null-deref risk since we didn't check conn_new()'s return value. This patch must be backported to 1.9 with the patch above.	2019-02-01 16:47:46 +01:00
Olivier Houchard	9c4f08ae39	BUG/MINOR: tune.fail-alloc: Don't forget to initialize ret. In mem_should_fail(), if we don't want to fail the allocation, either because mem_fail_rate is 0, or because we're still initializing, don't forget to initialize ret, or we may return a non-zero value, and fail an allocation we didn't want to fail. This should only affect users that compile with DEBUG_FAIL_ALLOC.	2019-02-01 16:30:08 +01:00
Willy Tarreau	3e451842dc	BUG/MEDIUM: htx: check the HTX compatibility in dynamic use-backend rules I would have sworn it was done, probably we lost it during the refactoring. If a frontend is in HTX and the backend not (and conersely), this is normally detected at config parsing time unless the rule is dynamic. In this case we must abort with an error 500. The logs will report "RR" (resource issue while processing request) with the frontend and the backend assigned, so that it's possible to figure what was attempted. This must be backported to 1.9.	2019-02-01 15:09:54 +01:00
Willy Tarreau	3c4e19f429	BUG/MEDIUM: backend: always release the previous connection into its own target srv_list There was a bug reported in issue #19 regarding the fact that haproxy could mis-route requests to the wrong server. It turns out that when switching to another server, the old connection was put back into the srv_list corresponding to the stream's target instead of this connection's target. Thus if this connection was later picked, it was pointing to the wrong server. The patch fixes this and also clarifies the assignment to srv_conn->target so that it's clear we don't change it when picking it from the srv_list. This must be backported to 1.9 only.	2019-02-01 11:58:33 +01:00
Olivier Houchard	dc21ff778b	MINOR: debug: Add an option that causes random allocation failures. When compiling with DEBUG_FAIL_ALLOC, add a new option, tune.fail-alloc, that gives the percentage of chances an allocation fails. This is useful to check that allocation failures are always handled gracefully.	2019-01-31 19:38:25 +01:00
Olivier Houchard	9c9da5ee89	MINOR: muxes: Don't bother to LIST_DEL(&conn->list) before calling conn_free(). conn_free() already removes the connection from any idle list, so there's no need to do it in the mux code, just before calling conn_free().	2019-01-31 19:38:25 +01:00
Willy Tarreau	8694978892	BUG/MEDIUM: mux-h2: properly consider the peer's advertised max-concurrent-streams Till now we used to only rely on tune.h2.max-concurrent-streams but if a peer advertises a lower limit this can cause streams to be reset or even the conection to be killed. Let's respect the peer's value for outgoing streams. This patch should be backported to 1.9, though it depends on the following ones : BUG/MINOR: server: fix logic flaw in idle connection list management MINOR: mux-h2: max-concurrent-streams should be unsigned MINOR: mux-h2: make sure to only check concurrency limit on the frontend MINOR: mux-h2: learn and store the peer's advertised MAX_CONCURRENT_STREAMS setting	2019-01-31 19:38:25 +01:00
Willy Tarreau	2e2083ae5b	MINOR: mux-h2: learn and store the peer's advertised MAX_CONCURRENT_STREAMS setting We used not to take it into account because we only used the configured parameter everywhere. This patch makes sure we can actually learn the value advertised by the peer. We still enforce our own limit on top of it however, to make sure we can actually limit resources or stream concurrency in case of suboptimal server settings.	2019-01-31 19:38:25 +01:00
Willy Tarreau	fa1d357f05	MINOR: mux-h2: make sure to only check concurrency limit on the frontend h2_has_too_many_cs() was renamed to h2_frt_has_too_many_cs() to make it clear it's only used to throttle the frontend connection, and the call places were adjusted to only call this code on a front connection. In practice it was already the case since the H2_CF_DEM_TOOMANY flag is only set there. But now the ambiguity is removed.	2019-01-31 19:38:25 +01:00
Willy Tarreau	5a490b669e	MINOR: mux-h2: max-concurrent-streams should be unsigned We compare it to other unsigned values, let's make it unsigned as well.	2019-01-31 19:38:25 +01:00
Willy Tarreau	00f18a36b6	BUG/MINOR: server: fix logic flaw in idle connection list management With variable connection limits, it's not possible to accurately determine whether the mux is still in use by comparing usage and max to be equal due to the fact that one determines the capacity and the other one takes care of the context. This can cause some connections to be dropped before they reach their stream ID limit. It seems it could also cause some connections to be terminated with streams still alive if the limit was reduced to match the newly computed avail_streams() value, though this cannot yet happen with existing muxes. Instead let's switch to usage reports and simply check whether connections are both unused and available before adding them to the idle list. This should be backported to 1.9.	2019-01-31 19:38:25 +01:00
Willy Tarreau	180590409f	BUG/MEDIUM: mux-h2: do not close the connection on aborted streams We used to rely on a hint that a shutw() or shutr() without data is an indication that the upper layer had performed a tcp-request content reject and really wanted to kill the connection, but sadly there is another situation where this happens, which is failed keep-alive request to a server. In this case the upper layer stream silently closes to let the client retry. In our case this had the side effect of killing all the connection. Instead of relying on such hints, let's address the problem differently and rely on information passed by the upper layers about the intent to kill the connection. During shutr/shutw, this is detected because the flag CS_FL_KILL_CONN is set on the connstream. Then only in this case we send a GOAWAY(ENHANCE_YOUR_CALM), otherwise we only send the reset. This makes sure that failed backend requests only fail frontend requests and not the whole connections anymore. This fix relies on the two previous patches adding SI_FL_KILL_CONN and CS_FL_KILL_CONN as well as the fix for the connection close, and it must be backported to 1.9 and 1.8, though the code in 1.8 could slightly differ (cs is always valid) : BUG/MEDIUM: mux-h2: wait for the mux buffer to be empty before closing the connection MINOR: stream-int: add a new flag to mention that we want the connection to be killed MINOR: connstream: have a new flag CS_FL_KILL_CONN to kill a connection	2019-01-31 19:38:25 +01:00
Willy Tarreau	51d0a7e54c	MINOR: connstream: have a new flag CS_FL_KILL_CONN to kill a connection This is the equivalent of SI_FL_KILL_CONN but for the connstreams. It will be set by the stream-interface during the various shutdown operations.	2019-01-31 19:38:25 +01:00
Willy Tarreau	0f9cd7b196	MINOR: stream-int: add a new flag to mention that we want the connection to be killed The new flag SI_FL_KILL_CONN is now set by the rare actions which deliberately want the whole connection (and not just the stream) to be killed. This is only used for "tcp-request content reject", "tcp-response content reject", "tcp-response content close" and "http-request reject". The purpose is to desambiguate the close from a regular shutdown. This will be used by the next patches.	2019-01-31 19:38:25 +01:00
Willy Tarreau	4dbda620f2	BUG/MEDIUM: mux-h2: wait for the mux buffer to be empty before closing the connection When finishing to respond on a stream, a shutw() is called (resulting in either an end of stream or RST), then h2_detach() is called, and may decide to kill the connection is a number of conditions are satisfied. Actually one of these conditions is that a GOAWAY frame was already sent or attempted to be sent. This one is wrong, because it can happen in at least these two situations : - a shutw() sends a GOAWAY to obey tcp-request content reject - a graceful shutdown is pending In both cases, the connection will be aborted with the mux buffer holding some data. In case of a strong abort the client will not see the GOAWAY or RST and might want to try again, which is counter-productive. In case of the graceful shutdown, it could result in truncated data. It looks like a valid candidate for the issue reported here : https://www.mail-archive.com/haproxy@formilux.org/msg32433.html A backport to 1.9 and 1.8 is necessary.	2019-01-31 19:38:25 +01:00
Willy Tarreau	28e581b21c	BUG/MINOR: stream: don't close the front connection when facing a backend error In 1.5-dev13, a bug was introduced by commit `e3224e870` ("BUG/MINOR: session: ensure that we don't retry connection if some data were sent"). If a connection error is reported after some data were sent (and lost), we used to accidently mark the front connection as being in error instead of only the back one because the two direction flags were applied to the same channel. This case is extremely rare with raw connections but can happen a bit more often with multiplexed streams. This will result in the error not being correctly reported to the client. This patch can be backported to all supported versions.	2019-01-31 19:38:25 +01:00
Olivier Houchard	8788b4111c	BUG/MEDIUM: connections: Don't forget to remove CO_FL_SESS_IDLE. If we're adding a connection to the server orphan idle list, don't forget to remove the CO_FL_SESS_IDLE flag, or we will assume later it's still attached to a session. This should be backported to 1.9.	2019-01-31 19:38:25 +01:00
Christopher Faulet	3949c9d90d	BUG/MEDIUM: mux-h1: Don't add "transfer-encoding" if message-body is forbidden When a HTTP/1.1 or above response is emitted to a client, if the flag H1_MF_XFER_LEN is set whereas H1_MF_CLEN and H1_MF_CHNK are not, the header "transfer-encoding" is added. It is a way to make HTX chunking consistent with H2. But we must exclude all cases where the message-body is explicitly forbidden by the RFC: * for all 1XX, 204 and 304 responses * for any responses to HEAD requests * for 200 responses to CONNECT requests For these 3 cases, the flag H1_MF_XFER_LEN is set but H1_MF_CLEN and H1_MF_CHNK not. And the header "transfer-encoding" must not be added. See issue #27 on github for details about the bug. This patch must be backported in 1.9.	2019-01-31 11:07:17 +01:00
Fr�d�ric L�caille	04636b7bac	BUG/MEDIUM: peers: Peer addresses parsing broken. This bug was introduced by `355b203` commit which prevented the peer addresses to be parsed for the local peer of a "peers" section. When adding "parse_addr" boolean parameter to parse_server(), this commit missed the case where the syntax with "peer" keyword should still be supported in addition to the new syntax with "server"+"bind" keyword. May be backported as fas as 1.5.	2019-01-31 09:56:39 +01:00
Willy Tarreau	a9b7796862	MINOR: mux-h2: consistently rely on the htx variable to detect the mode In h2_frt_transfer_data(), we support both HTX and legacy modes. The HTX mode is detected from the proxy option and sets a valid pointer into the htx variable. Better rely on this variable in all the function rather than testing the option again. This way the code is clearer and even the compiler knows this pointer is valid when it's dereferenced. This should be backported to 1.9 if the b_is_null() patch is backported.	2019-01-31 08:07:17 +01:00
Willy Tarreau	1f035507af	BUG/MINOR: mux-h2: make sure request trailers on aborted streams don't break the connection We used to respond a connection error in case we received a trailers frame on a closed stream, but it's a problem to do this if the error was caused by a reset because the sender has not yet received it and is just a victim of the timing. Thus we must not close the connection in this case. This patch may be backported to 1.9 but then it requires the following previous ones : MINOR: h2: add a generic frame checker MEDIUM: mux-h2: check the frame validity before considering the stream state CLEANUP: mux-h2: remove stream ID and frame length checks from the frame parsers	2019-01-30 19:37:20 +01:00
Willy Tarreau	b860c73756	CLEANUP: mux-h2: remove stream ID and frame length checks from the frame parsers It's not convenient to have such structural checks mixed with the ones related to the stream state. Let's remove all these basic tests that are already covered once for all when reading the frame header.	2019-01-30 19:37:20 +01:00
Willy Tarreau	54f46e53dd	MEDIUM: mux-h2: check the frame validity before considering the stream state There are some uneasy situation where it's difficult to validate a frame's format without being in an appropriate state. This patch makes sure that each frame passes through h2_frame_check() before being checked in the context of the stream's state. This makes sure we can always return a GOAWAY for protocol violations even if we can't process the frame.	2019-01-30 19:37:20 +01:00
Willy Tarreau	9c84d8299a	MINOR: h2: add a generic frame checker The new function h2_frame_check() checks the protocol limits for the received frame (length, ID, direction) and returns a verdict made of a connection error code. The purpose is to be able to validate any frame regardless of the state and the ability to call the frame handler, and to emit a GOAWAY early in this case.	2019-01-30 19:37:20 +01:00
Willy Tarreau	08bb1d6109	BUG/MINOR: mux-h2: make sure response HEADERS are not received in other states than OPEN and HLOC RFC7540#5.1 states that these are the only states allowing any frame type. For response HEADERS frames, we cannot accept that they are delivered on idle streams of course, so we're left with these two states only. It is important to test this so that we can remove the generic CLOSE_STREAM test for such frames in the main loop. This must be backported to 1.9 (1.8 doesn't have response HEADERS).	2019-01-30 19:37:14 +01:00
Willy Tarreau	8d9ac3ed8b	BUG/MEDIUM: mux-h2: do not abort HEADERS frame before decoding them If a response HEADERS frame arrives on a closed connection (due to a client abort sending an RST_STREAM), it's currently immediately rejected with an RST_STREAM, like any other frame. This is incorrect, as HEADERS frames must first be decoded to keep the HPACK decoder synchronized, possibly breaking subsequent responses. This patch excludes HEADERS/CONTINUATION/PUSH_PROMISE frames from the central closed state test and leaves to the respective frame parsers the responsibility to decode the frame then send RST_STREAM. This fix must be backported to 1.9. 1.8 is not directly impacted since it doesn't have response HEADERS nor trailers thus cannot recover from such situations anyway.	2019-01-30 19:36:21 +01:00
Willy Tarreau	24ff1f8341	BUG/MEDIUM: mux-h2: make sure never to send GOAWAY on too old streams The H2 spec requires to send GOAWAY when the client sends a frame after it has already closed using END_STREAM. Here the corresponding case was the fallback of a series of tests on the stream state, but it unfortunately also catches old closed streams which we don't know anymore. Thus any late packet after we've sent an RST_STREAM will trigger this GOAWAY and break other streams on the connection. This can happen when launching two tabs in a browser targetting the same slow page through an H2-to-H2 proxy, and pressing Escape to stop one of them. The other one gets an error when the page finally responds (and it generally retries), and the logs in the middle indicate SD-- flags since the late response was cancelled. This patch takes care to only send GOAWAY on streams we still know. It must be backported to 1.9 and 1.8.	2019-01-30 19:35:42 +01:00
Willy Tarreau	fc10f599cc	BUG/MEDIUM: mux-h2: fix two half-closed to closed transitions When receiving a HEADERS or DATA frame with END_STREAM set, we would inconditionally switch to half-closed(remote). This is wrong because we could already have been in half-closed(local) and need to switch to closed. This happens in the following situations : - receipt of the end of a client upload after we've already responded (e.g. redirects to POST requests) - receipt of a response on the backend side after we've already finished sending the request (most common case). This may possibly have caused some streams to stay longer than needed at the end of a transfer, though this is not apparent in tests. This must be backported to 1.9 and 1.8.	2019-01-30 19:34:40 +01:00
Willy Tarreau	b1c9edc579	BUG/MEDIUM: mux-h2: wake up flow-controlled streams on initial window update When a settings frame updates the initial window, all affected streams's window is updated as well. However the streams are not put back into the send list if they were already blocked on flow control. The effect is that such a stream will only be woken up by a WINDOW_UPDATE message but not by a SETTINGS changing the initial window size. This can be verified with h2spec's test http2/6.9.2/1 which occasionally fails without this patch. It is unclear whether this situation is really met in field, but the fix is trivial, it consists in adding each unblocked streams to the wait list as is done for the window updates. This fix must be backported to 1.9. For 1.8 the patch needs quite a few adaptations. It's better to copy-paste the code block from h2c_handle_window_update() adding the stream to the send_list when its mws is > 0.	2019-01-30 16:21:39 +01:00
Willy Tarreau	6432dc8783	CLEANUP: mux-h2: remove misleading leftover test on h2s' nullity The WINDOW_UPDATE and DATA frame handlers used to still have a check on h2s to return either h2s_error() or h2c_error(). This is a leftover from the early code. The h2s cannot be null there anymore as it has already been dereferenced before reaching these locations.	2019-01-30 15:45:02 +01:00
Kevin Zhu	13ebef7ecb	BUG/MINOR: deinit: tcp_rep.inspect_rules not deinit, add to deinit It seems like this can be backported as far as 1.5.	2019-01-30 10:22:34 +01:00
Tim Duesterhus	b229f018ee	BUG/MEDIUM: compression: Rewrite strong ETags RFC 7232 section 2.3.3 states: > Note: Content codings are a property of the representation data, > so a strong entity-tag for a content-encoded representation has to > be distinct from the entity tag of an unencoded representation to > prevent potential conflicts during cache updates and range > requests. In contrast, transfer codings (Section 4 of [RFC7230]) > apply only during message transfer and do not result in distinct > entity-tags. Thus a strong ETag must be changed when compressing. Usually this is done by converting it into a weak ETag, which represents a semantically, but not byte-by-byte identical response. A conversion to a weak ETag still allows If-None-Match to work. This should be backported to 1.9 and might be backported to every supported branch with compression.	2019-01-29 20:26:06 +01:00
Olivier Houchard	7493114729	BUG/MEDIUM: servers: Close the connection if we failed to install the mux. If we failed to install the mux, just close the connection, or conn_fd_handler() will be called for the FD, and crash as soon as it attempts to access the mux' wake method. This should be backported to 1.9.	2019-01-29 19:53:09 +01:00
Olivier Houchard	ef60ff38fb	BUG/MEDIUM: peers: Handle mux creation failure. If the mux fails to properly be created by conn_install_mux, fail, instead of silently ignoring it. This should be backported to 1.9.	2019-01-29 19:47:20 +01:00
Olivier Houchard	2b09443e04	BUG/MEDIUM: h2: In h2_send(), stop the loop if we failed to alloc a buf. In h2_send(), make sure we break the loop if we failed to alloc a buffer, or we'd end up looping endlessly. This should be backported to 1.9.	2019-01-29 19:47:20 +01:00
Olivier Houchard	a48437bb5e	BUG/MEDIUM: checks: Don't try to set ALPN if connection failed. If we failed to connect, don't attempt to set the ALPN, as we don't have a SSL context, anyway. This should be backported to 1.9.	2019-01-29 19:47:20 +01:00
Olivier Houchard	26da323cb9	BUG/MEDIUM: servers: Don't add an incomplete conn to the server idle list. If we failed to add the connection to the session, only try to add it back to the server idle list if it has a mux, otherwise the connection is incomplete and unusable. This should be backported to 1.9.	2019-01-29 19:47:20 +01:00
Olivier Houchard	4dc85538ba	BUG/MEDIUM: servers: Only destroy a conn_stream we just allocated. In connect_server(), if we failed to add the connection to the session, only destroy the conn_stream if we just allocated it, otherwise it may have been allocated outside connect_server(), along with a connection which has its destination address set. Also use si_release_endpoint() instead of cs_destroy(), to make sure the stream_interface doesn't reference it anymore. This should be backported to 1.9.	2019-01-29 19:47:20 +01:00
Olivier Houchard	f67be93ae0	BUG/MEDIUM: checks: Check that conn_install_mux succeeded. If conn_install_mux failed, then the connection has no mux and won't be usable, so just give up is on failure instead of ignoring it. This should be backported to 1.9.	2019-01-29 19:47:20 +01:00
Willy Tarreau	f1e6fa35de	CLEANUP: mux-h2: remove two useless but misleading assignments h2c->st0 was assigned to H2_CS_ERROR right after returning from h2c_error(), which had already done it. It's useless and confusing, let's remove this.	2019-01-29 18:51:41 +01:00
Willy Tarreau	3ad5d31bdf	BUG/MEDIUM: mux-h2: only close connection on request frames on closed streams A subtle bug was introduced with H2 on the backend. RFC7540 states that an attempt to create a stream on an ID not higher than the max known is a connection error. This was translated into rejecting HEADERS frames for closed streams. But with H2 on the backend, if the client aborts and causes an RST_STREAM to be emitted, the stream is effectively closed, and if/once the server responds, it starts by emitting a HEADERS frame with this ID thus it is interpreted as a connection error. This test must of course consider the side the mux is installed on and not take this for a connection error on responses. The effect is that an aborted stream on an outgoing H2 connection, for example due to a client stopping a transfer with option abortonclose set, would lead to an abort of all other streams. In the logs, this appears as one or several CD-- line(s) followed by one or several SD-- lines which are victims. Thanks to Luke Seelenbinder for reporting this problem and providing enough elements to help understanding how to reproduce it. This fix must be backported to 1.9.	2019-01-29 18:49:27 +01:00
Willy Tarreau	6254a9257e	BUILD/MINOR: peers: shut up a build warning introduced during last cleanup A new warning appears when building at -O0 since commit `3f0fb9df6` ("MINOR: peers: move "hello" message treatment code to reduce the size of the I/O handler."), it is related to the fact that proto_len is initialized from strlen() which is not a constant. Let's replace it with sizeof-1 instead and also mark the variable as static since it's useless outside of the file.	2019-01-29 17:45:23 +01:00
Willy Tarreau	6f731f33ac	CLEANUP: peers: factor error handling in peer_treat_definedmsg() This is a trivial code refactoring of similar parsing error code under a single label.	2019-01-29 11:11:23 +01:00
Willy Tarreau	1e82a14c34	CLEANUP: peers: factor the error handling code in peer_treet_updatemsg() The error handling code was extremely repetitive and error-prone due to the numerous copy-pastes, some involving unlocks or free. Let's factor this out. The code could be further simplified, but 12 locations were already cleaned without taking risks.	2019-01-29 11:08:06 +01:00
Fr�d�ric L�caille	4b2fd9bf71	MINOR: peers: move peer initializations code to reduce the size of the I/O handler. Implements two new functions to init peer flags and other stuff after having accepted or connected them with the peer I/O handler so that to reduce its size. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	3f0fb9df6c	MINOR: peers: move "hello" message treatment code to reduce the size of the I/O handler. This patch implements three functions to read and parse the three line of a "hello" peer protocol message so that to call them from the peer I/O handler and reduce its size. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	be825e5c05	CLEANUP: peers: Remove useless statements. When implementing peer_recv_msg() we added the statements reached with a "goto imcomplete" at the end of this function. This statements are executed only when co_getblk() returns something <0. So they are useless for now on, and may be safely removed. The following section wich was responsible of sending any peer protocol messages were reached only when co_getblk() returned 0 (no more message to read). In this case we replace the "goto impcomplete" statement by a "goto send_msgs" to reach this only when peer_recv_msg() returns 0. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	25e1d5e435	MINOR: peers: move send code to reduce the size of the I/O handler. This patch extracts the code responsible of sending peer protocol messages from the peer I/O handler to create a new function and to reduce the size of this handler. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	444243c62c	MINOR: peers: move messages treatment code to reduce the size of the I/O handler. Extract the code of the peer I/O handler responsible of treating any peer protocol message to create peer_treat_awaited_msg() function. Also rename peer_recv_updatemsg() to peer_treat_updatemsg() as this function only parse a stick-table update message already received by peer_recv_msg(). May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	7d0ceeec80	MINOR: peers: move error handling to reduce the size of the I/O handler. Implement new functions to send error and control class stick-table messages. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	d5fe14bb96	CLEANUP: peers: Be more generic. Make usage of a C union to pass parameters to all the peer_prepare_*() functions (more readable). May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	95203f2185	MINOR: peers: Move high level receive code to reduce the size of I/O handler. Implement a new function to read incoming stick-table messages. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	d27b09400c	MINOR: peers: Move ack, switch and definition receive code to reduce the size of the I/O handler. Implement three new functions to treat peer acks, switch and definition messages extracting the code from the big swich-case of the peer I/O handler to give more chances to this latter to be readable. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	168a34b45f	MINOR: peers: Move update receive code to reduce the size of the I/O handler. This patch implements a new function to treat the stick-table update messages so that to reduce the size of the peer I/O handler by ~200 lines. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	6a8303d49e	MEDIUM: peers: synchronizaiton code factorization to reduce the size of the I/O handler. Factorize the code responsible of synchronizing the peers upon startup. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	87f554c9fb	MINOR: peers: Add new functions to send code and reduce the I/O handler. This patch reduces the size of the peer I/O handler implementing a new function named peer_send_updatemsg() which uses the already implement peer_prepare_updatemsg(), then ci_putblk(). Reuse the code used to implement peer_send_(ack\|swith)msg() function especially the more generic function peer_send_msg(). May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	ec44ea8692	MINOR: peers: send code factorization. Implements peer_send_msg() functions for switch and ack messages which call the already defined peer_prepare_msg() before calling ci_putblk(). These two new functions are used at three places in the peer_io_handler(). May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	a8725ec372	CLEANUP: peers: Indentation fixes. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	ce02557aad	MINOR: peers: Extract some code to be reused. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Willy Tarreau	d822013f45	BUG/MEDIUM: backend: always call si_detach_endpoint() on async connection failure In case an asynchronous connection (ALPN) succeeds but the mux fails to attach, we must release the stream interface's endpoint, otherwise we leave the stream interface with an endpoint pointing to a freed connection with si_ops == si_conn_ops, and sess_update_st_cer() calls si_shutw() on it, causing it to crash. This must be backported to 1.9 only.	2019-01-28 16:33:35 +01:00
Olivier Houchard	9ef5155ba6	BUG/MEDIUM: servers: Attempt to reuse an unfinished connection on retry. In connect_server(), if the previous connection failed, but had an alpn, no mux was created, and thus the stream_interface's endpoint would be the connection. In this case, instead of forgetting about it, and overriding the stream_interface's endpoint later, try to reuse the connection, or the connection will still be in the session's connection list, and will reference to a stream that was probably destroyed. This should be backported to 1.9.	2019-01-28 16:33:31 +01:00
Miroslav Zagorac	6b3690bc6a	BUG/MINOR: spoe: corrected fragmentation string size This patch must be backported to 1.9 and 1.8.	2019-01-28 13:45:09 +01:00
Willy Tarreau	6afec46ba3	BUG/MINOR: mux-h2: do not report available outgoing streams after GOAWAY The calculation of available outgoing H2 streams was improved by commit `d64a3ebe6` ("BUG/MINOR: mux-h2: always check the stream ID limit in h2_avail_streams()"), but it still is incorrect because RFC7540#6.8 specifically forbids the creation of new streams after a GOAWAY frame was received. Thus we must not mark the connection as available anymore in order to be able to handle a graceful shutdown. This needs to be backported to 1.9.	2019-01-28 06:44:53 +01:00
Willy Tarreau	888d5678f7	BUG/MINOR: listener: always fill the source address for accepted socketpairs The source address was not set but passed down the chain to the upper layer's accept() calls. Let's initialize it like other UNIX sockets in this case. At the moment it should not have any impact since socketpairs are only usable for the master CLI. This should be backported to 1.9.	2019-01-27 21:48:29 +01:00
Willy Tarreau	f5809cde7a	MINOR: threads: make MAX_THREADS configurable at build time There's some value in being able to limit MAX_THREADS, either to save precious resources in embedded environments, or to protect certain deployments against accidently incorrect settings. With this patch, if MAX_THREADS is defined at build time, it will be used. However, given that LONGBITS is not a macro but is defined according to sizeof(long), we can't check the value range at build time and instead we need to perform the check at early boot time. However, the compiler is able to optimize away the constant comparisons and doesn't even emit the check code when values are correct. The output message regarding threading support was improved to report the number of threads.	2019-01-26 13:37:48 +01:00
Willy Tarreau	c9a82e48bf	MINOR: cfgparse: make the process/thread parser support a maximum value It was hard-wired to LONGBITS, let's make it configurable depending on the context (threads, processes).	2019-01-26 13:25:14 +01:00
Tim Duesterhus	4707033932	CLEANUP: h2: Remove debug printf in mux_h2.c It was introduced by `1915ca2738` and should be backported to 1.9.	2019-01-25 05:22:07 +01:00
Willy Tarreau	1915ca2738	BUG/MINOR: mux-h2: always compare content-length to the sum of DATA frames This is mandated by RFC7541#8.1.2.6. Till now we didn't have a copy of the content-length header field. But now that it's already parsed, it's easy to add the check. The reg-test was updated to match the new behaviour as the previous one expected unadvertised data to be silently discarded. This should be backported to 1.9 along with previous patch (MEDIUM: h2: always parse and deduplicate the content-length header) after it has got a bit more exposure.	2019-01-24 19:45:27 +01:00
Willy Tarreau	4790f7c907	MEDIUM: h2: always parse and deduplicate the content-length header The header used to be parsed only in HTX but not in legacy. And even in HTX mode, the value was dropped. Let's always parse it and report the parsed value back so that we'll be able to store it in the streams.	2019-01-24 19:07:26 +01:00
Willy Tarreau	f7a259d46f	MINOR: stream: don't wait before retrying after a failed connection reuse When a connection reuse fails, we must not wait before retrying, as most likely the issue is related to the reused connection and not to the server itself. This should be backported to 1.9, though it depends on previous patches dealing with SI_ST_CON for connection reuse.	2019-01-24 19:06:43 +01:00
Willy Tarreau	bf66bd1b8b	MEDIUM: stream-int: always mark pending outgoing SI_ST_CON Before the first send() attempt, we should be in SI_ST_CON, not SI_ST_EST, since we have not yet attempted to send and we are allowed to retry. This is particularly important with complex outgoing muxes which can fail during the first send attempt (e.g. failed stream ID allocation). It only requires that sess_update_st_con_tcp() knows about this possibility, as we must not forcefully close a reused connection when facing an error in this case, this will be handled later. This may be backported to 1.9 with care after some observation period.	2019-01-24 19:06:43 +01:00
Willy Tarreau	e9634bdc22	MINOR: mux-h2: always consider a server's max-reuse parameter This parameter allows to limit the number of successive requests sent on a connection. Let's compare it to the number of streams already sent on the connection to decide if the connection may still appear in the idle list or not. This may be used to help certain servers work around resource leaks, and also helps dealing with the issue of the GOAWAY in flight which requires to set a usage limit on the client to be reliable. This must be backported to 1.9.	2019-01-24 19:06:43 +01:00
Willy Tarreau	9c538e01c2	MINOR: server: add a max-reuse parameter Some servers may wish to limit the total number of requests they execute over a connection because some of their components might leak resources. In HTTP/1 it was easy, they just had to emit a "connection: close" header field with the last response. In HTTP/2, it's less easy because the info is not always shared with the component dealing with the H2 protocol and it could be harder to advertise a GOAWAY with a stream limit. This patch provides a solution to this by adding a new "max-reuse" parameter to the server keyword. This parameter indicates how many times an idle connection may be reused for new requests. The information is made available and the underlying muxes will be able to use it at will. This patch should be backported to 1.9.	2019-01-24 19:06:43 +01:00
Willy Tarreau	2c7deddc06	BUG/MEDIUM: backend: never try to attach to a mux having no more stream available The code dealing with idle connections used to check the number of streams available on the connection only to unlink the connection from the idle list. But this still resulted in too many streams reusing the same connection when they were already attached to it. We must detect that there is no more room and refrain from using this connection at all, and instead fall back to the no-reuse case. Ideally we should try to search among other idle connections, but for a backport let's stay safe. This must be backported to 1.9.	2019-01-24 19:06:43 +01:00
Willy Tarreau	a80dca8535	BUG/MINOR: mux-h2: refuse to allocate a stream with too high an ID One of the reasons for the excessive number of aborted requests when a server sets a limit on the highest stream ID is that we don't check this limit while allocating a new stream. This patch does this at two locations : - when a backend stream is allocated, we verify that there are still IDs left ; - when the ID is assigned, we verify that it's not higher than the advertised limit. This should be backported to 1.9.	2019-01-24 19:06:43 +01:00
Willy Tarreau	d64a3ebe64	BUG/MINOR: mux-h2: always check the stream ID limit in h2_avail_streams() This function is used to decide whether to put an idle connection back into the idle pool. While it considers the limit in number of concurrent requests, it does not consider the limit in number of streams, so if a server announces a low limit in a GOAWAY frame, it will be ignored. However there is a caveat : since we assign the stream IDs when sending them, we have a number of allocated streams which max_id doesn't take care of. This can be addressed by adding a new nb_reserved count on each connection to keep track of the ID-less streams. This patch makes sure we take care of the remaining number of streams if such a limit was announced, or of the number of streams before the highest ID. Now it is possible to accurately know how many streams can be allocated, and the number of failed outgoing streams has dropped in half. This must be backported to 1.9.	2019-01-24 19:06:43 +01:00
Willy Tarreau	15c120d251	CLEANUP: server: fix indentation mess on idle connections Apparently some code was moved around leaving the inner block incorrectly indented and with the closing brace in the middle of nowhere.	2019-01-24 19:06:43 +01:00
Willy Tarreau	64f6945fec	BUG/MINOR: stream: take care of synchronous errors when trying to send We currently detect a number of situations where we have to immediately deal with a state change, but we failed to consider the case of the synchronous error reported on the stream-interface. We definitely do not want to have to wait for a timeout to handle this one, especially at the beginning of the connection when it can lead to an immediate retry. This should be backported to 1.9.	2019-01-24 19:06:43 +01:00
Willy Tarreau	cb923d5001	MINOR: server: make sure pool-max-conn is >= -1 The keyword parser doesn't check the value range, but supported values are -1 and positive values, thus we should check it. This can be backported to 1.9.	2019-01-24 16:31:56 +01:00
Willy Tarreau	1e7d444eec	BUG/MINOR: hpack: return a compression error on invalid table size updates RFC7541#6.3 mandates that an error is reported when a dynamic table size update announces a size larger than the one configured with settings. This is tested by h2spec using test "hpack/6.3/1". This must be backported to 1.9 and possibly 1.8 as well.	2019-01-24 15:27:06 +01:00
Willy Tarreau	175cebb38a	BUG/MINOR: mux-h2: make it possible to set the error code on an already closed stream When sending RST_STREAM in response to a frame delivered on an already closed stream, we used not to be able to update the error code and deliver an RST_STREAM with a wrong code (e.g. H2_ERR_CANCEL). Let's always allow to update the code so that RST_STREAM is always sent with the appropriate error code (most often H2_ERR_STREAM_CLOSED). This should be backported to 1.9 and possibly to 1.8.	2019-01-24 15:27:06 +01:00
Willy Tarreau	5b4eae33de	BUG/MINOR: mux-h2: headers-type frames in HREM are always a connection error There are incompatible MUST statements in the HTTP/2 specification. Some require a stream error and others a connection error for the same situation. As discussed in the thread below, let's always apply the connection error when relevant (headers-like frame in half-closed(remote)) : https://mailarchive.ietf.org/arch/msg/httpbisa/pOIWRBRBdQrw5TDHODZXp8iblcE This must be backported to 1.9, possibly to 1.8 as well.	2019-01-24 15:27:06 +01:00
Willy Tarreau	113c7a2794	BUG/MINOR: mux-h2: CONTINUATION in closed state must always return GOAWAY Since we now support CONTINUATION frames, we must take care of properly aborting the connection when they are sent on a closed stream. By default we'd get a stream error which is not sufficient since the compression context is modified and unrecoverable. More info in this discussion : https://mailarchive.ietf.org/arch/msg/httpbisa/azZ1jiOkvM3xrpH4jX-Q72KoH00 This needs to be backported to 1.9 and possibly to 1.8 (less important there).	2019-01-24 15:27:06 +01:00
Willy Tarreau	31e846a071	BUG/MEDIUM: mux-h2: properly abort on trailers decoding errors There was an incomplete test in h2c_frt_handle_headers() resulting in negative return values from h2c_decode_headers() not being taken as errors. The effect is that the stream is then aborted on timeout only. This fix must be backported to 1.9.	2019-01-24 15:27:06 +01:00
Willy Tarreau	5ce6337254	BUG/MEDIUM: backend: also remove from idle list muxes that have no more room The current test consists in removing muxes which report that they're going to assign their last available stream, but a mux may already be saturated without having passed in this situation at all. This is what happens with mux_h2 when receiving a GOAWAY frame informing the mux about the ID of the last stream the other end is willing to process. The limit suddenly changes from near infinite to 0. Currently what happens is that such a mux remains in the idle list for a long time and refuses all new streams. Now at least it will only fail a single stream in a retryable way. A future improvement should consist in trying to pick another connection from the idle list. This fix must be backported to 1.9.	2019-01-24 13:53:06 +01:00
Willy Tarreau	759ca1eacc	BUG/MAJOR: mux-h2: don't destroy the stream on failed allocation in h2_snd_buf() In case we cannot allocate a stream ID for an outgoing stream, the stream will be aborted. The problem is that we also release it and it will be destroyed again by the application detecting the error, leading to a NULL dereference in h2_shutr() and h2_shutw(). Let's only mark the error on the CS and let the rest of the code handle the close. This should be backported to 1.9.	2019-01-24 13:52:10 +01:00
Willy Tarreau	b57af617c0	BUG/MINOR: mux-h1: avoid copying output over itself in zero-copy It's almost funny but one side effect of the latest zero-copy changes made to mux-h1 resulted in the temporary buffer being copied over itself at the exact same location. This has no impact except slowing down operations and irritating valgrind. The cause is an incorrect pointer check after the alignment optimizations were made. This needs to be backported to 1.9. Reported-by: Tim Duesterhus <tim@bastelstu.be>	2019-01-23 20:43:53 +01:00
Christopher Faulet	afe57846bf	BUG/MINOR: mux-h1: Apply the reserve on the channel's buffer only There is no reason to use the reserve to limit the amount of data of the input buffer that we can parse at a time. The only important thing is to keep the reserve free in the channel's buffer. This patch should be backported to 1.9.	2019-01-23 11:27:34 +01:00
Christopher Faulet	a413e958fd	BUG/MEDIUM: mux-h2/htx: Respect the channel's reserve When data are pushed in the channel's buffer, in h2_rcv_buf(), the mux-h2 must respect the reserve if the flag CO_RFL_KEEP_RSV is set. In HTX, because the stream-interface always sees the buffer as full, there is no other way to know the reserve must be respected. This patch must be backported to 1.9.	2019-01-23 11:27:34 +01:00
Christopher Faulet	dcd8c5eed4	BUG/MINOR: proto-htx: Return an error if all headers cannot be received at once When an HTX stream is waiting for a request or a response, it reports an error (400 for the request or 502 for the response) if a parsing error is reported by the mux (HTX_FL_PARSING_ERROR). The mux-h1 uses this error, among other things, when the headers are too big to be analyzed at once. But the mux-h2 doesn't. So the stream must also report an error if the multiplexer is unable to emit all headers at once. The multiplexers must always emit all the headers at once otherwise it is an error. There are 2 ways to detect this error: * The end-of-headers marker was not received yet _AND_ the HTX message is not empty. * The end-of-headers marker was not received yet _AND_ the multiplexer have some data to emit but it is waiting for more space in the channel's buffer. Note the mux-h2 is buggy for now when HTX is enabled. It does not respect the reserve. So there is no way to hit this bug. This patch must be backported to 1.9.	2019-01-23 11:27:34 +01:00
Dirkjan Bussink	526894ff39	BUG/MEDIUM: ssl: Fix handling of TLS 1.3 KeyUpdate messages In OpenSSL 1.1.1 TLS 1.3 KeyUpdate messages will trigger the callback that is used to verify renegotiation is disabled. This means that these KeyUpdate messages fail. In OpenSSL 1.1.1 a better mechanism is available with the SSL_OP_NO_RENEGOTIATION flag that disables any TLS 1.2 and earlier negotiation. So if this SSL_OP_NO_RENEGOTIATION flag is available, instead of having a manual check, trust OpenSSL and disable the check. This means that TLS 1.3 KeyUpdate messages will work properly. Reported-By: Adam Langley <agl@imperialviolet.org>	2019-01-23 09:51:22 +01:00
Christopher Faulet	774c486cec	BUG/MINOR: check: Wake the check task if the check is finished in wake_srv_chk() With tcp-check, the result of the check is set by the function tcpcheck_main() from the I/O layer. So it is important to wake up the check task to handle the result and finish the check. Otherwise, we will wait the task timeout to handle the result of a tcp-check, delaying the next check by as much. This patch also fixes a problem about email alerts reported by PiBa-NL (Pieter) on the ML [1] on all versions since the 1.6. So this patch must be backported from 1.9 to 1.6. [1] https://www.mail-archive.com/haproxy@formilux.org/msg32190.html	2019-01-22 07:01:15 +01:00
J�r�me Magnin	f57afa453a	BUG/MINOR: server: don't always trust srv_check_health when loading a server state When we load health values from a server state file, make sure what we assign to srv->check.health actually matches the state we restore. This should be backported as far as 1.6.	2019-01-21 11:09:03 +01:00
Willy Tarreau	1ba32032ef	BUG/MEDIUM: checks: fix recent regression on agent-check making it crash In order to address the mailers issues, we needed to store the proxy into the checks struct, which was done by commit `c98aa1f18` ("MINOR: checks: Store the proxy in checks."). However this one did it only for the health checks and not for the agent checks, resulting in an immediate crash when the agent is enabled on a random config like this one : listen agent bind :8000 server s1 255.255.255.255:1 agent-check agent-port 1 Thanks to Seri Kim for reporting it and providing a reproducer in issue #20. This fix must be backported to 1.9.	2019-01-21 07:48:26 +01:00
Uman Shahzad	da7eeedf38	BUG/MINOR: startup: certain goto paths in init_pollers fail to free If we fail to initialize pollers due to fdtab/fdinfo/polled_mask not getting allocated, we free any of those that were allocated and exit. However the ordering was incorrect, and there was an old unused and unreachable "fail_cache" path as well, which needs to be taken when no poller works. This was introduced with this commit during 1.9-dev : `cb92f5c` ("MINOR: pollers: move polled_mask outside of struct fdtab.") It needs to be backported to 1.9 only.	2019-01-21 04:48:48 +01:00
Fr�d�ric L�caille	355b2033ec	MINOR: cfgparse: SSL/TLS binding in "peers" sections. Make "bind" keywork be supported in "peers" sections. All "bind" settings are supported on this line. Add "default-bind" option to parse the binding options excepted the bind address. Do not parse anymore the bind address for local peers on "server" lines. Do not use anymore list_for_each_entry() to set the "peers" section listener parameters because there is only one listener by "peers" section. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Fr�d�ric L�caille	1055e687a2	MINOR: peers: Make outgoing connection to SSL/TLS peers work. This patch adds pointer to a struct server to peer structure which is initialized after having parsed a remote "peer" line. After having parsed all peers section we run ->prepare_srv to initialize all SSL/TLS stuff of remote perr (or server). Remaining thing to do to completely support peer protocol over SSL/TLS: make "bind" keyword be supported in "peers" sections to make SSL/TLS incoming connections to local peers work. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Fr�d�ric L�caille	c06b5d4f74	MINOR: cfgparse: Make "peer" lines be parsed as "server" lines. With this patch "default-server" lines are supported in "peers" sections to setup the default settings of peers which are from now setup when parsing both "peer" and "server" lines. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Fr�d�ric L�caille	9492c4ecdb	MINOR: cfgparse: Simplication. Make init_peers_frontend() be callable without having to check if there is something to do or not. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Fr�d�ric L�caille	91694d51f7	MINOR: cfgparse: Rework peers frontend init. Even if not already the case, we suppose that the frontend "peers" section may have been already initialized outside of "peer" line, we seperate their initializations from their binding initializations. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Fr�d�ric L�caille	4ba5198899	MINOR: cfgparse: Useless frontend initialization in "peers" sections. Use ->local "peers" struct member to flag a "peers" section frontend has being initialized. This is to be able to initialize the frontend of "peers" sections on lines different from "peer" lines. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Fr�d�ric L�caille	16e491004b	CLEANUP: cfgparse: Code reindentation. May help the series of patches to be reviewed. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Fr�d�ric L�caille	6617e769bf	CLEANUP: cfgparse: Return asap from cfg_parse_peers(). Avoid useless code indentation. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Fr�d�ric L�caille	1825103fbe	MINOR: cfgparse: Extract some code to be re-used. Create init_peers_frontend() function to allocate and initialize the frontend of "peers" sections (->peers_fe) so that to reuse it later. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Olivier Houchard	f24502ba46	BUG/MEDIUM: connections: Add the CO_FL_CONNECTED flag if a send succeeded. If a send succeeded, add the CO_FL_CONNECTED flag, the send may have been called by the upper layers before we even realized we were connected, and we may even read the response before we get the information, and si_cs_recv() has to know if we were connected or not. This should be backported to 1.9.	2019-01-17 19:18:20 +01:00
Olivier Houchard	09a0f03994	BUG/MEDIUM: servers: Make assign_tproxy_address work when ALPN is set. If an ALPN is set on the server line, then when we reach assign_tproxy_address, the stream_interface's endpoint will be a connection, not a conn_stream, so make sure assign_tproxy_address() handles both cases. This should be backported to 1.9.	2019-01-17 19:18:20 +01:00
Christopher Faulet	ed7a066b45	BUG/MEDIUM: stats: Get the right scope pointer depending on HTX is used or not For HTX streams, the scope pointer is relative to the URI in the start-line. But for streams using the legacy HTTP representation, the scope pointer is relative to the beginning of output data in the channel's buffer. So we must be careful to use the right one depending on the HTX is used or not. Because the start-line is used to get de scope pointer, it is important to keep it after the parsing of post paramters. So now, instead of removing blocks when read in the function stats_process_http_post(), we just move on next, leaving it in the HTX message. Thanks to Pieter (PiBa-NL) to report this bug. This patch must be backported to 1.9.	2019-01-16 17:27:49 +01:00
Ben51Degrees	daa356bd7d	BUG: 51d: Changes to the buffer API in 1.9 were not applied to the 51Degrees code. The code using the deprecated 'buf->p' has been updated to use 'ci_head(buf)' as described in section 5 of 'doc/internals/buffer-api.txt'. A compile time switch on 'BUF_NULL' has also been added to enable the same source code to be used with pre and post API change HAProxy. This should be backported to 1.9, and is compatible with previous versions.	2019-01-16 17:26:14 +01:00
David Carlier	f8f8ddf3af	BUILD/MEDIUM: da: Necessary code changes for new buffer API. The most significant change from 1.8 to >=1.9 is the buffer data structure, using the new field and fixing along side a little hidden compilation warning. This must be backported to 1.9.	2019-01-15 15:07:30 +01:00
Willy Tarreau	21c741a665	MINOR: backend: make the random algorithm support a number of draws When an argument <draws> is present, it must be an integer value one or greater, indicating the number of draws before selecting the least loaded of these servers. It was indeed demonstrated that picking the least loaded of two servers is enough to significantly improve the fairness of the algorithm, by always avoiding to pick the most loaded server within a farm and getting rid of any bias that could be induced by the unfair distribution of the consistent list. Higher values N will take away N-1 of the highest loaded servers at the expense of performance. With very high values, the algorithm will converge towards the leastconn's result but much slower. The default value is 2, which generally shows very good distribution and performance. This algorithm is also known as the Power of Two Random Choices and is described here : http://www.eecs.harvard.edu/~michaelm/postscripts/handbook2001.pdf	2019-01-14 19:33:17 +01:00
Willy Tarreau	0cac26cd88	MEDIUM: backend: move all LB algo parameters into an union Since all of them are exclusive, let's move them to an union instead of eating memory with the sum of all of them. We're using a transparent union to limit the code changes. Doing so reduces the struct lbprm from 392 bytes to 372, and thanks to these changes, the struct proxy is now down to 6480 bytes vs 6624 before the changes (144 bytes saved per proxy).	2019-01-14 19:33:17 +01:00
Willy Tarreau	76e84f5091	MINOR: backend: move hash_balance_factor out of chash This one is a proxy option which can be inherited from defaults even if the LB algo changes. Move it out of the lb_chash struct so that we don't need to keep anything separate between these structs. This will allow us to merge them into an union later. It even takes less room now as it fills a hole and removes another one.	2019-01-14 19:33:17 +01:00
Willy Tarreau	a9a7249966	MINOR: backend: remap the balance uri settings to lbprm.arg_opt{1,2,3} The algo-specific settings move from the proxy to the LB algo this way : - uri_whole => arg_opt1 - uri_len_limit => arg_opt2 - uri_dirs_depth1 => arg_opt3	2019-01-14 19:33:17 +01:00
Willy Tarreau	9fed8586b5	MINOR: backend: make the header hash use arg_opt1 for use_domain_only This is only a boolean extra arg. Let's map it to arg_opt1 and remove hh_match_domain from struct proxy.	2019-01-14 19:33:17 +01:00
Willy Tarreau	20e68378f1	MINOR: backend: add new fields in lbprm to store more LB options Some algorithms require a few extra options (up to 3). Let's provide some room in lbprm to store them, and make sure they're passed from defaults to backends.	2019-01-14 19:33:17 +01:00
Willy Tarreau	484ff07691	MINOR: backend: make headers and RDP cookie also use arg_str/len These ones used to rely on separate variables called hh_name/hh_len but they are exclusive with the former. Let's use the same variable which becomes a generic argument name and length for the LB algorithm.	2019-01-14 19:33:17 +01:00
Willy Tarreau	4c03d1c9b6	MINOR: backend: move url_param_name/len to lbprm.arg_str/len This one is exclusively used by LB parameters, when using URL param hashing. Let's move it to the lbprm struct under a more generic name.	2019-01-14 19:33:17 +01:00
Willy Tarreau	6c30be52da	BUG/MINOR: backend: BE_LB_LKUP_CHTREE is a value, not a bit There are a few instances where the lookup algo is tested against BE_LB_LKUP_CHTREE using a binary "AND" operation while this macro is a value among a set, and not a bit. The test happens to work because the value is exactly 4 and no bit overlaps with the other possible values but this is a latent bug waiting for a new LB algo to appear to strike. At the moment the only other algo sharing a bit with it is the "first" algo which is never supported in the same code places. This fix should be backported to maintained versions for safety if it passes easily, otherwise it's not important as it will not fix any visible issue.	2019-01-14 19:33:17 +01:00
Willy Tarreau	602a499da5	BUG/MINOR: backend: balance uri specific options were lost across defaults The "balance uri" options "whole", "len" and "depth" were not properly inherited from the defaults sections. In addition, "whole" and "len" were not even reset when parsing "uri", meaning that 2 subsequent "balance uri" statements would not have the expected effect as the options from the first one would remain for the second one. This may be backported to all maintained versions.	2019-01-14 19:33:17 +01:00
Willy Tarreau	089eaa0ba7	BUG/MINOR: backend: don't use url_param_name as a hint for BE_LB_ALGO_PH At a few places in the code we used to rely on this variable to guess what LB algo was in place. This is wrong because if the defaults section presets "balance url_param foo" and a backend uses "balance roundrobin", these locations will still see this url_param_name set and consider it. The harm is limited, as this only causes the beginning of the request body to be buffered. And in general this is a bad practice which prevents us from cleaning the lbprm stuff. Let's explicitly check the LB algo instead. This may be backported to all currently maintained versions.	2019-01-14 19:33:17 +01:00
Emeric Brun	9e7547740c	MINOR: ssl: add support of aes256 bits ticket keys on file and cli. Openssl switched from aes128 to aes256 since may 2016 to compute tls ticket secrets used by default. But Haproxy still handled only 128 bits keys for both tls key file and CLI. This patch permit the user to set aes256 keys throught CLI or the key file (80 bytes encoded in base64) in the same way that aes128 keys were handled (48 bytes encoded in base64): - first 16 bytes for the key name - next 16/32 bytes for aes 128/256 key bits key - last 16/32 bytes for hmac 128/256 bits Both sizes are now supported (but keys from same file must be of the same size and can but updated via CLI only using a key of the same size). Note: This feature need the fix "dec func ignores padding for output size checking."	2019-01-14 19:32:58 +01:00
Emeric Brun	09852f70e0	BUG/MEDIUM: ssl: missing allocation failure checks loading tls key file This patch fixes missing allocation checks loading tls key file and avoid memory leak in some error cases. This patch should be backport on branches 1.9 and 1.8	2019-01-14 19:32:45 +01:00
Emeric Brun	ed697e4856	BUG/MINOR: base64: dec func ignores padding for output size checking Decode function returns an error even if the ouptut buffer is large enought because the padding was not considered. This case was never met with current code base.	2019-01-14 19:32:15 +01:00
Olivier Houchard	32d75ed300	BUG/MEDIUM: h1: Make sure we destroy an inactive connectin that did shutw. In h1_process(), if we have no associated stream, and the connection got a shutw, then destroy it, it is unusable and it may be our last chance to do so. This should be backported to 1.9.	2019-01-14 18:14:52 +01:00
Olivier Houchard	0923fa4200	BUG/MEDIUM: checks: Avoid having an associated server for email checks. When using a check to send email, avoid having an associated server, so that we don't modify the server state if we fail to send an email. Also revert back to initialize the check status to HCHK_STATUS_INI, now that set_server_check_status() stops early if there's no server, we shouldn't get in a mail loop anymore. This should be backported to 1.9.	2019-01-14 11:15:11 +01:00
Olivier Houchard	c98aa1f182	MINOR: checks: Store the proxy in checks. Instead of assuming we have a server, store the proxy directly in struct check, and use it instead of s->server. This should be a no-op for now, but will be useful later when we change mail checks to avoid having a server. This should be backported to 1.9.	2019-01-14 11:15:11 +01:00
Christopher Faulet	00292353a1	MINOR: spoe: Make the SPOE filter compatible with HTX proxies There is any specific HTTP processing in the SPOE. So there is no reason to not use it on HTX proxies. This patch may be backported to 1.9.	2019-01-14 10:52:28 +01:00
Willy Tarreau	c9036c0004	BUG/MAJOR: cache: fix confusion between zero and uninitialized cache key The cache uses the first 32 bits of the uri's hash as the key to reference the object in the cache. It makes a special case of the value zero to mean that the object is not in the cache anymore. The problem is that when an object hashes as zero, it's still inserted but the eb32_delete() call is skipped, resulting in the object still being chained in the memory area while the block has been reclaimed and used for something else. Then when objects which were chained below it (techically any object since zero is at the root) are deleted, the walk through the upper object may encounter corrupted values where valid pointers were expected. But while this should only happen statically once on 4 billion, the problem gets worse when the cache-use conditions don't match the cache-store ones, because cache-store runs with an uninitialized key, which can create objects that will never be found by the lookup code, or worse, entries with a zero key preventing eviction of the tree node and resulting in a crash. It's easy to accidently end up on such a config because the request rules generally can't be used to decide on the response : http-request cache-use cache if { path_beg /images } http-response cache-store cache In this test, mixing traffic with /images/$RANDOM and /foo/$RANDOM will result in random keys being inserted, some of them possibly being zero, and crashes will quickly happen. The fix consists in 1) always initializing the transaction's cache_hash to zero, and 2) never storing a response for which the hash has not been calculated, as indicated by the value zero. It is worth noting that objects hashing as value zero will never be cached, but given that there's only one chance among 4 billion that this happens, this is totally harmless. This fix must be backported to 1.9 and 1.8.	2019-01-14 10:31:31 +01:00
Willy Tarreau	f77a158c87	MINOR: mux-h1: make the mux_h1_ops struct static It was needlessly exported while it's only used inside the mux.	2019-01-10 10:00:08 +01:00
Olivier Houchard	51088ce68f	BUG/MEDIUM: ssl: Disable anti-replay protection and set max data with 0RTT. When using early data, disable the OpenSSL anti-replay protection, and set the max amount of early data we're ready to accept, based on the size of buffers, or early data won't work with the released OpenSSL 1.1.1. This should be backported to 1.8.	2019-01-09 16:26:28 +01:00
Daniel Corbett	43bb842a08	BUG/MEDIUM: init: Initialize idle_orphan_conns for first server in server-template When initializing server-template all of the servers after the first have srv->idle_orphan_conns initialized within server_template_init() The first server does not have this initialized and when http-reuse is active this causes a segmentation fault when accessed from srv_add_to_idle_list(). This patch removes the check for srv->tmpl_info.prefix within server_finalize_init() and allows the first server within a server-template to have srv->idle_orphan_conns properly initialized. This should be backported to 1.9.	2019-01-09 14:45:21 +01:00
Christopher Faulet	4b0e9b2870	BUG/MINOR: lua/htx: Respect the reserve when data are send from an HTX applet In the function hlua_applet_htx_send_yield(), there already was a test to respect the reserve but the wrong function was used to get the available space for data in the HTX buffer. Instead of calling htx_free_space(), the function htx_free_data_space() must be used. But in fact, there is no reason to bother with that anymore because the function channel_htx_recv_max() has been added for this purpose. The result of this bug is that the call to htx_add_data() failed unexpectedly while the amount of written data was incremented, leading the applet to think all data was sent. To prevent any futher bugs, a test has been added to yield if we are not able to write data into the channel buffer. This patch must be backported to 1.9.	2019-01-09 14:36:22 +01:00
Willy Tarreau	a01f45e3ce	BUG/CRITICAL: mux-h2: re-check the frame length when PRIORITY is used Tim D�sterhus reported a possible crash in the H2 HEADERS frame decoder when the PRIORITY flag is present. A check is missing to ensure the 5 extra bytes needed with this flag are actually part of the frame. As per RFC7540#4.2, let's return a connection error with code FRAME_SIZE_ERROR. Many thanks to Tim for responsibly reporting this issue with a working config and reproducer. This issue was assigned CVE-2018-20615. This fix must be backported to 1.9 and 1.8.	2019-01-08 13:20:59 +01:00
Christopher Faulet	202c6ce1a2	BUG/MINOR: proto_htx: Use HTX versions to truncate or erase a buffer channel_truncate() is not aware of the underlying format of the messages. So if there are some outgoing data in the channel when called, it does some unexpected operations on the channel's buffer. So the HTX version, channel_htx_truncate(), must be used. The same is true for channel_erase(). It resets the buffer but not the HTX message. So channel_htx_erase() must be used instead. This patch is flagged as a bug, but as far as we know, it was never hitted. This patch should be backported to 1.9. If so, following patch must be backported too: * MINOR: channel/htx: Add the HTX version of channel_truncate/erase	2019-01-08 12:06:55 +01:00
Christopher Faulet	00cf697215	MINOR: htx: Add a function to truncate all blocks after a specific offset This function will be used to truncate all incoming data in a channel, keeping outgoing ones. This may be backported to 1.9.	2019-01-08 12:06:55 +01:00
Christopher Faulet	839791af0d	BUG/MINOR: cache: Disable the cache if any compression filter precedes it We need to check if any compression filter precedes the cache filter. This is only possible when the compression is configured in the frontend while the cache filter is configured on the backend (via a cache-store action or explicitly). This case cannot be detected during HAProxy startup. So in such cases, the cache is disabled. The patch must be backported to 1.9.	2019-01-08 11:32:23 +01:00
Christopher Faulet	ff17b183fe	BUG/MINOR: filters: Detect cache+compression config on legacy HTTP streams On legacy HTTP streams, it is forbidden to use the compression with the cache. When the compression filter is explicitly specified, the detection works as expected and such configuration are rejected at startup. But it does not work when the compression filter is implicitly defined. To fix the bug, the implicit declaration of the compression filter is checked first, before calling .check() callback of each filters. This patch should be backported to 1.9.	2019-01-08 11:32:23 +01:00
Christopher Faulet	1d3613a031	BUG/MINOR: compression: Disable it if another one is already in progress Since the commit `9666720c8` ("BUG/MEDIUM: compression: Use the right buffer pointers to compress input data"), the compression can be done twice. The first time on the frontend and the second time on the backend. This may happen by configuring the compression in a default section. To fix the bug, when the response is checked to know if it should be compressed or not, if the flag HTTP_MSGF_COMPRESSING is set, the compression is not performed. It means it is already handled by a previous compression filter. Thanks to Pieter (PiBa-NL) to report this bug. This patch must be backported to 1.9.	2019-01-08 11:31:56 +01:00
Christopher Faulet	666a0c4d82	MEDIUM: mux-h1: Clarify how shutr/shutw are handled Now, h1_shutr() only do a shutdown read and try to set the flag H1C_F_CS_SHUTDOWN if shutdown write was already performed. On its side, h1_shutw(), if all conditions are met, do the same for the shutdown write. The real connection close is done when the mux h1 is released, in h1_release(). The flag H1C_F_CS_SHUTW was renamed to H1C_F_CS_SHUTDOWN to be less ambiguous. This patch may be backported to 1.9.	2019-01-08 11:31:16 +01:00
Christopher Faulet	f3eb2b1c24	BUG/MINOR: mux-h1: Close connection on shutr only when shutw was really done In h1_shutr(), to fully close the connection, we must be sure the shutdown write was already performed on the connection. So we know rely on connection flags instead of conn_stream flags. If CO_FL_SOCK_WR_SH is already set when h1_shutr() is called, we can do a full connection close. Otherwise, we just do the shutdown read. Without this patch, it is possible to close the connection too early with some outgoing data in the output buf. This patch must be backported to 1.9.	2019-01-08 11:31:16 +01:00
Christopher Faulet	69fc88c605	BUG/MINOR: stats/htx: Respect the reserve when the stats page is dumped As for the cache applet, this one must respect the reserve on HTX streams. This patch is tagged as MINOR because it is unlikely to fully fill the channel's buffer. Some tests are already done to not process almost full buffer. This patch must be backported to 1.9.	2019-01-07 16:32:10 +01:00
Christopher Faulet	cc156623b2	BUG/MEDIUM: cache/htx: Respect the reserve when cached objects are served It is only true for HTX streams. The legacy code relies on ci_putblk() which is already aware of the reserve. It is mandatory to not fill the reserve to let other filters analysing data. It is especially true for the compression filter. It needs at least 20 bytes of free space, plus at most 5 bytes per 32kB block. So if the cache fully fills the channel's buffer, the compression will not have enough space to do its job and it will block the data forwarding, waiting for more free space. But if the buffer fully filled with input data (ie no outgoing data), the stream will be frozen infinitely. This patch must be backported to 1.9. It depends on the following patches: * BUG/MEDIUM: cache/htx: Respect the reserve when cached objects are served from the cache * MINOR: channel/htx: Add HTX version for some helper functions	2019-01-07 16:32:07 +01:00
Thierry FOURNIER	bf90ce12aa	BUG/MEDIUM: lua: dead lock when Lua tasks are trigerred When a task is created from Lua context out of initialisation, the hlua_ctx_init() function can be called from safe environement, so we must not initialise it. While the support of threads appear, the safe environment set a lock to ensure only one Lua execution at a time. If we initialize safe environment in another safe environmenet, we have a dead lock. this patch adds the support of the idicator "already_safe" whoch indicates if the context is initialized form safe Lua fonction. thank to Flakebi for the report This patch must be backported to haproxy-1.9 and haproxy-1.8	2019-01-07 10:54:19 +01:00
Thierry FOURNIER	1725c2e395	BUG/MINOR: lua: bad args are returned for Lua actions In tcp actions case, the argument n - 1 is returned. For example: http-request lua.script stuff display "stuff" as first arg tcp-request content lua.script stuff display "lua.script" as first arg The action parser doesn't use the *cur_arg value. Thanks to Andy Franks for the bug report. This patch mist be backported in haproxy-1.8 and haproxy-1.9	2019-01-07 10:52:46 +01:00
Willy Tarreau	7778b59be1	MINOR: stream/cli: report more info about the HTTP messages on "show sess all" The "show sess all" command didn't allow to detect whether compression is in use for a given stream, which is sometimes annoying. Let's add a few more info about the HTTP messages, namely the flags, body len, chunk len and the "next" pointer.	2019-01-07 10:38:10 +01:00

... 7 8 9 10 11 ...

7942 Commits