haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-10 09:07:02 +02:00

Author	SHA1	Message	Date
Willy Tarreau	4c747e86cd	MINOR: list: make the delete and pop operations idempotent These operations previously used to return a "locked" element, which is a constraint when multiple threads try to delete the same element, because the second one will block indefinitely. Instead, let's make sure that both LIST_DEL_LOCKED() and LIST_POP_LOCKED() always reinitialize the element after deleting it. This ensures that the second thread will immediately unblock and succeed with the removal. It also secures the pop vs delete competition that may happen when trying to remove an element that's about to be dequeued.	2019-02-28 16:03:29 +01:00
Willy Tarreau	690d2ad4d2	BUG/MEDIUM: list: add missing store barriers when updating elements and head Commit `a8434ec14` ("MINOR: lists: Implement locked variations.") introduced locked lists which use the elements pointers as locks for concurrent operations. Under heavy stress the lists occasionally fail. The cause is a missing barrier at some points when updating the list element and the head : nothing prevents the compiler (or CPU) from updating the list head first before updating the element, making another thread jump to a wrong location. This patch simply adds the missing barriers before these two opeations. This will have to be backported if the commit above is backported.	2019-02-28 15:59:31 +01:00
Willy Tarreau	285192564d	BUG/MEDIUM: list: fix LIST_POP_LOCKED's removal of the last pointer There was a typo making the last updated pointer be the pre-last element's prev instead of the last's prev element. It didn't show up during early tests because the contention is very rare on this one and it's implicitly recovered when updating the pointers to go to the next element, but it was clearly visible in the listener_accept() tests by having all threads block on LIST_POP_LOCKED() with n==p==LLIST_BUSY. This will have to be backported if commit `a8434ec14` ("MINOR: lists: Implement locked variations.") is backported.	2019-02-28 15:59:31 +01:00
Willy Tarreau	bd20ad5874	BUG/MEDIUM: list: fix the rollback on addq in the locked liss Commit `a8434ec14` ("MINOR: lists: Implement locked variations.") introduced locked lists which use the elements pointers as locks for concurrent operations. A copy-paste typo in LIST_ADDQ_LOCKED() causes corruption in the list in case the next pointer is already held, as it restores the previous pointer into the next one. It may impact the server pools. This will have to be backported if the commit above is backported.	2019-02-28 15:10:15 +01:00
Willy Tarreau	149ab779cc	MAJOR: threads: enable one thread per CPU by default Threads have long matured by now, still for most users their usage is not trivial. It's about time to enable them by default on platforms where we know the number of CPUs bound. This patch does this, it counts the number of CPUs the process is bound to upon startup, and enables as many threads by default. Of course, "nbthread" still overrides this, but if it's not set the default behaviour is to start one thread per CPU. The default number of threads is reported in "haproxy -vv". Simply using "taskset -c" is now enough to adjust this number of threads so that there is no more need for playing with cpu-map. And thanks to the previous patches on the listener, the vast majority of configurations will not need to duplicate "bind" lines with the "process x/y" statement anymore either, so a simple config will automatically adapt to the number of processors available.	2019-02-27 14:51:50 +01:00
Willy Tarreau	7ac908bf8c	MINOR: config: add global tune.listener.multi-queue setting tune.listener.multi-queue { on \| off } Enables ('on') or disables ('off') the listener's multi-queue accept which spreads the incoming traffic to all threads a "bind" line is allowed to run on instead of taking them for itself. This provides a smoother traffic distribution and scales much better, especially in environments where threads may be unevenly loaded due to external activity (network interrupts colliding with one thread for example). This option is enabled by default, but it may be forcefully disabled for troubleshooting or for situations where it is estimated that the operating system already provides a good enough distribution and connections are extremely short-lived.	2019-02-27 14:27:07 +01:00
Willy Tarreau	8a03408d81	MINOR: activity: add accept queue counters for pushed and overflows It's important to monitor the accept queues to know if some incoming connections had to be handled by their originating thread due to an overflow. It's also important to be able to confirm thread fairness. This patch adds "accq_pushed" to activity reporting, which reports the number of connections that were successfully pushed into each thread's queue, and "accq_full", which indicates the number of connections that couldn't be pushed because the thread's queue was full.	2019-02-27 14:27:07 +01:00
Willy Tarreau	1efafce61f	MINOR: listener: implement multi-queue accept for threads There is one point where we can migrate a connection to another thread without taking risk, it's when we accept it : the new FD is not yet in the fd cache and no task was created yet. It's still possible to assign it a different thread than the one which accepted the connection. The only requirement for this is to have one accept queue per thread and their respective processing tasks that have to be woken up each time an entry is added to the queue. This is a multiple-producer, single-consumer model. Entries are added at the queue's tail and the processing task is woken up. The consumer picks entries at the head and processes them in order. The accept queue contains the fd, the source address, and the listener. Each entry of the accept queue was rounded up to 64 bytes (one cache line) to avoid cache aliasing because tests have shown that otherwise performance suffers a lot (5%). A test has shown that it's important to have at least 256 entries for the rings, as at 128 it's still possible to fill them often at high loads on small thread counts. The processing task does almost nothing except calling the listener's accept() function and updating the global session and SSL rate counters just like listener_accept() does on synchronous calls. At this point the accept queue is implemented but not used.	2019-02-27 14:27:07 +01:00
Willy Tarreau	b2b50a7784	MINOR: listener: pre-compute some thread counts per bind_conf In order to quickly pick a thread ID when accepting a connection, we'll need to know certain pre-computed values derived from the thread mask, which are counts of bits per position multiples of 1, 2, 4, 8, 16 and 32. In practice it is sufficient to compute only the 4 first ones and store them in the bind_conf. We update the count every time the bind_thread value is adjusted. The fields in the bind_conf struct have been moved around a little bit to make it easier to group all thread bit values into the same cache line. The function used to return a thread number is bind_map_thread_id(), and it maps a number between 0 and 31/63 to a thread ID between 0 and 31/63, starting from the left.	2019-02-27 14:27:07 +01:00
Willy Tarreau	f3241115e7	MINOR: tools: implement functions to look up the nth bit set in a mask Function mask_find_rank_bit() returns the bit position in mask <m> of the nth bit set of rank <r>, between 0 and LONGBITS-1 included, starting from the left. For example ranks 0,1,2,3 for mask 0x55 will be 6, 4, 2 and 0 respectively. This algorithm is based on a popcount variant and is described here : https://graphics.stanford.edu/~seander/bithacks.html.	2019-02-27 14:27:07 +01:00
Willy Tarreau	9e85318417	MINOR: listener: maintain a per-thread count of the number of connections on a listener Having this information will help us improve thread-level distribution of incoming traffic.	2019-02-27 14:27:07 +01:00
Willy Tarreau	a36b324777	MEDIUM: listener: keep a single thread-mask and warn on "process" misuse Now that nbproc and nbthread are exclusive, we can still provide more detailed explanations about what we've found in the config when a bind line appears on multiple threads and processes at the same time, then ignore the setting. This patch reduces the listener's thread mask to a single mask instead of an array of masks per process. Now we have only one thread mask and one process mask per bind-conf. This removes ~504 bytes of RAM per bind-conf and will simplify handling of thread masks. If a "bind" line only refers to process numbers not found by its parent frontend or not covered by the global nbproc directive, or to a thread not covered by the global nbthread directive, a warning is emitted saying what will be used instead.	2019-02-27 14:27:07 +01:00
Olivier Houchard	db64489aac	BUG/MEDIUM: lists: Properly handle the case we're removing the first elt. In LIST_DEL_LOCKED(), initialize p2 to NULL, and only attempt to set it back to its previous value if we had a previous element, and thus p2 is non-NULL.	2019-02-26 18:47:59 +01:00
Olivier Houchard	9ea5d361ae	MEDIUM: servers: Reorganize the way idle connections are cleaned. Instead of having one task per thread and per server that does clean the idling connections, have only one global task for every servers. That tasks parses all the servers that currently have idling connections, and remove half of them, to put them in a per-thread list of connections to kill. For each thread that does have connections to kill, wake a task to do so, so that the cleaning will be done in the context of said thread.	2019-02-26 18:17:32 +01:00
Olivier Houchard	7f1bc31fee	MEDIUM: servers: Used a locked list for idle_orphan_conns. Use the locked macros when manipulating idle_orphan_conns, so that other threads can remove elements from it. It will be useful later to avoid having a task per server and per thread to cleanup the orphan list.	2019-02-26 18:17:32 +01:00
Olivier Houchard	a8434ec146	MINOR: lists: Implement locked variations. Implement LIST_ADD_LOCKED(), LIST_ADDQ_LOCKED(), LIST_DEL_LOCKED() and LIST_POP_LOCKED(). LIST_ADD_LOCKED, LIST_ADDQ_LOCKED and LIST_DEL_LOCKED work the same as LIST_ADD, LIST_ADDQ and LIST_DEL, except before any manipulation it locks the relevant elements of the list, so it's safe to manipulate the list with multiple threads. LIST_POP_LOCKED() removes the first element from the list, and returns its data.	2019-02-26 18:17:32 +01:00
Fr�d�ric L�caille	1fceee8316	MINOR: http_fetch: add "req.ungrpc" sample fetch for gRPC. This patch implements "req.ungrpc" sample fetch method to decode and parse a gRPC request. It takes only one argument: a protocol buffers field number to identify the protocol buffers message number to be looked up. This argument is a sort of path in dotted notation to the terminal field number to be retrieved. ex: req.ungrpc(1.2.3.4) This sample fetch catch the data in raw mode, without interpreting them. Some protocol buffers specific converters may be used to convert the data to the correct type.	2019-02-26 16:27:05 +01:00
Fr�d�ric L�caille	3a463c92cf	MINOR: arg: Add support for ARGT_PBUF_FNUM arg type. This new argument type is used to parse Protocol Buffers field number with dotted notation (e.g: 1.2.3.4).	2019-02-26 16:27:05 +01:00
Fr�d�ric L�caille	3b71716685	MINOR: standard: Add a function to parse uints (dotted notation). This function is useful to parse strings made of unsigned integers and to allocate a C array of unsigned integers from there. For instance this function allocates this array { 1, 2, 3, 4, } from this string: "1.2.3.4".	2019-02-26 16:27:05 +01:00
Christopher Faulet	c6827d52c1	MINOR: channel/htx: Add function to skips output bytes from an HTX channel It is the HTX version of co_skip(). Internally, It uses the function htx_drain(). It will be used by other commits to fix bugs, so it must be backported to 1.9.	2019-02-26 14:04:23 +01:00
Christopher Faulet	549822f0a1	MINOR: htx: Add function to drain data from an HTX message The function htx_drain() can now be used to drain data from an HTX message. It will be used by other commits to fix bugs, so it must be backported to 1.9.	2019-02-26 14:04:23 +01:00
Christopher Faulet	729b5b308c	BUG/MINOR: channel: Set CF_WROTE_DATA when outgoing data are skipped in co_skip(), the flag CF_WRITE_PARTIAL is set on the channel. The flag CF_WROTE_DATA must also be set to notify the channel some data were sent. This patch must be backported to 1.9.	2019-02-26 14:04:23 +01:00
Richard Russo	bc9d9844d5	BUG/MAJOR: fd/threads, task/threads: ensure all spin locks are unlocked Calculate if the fd or task should be locked once, before locking, and reuse the calculation when determing when to unlock. Fixes a race condition added in `87d54a9a` for fds, and `b20aa9ee` for tasks, released in 1.9-dev4. When one thread modifies thread_mask to be a single thread for a task or fd while a second thread has locked or is waiting on a lock for that task or fd, the second thread will not unlock it. For FDs, this is observable when a listener is polled by multiple threads, and is closed while those threads have events pending. For tasks, this seems possible, where task_set_affinity is called, but I did not observe it. This must be backported to 1.9.	2019-02-25 16:16:36 +01:00
Willy Tarreau	2d7f81b809	MINOR: fd: add a new my_closefrom() function to close all FDs This is a naive implementation of closefrom() which closes all FDs starting from the one passed in argument. closefrom() is not provided on all operating systems, and other versions will follow.	2019-02-21 22:19:17 +01:00
Olivier Houchard	f131481a0a	BUG/MEDIUM: servers: Add a per-thread counter of idle connections. Add a per-thread counter of idling connections, and use it to determine how many connections we should kill after the timeout, instead of using the global counter, or we're likely to just kill most of the connections. This should be backported to 1.9.	2019-02-21 19:07:45 +01:00
Olivier Houchard	e737103173	BUG/MEDIUM: servers: Use atomic operations when handling curr_idle_conns. Use atomic operations when dealing with srv->curr_idle_conns, as it's shared between threads, otherwise we could get inconsistencies. This should be backported to 1.9.	2019-02-21 19:07:19 +01:00
Christopher Faulet	0b46548a68	BUG/MEDIUM: h2/htx: Correctly handle interim responses when HTX is enabled 1xx responses does not work in HTTP2 when the HTX is enabled. First of all, when a response is parsed, only one HEADERS frame is expected. So when an interim response is received, the flag H2_SF_HEADERS_RCVD is set and the next HEADERS frame (for another interim repsonse or the final one) is parsed as a trailers one. Then when the response is sent, because an EOM block is found at the end of the interim HTX response, the ES flag is added on the frame, closing too early the stream. Here, it is a design problem of the HTX. Iterim responses are considered as full messages, leading to some ambiguities when HTX messages are processed. This will not be fixed now, but we need to keep it in mind for future improvements. To fix the parsing bug, the flag H2_MSGF_RSP_1XX is added when the response headers are decoded. When this flag is set, an EOM block is added into the HTX message, despite the fact that there is no ES flag on the frame. And we don't set the flag H2_SF_HEADERS_RCVD on the corresponding H2S. So the next HEADERS frame will not be parsed as a trailers one. To fix the sending bug, the ES flag is not set on the frame when an interim response is processed and the flag H2_SF_HEADERS_SENT is not set on the corresponding H2S. This patch must be backported to 1.9.	2019-02-19 16:26:14 +01:00
Olivier Houchard	9efa7b8ba8	BUILD/MEDIUM: initcall: Fix build on MacOS. MacOS syntax for sections is a bit different, so implement it. (see issue #42). This should be backported to 1.9.	2019-02-15 14:32:35 +01:00
Fr�d�ric L�caille	76d2cef0c2	BUG/MEDIUM: peers: Missing peer initializations. Initialize ->srv peer field for all the peers, the local peer included. Indeed, a haproxy process needs to connect to the local peer of a remote process. Furthermore, when a "peer" or "server" line is parsed by parse_server() the address must be copied to ->addr field of the peer object only if this address has been also parsed by parse_server(). This is not the case if this address belongs to the local peer and is provided on a "server" line. After having parsed the "peer" or "server" lines of a peer sections, the ->srv part of all the peer must be initialized for SSL, if enabled. Same thing for the binding part. Revert `1417f0b` commit which is no more required. No backport is needed, this is purely 2.0.	2019-02-12 19:49:22 +01:00
Ben51Degrees	4ddf59d070	MEDIUM: 51d: Enabled multi threaded operation in the 51Degrees module. The existing threading flag in the 51Degrees API (FIFTYONEDEGREES_NO_THREADING) has now been mapped to the HAProxy threading flag (USE_THREAD), and the 51Degrees module code has been made thread safe. In Pattern, the cache is now locked with a spin lock from hathreads.h using a new lable 'OTHER_LOCK'. The workset pool is now created with the same size as the number of threads to avoid any time waiting on a worket. In Hash Trie, the global device offsets structure is only used in single threaded operation. Multi threaded operation creates a new offsets structure in each thread.	2019-02-08 21:29:23 +01:00
Willy Tarreau	1417f0b5dc	BUG/MEDIUM: peers: check that p->srv actually exists before using p->srv->use_ssl Commit `1055e687a` ("MINOR: peers: Make outgoing connection to SSL/TLS peers work.") introduced an "srv" field in the peers, which points to the equivalent server to hold SSL settings. This one is not set when the peer is local so we must always test it before testing p->srv->use_ssl otherwise haproxy dies during reloads. No backport is needed, this is purely 2.0.	2019-02-08 10:22:31 +01:00
Willy Tarreau	ff9c9140f4	MINOR: config: make MAX_PROCS configurable at build time For some embedded systems, it's pointless to have 32- or even 64- large arrays of processes when it's known that much fewer processes will be used in the worst case. Let's introduce this MAX_PROCS define which contains the highest number of processes allowed to run at once. It still defaults to LONGBITS but may be lowered.	2019-02-07 15:10:19 +01:00
Willy Tarreau	980855bd95	BUG/MEDIUM: server: initialize the orphaned conns lists and tasks at the end This also depends on the nbthread count, so it must only be performed after parsing the whole config file. As a side effect, this removes some code duplication between servers and server-templates. This must be backported to 1.9.	2019-02-07 15:08:13 +01:00
Willy Tarreau	2415727a00	MINOR: global: add proc_mask() and thread_mask() These two functions return either all_{proc,threads}_mask, or the argument. This is used to default to all_proc_mask or all_threads_mask when not set on bind_conf or proxies.	2019-02-04 05:09:15 +01:00
Willy Tarreau	a38a7175b1	MINOR: config: keep an all_proc_mask like we have all_threads_mask This simplifies some mask comparisons at various places where nbits(global.nbproc) was used.	2019-02-04 05:09:15 +01:00
Willy Tarreau	cafa56ecd6	MINOR: tools: improve the popcount() operation We'll call popcount() more often so better use a parallel method than an iterative one. One optimal design is proposed at the site below. It requires a fast multiplication though, but even without it will still be faster than the iterative one, and all relevant 64 bit platforms do have a multiply unit. https://graphics.stanford.edu/~seander/bithacks.html	2019-02-04 05:09:15 +01:00
Willy Tarreau	4ed84c96cf	OPTIM: listener: optimize cache-line packing for struct listener Some unused fields were placed early and some important ones were on the second cache line. Let's move the proto_list and name closer to the end of the structure to bring accept() and default_target() into the first cache line.	2019-02-04 05:09:14 +01:00
Willy Tarreau	da9e939f3c	CLEANUP: threads: fix misleading comment about all_threads_mask This variable changed a bit after 1.8, it's never zero anymore.	2019-02-02 17:48:39 +01:00
Olivier Houchard	dc21ff778b	MINOR: debug: Add an option that causes random allocation failures. When compiling with DEBUG_FAIL_ALLOC, add a new option, tune.fail-alloc, that gives the percentage of chances an allocation fails. This is useful to check that allocation failures are always handled gracefully.	2019-01-31 19:38:25 +01:00
Olivier Houchard	ff5dd74e25	MINOR: xref: Add missing barriers. Add a few missing barriers in the xref code, it's unlikely to be a problem for x86, but may be on architectures with weak memory ordering.	2019-01-31 19:38:25 +01:00
Willy Tarreau	00f18a36b6	BUG/MINOR: server: fix logic flaw in idle connection list management With variable connection limits, it's not possible to accurately determine whether the mux is still in use by comparing usage and max to be equal due to the fact that one determines the capacity and the other one takes care of the context. This can cause some connections to be dropped before they reach their stream ID limit. It seems it could also cause some connections to be terminated with streams still alive if the limit was reduced to match the newly computed avail_streams() value, though this cannot yet happen with existing muxes. Instead let's switch to usage reports and simply check whether connections are both unused and available before adding them to the idle list. This should be backported to 1.9.	2019-01-31 19:38:25 +01:00
Willy Tarreau	51d0a7e54c	MINOR: connstream: have a new flag CS_FL_KILL_CONN to kill a connection This is the equivalent of SI_FL_KILL_CONN but for the connstreams. It will be set by the stream-interface during the various shutdown operations.	2019-01-31 19:38:25 +01:00
Willy Tarreau	0f9cd7b196	MINOR: stream-int: add a new flag to mention that we want the connection to be killed The new flag SI_FL_KILL_CONN is now set by the rare actions which deliberately want the whole connection (and not just the stream) to be killed. This is only used for "tcp-request content reject", "tcp-response content reject", "tcp-response content close" and "http-request reject". The purpose is to desambiguate the close from a regular shutdown. This will be used by the next patches.	2019-01-31 19:38:25 +01:00
Olivier Houchard	8788b4111c	BUG/MEDIUM: connections: Don't forget to remove CO_FL_SESS_IDLE. If we're adding a connection to the server orphan idle list, don't forget to remove the CO_FL_SESS_IDLE flag, or we will assume later it's still attached to a session. This should be backported to 1.9.	2019-01-31 19:38:25 +01:00
Willy Tarreau	e5fcfbed5c	MINOR: htx: never check for null htx pointer in htx_is_{,not_}empty() The previous patch clarifies the fact that the htx pointer is never null along all the code. This test for a null will never match, didn't catch the pointer 1 before the fix for b_is_null(), but it confuses the compiler letting it think that any dereferences made to this pointer after this test could actually mean we're dereferencing a null. Let's now drop this test. This saves us from having to add impossible tests everywhere to avoid the warning. This should be backported to 1.9 if the b_is_null() patch is backported.	2019-01-31 08:07:17 +01:00
Willy Tarreau	245d189cce	DOC: htx: make it clear that htxbuf() and htx_from_buf() always return valid pointers Update the comments above htxbuf() and htx_from_buf() to make it clear that they always return valid htx pointers so that callers know they do not have to test them. This is only true after the fix on b_is_null() which was the only known corner case. This should be backported to 1.9 if the b_is_null() patch is backported.	2019-01-31 08:07:17 +01:00
Olivier Houchard	203d735cac	BUG/MEDIUM: buffer: Make sure b_is_null handles buffers waiting for allocation. In b_is_null(), make sure we return 1 if the buffer is waiting for its allocation, as users assume there's memory allocated if b_is_null() returns 0. The indirect impact of not having this was that htxbuf() would not match b_is_null() for a buffer waiting for an allocation, and would thus return the value 1 for the htx pointer, causing various crashes under low memory condition. Note that this patch makes gcc versions 6 and above report two null-deref warnings in proto_htx.c since htx_is_empty() continues to check for a null pointer without knowing that this is protected by the test on b_is_null(). This is addressed by the following patches. This should be backported to 1.9.	2019-01-31 08:07:17 +01:00
Willy Tarreau	9c84d8299a	MINOR: h2: add a generic frame checker The new function h2_frame_check() checks the protocol limits for the received frame (length, ID, direction) and returns a verdict made of a connection error code. The purpose is to be able to validate any frame regardless of the state and the ability to call the frame handler, and to emit a GOAWAY early in this case.	2019-01-30 19:37:20 +01:00
Willy Tarreau	13afcb7ab3	BUG/MINOR: task: fix possibly missed event in inter-thread wakeups There's a very small but existing uncertainty window when waking another thread up where it is possible for task_wakeup() not to wake the other task up because it's still running while this once is in the process of finishing and loses its TASK_RUNNING flag. In this case the wakeup will be missed. The problem is that we have a single flag to store 3 states, since the transition from running to sleeping isn't atomic. Thus we need to have another flag to cover this part. This patch introduces TASK_QUEUED to mention that the task is already in the run queue, running or not. This bit will be removed while TASK_RUNNING is kept once dequeued, and will be used when removing TASK_RUNNING to check if the task has been requeued. It might be possible to slightly improve this but the occurrence rate is quite low and we don't really need to complexify the scheduler to optimize for a rare case. The impact with the current code is very low since we have few inter- thread wakeups. Most of them are caused by checks killing sessions. This must be backported to 1.9.	2019-01-28 15:03:04 +01:00
Willy Tarreau	f5809cde7a	MINOR: threads: make MAX_THREADS configurable at build time There's some value in being able to limit MAX_THREADS, either to save precious resources in embedded environments, or to protect certain deployments against accidently incorrect settings. With this patch, if MAX_THREADS is defined at build time, it will be used. However, given that LONGBITS is not a macro but is defined according to sizeof(long), we can't check the value range at build time and instead we need to perform the check at early boot time. However, the compiler is able to optimize away the constant comparisons and doesn't even emit the check code when values are correct. The output message regarding threading support was improved to report the number of threads.	2019-01-26 13:37:48 +01:00
Willy Tarreau	c9a82e48bf	MINOR: cfgparse: make the process/thread parser support a maximum value It was hard-wired to LONGBITS, let's make it configurable depending on the context (threads, processes).	2019-01-26 13:25:14 +01:00
Willy Tarreau	4790f7c907	MEDIUM: h2: always parse and deduplicate the content-length header The header used to be parsed only in HTX but not in legacy. And even in HTX mode, the value was dropped. Let's always parse it and report the parsed value back so that we'll be able to store it in the streams.	2019-01-24 19:07:26 +01:00
Willy Tarreau	bf66bd1b8b	MEDIUM: stream-int: always mark pending outgoing SI_ST_CON Before the first send() attempt, we should be in SI_ST_CON, not SI_ST_EST, since we have not yet attempted to send and we are allowed to retry. This is particularly important with complex outgoing muxes which can fail during the first send attempt (e.g. failed stream ID allocation). It only requires that sess_update_st_con_tcp() knows about this possibility, as we must not forcefully close a reused connection when facing an error in this case, this will be handled later. This may be backported to 1.9 with care after some observation period.	2019-01-24 19:06:43 +01:00
Willy Tarreau	9c538e01c2	MINOR: server: add a max-reuse parameter Some servers may wish to limit the total number of requests they execute over a connection because some of their components might leak resources. In HTTP/1 it was easy, they just had to emit a "connection: close" header field with the last response. In HTTP/2, it's less easy because the info is not always shared with the component dealing with the H2 protocol and it could be harder to advertise a GOAWAY with a stream limit. This patch provides a solution to this by adding a new "max-reuse" parameter to the server keyword. This parameter indicates how many times an idle connection may be reused for new requests. The information is made available and the underlying muxes will be able to use it at will. This patch should be backported to 1.9.	2019-01-24 19:06:43 +01:00
Willy Tarreau	1e7d444eec	BUG/MINOR: hpack: return a compression error on invalid table size updates RFC7541#6.3 mandates that an error is reported when a dynamic table size update announces a size larger than the one configured with settings. This is tested by h2spec using test "hpack/6.3/1". This must be backported to 1.9 and possibly 1.8 as well.	2019-01-24 15:27:06 +01:00
Willy Tarreau	71c3811589	MINOR: h2: declare new sets of frame types This patch adds H2_FT_HDR_MASK to group all frame types carrying headers information, and H2_FT_LATE_MASK to group frame types allowed to arrive after a stream was closed.	2019-01-24 15:27:06 +01:00
Fr�d�ric L�caille	355b2033ec	MINOR: cfgparse: SSL/TLS binding in "peers" sections. Make "bind" keywork be supported in "peers" sections. All "bind" settings are supported on this line. Add "default-bind" option to parse the binding options excepted the bind address. Do not parse anymore the bind address for local peers on "server" lines. Do not use anymore list_for_each_entry() to set the "peers" section listener parameters because there is only one listener by "peers" section. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Fr�d�ric L�caille	1055e687a2	MINOR: peers: Make outgoing connection to SSL/TLS peers work. This patch adds pointer to a struct server to peer structure which is initialized after having parsed a remote "peer" line. After having parsed all peers section we run ->prepare_srv to initialize all SSL/TLS stuff of remote perr (or server). Remaining thing to do to completely support peer protocol over SSL/TLS: make "bind" keyword be supported in "peers" sections to make SSL/TLS incoming connections to local peers work. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Tim Duesterhus	8b87c01c4d	BUG/MINOR: stick_table: Prevent conn_cur from underflowing When using the peers feature a race condition could prevent a connection from being properly counted. When this connection exits it is being "uncounted" nonetheless, leading to a possible underflow (-1) of the conn_curr stick table entry in the following scenario : - Connect to peer A (A=1, B=0) - Peer A sends 1 to B (A=1, B=1) - Kill connection to A (A=0, B=1) - Connect to peer B (A=0, B=2) - Peer A sends 0 to B (A=0, B=0) - Peer B sends 0/2 to A (A=?, B=0) - Kill connection to B (A=?, B=-1) - Peer B sends -1 to A (A=-1, B=-1) This fix may be backported to all supported branches.	2019-01-15 15:34:49 +01:00
Willy Tarreau	0cac26cd88	MEDIUM: backend: move all LB algo parameters into an union Since all of them are exclusive, let's move them to an union instead of eating memory with the sum of all of them. We're using a transparent union to limit the code changes. Doing so reduces the struct lbprm from 392 bytes to 372, and thanks to these changes, the struct proxy is now down to 6480 bytes vs 6624 before the changes (144 bytes saved per proxy).	2019-01-14 19:33:17 +01:00
Willy Tarreau	76e84f5091	MINOR: backend: move hash_balance_factor out of chash This one is a proxy option which can be inherited from defaults even if the LB algo changes. Move it out of the lb_chash struct so that we don't need to keep anything separate between these structs. This will allow us to merge them into an union later. It even takes less room now as it fills a hole and removes another one.	2019-01-14 19:33:17 +01:00
Willy Tarreau	a9a7249966	MINOR: backend: remap the balance uri settings to lbprm.arg_opt{1,2,3} The algo-specific settings move from the proxy to the LB algo this way : - uri_whole => arg_opt1 - uri_len_limit => arg_opt2 - uri_dirs_depth1 => arg_opt3	2019-01-14 19:33:17 +01:00
Willy Tarreau	9fed8586b5	MINOR: backend: make the header hash use arg_opt1 for use_domain_only This is only a boolean extra arg. Let's map it to arg_opt1 and remove hh_match_domain from struct proxy.	2019-01-14 19:33:17 +01:00
Willy Tarreau	20e68378f1	MINOR: backend: add new fields in lbprm to store more LB options Some algorithms require a few extra options (up to 3). Let's provide some room in lbprm to store them, and make sure they're passed from defaults to backends.	2019-01-14 19:33:17 +01:00
Willy Tarreau	484ff07691	MINOR: backend: make headers and RDP cookie also use arg_str/len These ones used to rely on separate variables called hh_name/hh_len but they are exclusive with the former. Let's use the same variable which becomes a generic argument name and length for the LB algorithm.	2019-01-14 19:33:17 +01:00
Willy Tarreau	4c03d1c9b6	MINOR: backend: move url_param_name/len to lbprm.arg_str/len This one is exclusively used by LB parameters, when using URL param hashing. Let's move it to the lbprm struct under a more generic name.	2019-01-14 19:33:17 +01:00
Emeric Brun	9e7547740c	MINOR: ssl: add support of aes256 bits ticket keys on file and cli. Openssl switched from aes128 to aes256 since may 2016 to compute tls ticket secrets used by default. But Haproxy still handled only 128 bits keys for both tls key file and CLI. This patch permit the user to set aes256 keys throught CLI or the key file (80 bytes encoded in base64) in the same way that aes128 keys were handled (48 bytes encoded in base64): - first 16 bytes for the key name - next 16/32 bytes for aes 128/256 key bits key - last 16/32 bytes for hmac 128/256 bits Both sizes are now supported (but keys from same file must be of the same size and can but updated via CLI only using a key of the same size). Note: This feature need the fix "dec func ignores padding for output size checking."	2019-01-14 19:32:58 +01:00
Olivier Houchard	c98aa1f182	MINOR: checks: Store the proxy in checks. Instead of assuming we have a server, store the proxy directly in struct check, and use it instead of s->server. This should be a no-op for now, but will be useful later when we change mail checks to avoid having a server. This should be backported to 1.9.	2019-01-14 11:15:11 +01:00
Willy Tarreau	762475e1f9	BUG/MEDIUM: connection: properly unregister the mux on failed initialization When mux->init() fails, session_free() will call it again to unregister it while it was already done, resulting in null derefs or use-after-free. This typically happens on out-of-memory conditions during H1 or H2 connection or stream allocation. This fix must be backported to 1.9.	2019-01-10 19:47:43 +01:00
Christopher Faulet	f7ed195ac8	MINOR: channel/htx: Add the HTX version of channel_truncate/erase The function channel_htx_truncate() can now be used on HTX buffer to truncate all incoming data, keeping outgoing one intact. This function relies on the function channel_htx_erase() and htx_truncate(). This patch may be backported to 1.9. If so, the patch "MINOR: channel/htx: Add the HTX version of channel_truncate()" must also be backported.	2019-01-08 12:06:55 +01:00
Christopher Faulet	00cf697215	MINOR: htx: Add a function to truncate all blocks after a specific offset This function will be used to truncate all incoming data in a channel, keeping outgoing ones. This may be backported to 1.9.	2019-01-08 12:06:55 +01:00
Christopher Faulet	5811db0043	MINOR: channel/htx: Add HTX version for some helper functions HTX versions for functions to test the free space in input against the reserve have been added. Now, on HTX streams, following functions can be used: * channel_htx_may_recv * channel_htx_recv_limit * channel_htx_recv_max * channel_htx_full This patch must be backported in 1.9 because it will be used by a futher patch to fix a bug.	2019-01-07 16:32:05 +01:00
Christopher Faulet	8564c1f04b	MINOR: htx: Add an helper function to get the max space usable for a block This patch must be backported in 1.9 because it will be used by a futher patch to fix a bug.	2019-01-07 16:32:02 +01:00
Willy Tarreau	909b9d852b	BUILD: add a new file "version.c" to carry version updates While testing fixes, it's sometimes confusing to rebuild only one C file (e.g. a mux) and not to have the correct commit ID reported in "haproxy -v" nor on the stats page. This patch adds a new "version.c" file which is always rebuilt. It's very small and contains only 3 variables derived from the various version strings. These variables are used instead of the macros at the few places showing the version. This way the output version of the running code is always correct for the parts that were rebuilt.	2019-01-04 18:20:32 +01:00
Olivier Houchard	f1b11e2d16	MINOR: connections: Remove a stall comment. Remove the comment that pretends 0x40000000 is unused, it's not true anymore.	2019-01-04 17:26:47 +01:00
Willy Tarreau	0f8fb6b7f9	MINOR: h1: make the H1 headers block parser able to parse headers only Currently the H1 headers parser works for either a request or a response because it starts from the start line. It is also able to resume its processing when it was interrupted, but in this case it doesn't update the list. Make it support a new flag, H1_MF_HDRS_ONLY so that the caller can indicate it's only interested in the headers list and not the start line. This will be convenient to parse H1 trailers.	2019-01-04 10:48:03 +01:00
Willy Tarreau	1e1f27c5c1	MINOR: h2: add h2_make_htx_trailers to turn H2 headers to HTX trailers This function is usable to transform a list of H2 header fields to a HTX trailers block. It takes care of rejecting forbidden headers and pseudo-headers when performing the conversion. It also emits the trailing CRLF that is currently needed in the HTX trailers block.	2019-01-03 18:45:38 +01:00
Willy Tarreau	52610e905d	MINOR: htx: add a new function to add a block without filling it htx_add_blk_type_size() creates a block of a specified type and size and returns it. The caller can then fill it.	2019-01-03 18:45:38 +01:00
Willy Tarreau	9d953e7572	MINOR: h2: add h2_make_h1_trailers to turn H2 headers to H1 trailers This function is usable to transform a list of H2 header fields to a H1 trailers block. It takes care of rejecting forbidden headers and pseudo-headers when performing the conversion.	2019-01-03 18:45:38 +01:00
Willy Tarreau	59884a646c	MINOR: lb: allow redispatch when using consistent hash Redispatch traditionally only worked for cookie based persistence. Adding redispatch support for consistent hash based persistence - also update docs. Reported by Oskar Stenman on discourse: https://discourse.haproxy.org/t/balance-uri-consistent-hashing-redispatch-3-not-redispatching/3344 Should be backported to 1.8. Cc: Lukas Tribus <lukas@ltri.eu>	2019-01-02 20:22:17 +01:00
Christopher Faulet	e64582929f	MINOR: channel: Add the function channel_add_input This function must be called when new incoming data are pushed in the channel's buffer. It updates the channel state and take care of the fast forwarding by consuming right amount of data and decrementing "->to_forward" accordingly when necessary. In fact, this patch just moves a part of ci_putblk in a dedicated function. This patch must be backported to 1.9.	2019-01-02 20:12:44 +01:00
Olivier Houchard	a2dbeb22fc	MEDIUM: sessions: Keep track of which connections are idle. Instead of keeping track of the number of connections we're responsible for, keep track of the number of connections we're responsible for that we are currently considering idling (ie that we are not using, they may be in use by other sessions), that way we can actually reuse connections when we have more connections than the max configured.	2018-12-28 19:16:03 +01:00
Olivier Houchard	351411facd	BUG/MAJOR: sessions: Use an unlimited number of servers for the conn list. When a session adds a connection to its connection list, we used to remove connections for an another server if there were not enough room for our server. This can't work, because those lists are now the list of connections we're responsible for, not just the idle connections. To fix this, allow for an unlimited number of servers, instead of using an array, we're now using a linked list.	2018-12-28 16:33:13 +01:00
Olivier Houchard	09e498f1a1	BUG/MEDIUM: tasks: Decrement tasks_run_queue in tasklet_free(). If the tasklet is in the list, don't forget to decrement tasks_run_queue in tasklet_free(). This should be backported to 1.9.	2018-12-24 14:04:55 +01:00
Willy Tarreau	f48919aafb	MINOR: buffers: add a new b_move() function This function will be used to move parts of a buffer to another place in the same buffer, even if the parts overlap. In order to keep things under reasonable control, it only uses a length and absolute offsets for the source and destination, and doesn't consider head nor data.	2018-12-24 11:45:00 +01:00
Willy Tarreau	deab244dc1	MINOR: h2: add a bit-based frame type representation This will ease checks among sets of frames.	2018-12-24 11:45:00 +01:00
Willy Tarreau	fba74ea7b0	[RELEASE] Released version 2.0-dev0 Released version 2.0-dev0 with the following main changes : - BUG/MAJOR: connections: Close the connection before freeing it. - REGTEST: Require the option LUA to run lua tests - REGTEST: script: Process script arguments before everything else - REGTEST: script: Evaluate the varnishtest command to allow quoted parameters - REGTEST: script: Add the option --clean to remove previous log direcotries - REGTEST: script: Add the option --debug to show logs on standard ouput - REGTEST: script: Add the option --keep-logs to keep all log directories - REGTEST: script: Add the option --use-htx to enable the HTX in regtests - REGTEST: script: Print only errors in the results report - REGTEST: Add option to use HTX prefixed by the macro 'no-htx' - REGTEST: Make reg-tests target support argument. - REGTEST: Fix a typo about barrier type. - REGTEST: Be less Linux specific with a syslog regex. - REGTEST: Missing enclosing quotes for ${tmpdir} macro. - REGTEST: Exclude freebsd target for some reg tests. - BUG/MEDIUM: h2: Don't forget to quit the sending_list if SUB_CALL_UNSUBSCRIBE. - BUG/MEDIUM: mux-h2: Don't forget to quit the send list on error reports - BUG/MEDIUM: dns: Don't prevent reading the last byte of the payload in dns_validate_response() - BUG/MEDIUM: dns: overflowed dns name start position causing invalid dns error - BUG/MINOR: compression/htx: Don't compress responses with unknown body length - BUG/MINOR: compression/htx: Don't add the last block of data if it is empty - MEDIUM: mux_h1: Implement h1_show_fd. - REGTEST: script: Add support of alternatives in requited options list - REGTEST: Add a basic test for the compression - BUG/MEDIUM: mux-h2: don't needlessly wake up the demux on short frames - REGTEST: A basic test for "http-buffer-request" - BUG/MEDIUM: server: Also copy "check-sni" for server templates. - MINOR: ssl: Add ssl_sock_set_alpn(). - MEDIUM: checks: Add check-alpn.	2018-12-22 11:20:35 +01:00
Olivier Houchard	921501443b	MEDIUM: checks: Add check-alpn. Add a way to configure the ALPN used by check, with a new "check-alpn" keyword. By default, the checks will use the server ALPN, but it may not be convenient, for instance because the server may use HTTP/2, while checks are unable to do HTTP/2 yet.	2018-12-21 19:54:16 +01:00
Olivier Houchard	ab28a320aa	MINOR: ssl: Add ssl_sock_set_alpn(). Add a new function, ssl_sock_set_alpn(), to be able to change the ALPN for a connection, instead of relying of the one defined in the SSL_CTX.	2018-12-21 19:53:30 +01:00
Olivier Houchard	8ab8a6eee5	BUG/MAJOR: connections: Close the connection before freeing it. In si_release_endpoint(), if the end point is a connection, because we don't know which mux to use it, make sure we close the connection before freeing it, or else, we'd have a fd left for polling, which would point to a now free'd connection. This should be backported to 1.9.	2018-12-20 06:03:14 +01:00
Willy Tarreau	e9f4301f0f	MINOR: connection: add cs_set_error() to set the error bits Depending on the CS_FL_EOS status, we either set CS_FL_ERR_PENDING or CS_FL_ERROR at various places. Let's have a generic function to do this.	2018-12-19 18:13:52 +01:00
Willy Tarreau	14bfe9af12	CLEANUP: stream-int: consistently call the si/stream_int functions As long-time changes have accumulated over time, the exported functions of the stream-interface were almost all prefixed "si_<something>" while most private ones (mostly callbacks) were called "stream_int_<something>". There were still a few confusing exceptions, which were addressed to follow this shcme : - stream_sock_read0(), only used internally, was renamed stream_int_read0() and made static - stream_int_notify() is only private and was made static - stream_int_{check_timeouts,report_error,retnclose,register_handler,update} were renamed si_<something>. Now it is clearer when checking one of these if it risks to be used outside or not.	2018-12-19 15:25:43 +01:00
Willy Tarreau	94031d30d7	MINOR: connection: remove an unwelcome dependency on struct stream There was a reference to struct stream in conn_free() for the case where we're freeing a connection that doesn't have a mux attached. For now we know it's always a stream, and we only need to do it to put a NULL in s->si[1].end. Let's do it better by storing the pointer to si[1].end in the context and specifying that this pointer is always nulled if the mux is null. This way it allows a connection to detach itself from wherever it's being used. Maybe we could even get rid of the condition on the mux.	2018-12-19 14:36:29 +01:00
Willy Tarreau	3d2ee55ebd	CLEANUP: connection: rename conn->mux_ctx to conn->ctx We most often store the mux context there but it can also be something else while setting up the connection. Better call it "ctx" and know that it's the owner's context than misleadingly call it mux_ctx and get caught doing suspicious tricks.	2018-12-19 14:13:07 +01:00
Willy Tarreau	4f6516d677	CLEANUP: connection: rename subscription events values and event field The SUB_CAN_SEND/SUB_CAN_RECV enum values have been confusing a few times, especially when checking them on reading. After some discussion, it appears that calling them SUB_RETRY_SEND/SUB_RETRY_RECV more accurately reflects their purpose since these events may only appear after a first attempt to perform the I/O operation has failed or was not completed. In addition the wait_reason field in struct wait_event which carries them makes one think that a single reason may happen at once while it is in fact a set of events. Since the struct is called wait_event it makes sense that this field is called "events" to indicate it's the list of events we're subscribed to. Last, the values for SUB_RETRY_RECV/SEND were swapped so that value 1 corresponds to recv and 2 to send, as is done almost everywhere else in the code an in the shutdown() call.	2018-12-19 14:09:21 +01:00
Willy Tarreau	beefaee4f5	MEDIUM: h2: properly check and deduplicate the content-length header in HTX When producing an HTX message, we can't rely on the next-level H1 parser to check and deduplicate the content-length header, so we have to do it while parsing a message. The algorithm is the exact same as used for H1 messages.	2018-12-19 13:08:08 +01:00
Willy Tarreau	d5e3c71208	MINOR: objtype: report a few missing types in names and base pointers Types DNS_SRVRQ and CS were not referenced in the type to string conversions, causing possibly misleading outputs in session dumps. Now instead of showing "NONE" for unknown invalid types names, we display "!INVAL!" to clear the confusion that may exist in case of memory corruption for example.	2018-12-18 16:31:10 +01:00
Olivier Houchard	71748cb91b	BUG/MEDIUM: connection: Add a new CS_FL_ERR_PENDING flag to conn_streams. Add a new flag to conn_streams, CS_FL_ERR_PENDING. This is to be set instead of CS_FL_ERR in case there's still more data to be read, so that we read all the data before closing.	2018-12-17 21:54:14 +01:00
Willy Tarreau	bce4d8a37d	MINOR: debug: make the ABORT_NOW macro use a volatile int Similar to previous commit, let's make the macro use a volatile when dereferencing NULL so that clang doesn't optimize it away.	2018-12-16 08:17:23 +01:00
Olivier Houchard	51e474136b	MINOR: pools: Cast to volatile int * instead of int . When using DEBUG_MEMORY_POOLS, when we want to crash, instead of using (int )0 = 0, use (volatile int *)0 = 0, or clang will just translate it to a nop, instead of dereferencing 0.	2018-12-16 08:15:16 +01:00
Olivier Houchard	a4d4fdfaa3	MEDIUM: sessions: Don't keep an infinite number of idling connections. In session, don't keep an infinite number of connection that can idle. Add a new frontend parameter, "max-session-srv-conns" to set a max number, with a default value of 5.	2018-12-15 23:50:10 +01:00
Olivier Houchard	f502aca5c2	MEDIUM: mux: provide the session to the init() and attach() method. Instead of trying to get the session from the connection, which is not always there, and of course there could be multiple sessions per connection, provide it with the init() and attach() methods, so that we know the session for each outgoing stream.	2018-12-15 23:50:09 +01:00
Olivier Houchard	b7b3faa79c	MEDIUM: servers: Replace idle-timeout with pool-purge-delay. Instead of the old "idle-timeout" mechanism, add a new option, "pool-purge-delay", that sets the delay before purging idle connections. Each time the delay happens, we destroy half of the idle connections.	2018-12-15 23:50:09 +01:00
Olivier Houchard	006e3101f9	MEDIUM: servers: Add a command to limit the number of idling connections. Add a new command, "pool-max-conn" that sets the maximum number of connections waiting in the orphan idling connections list (as activated with idle-timeout). Using "-1" means unlimited. Using pools is now dependant on this.	2018-12-15 23:50:08 +01:00
William Lallemand	a57b7e33ef	MINOR: cli: implements 'reload' on master CLI The reload command reload the haproxy master like it is done with a kill -USR2 on the master process.	2018-12-15 13:33:49 +01:00
Christopher Faulet	f0216dae0c	MINOR: payload/htx: Adapt smp_fetch_len to be HTX aware	2018-12-14 16:03:34 +01:00
Willy Tarreau	a1214a501f	MINOR: cache: report the number of cache lookups and cache hits The cache lookups and hits is now accounted per frontend and per backend, and reported on the stats page.	2018-12-14 14:00:25 +01:00
Willy Tarreau	59caa3b872	MINOR: tools: increase the number of ITOA strings to 16 It's currently 10 and is too little to extend some tooltips on the stats page.	2018-12-14 13:59:42 +01:00
Willy Tarreau	f157384803	MINOR: backend: count the number of connect and reuse per server and per backend Sadly we didn't have the cumulated number of connections established to servers till now, so let's now update it per backend and per-server and report it in the stats. On the stats page it appears in the tooltip when hovering over the total sessions count field.	2018-12-14 11:35:36 +01:00
Olivier Houchard	9a86fcbd47	MEDIUM: mux: Add an optional "reset" method. Add a new method to mux, "reset", that is used to let the mux know the connection attempt failed, and we're about to retry, so it just have to reinit itself. Currently only the H1 mux needs it.	2018-12-13 17:32:15 +01:00
William Lallemand	b7ea141cbb	MEDIUM: cli: handle CLI level from the master CLI Handle the CLI level in the master CLI. In order to do this, the master CLI stores the level in the stream. Each command are prefixed by a "user" or "operator" command before they are forwarded to the target CLI. The level can be configured in the haproxy program arguments with the level keyword: -S /tmp/sock,level,admin -S /tmp/sock2,level,user.	2018-12-13 09:45:16 +01:00
William Lallemand	dc12c2e56c	CLEANUP: cli: use dedicated define instead of appctx ones Replace APPCTX_CLI_ST1_PAYLOAD and APPCTX_CLI_ST1_PROMPT by PCLI_F_PAYLOAD and PCLI_F_PROMPT in the master CLI code.	2018-12-13 09:45:16 +01:00
William Lallemand	f630d01c9f	MEDIUM: cli: store CLI level in the appctx Store and check the level in the appctx in order to allow dynamic permission changes over the CLI.	2018-12-13 09:45:16 +01:00
Remi Gacogne	00488ddef5	BUG: dns: Fix off-by-one write in dns_validate_dns_response() The maximum number of bytes in a DNS name is indeed 255, but we need to allocate one more byte for the NULL-terminating byte. Otherwise dns_read_name() might return 255 for a very long name, causing dns_validate_dns_response() to write a NULL value one byte after the end of the buffer: dns_answer_record->name[len] = 0; The next fields in the struct being filled from the content of the query, it might have been possible to fill them with non-0 values, causing for example a strlen() of the name to read past the end of the struct and access unintended parts of the memory, possibly leading to a crash. To be backported to 1.8, probably also 1.7.	2018-12-12 14:44:52 +01:00
Remi Gacogne	bc552102ad	BUG: dns: Fix out-of-bounds read via signedness error in dns_validate_dns_response() Since the data_len field of the dns_answer_item struct was an int16_t, record length values larger than 2^15-1 were causing an integer overflow and thus may have been interpreted as negative, making us read well before the beginning of the buffer. This might have led to information disclosure or a crash. To be backported to 1.8, probably also 1.7.	2018-12-12 14:44:38 +01:00
Willy Tarreau	0007d0afbc	CLEANUP: stream: remove SF_TUNNEL, SF_INITIALIZED, SF_CONN_TAR These flags haven't been used for a while. SF_TUNNEL was reintroduced by commit `d62b98c6e` ("MINOR: stream: don't set backend's nor response analysers on SF_TUNNEL") to handle the two-level streams needed to deal with the first model for H2, and was not removed after this model was abandonned. SF_INITIALIZED was only set. SF_CONN_TAR was never referenced at all.	2018-12-11 18:01:38 +01:00
Willy Tarreau	afba57ae80	REORG: h1: merge types+proto into common/h1.h These two files are self-contained and do not depend on other layers, so let's remerge them together for easier manipulation.	2018-12-11 17:15:13 +01:00
Willy Tarreau	30925659ef	CLEANUP: h1: remove some occurrences of unneeded h1.h inclusions Several places where h1.h was included didn't need it at all since they in fact relied on the legacy HTTP definitions.	2018-12-11 17:15:13 +01:00
Willy Tarreau	326e27ed08	REORG: h1: move the h1_state definition to proto_http This is the legacy HTTP/1 state, it's never used from within h1 users, let's move it to proto_http with the rest of the legacy code.	2018-12-11 17:15:13 +01:00
Willy Tarreau	538746ad38	REORG: h1: move legacy http functions to http_msg.c Now that h1 and legacy HTTP are two distinct things, there's no need to keep the legacy HTTP parsers in h1.c since they're only used by the legacy code in proto_http.c, and h1.h doesn't need to include hdr_idx anymore. This concerns the following functions : - http_parse_reqline(); - http_parse_stsline(); - http_msg_analyzer(); - http_forward_trailers(); All of these were moved to http_msg.c.	2018-12-11 17:15:13 +01:00
Willy Tarreau	c5a4fd5c30	REORG: http: create http_msg.c to place there some legacy HTTP parts Lots of HTTP code still uses struct http_msg. Not only this code is still huge, but it's part of the legacy interface. Let's move most of these functions to a separate file http_msg.c to make it more visible which file relies on what. It's mostly symmetrical with what is present in http_htx.c. The function http_transform_header_str() which used to rely on two function pointers to look up a header was simplified to rely on two variants http_legacy_replace_{,full_}header(), making both sides of the function much simpler. No code was changed beyond these moves.	2018-12-11 17:15:13 +01:00
Willy Tarreau	b96b77ed6e	REORG: htx: merge types+proto into common/htx.h All the HTX definition is self-contained and doesn't really depend on anything external since it's a mostly protocol. In addition, some external similar files (like h2) also placed in common used to rely on it, making it a bit awkward. This patch moves the two htx.h files into a single self-contained one. The historical dependency on sample.h could be also removed since it used to be there only for http_meth_t which is now in http.h.	2018-12-11 17:15:04 +01:00
Christopher Faulet	f4a4ef7d7c	MINOR: filters: Export the name of known filters It could be useful to know if some filter is declared on a proxy or if it is enabled on a stream.	2018-12-11 17:09:31 +01:00
Christopher Faulet	54a8d5a4a0	MEDIUM: cache/htx: Add the HTX support into the cache The cache is now able to store and resend HTX messages. When an HTX message is stored in the cache, the headers are prefixed with their block's info (an uint32_t), containing its type and its length. Data, on their side, are stored without any prefix. Only the value is copied in the cache. 2 fields have been added in the structure cache_entry, hdrs_len and data_len, to known the size, in the cache, of the headers part and the data part. If the message is chunked, the trailers are also copied, the same way as data. When the HTX message is recreated in the cache applet, the trailers size is known removing the headers length and the data lenght from the total object length.	2018-12-11 17:09:31 +01:00
Christopher Faulet	c9df7f728f	MINOR: compression: Rename the function check_legacy_http_comp_flt() To not mix it up with the legacy HTTP representation, this function has been rename check_implicit_http_comp_flt().	2018-12-11 17:09:31 +01:00
William Lallemand	459e18e9e7	MINOR: cli: use pcli_flags for prompt activation Instead of using a variable to activate the prompt, we just use a flag.	2018-12-11 17:05:40 +01:00
William Lallemand	ebf61804ef	MEDIUM: cli: handle payload in CLI proxy The CLI proxy was not handling payload. To do that, we needed to keep a connection active on a server and to transfer each new line over that connection until we receive a empty line. The CLI proxy handles the payload in the same way that the CLI do it. Examples: $ echo -e "@1;add map #-1 <<\n$(cat data)\n" \| socat /tmp/master-socket - $ socat /tmp/master-socket readline prompt master> @1 25130> add map #-1 << + test test + test2 test2 + test3 test3 + 25130>	2018-12-11 17:05:36 +01:00
William Lallemand	5b80fa2864	MINOR: cli: parse prompt command in the CLI proxy Handle the prompt command. Works the same way as the CLI.	2018-12-11 16:54:18 +01:00
Willy Tarreau	1a18b54142	REORG: connection: centralize the conn_set_{tos,mark,quickack} functions There were a number of ugly setsockopt() calls spread all over proto_http.c, proto_htx.c and hlua.c just to manipulate the front connection's TOS, mark or TCP quick-ack. These ones entirely relied on the connection, its existence, its control layer's presence, and its addresses. Worse, inet_set_tos() was placed in proto_http.c, exported and used from the two other ones, surrounded in #ifdefs. This patch moves this code to connection.h and makes the other ones rely on it without ifdefs.	2018-12-11 16:41:51 +01:00
Willy Tarreau	eaeeb68f23	MINOR: hpack: provide a function to encode an HTTP path The new function hpack_encode_path() supports encoding a path into the ":path" header. It knows about "/" and "/index.html" which use a single byte, and falls back to literal encoding for other ones, with a fast path for short paths < 127 bytes.	2018-12-11 09:07:02 +01:00
Willy Tarreau	820b391260	MINOR: hpack: provide a function to encode an HTTP scheme The new function hpack_encode_scheme() supports encoding a scheme into the ":scheme" header. It knows about "https" and "http" which use a single byte, and falls back to literal encoding for other ones.	2018-12-11 09:07:02 +01:00
Willy Tarreau	39c80ebff0	MINOR: hpack: provide a function to encode an HTTP method The new function hpack_encode_method() supports encoding a method. It knows about GET and POST which use a single byte, and falls back to literal encoding for other ones.	2018-12-11 09:07:02 +01:00
Willy Tarreau	8895367fb1	MINOR: hpack: provide new functions to encode the ":status" header This header exists with 7 different values, it's worth taking them into account for the encoding, hence these functions. One of them makes use of an integer only and computes the 3 output bytes in case of literal. The other one benefits from the knowledge of an existing string, which for example exists in the case of H1 to H2 encoding.	2018-12-11 09:07:02 +01:00
Willy Tarreau	bd5659bbe1	MINOR: hpack: provide a function to encode a long indexed header For long header values whose index is known, hpack_encodde_long_idx() may now be used. This function emits the short index and follows with the header's value.	2018-12-11 09:07:01 +01:00
Willy Tarreau	30eb809fdb	MINOR: hpack: provide a function to encode a short indexed header Most direct calls to HPACK functions are made to encode short header fields like methods, schemes or statuses, whose lengths and indexes are known. Let's have a small function to do this.	2018-12-11 09:06:46 +01:00
Willy Tarreau	bad0a381d3	MINOR: hpack: move the length computation and encoding functions to .h We'll need these functions from other inline functions, let's make them accessible. len_to_bytes() was renamed to hpack_len_to_bytes() since it's now exposed.	2018-12-11 09:06:46 +01:00
Willy Tarreau	2df026fbce	CLEANUP: hpack: no need to include chunk.h, only include buf.h Chunk.h used to be needed to declare the struct chunk which we don't use anymore, let's fall back to the lighter buf.h	2018-12-11 09:06:06 +01:00
Willy Tarreau	071d4b31ff	MINOR: compiler: add a new macro ALREADY_CHECKED() This macro may be used to block constant propagation that lets the compiler detect a possible NULL dereference on a variable resulting from an explicit assignment in an impossible check. Sometimes a function is called which does safety checks and returns NULL if safe conditions are not met. The place where it's called cannot hit this condition and dereferencing the pointer without first checking it will make the compiler emit a warning about a "potential null pointer dereference" which is hard to work around. This macro "washes" the pointer and prevents the compiler from emitting tests branching to undefined instructions. It may only be used when the developer is absolutely certain that the conditions are guaranteed and that the pointer passed in argument cannot be NULL by design. A typical use case is a top-level function doing this : if (frame->type == HEADERS) parse_frame(frame); Then parse_frame() does this : void parse_frame(struct frame frame) { const char frame_hdr; frame_hdr = frame_hdr_start(frame); if (frame_hdr == FRAME_HDR_BEGIN) process_frame(frame); } and : const char frame_hdr_start(const struct frame frame) { if (frame->type == HEADERS) return frame->data; else return NULL; } Above parse_frame() is only called for frame->type == HEADERS so it will never get a NULL in return from frame_hdr_start(). Thus it's always safe to dereference frame_hdr since the check was already performed above. It's then safe to address it this way instead of inventing dummy error code paths that may create real bugs : void parse_frame(struct frame frame) { const char frame_hdr; frame_hdr = frame_hdr_start(frame); ALREADY_CHECKED(frame_hdr); if (*frame_hdr == FRAME_HDR_BEGIN) process_frame(frame); }	2018-12-08 15:27:03 +01:00
Willy Tarreau	d6735d611e	MEDIUM: ist: use local conversion arrays to case conversion Calling tolower/toupper for each character is slow, a lookup into a 256-byte table is cheaper, especially for common characters used in header field names which all fit into a cache line. Let's create these two variables marked weak so that they're included only once.	2018-12-07 13:25:59 +01:00
Willy Tarreau	3f2d696d72	MINOR: ist: add functions to copy/uppercase/lowercase into a buffer or string The ist functions were missing functions to copy an IST into a target buffer, making some code have to resort to memcpy(), which tends to be overkill for small strings, that the compiler cannot guess. In addition sometimes there is a need to turn a string to lower or upper case so it had to be overwritten after the operation. This patch adds 6 functions to copy an ist to a buffer, as binary or as a string (i.e. a zero is or is not appended), and optionally to apply a lower case or upper case transformation on the fly. A number of tests were performed to optimize the processing for small strings. The loops are marked unlikely to dissuade the compilers from over-optimizing them and switching to SIMD instructions. The lower case or upper case transformations used to rely on external functions for each character and to crappify the code due to clobbered registers, which is not acceptable when we know that only a certain class of chars has to be transformed, so the test was open-coded.	2018-12-07 13:25:59 +01:00
Olivier Houchard	d247be0620	BUG/MEDIUM: connections: Split CS_FL_RCV_MORE into 2 flags. CS_FL_RCV_MORE is used in two cases, to let the conn_stream know there may be more data available, and to let it know that it needs more room. We can't easily differentiate between the two, and that may leads to hangs, so split it into two flags, CS_FL_RCV_MORE, that means there may be more data, and CS_FL_WANT_ROOM, that means we need more room. This should not be backported.	2018-12-06 16:36:05 +01:00
Willy Tarreau	adc7f3edd2	BUG/MEDIUM: stream-int: don't attempt to receive if the connection is not established If we try to receive before the connection is established, we lose the send event and are not woken up anymore once the connection is established. This was diagnosed by Olivier. No backport is needed.	2018-12-06 15:25:58 +01:00
Willy Tarreau	a3b62d374a	MINOR: stream-int: add a new blocking condition on the remote connection There are some situations where we need to wait for the other side to be connected. None of the current blocking flags support this. It used to work more or less by accident using the old flags. Let's add a new flag to mention we're blocking on this, it's removed by si_chk_rcv() when a connection is established. It should be enough for now.	2018-12-06 15:24:01 +01:00
William Lallemand	27f3fa56f5	BUG/MEDIUM: mworker: stop every tasks in the master The master is not supposed to run (at the moment) any task before the polling loop, the created tasks should be run only in the workers but in the master they should be disabled or removed. No backport needed.	2018-12-06 14:12:58 +01:00
Christopher Faulet	aa75b3d2d5	CLEANUP: htx: Fix indentation here and there in HTX files	2018-12-05 17:33:14 +01:00
Christopher Faulet	b2aedea142	MEDIUM: channel/htx: Add functions for forward HTX data To ease the fast forwarding and the infinte forwarding on HTX proxies, 2 functions have been added to let the channel be almost aware of the way data are stored in its buffer. By calling these functions instead of legacy ones, we are sure to forward the right amount of data.	2018-12-05 17:29:30 +01:00
Christopher Faulet	27ba2dc6d6	MEDIUM: htx: Rework conversion from a buffer to an htx structure Now, the function htx_from_buf() will set the buffer's length to its size automatically. In return, the caller should call htx_to_buf() at the end to be sure to leave the buffer hosting the HTX message in the right state. When the caller can use the function htxbuf() to get the HTX message without any update on the underlying buffer.	2018-12-05 17:10:16 +01:00
Willy Tarreau	3906e22f6f	MINOR: htx: add buf_room_for_htx_data() to help optimize buffer transfers The small HTX overhead is enough to make the system perform multiple reads and unaligned memory copies. Here we provide a function whose purpose is to reduce the apparent room in a buffer by the size of the overhead for DATA blocks, which is the struct htx plus 2 blocks (one for DATA, one for the end of message so that small blocks can fit at once). The muxes using HTX will be encouraged to use this one instead of b_room() to compute the available buffer room and avoid filling their demux buf with more data than can fit at once into the HTX buffer.	2018-12-05 10:57:42 +01:00
Willy Tarreau	8ae4235f94	MINOR: htx: make htx_from_buf() adjust the size only on new buffers This one is used a lot during transfers, let's avoid resetting its size when there are already data in the buffer since it implies the size is correct.	2018-12-05 10:57:42 +01:00
Christopher Faulet	c59ff23804	MINOR: htx: Rename functions htx_*_to_str() to be H1 specific "_to_h1" suffix is now used because these function produce H1 strings. It avoids any ambiguity on the output format.	2018-12-04 05:51:37 +01:00
Joseph Herlant	75a323f04e	CLEANUP: Fix a typo in the listener subsystem Fixes a typo in the code comment of the listener subsystem.	2018-12-02 18:43:28 +01:00
Joseph Herlant	f69b807fa4	CLEANUP: Fix typos in the file descriptor subsystem Fixes 2 typos in the code comment of the file descriptor subsystem.	2018-12-02 18:43:25 +01:00
Joseph Herlant	0b75e63dc5	CLEANUP: Fix a typo in the checks header file Fixes a typo in the code comments of the checks header file.	2018-12-02 18:43:21 +01:00
Joseph Herlant	eeac3c722f	CLEANUP: Fix a typo in the protocol header file Fixes a typo in the code comments of the header file holding the general protocol primitives.	2018-12-02 18:42:49 +01:00
Joseph Herlant	8a95a6e5ed	CLEANUP: Fix a typo in the connection subsystem Fixes a typo in the code comments of the connection subsystem.	2018-12-02 18:42:12 +01:00
Joseph Herlant	41abef77cb	CLEANUP: Fix a typo in the mini-clist header Fixes a typo in the code comments of the mini-clist header.	2018-12-02 18:38:15 +01:00
Joseph Herlant	30bc509c40	CLEANUP: Fix typos in the h1 subsystem Fixes typos in the code comments of the h1 subsystem.	2018-12-02 18:38:02 +01:00
Joseph Herlant	be7619aaca	CLEANUP: Fix typo in the chunk headers file Fix a typo detected in the chunk.h header file's code comments.	2018-12-02 18:37:56 +01:00
Joseph Herlant	c42c0e9969	CLEANUP: fix typos in the htx subsystem Fix typos detected in the code comments of the htx subsystem.	2018-12-02 18:37:50 +01:00
Olivier Houchard	0c18a6fe34	MEDIUM: servers: Add a way to keep idle connections alive. Add a new keyword for servers, "idle-timeout". If set, unused connections are kept alive until the timeout happens, and will be picked for reuse if no other connection is available.	2018-12-02 18:16:53 +01:00
Olivier Houchard	8defe4b51a	MINOR: mux: add a "max_streams" method. Add a new method to muxes, "max_streams", that returns the max number of streams the mux can handle. This will be used to know if a mux is in use or not.	2018-12-02 17:48:32 +01:00
Olivier Houchard	f3e65b086d	MINOR: connection: Fix a comment. Connections can now have an owner for outgoing connections, so update the comment tu reflect that.	2018-12-02 17:48:28 +01:00
Willy Tarreau	1329b5be71	MINOR: h2: add new functions to produce an HTX message from an H2 response The new function h2_prepare_htx_stsline() produces an HTX response message from an H2 response presented as a list of header fields.	2018-12-02 13:30:17 +01:00
Willy Tarreau	3fbea1d8d0	MINOR: server: the mux_proto entry in the server is const Same as previous commit. We'll have to update this one soon, let's avoid any cast and mark it const as it really is.	2018-12-02 13:12:16 +01:00
Willy Tarreau	5fc311c001	MINOR: connection: create conn_get_best_mux_entry() We currently have conn_get_best_mux() to return the best mux for a given protocol name, side and proxy mode. But we need the mux entry as well in order to fix the bind_conf and servers at the end of the config parsing. Let's split the function in two parts. It's worth noting that the <conn> argument is never used anymore so this part is eligible to some cleanup.	2018-12-02 13:12:16 +01:00
Willy Tarreau	a004ae3e66	MINOR: listener: the mux_proto entry in the bind_conf is const We'll have to update this one soon, let's avoid any cast and mark it const as it really is.	2018-12-02 13:12:15 +01:00
Willy Tarreau	6deb4129de	MINOR: h2: implement H2->HTX request header frame transcoding Till now we could only produce an HTTP/1 request from a list of H2 request headers. Now the new function h2_make_htx_request() does the same but using the HTX encoding instead, while respecting the H2 semantics. The code is not much different from the first version, only the encoding differs. For now it's not used.	2018-12-01 17:38:32 +01:00
Christopher Faulet	75bc913d23	MAJOR: filters: Adapt filters API to be compatible with the HTX represenation First, to be called on HTX streams, a filter must explicitly be declared as compatible by setting the flag STRM_FLT_FL_HAS_FILTERS on the filter's config at HAProxy startup. This flag is checked when a filter implementation is attached to a stream. Then, some changes have been made on HTTP callbacks. The callback http_payload has been added to filter HTX data. It will be called on HTX streams only. It replaces the callbacks http_data, http_chunk_trailers and http_forward_data, called on legacy HTTP streams only and marked as deprecated. The documention (once updated)) will give all information to implement this new callback. Other HTTP callbacks will be called for HTX and HTTP legacy streams. So it is the filter's responsibility to known which kind of data it handles. The macro IS_HTX_STRM should be used in such cases. There is at least a noticeable changes in the way data are forwarded. In HTX, after the call to the callback http_headers, all the headers are considered as forwarded. So, in http_payload, only the body and eventually the trailers will be filtered.	2018-12-01 17:37:27 +01:00
Christopher Faulet	e44769b4fa	MINOR: mux-h1: Capture bad H1 messages First of all, an dedicated error snapshot, h1_snapshot, has been added. It contains more or less the some info than http_snapshot but adapted for H1 messages. Then, the function h1_capture_bad_message() has been added to capture bad H1 messages. And finally, the function h1_show_error_snapshot() is used to dump these errors. Only Headers or data parsing are captured.	2018-12-01 17:37:27 +01:00
Christopher Faulet	a7b677cd0d	MEDIUM: proto_htx: Convert all HTTP error messages into HTX During startup, after the configuration parsing, all HTTP error messages (errorloc, errorfile or default messages) are converted into HTX messages and stored in dedicated buffers. We use it to return errors in the HTX analyzers instead of using ugly OOB blocks.	2018-12-01 17:37:27 +01:00
Christopher Faulet	b2db4fa016	MINOR: htx: Add BODYLESS flags on the HTX start-line and the HTTP message the flags HTX_SL_F_BODYLESS and HTTP_MSGF_BODYLESS have been added. These flags are set when the corresponding HTTP message has no body at all.	2018-12-01 17:37:27 +01:00
Christopher Faulet	f1ba18d7b3	MEDIUM: htx: Don't rely on h1_sl anymore except during H1 header parsing Instead, we now use the htx_sl coming from the HTX message. It avoids to have too H1 specific code in version-agnostic parts. Of course, the concept of the start-line is higly influenced by the H1, but the structure htx_sl can be adapted, if necessary. And many things depend on a start-line during HTTP analyzis. Using the structure htx_sl also avoid boring conversions between HTX version and H1 version.	2018-12-01 17:37:27 +01:00
Christopher Faulet	54483df5ba	MINOR: htx: Add the start-line offset for the HTX message in the HTX structure If there is no start-line, this offset is set to -1. Otherwise, it is the relative address where the start-line is stored in the data block. When the start-line is added, replaced or removed, this offset is updated accordingly. On remove, if the start-line is no set and if the next block is a start-line, the offset is updated. Finally, when an HTX structure is defragmented, the offset is also updated accordingly.	2018-12-01 17:37:27 +01:00
Christopher Faulet	570d1614fa	MEDIUM: htx: Change htx_sl to be a struct instead of an union The HTX start-line is now a struct. It will be easier to extend, if needed. Same info can be found, of course. In addition it is now possible to set flags on it. It will be used to set some infos about the message. Some macros and functions have been added in proto/htx.h to help accessing different parts of the start-line.	2018-12-01 17:37:27 +01:00
Christopher Faulet	14e88252f2	MINOR: htx: Add a function to find the HTX block corresponding to a data offset The function htx_find_blk() returns the HTX block containing data with a given offset, relatively to the beginning of the HTX message. It is a good way to skip outgoing data and find the first HTX block not already processed.	2018-12-01 17:37:27 +01:00
Christopher Faulet	d16b0a7b2d	MINOR: htx: Add function to iterate on an HTX message using HTX blocks the functions htx_get_next() and htx_get_prev() are used to iterate on an HTX message using blocks position. With htx_get_next_blk() and htx_get_prev_blk(), it is possible to do the same, but with HTX blocks. Of course, internally, we rely on position's versions to do so. But it is handy for callers to not take care of the blocks position.	2018-12-01 17:37:27 +01:00
Christopher Faulet	24ed835129	MINOR: htx: Add function to add an HTX block just before another one The function htx_add_data_before() can be used to add an HTX block before another one. For instance, it could be used to add some data before the end-of-message marker.	2018-12-01 17:37:27 +01:00
Christopher Faulet	3bc1b11dae	MEDIUM: conn_stream: Add a way to get mux's info on a CS from the upper layer Time to time, the need arises to get some info owned by the multiplexer about a connection stream from the upper layer. Today we really need to get some dates and durations specific to the conn_stream. It is only true for the mux H1 and H2. Otherwise it will be impossible to have correct times reported in the logs. To do so, the structure cs_info has been defined to provide all info we ever need on a conn_stream from the upper layer. Of course, it is the first step. So this structure will certainly envloved. But for now, only the bare minimum is referenced. On the mux side, the callback get_cs_info() has been added in the structure mux_ops. Multiplexers can now implement it, if necessary, to return a pointer on a structure cs_info. And finally, the function si_get_cs_info() should be used from the upper layer. If the stream interface is not attached to a connection stream, this function returns NULL, likewise if the callback get_cs_info() is not defined for the corresponding mux.	2018-12-01 17:37:27 +01:00
Willy Tarreau	c01ed9ff20	MINOR: htx: add a function to cut the beginning of a DATA block htx_cut_data_blk() is used to cut the beginning of a DATA block after a part of it was tranferred. It simply advances the address, reduces the advertised length and updates the htx's total data count.	2018-12-01 17:36:59 +01:00
Willy Tarreau	d3c49d17dc	BUG/MINOR: connection: report mux modes when HTX is supported It looks like we forgot to report HTX when listing the muxes and their respective protocols, leading to "NONE" being displayed. Let's report "HTX" and "HTTP\|HTX" since both will exist. Also fix a minor typo in the output message.	2018-12-01 17:33:35 +01:00
Olivier Houchard	00cf70f28b	MAJOR: sessions: Store multiple outgoing connections in the session. Instead of just storing the last connection in the session, store all of the connections, for at most MAX_SRV_LIST (currently 5) targets. That way we can do keepalive on more than 1 outgoing connection when the client uses HTTP/2.	2018-12-01 10:47:18 +01:00
William Lallemand	4b58c80ee2	REORG: mworker: declare master variable in global.h This variable is used at several places, better declare it in global.h.	2018-11-27 19:34:00 +01:00
Willy Tarreau	7f0165e399	MEDIUM: memory: make the pool cache an array and not a thread_local Having a thread_local for the pool cache is messy as we need to initialize all elements upon startup, but we can't until the threads are created, and once created it's too late. For this reason, the allocation code used to check for the pool's initialization, and it was the release code which used to detect the first call and to initialize the cache on the fly, which is not exactly optimal. Now that we have initcalls, let's turn this into a per-thread array. This array is initialized very early in the boot process (STG_PREPARE) so that pools are always safe to use. This allows to remove the tests from the alloc/free calls. Doing just this has removed 2.5 kB of code on all cumulated pool_alloc() and pool_free() paths.	2018-11-26 19:50:32 +01:00
Willy Tarreau	b6b3df3ed3	MEDIUM: initcall: use initcalls for a few initialization functions signal_init(), init_log(), init_stream(), and init_task() all used to only preset some values and lists. This needs to be done very early to provide a reliable interface to all other users. The calls used to be explicit in haproxy.c:init(). Now they're placed in initcalls at the STG_PREPARE stage. The functions are not exported anymore.	2018-11-26 19:50:32 +01:00
Willy Tarreau	2455cebe00	MEDIUM: memory: use pool_destroy_all() to destroy all pools on deinit() Instead of exporting a number of pools and having to manually delete them in deinit() or to have dedicated destructors to remove them, let's simply kill all pools on deinit(). For this a new function pool_destroy_all() was introduced. As its name implies, it destroys and frees all pools (provided they don't have any user anymore of course). This allowed to remove 4 implicit destructors, 2 explicit ones, and 11 individual calls to pool_destroy(). In addition it properly removes the mux_pt_ctx pool which was not cleared on exit (no backport needed here since it's 1.9 only). The sig_handler pool doesn't need to be exported anymore and became static now.	2018-11-26 19:50:32 +01:00
Willy Tarreau	8ceae72d44	MEDIUM: init: use initcall for all fixed size pool creations This commit replaces the explicit pool creation that are made in constructors with a pool registration. Not only this simplifies the pools declaration (it can be done on a single line after the head is declared), but it also removes references to pools from within constructors. The only remaining create_pool() calls are those performed in init functions after the config is parsed, so there is no more user of potentially uninitialized pool now. It has been the opportunity to remove no less than 12 constructors and 6 init functions.	2018-11-26 19:50:32 +01:00
Willy Tarreau	7107c8b494	MINOR: memory: add a callback function to create a pool The new function create_pool_callback() takes 3 args including the return pointer, and creates a pool with the specified name and size. In case of allocation error, it emits an error message and returns. The new macro REGISTER_POOL() registers a callback using this function and will be usable to request some pools creation and guarantee that the allocation will be checked. An even simpler approach is to use DECLARE_POOL() and DECLARE_STATIC_POOL() which declare and register the pool.	2018-11-26 19:50:32 +01:00
Willy Tarreau	e655251e80	MINOR: initcall: use initcalls for section parsers The two calls to cfg_register_section() and cfg_register_postparser() are now supported by initcalls. This allowed to remove two other constructors.	2018-11-26 19:50:32 +01:00
Willy Tarreau	172f5ce948	MINOR: initcall: use initcalls for most post_{check,deinit} and per_thread* Most calls to hap_register_post_check(), hap_register_post_deinit(), hap_register_per_thread_init(), hap_register_per_thread_deinit() can be done using initcalls and will not require a constructor anymore. Let's create a set of simplified macros for this, called respectively REGISTER_POST_CHECK, REGISTER_POST_DEINIT, REGISTER_PER_THREAD_INIT, and REGISTER_PER_THREAD_DEINIT. Some files were not modified because they wouldn't benefit from this or because they conditionally register (e.g. the pollers).	2018-11-26 19:50:32 +01:00
Willy Tarreau	8071338c78	MINOR: initcall: apply initcall to all register_build_opts() calls Most register_build_opts() calls use static strings. These ones were replaced with a trivial REGISTER_BUILD_OPTS() statement adding the string and its call to the STG_REGISTER section. A dedicated section could be made for this if needed, but there are very few such calls for this to be worth it. The calls made with computed strings however, like those which retrieve OpenSSL's version or zlib's version, were moved to a dedicated function to guarantee they are called late in the process. For example, the SSL call probably requires that SSL_library_init() has been called first.	2018-11-26 19:50:32 +01:00
Willy Tarreau	90fa97b65e	MINOR: threads: add new macros to declare self-initializing locks Using __decl_spinlock(), __decl_rwlock(), __decl_aligned_spinlock() and __decl_aligned_rwlock(), one can now simply declare a spinlock or an rwlock which will automatically be initialized at boot time by calling the ha_spin_init() or ha_rwlock_init() callback. The "aligned" variants enforce a 64-byte alignment on the lock.	2018-11-26 19:50:32 +01:00
Willy Tarreau	a8ae77da61	MINOR: thread: provide a set of lock initialisers This patch adds ha_spin_init() and ha_rwlock_init() which are used as a callback to initialise locks at boot time. They perform exactly the same as HA_SPIN_INIT() or HA_RWLOCK_INIT() but from within a real function.	2018-11-26 19:50:32 +01:00
Willy Tarreau	d13a9281bd	MINOR: initcall: introduce a way to register init functions to call at boot We currently have to deal with multiple initialization stages in a way that can be confusing, because certain parts rely on others having been properly initialized. Most calls consist in adding lists to existing lists, whose heads are initialized in the declaration so this is easy. But some calls create new pools and require pools to be properly initialized. Pools currently are thread-local and as such cannot be pre-initialized, requiring run-time checks. All this could be simplified by using multiple boot stages and allowing functions to be registered at various stages. One approach might be to use gcc's constructor priorities, but this requires gcc >= 4.3 which eliminates a wide spectrum of working compilers, and some versions of certain compilers (like clang 3.0) are known for silently ignore these priorities. Instead we can use our own init function registration mechanism. A first attempt was made using register_function() calls in all constructors but this made the code more painful. This patch's approach is different. It creates sections containing arrays of pointers to "initcall" descriptors. An initcall contains a pointer to a function and an argument. Each section corresponds to a specific initialization stage. Each module creates such descriptors for various calls it requires. The main() function starts by scanning each of these sections in turn to process these initcalls. This will make it possible to remove many constructors from various modules, by simply placing initcalls for the requested functions next to the keyword lists that need to be called. A first attempt was made by placing the initcalls directly into the sections instead of creating an array of pointers, but it becomes sensitive to the array's alignment which depends on the compiler and the linker, so it seems too fragile. For now we support 6 init stages : - STG_PREPARE : preset variables, tables and list heads - STG_LOCK : initialize spinlocks and rwlocks - STG_ALLOC : allocate the required structures - STG_POOL : create pools - STG_REGISTER : register static lists (keywords etc) - STG_INIT : subsystems normal initialization These ones are declared directly in the files where they are needed using one of the INITCALL* macros, passing 0 to 3 pointers as arguments. The API should possibly be extended to support a return value to give a status to the caller, and to support a unified API, possibly a bit more flexibility in the arguments. In this case it might make sense to support a set of macros to register functions having a different API and to pass the function type in the initcall itself. Special thanks to Olivier for showing how to scan sections as this is not something particularly well documented and exactly what I've been missing to achieve this.	2018-11-26 19:50:32 +01:00
Willy Tarreau	a7280a1ec2	BUILD: buffers: buf.h requires unistd to get ssize_t on libmusl Building with musl and gcc-5.3 for MIPS returns this : include/common/buf.h: In function 'b_dist': include/common/buf.h:252:2: error: unknown type name 'ssize_t' ssize_t dist = to - from; ^ Including stdint or stddef is not sufficient there to get ssize_t, unistd is needed as well. It's likely that other platforms will have the same issue. This patch also addresses it in ist.h and memory.h.	2018-11-26 19:49:21 +01:00
Willy Tarreau	6689609090	BUILD: htx: fix fprintf format inconsistency on 32-bit platforms Building on 32 bits gives this : include/proto/htx.h: In function 'htx_dump': include/proto/htx.h:443:25: warning: format '%lu' expects argument of type 'long unsigned int', but argument 8 has type 'uint64_t {aka long long unsigned int}' [-Wformat=] fprintf(stderr, "htx:%p [ size=%u - data=%u - used=%u - wrap=%s - extra=%lu]\n", ^ In htx_dump(), fprintf() uses %lu but the value is an uint64_t so it doesn't match on 32-bit. Let's cast this to unsigned long long and use %llu instead.	2018-11-26 19:37:32 +01:00
Olivier Houchard	ee23b2a1e3	MEDIUM: servers: Store the connection in the SI until we have a mux. When we create a connection, if we have to defer the conn_stream and the mux creation until we can decide it (ie until the SSL handshake is done, and the ALPN is decided), store the connection in the stream_interface, so that we're sure we can destroy it if needed.	2018-11-23 19:11:14 +01:00
Olivier Houchard	201b9f4eb5	MAJOR: connections: Defer mux creation for outgoing connection if alpn is set. If an ALPN (or a NPN) was chosen for a server, defer choosing the mux until after the SSL handshake is done, and the ALPN/NPN has been negociated, so that we know which mux to pick.	2018-11-22 19:52:23 +01:00
Olivier Houchard	c756600103	MINOR: server: Add "alpn" and "npn" keywords. Add new keywords to "server" lines, alpn and npn. If set, when connecting through SSL, those alpn/npn will be negociated during the SSL handshake.	2018-11-22 19:50:08 +01:00
Willy Tarreau	beb859abce	MINOR: polling: add an option to support busy polling In some situations, especially when dealing with low latency on processors supporting a variable frequency or when running inside virtual machines, each time the process waits for an I/O using the poller, the processor goes back to sleep or is offered to another VM for a long time, and it causes excessively high latencies. A solution to this provided by this patch is to enable busy polling using a global option. When busy polling is enabled, the pollers never sleep and loop over themselves waiting for an I/O event to happen or for a timeout to occur. On multi-processor machines it can significantly overheat the processor but it usually results in much lower latencies. A typical test consisting in injecting traffic over a single connection at a time over the loopback shows a bump from 4640 to 8540 connections per second on forwarded connections, indicating a latency reduction of 98 microseconds for each connection, and a bump from 12500 to 21250 for locally terminated connections (redirects), indicating a reduction of 33 microseconds. It is only usable with epoll and kqueue because select() and poll()'s API is not convenient for such usages, and the level of performance they are used in doesn't benefit from this anyway. The option, which obviously remains disabled by default, can be turned on using "busy-polling" in the global section, and turned off later using "no busy-polling". Its status is reported in "show info" to help troubleshooting suspicious CPU spikes.	2018-11-22 19:47:30 +01:00
Willy Tarreau	48f8bc1368	MINOR: poller: move the call of tv_update_date() back to the pollers The reason behind this will be to be able to compute a timeout when busy polling.	2018-11-22 18:57:37 +01:00

... 2 3 4 5 6 ...

3452 Commits