haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-09 08:37:04 +02:00

Author	SHA1	Message	Date
Willy Tarreau	103e5663c8	BUG/MAJOR: threads/queue: avoid recursive locking in pendconn_get_next_strm() pendconn_get_next_strm() is called from process_srv_queue() under the server lock, and calls stream_add_srv_conn() with this lock held, while the latter tries to take it again. This results in a deadlock when a server's maxconn is reached and haproxy is built with thread support.	2017-11-26 18:50:30 +01:00
Willy Tarreau	1ca1b70cf9	CLEANUP: pools: align pools on a cache line There are just a few pools, and they're stressed a lot, so it makes sense to dedicate them a cache line to avoid contention and to place the lock at the beginning.	2017-11-26 11:10:53 +01:00
Willy Tarreau	5809052ae1	CLEANUP: fd: place the lock at the beginning of struct fdtab The struct is not cache line aligned but at least, every time the lock will appear in the same cache line as the fd it will benefit from being accessed first. This improves the performance by about 2% on fd-intensive workloads with 4 threads.	2017-11-26 11:10:53 +01:00
Willy Tarreau	08eaa78739	CLEANUP: checks: remove 16 bytes of holes in struct check These ones were easily recovered by swapping two members.	2017-11-26 11:10:52 +01:00
Willy Tarreau	a51108443e	CLEANUP: proxy: slightly reorder the struct proxy to reduce holes 16 bytes were recovered from the struct doing minimal reordering.	2017-11-26 11:10:52 +01:00
Willy Tarreau	d7e33bbe2f	CLEANUP: server: reorder some fields in struct server to save 40 bytes In 1.8 many holes were introduced in struct server, so let's slightly reorder a few fields to plug most of them. This saves 40 bytes in the struct.	2017-11-26 11:10:52 +01:00
Willy Tarreau	8b94969054	MINOR: fd: cache-align fdtab and fdcache locks These locks are highly contended, let's not make them share cache lines.	2017-11-26 11:10:51 +01:00
Willy Tarreau	53bae85b8e	BUG/MINOR: threads: don't drop "extern" on the lock in include files Commit `9dcf9b6` ("MINOR: threads: Use __decl_hathreads to declare locks") accidently lost a few "extern" in certain lock declarations, possibly causing certain entries to be declared at multiple places. Apparently it hasn't caused any harm though. The offending ones were : - fdtab_lock - fdcache_lock - poll_lock - buffer_wq_lock	2017-11-26 11:10:50 +01:00
William Lallemand	4cfede87a3	MAJOR: mworker: exits the master on failure This patch changes the behavior of the master during the exit of a worker. When a worker exits with an error code, for example in the case of a segfault, all workers are now killed and the master leaves. If you don't want this behavior you can use the option "master-worker no-exit-on-failure".	2017-11-24 22:48:27 +01:00
Willy Tarreau	bafbe01028	CLEANUP: pools: rename all pool functions and pointers to remove this "2" During the migration to the second version of the pools, the new functions and pool pointers were all called "pool_something2()" and "pool2_something". Now there's no more pool v1 code and it's a real pain to still have to deal with this. Let's clean this up now by removing the "2" everywhere, and by renaming the pool heads "pool_head_something".	2017-11-24 17:49:53 +01:00
Olivier Houchard	fbc74e8556	MINOR/CLEANUP: proxy: rename "proxy" to "proxies_list" Rename the global variable "proxy" to "proxies_list". There's been multiple proxies in haproxy for quite some time, and "proxy" is a potential source of bugs, a number of functions have a "proxy" argument, and some code used "proxy" when it really meant "px" or "curproxy". It worked by pure luck, because it usually happened while parsing the config, and thus "proxy" pointed to the currently parsed proxy, but we should probably not rely on this. [wt: some of these are definitely fixes that are worth backporting]	2017-11-24 17:21:27 +01:00
Christopher Faulet	767a84bcc0	CLEANUP: log: Rename Alert/Warning in ha_alert/ha_warning	2017-11-24 17:19:12 +01:00
Christopher Faulet	c644fa9bf5	MINOR: config: Add threads support for "process" option on "bind" lines It is now possible on a "bind" line (or a "stats socket" line) to specify the thread set allowed to process listener's connections. For instance: # HTTPS connections will be processed by all threads but the first and HTTP # connection will be processed on the first thread. bind :80 process 1/1 bind :443 ssl crt mycert.pem process 1/2-	2017-11-24 15:38:50 +01:00
Christopher Faulet	cb6a94510d	MINOR: config: Add the threads support in cpu-map directive Now, it is possible to bind CPU at the thread level instead of the process level by defining a thread set in "cpu-map" directives. Thus, its format is now: cpu-map [auto:]<process-set>[/<thread-set>] <cpu-set>... where <process-set> and <thread-set> must follow the format: all \| odd \| even \| number[-[number]] Having a process range and a thread range in same time with the "auto:" prefix is not supported. Only one range is supported, the other one must be a fixed number. But it is allowed when there is no "auto:" prefix. Because it is possible to define a mapping for a process and another for a thread on this process, threads will be bound on the intersection of their mapping and the one of the process on which they are attached. If the intersection is null, no specific binding will be set for the threads.	2017-11-24 15:38:50 +01:00
Christopher Faulet	26028f6209	MINOR: config: Add auto-increment feature for cpu-map The prefix "auto:" can be added before the process set to let HAProxy automatically bind a process to a CPU by incrementing process and CPU sets. To be valid, both sets must have the same size. No matter the declaration order of the CPU sets, it will be bound from the lower to the higher bound. Examples: # all these lines bind the process 1 to the cpu 0, the process 2 to cpu 1 # and so on. cpu-map auto:1-4 0-3 cpu-map auto:1-4 0-1 2-3 cpu-map auto:1-4 3 2 1 0 # bind each process to exaclty one CPU using all/odd/even keyword cpu-map auto:all 0-63 cpu-map auto:even 0-31 cpu-map auto:odd 32-63 # invalid cpu-map because process and CPU sets have different sizes. cpu-map auto:1-4 0 # invalid cpu-map auto:1 0-3 # invalid	2017-11-24 15:38:49 +01:00
Christopher Faulet	ff8131861f	MINOR: standard: Add my_ffsl function to get the position of the bit set to one	2017-11-24 15:38:49 +01:00
Christopher Faulet	f1f0c5f591	MINOR: config: Export parse_process_number and use it wherever it's applicable This function is used when "bind-process" directive is parsed and when "process" parameter on a "bind" or a "stats socket" line is parsed.	2017-11-24 15:38:49 +01:00
William Lallemand	f528fff46b	MEDIUM: cache: store sha1 for hashing the cache key The cache was relying on the txn->uri for creating its key, which was a big problem when there was no log activated. This patch does a sha1 of the host + uri, and stores it in the txn. When a object is stored, the eb32node uses the first 32 bits of the hash as a key, and the whole hash is stored in the cache entry. During a lookup, the truncated hash is used, and when it matches an entry we check the real sha1.	2017-11-23 20:20:04 +01:00
Olivier Houchard	90084a133d	MINOR: ssl: Handle reading early data after writing better. It can happen that we want to read early data, write some, and then continue reading them. To do so, we can't reuse tmp_early_data to store the amount of data sent, so introduce a new member. If we read early data, then ssl_sock_to_buf() is now the only responsible for getting back to the handshake, to make sure we don't miss any early data.	2017-11-23 19:35:28 +01:00
Willy Tarreau	158fa75811	MINOR: pools: implement DEBUG_UAF to detect use after free This code has been used successfully a few times in the past to detect that a pool was used after being freed. Its main goal is to allocate a full page for each object so that they are always released individually and unmapped from memory. This way if any part of the code reference the object after is was freed and before it is reallocated, a segv occurs at the exact offending location. It does a few extra things such as writing to the memory area before freeing to detect double-frees and free of read-only areas, and placing the data at the end of the page instead of the beginning so that out of bounds accesses are easier to spot. The amount of memory used with this is huge (about 10 times the regular usage) but it can be useful sometimes.	2017-11-22 19:43:57 +01:00
Willy Tarreau	f13322ede1	MINOR: pools: prepare functions to override malloc/free in pools This will be useful to add some debugging capabilities. For now it changes nothing.	2017-11-22 19:27:44 +01:00
William Lallemand	111bfef33c	MEDIUM: shctx: use unsigned int for len and block_count Allows bigger objects to be cached in the shctx, the first implementation was only storing small ssl session, but we want to store bigger HTTP response.	2017-11-21 21:35:04 +01:00
Willy Tarreau	59a10fb53d	MEDIUM: h2: change hpack_decode_headers() to only provide a list of headers The current H2 to H1 protocol conversion presents some issues which will require to perform some processing on certain headers before writing them so it's not possible to convert HPACK to H1 on the fly. This commit modifies the headers decoding so that it now works in two phases : hpack_decode_headers() only decodes the HPACK stream in the HEADERS frame and puts the result into a list. Headers which require storage (huffman-compressed or from the dynamic table) are stored in a chunk allocated by the H2 demuxer. Then once the headers are properly decoded into this list, h2_make_h1_request() is called with this list to produce the HTTP/1.1 request into the destination buffer. The list necessarily enforces a limit. Here we use 2*MAX_HTTP_HDR, which means that we can have as many individual cookies as we have regular headers if a client decides to break their cookies into multiple values. This seams reasonable and will allow the H1 parser to decide whether it's too much or not. Thus the output stream is not produced on the fly anymore and this will permit to deal with certain corner cases like reparing the Cookie header (which for now is not done). In order to limit header duplication and parsing, the known pseudo headers continue to be passed by their index : the name element in the list then has a NULL pointer and the value is the pseudo header's index. Given that these ones represent about half of the incoming requests and need to be found quickly, it maintains an acceptable level of performance. The code was significantly reduced by doing this because the orignal code had to deal with HPACK and H1 combinations (eg: index vs not indexed, etc) and now the HPACK decoding is totally focused on the decompression, and the H1 encoding doesn't have to deal with the issue of wrapping input for example. One bug was addressed here (though it couldn't happen at the moment). The H2 demuxer used to detect a failure to write the request into the H1 buffer and would then detect if the output buffer wraps, realign it and try again. The problem by doing so was that the HPACK context was already modified and not rewindable. Thus the size check is now performed first and a failure is reported if it doesn't fit.	2017-11-21 21:13:36 +01:00
Willy Tarreau	f24ea8e45e	MEDIUM: h2: add a function to emit an HTTP/1 request from a headers list The current H2 to H1 protocol conversion presents some issues which will require to perform some processing on certain headers before writing them so it's not possible to convert HPACK to H1 on the fly. Here we introduce a function which performs half of what hpack_decode_header() used to do, which is to take a list of headers on input and emit the corresponding request in HTTP/1.1 format. The code is the same and functions were renamed to be prefixed with "h2" instead of "hpack", though it ends up being simpler as the various HPACK-specific cases could be fused into a single one (ie: add header). Moving this part here makes a lot of sense as now this code is specific to what is documented in HTTP/2 RFC 7540 and will be able to deal with special cases related to H2 to H1 conversion enumerated in section 8.1. Various error codes which were previously assigned to HPACK were never used (aside being negative) and were all replaced by -1 with a comment indicating what error was detected. The code could be further factored thanks to this but this commit focuses on compatibility first. This code is not yet used but builds fine.	2017-11-21 21:13:33 +01:00
Willy Tarreau	dbd25fc75a	BUILD: compiler: add a new type modifier __maybe_unused While gcc only emits warnings about unused static functions, Clang also emits such a warning when the functions are inlined. This is a bit annoying at certain places where functions are provided to manipulate multiple data types and are not yet used. Let's have a type modifier "__maybe_unused" which sets the "unused" attribute like the Linux kernel does. It's elegant as it allows the code author to indicate that it knows that this element might be unused. It works on variables as well, which is convenient to remove ifdefs around local variables in certain functions, but doesn't work on labels.	2017-11-20 21:27:27 +01:00
Willy Tarreau	2532bd2f81	BUILD: threads/plock: fix a build issue on Clang without optimization [ plock commit 4c53fd3a0b2b1892817cebd0db012a52f4087850 ] Pieter Baauw reported a build issue affecting haproxy after plock was included. It happens that expressions of the form : if ((const) ? (expr1) : (expr2)) do_something() always produce code for both expr1 and expr2 on Clang when building without optimization. The resulting asm code is even funny, basically doing : mov reg, 1 cmp reg, 1 ... This causes our sizeof() tests to fail to build because we purposely dereference a fake function that reports the location and nature of the inconsistency, but this fake function appears in the object code despite all conditions being there to avoid it. However the compiler is still smart enough to optimize away code doing if (const) do_something() So we simply repeat the condition before do_something(), and the dummy function is not referenced anymore unless really required.	2017-11-20 21:06:35 +01:00
Willy Tarreau	b5f271555e	MINOR: threads/build: atomic: replace the few inlines with macros [ plock commit 61e255286ae32e83e1a3174dd7c49eda99880a8b] There are a few inlines such as pl_barrier() and pl_cpu_relax() which are used a lot. Unfortunately, while building test code at -O0, inlining is disabled and these ones are called a lot and show up a lot in any profile, are traced into when single-stepping with a debugger, etc, thus they are polluting the landscape. Since they're single-asm statements, there is no reason for not turning them into macros. The result becomes fairly visible here at -O0 : $ size latency.inline latency.macro text data bss dec hex filename 11431 692 656 12779 31eb treelock.inline 10967 692 656 12315 301b treelock.macro And it was verified that regularly optimized code remains strictly identical.	2017-11-20 21:06:35 +01:00
Willy Tarreau	d0d8ba59d3	MINOR: threads/atomic: implement pl_bts() on non-x86 [ plock commit da17ba320aad3a8faf08e36fca604de9cad21fdd ] This one was missing, it can be done using sync_fetch_and_or().	2017-11-20 21:06:03 +01:00
Willy Tarreau	01b8398b9e	MINOR: threads/atomic: implement pl_mb() in asm on x86 [ plock commit 44081ea493dd78dab48076980e881748e9b33db5 ] Older compilers (eg: gcc 3.4) don't provide __sync_synchronize() so let's do it by hand on this platform.	2017-11-20 20:45:47 +01:00
Willy Tarreau	f7ba77eb80	MINOR: threads/plock: rename local variables in macros to avoid conflicts [ plock commit b155d5c762fb9a9793911881f80e61faa6b0e889 ] Local variables "l", "i" and "ret" were renamed "__pl_l", "__pl_i" and "__pl_r" respectively, to limit the risk of conflicts with existing variables in application code.	2017-11-20 20:45:43 +01:00
Willy Tarreau	98409e34ca	MINOR: threads/atomic: rename local variables in macros to avoid conflicts [ plock commit bfac5887ebabb8ef753b0351f162265767eb219b ] Local variable "t" was renamed "__pl_t" to limit the risk of conflicts with existing variables in application code.	2017-11-20 20:45:38 +01:00
William Lallemand	71bd11a1f3	MEDIUM: cache: enable the HTTP analysers Enable the same analysers as the stats applet. Allows keepalive and termination flags to work.	2017-11-20 19:22:27 +01:00
William Lallemand	44e259c0b7	CLEANUP: cache: remove unused struct Remove unused structure which remain from old dev.	2017-11-20 19:22:27 +01:00
Tim Duesterhus	d6942c8297	MEDIUM: mworker: Add systemd `Type=notify` support This patch adds support for `Type=notify` to the systemd unit. Supporting `Type=notify` improves both starting as well as reloading of the unit, because systemd will be let known when the action completed. See this quote from `systemd.service(5)`: > Note however that reloading a daemon by sending a signal (as with the > example line above) is usually not a good choice, because this is an > asynchronous operation and hence not suitable to order reloads of > multiple services against each other. It is strongly recommended to > set ExecReload= to a command that not only triggers a configuration > reload of the daemon, but also synchronously waits for it to complete. By making systemd aware of a reload in progress it is able to wait until the reload actually succeeded. This patch introduces both a new `USE_SYSTEMD` build option which controls including the sd-daemon library as well as a `-Ws` runtime option which runs haproxy in master-worker mode with systemd support. When haproxy is running in master-worker mode with systemd support it will send status messages to systemd using `sd_notify(3)` in the following cases: - The master process forked off the worker processes (READY=1) - The master process entered the `mworker_reload()` function (RELOADING=1) - The master process received the SIGUSR1 or SIGTERM signal (STOPPING=1) Change the unit file to specify `Type=notify` and replace master-worker mode (`-W`) with master-worker mode with systemd support (`-Ws`). Future evolutions of this feature could include making use of the `STATUS` feature of `sd_notify()` to send information about the number of active connections to systemd. This would require bidirectional communication between the master and the workers and thus is left for future work.	2017-11-20 18:39:41 +01:00
Olivier Houchard	e6060c5d87	MINOR: SSL: Store the ASN1 representation of client sessions. Instead of storing the SSL_SESSION pointer directly in the struct server, store the ASN1 representation, otherwise, session resumption is broken with TLS 1.3, when multiple outgoing connections want to use the same session.	2017-11-16 19:03:32 +01:00
Christopher Faulet	595d7b72a6	MINOR: applets: Use a bitfield to track applets activity per-thread a bitfield has been added to know if there are runnable applets for a thread. When an applet is woken up, the bits corresponding to its thread_mask are set. When all active applets for a thread is get to be processed, the thread is removed from active ones by unsetting its tid_bit from the bitfield.	2017-11-16 11:19:46 +01:00
Christopher Faulet	3911ee85df	MINOR: tasks: Use a bitfield to track tasks activity per-thread a bitfield has been added to know if there are runnable tasks for a thread. When a task is woken up, the bits corresponding to its thread_mask are set. When all tasks for a thread have been evaluated without any wakeup, the thread is removed from active ones by unsetting its tid_bit from the bitfield.	2017-11-16 11:19:46 +01:00
William Lallemand	75ea0a06b0	BUG/MEDIUM: mworker: does not close inherited FD At the end of the master initialisation, a call to protocol_unbind_all() was made, in order to close all the FDs. Unfortunately, this function closes the inherited FDs (fd@), upon reload the master wasn't able to reload a configuration with those FDs. The create_listeners() function now store a flag to specify if the fd was inherited or not. Replace the protocol_unbind_all() by mworker_cleanlisteners() + deinit_pollers()	2017-11-15 19:53:33 +01:00
Willy Tarreau	9c1e15d8cd	MINOR: tools: emphasize the node being worked on in the tree dump Now we can show in dotted red the node being removed or surrounded in red a node having been inserted, and add a description on the graph related to the operation in progress for example.	2017-11-15 19:43:05 +01:00
Willy Tarreau	ed3cda02ae	MINOR: tools: add a function to dump a scope-aware tree to a file It emits a dump in DOT format for graphing purposes during debugging sessions. It's convenient to dump the run queue.	2017-11-15 16:07:15 +01:00
Christopher Faulet	99bca65f53	BUG/MEDIUM: standard: itao_str/idx and quote_str/idx must be thread-local This bug has an impact on the stats applet and easily leads to a crash of HAProxy. This is specific to threads, no backport is needed.	2017-11-14 18:11:57 +01:00
Christopher Faulet	e9a896e09e	BUG/MINOR: threads: tid_bit must be a unsigned long This is specific to threads, no backport is needed.	2017-11-14 18:11:28 +01:00
Christopher Faulet	fa5c812a6b	BUG/MINOR: buffers: Fix b_alloc_margin to be "fonctionnaly" thread-safe b_alloc_margin is, strickly speeking, thread-safe. It will not crash HAproxy. But its contract is not respected anymore in a multithreaded environment. In this function, we need to be sure to have <margin> buffers available in the pool after the allocation. So to have this guarantee, we must lock the memory pool during all the operation. This also means, we must call internal and lockless memory functions (prefixed with '__'). For the record, this patch fixes a pernicious bug happens after a soft reload where some streams can be blocked infinitly, waiting for a buffer in the buffer_wq list. This happens because, during a soft reload, pool_gc2 is called, making some calls to b_alloc_fast fail. This is specific to threads, no backport is needed.	2017-11-13 11:42:48 +01:00
Christopher Faulet	9dcf9b6f03	MINOR: threads: Use __decl_hathreads to declare locks This macro should be used to declare variables or struct members depending on the USE_THREAD compile option. It avoids the encapsulation of such declarations between #ifdef/#endif. It is used to declare all lock variables.	2017-11-13 11:38:17 +01:00
Willy Tarreau	387bd4f69f	CLEANUP: global: introduce variable pid_bit to avoid shifts with relative_pid At a number of places, bitmasks are used for process affinity and to map listeners to processes. Every time 1UL<<(relative_pid-1) is used. Let's create a "pid_bit" variable corresponding to this value to clean this up.	2017-11-10 19:08:14 +01:00
Willy Tarreau	28b55c6fed	CLEANUP: mux: remove the unused "release()" function In commit `53a4766` ("MEDIUM: connection: start to introduce a mux layer between xprt and data") we introduced a release() function which ends up never being used. Let's get rid of it now.	2017-11-10 16:43:05 +01:00
Willy Tarreau	aa39860aef	MINOR: tools: don't use unlikely() in hex2i() This small inline function causes some pain to the compiler when used inside other functions due to its use of the unlikely() hint for non-digits. It causes the letters to be processed far away in the calling function and makes the code less efficient. Removing these unlikely() hints has increased the chunk size parsing by around 5%.	2017-11-10 11:19:54 +01:00
Willy Tarreau	b15e3fefc9	BUG/MEDIUM: h1: ensure the chunk size parser can deal with full buffers The HTTP/1 code always has the reserve left available so the buffer is never full there. But with HTTP/2 we have to deal with full buffers, and it happens that the chunk size parser cannot tell the difference between a full buffer and an empty one since it compares the start and the stop pointer. Let's change this to instead deal with the number of bytes left to process. As a side effect, this code ends up being about 10% faster than the previous one, even on HTTP/1.	2017-11-10 11:17:08 +01:00
Christopher Faulet	c5a9d5bf23	BUG/MEDIUM: stream-int: Don't loss write's notifs when a stream is woken up When a write activity is reported on a channel, it is important to keep this information for the stream because it take part on the analyzers' triggering. When some data are written, the flag CF_WRITE_PARTIAL is set. It participates to the task's timeout updates and to the stream's waking. It is also used in CF_MASK_ANALYSER mask to trigger channels anaylzers. In the past, it was cleared by process_stream. Because of a bug (fixed in commit `95fad5ba4` ["BUG/MAJOR: stream-int: don't re-arm recv if send fails"]), It is now cleared before each send and in stream_int_notify. So it is possible to loss this information when process_stream is called, preventing analyzers to be called, and possibly leading to a stalled stream. Today, this happens in HTTP2 when you call the stat page or when you use the cache filter. In fact, this happens when the response is sent by an applet. In HTTP1, everything seems to work as expected. To fix the problem, we need to make the difference between the write activity reported to lower layers and the one reported to the stream. So the flag CF_WRITE_EVENT has been added to notify the stream of the write activity on a channel. It is set when a send succedded and reset by process_stream. It is also used in CF_MASK_ANALYSER. finally, it is checked in stream_int_notify to wake up a stream and in channel_check_timeouts. This bug is probably present in 1.7 but it seems to have no effect. So for now, no needs to backport it.	2017-11-09 15:16:05 +01:00
Willy Tarreau	1b4cf9b754	BUG/MINOR: h1: the HTTP/1 make status code parser check for digits The H1 parser used by the H2 gateway was a bit lax and could validate non-numbers in the status code. Since it computes the code on the fly it's problematic, as "30:" is read as status code 310. Let's properly check that it's a number now. No backport needed.	2017-11-09 11:15:45 +01:00
Olivier Houchard	522eea7110	MINOR: ssl: Handle sending early data to server. This adds a new keyword on the "server" line, "allow-0rtt", if set, we'll try to send early data to the server, as long as the client sent early data, as in case the server rejects the early data, we no longer have them, and can't resend them, so the only option we have is to send back a 425, and we need to be sure the client knows how to interpret it correctly.	2017-11-08 14:11:10 +01:00
Emeric Brun	d8b3b65faa	BUG/MEDIUM: splice/threads: pipe reuse list was not protected. The list is now protected using a global spinlock.	2017-11-07 14:47:28 +01:00
Christopher Faulet	2a944ee16b	BUILD: threads: Rename SPIN/RWLOCK macros using HA_ prefix This remove any name conflicts, especially on Solaris.	2017-11-07 11:10:24 +01:00
Olivier Houchard	55dcdf4c39	BUG/MINOR: dns: Don't try to get the server lock if it's already held. dns_link_resolution() can be called with the server lock already held, so don't attempt to lock it again in that case.	2017-11-06 18:34:24 +01:00
Willy Tarreau	88ac59be4d	MINOR: threads: use faster locks for the spin locks The spin locks used to rely on W locks, which involve a loop waiting for readers to leave, and this doesn't happen here. It's more efficient to use S locks instead, which are also mutually exclusive and do not have this loop. This saves one test per spinlock and a few tens of bytes allowing certain functions to be inlined.	2017-11-06 11:20:11 +01:00
Willy Tarreau	8d38805d3d	MAJOR: task: make use of the scope-aware ebtree functions Currently the task scheduler suffers from an O(n) lookup when skipping tasks that are not for the current thread. The reason is that eb32_lookup_ge() has no information about the current thread so it always revisits many tasks for other threads before finding its own tasks. This is particularly visible with HTTP/2 since the number of concurrent streams created at once causes long series of tasks for the same stream in the scheduler. With only 10 connections and 100 streams each, by running on two threads, the performance drops from 640kreq/s to 11.2kreq/s! Lookup metrics show that for only 200000 task lookups, 430 million skips had to be performed, which means that on average, each lookup leads to 2150 nodes to be visited. This commit backports the principle of scope lookups for ebtrees from the ebtree_v7 development tree. The idea is that each node contains a mask indicating the union of the scopes for the nodes below it, which is fed during insertion, and used during lookups. Then during lookups, branches that do not contain any leaf matching the requested scope are simply ignored. This perfectly matches a thread mask, allowing a thread to only extract the tasks it cares about from the run queue, and to always find them in O(log(n)) instead of O(n). Thus the scheduler uses tid_bit and task->thread_mask as the ebtree scope here. Doing this has recovered most of the performance, as can be seen on the test below with two threads, 10 connections, 100 streams each, and 1 million requests total : Before After Gain test duration : 89.6s 4.73s x19 HTTP requests/s (DEBUG) : 11200 211300 x19 HTTP requests/s (PROD) : 15900 447000 x28 spin_lock time : 85.2s 0.46s /185 time per lookup : 13us 40ns /325 Even when going to 6 threads (on 3 hyperthreaded CPU cores), the performance stays around 284000 req/s, showing that the contention is much lower. A test showed that there's no benefit in using this for the wait queue though.	2017-11-06 11:20:11 +01:00
Willy Tarreau	62a124977b	MINOR: applets: no need to check for runqueue's emptiness in appctx_res_wakeup() The __appctx_wakeup() function already does it. It matters with threads enabled because it simplifies the code in appctx_res_wakeup() to get rid of this test.	2017-11-05 12:01:11 +01:00
Willy Tarreau	bbd09b9306	BUG/MAJOR: thread/listeners: enable_listener must not call unbind_listener() unbind_listener() takes the listener lock, which is already held by enable_listener(). This situation happens when starting with nbproc > 1 with some bind lines limited to a certain process, because in this case enable_listener() tries to stop unneeded listeners. This commit introduces __do_unbind_listeners() which must be called with the lock held, and makes enable_listener() use this one. Given that the only return code has never been used and that it starts to make the code more complicated to propagate it before throwing it to the trash, the function's return type was changed to void.	2017-11-05 11:38:44 +01:00
David Carlier	5222d8eb25	BUG/MINOR: stdarg.h inclusion Needed for the memvprintf part, the va_list type. Spotted during OpenBSD build.	2017-11-03 15:04:09 +01:00
Willy Tarreau	4b75fffa2b	BUG/MAJOR: buffers: fix get_buffer_nc() for data at end of buffer This function incorrectly dealt with the case where data doesn't wrap but lies at the end of the buffer, resulting in Lukas' reported data corruption with HTTP/2. No backport is needed, it was introduced for HTTP/2 in 1.8-dev.	2017-11-02 17:16:07 +01:00
Willy Tarreau	7c2a2ad65c	BUG/MINOR: thread: fix a typo in the debug code __spin_unlock() used to call RWLOCK_WRUNLOCK() to unlock in the debug code. It's harmless as they happen to be identical.	2017-11-02 16:26:02 +01:00
William Lallemand	77c1197bfb	MEDIUM: cache: deliver objects from cache Lookup objects in the cache and deliver them using the http-request action "cache-use".	2017-10-31 21:17:19 +01:00
William Lallemand	41db46035e	MEDIUM: cache: configuration parsing and initialization Parse a configuration section "cache" and a http-{response,request} actions. Example: listen frt mode http http-response cache-store foobar http-request cache-use foobar cache foobar total-max-size 4 # size in megabytes	2017-10-31 21:17:19 +01:00
Willy Tarreau	ffca736401	MINOR: h2: centralize all HTTP/2 protocol elements and constants These constants from RFC7540 will be centralized into common/h2.h for use by the future h2 mux and other places.	2017-10-31 18:03:24 +01:00
Willy Tarreau	1be4f3d8af	MEDIUM: hpack: implement basic hpack encoding For now it only supports literals and a bit of static header table references for the 9 most common header field names (date, server, content-type, content-length, last-modified, accept-ranges, etag, cache-control, location). A previous incarnation of this commit used to strip the forbidden H2 header names (connection, proxy-connection, upgrade, transfer-encoding, keep-alive) but this is no longer the case as this filtering is irrelevant to HPACK encoding and is specific to H2, so this will have to be done by the caller. It's quite not optimal but works fine enough to prepare some valid and partially compressed responses during development.	2017-10-31 18:03:24 +01:00
Willy Tarreau	679790baae	MINOR: hpack: implement the decoder The decoder is now fully functional. It makes use of the dynamic header table. Dynamic header table size updates are currently ignored, as our initially advertised value is the highest we support. Strictly speaking, the impact is that a client referencing a header field after such an update wouldn't observe an error instead of the connection being dropped if it was implemented. Decoded header fields are copied into a target buffer in HTTP/1 format using HTTP/1.1 as the version. The Host header field is automatically appended if a ":authority" header field is present. All decoded header fields can be displayed if the file is compiled with DEBUG_HPACK.	2017-10-31 18:03:24 +01:00
Willy Tarreau	ce04094c4a	MINOR: hpack: implement the header tables management This code deals with header insertion, retrieval and eviction, as well as with dynamic header table defragmentation. It is functional for use as a decoder and was heavily tested in this context. There's still some room for optimization (eg: the defragmentation code currently does it in place using a memcpy). Also for now the dynamic header table is allocated using malloc() while a pool needs to be created instead. This code was mostly imported from https://github.com/wtarreau/http2-exp with "hpack_" prepended in front of most names to avoid risks of conflicts. Some small cleanups and renamings were applied during the import. This version must be considered more recent. Some HPACK error codes were placed here (HPACK_ERR_*), not exactly because they're needed by the decoder but they'll be needed by all callers. Maybe a different location should be found.	2017-10-31 18:03:24 +01:00
Willy Tarreau	a004ade512	MINOR: hpack: implement the HPACK Huffman table decoder The code was borrowed from the HPACK experimental implementations available here : https://github.com/wtarreau/http2-exp It contains the Huffman table as specified in RFC7541 Appendix B, and a set of reverse tables used to decode a Huffman byte stream, and produced by contrib/h2/gen-rht. The encoder is not finalized, it doesn't emit the byte stream but this is not needed for now.	2017-10-31 18:03:24 +01:00
Willy Tarreau	436d333124	MEDIUM: connection: add a destroy callback This callback will be used to release upper layers when a mux is in use. Given that the mux can be asynchronously deleted, we need a way to release the extra information such as the session. This callback will be called directly by the mux upon releasing everything and before the connection itself is released, so that the callee can find its information inside the connection if needed. The way it currently works is not perfect, and most likely this should instead become a mux release callback, but for now we have no easy way to add mux-specific stuff, and since there's one mux per connection, it works fine this way.	2017-10-31 18:03:24 +01:00
Willy Tarreau	2c52a2b9ee	MEDIUM: connection: make mux->detach() release the connection For H2, only the mux's timeout or other conditions might cause a release of the mux and the connection, no stream should be allowed to kill such a shared connection. So a stream will only detach using cs_destroy() which will call mux->detach() then free the cs. For now it's only handled by mux_pt. The goal is that the data layer never has to care about the connection, which will have to be released depending on the mux's mood.	2017-10-31 18:03:24 +01:00
Willy Tarreau	6978db35e9	MINOR: connection: add cs_close() to close a conn_stream This basically calls cs_shutw() followed by cs_shutr(). Both of them are called in the most conservative mode so that any previous call is still respected. The CS flags are cleared so that it can be reused (this is important for connection retries when conn and CS are reused without being reallocated).	2017-10-31 18:03:24 +01:00
Willy Tarreau	ecdb3fe9f4	MINOR: conn_stream: modify cs_shut{r,w} API to pass the desired mode Now we can specify how we want to shutdown (drain vs reset, and normal vs silent), and this propagates to the mux then the transport layer.	2017-10-31 18:03:23 +01:00
Willy Tarreau	79dadb5335	MINOR: conn_stream: new shutr/w status flags In order to support all shutdown modes on the CS, we introduce the following flags : CS_FL_SHRD : shut read, drain extra data CS_FL_SHRR : shut read, reset extra data CS_FL_SHWN : shut write, normal notification CS_FL_SHWS : shut write, silent mode (no notification) And the following modes for shutr/shutw : CS_SHR_DRAIN, CS_SHR_RESET, CS_SHW_NORMAL, CS_SHW_SILENT. Note: it's possible that we won't need to distinguish the two shutw above as they're only an action. For now they are not used.	2017-10-31 18:03:23 +01:00
Olivier Houchard	9aaf778129	MAJOR: connection : Split struct connection into struct connection and struct conn_stream. All the references to connections in the data path from streams and stream_interfaces were changed to use conn_streams. Most functions named "something_conn" were renamed to "something_cs" for this. Sometimes the connection still is what matters (eg during a connection establishment) and were not always renamed. The change is significant and minimal at the same time, and was quite thoroughly tested now. As of this patch, all accesses to the connection from upper layers go through the pass-through mux.	2017-10-31 18:03:23 +01:00
Willy Tarreau	63dd75d934	MINOR: connection: introduce the conn_stream manipulation functions Most of the functions dealing with conn_streams are here. They act at the data layer and interact with the mux. For now they are not used yet but everything builds.	2017-10-31 18:03:23 +01:00
Olivier Houchard	8e6147292e	MINOR: mux: add more methods to mux_ops We'll need to support reading/writing from both sides, with buffers and pipes, as well as retrieving/updating flags.	2017-10-31 18:03:23 +01:00
Olivier Houchard	e2b40b9eab	MINOR: connection: introduce conn_stream This patch introduces a new struct conn_stream. It's the stream-side of a multiplexed connection. A pool is created and destroyed on exit. For now the conn_streams are not used at all.	2017-10-31 18:03:23 +01:00
Willy Tarreau	2e0b2b5f83	MEDIUM: session: use the ALPN token and proxy mode to select the mux When an incoming connection is made on an HTTP mode frontend, the session now looks up the mux to use based on the ALPN token and the proxy mode. This will allow easier mux registration, and we don't need to hard-code the mux_pt_ops anymore.	2017-10-31 18:03:23 +01:00
Willy Tarreau	2386be64ba	MINOR: connection: implement alpn registration of muxes Selecting a mux based on ALPN and the proxy mode will quickly become a pain. This commit provides new functions to register/lookup a mux based on the ALPN string and the proxy mode to make this easier. Given that we're not supposed to support a wide range of muxes, the lookup should not have any measurable performance impact.	2017-10-31 18:03:23 +01:00
Willy Tarreau	53a4766e40	MEDIUM: connection: start to introduce a mux layer between xprt and data For HTTP/2 and QUIC, we'll need to deal with multiplexed streams inside a connection. After quite a long brainstorming, it appears that the connection interface to the existing streams is appropriate just like the connection interface to the lower layers. In fact we need to have the mux layer in the middle of the connection, between the transport and the data layer. A mux can exist on two directions/sides. On the inbound direction, it instanciates new streams from incoming connections, while on the outbound direction it muxes streams into outgoing connections. The difference is visible on the mux->init() call : in one case, an upper context is already known (outgoing connection), and in the other case, the upper context is not yet known (incoming connection) and will have to be allocated by the mux. The session doesn't have to create the new streams anymore, as this is performed by the mux itself. This patch introduces this and creates a pass-through mux called "mux_pt" which is used for all new connections and which only calls the data layer's recv,send,wake() calls. One incoming stream is immediately created when init() is called on the inbound direction. There should not be any visible impact. Note that the connection's mux is purposely not set until the session is completed so that we don't accidently run with the wrong mux. This must not cause any issue as the xprt_done_cb function is always called prior to using mux's recv/send functions.	2017-10-31 18:03:23 +01:00
Willy Tarreau	b29dc95a97	MINOR: threads: add a portable barrier for threads and non-threads HA_BARRIER() is just a simple memory barrier to prevent the compiler from reordering our code.	2017-10-31 18:01:18 +01:00
Willy Tarreau	2510f702f9	MINOR: h1: add a function to measure the trailers length This is needed in the H2->H1 gateway so that we know how long the trailers block is in chunked encoding. It returns the number of bytes, or 0 if some are missing, or -1 in case of parse error.	2017-10-31 17:18:10 +01:00
Willy Tarreau	f65610a83d	CLEANUP: threads: rename process_mask to thread_mask It was a leftover from the last cleaning session; this mask applies to threads and calling it process_mask is a bit confusing. It's the same in fd, task and applets.	2017-10-31 16:06:06 +01:00
Olivier Houchard	d16bfe6c01	BUG/MINOR: dns: Fix SRV records with the new thread code. srv_set_fqdn() may be called with the DNS lock already held, but tries to lock it anyway. So, add a new parameter to let it know if it was already locked or not;	2017-10-31 15:47:55 +01:00
Willy Tarreau	a5e0590b80	BUILD: stick-tables: silence an uninitialized variable warning Commit `819fc6f` ("MEDIUM: threads/stick-tables: handle multithreads on stick tables") introduced a valid warning about an uninitialized return value in stksess_kill_if_expired(). It just happens that this result is never used, so let's turn the function back to void as previously.	2017-10-31 15:45:42 +01:00
Emeric Brun	6e0128630b	BUG/MAJOR: threads/freq_ctr: fix lock on freq counters. The wrong bit was set to keep the lock on freq counter update. And the read functions were re-worked to use volatile. Moreover, when a freq counter is updated, it is now rotated only if the current counter is in the past (now.tv_sec > ctr->curr_sec). It is important with threads because the current time (now) is thread-local. So, rounded to the second, the time may vary by more or less 1 second. So a freq counter rotated by one thread may be see 1 second in the future. In this case, it is updated but not rotated.	2017-10-31 13:58:33 +01:00
Christopher Faulet	cd7879adc2	BUG/MEDIUM: threads: Run the poll loop on the main thread too There was a flaw in the way the threads was created. the main one was just used to create all the others and just wait to exit. Now, it is used to run a poll loop. So we only create nbthread-1 threads. This also fixes a bug about the compression filter when there is only 1 thread (nbthread == 1 or no threads support). The bug was in the way thread-local resources was initialized. per-thread init/deinit callbacks were never called for the main process. So, with nthread set to 1, some buffers remained uninitialized.	2017-10-31 13:58:33 +01:00
Emeric Brun	9f0b458525	MEDIUM: threads/server: Use the server lock to protect health check and cli concurrency	2017-10-31 13:58:33 +01:00
Christopher Faulet	c2a89a6aed	MINOR: threads/mailers: Add a lock to protect queues of email alerts	2017-10-31 13:58:33 +01:00
Christopher Faulet	cfda847643	MINOR: threads/checks: Add a lock to protect the pid list used by external checks	2017-10-31 13:58:33 +01:00
Christopher Faulet	6251902e67	MINOR: threads: Add thread-map config parameter in the global section By default, no affinity is set for threads. To bind threads on CPU, you must define a "thread-map" in the global section. The format is the same than the "cpu-map" parameter, with a small difference. The process number must be defined, with the same format than cpu-map ("all", "even", "odd" or a number between 1 and 31/63). A thread will be bound on the intersection of its mapping and the one of the process on which it is attached. If the intersection is null, no specific bind will be set for the thread.	2017-10-31 13:58:33 +01:00
Christopher Faulet	b2812a6240	MEDIUM: thread/dns: Make DNS thread-safe	2017-10-31 13:58:33 +01:00
Christopher Faulet	24289f2e07	MEDIUM: thread/spoe: Make the SPOE thread-safe Because there is not migration mechanism yet, all runtime information about an SPOE agent are thread-local and async exchanges with agents are disabled when we have serveral threads. Howerver, pipelining is still available. So for now, the thread part of the SPOE is pretty simple.	2017-10-31 13:58:33 +01:00
Thierry FOURNIER	738a6d76f6	MEDIUM: threads/tasks: Add lock around notifications This patch add lock around some notification calls	2017-10-31 13:58:32 +01:00
Thierry FOURNIER	952939d294	MEDIUM: threads/xref: Convert xref function to a thread safe model Ensure that the unlink is done safely between thread and that the peer struct will not destroy between the usage of the peer.	2017-10-31 13:58:32 +01:00
Thierry FOURNIER	94a6bfce9b	MEDIUM: threads/lua: Cannot acces to the socket if we try to access from another thread. We have two y for nsuring that the data is not concurently manipulated: - locks - running task on the same thread. locks are expensives, it is better to avoid it. This patch cecks that the Lua task run on the same thread that the stream associated to the coprocess. TODO: in a next version, the error should be replaced by a yield and thread migration request.	2017-10-31 13:58:32 +01:00
Thierry FOURNIER	61ba0e2b6d	MEDIUM: threads/lua: Add locks around the Lua execution parts. Note that the Lua processing is not really thread safe. It provides heavy system which consists to add our own lock function in the Lua code and recompile the library. This system will probably not accepted by maintainers of various distribs. Our main excution point of the Lua is the function lua_resume(). A quick looking on the Lua sources displays a lua_lock() a the start of function and a lua_unlock() at the end of the function. So I conclude that the Lua thread safe mode just perform a mutex around all execution. So I prefer to do this in the HAProxy code, it will be easier for distro maintainers. Note that the HAProxy lua functions rounded by the macro SET_SAFE_LJMP and RESET_SAFE_LJMP manipulates the Lua stack, so it will be careful to set mutex around these functions.	2017-10-31 13:58:32 +01:00
Christopher Faulet	8ca3b4bc46	MEDIUM: threads/compression: Make HTTP compression thread-safe	2017-10-31 13:58:32 +01:00
Christopher Faulet	71a6a8efaa	MEDIUM: threads/filters: Add init/deinit callback per thread Now, it is possible to define init_per_thread and deinit_per_thread callbacks to deal with ressources allocation for each thread. This is the filter responsibility to deal with concurrency. This is also the filter responsibility to know if HAProxy is started with some threads. A good way to do so is to check "global.nbthread" value. If it is greater than 1, then _per_thread callbacks will be called.	2017-10-31 13:58:32 +01:00
Christopher Faulet	e95f2c3ef5	MEDIUM: thread/vars: Make vars thread-safe A RW lock has been added to the vars structure to protect each list of variables. And a global RW lock is used to protect registered names. When a varibable is fetched, we duplicate sample data because the variable could be modified by another thread.	2017-10-31 13:58:32 +01:00
Christopher Faulet	94b712337d	MEDIUM: threads/freq_ctr: Make the frequency counters thread-safe When a frequency counter must be updated, we use the curr_sec/curr_tick fields as a lock, by setting the MSB to 1 in a compare-and-swap to lock and by reseting it to unlock. And when we need to read it, we loop until the counter is unlocked. This way, the frequency counters are thread-safe without any external lock. It is important to avoid increasing the size of many structures (global, proxy, server, stick_table).	2017-10-31 13:58:32 +01:00
Emeric Brun	b5997f740b	MAJOR: threads/map: Make acls/maps thread safe locks have been added in pat_ref and pattern_expr structures to protect all accesses to an instance of on of them. Moreover, a global lock has been added to protect the LRU cache used for pattern matching. Patterns are now duplicated after a successfull matching, to avoid modification by other threads when the result is used. Finally, the function reloading a pattern list has been modified to be thread-safe.	2017-10-31 13:58:32 +01:00
Emeric Brun	821bb9beaa	MAJOR: threads/ssl: Make SSL part thread-safe First, OpenSSL is now initialized to be thread-safe. This is done by setting 2 callbacks. The first one is ssl_locking_function. It handles the locks and unlocks. The second one is ssl_id_function. It returns the current thread id. During the init step, we create as much as R/W locks as needed, ie the number returned by CRYPTO_num_locks function. Next, The reusable SSL session in the server context is now thread-local. Shctx is now also initialized if HAProxy is started with several threads. And finally, a global lock has been added to protect the LRU cache used to store generated certificates. The function ssl_sock_get_generated_cert is now deprecated because the retrieved certificate can be removed by another threads in same time. Instead, a new function has been added, ssl_sock_assign_generated_cert. It must be used to search a certificate in the cache and set it immediatly if found.	2017-10-31 13:58:32 +01:00
Emeric Brun	6b35e9bfbf	MEDIUM: threads/stream: Make streams list thread safe Adds a global lock to protect the full streams list used to dump sessions on stats socket.	2017-10-31 13:58:32 +01:00
Emeric Brun	a1dd243adb	MAJOR: threads/buffer: Make buffer wait queue thread safe Adds a global lock to protect the buffer wait queue.	2017-10-31 13:58:31 +01:00
Emeric Brun	80527f5bb6	MAJOR: threads/peers: Make peers thread safe A lock is used to protect accesses to a peer structure. A the lock is taken in the applet handler when the peer is identified and released living the applet handler. In the scheduling task for peers section, the lock is taken for every listed peer and released at the end of the process task function. The peer 'force shutdown' function was also re-worked.	2017-10-31 13:58:31 +01:00
Emeric Brun	1138fd0c57	MAJOR: threads/applet: Handle multithreading for applets A global lock has been added to protect accesses to the list of active applets. A process mask has also been added on each applet. Like for FDs and tasks, it is used to know which threads are allowed to process an applet. Because applets are, most of time, linked to a session, it should be sticky on the same thread. But in all cases, it is the responsibility of the applet handler to lock what have to be protected in the applet context.	2017-10-31 13:58:31 +01:00
Emeric Brun	272e252e61	MINOR: threads/regex: Change Regex trash buffer into a thread local variable	2017-10-31 13:58:31 +01:00
Emeric Brun	8c1aaa201a	MEDIUM: threads/http: Make http_capture_bad_message thread-safe This is done by passing the right stream's proxy (the frontend or the backend, depending on the context) to lock the error snapshot used to store the error info.	2017-10-31 13:58:31 +01:00
Emeric Brun	819fc6f563	MEDIUM: threads/stick-tables: handle multithreads on stick tables The stick table API was slightly reworked: A global spin lock on stick table was added to perform lookup and insert in a thread safe way. The handling of refcount on entries is now handled directly by stick tables functions under protection of this lock and was removed from the code of callers. The "stktable_store" function is no more externalized and users should now use "stktable_set_entry" in any case of insertion. This last one performs a lookup followed by a store if not found. So the code using "stktable_store" was re-worked. Lookup, and set_entry functions automatically increase the refcount of the returned/stored entry. The function "sticktable_touch" was renamed "sticktable_touch_local" and is now able to decrease the refcount if last arg is set to true. It is allowing to release the entry without taking the lock twice. A new function "sticktable_touch_remote" is now used to insert entries coming from remote peers at the right place in the update tree. The code of peer update was re-worked to use this new function. This function is also able to decrease the refcount if wanted. The function "stksess_kill" also handle a parameter to decrease the refcount on the entry. A read/write lock is added on each entry to protect the data content updates of the entry.	2017-10-31 13:58:31 +01:00
Christopher Faulet	5b51755aef	MEDIUM: threads/lb: Make LB algorithms (lb_*.c) thread-safe A lock for LB parameters has been added inside the proxy structure and atomic operations have been used to update server variables releated to lb. The only significant change is about lb_map. Because the servers status are updated in the sync-point, we can call recalc_server_map function synchronously in map_set_server_status_up/down function.	2017-10-31 13:58:31 +01:00
Christopher Faulet	5d42e099c5	MINOR: threads/server: Add a lock to deal with insert in updates_servers list This list is used to save changes on the servers state. So when serveral threads are used, it must be locked. The changes are then applied in the sync-point. To do so, servers_update_status has be moved in the sync-point. So this is useless to lock it at this step because the sync-point is a protected area by iteself.	2017-10-31 13:58:31 +01:00
Christopher Faulet	29f77e846b	MEDIUM: threads/server: Add a lock per server and atomically update server vars The server's lock is use, among other things, to lock acces to the active connection list of a server.	2017-10-31 13:58:31 +01:00
Christopher Faulet	40a007cf2a	MEDIUM: threads/server: Make connection list (priv/idle/safe) thread-safe For now, we have a list of each type per thread. So there is no need to lock them. This is the easiest solution for now, but not the best one because there is no sharing between threads. An idle connection on a thread will not be able be used by a stream on another thread. So it could be a good idea to rework this patch later.	2017-10-31 13:58:30 +01:00
Christopher Faulet	ff8abcd31d	MEDIUM: threads/proxy: Add a lock per proxy and atomically update proxy vars Now, each proxy contains a lock that must be used when necessary to protect it. Moreover, all proxy's counters are now updated using atomic operations.	2017-10-31 13:58:30 +01:00
Christopher Faulet	8d8aa0d681	MEDIUM: threads/listeners: Make listeners thread-safe First, we use atomic operations to update jobs/totalconn/actconn variables, listener's nbconn variable and listener's counters. Then we add a lock on listeners to protect access to their information. And finally, listener queues (global and per proxy) are also protected by a lock. Here, because access to these queues are unusal, we use the same lock for all queues instead of a global one for the global queue and a lock per proxy for others.	2017-10-31 13:58:30 +01:00
Christopher Faulet	b79a94c9f3	MEDIUM: threads/signal: Add a lock to make signals thread-safe A global lock has been added to protect the signal processing. So when a signal it triggered, only one thread will catch it.	2017-10-31 13:58:30 +01:00
Emeric Brun	c60def8368	MAJOR: threads/task: handle multithread on task scheduler 2 global locks have been added to protect, respectively, the run queue and the wait queue. And a process mask has been added on each task. Like for FDs, this mask is used to know which threads are allowed to process a task. For many tasks, all threads are granted. And this must be your first intension when you create a new task, else you have a good reason to make a task sticky on some threads. This is then the responsibility to the process callback to lock what have to be locked in the task context. Nevertheless, all tasks linked to a session must be sticky on the thread creating the session. It is important that I/O handlers processing session FDs and these tasks run on the same thread to avoid conflicts.	2017-10-31 13:58:30 +01:00
Christopher Faulet	36716a7fec	MEDIUM: threads/fd: Initialize the process mask during the call to fd_insert Listeners will allow any threads to process the corresponding fd. But for other FDs, we limit the processing to the current thread.	2017-10-31 13:58:30 +01:00
Christopher Faulet	a7c5d43085	MINOR: threads/fd: Add a mask of threads allowed to process on each fd in fdtab array	2017-10-31 13:58:30 +01:00
Christopher Faulet	d4604adeaa	MAJOR: threads/fd: Make fd stuffs thread-safe Many changes have been made to do so. First, the fd_updt array, where all pending FDs for polling are stored, is now a thread-local array. Then 3 locks have been added to protect, respectively, the fdtab array, the fd_cache array and poll information. In addition, a lock for each entry in the fdtab array has been added to protect all accesses to a specific FD or its information. For pollers, according to the poller, the way to manage the concurrency is different. There is a poller loop on each thread. So the set of monitored FDs may need to be protected. epoll and kqueue are thread-safe per-se, so there few things to do to protect these pollers. This is not possible with select and poll, so there is no sharing between the threads. The poller on each thread is independant from others. Finally, per-thread init/deinit functions are used for each pollers and for FD part for manage thread-local ressources. Now, you must be carefull when a FD is created during the HAProxy startup. All update on the FD state must be made in the threads context and never before their creation. This is mandatory because fd_updt array is thread-local and initialized only for threads. Because there is no pollers for the main one, this array remains uninitialized in this context. For this reason, listeners are now enabled in run_thread_poll_loop function, just like the worker pipe.	2017-10-31 13:58:30 +01:00
Christopher Faulet	b349e48ede	MEDIUM: threads/pool: Make pool thread-safe by locking all access to a pool A lock has been added for each memory pool. It is used to protect the pool during allocations and releases. It is also used when pool info are dumped.	2017-10-31 13:58:30 +01:00
Christopher Faulet	f8188c69fa	MEDIUM: threads/logs: Make logs thread-safe log buffers and static variables used in log functions are now thread-local. So there is no need to lock anything to log messages. Moreover, per-thread init/deinit functions are now used to initialize these buffers.	2017-10-31 13:58:30 +01:00
Christopher Faulet	9a65571781	MEDIUM: threads/time: Many global variables from time.h are now thread-local	2017-10-31 13:58:30 +01:00
Christopher Faulet	6adad11283	MEDIUM: threads/chunks: Transform trash chunks in thread-local variables So, per-thread init/deinit functions are registered to allocate/release them.	2017-10-31 13:58:30 +01:00
Christopher Faulet	339fff8a18	MEDIUM: threads: Adds a set of functions to handle sync-point A sync-point is a protected area where you have the warranty that no concurrency access is possible. It is implementated as a thread barrier to enter in the sync-point and another one to exit from it. Inside the sync-point, all threads that must do some syncrhonous processing will be called one after the other while all other threads will wait. All threads will then exit from the sync-point at the same time. A sync-point will be evaluated only when necessary because it is a costly operation. To limit the waiting time of each threads, we must have a mechanism to wakeup all threads. This is done with a pipe shared by all threads. By writting in this pipe, we will interrupt all threads blocked on a poller. The pipe is then flushed before exiting from the sync-point.	2017-10-31 13:58:29 +01:00
Christopher Faulet	be0faa2e47	MINOR: threads: Add nbthread parameter It is only parsed and initialized for now. It will be used later. This parameter is only available when support for threads was built in.	2017-10-31 13:58:29 +01:00
Christopher Faulet	415f611ff4	MINOR: threads: Add mechanism to register per-thread init/deinit functions hap_register_per_thread_init and hap_register_per_thread_deinit functions has been added to register functions to do, for each thread, respectively, some initialization and deinitialization. These functions are added in the global lists per_thread_init_list and per_thread_deinit_list. These functions are called only when HAProxy is started with more than 1 thread (global.nbthread > 1).	2017-10-31 13:58:29 +01:00
Christopher Faulet	1a2b56ea8e	MEDIUM: threads: Add hathreads header file This file contains all functions and macros used to deal with concurrency in HAProxy. It contains all high-level function to do atomic operation (HA_ATOMIC_*). Note, for now, we rely on "__atomic" GCC builtins to do atomic operation. So HAProxy can be compiled with the thread support iff these builtins are available. It also contains wrappers around plocks to use spin or read/write locks. These wrappers are used to abstract the internal representation of the locking system and to add information to help debugging, when compiled with suitable options. To add extra info on locks, you need to add DEBUG=-DDEBUG_THREAD or DEBUG=-DDEBUG_FULL compilation option. In addition to timing info on locks, we keep info on where a lock was acquired the last time (function name, file and line). There are also the thread id and a flag to know if it is still locked or not. This will be useful to debug deadlocks.	2017-10-31 13:58:23 +01:00
Emeric Brun	7122ab31b1	MINOR: threads: Add atomic-ops and plock includes in import dir atomic-ops header contains some low-level functions to do atomic operations. These operations are used by the progressive locks (plock).	2017-10-31 11:36:13 +01:00
Christopher Faulet	e9bd686b68	MINOR: threads: Add THREAD_LOCAL macro When compiled with threads support, this marco is set to __thread. Else it is empty.	2017-10-31 11:36:13 +01:00
Christopher Faulet	93a518f02a	MINOR: standard: Add memvprintf function Now memprintf relies on memvprintf. This new function does exactly what memprintf did before, but it must be called with a va_list instead of a variable number of arguments. So there is no change for every functions using memprintf. But it is now also possible to have same functionnality from any function with variadic arguments.	2017-10-31 11:36:12 +01:00
Christopher Faulet	0108bb3e40	MEDIUM: mailers: Init alerts during conf parsing and refactor their processing Email alerts relies on checks to send emails. The link between a mailers section and a proxy was resolved during the configuration parsing, But initialization was done when the first alert is triggered. This implied memory allocations and tasks creations. With this patch, everything is now initialized during the configuration parsing. So when an alert is triggered, only the memory required by this alert is dynamically allocated. Moreover, alerts processing had a flaw. The task handler used to process alerts to be sent to the same mailer, process_email_alert, was designed to give back the control to the scheduler when an alert was sent. So there was a delay between the sending of 2 consecutives alerts (the min of "proxy->timeout.connect" and "mailer->timeout.mail"). To fix this problem, now, we try to process as much queued alerts as possible when the task is woken up.	2017-10-31 11:36:12 +01:00
Christopher Faulet	67957bd59e	MAJOR: dns: Refactor the DNS code This is a huge patch with many changes, all about the DNS. Initially, the idea was to update the DNS part to ease the threads support integration. But quickly, I started to refactor some parts. And after several iterations, it was impossible for me to commit the different parts atomically. So, instead of adding tens of patches, often reworking the same parts, it was easier to merge all my changes in a uniq patch. Here are all changes made on the DNS. First, the DNS initialization has been refactored. The DNS configuration parsing remains untouched, in cfgparse.c. But all checks have been moved in a post-check callback. In the function dns_finalize_config, for each resolvers, the nameservers configuration is tested and the task used to manage DNS resolutions is created. The links between the backend's servers and the resolvers are also created at this step. Here no connection are kept alive. So there is no needs anymore to reopen them after HAProxy fork. Connections used to send DNS queries will be opened on demand. Then, the way DNS requesters are linked to a DNS resolution has been reworked. The resolution used by a requester is now referenced into the dns_requester structure and the resolution pointers in server and dns_srvrq structures have been removed. wait and curr list of requesters, for a DNS resolution, have been replaced by a uniq list. And Finally, the way a requester is removed from a DNS resolution has been simplified. Now everything is done in dns_unlink_resolution. srv_set_fqdn function has been simplified. Now, there is only 1 way to set the server's FQDN, independently it is done by the CLI or when a SRV record is resolved. The static DNS resolutions pool has been replaced by a dynamoc pool. The part has been modified by Baptiste Assmann. The way the DNS resolutions are triggered by the task or by a health-check has been totally refactored. Now, all timeouts are respected. Especially hold.valid. The default frequency to wake up a resolvers is now configurable using "timeout resolve" parameter. Now, as documented, as long as invalid repsonses are received, we really wait all name servers responses before retrying. As far as possible, resources allocated during DNS configuration parsing are releases when HAProxy is shutdown. Beside all these changes, the code has been cleaned to ease code review and the doc has been updated.	2017-10-31 11:36:12 +01:00
Christopher Faulet	344c4ab6a9	MEDIUM: spoe/rules: Process "send-spoe-group" action The messages processing is done using existing functions. So here, the main task is to find the SPOE engine to use. To do so, we loop on all filter instances attached to the stream. For each, we check if it is a SPOE filter and, if yes, if its name is the one used to declare the "send-spoe-group" action. We also take care to return an error if the action processing is interrupted by HAProxy (because of a timeout or an error at the HAProxy level). This is done by checking if the flag ACT_FLAG_FINAL is set. The function spoe_send_group is the action_ptr callback ot	2017-10-31 11:36:12 +01:00
Christopher Faulet	c718b82dfe	MINOR: spoe: Add a type to qualify the message list during encoding Because we can have messages chained by event or by group, we need to have a way to know which kind of list we manipulate during the encoding. So 2 types of list has been added, SPOE_MSGS_BY_EVENT and SPOE_MSGS_BY_GROUP. And the right type is passed when spoe_encode_messages is called.	2017-10-31 11:36:12 +01:00
Christopher Faulet	76c09ef8de	MEDIUM: spoe/rules: Add "send-spoe-group" action for tcp/http rules This action is used to trigger sending of a group of SPOE messages. To do so, the SPOE engine used to send messages must be defined, as well as the SPOE group to send. Of course, the SPOE engine must refer to an existing SPOE filter. If not engine name is provided on the SPOE filter line, the SPOE agent name must be used. For example: http-request send-spoe-group my-engine some-group This action is available for "tcp-request content", "tcp-response content", "http-request" and "http-response" rulesets. It cannot be used for tcp connection/session rulesets because actions for these rulesets cannot yield. For now, the action keyword is parsed and checked. But it does nothing. Its processing will be added in another patch.	2017-10-31 11:36:12 +01:00
Christopher Faulet	11610f3b5a	MEDIUM: spoe: Parse new "spoe-group" section in SPOE config file For now, this section is only parsed. It should have the following format: spoe-group <grp-name> messages <msg-name> ... And then SPOE groups must be referenced in spoe-agent section: spoe-agnt <name> ... groups <grp-name> ... The purpose of these groups is to trigger messages sending from TCP or HTTP rules, directly from HAProxy configuration, and not on specific event. This part will be added in another patch. It is important to note that a message belongs at most to a group.	2017-10-31 11:36:12 +01:00
Christopher Faulet	7ee8667c99	MINOR: spoe: Check uniqness of SPOE engine names during config parsing The engine name is now kept in "spoe_config" struture. Because a SPOE filter can be declared without engine name, we use the SPOE agent name by default. Then, its uniqness is checked against all others SPOE engines configured for the same proxy. * TODO: Add documentation	2017-10-31 11:36:12 +01:00
Christopher Faulet	57583e474e	MEDIUM: spoe: Add support of ACLS to enable or disable sending of SPOE messages Now, it is possible to conditionnaly send a SPOE message by adding an ACL-based condition on the "event" line, in a "spoe-message" section. Here is the example coming for the SPOE documentation: spoe-message get-ip-reputation args ip=src event on-client-session if ! { src -f /etc/haproxy/whitelist.lst } To avoid mixin with proxy's ACLs, each SPOE message has its private ACL list. It possible to declare named ACLs in "spoe-message" section, using the same syntax than for proxies. So we can rewrite the previous example to use a named ACL: spoe-message get-ip-reputation args ip=src acl ip-whitelisted src -f /etc/haproxy/whitelist.lst event on-client-session if ! ip-whitelisted ACL-based conditions are executed in the context of the stream that handle the client and the server connections.	2017-10-31 11:36:12 +01:00
Christopher Faulet	1b421eab87	MINOR: acl: Pass the ACLs as an explicit parameter of build_acl_cond So it is possible to use anothers ACLs to build ACL conditions than those of proxies.	2017-10-31 11:36:12 +01:00
Christopher Faulet	78880fb196	MINOR: action: Add function to check rules using an action ACT_ACTION_TRK_* The function "check_trk_action" has been added to find and check the target table for rules using an action ACT_ACTION_TRK_*.	2017-10-31 11:36:12 +01:00
Christopher Faulet	6d950b92cd	MINOR: action: Add a function pointer in act_rule struct to check its validity It is possible to define the field "act_rule.check_ptr" if you want to check the validity of a tcp/http rule.	2017-10-31 11:36:12 +01:00
Christopher Faulet	4fce0d8447	MINOR: action: Use trk_idx instead of tcp/http_trk_idx So tcp_trk_idx and http_trk_idx have been removed.	2017-10-31 11:36:12 +01:00
Christopher Faulet	7421b14c22	MINOR: action: Add trk_idx inline function It returns tracking index corresponding to an action ACT_ACTION_TRK_SC*. It will replace http_trk_idx and tcp_trk_idx.	2017-10-31 11:36:12 +01:00
Willy Tarreau	d22e83abd9	MINOR: h1: store the status code in the H1 message It was painful not to have the status code available, especially when it was computed. Let's store it and ensure we don't claim content-length anymore on 1xx, only 0 body bytes.	2017-10-31 08:43:29 +01:00
William Lallemand	a3c77cfdd7	MINOR: shctx: rename lock functions Rename lock functions to shctx_lock() and shctx_unlock() to be coherent with the new API.	2017-10-31 03:49:44 +01:00
William Lallemand	4f45bb9c46	MEDIUM: shctx: separate ssl and shctx This patch reorganize the shctx API in a generic storage API, separating the shared SSL session handling from its core. The shctx API only handles the generic data part, it does not know what kind of data you use with it. A shared_context is a storage structure allocated in a shared memory, allowing its usage in a multithread or a multiprocess context. The structure use 2 linked list, one containing the available blocks, and another for the hot locked blocks. At initialization the available list is filled with <maxblocks> blocks of size <blocksize>. An <extra> space is initialized outside the list in case you need some specific storage. +-----------------------+--------+--------+--------+--------+---- \| struct shared_context \| extra \| block1 \| block2 \| block3 \| ... +-----------------------+--------+--------+--------+--------+---- <-------- maxblocks ---------> * blocksize The API allows to store content on several linked blocks. For example, if you allocated blocks of 16 bytes, and you want to store an object of 60 bytes, the object will be allocated in a row of 4 blocks. The API was made for LRU usage, each time you get an object, it pushes the object at the end of the list. When it needs more space, it discards The functions name have been renamed in a more logical way, the part regarding shctx have been prefixed by shctx_ and the functions for the shared ssl session cache have been prefixed by sh_ssl_sess_.	2017-10-31 03:49:40 +01:00
William Lallemand	ed0b5ad1aa	REORG: shctx: move ssl functions to ssl_sock.c Move the ssl callback functions of the ssl shared session cache to ssl_sock.c. The shctx functions still needs to be separated of the ssl tree and data.	2017-10-31 03:48:39 +01:00
William Lallemand	3f85c9aec8	MEDIUM: shctx: allow the use of multiple shctx Add an shctx argument which permits to create new independent shctx area.	2017-10-31 03:44:11 +01:00
William Lallemand	24a7a75be6	REORG: shctx: move lock functions and struct Move locks functions to proto/shctx.h, and structures to types/shctx.h in order to simplify the split ssl/shctx.	2017-10-31 03:44:11 +01:00
William Lallemand	83215a44b8	MEDIUM: lists: list_for_each_entry{_safe}_from functions Add list_for_each_entry_from and list_for_each_entry_safe_from which allows to iterate in a list starting from a specific item.	2017-10-31 03:44:11 +01:00
Emmanuel Hocdet	01da571e21	MINOR: merge ssl_sock_get calls for log and ppv2 Merge ssl_sock_get_version and ssl_sock_get_proto_version. Change ssl_sock_get_cipher to be used in ppv2.	2017-10-27 19:32:36 +02:00
Emmanuel Hocdet	58118b43b1	MINOR: update proxy-protocol-v2 #define Report #define from doc/proxy-protocol.txt.	2017-10-27 19:32:36 +02:00
Olivier Houchard	9679ac997a	MINOR: ssl: Don't abuse ssl_options. A bind_conf does contain a ssl_bind_conf, which already has a flag to know if early data are activated, so use that, instead of adding a new flag in the ssl_options field.	2017-10-27 19:26:52 +02:00
Olivier Houchard	c2aae74f01	MEDIUM: ssl: Handle early data with OpenSSL 1.1.1 When compiled with Openssl >= 1.1.1, before attempting to do the handshake, try to read any early data. If any early data is present, then we'll create the session, read the data, and handle the request before we're doing the handshake. For this, we add a new connection flag, CO_FL_EARLY_SSL_HS, which is not part of the CO_FL_HANDSHAKE set, allowing to proceed with a session even before an SSL handshake is completed. As early data do have security implication, we let the origin server know the request comes from early data by adding the "Early-Data" header, as specified in this draft from the HTTP working group : https://datatracker.ietf.org/doc/html/draft-ietf-httpbis-replay	2017-10-27 10:54:05 +02:00
Olivier Houchard	51a76d84e4	MINOR: http: Mark the 425 code as "Too Early". This adds a new status code for use with the "http-request deny" ruleset. The use case for this code is currently handled by this draft dedicated to 0-RTT processing : https://datatracker.ietf.org/doc/html/draft-ietf-httpbis-replay	2017-10-27 10:53:32 +02:00
Thierry FOURNIER	31904278dc	MINOR: hlua: Add regex class This patch simply brings HAProxy internal regex system to the Lua API. Lua doesn't embed regexes, now it inherits from the regexes compiled with haproxy.	2017-10-27 10:30:44 +02:00
William Lallemand	48b4bb4b09	MEDIUM: cfgparse: post parsing registration Allow to register a function which will be called after the configuration file parsing, at the end of the check_config_validity(). It's useful fo checking dependencies between sections or for resolving keywords, pointers or values.	2017-10-27 10:15:56 +02:00
William Lallemand	d2ff56d2a3	MEDIUM: cfgparse: post section callback This commit implements a post section callback. This callback will be used at the end of a section parsing. Every call to cfg_register_section must be modified to use the new prototype: int cfg_register_section(char section_name, int (section_parser)(const char , int, char , int), int (post_section_parser)());	2017-10-27 10:14:51 +02:00
Willy Tarreau	145746c2d5	MINOR: buffer: add the buffer input manipulation functions We used to have bo_{get,put}_{chr,blk,str} to retrieve/send data to the output area of a buffer, but not the equivalent ones for the input area. This will be needed to copy uploaded data frames in HTTP/2.	2017-10-27 10:00:17 +02:00
Willy Tarreau	7b271b214f	MEDIUM: connection: make use of CO_FL_WILL_UPDATE in conn_sock_shutw() This one may be called by upper layers (eg: si_shutw()) or lower layers (si_shutw() as well during stream_int_notify()) so we want it to take care of updating the connection's flags if it's not going to be done by the caller.	2017-10-25 15:52:41 +02:00
Willy Tarreau	916e12dcfb	MINOR: connection: add flag CO_FL_WILL_UPDATE to indicate when updates are granted In transport-layer functions (snd_buf/rcv_buf), it's very problematic never to know if polling changes made to the connection will be propagated or not. This has led to some conn_cond_update_polling() calls being placed at a few places to cover both the cases where the function is called from the upper layer and when it's called from the lower layer. With the arrival of the MUX, this becomes even more complicated, as the upper layer will not have to manipulate anything from the connection layer directly and will not have to push such updates directly either. But the snd_buf functions will need to see their updates committed when called from upper layers. The solution here is to introduce a connection flag set by the connection handler (and possibly any other similar place) indicating that the caller is committed to applying such changes on return. This way, the called functions will be able to apply such changes by themselves before leaving when the flag is not set, and the upper layer will not have to care about that anymore.	2017-10-25 15:52:41 +02:00
Willy Tarreau	bc97cc4fd1	MINOR: connection: move the cleanup of flag CO_FL_WAIT_ROOM This flag is only used when reading using splicing for now, and is only set when a pipe full condition is met, so we can simplify its reset condition in conn_refresh_polling_flags so that it's cleared at the same time as the other ones, only when the control layer is ready. This flag could be used more, to mark that a buffer full condition was met with any receive method in order to simplify polling management. This should probably be revisited after 1.8.	2017-10-25 15:52:41 +02:00
Dragan Dosen	7389dd086c	IMPORT: sha1: import SHA1 functions This is based on the git SHA1 implementation and optimized to do word accesses rather than byte accesses, and to avoid unnecessary copies into the context array.	2017-10-25 04:45:48 +02:00
Emmanuel Hocdet	019f9b10ef	MINOR: ssl: build with recent BoringSSL library BoringSSL switch OPENSSL_VERSION_NUMBER to 1.1.0 for compatibility. Fix BoringSSL call and openssl-compat.h/#define occordingly. This will not break openssl/libressl compat.	2017-10-24 19:57:16 +02:00
Willy Tarreau	1296382d0b	CONTRIB: trace: add the possibility to place trace calls in the code Now any call to trace() in the code will automatically appear interleaved with the call sequence and timestamped in the trace file. They appear with a '#' on the 3rd argument (caller's pointer) in order to make them easy to spot. If the trace functionality is not used, a dmumy weak function is used instead so that it doesn't require to recompile every time traces are enabled/disabled. The trace decoder knows how to deal with these messages, detects them and indents them similarly to the currently traced function. This can be used to print function arguments for example. Note that we systematically flush the log when calling trace() to ensure we never miss important events, so this may impact performance. The trace() function uses the same format as printf() so it should be easy to setup during debugging sessions.	2017-10-24 19:54:25 +02:00
Willy Tarreau	cbc6524a19	MINOR: connection: remove conn_force_close() Now only conn_full_close() will be used. It will become more obvious when the tracking is in place or not and will make it easier to convert remaining call places to conn_streams.	2017-10-22 09:54:19 +02:00
Willy Tarreau	3b737c9894	MINOR: stream-int: use conn_full_close() instead of conn_force_close() We simply disable tracking before calling it.	2017-10-22 09:54:18 +02:00
Willy Tarreau	dc42acddb6	MINOR: connection: add conn_stop_tracking() to disable tracking This will be used before conn_full_close() instead of using conn_force_close(), resulting in a clearer exit path in various situations.	2017-10-22 09:54:16 +02:00
Willy Tarreau	6a0a80adaf	MINOR: connection: ensure conn_ctrl_close() also resets the fd The connection's fd was reset to DEAD_FD_MAGIC on conn_force_close() but not on conn_full_close(), which is a bit strange. Let's do it on both.	2017-10-22 09:54:16 +02:00
Willy Tarreau	f9ce57e86c	MEDIUM: connection: make conn_sock_shutw() aware of lingering Instead of having to manually handle lingering outside, let's make conn_sock_shutw() check for it before calling shutdown(). We simply don't want to emit the FIN if we're going to reset the connection due to lingering. It's particularly important for silent-drop where it's absolutely mandatory that no packet leaves the machine.	2017-10-22 09:54:16 +02:00
Olivier Houchard	1a0545f3d7	REORG: connection: rename CO_FL_DATA_* -> CO_FL_XPRT_* These flags are not exactly for the data layer, they instead indicate what is expected from the transport layer. Since we're going to split the connection between the transport and the data layers to insert a mux layer, it's important to have a clear idea of what each layer does. All function conn_data_* used to manipulate these flags were renamed to conn_xprt_*.	2017-10-22 09:54:15 +02:00
Willy Tarreau	794f9af894	MEDIUM: h1: reimplement the http/1 response parser for the gateway The HTTP/2->HTTP/1 gateway will need to process HTTP/1 responses. We cannot sanely rely on the HTTP/1 txn to parse a response because : 1) responses generated by haproxy such as error messages, redirects, stats or Lua are neither parsed nor indexed ; this could be addressed over the long term but will take time. 2) the http txn is useless to parse the body : the states present there are only meaningful to received bytes (ie next bytes to parse) and not at all to sent bytes. Thus chunks cannot be followed at all. Even when implementing this later, it's unsure whether it will be possible when dealing with compression. So using the HTTP txn is now out of the equation and the only remaining solution is to call an HTTP/1 message parser. We already have one, it was slightly modified to avoid keeping states by benefitting from the fact that the response was produced by haproxy and this is entirely available. It assumes the following rules are true, or that incuring an extra cost to work around them is acceptable : - the response buffer is read-write and supports modifications in place - headers sent through / by haproxy are not folded. Folding is still implemented by replacing CR/LF/tabs/spaces with spaces if encountered - HTTP/0.9 responses are never sent by haproxy and have never been supported at all - haproxy will not send partial responses, the whole headers block will be sent at once ; this means that we don't need to keep expensive states and can afford to restart the parsing from the beginning when facing a partial response ; - response is contiguous (does not wrap). This was already the case with the original parser and ensures we can safely dereference all fields with (ptr,len) The parser replaces all of the http_msg fields that were necessary with local variables. The parser is not called on an http_msg but on a string with a start and an end. The HTTP/1 states were reused for ease of use, though the request-specific ones have not been implemented for now. The error position and error state are supported and optional ; these ones may be used later for bug hunting. The parser issues the list of all the headers into a caller-allocated array of struct ist. The content-length/transfer-encoding header are checked and the relevant info fed the h1 message state (flags + body_len).	2017-10-22 09:54:15 +02:00
Willy Tarreau	306924ecb8	MINOR: http: add very simple header management based on double strings This will be used initially by the hpack table and hopefully later by a new native http processor. These headers are made of name and value, both an immediate string (ie: pointer and length).	2017-10-22 09:54:14 +02:00
Willy Tarreau	4093a4dc01	MINOR: h1: add struct h1m for basic HTTP/1 messages This one is much simpler than http_msg and will be used in the HTTP parsers involved in the H2 to H1 gateway.	2017-10-22 09:54:14 +02:00
Willy Tarreau	b28925675d	MEDIUM: http: make the chunk crlf parser only depend on the buffer The chunk crlf parser used to depend on the channel and on the HTTP message, eventhough it's not really needed. Let's remove this dependency so that it can be used within the H2 to H1 gateway. As part of this small API change, it was renamed to h1_skip_chunk_crlf() to mention that it doesn't depend on http_msg anymore.	2017-10-22 09:54:14 +02:00
Willy Tarreau	e56cdd3629	MEDIUM: http: make the chunk size parser only depend on the buffer The chunk parser used to depend on the channel and on the HTTP message but it's not really needed as they're only used to retrieve the buffer as well as to return the number of bytes parsed and the chunk size. Here instead we pass the (few) relevant information in arguments so that the function may be reused without a channel nor an HTTP message (ie from the H2 to H1 gateway). As part of this API change, it was renamed to h1_parse_chunk_size() to mention that it doesn't depend on http_msg anymore.	2017-10-22 09:54:14 +02:00
Willy Tarreau	8740c8b1b2	REORG: http: move the HTTP/1 header block parser to h1.c Since it still depends on http_msg, it was not renamed yet.	2017-10-22 09:54:13 +02:00
Willy Tarreau	db4893d6a4	REORG: http: move the HTTP/1 chunk parser to h1.{c,h} Functions http_parse_chunk_size(), http_skip_chunk_crlf() and http_forward_trailers() were moved to h1.h and h1.c respectively so that they can be called from outside. The parts that were inline remained inline as it's critical for performance (+41% perf difference reported in an earlier test). For now the "http_" prefix remains in their name since they still depend on the http_msg type.	2017-10-22 09:54:13 +02:00
Willy Tarreau	0da5b3bddc	REORG: http: move some very http1-specific parts to h1.{c,h} Certain types and enums are very specific to the HTTP/1 parser, and we'll need to share them with the HTTP/2 to HTTP/1 translation code. Let's move them to h1.c/h1.h. Those with very few occurrences or only used locally were renamed to explicitly mention the relevant HTTP version : enum ht_state -> h1_state. http_msg_state_str -> h1_msg_state_str HTTP_FLG_* -> H1_FLG_* http_char_classes -> h1_char_classes Others like HTTP_IS_, HTTP_MSG_ are left to be done later.	2017-10-22 09:54:13 +02:00
Willy Tarreau	0621da5f5b	MINOR: buffer: make bo_getblk_nc() not return 2 for a full buffer Thus function returns the number of blocks. When a buffer is full and properly aligned, buf->p loops back the beginning, and the test in the code doesn't cover that specific case, so it returns two chunks, a full one and an empty one. It's harmless but can sometimes have a small impact on performance and definitely makes the code hard to debug.	2017-10-22 09:54:12 +02:00
Emeric Brun	5a1335110c	BUG/MEDIUM: log: check result details truncated. Fix regression introduced by commit: 'MAJOR: servers: propagate server status changes asynchronously.' The building of the log line was re-worked to be done at the postponed point without lack of data. [wt: this only affects 1.8-dev, no backport needed]	2017-10-19 18:51:32 +02:00
Willy Tarreau	e67c4e5744	MINOR: ist: add ist0() to add a trailing zero to a string. This function modifies the string to add a zero after the end, and returns the start pointer. The purpose is to use it on strings extracted by parsers from larger strings cut with delimiters that are not important and can be destroyed. It allows any such string to be used with regular string functions. It's also convenient to use with printf() to show data extracted from writable areas.	2017-10-19 15:01:08 +02:00
Willy Tarreau	41ab86898e	MINOR: channel: make the channel be a const in all {ci,co}_get* functions There's no point having the channel marked writable as these functions only extract data from the channel. The code was retrieved from their ci/co ancestors.	2017-10-19 15:01:08 +02:00
Willy Tarreau	e0e734ccc5	MINOR: buffer: add bo_getblk() and bo_getblk_nc() These functions respectively extract a block from an output buffer by copying it or by just passing pointers and lengths for zero copy operation.	2017-10-19 15:01:08 +02:00
Willy Tarreau	06d80a9a9c	REORG: channel: finally rename the last bi_* / bo_* functions For HTTP/2 we'll need some buffer-only equivalent functions to some of the ones applying to channels and still squatting the bi_* / bo_* namespace. Since these names have kept being misleading for quite some time now and are really getting annoying, it's time to rename them. This commit will use "ci/co" as the prefix (for "channel in", "channel out") instead of "bi/bo". The following ones were renamed : bi_getblk_nc, bi_getline_nc, bi_putblk, bi_putchr, bo_getblk, bo_getblk_nc, bo_getline, bo_getline_nc, bo_inject, bi_putchk, bi_putstr, bo_getchr, bo_skip, bi_swpbuf	2017-10-19 15:01:08 +02:00
Willy Tarreau	5b9834f12a	MINOR: buffer: add buffer_space_wraps() This function returns true if the available buffer space wraps. This will be used to detect if it's worth realigning a buffer when it lacks contigous space.	2017-10-19 15:01:08 +02:00
Willy Tarreau	e5676e7103	MINOR: buffer: add two functions to inject data into buffers bi_istput() injects the ist string into the input region of the buffer, it will be used to feed small data chunks into the conn_stream. bo_istput() does the same into the output region of the buffer, it will be used to send data via the transport layer and assumes there's no input data.	2017-10-19 15:01:08 +02:00
Willy Tarreau	6634b63c78	MINOR: buffer: add a function to match against string patterns In order to match known patterns in wrapping buffer, we'll introduce new string manipulation functions for buffers. The new function b_isteq() relies on an ist string for the pattern and compares it against any location in the buffer relative to <p>. The second function bi_eat() is specially designed to match input contents.	2017-10-19 15:01:07 +02:00
Willy Tarreau	7f564d2b60	MINOR: buffer: add bo_del() to delete a number of characters from output This simply reduces the amount of output data from the buffer after they have been transferred, in a way that is more natural than by fiddling with buf->o. b_del() was renamed to bi_del() to avoid any ambiguity (it's not yet used).	2017-10-19 15:01:07 +02:00
Willy Tarreau	dea7c5c03d	BUG/MINOR: tools: fix my_htonll() on x86_64 Commit `36eb3a3` ("MINOR: tools: make my_htonll() more efficient on x86_64") brought an incorrect asm statement missing the input constraints, causing the input value not necessarily to be placed into the same register as the output one, resulting in random output. It happens to work when building at -O0 but not above. This was only detected in the HTTP/2 parser, but in mainline it could only affect the integer to binary sample cast. No backport is needed since this bug was only introduced in the development branch.	2017-10-18 11:46:17 +02:00
Olivier Houchard	9130a9605d	MINOR: checks: Add a new keyword to specify a SNI when doing SSL checks. Add a new keyword, "check-sni", to be able to specify the SNI to be used when doing health checks over SSL.	2017-10-17 18:10:24 +02:00
Emeric Brun	64cc49cf7e	MAJOR: servers: propagate server status changes asynchronously. In order to prepare multi-thread development, code was re-worked to propagate changes asynchronoulsy. Servers with pending status changes are registered in a list and this one is processed and emptied only once 'run poll' loop. Operational status changes are performed before administrative status changes. In a case of multiple operational status change or admin status change in the same 'run poll' loop iteration, those changes are merged to reach only the targeted status.	2017-10-13 12:00:27 +02:00
Willy Tarreau	bf08beb2a3	MINOR: session: remove the list of streams from struct session Commit `bcb86ab` ("MINOR: session: add a streams field to the session struct") added this list of streams that is not needed anymore. Let's get rid of it now.	2017-10-08 22:32:05 +02:00
Willy Tarreau	c939835f77	MINOR: compiler: restore the likely() wrapper for gcc 5.x After some tests, gcc 5.x produces better code with likely() than without, contrary to gcc 4.x where it was better to disable it. Let's re-enable it for 5 and above.	2017-10-08 22:32:05 +02:00
Willy Tarreau	2ba672726c	MINOR: ist: add a macro to ease const array initialization It's not possible to use strlen() in const arrays even with const strings, but we can use sizeof-1 via a macro. Let's provide this in the IST() macro, as it saves the developer from having to count the characters.	2017-09-21 15:32:31 +02:00
Willy Tarreau	82967bf9b3	MINOR: connection: adjust CO_FL_NOTIFY_DATA after removal of flags After the removal of CO_FL_DATA_RD_SH and CO_FL_DATA_WR_SH, the aggregate mask CO_FL_NOTIFY_DATA was not updated. It happens that now CO_FL_NOTIFY_DATA and CO_FL_NOTIFY_DONE are similar, which may reveal some overlap between the ->wake and ->xprt_done callbacks. We'll see after the mux changes if both are still required.	2017-09-21 06:28:52 +02:00
Willy Tarreau	5531d5732d	MINOR: net_helper: add 64-bit read/write functions These ones are the same as the previous ones but for 64 bit values. We're using my_ntohll() and my_htonll() from standard.h for the byte order conversion.	2017-09-21 06:27:08 +02:00
Willy Tarreau	2888c08346	MINOR: net_helper: add write functions These ones are the equivalent of the read_* functions. They support writing unaligned words, possibly wrapping, in host and network order. The write_i*() functions were not implemented since the caller can already use the unsigned version.	2017-09-21 06:25:10 +02:00
Willy Tarreau	d5370e1d6c	MINOR: net_helper: add functions to read from vectors This patch adds the ability to read from a wrapping memory area (ie: buffers). The new functions are called "readv_<type>". The original ones were renamed to start with "read_" to make the difference more obvious between the read method and the returned type. It's worth noting that the memory barrier in readv_bytes() is critical, as otherwise gcc decides that it doesn't need the resulting data, but even worse, removes the length checks in readv_u64() and happily performs an out-of-bounds unaligned read using read_u64()! Such "optimizations" are a bit borderline, especially when they impact security like this...	2017-09-20 11:27:31 +02:00
Willy Tarreau	26488ad358	MINOR: buffer: add b_end() and b_to_end() These ones return respectively the pointer to the end of the buffer and the distance between b->p and the end. These will simplify a bit some new code needed to parse directly from a wrapping buffer.	2017-09-20 11:27:31 +02:00
Willy Tarreau	4a6425d373	MINOR: buffer: add b_del() to delete a number of characters This will be used by code which directly parses buffers with no channel in the middle (eg: h2, might be used by checks as well).	2017-09-20 11:27:31 +02:00
Willy Tarreau	36eb3a3ac8	MINOR: tools: make my_htonll() more efficient on x86_64 The current construct was made when developing on a 32-bit machine. Having a simple bswap operation replaced with 2 bswap, 2 shift and 2 or is quite of a waste of precious cycles... Let's provide a trivial asm-based implementation for x86_64.	2017-09-20 11:27:31 +02:00
Willy Tarreau	05f5047d40	MINOR: listener: new function listener_release Instead of duplicating some sensitive listener-specific code in the session and in the stream code, let's call listener_release() when releasing a connection attached to a listener.	2017-09-15 11:49:52 +02:00
Willy Tarreau	2cc5bae0b8	MINOR: listeners: make listeners count consistent with reality Some places call delete_listener() then decrement the number of listeners and jobs. At least one other place calls delete_listener() without doing so, but since it's in deinit(), it's harmless and cannot risk to cause zombie processes to survive. Given that the number of listeners and jobs is incremented when creating the listeners, it's much more logical to symmetrically decrement them when deleting such listeners.	2017-09-15 11:49:52 +02:00
Willy Tarreau	0de59fd53a	MINOR: listeners: new function create_listeners This function is used to create a series of listeners for a specific address and a port range. It automatically calls the matching protocol handlers to add them to the relevant lists. This way cfgparse doesn't need to manipulate listeners anymore. As an added bonus, the memory allocation is checked.	2017-09-15 11:49:52 +02:00
Willy Tarreau	31794892af	MINOR: unix: remove the now unused proto_uxst.h file Since everything is self contained in proto_uxst.c there's no need to export anything. The same should be done for proto_tcp.c but the file contains other stuff that's not related to the TCP protocol itself and which should first be moved somewhere else.	2017-09-15 11:49:52 +02:00
Willy Tarreau	9d5be5c823	MINOR: protocols: register the ->add function and stop calling them directly cfgparse has no business directly calling each individual protocol's 'add' function to create a listener. Now that they're all registered, better perform a protocol lookup on the family and have a standard ->add method for all of them.	2017-09-15 11:49:52 +02:00
Willy Tarreau	3228238c73	MINOR: protocols: always pass a "port" argument to the listener creation It's a shame that cfgparse() has to make special cases of each protocol just to cast the port to the target address family. Let's pass the port in argument to the function. The unix listener simply ignores it.	2017-09-15 11:49:52 +02:00
Andjelko Iharos	c4df59e914	MINOR: cli: add socket commands and config to prepend informational messages with severity Adds cli commands to change at runtime whether informational messages are prepended with severity level or not, with support for numeric and worded severity in line with syslog severity level. Adds stats socket config keyword severity-output to set default behavior per socket on startup.	2017-09-13 13:37:59 +02:00
Olivier Houchard	ed0d96cac4	MINOR: net_helper: Inline functions meant to be inlined.	2017-09-13 13:35:35 +02:00
Thierry FOURNIER	d697596c6c	MINOR: tasks: Move Lua notification from Lua to tasks These notification management function and structs are generic and it will be better to move in common parts. The notification management functions and structs have names containing some "lua" references because it was written for the Lua. This patch removes also these references.	2017-09-11 18:59:40 +02:00
Thierry FOURNIER	2da788e755	MEDIUM: xref/lua: Use xref for referencing cosocket relation between stream and lua This relation will ensure that each was informed about death of another one.	2017-09-11 18:59:40 +02:00
Thierry FOURNIER	3c65b7a916	MINOR: xref: Add a new xref system xref is used to create a relation between two elements. Once an element is released, it breaks the relation. If the relation is already broken, it frees the xref struct. The pointer between two elements is a sort of refcount with max value 1. The relation is only between two elements. The pointer and the type of element a and b are conventional. Note that xref is initialised from Lua files because Lua is the only one user.	2017-09-11 18:59:40 +02:00
Emmanuel Hocdet	ddcde195eb	MINOR: ssl: rework smp_fetch_ssl_fc_cl_str without internal ssl use smp_fetch_ssl_fc_cl_str as very limited usage (only work with openssl == 1.0.2 compiled with the option enable-ssl-trace). It use internal cipher.algorithm_ssl attribut and SSL_CIPHER_standard_name (available with ssl-trace). This patch implement this (debug) function in a standard way. It used common SSL_CIPHER_get_name to display cipher name. It work with openssl >= 1.0.2 and boringssl.	2017-09-09 08:36:22 +02:00
Christopher Faulet	21e9267ac3	MINOR: fd: Add fd_update_events function This function should be called by the poller to set FD_POLL_* flags on an FD and update its state if needed. This function has been added to ease threads support integration.	2017-09-05 15:43:09 +02:00
Emeric Brun	52a91d3d48	MEDIUM: check: server states and weight propagation re-work The server state and weight was reworked to handle "pending" values updated by checks/CLI/LUA/agent. These values are commited to be propagated to the LB stack. In further dev related to multi-thread, the commit will be handled into a sync point. Pending values are named using the prefix 'next_' Current values used by the LB stack are named 'cur_'	2017-09-05 15:23:16 +02:00
Christopher Faulet	de2075fd21	MINOR: freq_ctr: Return the new value after an update This will ease threads support integration.	2017-09-05 11:55:07 +02:00
Christopher Faulet	d82b180d6b	MINOR: fd: Use inlined functions to check fd state in fd__send/recv functions It these functions, the test is inverted and we rely on fd_recv/send_ function to check the fd state. This will ease threads support integration.	2017-09-05 10:47:32 +02:00
Christopher Faulet	8db2fdfaba	MINOR: fd: Add fd_active function This inlined function is used to check if a fd is active for receive or send. It will ease threads support integration.	2017-09-05 10:39:46 +02:00
Christopher Faulet	6988f678cd	MINOR: http: Use a trash chunk to store decoded string of the HTTP auth header This string is used in sample fetches so it is safe to use a preallocated trash chunk instead of a buffer dynamically allocated during HAProxy startup.	2017-09-05 10:36:28 +02:00
Christopher Faulet	ca20d02ea8	MINOR: stick-tables: Make static_table_key a struct variable instead of a pointer First, this variable does not need to be publicly exposed because it is only used by stick_table functions. So we declare it as a global static in stick_table.c file. Then, it is useless to use a pointer. Using a plain struct variable avoids any dynamic allocation.	2017-09-05 10:35:07 +02:00
Christopher Faulet	ad405f1714	MINOR: buffers: Move swap_buffer into buffer.c and add deinit_buffer function swap_buffer is a global variable only used by buffer_slow_realign. So it has been moved from global.h to buffer.c and it is allocated by init_buffer function. deinit_buffer function has been added to release it. It is also used to destroy the buffers' pool.	2017-09-05 10:34:30 +02:00
Christopher Faulet	0132d06f68	MINOR: logs: Use dedicated function to init/deinit log buffers Now, we use init_log_buffers and deinit_log_buffers to, respectively, initialize and deinitialize log buffers used for syslog messages. These functions have been introduced to be used by threads, to deal with thread-local log buffers.	2017-09-05 10:29:31 +02:00
Christopher Faulet	748919a4c7	MINOR: chunks: Use dedicated function to init/deinit trash buffers Now, we use init_trash_buffers and deinit_trash_buffers to, respectively, initialize and deinitialize trash buffers (trash, trash_buf1 and trash_buf2). These functions have been introduced to be used by threads, to deal with thread-local trash buffers.	2017-09-05 10:22:20 +02:00
Christopher Faulet	576c5aa25c	MINOR: fd: Set owner and iocb field before inserting a new fd in the fdtab This will be needed for concurrent accesses.	2017-09-05 10:17:10 +02:00
Christopher Faulet	d531f88622	MINOR: fd: Don't forget to reset fdtab[fd].update when a fd is added/removed It used to be guaranteed by the polling functions on a later call but with concurrent accesses it cannot be granted anymore.	2017-09-05 10:16:42 +02:00
Christopher Faulet	f5b8adc5c0	MINOR: listeners: Change enable_listener and disable_listener into private functions These functions are only used in listener.c.	2017-09-05 10:14:16 +02:00
Christopher Faulet	5580ba2e11	MINOR: listeners: Change listener_full and limit_listener into private functions These functions are only used in listener_accept. So there is no need to export them.	2017-09-05 10:13:55 +02:00
Christopher Faulet	ae459fd206	CLEANUP: memory: Remove unused function pool_destroy This one was never used.	2017-09-05 10:13:20 +02:00
Emmanuel Hocdet	4366476852	MINOR: ssl: remove duplicate ssl_methods in struct bind_conf Patch "MINOR: ssl: support ssl-min-ver and ssl-max-ver with crt-list" introduce ssl_methods in struct ssl_bind_conf. struct bind_conf have now ssl_methods and ssl_conf.ssl_methods (unused). It's error-prone. This patch remove the duplicate structure to avoid any confusion.	2017-09-05 09:42:30 +02:00
Willy Tarreau	bbae3f0170	MEDIUM: connection: remove useless flag CO_FL_DATA_WR_SH After careful inspection, this flag is set at exactly two places : - once in the health-check receive callback after receipt of a response - once in the stream interface's shutw() code where CF_SHUTW is always set on chn->flags The flag was checked in the checks before deciding to send data, but when it is set, the wake() callback immediately closes the connection so the CO_FL_SOCK_WR_SH flag is also set. The flag was also checked in si_conn_send(), but checking the channel's flag instead is enough and even reveals that one check involving it could never match. So it's time to remove this flag and replace its check with a check of CF_SHUTW in the stream interface. This way each layer is responsible for its shutdown, this will ease insertion of the mux layer.	2017-08-30 10:05:49 +02:00
Willy Tarreau	cde5651c4d	CLEANUP: connection: remove the unused conn_sock_shutw_pending() This has never been used anywhere.	2017-08-30 08:18:53 +02:00
Willy Tarreau	54e917cfa1	MEDIUM: connection: remove useless flag CO_FL_DATA_RD_SH This flag is both confusing and wrong. It is supposed to report the fact that the data layer has received a shutdown, but in fact this is reported by CO_FL_SOCK_RD_SH which is set by the transport layer after this condition is detected. The only case where the flag above is set is in the stream interface where CF_SHUTR is also set on the receiving channel. In addition, it was checked in the health checks code (while never set) and was always test jointly with CO_FL_SOCK_RD_SH everywhere, except in conn_data_read0_pending() which incorrectly doesn't match the second time it's called and is fortunately protected by an extra check on (ic->flags & CF_SHUTR). This patch gets rid of the flag completely. Now conn_data_read0_pending() accurately reports the fact that the transport layer has detected the end of the stream, regardless of the fact that this state was already consumed, and the stream interface watches ic->flags&CF_SHUTR to know if the channel was already closed by the upper layer (which it already used to do). The now unused conn_data_read0() function was removed.	2017-08-30 08:18:50 +02:00
Willy Tarreau	5790eb0a76	MINOR: stream: provide a new stream creation function for connections The purpose will be to create new streams for a given connection so that we can later abstract this from a mux.	2017-08-30 07:06:39 +02:00
Willy Tarreau	0b74eae1f1	MEDIUM: session: add a pointer to a struct task in the session The session may need to enforce a timeout when waiting for a handshake. Till now we used a trick to avoid allocating a pointer, we used to set the connection's owner to the task and set the task's context to the session, so that it was possible to circle between all of them. The problem is that we'll really need to pass the pointer to the session to the upper layers during initialization and that the only place to store it is conn->owner, which is squatted for this trick. So this patch moves the struct task* into the session where it should always have been and ensures conn->owner points to the session until the data layer is properly initialized.	2017-08-30 07:05:49 +02:00
Willy Tarreau	ca3610251b	CLEANUP: listener: remove the unused handler field Historically listeners used to have a handler depending on the upper layer. But now it's exclusively process_stream() and nothing uses it anymore so it can safely be removed.	2017-08-30 07:05:08 +02:00
Willy Tarreau	87787acf72	MEDIUM: stream: make stream_new() allocate its own task Currently a task is allocated in session_new() and serves two purposes : - either the handshake is complete and it is offered to the stream via the second arg of stream_new() - or the handshake is not complete and it's diverted to be used as a timeout handler for the embryonic session and repurposed once we land into conn_complete_session() Furthermore, the task's process() function was taken from the listener's handler in conn_complete_session() prior to being replaced by a call to stream_new(). This will become a serious mess with the mux. Since it's impossible to have a stream without a task, this patch removes the second arg from stream_new() and make this function allocate its own task. In session_accept_fd(), we now only allocate the task if needed for the embryonic session and delete it later.	2017-08-30 07:05:04 +02:00
Willy Tarreau	8e3c6ce75a	MEDIUM: connection: get rid of data->init() which was not for data The ->init() callback of the connection's data layer was only used to complete the session's initialisation since sessions and streams were split apart in 1.6. The problem is that it creates a big confusion in the layers' roles as the session has to register a dummy data layer when waiting for a handshake to complete, then hand it off to the stream which will replace it. The real need is to notify that the transport has finished initializing. This should enable a better splitting between these layers. This patch thus introduces a connection-specific callback called xprt_done_cb() which informs about handshake successes or failures. With this, data->init() can disappear, CO_FL_INIT_DATA as well, and we don't need to register a dummy data->wake() callback to be notified of errors.	2017-08-30 07:04:04 +02:00
Willy Tarreau	585744bf2e	REORG/MEDIUM: connection: introduce the notion of connection handle Till now connections used to rely exclusively on file descriptors. It was planned in the past that alternative solutions would be implemented, leading to member "union t" presenting sock.fd only for now. With QUIC, the connection will need to continue to exist but will not rely on a file descriptor but a connection ID. So this patch introduces a "connection handle" which is either a file descriptor or a connection ID, to replace the existing "union t". We've now removed the intermediate "struct sock" which was never used. There is no functional change at all, though the struct connection was inflated by 32 bits on 64-bit platforms due to alignment.	2017-08-24 19:30:04 +02:00
Willy Tarreau	0c219be3df	BUG/MEDIUM: dns: fix accepted_payload_size parser to avoid integer overflow Since commit `9d8dbbc` ("MINOR: dns: Maximum DNS udp payload set to 8192") it's possible to specify a packet size, but passing too large a size or a negative size is not detected and results in memset() being performed over a 2GB+ area upon receipt of the first DNS response, causing runtime crashes. We now check that the size is not smaller than the smallest packet which is the DNS header size (12 bytes). No backport is needed.	2017-08-22 12:03:46 +02:00
Baptiste Assmann	9d8dbbc56b	MINOR: dns: Maximum DNS udp payload set to 8192 Following up DNS extension introduction, this patch aims at making the computation of the maximum number of records in DNS response dynamic. This computation is based on the announced payload size accepted by HAProxy.	2017-08-22 11:39:57 +02:00
Baptiste Assmann	747359eeca	BUG/MINOR: dns: server set by SRV records stay in "no resolution" status This patch fixes a bug where some servers managed by SRV record query types never ever recover from a "no resolution" status. The problem is due to a wrong function called when breaking the server/resolution (A/AAAA) relationship: this is performed when a server's SRV record disappear from the SRV response.	2017-08-22 11:34:49 +02:00
Fr�d�ric L�caille	6ca71a9297	BUG/MINOR: Wrong type used as argument for spoe_decode_buffer(). Contrary to 64-bits libCs where size_t type size is 8, on systems with 32-bits size of size_t is 4 (the size of a long) which does not equal to size of uint64_t type. This was revealed by such GCC warnings on 32bits systems: src/flt_spoe.c:2259:40: warning: passing argument 4 of spoe_decode_buffer from incompatible pointer type if (spoe_decode_buffer(&p, end, &str, &sz) == -1) ^ As the already existing code using spoe_decode_buffer() already use such pointers to uint64_t, in place of pointer to size_t ;), most of this code is in contrib directory, this simple patch modifies the prototype of spoe_decode_buffer() so that to use a pointer to uint64_t in place of a pointer to size_t, uint64_t type being the type finally required for decode_varint().	2017-08-22 11:27:20 +02:00
Willy Tarreau	a5480694bf	MINOR: http: export some of the HTTP parser macros The two macros EXPECT_LF_HERE and EAT_AND_JUMP_OR_RETURN were exported for use outside the HTTP parser. They now take extra arguments to avoid implicit pointers and jump labels. These will be used to reimplement a minimalist HTTP/1 parser in the H1->H2 gateway.	2017-08-18 13:38:47 +02:00
Willy Tarreau	e11f727c95	MINOR: ist: implement very simple indirect strings For HPACK we'll need to perform a lot of string manipulation between the dynamic headers table and the output stream, and we need an efficient way to deal with that, considering that the zero character is not an end of string marker here. It turns out that gcc supports returning structs from functions and is able to place up to two words directly in registers when -freg-struct is used, which is the case by default on x86 and armv8. On other architectures the caller reserves some stack space where the callee can write, which is equivalent to passing a pointer to the return value. So let's implement a few functions to deal with this as the resulting code will be optimized on certain architectures where retrieving the length of a string will simply consist in reading one of the two returned registers. Extreme care was taken to ensure that the compiler gets maximum opportunities to optimize out every bit of unused code. This is also the reason why no call to regular string functions (such as strlen(), memcmp(), memcpy() etc) were used. The code involving them is often larger than when they are open coded. Given that strings are usually very small, especially when manipulating headers, the time spent calling a function optimized for large vectors often ends up being higher than the few cycles needed to count a few bytes. An issue was met with __builtin_strlen() which can automatically convert a constant string to its constant length. It doesn't accept NULLs and there is no way to hide them using expressions as the check is made before the optimizer is called. On gcc 4 and above, using an intermediary variable is enough to hide it. On older versions, calls to ist() with an explicit NULL argument will issue a warning. There is normally no reason to do this but taking care of it the best possible still seems important.	2017-08-18 13:38:47 +02:00
Willy Tarreau	2bfd35885e	MINOR: stream: link the stream to its session Now each stream is added to the session's list of streams, so that it will be possible to know all the streams belonging to a session, and to know if any stream is still attached to a sessoin.	2017-08-18 13:26:35 +02:00
Willy Tarreau	bcb86abaca	MINOR: session: add a streams field to the session struct This will be used to hold the list of streams belonging to a given session.	2017-08-18 13:26:35 +02:00
Willy Tarreau	82032f1223	MINOR: chunks: add chunk_memcpy() and chunk_memcat() These two functions respectively copy a memory area onto the chunk, and append the contents of a memory area over a chunk. They are convenient to prepare binary output data to be sent and will be used for HTTP/2.	2017-08-18 13:26:20 +02:00
Baptiste Assmann	2af08fe3de	MINOR: dns: enabled edns0 extension and make accpeted payload size tunable Edns extensions may be used to negotiate some settings between a DNS client and a server. For now we only use it to announce the maximum response payload size accpeted by HAProxy. This size can be set through a configuration parameter in the resolvers section. If not set, it defaults to 512 bytes.	2017-08-18 11:25:56 +02:00
Baptiste Assmann	572ab8b269	MINOR: dns: new dns record type (RTYPE) for OPT DNS record type OPT is required to send additional records. OPT has been assigned ID 41.	2017-08-18 11:25:49 +02:00
Emmanuel Hocdet	15969297af	BUILD: ssl: replace SSL_CTX_get0_privatekey for openssl < 1.0.2 Commit `48a8332a` introduce SSL_CTX_get0_privatekey in openssl-compat.h but SSL_CTX_get0_privatekey access internal structure and can't be a candidate to openssl-compat.h. The workaround with openssl < 1.0.2 is to use SSL_new then SSL_get_privatekey.	2017-08-11 11:35:26 +02:00
Olivier Houchard	8da5f98fbe	MINOR: dns: Handle SRV records. Make it so for each server, instead of specifying a hostname, one can use a SRV label. When doing so, haproxy will first resolve the SRV label, then use the resulting hostnames, as well as port and weight (priority is ignored right now), to each server using the SRV label. It is resolved periodically, and any server disappearing from the SRV records will be removed, and any server appearing will be added, assuming there're free servers in haproxy.	2017-08-09 16:32:49 +02:00
Olivier Houchard	e962fd880d	Add a few functions to do unaligned access. Add a few functions to read 16bits and 32bits integers that may be unaligned, both in host and network order.	2017-08-09 16:32:49 +02:00
Olivier Houchard	e2c222b12f	MINOR: obj: Add a new type of object, OBJ_TYPE_SRVRQ. dns_srvrq will be objects used for dealing with SRV records.	2017-08-09 16:32:49 +02:00
Olivier Houchard	a8c6db8d2d	MINOR: dns: Cache previous DNS answers. As DNS servers may not return all IPs in one answer, we want to cache the previous entries. Those entries are removed when considered obsolete, which happens when the IP hasn't been returned by the DNS server for a time defined in the "hold obsolete" parameter of the resolver section. The default is 30s.	2017-08-09 16:32:49 +02:00
Fr�d�ric L�caille	3169471964	MINOR: Add server port field to server state file. This patch adds server ports to server state file at the end of each line for backward compatibility.	2017-08-03 14:31:46 +02:00
Christopher Faulet	48a8332a4a	BUG/MEDIUM: ssl: Fix regression about certificates generation Since the commit `f6b37c67` ["BUG/MEDIUM: ssl: in bind line, ssl-options after 'crt' are ignored."], the certificates generation is broken. To generate a certificate, we retrieved the private key of the default certificate using the SSL object. But since the commit `f6b37c67`, the SSL object is created with a dummy certificate (initial_ctx). So to fix the bug, we use directly the default certificate in the bind_conf structure. We use SSL_CTX_get0_privatekey function to do so. Because this function does not exist for OpenSSL < 1.0.2 and for LibreSSL, it has been added in openssl-compat.h with the right #ifdef.	2017-07-28 18:25:18 +02:00
Willy Tarreau	6d0d3f6546	MINOR: listener: add a function to return a listener's state as a string This will be used in debugging output, so it's a short 3-character string.	2017-07-28 17:03:12 +02:00
Emmanuel Hocdet	174dfe55a0	MINOR: ssl: add "no-ca-names" parameter for bind This option prevent to send CA names in server hello message when ca-file is used. This parameter is also available in "crt-list".	2017-07-28 15:20:48 +02:00
Willy Tarreau	71d058c288	MINOR: ssl: add a new error codes for wrong server certificates If a server presents an unexpected certificate to haproxy, that is, a certificate that doesn't match the expected name as configured in verifyhost or as requested using SNI, we want to store that precious information. Fortunately we have access to the connection in the verification callback so it's possible to store an error code there. For this purpose we use CO_ER_SSL_MISMATCH_SNI (for when the cert name didn't match the one requested using SNI) and CO_ER_SSL_MISMATCH for when it doesn't match verifyhost.	2017-07-28 11:50:16 +02:00
Christopher Faulet	96c7b8dbd2	BUG/MINOR: ssl: Fix check against SNI during server certificate verification This patch fixes the commit `2ab8867` ("MINOR: ssl: compare server certificate names to the SNI on outgoing connections") When we check the certificate sent by a server, in the verify callback, we get the SNI from the session (SSL_SESSION object). In OpenSSL, tlsext_hostname value for this session is copied from the ssl connection (SSL object). But the copy is done only if the "server_name" extension is found in the server hello message. This means the server has found a certificate matching the client's SNI. When the server returns a default certificate not matching the client's SNI, it doesn't set any "server_name" extension in the server hello message. So no SNI is set on the SSL session and SSL_SESSION_get0_hostname always returns NULL. To fix the problemn, we get the SNI directly from the SSL connection. It is always defined with the value set by the client. If the commit `2ab8867` is backported in 1.7 and/or 1.6, this one must be backported too. Note: it's worth mentionning that by making the SNI check work, we introduce another problem by which failed SNI checks can cause long connection retries on the server, and in certain cases the SNI value used comes from the client. So this patch series must not be backported until this issue is resolved.	2017-07-26 19:43:33 +02:00
Willy Tarreau	f42199975c	MINOR: task: always preinitialize the task's timeout in task_init() task_init() is called exclusively by task_new() which is the only way to create a task. Most callers set t->expire to TICK_ETERNITY, some set it to another value and a few like Lua don't set it at all as they don't need a timeout, causing random values to be used in case the task gets queued. Let's always set t->expire to TICK_ETERNITY in task_init() so that all tasks are now initialized in a clean state. This patch can be backported as it will definitely make the code more robust (at least the Lua code, possibly other places).	2017-07-24 17:52:58 +02:00
Christopher Faulet	5db105e8b2	MINOR: samples: Handle the type SMP_T_METH in smp_is_safe and smp_is_rw For all known methods, samples are considered as safe and rewritable. For unknowns, we handle them like strings (SMP_T_STR).	2017-07-24 17:16:00 +02:00
David Carlier	b781dbede3	MINOR: memory: remove macros We finally get rid of the macros and use usual memory management functions directly.	2017-07-21 09:54:03 +02:00
Willy Tarreau	cb1949b8b3	MINOR: tools: add a portable timegm() alternative timegm() is not provided everywhere and the documentation on how to replace it is bogus as it proposes an inefficient and non-thread safe alternative. Here we reimplement everything needed to compute the number of seconds since Epoch based on the broken down fields in struct tm. It is only guaranteed to return correct values for correct inputs. It was successfully tested with all possible 32-bit values of time_t converted to struct tm using gmtime() and back to time_t using the legacy timegm() and this function, and both functions always produced the same result. Thanks to Beno�t Garnier for an instructive discussion and detailed explanations of the various time functions, leading to this solution.	2017-07-19 19:15:06 +02:00
Emmanuel Hocdet	8c2ddc20de	BUILD: ssl: fix compatibility with openssl without TLSEXT_signature_* In openssl < 1.0.1, TLSEXT_signature_* is undefined. Add TLSEXT signatures (RFC 5246) when TLSEXT_signature_anonymous is undefined.	2017-07-19 17:19:33 +02:00
Thierry FOURNIER	b13b20a19a	BUG/MAJOR: lua/socket: resources not detroyed when the socket is aborted In some cases, the socket is misused. The user can open socket and never close it, or open the socket and close it without sending data. This causes resources leak on all resources associated to the stream (buffer, spoe, ...) This is caused by the stream_shutdown function which is called outside of the stream execution process. Sometimes, the shtudown is required while the stream is not started, so the cleanup is ignored. This patch change the shutdown mode of the session. Now if the session is no longer used and the Lua want to destroy it, it just set a destroy flag and the session kill itself. This patch should be backported in 1.6 and 1.7	2017-07-18 06:41:33 +02:00
Willy Tarreau	106f631280	CLEANUP: hdr_idx: make some function arguments const where possible Functions hdr_idx_first_idx() and hdr_idx_first_pos() were missing a "const" qualifier on their arguments which are not modified, causing a warning in some experimental H2 code.	2017-07-17 21:11:30 +02:00
Fr�d�ric L�caille	ed2b4a6b79	BUG/MINOR: peers: peer synchronization issue (with several peers sections). When several stick-tables were configured with several peers sections, only a part of them could be synchronized: the ones attached to the last parsed 'peers' section. This was due to the fact that, at least, the peer I/O handler refered to the wrong peer section list, in fact always the same: the last one parsed. The fact that the global peer section list was named "struct peers *peers" lead to this issue. This variable name is dangerous ;). So this patch renames global 'peers' variable to 'cfg_peers' to ensure that no such wrong references are still in use, then all the functions wich used old 'peers' variable have been modified to refer to the correct peer list. Must be backported to 1.6 and 1.7.	2017-07-13 09:39:29 +02:00
Willy Tarreau	2ab88675ec	MINOR: ssl: compare server certificate names to the SNI on outgoing connections When support for passing SNI to the server was added in 1.6-dev3, there was no way to validate that the certificate presented by the server would really match the name requested in the SNI, which is quite a problem as it allows other (valid) certificates to be presented instead (when hitting the wrong server or due to a man in the middle). This patch adds the missing check against the value passed in the SNI. The "verifyhost" value keeps precedence if set. If no SNI is used and no verifyhost directive is specified, then the certificate name is not checked (this is unchanged). In order to extract the SNI value, it was necessary to make use of SSL_SESSION_get0_hostname(), which appeared in openssl 1.1.0. This is a trivial function which returns the value of s->tlsext_hostname, so it was provided in the compat layer for older versions. After some refinements from Emmanuel, it now builds with openssl 1.0.2, openssl 1.1.0 and boringssl. A test file was provided to ease testing all cases. After some careful observation period it may make sense to backport this to 1.7 and 1.6 as some users rightfully consider this limitation as a bug. Cc: Emmanuel Hocdet <manu@gandi.net> Signed-off-by: Willy Tarreau <w@1wt.eu>	2017-07-06 15:15:28 +02:00
Emeric Brun	7d27f3c12d	BUG/MEDIUM: map/acl: fix unwanted flags inheritance. The bug: Maps/ACLs using the same file/id can mistakenly inherit their flags from the last declared one. i.e. $ cat haproxy.conf listen mylistener mode http bind 0.0.0.0:8080 acl myacl1 url -i -f mine.acl acl myacl2 url -f mine.acl acl myacl3 url -i -f mine.acl redirect location / if myacl2 $ cat mine.acl foobar Shows an unexpected redirect for request 'GET /FOObAR HTTP/1.0\n\n'. This fix should be backported on mainline branches v1.6 and v1.7.	2017-07-04 10:45:53 +02:00
Emeric Brun	8d85aa44da	BUG/MAJOR: map: fix segfault during 'show map/acl' on cli. The reference of the current map/acl element to dump could be destroyed if map is updated from an 'http-request del-map' configuration rule or throught a 'del map/acl' on CLI. We use a 'back_refs' chaining element to fix this. As it is done to dump sessions. This patch needs also fix: 'BUG/MAJOR: cli: fix custom io_release was crushed by NULL.' To clean the back_ref and avoid a crash on a further del/clear map operation. Those fixes should be backported on mainline branches 1.7 and 1.6. This patch wont directly apply on 1.6.	2017-06-30 06:49:42 +02:00
Emeric Brun	c730606879	MAJOR: applet: applet scheduler rework. In order to authorize call of appctx_wakeup on running task: - from within the task handler itself. - in futur, from another thread. The appctx is considered paused as default after running the handler. The handler should explicitly call appctx_wakeup to be re-called. When the appctx_free is called on a running handler. The real free is postponed at the end of the handler process.	2017-06-27 14:38:02 +02:00
Willy Tarreau	a9c1741820	MINOR: connection: add a .get_alpn() method to xprt_ops This will be used to retrieve the ALPN negociated over SSL (or possibly via the proxy protocol later). It's likely that this information should be stored in the connection itself, but it requires adding an extra pointer and an extra integer. Thus better rely on the transport layer to pass this info for now.	2017-06-27 14:38:02 +02:00
Christopher Faulet	f3a55dbd22	MINOR: queue: Change pendconn_from_srv/pendconn_from_px into private functions	2017-06-27 14:38:02 +02:00
Christopher Faulet	f0614e8111	MINOR: backends: Change get_server_sh/get_server_uh into private function	2017-06-27 14:38:02 +02:00
Christopher Faulet	87566c923b	MINOR: queue: Change pendconn_get_next_strm into private function	2017-06-27 14:38:02 +02:00
Emeric Brun	0194897e54	MAJOR: task: task scheduler rework. In order to authorize call of task_wakeup on running task: - from within the task handler itself. - in futur, from another thread. The lookups on runqueue and waitqueue are re-worked to prepare multithread stuff. If task_wakeup is called on a running task, the woken message flags are savec in the 'pending_state' attribute of the state. The real wakeup is postponed at the end of the handler process and the woken messages are copied from pending_state to the state attribute of the task. It's important to note that this change will cause a very minor (though measurable) performance loss but it is necessary to make forward progress on a multi-threaded scheduler. Most users won't ever notice.	2017-06-27 14:38:02 +02:00
Emeric Brun	ff4491726f	BUG/MINOR: stream: flag TASK_WOKEN_RES not set if task in runqueue Under certain circumstances, if a stream's task is first woken up (eg: I/O event) then notified of the availability of a buffer it was waiting for via stream_res_wakeup(), this second event is lost because the flags are only merged after seeing that the task is running. At the moment it seems that the TASK_WOKEN_RES event is not explicitly checked for, but better fix this before getting reports of lost events. This fix removes this "task running" test which is properly performed in task_wakeup(), while the flags are properly merged. It must be backported to 1.7 and 1.6.	2017-06-27 14:37:52 +02:00
Christopher Faulet	a36b311b9f	BUG/MINOR: buffers: Fix bi/bo_contig_space to handle full buffers These functions was added in commit `637f8f2c` ("BUG/MEDIUM: buffers: Fix how input/output data are injected into buffers"). This patch fixes hidden bugs. When a buffer is full (buf->i + buf->o == buf->size), instead of returning 0, these functions can return buf->size. Today, this never happens because callers already check if the buffer is full before calling bi/bo_contig_space. But to avoid possible bugs if calling conditions changed, we slightly refactored these functions.	2017-06-14 16:20:20 +02:00
Emmanuel Hocdet	df701a2adb	MINOR: ssl: support ssl-min-ver and ssl-max-ver with crt-list SSL/TLS version can be changed per certificat if and only if openssl lib support earlier callback on handshake and, of course, is implemented in haproxy. It's ok for BoringSSL. For Openssl, version 1.1.1 have such callback and could support it.	2017-06-02 16:42:09 +02:00
Willy Tarreau	2686dcad1e	CLEANUP: connection: remove unused CO_FL_WAIT_DATA Very early in the connection rework process leading to v1.5-dev12, commit `56a77e5` ("MEDIUM: connection: complete the polling cleanups") marked the end of use for this flag which since was never set anymore, but it continues to be tested. Let's kill it now.	2017-06-02 15:50:27 +02:00
Willy Tarreau	ed936c5d37	MINOR: tools: make debug_hexdump() take a string prefix When dumping data at various places in the code, it's hard to figure what is present where. To make this easier, this patch slightly modifies debug_hexdump() to take a prefix string which is prepended in front of each output line.	2017-06-02 15:49:31 +02:00
Willy Tarreau	9faef1e391	MINOR: tools: make debug_hexdump() use a const char for the string There's no reason the string to be dumped should be a char *, it's a const.	2017-06-02 15:49:31 +02:00
Jarno Huuskonen	577d5ac8ae	CLEANUP: str2mask return code comment: non-zero -> zero.	2017-06-02 15:43:46 +02:00
Baptiste Assmann	201c07f681	MAJOR/REORG: dns: DNS resolution task and requester queues This patch is a major upgrade of the internal run-time DNS resolver in HAProxy and it brings the following 2 main changes: 1. DNS resolution task Up to now, DNS resolution was triggered by the health check task. From now, DNS resolution task is autonomous. It is started by HAProxy right after the scheduler is available and it is woken either when a network IO occurs for one of its nameserver or when a timeout is matched. From now, this means we can enable DNS resolution for a server without enabling health checking. 2. Introduction of a dns_requester structure Up to now, DNS resolution was purposely made for resolving server hostnames. The idea, is to ensure that any HAProxy internal object should be able to trigger a DNS resolution. For this purpose, 2 things has to be done: - clean up the DNS code from the server structure (this was already quite clean actually) and clean up the server's callbacks from manipulating too much DNS resolution - create an agnostic structure which allows linking a DNS resolution and a requester of any type (using obj_type enum) 3. Manage requesters through queues Up to now, there was an uniq relationship between a resolution and it's owner (aka the requester now). It's a shame, because in some cases, multiple objects may share the same hostname and may benefit from a resolution being performed by a third party. This patch introduces the notion of queues, which are basically lists of either currently running resolution or waiting ones. The resolutions are now available as a pool, which belongs to the resolvers. The pool has has a default size of 64 resolutions per resolvers and is allocated at configuration parsing.	2017-06-02 11:58:54 +02:00
Baptiste Assmann	fa4a663095	MINOR: dns: implement a LRU cache for DNS resolutions Introduction of a DNS response LRU cache in HAProxy. When a positive response is received from a DNS server, HAProxy stores it in the struct resolution and then also populates a LRU cache with the response. For now, the key in the cache is a XXHASH64 of the hostname in the domain name format concatened to the query type in string format.	2017-06-02 11:40:01 +02:00
Baptiste Assmann	729c901c3f	MAJOR: dns: save a copy of the DNS response in struct resolution Prior this patch, the DNS responses were stored in a pre-allocated memory area (allocated at HAProxy's startup). The problem is that this memory is erased for each new DNS responses received and processed. This patch removes the global memory allocation (which was not thread safe by the way) and introduces a storage of the dns response in the struct resolution. The memory in the struct resolution is also reserved at start up and is thread safe, since each resolution structure will have its own memory area. For now, we simply store the response and use it atomically per response per server.	2017-06-02 11:30:21 +02:00
Baptiste Assmann	fb7091e213	MINOR: dns: new snr_check_ip_callback function In the process of breaking links between dns_* functions and other structures (mainly server and a bit of resolution), the function dns_get_ip_from_response needs to be reworked: it now can call "callback" functions based on resolution's owner type to allow modifying the way the response is processed. For now, main purpose of the callback function is to check that an IP address is not already affected to an element of the same type. For now, only server type has a callback.	2017-06-02 11:28:14 +02:00
Baptiste Assmann	42746373eb	REORG: dns: dns_option structure, storage of hostname_dn This patch introduces a some re-organisation around the DNS code in HAProxy. 1. make the dns_* functions less dependent on 'struct server' and 'struct resolution'. With this in mind, the following changes were performed: - 'struct dns_options' has been removed from 'struct resolution' (well, we might need it back at some point later, we'll see) ==> we'll use the 'struct dns_options' from the owner of the resolution - dns_get_ip_from_response(): takes a 'struct dns_options' instead of 'struct resolution' ==> so the caller can pass its own dns options to get the most appropriate IP from the response - dns_process_resolve(): struct dns_option is deduced from new resolution->requester_type parameter 2. add hostname_dn and hostname_dn_len into struct server In order to avoid recomputing a server's hostname into its domain name format (and use a trash buffer to store the result), it is safer to compute it once at configuration parsing and to store it into the struct server. In the mean time, the struct resolution linked to the server doesn't need anymore to store the hostname in domain name format. A simple pointer to the server one will make the trick. The function srv_alloc_dns_resolution() properly manages everything for us: memory allocation, pointer updates, etc... 3. move resolvers pointer into struct server This patch makes the pointer to struct dns_resolvers from struct dns_resolution obsolete. Purpose is to make the resolution as "neutral" as possible and since the requester is already linked to the resolvers, then we don't need this information anymore in the resolution itself.	2017-06-02 11:26:48 +02:00
Baptiste Assmann	81ed1a0516	MINOR: dns: functions to manage memory for a DNS resolution structure A couple of new functions to allocate and free memory for a DNS resolution structure. Main purpose is to to make the code related to DNS more consistent. They allocate or free memory for the structure itself. Later, if needed, they should also allocate / free the buffers, etc, used by this structure. They don't set/unset any parameters, this is the role of the caller. This patch also implement calls to these function eveywhere it is required.	2017-06-02 11:20:29 +02:00
Baptiste Assmann	d0aa6d2399	MINOR: dns: smallest DNS fqdn size global variable used to define the size of the smallest fqdn possible.	2017-06-02 11:20:07 +02:00
St�phane Cottin	23e9e93128	MINOR: log: Add logurilen tunable. The default len of request uri in log messages is 1024. In some use cases, you need to keep the long trail of GET parameters. The only way to increase this len is to recompile with DEFINE=-DREQURI_LEN=2048. This commit introduces a tune.http.logurilen configuration directive, allowing to tune this at runtime.	2017-06-02 11:06:36 +02:00
William Lallemand	69f9b3bfa4	MEDIUM: mworker: exit-on-failure option This option exits every workers when one of the current workers die. It allows you to monitor the master process in order to relaunch everything on a failure. For example it can be used with systemd and Restart=on-failure in a spec file.	2017-06-02 10:56:32 +02:00
William Lallemand	095ba4c242	MEDIUM: mworker: replace systemd mode by master worker mode This commit remove the -Ds systemd mode in HAProxy in order to replace it by a more generic master worker system. It aims to replace entirely the systemd wrapper in the near future. The master worker mode implements a new way of managing HAProxy processes. The master is in charge of parsing the configuration file and is responsible for spawning child processes. The master worker mode can be invoked by using the -W flag. It can be used either in background mode (-D) or foreground mode. When used in background mode, the master will fork to daemonize. In master worker background mode, chroot, setuid and setgid are done in each child rather than in the master process, because the master process will still need access to filesystem to reload the configuration.	2017-06-02 10:56:32 +02:00
Emeric Brun	3854e0102b	MEDIUM: ssl: handle multiple async engines This patch adds the support of a maximum of 32 engines in async mode. Some tests have been done using 2 engines simultaneously. This patch also removes specific 'async' attribute from the connection structure. All the code relies only on Openssl functions.	2017-05-27 07:12:27 +02:00
Grant Zhang	fa6c7ee702	MAJOR: ssl: add openssl async mode support ssl-mode-async is a global configuration parameter which enables asynchronous processing in OPENSSL for all SSL connections haproxy handles. With SSL_MODE_ASYNC set, TLS I/O operations may indicate a retry with SSL_ERROR_WANT_ASYNC with this mode set if an asynchronous capable engine is used to perform cryptographic operations. Currently async mode only supports one async-capable engine. This is the latest version of the patchset which includes Emeric's updates : - improved async fd cleaning when openssl reports an fd to delete - prevent conn_fd_handler from calling SSL_{read,write,handshake} until the async fd is ready, as these operations are very slow and waste CPU - postpone of SSL_free to ensure the async operation can complete and does not cause a dereference a released SSL. - proper removal of async fd from the fdtab and removal of the unused async flag.	2017-05-27 07:05:54 +02:00
Grant Zhang	872f9c2139	MEDIUM: ssl: add basic support for OpenSSL crypto engine This patch adds the global 'ssl-engine' keyword. First arg is an engine identifier followed by a list of default_algorithms the engine will operate. If the openssl version is too old, an error is reported when the option is used.	2017-05-27 07:05:00 +02:00
William Lallemand	f6975e9f76	MINOR: cli: add 'expose-fd listeners' to pass listeners FDs This patch changes the stats socket rights for allowing the sending of listening sockets. The previous behavior was to allow any unix stats socket with admin level to send sockets. It's not possible anymore, you have to set this option to activate the socket sending. Example: stats socket /var/run/haproxy4.sock mode 666 expose-fd listeners level user process 4	2017-05-27 07:02:17 +02:00
William Lallemand	07a62f7a7e	MINOR: cli: add ACCESS_LVL_MASK to store the access level The current level variable use only 2 bits for storing the 3 access level (user, oper and admin). This patch add a bitmask which allows to use the remaining bits for other usage.	2017-05-27 07:02:06 +02:00
Emmanuel Hocdet	5db33cbdc4	MEDIUM: ssl: ssl_methods implementation is reworked and factored for min/max tlsxx Plan is to add min-tlsxx max-tlsxx configuration, more consistent than no-tlsxx. This patch introduce internal min/max and replace force-tlsxx implementation. SSL method configuration is store in 'struct tls_version_filter'. SSL method configuration to openssl setting is abstract in 'methodVersions' table. With openssl < 1.1.0, SSL_CTX_set_ssl_version is used for force (min == max). With openssl >= 1.1.0, SSL_CTX_set_min/max_proto_version is used.	2017-05-12 15:49:04 +02:00
Lukas Tribus	53ae85c38e	MINOR: ssl: add prefer-client-ciphers Currently we unconditionally set SSL_OP_CIPHER_SERVER_PREFERENCE [1], which may not always be a good thing. The benefit of server side cipher prioritization may not apply to all cases out there, and it appears that the various SSL libs are going away from this recommendation ([2], [3]), as insecure ciphers suites are properly blacklisted/removed and honoring the client's preference is more likely to improve user experience (for example using SW-friendly ciphers on devices without HW AES support). This is especially true for TLSv1.3, which will restrict the cipher suites to just AES-GCM and Chacha20/Poly1305. Apache [4], nginx [5] and others give admins full flexibility, we should as well. The initial proposal to change the current default and add a "prefer-server-ciphers" option (as implemented in `e566ecb`) has been declined due to the possible security impact. This patch implements prefer-client-ciphers without changing the defaults. [1] https://www.openssl.org/docs/man1.0.2/ssl/SSL_CTX_set_options.html [2] https://github.com/openssl/openssl/issues/541 [3] https://github.com/libressl-portable/portable/issues/66 [4] https://httpd.apache.org/docs/2.0/en/mod/mod_ssl.html#sslhonorcipherorder [5] https://nginx.org/en/docs/http/ngx_http_ssl_module.html#ssl_prefer_server_ciphers	2017-05-12 15:49:04 +02:00
Fr�d�ric L�caille	b418c1228c	MINOR: server: cli: Add server FQDNs to server-state file and stats socket. This patch adds a new stats socket command to modify server FQDNs at run time. Its syntax: set server <backend>/<server> fqdn <FQDN> This patch also adds FQDNs to server state file at the end of each line for backward compatibility ("-" if not present).	2017-05-03 06:58:53 +02:00
Lukas Tribus	23953686da	DOC: update RFC references A few doc and code comment updates bumping RFC references to the new ones.	2017-04-28 18:58:11 +02:00
Thierry FOURNIER	6ab2bae084	REORG: spoe: move spoe_encode_varint / spoe_decode_varint from spoe to common These encoding functions does general stuff and can be used in other context than spoe. This patch moves the function spoe_encode_varint and spoe_decode_varint from spoe to common. It also remove the prefix spoe. These functions will be used for encoding values in new binary sample fetch.	2017-04-27 11:50:41 +02:00
Thierry FOURNIER	f4128a9981	BUG/MINOR: change header-declared function to static inline When we include the header proto/spoe.h in other files in the same project, the compilator claim that the symbol have multiple definitions: src/flt_spoe.o: In function `spoe_encode_varint': ~/git/haproxy/include/proto/spoe.h:45: multiple definition of `spoe_encode_varint' src/proto_http.o:~/git/haproxy/include/proto/spoe.h:45: first defined here	2017-04-27 11:50:07 +02:00
Fr�d�ric L�caille	b82f742b78	MINOR: server: Add 'server-template' new keyword supported in backend sections. This patch makes backend sections support 'server-template' new keyword. Such 'server-template' objects are parsed similarly to a 'server' object by parse_server() function, but its first arguments are as follows: server-template <ID prefix> <nb \| range> <ip \| fqdn>:<port> ... The remaining arguments are the same as for 'server' lines. With such server template declarations, servers may be allocated with IDs built from <ID prefix> and <nb \| range> arguments. For instance declaring: server-template foo 1-5 google.com:80 ... or server-template foo 5 google.com:80 ... would be equivalent to declare: server foo1 google.com:80 ... server foo2 google.com:80 ... server foo3 google.com:80 ... server foo4 google.com:80 ... server foo5 google.com:80 ...	2017-04-21 15:42:10 +02:00
Olivier Houchard	1fc0516516	MINOR: proxy: Don't close FDs if not our proxy. When running with multiple process, if some proxies are just assigned to some processes, the other processes will just close the file descriptors for the listening sockets. However, we may still have to provide those sockets when reloading, so instead we just try hard to pretend those proxies are dead, while keeping the sockets opened. A new global option, no-reused-socket", has been added, to restore the old behavior of closing the sockets not bound to this process.	2017-04-13 19:15:17 +02:00
Olivier Houchard	f73629d23a	MINOR: global: Add an option to get the old listening sockets. Add the "-x" flag, that takes a path to a unix socket as an argument. If used, haproxy will connect to the socket, and asks to get all the listening sockets from the old process. Any failure is fatal. This is needed to get seamless reloads on linux.	2017-04-13 19:15:17 +02:00
Olivier Houchard	f886e3478d	MINOR: cli: Add a command to send listening sockets. Add a new command that will send all the listening sockets, via the stats socket, and their properties. This is a first step to workaround the linux problem when reloading haproxy.	2017-04-13 19:15:17 +02:00
Willy Tarreau	7b677265fd	[RELEASE] Released version 1.8-dev1 Released version 1.8-dev1 with the following main changes : - BUG/MEDIUM: proxy: return "none" and "unknown" for unknown LB algos - BUG/MINOR: stats: make field_str() return an empty string on NULL - DOC: Spelling fixes - BUG/MEDIUM: http: Fix tunnel mode when the CONNECT method is used - BUG/MINOR: http: Keep the same behavior between 1.6 and 1.7 for tunneled txn - BUG/MINOR: filters: Protect args in macros HAS_DATA_FILTERS and IS_DATA_FILTER - BUG/MINOR: filters: Invert evaluation order of HTTP_XFER_BODY and XFER_DATA analyzers - BUG/MINOR: http: Call XFER_DATA analyzer when HTTP txn is switched in tunnel mode - BUG/MAJOR: stream: fix session abort on resource shortage - OPTIM: stream-int: don't disable polling anymore on DONT_READ - BUG/MINOR: cli: allow the backslash to be escaped on the CLI - BUG/MEDIUM: cli: fix "show stat resolvers" and "show tls-keys" - DOC: Fix map table's format - DOC: Added 51Degrees conv and fetch functions to documentation. - BUG/MINOR: http: don't send an extra CRLF after a Set-Cookie in a redirect - DOC: mention that req_tot is for both frontends and backends - BUG/MEDIUM: variables: some variable name can hide another ones - MINOR: lua: Allow argument for actions - BUILD: rearrange target files by build time - CLEANUP: hlua: just indent functions - MINOR: lua: give HAProxy variable access to the applets - BUG/MINOR: stats: fix be/sessions/max output in html stats - MINOR: proxy: Add fe_name/be_name fetchers next to existing fe_id/be_id - DOC: lua: Documentation about some entry missing - DOC: lua: Add documentation about variable manipulation from applet - MINOR: Do not forward the header "Expect: 100-continue" when the option http-buffer-request is set - DOC: Add undocumented argument of the trace filter - DOC: Fix some typo in SPOE documentation - MINOR: cli: Remove useless call to bi_putchk - BUG/MINOR: cli: be sure to always warn the cli applet when input buffer is full - MINOR: applet: Count number of (active) applets - MINOR: task: Rename run_queue and run_queue_cur counters - BUG/MEDIUM: stream: Save unprocessed events for a stream - BUG/MAJOR: Fix how the list of entities waiting for a buffer is handled - BUILD/MEDIUM: Fixing the build using LibreSSL - BUG/MEDIUM: lua: In some case, the return of sample-fetches is ignored (2) - SCRIPTS: git-show-backports: fix a harmless typo - SCRIPTS: git-show-backports: add -H to use the hash of the commit message - BUG/MINOR: stream-int: automatically release SI_FL_WAIT_DATA on SHUTW_NOW - CLEANUP: applet/lua: create a dedicated ->fcn entry in hlua_cli context - CLEANUP: applet/table: add an "action" entry in ->table context - CLEANUP: applet: remove the now unused appctx->private field - DOC: lua: documentation about time parser functions - DOC: lua: improve links - DOC: lua: section declared twice - MEDIUM: cli: 'show cli sockets' list the CLI sockets - BUG/MINOR: cli: "show cli sockets" wouldn't list all processes - BUG/MINOR: cli: "show cli sockets" would always report process 64 - CLEANUP: lua: rename one of the lua appctx union - BUG/MINOR: lua/cli: bad error message - MEDIUM: lua: use memory pool for hlua struct in applets - MINOR: lua/signals: Remove Lua part from signals. - DOC: cli: show cli sockets - MINOR: cli: automatically enable a CLI I/O handler when there's no parser - CLEANUP: memory: remove the now unused cli_parse_show_pools() function - CLEANUP: applet: group all CLI contexts together - CLEANUP: stats: move a misplaced stats context initialization - MINOR: cli: add two general purpose pointers and integers in the CLI struct - MINOR: appctx/cli: remove the cli_socket entry from the appctx union - MINOR: appctx/cli: remove the env entry from the appctx union - MINOR: appctx/cli: remove the "be" entry from the appctx union - MINOR: appctx/cli: remove the "dns" entry from the appctx union - MINOR: appctx/cli: remove the "server_state" entry from the appctx union - MINOR: appctx/cli: remove the "tlskeys" entry from the appctx union - CONTRIB: tcploop: add limits.h to fix build issue with some compilers - MINOR/DOC: lua: just precise one thing - DOC: fix small typo in fe_id (backend instead of frontend) - BUG/MINOR: Fix the sending function in Lua's cosocket - BUG/MINOR: lua: memory leak executing tasks - BUG/MINOR: lua: bad return code - BUG/MINOR: lua: memleak when Lua/cli fails - MEDIUM: lua: remove Lua struct from session, and allocate it with memory pools - CLEANUP: haproxy: statify unexported functions - MINOR: haproxy: add a registration for build options - CLEANUP: wurfl: use the build options list to report it - CLEANUP: 51d: use the build options list to report it - CLEANUP: da: use the build options list to report it - CLEANUP: namespaces: use the build options list to report it - CLEANUP: tcp: use the build options list to report transparent modes - CLEANUP: lua: use the build options list to report it - CLEANUP: regex: use the build options list to report the regex type - CLEANUP: ssl: use the build options list to report the SSL details - CLEANUP: compression: use the build options list to report the algos - CLEANUP: auth: use the build options list to report its support - MINOR: haproxy: add a registration for post-check functions - CLEANUP: checks: make use of the post-init registration to start checks - CLEANUP: filters: use the function registration to initialize all proxies - CLEANUP: wurfl: make use of the late init registration - CLEANUP: 51d: make use of the late init registration - CLEANUP: da: make use of the late init registration code - MINOR: haproxy: add a registration for post-deinit functions - CLEANUP: wurfl: register the deinit function via the dedicated list - CLEANUP: 51d: register the deinitialization function - CLEANUP: da: register the deinitialization function - CLEANUP: wurfl: move global settings out of the global section - CLEANUP: 51d: move global settings out of the global section - CLEANUP: da: move global settings out of the global section - MINOR: cfgparse: add two new functions to check arguments count - MINOR: cfgparse: move parsing of "ca-base" and "crt-base" to ssl_sock - MEDIUM: cfgparse: move all tune.ssl.* keywords to ssl_sock - MEDIUM: cfgparse: move maxsslconn parsing to ssl_sock - MINOR: cfgparse: move parsing of ssl-default-{bind,server}-ciphers to ssl_sock - MEDIUM: cfgparse: move ssl-dh-param-file parsing to ssl_sock - MEDIUM: compression: move the zlib-specific stuff from global.h to compression.c - BUG/MEDIUM: ssl: properly reset the reused_sess during a forced handshake - BUG/MEDIUM: ssl: avoid double free when releasing bind_confs - BUG/MINOR: stats: fix be/sessions/current out in typed stats - MINOR: tcp-rules: check that the listener exists before updating its counters - MEDIUM: spoe: don't create a dummy listener for outgoing connections - MINOR: listener: move the transport layer pointer to the bind_conf - MEDIUM: move listener->frontend to bind_conf->frontend - MEDIUM: ssl: remote the proxy argument from most functions - MINOR: connection: add a new prepare_bind_conf() entry to xprt_ops - MEDIUM: ssl_sock: implement ssl_sock_prepare_bind_conf() - MINOR: connection: add a new destroy_bind_conf() entry to xprt_ops - MINOR: ssl_sock: implement ssl_sock_destroy_bind_conf() - MINOR: server: move the use_ssl field out of the ifdef USE_OPENSSL - MINOR: connection: add a minimal transport layer registration system - CLEANUP: connection: remove all direct references to raw_sock and ssl_sock - CLEANUP: connection: unexport raw_sock and ssl_sock - MINOR: connection: add new prepare_srv()/destroy_srv() entries to xprt_ops - MINOR: ssl_sock: implement and use prepare_srv()/destroy_srv() - CLEANUP: ssl: move tlskeys_finalize_config() to a post_check callback - CLEANUP: ssl: move most ssl-specific global settings to ssl_sock.c - BUG/MINOR: backend: nbsrv() should return 0 if backend is disabled - BUG/MEDIUM: ssl: for a handshake when server-side SNI changes - BUG/MINOR: systemd: potential zombie processes - DOC: Add timings events schemas - BUILD: lua: build failed on FreeBSD. - MINOR: samples: add xx-hash functions - MEDIUM: regex: pcre2 support - BUG/MINOR: option prefer-last-server must be ignored in some case - MINOR: stats: Support "select all" for backend actions - BUG/MINOR: sample-fetches/stick-tables: bad type for the sample fetches sc_get_gpt0 - BUG/MAJOR: channel: Fix the definition order of channel analyzers - BUG/MINOR: http: report real parser state in error captures - BUILD: scripts: automatically update the branch in version.h when releasing - MINOR: tools: add a generic hexdump function for debugging - BUG/MAJOR: http: fix risk of getting invalid reports of bad requests - MINOR: http: custom status reason. - MINOR: connection: add sample fetch "fc_rcvd_proxy" - BUG/MINOR: config: emit a warning if http-reuse is enabled with incompatible options - BUG/MINOR: tools: fix off-by-one in port size check - BUG/MEDIUM: server: consider AF_UNSPEC as a valid address family - MEDIUM: server: split the address and the port into two different fields - MINOR: tools: make str2sa_range() return the port in a separate argument - MINOR: server: take the destination port from the port field, not the addr - MEDIUM: server: disable protocol validations when the server doesn't resolve - BUG/MEDIUM: tools: do not force an unresolved address to AF_INET:0.0.0.0 - BUG/MINOR: ssl: EVP_PKEY must be freed after X509_get_pubkey usage - BUG/MINOR: ssl: assert on SSL_set_shutdown with BoringSSL - MINOR: Use "500 Internal Server Error" for 500 error/status code message. - MINOR: proto_http.c 502 error txt typo. - DOC: add deprecation notice to "block" - MINOR: compression: fix -vv output without zlib/slz - BUG/MINOR: Reset errno variable before calling strtol(3) - MINOR: ssl: don't show prefer-server-ciphers output - OPTIM/MINOR: config: Optimize fullconn automatic computation loading configuration - BUG/MINOR: stream: Fix how backend-specific analyzers are set on a stream - MAJOR: ssl: bind configuration per certificat - MINOR: ssl: add curve suite for ECDHE negotiation - MINOR: checks: Add agent-addr config directive - MINOR: cli: Add possiblity to change agent config via CLI/socket - MINOR: doc: Add docs for agent-addr configuration variable - MINOR: doc: Add docs for agent-addr and agent-send CLI commands - BUILD: ssl: fix to build (again) with boringssl - BUILD: ssl: fix build on OpenSSL 1.0.0 - BUILD: ssl: silence a warning reported for ERR_remove_state() - BUILD: ssl: eliminate warning with OpenSSL 1.1.0 regarding RAND_pseudo_bytes() - BUILD: ssl: kill a build warning introduced by BoringSSL compatibility - BUG/MEDIUM: tcp: don't poll for write when connect() succeeds - BUG/MINOR: unix: fix connect's polling in case no data are scheduled - MINOR: server: extend the flags to 32 bits - BUG/MINOR: lua: Map.end are not reliable because "end" is a reserved keyword - MINOR: dns: give ability to dns_init_resolvers() to close a socket when requested - BUG/MAJOR: dns: restart sockets after fork() - MINOR: chunks: implement a simple dynamic allocator for trash buffers - BUG/MEDIUM: http: prevent redirect from overwriting a buffer - BUG/MEDIUM: filters: Do not truncate HTTP response when body length is undefined - BUG/MEDIUM: http: Prevent replace-header from overwriting a buffer - BUG/MINOR: http: Return an error when a replace-header rule failed on the response - BUG/MINOR: sendmail: The return of vsnprintf is not cleanly tested - BUG/MAJOR: ssl: fix a regression in ssl_sock_shutw() - BUG/MAJOR: lua segmentation fault when the request is like 'GET ?arg=val HTTP/1.1' - BUG/MEDIUM: config: reject anything but "if" or "unless" after a use-backend rule - MINOR: http: don't close when redirect location doesn't start with "/" - MEDIUM: boringssl: support native multi-cert selection without bundling - BUG/MEDIUM: ssl: fix verify/ca-file per certificate - BUG/MEDIUM: ssl: switchctx should not return SSL_TLSEXT_ERR_ALERT_WARNING - MINOR: ssl: removes SSL_CTX_set_ssl_version call and cleanup CTX creation. - BUILD: ssl: fix build with -DOPENSSL_NO_DH - MEDIUM: ssl: add new sample-fetch which captures the cipherlist - MEDIUM: ssl: remove ssl-options from crt-list - BUG/MEDIUM: ssl: in bind line, ssl-options after 'crt' are ignored. - BUG/MINOR: ssl: fix cipherlist captures with sustainable SSL calls - MINOR: ssl: improved cipherlist captures - BUG/MINOR: spoe: Fix soft stop handler using a specific id for spoe filters - BUG/MINOR: spoe: Fix parsing of arguments in spoe-message section - MAJOR: spoe: Add support of pipelined and asynchronous exchanges with agents - MINOR: spoe: Add support for pipelining/async capabilities in the SPOA example - MINOR: spoe: Remove SPOE details from the appctx structure - MINOR: spoe: Add status code in error variable instead of hardcoded value - MINOR: spoe: Send a log message when an error occurred during event processing - MINOR: spoe: Check the scope of sample fetches used in SPOE messages - MEDIUM: spoe: Be sure to wakeup the good entity waiting for a buffer - MINOR: spoe: Use the min of all known max_frame_size to encode messages - MAJOR: spoe: Add support of payload fragmentation in NOTIFY frames - MINOR: spoe: Add support for fragmentation capability in the SPOA example - MAJOR: spoe: refactor the filter to clean up the code - MINOR: spoe: Handle NOTIFY frames cancellation using ABORT bit in ACK frames - REORG: spoe: Move struct and enum definitions in dedicated header file - REORG: spoe: Move low-level encoding/decoding functions in dedicated header file - MINOR: spoe: Improve implementation of the payload fragmentation - MINOR: spoe: Add support of negation for options in SPOE configuration file - MINOR: spoe: Add "pipelining" and "async" options in spoe-agent section - MINOR: spoe: Rely on alertif_too_many_arg during configuration parsing - MINOR: spoe: Add "send-frag-payload" option in spoe-agent section - MINOR: spoe: Add "max-frame-size" statement in spoe-agent section - DOC: spoe: Update SPOE documentation to reflect recent changes - MINOR: config: warn when some HTTP rules are used in a TCP proxy - BUG/MEDIUM: ssl: Clear OpenSSL error stack after trying to parse OCSP file - BUG/MEDIUM: cli: Prevent double free in CLI ACL lookup - BUG/MINOR: Fix "get map <map> <value>" CLI command - MINOR: Add nbsrv sample converter - CLEANUP: Replace repeated code to count usable servers with be_usable_srv() - MINOR: Add hostname sample fetch - CLEANUP: Remove comment that's no longer valid - MEDIUM: http_error_message: txn->status / http_get_status_idx. - MINOR: http-request tarpit deny_status. - CLEANUP: http: make http_server_error() not set the status anymore - MEDIUM: stats: Add JSON output option to show (info\|stat) - MEDIUM: stats: Add show json schema - BUG/MAJOR: connection: update CO_FL_CONNECTED before calling the data layer - MINOR: server: Add dynamic session cookies. - MINOR: cli: Let configure the dynamic cookies from the cli. - BUG/MINOR: checks: attempt clean shutw for SSL check - CONTRIB: tcploop: make it build on FreeBSD - CONTRIB: tcploop: fix time format to silence build warnings - CONTRIB: tcploop: report action 'K' (kill) in usage message - CONTRIB: tcploop: fix connect's address length - CONTRIB: tcploop: use the trash instead of NULL for recv() - BUG/MEDIUM: listener: do not try to rebind another process' socket - BUG/MEDIUM server: Fix crash when dynamic is defined, but not key is provided. - CLEANUP: config: Typo in comment. - BUG/MEDIUM: filters: Fix channels synchronization in flt_end_analyze - TESTS: add a test configuration to stress handshake combinations - BUG/MAJOR: stream-int: do not depend on connection flags to detect connection - BUG/MEDIUM: connection: ensure to always report the end of handshakes - MEDIUM: connection: don't test for CO_FL_WAKE_DATA - CLEANUP: connection: completely remove CO_FL_WAKE_DATA - BUG: payload: fix payload not retrieving arbitrary lengths - BUILD: ssl: simplify SSL_CTX_set_ecdh_auto compatibility - BUILD: ssl: fix OPENSSL_NO_SSL_TRACE for boringssl and libressl - BUG/MAJOR: http: fix typo in http_apply_redirect_rule - MINOR: doc: 2.4. Examples should be 2.5. Examples - BUG/MEDIUM: stream: fix client-fin/server-fin handling - MINOR: fd: add a new flag HAP_POLL_F_RDHUP to struct poller - BUG/MINOR: raw_sock: always perfom the last recv if RDHUP is not available - OPTIM: poll: enable support for POLLRDHUP - MINOR: kqueue: exclusively rely on the kqueue returned status - MEDIUM: kqueue: take care of EV_EOF to improve polling status accuracy - MEDIUM: kqueue: only set FD_POLL_IN when there are pending data - DOC/MINOR: Fix typos in proxy protocol doc - DOC: Protocol doc: add checksum, TLV type ranges - DOC: Protocol doc: add SSL TLVs, rename CHECKSUM - DOC: Protocol doc: add noop TLV - MEDIUM: global: add a 'hard-stop-after' option to cap the soft-stop time - MINOR: dns: improve DNS response parsing to use as many available records as possible - BUG/MINOR: cfgparse: loop in tracked servers lists not detected by check_config_validity(). - MINOR: server: irrelevant error message with 'default-server' config file keyword. - MINOR: server: Make 'default-server' support 'backup' keyword. - MINOR: server: Make 'default-server' support 'check-send-proxy' keyword. - CLEANUP: server: code alignement. - MINOR: server: Make 'default-server' support 'non-stick' keyword. - MINOR: server: Make 'default-server' support 'send-proxy' and 'send-proxy-v2 keywords. - MINOR: server: Make 'default-server' support 'check-ssl' keyword. - MINOR: server: Make 'default-server' support 'force-sslv3' and 'force-tlsv1[0-2]' keywords. - CLEANUP: server: code alignement. - MINOR: server: Make 'default-server' support 'no-ssl' and 'no-tlsv' keywords. - MINOR: server: Make 'default-server' support 'ssl' keyword. - MINOR: server: Make 'default-server' support 'send-proxy-v2-ssl' keywords. - CLEANUP: server: code alignement. - MINOR: server: Make 'default-server' support 'verify' keyword. - MINOR: server: Make 'default-server' support 'verifyhost' setting. - MINOR: server: Make 'default-server' support 'check' keyword. - MINOR: server: Make 'default-server' support 'track' setting. - MINOR: server: Make 'default-server' support 'ca-file', 'crl-file' and 'crt' settings. - MINOR: server: Make 'default-server' support 'redir' keyword. - MINOR: server: Make 'default-server' support 'observe' keyword. - MINOR: server: Make 'default-server' support 'cookie' keyword. - MINOR: server: Make 'default-server' support 'ciphers' keyword. - MINOR: server: Make 'default-server' support 'tcp-ut' keyword. - MINOR: server: Make 'default-server' support 'namespace' keyword. - MINOR: server: Make 'default-server' support 'source' keyword. - MINOR: server: Make 'default-server' support 'sni' keyword. - MINOR: server: Make 'default-server' support 'addr' keyword. - MINOR: server: Make 'default-server' support 'disabled' keyword. - MINOR: server: Add 'no-agent-check' server keyword. - DOC: server: Add docs for "server" and "default-server" new "no-*" and other settings. - MINOR: doc: fix use-server example (imap vs mail) - BUG/MEDIUM: tcp: don't require privileges to bind to device - BUILD: make the release script use shortlog for the final changelog - BUILD: scripts: fix typo in announce-release error message - CLEANUP: time: curr_sec_ms doesn't need to be exported - BUG/MEDIUM: server: Wrong server default CRT filenames initialization. - BUG/MEDIUM: peers: fix buffer overflow control in intdecode. - BUG/MEDIUM: buffers: Fix how input/output data are injected into buffers - BUG/MINOR: http: Fix conditions to clean up a txn and to handle the next request - CLEANUP: http: Remove channel_congested function - CLEANUP: buffers: Remove buffer_bounce_realign function - CLEANUP: buffers: Remove buffer_contig_area and buffer_work_area functions - MINOR: http: remove useless check on HTTP_MSGF_XFER_LEN for the request - MINOR: http: Add debug messages when HTTP body analyzers are called - BUG/MEDIUM: http: Fix blocked HTTP/1.0 responses when compression is enabled - BUG/MINOR: filters: Don't force the stream's wakeup when we wait in flt_end_analyze - DOC: fix parenthesis and add missing "Example" tags - DOC: update the contributing file - DOC: log-format/tcplog/httplog update - MINOR: config parsing: add warning when log-format/tcplog/httplog is overriden in "defaults" sections	2017-04-03 09:27:49 +02:00
Christopher Faulet	a545569f1e	CLEANUP: buffers: Remove buffer_contig_area and buffer_work_area functions Not used anymore since last commit.	2017-03-31 14:38:30 +02:00
Christopher Faulet	aaf4a325ca	CLEANUP: buffers: Remove buffer_bounce_realign function Not used anymore since last commit.	2017-03-31 14:38:22 +02:00
Christopher Faulet	533182f1c8	CLEANUP: http: Remove channel_congested function Not used anymore since last commit.	2017-03-31 14:38:08 +02:00
Christopher Faulet	637f8f2ca7	BUG/MEDIUM: buffers: Fix how input/output data are injected into buffers The function buffer_contig_space is buggy and could lead to pernicious bugs (never hitted until now, AFAIK). This function should return the number of bytes that can be written into the buffer at once (without wrapping). First, this function is used to inject input data (bi_putblk) and to inject output data (bo_putblk and bo_inject). But there is no context. So it cannot decide where contiguous space should placed. For input data, it should be after bi_end(buf) (ie, buf->p + buf->i modulo wrapping calculation). For output data, it should be after bo_end(buf) (ie, buf->p) and input data are assumed to not exist (else there is no space at all). Then, considering we need to inject input data, this function does not always returns the right value. And when we need to inject output data, we must be sure to have no input data at all (buf->i == 0), else the result can also be wrong (but this is the caller responsibility, so everything should be fine here). The buffer can be in 3 different states: 1) no wrapping <---- o ----><----- i -----> +------------+------------+-------------+------------+ \| \|oooooooooooo\|iiiiiiiiiiiii\|xxxxxxxxxxxx\| +------------+------------+-------------+------------+ ^ <contig_space> p ^ ^ l r 2) input wrapping ...---> <---- o ----><-------- i -------... +-----+------------+------------+--------------------+ \|iiiii\|xxxxxxxxxxxx\|oooooooooooo\|iiiiiiiiiiiiiiiiiiii\| +-----+------------+------------+--------------------+ <contig_space> ^ ^ ^ p l r 3) output wrapping ...------ o ------><----- i -----> <----... +------------------+-------------+------------+------+ \|oooooooooooooooooo\|iiiiiiiiiiiii\|xxxxxxxxxxxx\|oooooo\| +------------------+-------------+------------+------+ ^ <contig_space> p ^ ^ l r buffer_contig_space returns (l - r). The cases 1 and 3 are correctly handled. But for the second case, r is wrong. It points on the buffer's end (buf->data + buf->size). It should be bo_end(buf) (ie, buf->p - buf->o). To fix the bug, the function has been splitted. Now, bi_contig_space and bo_contig_space should be used to know the contiguous space available to insert, respectively, input data and output data. For bo_contig_space, input data are assumed to not exist. And the right version is used, depending what we want to do. In addition, to clarify the buffer's API, buffer_realign does not return value anymore. So it has the same API than buffer_slow_realign. This patch can be backported in 1.7, 1.6 and 1.5.	2017-03-31 14:36:04 +02:00
Fr�d�ric L�caille	6e0843c0e0	MINOR: server: Add 'no-agent-check' server keyword. This patch adds 'no-agent-check' setting supported both by 'default-server' and 'server' directives to disable an agent check for a specific server which would have 'agent-check' set as default value (inherited from 'default-server' 'agent-check' setting), or, on 'default-server' lines, to disable 'agent-check' setting as default value for any further 'server' declarations. For instance, provided this configuration: default-server agent-check server srv1 server srv2 no-agent-check server srv3 default-server no-agent-check server srv4 srv1 and srv3 would have an agent check enabled contrary to srv2 and srv4. We do not allocate anymore anything when parsing 'default-server' 'agent-check' setting.	2017-03-27 14:37:01 +02:00
Fr�d�ric L�caille	9a146de934	MINOR: server: Make 'default-server' support 'sni' keyword. This patch makes 'default-server' directives support 'sni' settings. A field 'sni_expr' has been added to 'struct server' to temporary stores SNI expressions as strings during both 'default-server' and 'server' lines parsing. So, to duplicate SNI expressions from 'default-server' 'sni' setting for new 'server' instances we only have to "strdup" these strings as this is often done for most of the 'server' settings. Then, sample expressions are computed calling sample_parse_expr() (only for 'server' instances). A new function has been added to produce the same error output as before in case of any error during 'sni' settings parsing (display_parser_err()). Should not break anything.	2017-03-27 14:37:01 +02:00
Fr�d�ric L�caille	65aa356c0b	MINOR: server: Make 'default-server' support 'check' keyword. Before this patch 'check' setting was only supported by 'server' directives. This patch makes also 'default-server' directives support this setting. A new 'no-check' keyword parser has been implemented to disable this setting both in 'default-server' and 'server' directives. Should not break anything.	2017-03-27 14:37:01 +02:00
Cyril Bont�	203ec5a2b5	MEDIUM: global: add a 'hard-stop-after' option to cap the soft-stop time When SIGUSR1 is received, haproxy enters in soft-stop and quits when no connection remains. It can happen that the instance remains alive for a long time, depending on timeouts and traffic. This option ensures that soft-stop won't run for too long. Example: global hard-stop-after 30s # Once in soft-stop, the instance will remain # alive for at most 30 seconds.	2017-03-23 23:03:57 +01:00
Willy Tarreau	5a767693b5	MINOR: fd: add a new flag HAP_POLL_F_RDHUP to struct poller We'll need to differenciate between pollers which can report hangup at the same time as read (POLL_RDHUP) from the other ones, because only these ones may benefit from the fd_done_recv() optimization. Epoll has had support for EPOLLRDHUP since Linux 2.6.17 and has always been used this way in haproxy, so now we only set the flag once we've observed it once in a response. It means that some initial requests may try to perform a second recv() call, but after the first closed connection it will be enough to know that the second call is not needed anymore. Later we may extend these flags to designate event-triggered pollers.	2017-03-21 16:30:35 +01:00
Hongbo Long	e39683c4d4	BUG/MEDIUM: stream: fix client-fin/server-fin handling A tcp half connection can cause 100% CPU on expiration. First reproduced with this haproxy configuration : global tune.bufsize 10485760 defaults timeout server-fin 90s timeout client-fin 90s backend node2 mode tcp timeout server 900s timeout connect 10s server def 127.0.0.1:3333 frontend fe_api mode tcp timeout client 900s bind :1990 use_backend node2 Ie timeout server-fin shorter than timeout server, the backend server sends data, this package is left in the cache of haproxy, the backend server continue sending fin package, haproxy recv fin package. this time the session information is as follows: time the session information is as follows: 0x2373470: proto=tcpv4 src=127.0.0.1:39513 fe=fe_api be=node2 srv=def ts=08 age=1s calls=3 rq[f=848000h,i=0,an=00h,rx=14m58s,wx=,ax=] rp[f=8004c020h,i=0,an=00h,rx=,wx=14m58s,ax=] s0=[7,0h,fd=6,ex=] s1=[7,18h,fd=7,ex=] exp=14m58s rp has set the CF_SHUTR state, next, the client sends the fin package, session information is as follows: 0x2373470: proto=tcpv4 src=127.0.0.1:39513 fe=fe_api be=node2 srv=def ts=08 age=38s calls=4 rq[f=84a020h,i=0,an=00h,rx=,wx=,ax=] rp[f=8004c020h,i=0,an=00h,rx=1m11s,wx=14m21s,ax=] s0=[7,0h,fd=6,ex=] s1=[9,10h,fd=7,ex=] exp=1m11s After waiting 90s, session information is as follows: 0x2373470: proto=tcpv4 src=127.0.0.1:39513 fe=fe_api be=node2 srv=def ts=04 age=4m11s calls=718074391 rq[f=84a020h,i=0,an=00h,rx=,wx=,ax=] rp[f=8004c020h,i=0,an=00h,rx=?,wx=10m49s,ax=] s0=[7,0h,fd=6,ex=] s1=[9,10h,fd=7,ex=] exp=? run(nice=0) cpu information: 6899 root 20 0 112224 21408 4260 R 100.0 0.7 3:04.96 haproxy Buffering is set to ensure that there is data in the haproxy buffer, and haproxy can receive the fin package, set the CF_SHUTR flag, If the CF_SHUTR flag has been set, The following code does not clear the timeout message, causing cpu 100%: stream.c:process_stream: if (unlikely((res->flags & (CF_SHUTR\|CF_READ_TIMEOUT)) == CF_READ_TIMEOUT)) { if (si_b->flags & SI_FL_NOHALF) si_b->flags \|= SI_FL_NOLINGER; si_shutr(si_b); } If you have closed the read, set the read timeout does not make sense. With or without cf_shutr, read timeout is set: if (tick_isset(s->be->timeout.serverfin)) { res->rto = s->be->timeout.serverfin; res->rex = tick_add(now_ms, res->rto); } After discussion on the mailing list, setting half-closed timeouts the hard way here doesn't make sense. They should be set only at the moment the shutdown() is performed. It will also solve a special case which was already reported of some half-closed timeouts not working when the shutw() is performed directly at the stream-interface layer (no analyser involved). Since the stream interface layer cannot know the timeout values, we'll have to store them directly in the stream interface so that they are used upon shutw(). This patch does this, fixing the problem. An easier reproducer to validate the fix is to keep the huge buffer and shorten all timeouts, then call it under tcploop server and client, and wait 3 seconds to see haproxy run at 100% CPU : global tune.bufsize 10485760 listen px bind :1990 timeout client 90s timeout server 90s timeout connect 1s timeout server-fin 3s timeout client-fin 3s server def 127.0.0.1:3333 $ tcploop 3333 L W N20 A P100 F P10000 & $ tcploop 127.0.0.1:1990 C S10000000 F	2017-03-21 15:04:43 +01:00
Emmanuel Hocdet	9490cedb4e	BUILD: ssl: fix OPENSSL_NO_SSL_TRACE for boringssl and libressl "sample-fetch which captures the cipherlist" patch introduce #define do deal with trace functions only available in openssl > 1.0.2. Add this #define to libressl and boringssl environment. Thanks to Piotr Kubaj for postponing and testing with libressl.	2017-03-20 12:01:44 +01:00
Emmanuel Hocdet	a52bb15cc7	BUILD: ssl: simplify SSL_CTX_set_ecdh_auto compatibility SSL_CTX_set_ecdh_auto is declared (when present) with #define. A simple #ifdef avoid to list all cases of ssllibs. It's a placebo in new ssllibs. It's ok with openssl 1.0.1, 1.0.2, 1.1.0, libressl and boringssl. Thanks to Piotr Kubaj for postponing and testing with libressl.	2017-03-20 12:01:34 +01:00
Willy Tarreau	de40d798de	CLEANUP: connection: completely remove CO_FL_WAKE_DATA Since it's only set and never tested anymore, let's remove it.	2017-03-19 12:18:27 +01:00
Willy Tarreau	3c0cc49d30	BUG/MEDIUM: connection: ensure to always report the end of handshakes Despite the previous commit working fine on all tests, it's still not sufficient to completely address the problem. If the connection handler is called with an event validating an L4 connection but some handshakes remain (eg: accept-proxy), it will still wake the function up, which will not report the activity, and will not detect a change once the handshake it complete so it will not notify the ->wake() handler. In fact the only reason why the ->wake() handler is still called here is because after dropping the last handshake, we try to call ->recv() and ->send() in turn and change the flags in order to detect a data activity. But if for any reason the data layer is not interested in reading nor writing, it will not get these events. A cleaner way to address this is to call the ->wake() handler only on definitive status changes (shut, error), on real data activity, and on a complete connection setup, measured as CONNECTED with no more handshake pending. It could be argued that the handshake flags have to be made part of the condition to set CO_FL_CONNECTED but that would currently break a part of the health checks. Also a handshake could appear at any moment even after a connection is established so we'd lose the ability to detect a second end of handshake. For now the situation around CO_FL_CONNECTED is not clean : - session_accept() only sets CO_FL_CONNECTED if there's no pending handshake ; - conn_fd_handler() will set it once L4 and L6 are complete, which will do what session_accept() above refrained from doing even if an accept_proxy handshake is still pending ; - ssl_sock_infocbk() and ssl_sock_handshake() consider that a handshake performed with CO_FL_CONNECTED set is a renegociation ; => they should instead filter on CO_FL_WAIT_L6_CONN - all ssl_fc_* sample fetch functions wait for CO_FL_CONNECTED before accepting to fetch information => they should also get rid of any pending handshake - smp_fetch_fc_rcvd_proxy() uses !CO_FL_CONNECTED instead of CO_FL_ACCEPT_PROXY - health checks (standard and tcp-checks) don't check for HANDSHAKE and may report a successful check based on CO_FL_CONNECTED while not yet done (eg: send buffer full on send_proxy). This patch aims at solving some of these side effects in a backportable way before this is reworked in depth : - we need to call ->wake() to report connection success, measure connection time, notify that the data layer is ready and update the data layer after activity ; this has to be done either if we switch from pending {L4,L6}_CONN to nothing with no handshakes left, or if we notice some handshakes were pending and are now done. - we document that CO_FL_CONNECTED exactly means "L4 connection setup confirmed at least once, L6 connection setup confirmed at least once or not necessary, all this regardless of any possibly remaining handshakes or future L6 negociations". This patch also renames CO_FL_CONN_STATUS to the more explicit CO_FL_NOTIFY_DATA, and works around the previous flags trick consiting in setting an impossible combination of flags to notify the data layer, by simply clearing the current flags. This fix should be backported to 1.7, 1.6 and 1.5.	2017-03-19 12:06:18 +01:00
Christopher Faulet	e6006245de	BUG/MEDIUM: filters: Fix channels synchronization in flt_end_analyze When a filter is used, there are 2 channel's analyzers to surround all the others, flt_start_analyze and flt_end_analyze. This is the good place to acquire and release resources used by filters, when needed. In addition, the last one is used to synchronize the both channels, especially for HTTP streams. We must wait that the analyze is finished for the both channels for an HTTP transaction before restarting it for the next one. But this part was buggy, leading to unexpected behaviours. First, depending on which channel ends first, the request or the response can be switch in a "forward forever" mode. Then, the HTTP transaction can be cleaned up too early, while a processing is still in progress on a channel. To fix the bug, the flag CF_FLT_ANALYZE has been added. It is set on channels in flt_start_analyze and is kept if at least one filter is still analyzing the channel. So, we can trigger the channel syncrhonization if this flag was removed on the both channels. In addition, the flag TX_WAIT_CLEANUP has been added on the transaction to know if the transaction must be cleaned up or not during channels syncrhonization. This way, we are sure to reset everything once all the processings are finished. This patch should be backported in 1.7.	2017-03-15 19:09:06 +01:00
Olivier Houchard	614f8d7d56	MINOR: cli: Let configure the dynamic cookies from the cli. This adds 3 new commands to the cli : enable dynamic-cookie backend <backend> that enables dynamic cookies for a specified backend disable dynamic-cookie backend <backend> that disables dynamic cookies for a specified backend set dynamic-cookie-key backend <backend> that lets one change the dynamic cookie secret key, for a specified backend.	2017-03-15 11:38:29 +01:00
Olivier Houchard	4e694049fa	MINOR: server: Add dynamic session cookies. This adds a new "dynamic" keyword for the cookie option. If set, a cookie will be generated for each server (assuming one isn't already provided on the "server" line), from the IP of the server, the TCP port, and a secret key provided. To provide the secret key, a new keyword as been added, "dynamic-cookie-key", for backends. Example : backend bk_web balance roundrobin dynamic-cookie-key "bla" cookie WEBSRV insert dynamic server s1 127.0.0.1:80 check server s2 192.168.56.1:80 check This is a first step to be able to dynamically add and remove servers, without modifying the configuration file, and still have all the load balancers redirect the traffic to the right server. Provide a way to generate session cookies, based on the IP address of the server, the TCP port, and a secret key provided.	2017-03-15 11:37:30 +01:00
Simon Horman	6f6bb380ef	MEDIUM: stats: Add show json schema This may be used to output the JSON schema which describes the output of show info json and show stats json. The JSON output is without any extra whitespace in order to reduce the volume of output. For human consumption passing the output through a pretty printer may be helpful. e.g.: $ echo "show schema json" \| socat /var/run/haproxy.stat stdio \| \ python -m json.tool The implementation does not generate the schema. Some consideration could be given to integrating the output of the schema with the output of typed and json info and stats. In particular the types (u32, s64, etc...) and tags. A sample verification of show info json and show stats json using the schema is as follows. It uses the jsonschema python module: cat > jschema.py << __EOF__ import json from jsonschema import validate from jsonschema.validators import Draft3Validator with open('schema.txt', 'r') as f: schema = json.load(f) Draft3Validator.check_schema(schema) with open('instance.txt', 'r') as f: instance = json.load(f) validate(instance, schema, Draft3Validator) __EOF__ $ echo "show schema json" \| socat /var/run/haproxy.stat stdio > schema.txt $ echo "show info json" \| socat /var/run/haproxy.stat stdio > instance.txt python ./jschema.py $ echo "show stats json" \| socat /var/run/haproxy.stat stdio > instance.txt python ./jschema.py Signed-off-by: Simon Horman <horms@verge.net.au>	2017-03-14 11:14:03 +01:00
Simon Horman	05ee213f8b	MEDIUM: stats: Add JSON output option to show (info\|stat) Add a json parameter to show (info\|stat) which will output information in JSON format. A follow-up patch will add a JSON schema which describes the format of the JSON output of these commands. The JSON output is without any extra whitespace in order to reduce the volume of output. For human consumption passing the output through a pretty printer may be helpful. e.g.: $ echo "show info json" \| socat /var/run/haproxy.stat stdio \| \ python -m json.tool STAT_STARTED has bee added in order to track if show output has begun or not. This is used in order to allow the JSON output routines to only insert a "," between elements when needed. I would value any feedback on how this might be done better. Signed-off-by: Simon Horman <horms@verge.net.au>	2017-03-14 11:14:03 +01:00
Jarno Huuskonen	9e6906b9ec	MEDIUM: http_error_message: txn->status / http_get_status_idx. This commit removes second argument(msgnum) from http_error_message and changes http_error_message to use s->txn->status/http_get_status_idx for mapping status code from 200..504 to HTTP_ERR_200..HTTP_ERR_504(enum). This is needed for http-request tarpit deny_status commit.	2017-03-14 10:41:41 +01:00
Nenad Merdanovic	b7e7c4720a	MINOR: Add nbsrv sample converter This is like the nbsrv() sample fetch function except that it works as a converter so it can count the number of available servers of a backend name retrieved using a sample fetch or an environment variable. Signed-off-by: Nenad Merdanovic <nmerdan@haproxy.com>	2017-03-13 18:26:05 +01:00
Christopher Faulet	cecd8527b3	MINOR: spoe: Add "send-frag-payload" option in spoe-agent section This option can be used to enable or to disable (prefixing the option line with the "no" keyword) the sending of fragmented payload to agents. By default, this option is enabled.	2017-03-09 15:32:55 +01:00
Christopher Faulet	305c6079d4	MINOR: spoe: Add "pipelining" and "async" options in spoe-agent section These options can be used to enable or to disable (prefixing the option line with the "no" keyword), respectively, pipelined and asynchronous exchanged between HAproxy and agents. By default, pipelining and async options are enabled.	2017-03-09 15:32:55 +01:00
Christopher Faulet	f032c3ec09	MINOR: spoe: Improve implementation of the payload fragmentation Now, when a payload is fragmented, the first frame must define the frame type and the followings must use the special type SPOE_FRM_T_UNSET. This way, it is easy to know if a fragment is the first one or not. Of course, all frames must still share the same stream-id and frame-id. Update SPOA example accordingly.	2017-03-09 15:32:55 +01:00
Christopher Faulet	4ff3e574ac	REORG: spoe: Move low-level encoding/decoding functions in dedicated header file So, it will be easier to anyone to develop external services using these functions. SPOA example has been updated accordingly.	2017-03-09 15:32:55 +01:00
Christopher Faulet	1f40b91a83	REORG: spoe: Move struct and enum definitions in dedicated header file SPOA example has been Updated accordingly	2017-03-09 15:32:55 +01:00
Christopher Faulet	8ef75251e3	MAJOR: spoe: refactor the filter to clean up the code The SPOE code is now pretty big and it was the good time to clean it up. It is not perfect, some parts remains a bit ugly. But it is far better now.	2017-03-09 15:32:55 +01:00
Christopher Faulet	42bfa46234	MINOR: spoe: Remove SPOE details from the appctx structure Now, as for peers, we use an opaque pointer to store information related to the SPOE filter in appctx structure. These information are now stored in a dedicated structure (spoe_appctx) and allocated, using a pool, when the applet is created. This removes the dependency between applets and the SPOE filter and avoids to eventually inflate the appctx structure.	2017-03-09 15:32:55 +01:00
Christopher Faulet	a1cda02995	MAJOR: spoe: Add support of pipelined and asynchronous exchanges with agents Now, HAProxy and agents can announce the support for "pipelining" and/or "async" capabilities during the HELLO handshake. For now, HAProxy always announces the support of both. In addition, in its HELLO frames. HAproxy adds the "engine-id" key. It is a uniq string that identify a SPOE engine. The "pipelining" capability is the ability for a peer to decouple NOTIFY and ACK frames. This is a symmectical capability. To be used, it must be supported by HAproxy and agents. Unlike HTTP pipelining, the ACK frames can be send in any order, but always on the same TCP connection used for the corresponding NOTIFY frame. The "async" capability is similar to the pipelining, but here any TCP connection established between HAProxy and the agent can be used to send ACK frames. if an agent accepts connections from multiple HAProxy, it can use the "engine-id" value to group TCP connections.	2017-03-09 15:32:55 +01:00
Emmanuel Hocdet	f6b37c67be	BUG/MEDIUM: ssl: in bind line, ssl-options after 'crt' are ignored. Bug introduced with "removes SSL_CTX_set_ssl_version call and cleanup CTX creation": ssl_sock_new_ctx is called before all the bind line is parsed. The fix consists of separating the use of default_ctx as the initialization context of the SSL connection via bind_conf->initial_ctx. Initial_ctx contains all the necessary parameters before performing the selection of the CTX: default_ctx is processed as others ctx without unnecessary parameters.	2017-03-07 10:42:43 +01:00
Emmanuel Hocdet	4608ed9511	MEDIUM: ssl: remove ssl-options from crt-list ssl-options are link to the initial negotiation environnement worn by default_ctx. Remove it from crt-list to avoid any confusion.	2017-03-07 10:33:16 +01:00
Emmanuel Hocdet	0594211987	MEDIUM: boringssl: support native multi-cert selection without bundling This patch used boringssl's callback to analyse CLientHello before any handshake to extract key signature capabilities. Certificat with better signature (ECDSA before RSA) is choosed transparenty, if client can support it. RSA and ECDSA certificates can be declare in a row (without order). This makes it possible to set different ssl and filter parameter with crt-list.	2017-03-02 18:31:05 +01:00
Willy Tarreau	b686afd568	MINOR: chunks: implement a simple dynamic allocator for trash buffers The trash buffers are becoming increasingly complex to deal with due to the code's modularity allowing some functions to be chained and causing the same chunk buffers to be used multiple times along the chain, possibly corrupting each other. In fact the trash were designed from scratch for explicitly not surviving a function call but string manipulation makes this impossible most of the time while not fullfilling the need for reliable temporary chunks. Here we introduce the ability to allocate a temporary trash chunk which is reserved, so that it will not conflict with the trash chunks other functions use, and will even support reentrant calls (eg: build_logline). For this, we create a new pool which is exactly the size of a usual chunk buffer plus the size of the chunk struct so that these chunks when allocated are exactly the same size as the ones returned by get_trash_buffer(). These chunks may fail so the caller must check them, and the caller is also responsible for freeing them. The code focuses on minimal changes and ease of reliable backporting because it will be needed in stable versions in order to support next patch.	2017-02-08 11:16:29 +01:00
Baptiste Assmann	5cd1b9222e	MINOR: dns: give ability to dns_init_resolvers() to close a socket when requested The function dns_init_resolvers() is used to initialize socket used to send DNS queries. This patch gives the function the ability to close a socket before re-opening it. [wt: this needs to be backported to 1.7 for next fix]	2017-02-03 07:21:32 +01:00
Willy Tarreau	c0752565fe	MINOR: server: extend the flags to 32 bits Right now not only we're limited to 8 bits, but it's mentionned nowhere and the limit was already reached. In addition, pp_opts (proxy protocol options) were set to 32 bits while only 3 are needed. So let's swap these two and group them together to avoid leaving two holes in the structure, saving 64 bits on 64-bit machines.	2017-01-25 18:49:27 +01:00
Willy Tarreau	e3e326d9f0	BUILD: ssl: kill a build warning introduced by BoringSSL compatibility A recent patch to support BoringSSL caused this warning to appear on OpenSSL 1.1.0 : src/ssl_sock.c:3062:4: warning: statement with no effect [-Wunused-value] It's caused by SSL_CTX_set_ecdh_auto() which is now only a macro testing that the last argument is zero, and the result is not used here. Let's just kill it for both versions. Tested with 0.9.8, 1.0.0, 1.0.1, 1.0.2, 1.1.0. This fix may be backported to 1.7 if the boringssl fix is as well.	2017-01-19 17:56:20 +01:00
Willy Tarreau	77d88da7e1	BUILD: ssl: eliminate warning with OpenSSL 1.1.0 regarding RAND_pseudo_bytes() This function was deprecated in 1.1.0 causing this warning : src/ssl_sock.c:551:3: warning: 'RAND_pseudo_bytes' is deprecated (declared at /opt/openssl-1.1.0/include/openssl/rand.h:47) [-Wdeprecated-declarations] The man suggests to use RAND_bytes() instead. While the return codes differ, it turns out that the function was already misused and was relying on RAND_bytes() return code instead. The patch was tested on 0.9.8, 1.0.0, 1.0.1, 1.0.2 and 1.1.0. This fix must be backported to 1.7 and the return code check should be backported to earlier versions if relevant.	2017-01-19 17:28:08 +01:00

... 5 6 7 8 9 ...

2947 Commits