haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-25 08:31:23 +02:00

Author	SHA1	Message	Date
Willy Tarreau	12e1027aa6	MINOR: tools: make url2ipv4 return the exact number of bytes parsed The function's return value is currently used as a boolean but we'll need it to return the number of bytes parsed. Right now it returns it minus one, unless the last char doesn't match what is permitted. Let's update this to make it more usable.	2021-03-25 15:18:47 +01:00
Christopher Faulet	a9a9e9aac9	BUG/MEDIUM: thread: Fix a deadlock if an isolated thread is marked as harmless If an isolated thread is marked as harmless, it will loop forever in thread_harmless_till_end() waiting no threads are isolated anymore. It never happens because the current thread is isolated. To fix the bug, we exclude the current thread for the test. We now wait for all other threads to leave the rendez-vous point. This bug only seems to occurr if HAProxy is compiled with DEBUG_UAF, when pool_gc() is called. pool_gc() isolates the current thread, while pool_free_area() set the thread as harmless when munmap is called. This patch must be backported as far as 2.0.	2021-03-25 14:31:50 +01:00
Amaury Denoyelle	65bf600cc3	BUG/MEDIUM: release lock on idle conn killing on reached pool high count Release the lock before calling mux destroy in connect_server when trying to kill an idle connection because the pool high count has been reached. The lock must be released because the mux destroy will call srv_release_conn which also takes the lock to remove the connection from the tree. As the connection was already deleted from the tree at this stage, it is safe to release the lock, and the removal in srv_release_conn will be a noop. It does not need to be backported because it is only present in the current release. It has been introduced by 5c7086f6b06d546c5800486ed9e4bb8d8d471e09 MEDIUM: connection: protect idle conn lists with locks	2021-03-25 11:55:35 +01:00
Olivier Houchard	c23b33764e	BUG/MEDIUM: fd: Take the fd_mig_lock when closing if no DWCAS is available. In fd_delete(), if we're running with no double-width cas, take the fd_mig_lock before setting thread_mask to 0 to make sure that another thread calling fd_set_running() won't miss the new value of thread_mask and set its bit in running_mask after we checked it. This should be backported to 2.2 as part of the series fixing fd_delete().	2021-03-25 07:34:35 +01:00
Willy Tarreau	2d4232901c	CLEANUP: fd: slightly simplify up _fd_delete_orphan() Let's release the port range earlier so that all zeroes are grouped together and that the compiler can slightly simplify the code.	2021-03-24 17:17:21 +01:00
Willy Tarreau	2c3f9818e8	BUG/MEDIUM: fd: do not wait on FD removal in fd_delete() Christopher discovered an issue mostly affecting 2.2 and to a less extent 2.3 and above, which is that it's possible to deadlock a soft-stop when several threads are using a same listener: thread1 thread2 unbind_listener() fd_set_running() lock(listener) listener_accept() fd_delete() lock(listener) while (running_mask); -----> deadlock unlock(listener) This simple case disappeared from 2.3 due to the removal of some locked operations at the end of listener_accept() on the regular path, but the architectural problem is still here and caused by a lock inversion built around the loop on running_mask in fd_clr_running_excl(), because there are situations where the caller of fd_delete() may hold a lock that is preventing other threads from dropping their bit in running_mask. The real need here is to make sure the last user deletes the FD. We have all we need to know the last one, it's the one calling fd_clr_running() last, or entering fd_delete() last, both of which can be summed up as the last one calling fd_clr_running() if fd_delete() calls fd_clr_running() at the end. And we can prevent new threads from appearing in running_mask by removing their bits in thread_mask. So what this patch does is that it sets the running_mask for the thread in fd_delete(), clears the thread_mask, thus marking the FD as orphaned, then clears the running mask again, and completes the deletion if it was the last one. If it was not, another thread will pass through fd_clr_running and will complete the deletion of the FD. The bug is easily reproducible in 2.2 under high connection rates during soft close. When the old process stops its listener, occasionally two threads will deadlock and the old process will then be killed by the watchdog. It's strongly believed that similar situations do exist in 2.3 and 2.4 (e.g. if the removal attempt happens during resume_listener() called from listener_accept()) but if so, they should be much harder to trigger. This should be backported to 2.2 as the issue appeared with the FD migration. It requires previous patches "fd: make fd_clr_running() return the remaining running mask" and "MINOR: fd: remove the unneeded running bit from fd_insert()". Notes for backport: in 2.2, the fd_dodelete() function requires an extra argument "do_close" indicating whether we want to remove and close the FD (fd_delete) or just delete it (fd_remove). While this information is not conveyed along the chain, we know that late calls always imply do_close=1 become do_close=0 exclusively results from fd_remove() which is only used by the config parser and the master, both of which are single-threaded, hence are always the last ones in the running_mask. Thus it is safe to assume that a postponed FD deletion always implies do_close=1. Thanks to Olivier for his help in designing this optimal solution.	2021-03-24 17:17:21 +01:00
Christopher Faulet	1e8433f594	BUG/MEDIUM: lua: Always init the lua stack before referencing the context When a lua context is allocated, its stack must be initialized to NULL before attaching it to its owner (task, stream or applet). Otherwise, if the watchdog is fired before the stack is really created, that may lead to a segfault because we try to dump the traceback of an uninitialized lua stack. It is easy to trigger this bug if a lua script do a blocking call while another thread try to initialize a new lua context. Because of the global lua lock, the init is blocked before the stack creation. Of course, it only happens if the script is executed in the shared global context. This patch must be backported as far as 2.0.	2021-03-24 16:36:36 +01:00
Christopher Faulet	cc2c4f8f4c	BUG/MEDIUM: debug/lua: Use internal hlua function to dump the lua traceback The commit reverts following commits: * 83926a04 BUG/MEDIUM: debug/lua: Don't dump the lua stack if not dumpable * a61789a1 MEDIUM: lua: Use a per-thread counter to track some non-reentrant parts of lua Instead of relying on a Lua function to print the lua traceback into the debugger, we are now using our own internal function (hlua_traceback()). This one does not allocate memory and use a chunk instead. This avoids any issue with a possible deadlock in the memory allocator because the thread processing was interrupted during a memory allocation. This patch relies on the commit "BUG/MEDIUM: debug/lua: Use internal hlua function to dump the lua traceback". Both must be backported wherever the patches above are backported, thus as far as 2.0	2021-03-24 16:35:23 +01:00
Christopher Faulet	d09cc519bd	MINOR: lua: Slightly improve function dumping the lua traceback The separator string is now configurable, passing it as parameter when the function is called. In addition, the message have been slightly changed to be a bit more readable.	2021-03-24 16:33:26 +01:00
Ilya Shipitsin	a0fd35b054	BUILD: ssl: guard ecdh functions with SSL_CTX_set_tmp_ecdh macro let us use feature macro SSL_CTX_set_tmp_ecdh instead of comparing openssl version	2021-03-24 09:52:37 +01:00
Remi Tricot-Le Breton	fb00f31af4	BUG/MINOR: ssl: Prevent disk access when using "add ssl crt-list" If an unknown CA file was first mentioned in an "add ssl crt-list" CLI command, it would result in a call to X509_STORE_load_locations which performs a disk access which is forbidden during runtime. The same would happen if a "ca-verify-file" or "crl-file" was specified. This was due to the fact that the crt-list file parsing and the crt-list related CLI commands parsing use the same functions. The patch simply adds a new parameter to all the ssl_bind parsing functions so that they know if the call is made during init or by the CLI, and the ssl_store_load_locations function can then reject any new cafile_entry creation coming from a CLI call. It can be backported as far as 2.2.	2021-03-23 19:29:46 +01:00
Willy Tarreau	f23b1bc534	BUILD: tools: fix build error with new PA_O_DEFAULT_DGRAM Previous commit 69ba35146 ("MINOR: tools: introduce new option PA_O_DEFAULT_DGRAM on str2sa_range.") managed to introduce a parenthesis imbalance that broke the build. No backport is needed.	2021-03-23 18:38:13 +01:00
Emeric Brun	69ba35146f	MINOR: tools: introduce new option PA_O_DEFAULT_DGRAM on str2sa_range. str2sa_range function options PA_O_DGRAM and PA_O_STREAM are used to define the supported address types but also to set the default type if it is not explicit. If the used address support both STREAM and DGRAM, the default was always set to STREAM. This patch introduce a new option PA_O_DEFAULT_DGRAM to force the default to DGRAM type if it is not explicit in the address field and both STREAM and DGRAM are supported. If only DGRAM or only STREAM is supported, it continues to be considered as the default.	2021-03-23 15:32:22 +01:00
Willy Tarreau	8cc586c73f	BUG/MEDIUM: freq_ctr/threads: use the global_now_ms variable In commit a1ecbca0a ("BUG/MINOR: freq_ctr/threads: make use of the last updated global time"), for period-based counters, the millisecond part of the global_now variable was used as the date for the new period. But it's wrong, it only works with sub-second periods as it wraps every second, and for other periods the counters never rotate anymore. Let's make use of the newly introduced global_now_ms variable instead, which contains the global monotonic time expressed in milliseconds. This patch needs to be backported wherever the patch above is backported. It depends on previous commit "MINOR: time: also provide a global, monotonic global_now_ms timer".	2021-03-23 09:03:37 +01:00
Willy Tarreau	6064b34be0	MINOR: time: also provide a global, monotonic global_now_ms timer The period-based freq counters need the global date in milliseconds, so better calculate it and expose it rather than letting all call places incorrectly retrieve it. Here what we do is that we maintain a new globally monotonic timer, global_now_ms, which ought to be very close to the global_now one, but maintains the monotonic approach of now_ms between all threads in that global_now_ms is always ahead of any now_ms. This patch is made simple to ease backporting (it will be needed for a subsequent fix), but it also opens the way to some simplifications on the time handling: instead of computing the local time and trying to force it to the global one, we should soon be able to proceed in the opposite way, that is computing the new global time an making the local one just the latest snapshot of it. This will bring the benefit of making sure that the global time is always ahead of the local one.	2021-03-23 09:01:37 +01:00
Willy Tarreau	e44989369d	CLEANUP: quic: use pool_zalloc() instead of pool_alloc+memset Two places used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:20:21 +01:00
Willy Tarreau	6922e550eb	CLEANUP: tcpcheck: use pool_zalloc() instead of pool_alloc+memset Two places used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:20:03 +01:00
Willy Tarreau	f208ac0616	CLEANUP: ssl: use pool_zalloc() in ssl_init_keylog() This one used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:19:48 +01:00
Willy Tarreau	70490ebb12	CLEANUP: resolvers: use pool_zalloc() in resolv_link_resolution() This one used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:19:28 +01:00
Willy Tarreau	3ab0a0bc88	CLEANUP: mailers: use pool_zalloc() in enqueue_one_email_alert() This one used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:19:13 +01:00
Willy Tarreau	ec4cfc3835	CLEANUP: frontend: use pool_zalloc() in frontend_accept() The capture buffers were allocated then zeroed, let's have the allocator do it.	2021-03-22 23:18:54 +01:00
Willy Tarreau	c9ef9bc9a5	CLEANUP: spoe: use pool_zalloc() instead of pool_alloc+memset Two places used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:18:26 +01:00
Willy Tarreau	1bbec3883a	CLEANUP: filters: use pool_zalloc() in flt_stream_add_filter() This one used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:17:56 +01:00
Willy Tarreau	d68d4f1002	MEDIUM: dynbuf: remove last usages of b_alloc_margin() The function's purpose used to be to fail a buffer allocation if that allocation wouldn't result in leaving some buffers available. Thus, some allocations could succeed and others fail for the sole purpose of trying to provide 2 buffers at once to process_stream(). But things have changed a lot with 1.7 breaking the promise that process_stream() would always succeed with only two buffers, and later the thread-local pool caches that keep certain buffers available that are not accounted for in the global pool so that local allocators cannot guess anything from the number of currently available pools. Let's just replace all last uses of b_alloc_margin() with b_alloc() once for all.	2021-03-22 16:27:59 +01:00
Willy Tarreau	f499f50c8f	CLEANUP: l7-retries: do not test the buffer before calling b_alloc() The return value is enough now to know if the allocation succeeded or failed.	2021-03-22 16:17:37 +01:00
Willy Tarreau	862ad82f22	CLEANUP: compression: do not test for buffer before calling b_alloc() Now we know the function is idempotent, we don't need to run the preliminary test anymore.	2021-03-22 16:16:22 +01:00
Willy Tarreau	b454e908e5	MINOR: ssl: use pool_alloc(), not pool_alloc_dirty() pool_alloc_dirty() is the version below pool_alloc() that never performs the memory poisonning. It should only be called directly for very large unstructured areas for which enabling memory poisonning would not bring anything but could significantly hurt performance (e.g. buffers). Using this function here will not provide any benefit and will hurt the ability to debug. It would be desirable to backport this, although it does not cause any user-visible bug, it just complicates debugging.	2021-03-22 15:35:53 +01:00
Willy Tarreau	acc5b011e5	MINOR: cache: use pool_alloc(), not pool_alloc_dirty() pool_alloc_dirty() is the version below pool_alloc() that never performs the memory poisonning. It should only be called directly for very large unstructured areas for which enabling memory poisonning would not bring anything but could significantly hurt performance (e.g. buffers). Using this function here will not provide any benefit and will hurt the ability to debug. It would be desirable to backport this, although it does not cause any user-visible bug, it just complicates debugging.	2021-03-22 15:35:53 +01:00
Willy Tarreau	18f43d85a0	MINOR: fcgi-app: use pool_alloc(), not pool_alloc_dirty() pool_alloc_dirty() is the version below pool_alloc() that never performs the memory poisonning. It should only be called directly for very large unstructured areas for which enabling memory poisonning would not bring anything but could significantly hurt performance (e.g. buffers). Using this function here will not provide any benefit and will hurt the ability to debug. It would be desirable to backport this, although it does not cause any user-visible bug, it just complicates debugging.	2021-03-22 15:35:53 +01:00
Willy Tarreau	f1a91292dc	MINOR: spoe: use pool_alloc(), not pool_alloc_dirty() pool_alloc_dirty() is the version below pool_alloc() that never performs the memory poisonning. It should only be called directly for very large unstructured areas for which enabling memory poisonning would not bring anything but could significantly hurt performance (e.g. buffers). Using this function here will not provide any real benefit, it only avoids the area being poisonned before being zeroed. Ideally a pool_calloc() function should be provided for this.	2021-03-22 15:35:53 +01:00
Willy Tarreau	5bfeb2139b	MINOR: compression: use pool_alloc(), not pool_alloc_dirty() pool_alloc_dirty() is the version below pool_alloc() that never performs the memory poisonning. It should only be called directly for very large unstructured areas for which enabling memory poisonning would not bring anything but could significantly hurt performance (e.g. buffers). Using this function here will not provide any benefit and will hurt the ability to debug. It would be desirable to backport this, although it does not cause any user-visible bug, it just complicates debugging.	2021-03-22 15:35:53 +01:00
Amaury Denoyelle	3b1c9a39fd	CLEANUP: mark defproxy as const on parse tune.fail-alloc This fixes a gcc warning about a missing const on defproxy for mem_parse_global_fail_alloc. This is needed since the commit : 018251667e4c95478ce0026f4d700e0420f8ce24 CLEANUP: config: make the cfg_keyword parsers take a const for the defproxy	2021-03-22 11:50:31 +01:00
Ilya Shipitsin	ba13f16aa2	CLEANUP: assorted typo fixes in the code and comments This is 21st iteration of typo fixes	2021-03-20 09:28:58 +01:00
Olivier Houchard	26c51097d8	MEDIUM: quic: Fix build. Put the ) at the right place. This should fix github issue #1190.	2021-03-19 20:09:22 +01:00
Olivier Houchard	7ab6d8bdf3	MEDIUM: quic: Fix build. Spell conn_xprt_start() correctly. This should fix github issue #1189.	2021-03-19 19:48:53 +01:00
Christopher Faulet	83926a04fe	BUG/MEDIUM: debug/lua: Don't dump the lua stack if not dumpable When we try to dump the stack of a lua context, if it is not dumpable, nothing is performed and a message is emitted instead. This happens when a lua execution was interrupted inside a non-reentrant part. This patch depends on following commit : * MEDIUM: lua: Use a per-thread counter to track some non-reentrant parts of lua Thanks to this patch, we avoid a possible deadllock if the lua is interrupted by the watchdog in the lua memory allocator, because realloc() is not async-signal-safe. Both patches must be backported as far as 2.0.	2021-03-19 16:19:59 +01:00
Christopher Faulet	a61789a1d6	MEDIUM: lua: Use a per-thread counter to track some non-reentrant parts of lua Some parts of the Lua are non-reentrant. We must be sure to carefully track these parts to not dump the lua stack when it is interrupted inside such parts. For now, we only identified the custom lua allocator. If the thread is interrupted during the memory allocation, we must not try to print the lua stack wich also allocate memory. Indeed, realloc() is not async-signal-safe. In this patch we introduce a thread-local counter. It is incremented before entering in a non-reentrant part and decremented when exiting. It is only performed in hlua_alloc() for now.	2021-03-19 16:16:23 +01:00
Christopher Faulet	a561ffb978	CLEANUP: tcp-rules: Fix a typo in error messages about expect-netscaler-cip It was misspelled (expect-netscaler-ip instead of expect-netscaler-cip). 2 commits are concerned : * db67b0ed7 MINOR: tcp-rules: suggest approaching action names on mismatch * 72d012fbd CLEANUP: tcp-rules: add missing actions in the tcp-request error message The first one will not be backported, but the second one was backported as far as 1.8. Thus this one may also be backported, but only the 2nd part about the list of accepted keywords.	2021-03-19 15:41:16 +01:00
Olivier Houchard	dae6975498	MINOR: muxes: garbage collect the reset() method. Now that connections aren't being reused when they failed, remove the reset() method. It was unimplemented anywhere, except for H1 where it did nothing, anyway.	2021-03-19 15:33:04 +01:00
Olivier Houchard	bc5ce9201a	MEDIUM: connections: Implement a start() method in ssl_sock. Add a start() method to ssl_sock. It is responsible with initiating the SSL handshake, currently by just scheduling the tasklet, instead of doing it in the init() method, when all the XPRT may not have been initialized.	2021-03-19 15:33:04 +01:00
Olivier Houchard	d54ede7d08	MEDIUM: connections: Implement a start() method for xprt_handshake. Add a start_method to xprt_handshake. It schedules the tasklet that does the handshake. This used to be done in xprt_handshake_add_xprt(), but that's a much better place.	2021-03-19 15:33:04 +01:00
Olivier Houchard	1b3c931bff	MEDIUM: connections: Introduce a new XPRT method, start(). Introduce a new XPRT method, start(). The init() method will now only initialize whatever is needed for the XPRT to run, but any action the XPRT has to do before being ready, such as handshakes, will be done in the new start() method. That way, we will be sure the full stack of xprt will be initialized before attempting to do anything. The init() call is also moved to conn_prepare(). There's no longer any reason to wait for the ctrl to be ready, any action will be deferred until start(), anyway. This means conn_xprt_init() is no longer needed.	2021-03-19 15:33:04 +01:00
Olivier Houchard	ca1a57f022	MINOR: raw_sock: Add a close method. Add a close() method, that explicitely cancels any subscription on the connection, in preparation for future evolutions.	2021-03-19 15:33:04 +01:00
Emeric Brun	8af3bb0abf	BUG/MINOR: protocol: add missing support of dgram unix socket. The proto "uxdg" (UNIX DGRAM) was not declared, causing an error trying to put a socket unix on "dgram-bind" into a log-forward section. This patch introduces the missing "uxdg" protocol by adding proto_uxdg.c which was fully created based on the code available for the other protocols. This patch should be backported to version 2.3 and above.	2021-03-18 18:30:29 +01:00
Amaury Denoyelle	304672320e	MINOR: server: support keyword proto in 'add server' cli Allow to specify the mux proto for a dynamic server. It must be compatible with the backend mode to be accepted. The reg-tests has been extended for this error case.	2021-03-18 16:22:10 +01:00
Amaury Denoyelle	fc465a54fd	MINOR: server: enable standard options for dynamic servers Enable a subset of server options to be used as keywords on the CLI command 'add server'. These options are safe and can be applied flawlessly for a dynamic server.	2021-03-18 16:22:10 +01:00
Amaury Denoyelle	f99f77a500	MEDIUM: server: implement 'add server' cli command Add a new cli command 'add server'. This command is used to create a new server at runtime attached on an existing backend. The syntax is the following one : $ add server <be_name>/<sv_name> [<kws>...] This command is only available through experimental mode for the moment. Currently, no server keywords are supported. They will be activated individually when deemed properly functional and safe. Another limitation is put on the backend load-balancing algorithm. The algorithm must use consistent hashing to guarantee a minimal reallocation of existing connections on the new server insertion.	2021-03-18 15:52:07 +01:00
Amaury Denoyelle	216a1ce3b9	MINOR: stats: export function to allocate extra proxy counters Remove static qualifier on stats_allocate_proxy_counters_internal. This function will be used to allocate extra counters at runtime for dynamic servers.	2021-03-18 15:52:07 +01:00
Amaury Denoyelle	76e10e78bb	MINOR: server: prepare parsing for dynamic servers Prepare the server parsing API to support dynamic servers. - define a new parsing flag to be used for dynamic servers - each keyword contains a new field dynamic_ok to indicate if it can be used for a dynamic server. For now, no keyword are supported. - do not copy settings from the default server for a new dynamic server. - a dynamic server is created in a maintenance mode and requires an explicit 'enable server' command. - a new server flag named SRV_F_DYNAMIC is created. This flag is set for all servers created at runtime. It might be useful later, for example to know if a server can be purged.	2021-03-18 15:51:12 +01:00
Amaury Denoyelle	30c0537f5a	REORG: server: use flags for parse_server Modify the API of parse_server function. Use flags to describe the type of the parsed server instead of discrete arguments. These flags can be used to specify if a server/default-server/server-template is parsed. Additional parameters are also specified (parsing of the address required, resolve of a name must be done immediately). It is now unneeded to use strcmp on args[0] in parse_server. Also, the calls to parse_server are more explicit thanks to the flags.	2021-03-18 15:37:05 +01:00

... 99 100 101 102 103 ...

16131 Commits