haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-23 23:51:19 +02:00

Author	SHA1	Message	Date
Willy Tarreau	9c768fdca1	BUG/MEDIUM: http: don't store url_decode() result in the samples's length By convenience or laziness we used to store url_decode()'s return code into smp->data.u.str.data. The result checks applied there compare it to 0 while it's now unsigned since commit 843b7cb ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct "). Let's clean this up and test the result itself without storing it first. No backport is needed.	2018-08-22 05:16:32 +02:00
Willy Tarreau	6e27be1a5d	BUG/MEDIUM: http: don't store exp_replace() result in the trash's length By convenience or laziness we used to store exp_replace()'s return code into trash.data. The result checks applied there compare trash.data to -1 while it's now unsigned since commit 843b7cb ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct "). Let's clean this up and test the result itself without storing it first. No backport is needed.	2018-08-22 05:16:32 +02:00
Willy Tarreau	5f6333caca	BUG/MINOR: chunks: do not store -1 into chunk_printf() in case of error Since commit 843b7cb ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct") a chunk length is unsigned so we can't reliably store -1 and check for negative values in the caller. Only one such location was found in proto_http's http-request auth rules (which cannot realistically fail). No backport is needed.	2018-08-22 05:16:31 +02:00
Willy Tarreau	49725a0977	BUG/MEDIUM: check/threads: do not involve the rendez-vous point for status updates thread_isolate() is currently being called with the server lock held. This is not acceptable because it prevents other threads from reaching the rendez-vous point. Now that the LB algos are thread-safe, let's get rid of this call. No backport is nedeed.	2018-08-21 19:54:09 +02:00
Willy Tarreau	1b87748ff5	BUG/MEDIUM: lb/threads: always properly lock LB algorithms on maintenance operations Since commit 3ff577e ("MAJOR: server: make server state changes synchronous again"), srv_update_status() calls the various maintenance operations of the LB algorithms (->set_server_up, ->set_server_down, ->update_server_weight()). These ones are called with a single thread guaranteed by the rendez-vous point, so the fact that they're lacking some locks has no effect. However we'll need to remove the rendez-vous point so we have to take care of properly locking all the LB algos. The comments have been properly updated on the various functions to mention their locking expectations. All these functions are called with the server lock held, and all of them now support concurrent calls by using the lbprm's lock. This fix doesn't need to be backported at the moment, though if any check-specific issue surfaced in 1.8, it could make sense to reuse it.	2018-08-21 19:44:53 +02:00
Willy Tarreau	deca26c452	BUG/MAJOR: queue/threads: make pendconn_redistribute not lock the server Since commit 3ff577e ("MAJOR: server: make server state changes synchronous again"), srv_update_status() is called with the server lock held. It calls (among others) pendconn_redistribute() which used to take this lock, causing CPU loops by default, or crashes if build with -DDEBUG_THREAD. Since this function is not called from any other place anymore, it doesn't require the lock on its own so let's simply drop it from there. No backport is needed, this is 1.9-specific.	2018-08-21 18:11:03 +02:00
Olivier Houchard	80c56790d9	BUG/MEDIUM: stream_interface: Call the wake callback after sending. If we subscribed to send, and the callback is called, call the wake callback after, so that process_stream() may be woken up if needed. This is 1.9-specific, no backport is needed.	2018-08-21 18:06:57 +02:00
Olivier Houchard	fab7c7e91c	BUG/MEDIUM: H2: Activate polling after successful h2_snd_buf(). Make sure h2_send() is called after h2_snd_buf() by activating polling. This is 1.9-specific, no backport is needed.	2018-08-21 18:06:57 +02:00
Olivier Houchard	a6ff035770	BUG/MEDIUM: stream-int: Check if the conn_stream exist in si_cs_io_cb. It is possible that the conn_stream gets detached from the stream_interface, and as it subscribed to the wait list, si_cs_io_cb() gets called anyway, so make sure we have a conn_stream before attempting to send more data. This is 1.9-specific, no backport is needed.	2018-08-21 18:06:54 +02:00
Olivier Houchard	18a85fe602	BUG/MEDIUM: streams: Don't forget to remove the si from the wait list. When freeing the stream, make sure we remove the stream interfaces from the wait lists, in case it was in there. This is 1.9-specific, no backport is needed.	2018-08-21 18:06:33 +02:00
Willy Tarreau	3bcc2699ba	BUG/MEDIUM: cli/threads: protect some server commands against concurrent operations The server-specific CLI commands "set weight", "set maxconn", "disable agent", "enable agent", "disable health", "enable health", "disable server" and "enable server" were not protected against concurrent accesses. Now they take the server lock around the sensitive part. This patch must be backported to 1.8.	2018-08-21 15:35:31 +02:00
Willy Tarreau	46b7f53ad9	DOC: server/threads: document which functions need to be called with/without locks At the moment it's totally unclear while reading the server's code which functions require to be called with the server lock held and which ones grab it and cannot be called this way. This commit simply inventories all of them to indicate what is detected depending on how these functions use the struct server. Only functions used at runtime were checked, those dedicated to config parsing were skipped. Doing so already has uncovered a few bugs on some CLI actions.	2018-08-21 14:58:25 +02:00
Willy Tarreau	a275a3710e	BUG/MEDIUM: cli/threads: protect all "proxy" commands against concurrent updates The proxy-related commands like "{enable\|disable\|shutdown} frontend", "{enable\|disable} dynamic-cookie", "set dynamic-cookie-key" were not protected against concurrent accesses making their use dangerous with threads. This patch must be backported to 1.8.	2018-08-21 14:58:25 +02:00
Willy Tarreau	eeba36b3af	BUG/MEDIUM: server: update our local state before propagating changes Commit 3ff577e ("MAJOR: server: make server state changes synchronous again") reintroduced synchronous server state changes. However, during the previous change from synchronous to asynchronous, the server state propagation was placed at the end of the function to ease the code changes, and the commit above didn't put it back at its place. This has resulted in propagated states to be incomplete. For example, making a server leave maintenance would make it up but would leave its tracking servers down because they see their tracked server is still down. Let's just move the status update right to its place. It also adds the benefit of reporting state changes in the order they appear and not in reverse. No backport is needed.	2018-08-21 08:29:25 +02:00
Cyril Bont�	7ee465f1ad	BUG/MINOR: lua: fix extra 500ms added to socket timeouts Since commit #56cc12509, haproxy accepts double values for timeouts. The value is then converted to milliseconds before being rounded up and cast to int. The issue is that to round up the value, a constant value of 0.5 is added to it, but too early in the conversion, resulting in an additional 500ms to the value. We are talking about a precision of 1ms, so we can safely get rid of this rounding trick and adjust resulting timeouts equal to 0 to a minimum of 1ms. This patch is specific to the 1.9 branch and doesn't require to be backported.	2018-08-19 22:11:28 +02:00
Cyril Bont�	7bb6345497	BUG/MEDIUM: lua: socket timeouts are not applied Sachin Shetty reported that socket timeouts set in LUA code have no effect. Indeed, connect timeout is never modified and is always set to its default, set to 5 seconds. Currently, this patch will apply the specified timeout value to the connect timeout. For the read and write timeouts, the issue is that the timeout is updated but the expiration dates were not updated. This patch should be backported up to the 1.6 branch.	2018-08-18 00:23:52 +02:00
Olivier Houchard	19bdf2428d	MINOR: tasks: Don't special-case when nbthreads == 1 Instead of checking if nbthreads == 1, just and thread_mask with all_threads_mask to know if we're supposed to add the task to the local or the global runqueue.	2018-08-17 14:50:37 +02:00
Bertrand Jacquin	a25282bb39	DOC: ssl: Use consistent naming for TLS protocols In most cases, "TLSv1.x" naming is used across and documentation, lazy people tend to grep too much and may not find what they are looking for. Fixing people is hard.	2018-08-16 20:20:26 +02:00
Emeric Brun	271022150d	BUG/MINOR: map: fix map_regm with backref Due to a cascade of get_trash_chunk calls the sample is corrupted when we want to read it. The fix consist to use a temporary chunk to copy the sample value and use it. [wt: for 1.8 and older, a backport was successfully tested here : https://www.mail-archive.com/haproxy@formilux.org/msg30694.html]	2018-08-16 19:44:04 +02:00
Emeric Brun	e1b4ed4352	BUG/MEDIUM: ssl: loading dh param from certifile causes unpredictable error. If the dh parameter is not found, the openssl's error global stack was not correctly cleared causing unpredictable error during the following parsing (chain cert parsing for instance). This patch should be backported in 1.8 (and perhaps 1.7)	2018-08-16 19:36:08 +02:00
Emeric Brun	eb155b6ca6	BUG/MEDIUM: ssl: fix missing error loading a keytype cert from a bundle. If there was an issue loading a keytype's part of a bundle, the bundle was implicitly ignored without errors. This patch should be backported in 1.8 (and perhaps 1.7)	2018-08-16 19:36:06 +02:00
Olivier Houchard	fde2a09a15	BUG/MEDIUM: sessions: Don't use t->state. In session_expire_embryonic(), don't use t->state, use the "state" argument instead, as t->state has been cleaned before we're being called.	2018-08-16 19:25:56 +02:00
Olivier Houchard	d8b7a4701d	BUG/MEDIUM: tasks: Don't insert in the global rqueue if nbthread == 1 Make sure we don't insert a task in the global run queue if nbthread == 1, as, as an optimisation, we avoid reading from it if nbthread == 1.	2018-08-16 19:25:46 +02:00
Olivier Houchard	5c110b924e	MINOR: checks: Add event_srv_chk_io(). In checks, introduce event_srv_chk_io() as a callback to be called if data can be sent again, instead of abusing event_srv_chk_w.	2018-08-16 17:29:54 +02:00
Olivier Houchard	29fb89dc5e	MINOR: mux_h2: Don't use h2_send() as a callback. Instead of using h2_send() directly as a callback, introcude h2_io_cb(), that will call h2_send() if it is possible to send data.	2018-08-16 17:29:54 +02:00
Olivier Houchard	8f0b4c66f5	MINOR: stream_interface: Give stream_interface its own wait_list. Instead of just using the conn_stream wait_list, give the stream_interface its own. When the conn_stream will have its own buffers, the stream_interface may have to wait on it.	2018-08-16 17:29:54 +02:00
Olivier Houchard	91894cbf4c	MINOR: stream_interface: Don't use si_cs_send() as a task handler. Instead of using si_cs_send() as a task handler, define a new function, si_cs_io_cb(), and give si_cs_send() its original prototype. Right now si_cs_io_cb() just handles send, but later it'll handle recv() too.	2018-08-16 17:29:54 +02:00
Olivier Houchard	e1c6dbcd70	MINOR: connections/mux: Add the wait reason(s) to wait_list. Add a new element to the wait_list, that let us know which event(s) we are waiting on.	2018-08-16 17:29:53 +02:00
Olivier Houchard	ed0f207ef5	MINOR: connections: Get rid of txbuf. Remove txbuf from conn_stream. It is not used yet, and its only user will probably be the mux_h2, so it will be better suited in the struct h2s.	2018-08-16 17:29:51 +02:00
Olivier Houchard	638b799b09	MINOR: connections: Move rxbuf from the conn_stream to the h2s. As the mux_h2 is the only user of rxbuf, move it to the struct h2s, instead of conn_stream.	2018-08-16 17:28:11 +02:00
Olivier Houchard	511efeae7e	MINOR: connections: Make rcv_buf mandatory and nuke cs_recv(). Reintroduce h2_rcv_buf(), right now it just does what cs_recv() did, but should be modified later.	2018-08-16 17:23:44 +02:00
Emeric Brun	77e8919fc6	BUG/MINOR: ssl: empty connections reported as errors. Empty connection is reported as handshake error even if dont-log-null is specified. This bug affect is a regression du to: BUILD: ssl: fix to build (again) with boringssl New openssl 1.1.1 defines OPENSSL_NO_HEARTBEATS as boring ssl so the test was replaced by OPENSSL_IS_BORINGSSL This fix should be backported on 1.8	2018-08-16 11:59:59 +02:00
Patrick Hemmer	248cb4c503	MEDIUM: queue: adjust position based on priority-class and priority-offset The priority values are used when connections are queued to determine which connections should be served first. The lowest priority class is served first. When multiple requests from the same class are found, the earliest (according to queue_time + offset) is served first. The queue offsets can span over roughly 17 minutes after which the offsets will wrap around. This allows up to 8 minutes spent in the queue with no reordering.	2018-08-10 15:06:48 +02:00
Patrick Hemmer	268a707a3d	MEDIUM: add set-priority-class and set-priority-offset This adds the set-priority-class and set-priority-offset actions to http-request and tcp-request content. At this point they are not used yet, which is the purpose of the next commit, but all the logic to set and clear the values is there.	2018-08-10 15:06:31 +02:00
Patrick Hemmer	0355dabd7c	MINOR: queue: replace the linked list with a tree We'll need trees to manage the queues by priorities. This change replaces the list with a tree based on a single key. It's effectively a list but allows us to get rid of the list management right now.	2018-08-10 15:06:27 +02:00
Patrick Hemmer	da282f4a8f	MINOR: queue: store the queue index in the stream when enqueuing We store the queue index in the stream and check it on dequeueing to figure how many entries were processed in between. This way we'll be able to count the elements that may later be added before ours.	2018-08-10 15:06:25 +02:00
Patrick Hemmer	ffe5e8c638	MINOR: stream: rename {srv,prx}_queue_size to *_queue_pos The current name is misleading as it implies a queue size, but the value instead indicates a position in the queue. The value is only the queue size at the exact moment the element is enqueued. Soon we will gain the ability to insert anywhere into the queue, upon which clarity of the name is more important.	2018-08-10 15:04:14 +02:00
Willy Tarreau	66425e31b5	MINOR: queue: make sure the pendconn is released before logging We'll soon need to rely on the pendconn position at the time of dequeuing to figure the position a stream took in the queue. Usually it's not a problem since pendconn_free() is called once the connection starts, but it will make a difference for failed dequeues (eg: queue timeout reached). Thus it's important to call pendconn_free() before logging in cases we are not certain whether it was already performed, and to call pendconn_unlink() after we know the pendconn will not be used so that we collect the queue state as accurately as possible. As a benefit it will also make the server's and backend's queues count more accurate in these cases.	2018-08-10 15:04:08 +02:00
Christopher Faulet	7ce0c891ab	MEDIUM: mux: Use the mux protocol specified on bind/server lines To do so, mux choices are split to handle incoming and outgoing connections in a different way. The protocol specified on the bind/server line is used in priority. Then, for frontend connections, the ALPN is retrieved and used to choose the best mux. For backend connection, there is no ALPN. Finaly, if no protocol is specified and no protocol matches the ALPN, we fall back on a default mux, choosing in priority the first mux with exactly the same mode.	2018-08-08 10:42:08 +02:00
Christopher Faulet	8ed0a3e32a	MINOR: mux/server: Add 'proto' keyword to force the multiplexer's protocol For now, it is parsed but not used. Tests are done on it to check if the side and the mode are compatible with the server's definition.	2018-08-08 10:42:08 +02:00
Christopher Faulet	a717b99284	MINOR: mux/frontend: Add 'proto' keyword to force the mux protocol For now, it is parsed but not used. Tests are done on it to check if the side and the mode are compatible with the proxy's definition.	2018-08-08 10:41:11 +02:00
Christopher Faulet	7c42eacbe9	BUG/MEDIUM: stream_int: Don't check CO_FL_SOCK_RD_SH flag to trigger cs receive It is mandatory to be sure to process data blocked in the RX buffer of the conn_stream while the shutr/read0 was already processed. The stream interface doesn't need to rely on this flags because it already tests CS_FL_EOS.	2018-08-08 10:41:11 +02:00
Willy Tarreau	91c2826e1d	CLEANUP: server: remove the update list and the update lock These ones are not more used, let's get rid of them.	2018-08-08 09:57:45 +02:00
Willy Tarreau	3ff577e165	MAJOR: server: make server state changes synchronous again Now we try to synchronously push updates as they come using the new rdv point, so that the call to the server update function from the main poll loop is not needed anymore. It further reduces the apparent latency in the health checks as the response time almost always appears as 0 ms, resulting in a slightly higher check rate of ~1960 conn/s. Despite this, the CPU consumption has slightly dropped again to ~32% for the same test. The only trick is that the checks code is built with a bit of recursivity because srv_update_status() calls server_recalc_eweight(), and the latter needs to signal srv_update_status() in case of updates. Thus we added an extra argument to this function to indicate whether or not it must propagate updates (no if it comes from srv_update_status).	2018-08-08 09:57:45 +02:00
Willy Tarreau	647c70b681	MINOR: threads: remove the previous synchronization point It's not needed anymore as it is fully covered by the new rendez-vous point. This also removes the pipe and its polling.	2018-08-08 09:57:45 +02:00
Willy Tarreau	85c459d7e8	MEDIUM: haproxy: don't use sync_poll_loop() anymore in the main loop This partially reverts commit d8fd2af ("BUG/MEDIUM: threads: Use the sync point to check active jobs and exit") which used to address an issue in the way the sync point used to check for present threads, which was later addressed by commit ddb6c16 ("BUG/MEDIUM: threads: Fix the exit condition of the thread barrier"). Thus there is no need anymore to use the sync point for exiting and we can completely remove this call in the main loop.	2018-08-08 09:56:32 +02:00
Willy Tarreau	3d3700f216	MEDIUM: checks: use the new rendez-vous point to spread check result The current sync point causes some important stress when a high number of threads is in use on a config with lots of checks, because it wakes up all threads every time a server state changes. A config like the following can easily saturate a 4-core machine reaching only 750 checks per second out of the ~2000 configured : global nbthread 4 defaults mode http timeout connect 5s timeout client 5s timeout server 5s frontend srv bind :8001 process 1/1 redirect location / if { method OPTIONS } { rand(100) ge 50 } stats uri / backend chk option httpchk server-template srv 1-100 127.0.0.1:8001 check rise 1 fall 1 inter 50 The reason is that the random on the fake server causes the responses to randomly match an HTTP check, and results in a lot of up/down events that are broadcasted to all threads. It's worth noting that the CPU usage already dropped by about 60% between 1.8 and 1.9 just due to the scheduler updates, but the sync point remains expensive. In addition, it's visible on the stats page that a lot of requests end up with an L7TOUT status in ~60ms. With smaller timeouts, it's even L4TOUT around 20-25ms. By not using THREAD_WANT_SYNC() anymore and only calling the server updates under thread_isolate(), we can avoid all these wakeups. The CPU usage on the same config drops to around 44% on the same machine, with all checks being delivered at ~1900 checks per second, and the stats page shows no more timeouts, even at 10 ms check interval. The difference is mainly caused by the fact that there's no more need to wait for a thread to wake up from poll() before starting to process check results.	2018-08-08 09:56:32 +02:00
Christopher Faulet	98d9fe21e0	MINOR: mux: Print the list of existing mux protocols during HA startup This is done in verbose/debug mode and when build options are reported.	2018-08-08 09:54:22 +02:00
Christopher Faulet	32f61c0421	MINOR: mux: Unlink ALPN and multiplexers to rather speak of mux protocols Multiplexers are not necessarily associated to an ALPN. ALPN is a TLS extension, so it is not always defined or used. Instead, we now rather speak of multiplexer's protocols. So in this patch, there are no significative changes, some structures and functions are just renamed.	2018-08-08 09:54:22 +02:00
Christopher Faulet	2d5292a412	MINOR: mux: Add info about the supported side in alpn_mux_list structure Now, a multiplexer can specify if it can be install on incoming connections (ALPN_SIDE_FE), on outgoing connections (ALPN_SIDE_BE) or both (ALPN_SIDE_BOTH). These flags are compatible with proxies' ones.	2018-08-08 09:54:22 +02:00

... 94 95 96 97 98 ...

10974 Commits