haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-31 03:21:24 +02:00

Author	SHA1	Message	Date
Olivier Houchard	a6ff035770	BUG/MEDIUM: stream-int: Check if the conn_stream exist in si_cs_io_cb. It is possible that the conn_stream gets detached from the stream_interface, and as it subscribed to the wait list, si_cs_io_cb() gets called anyway, so make sure we have a conn_stream before attempting to send more data. This is 1.9-specific, no backport is needed.	2018-08-21 18:06:54 +02:00
Olivier Houchard	18a85fe602	BUG/MEDIUM: streams: Don't forget to remove the si from the wait list. When freeing the stream, make sure we remove the stream interfaces from the wait lists, in case it was in there. This is 1.9-specific, no backport is needed.	2018-08-21 18:06:33 +02:00
Willy Tarreau	3bcc2699ba	BUG/MEDIUM: cli/threads: protect some server commands against concurrent operations The server-specific CLI commands "set weight", "set maxconn", "disable agent", "enable agent", "disable health", "enable health", "disable server" and "enable server" were not protected against concurrent accesses. Now they take the server lock around the sensitive part. This patch must be backported to 1.8.	2018-08-21 15:35:31 +02:00
Willy Tarreau	46b7f53ad9	DOC: server/threads: document which functions need to be called with/without locks At the moment it's totally unclear while reading the server's code which functions require to be called with the server lock held and which ones grab it and cannot be called this way. This commit simply inventories all of them to indicate what is detected depending on how these functions use the struct server. Only functions used at runtime were checked, those dedicated to config parsing were skipped. Doing so already has uncovered a few bugs on some CLI actions.	2018-08-21 14:58:25 +02:00
Willy Tarreau	a275a3710e	BUG/MEDIUM: cli/threads: protect all "proxy" commands against concurrent updates The proxy-related commands like "{enable\|disable\|shutdown} frontend", "{enable\|disable} dynamic-cookie", "set dynamic-cookie-key" were not protected against concurrent accesses making their use dangerous with threads. This patch must be backported to 1.8.	2018-08-21 14:58:25 +02:00
Willy Tarreau	eeba36b3af	BUG/MEDIUM: server: update our local state before propagating changes Commit 3ff577e ("MAJOR: server: make server state changes synchronous again") reintroduced synchronous server state changes. However, during the previous change from synchronous to asynchronous, the server state propagation was placed at the end of the function to ease the code changes, and the commit above didn't put it back at its place. This has resulted in propagated states to be incomplete. For example, making a server leave maintenance would make it up but would leave its tracking servers down because they see their tracked server is still down. Let's just move the status update right to its place. It also adds the benefit of reporting state changes in the order they appear and not in reverse. No backport is needed.	2018-08-21 08:29:25 +02:00
Cyril Bont�	7ee465f1ad	BUG/MINOR: lua: fix extra 500ms added to socket timeouts Since commit #56cc12509, haproxy accepts double values for timeouts. The value is then converted to milliseconds before being rounded up and cast to int. The issue is that to round up the value, a constant value of 0.5 is added to it, but too early in the conversion, resulting in an additional 500ms to the value. We are talking about a precision of 1ms, so we can safely get rid of this rounding trick and adjust resulting timeouts equal to 0 to a minimum of 1ms. This patch is specific to the 1.9 branch and doesn't require to be backported.	2018-08-19 22:11:28 +02:00
Cyril Bont�	7bb6345497	BUG/MEDIUM: lua: socket timeouts are not applied Sachin Shetty reported that socket timeouts set in LUA code have no effect. Indeed, connect timeout is never modified and is always set to its default, set to 5 seconds. Currently, this patch will apply the specified timeout value to the connect timeout. For the read and write timeouts, the issue is that the timeout is updated but the expiration dates were not updated. This patch should be backported up to the 1.6 branch.	2018-08-18 00:23:52 +02:00
Olivier Houchard	19bdf2428d	MINOR: tasks: Don't special-case when nbthreads == 1 Instead of checking if nbthreads == 1, just and thread_mask with all_threads_mask to know if we're supposed to add the task to the local or the global runqueue.	2018-08-17 14:50:37 +02:00
Bertrand Jacquin	a25282bb39	DOC: ssl: Use consistent naming for TLS protocols In most cases, "TLSv1.x" naming is used across and documentation, lazy people tend to grep too much and may not find what they are looking for. Fixing people is hard.	2018-08-16 20:20:26 +02:00
Emeric Brun	271022150d	BUG/MINOR: map: fix map_regm with backref Due to a cascade of get_trash_chunk calls the sample is corrupted when we want to read it. The fix consist to use a temporary chunk to copy the sample value and use it. [wt: for 1.8 and older, a backport was successfully tested here : https://www.mail-archive.com/haproxy@formilux.org/msg30694.html]	2018-08-16 19:44:04 +02:00
Emeric Brun	e1b4ed4352	BUG/MEDIUM: ssl: loading dh param from certifile causes unpredictable error. If the dh parameter is not found, the openssl's error global stack was not correctly cleared causing unpredictable error during the following parsing (chain cert parsing for instance). This patch should be backported in 1.8 (and perhaps 1.7)	2018-08-16 19:36:08 +02:00
Emeric Brun	eb155b6ca6	BUG/MEDIUM: ssl: fix missing error loading a keytype cert from a bundle. If there was an issue loading a keytype's part of a bundle, the bundle was implicitly ignored without errors. This patch should be backported in 1.8 (and perhaps 1.7)	2018-08-16 19:36:06 +02:00
Olivier Houchard	fde2a09a15	BUG/MEDIUM: sessions: Don't use t->state. In session_expire_embryonic(), don't use t->state, use the "state" argument instead, as t->state has been cleaned before we're being called.	2018-08-16 19:25:56 +02:00
Olivier Houchard	d8b7a4701d	BUG/MEDIUM: tasks: Don't insert in the global rqueue if nbthread == 1 Make sure we don't insert a task in the global run queue if nbthread == 1, as, as an optimisation, we avoid reading from it if nbthread == 1.	2018-08-16 19:25:46 +02:00
Olivier Houchard	5c110b924e	MINOR: checks: Add event_srv_chk_io(). In checks, introduce event_srv_chk_io() as a callback to be called if data can be sent again, instead of abusing event_srv_chk_w.	2018-08-16 17:29:54 +02:00
Olivier Houchard	29fb89dc5e	MINOR: mux_h2: Don't use h2_send() as a callback. Instead of using h2_send() directly as a callback, introcude h2_io_cb(), that will call h2_send() if it is possible to send data.	2018-08-16 17:29:54 +02:00
Olivier Houchard	8f0b4c66f5	MINOR: stream_interface: Give stream_interface its own wait_list. Instead of just using the conn_stream wait_list, give the stream_interface its own. When the conn_stream will have its own buffers, the stream_interface may have to wait on it.	2018-08-16 17:29:54 +02:00
Olivier Houchard	91894cbf4c	MINOR: stream_interface: Don't use si_cs_send() as a task handler. Instead of using si_cs_send() as a task handler, define a new function, si_cs_io_cb(), and give si_cs_send() its original prototype. Right now si_cs_io_cb() just handles send, but later it'll handle recv() too.	2018-08-16 17:29:54 +02:00
Olivier Houchard	e1c6dbcd70	MINOR: connections/mux: Add the wait reason(s) to wait_list. Add a new element to the wait_list, that let us know which event(s) we are waiting on.	2018-08-16 17:29:53 +02:00
Olivier Houchard	ed0f207ef5	MINOR: connections: Get rid of txbuf. Remove txbuf from conn_stream. It is not used yet, and its only user will probably be the mux_h2, so it will be better suited in the struct h2s.	2018-08-16 17:29:51 +02:00
Olivier Houchard	638b799b09	MINOR: connections: Move rxbuf from the conn_stream to the h2s. As the mux_h2 is the only user of rxbuf, move it to the struct h2s, instead of conn_stream.	2018-08-16 17:28:11 +02:00
Olivier Houchard	511efeae7e	MINOR: connections: Make rcv_buf mandatory and nuke cs_recv(). Reintroduce h2_rcv_buf(), right now it just does what cs_recv() did, but should be modified later.	2018-08-16 17:23:44 +02:00
Emeric Brun	77e8919fc6	BUG/MINOR: ssl: empty connections reported as errors. Empty connection is reported as handshake error even if dont-log-null is specified. This bug affect is a regression du to: BUILD: ssl: fix to build (again) with boringssl New openssl 1.1.1 defines OPENSSL_NO_HEARTBEATS as boring ssl so the test was replaced by OPENSSL_IS_BORINGSSL This fix should be backported on 1.8	2018-08-16 11:59:59 +02:00
Patrick Hemmer	248cb4c503	MEDIUM: queue: adjust position based on priority-class and priority-offset The priority values are used when connections are queued to determine which connections should be served first. The lowest priority class is served first. When multiple requests from the same class are found, the earliest (according to queue_time + offset) is served first. The queue offsets can span over roughly 17 minutes after which the offsets will wrap around. This allows up to 8 minutes spent in the queue with no reordering.	2018-08-10 15:06:48 +02:00
Patrick Hemmer	268a707a3d	MEDIUM: add set-priority-class and set-priority-offset This adds the set-priority-class and set-priority-offset actions to http-request and tcp-request content. At this point they are not used yet, which is the purpose of the next commit, but all the logic to set and clear the values is there.	2018-08-10 15:06:31 +02:00
Patrick Hemmer	0355dabd7c	MINOR: queue: replace the linked list with a tree We'll need trees to manage the queues by priorities. This change replaces the list with a tree based on a single key. It's effectively a list but allows us to get rid of the list management right now.	2018-08-10 15:06:27 +02:00
Patrick Hemmer	da282f4a8f	MINOR: queue: store the queue index in the stream when enqueuing We store the queue index in the stream and check it on dequeueing to figure how many entries were processed in between. This way we'll be able to count the elements that may later be added before ours.	2018-08-10 15:06:25 +02:00
Patrick Hemmer	ffe5e8c638	MINOR: stream: rename {srv,prx}_queue_size to *_queue_pos The current name is misleading as it implies a queue size, but the value instead indicates a position in the queue. The value is only the queue size at the exact moment the element is enqueued. Soon we will gain the ability to insert anywhere into the queue, upon which clarity of the name is more important.	2018-08-10 15:04:14 +02:00
Willy Tarreau	66425e31b5	MINOR: queue: make sure the pendconn is released before logging We'll soon need to rely on the pendconn position at the time of dequeuing to figure the position a stream took in the queue. Usually it's not a problem since pendconn_free() is called once the connection starts, but it will make a difference for failed dequeues (eg: queue timeout reached). Thus it's important to call pendconn_free() before logging in cases we are not certain whether it was already performed, and to call pendconn_unlink() after we know the pendconn will not be used so that we collect the queue state as accurately as possible. As a benefit it will also make the server's and backend's queues count more accurate in these cases.	2018-08-10 15:04:08 +02:00
Christopher Faulet	7ce0c891ab	MEDIUM: mux: Use the mux protocol specified on bind/server lines To do so, mux choices are split to handle incoming and outgoing connections in a different way. The protocol specified on the bind/server line is used in priority. Then, for frontend connections, the ALPN is retrieved and used to choose the best mux. For backend connection, there is no ALPN. Finaly, if no protocol is specified and no protocol matches the ALPN, we fall back on a default mux, choosing in priority the first mux with exactly the same mode.	2018-08-08 10:42:08 +02:00
Christopher Faulet	8ed0a3e32a	MINOR: mux/server: Add 'proto' keyword to force the multiplexer's protocol For now, it is parsed but not used. Tests are done on it to check if the side and the mode are compatible with the server's definition.	2018-08-08 10:42:08 +02:00
Christopher Faulet	a717b99284	MINOR: mux/frontend: Add 'proto' keyword to force the mux protocol For now, it is parsed but not used. Tests are done on it to check if the side and the mode are compatible with the proxy's definition.	2018-08-08 10:41:11 +02:00
Christopher Faulet	7c42eacbe9	BUG/MEDIUM: stream_int: Don't check CO_FL_SOCK_RD_SH flag to trigger cs receive It is mandatory to be sure to process data blocked in the RX buffer of the conn_stream while the shutr/read0 was already processed. The stream interface doesn't need to rely on this flags because it already tests CS_FL_EOS.	2018-08-08 10:41:11 +02:00
Willy Tarreau	91c2826e1d	CLEANUP: server: remove the update list and the update lock These ones are not more used, let's get rid of them.	2018-08-08 09:57:45 +02:00
Willy Tarreau	3ff577e165	MAJOR: server: make server state changes synchronous again Now we try to synchronously push updates as they come using the new rdv point, so that the call to the server update function from the main poll loop is not needed anymore. It further reduces the apparent latency in the health checks as the response time almost always appears as 0 ms, resulting in a slightly higher check rate of ~1960 conn/s. Despite this, the CPU consumption has slightly dropped again to ~32% for the same test. The only trick is that the checks code is built with a bit of recursivity because srv_update_status() calls server_recalc_eweight(), and the latter needs to signal srv_update_status() in case of updates. Thus we added an extra argument to this function to indicate whether or not it must propagate updates (no if it comes from srv_update_status).	2018-08-08 09:57:45 +02:00
Willy Tarreau	647c70b681	MINOR: threads: remove the previous synchronization point It's not needed anymore as it is fully covered by the new rendez-vous point. This also removes the pipe and its polling.	2018-08-08 09:57:45 +02:00
Willy Tarreau	85c459d7e8	MEDIUM: haproxy: don't use sync_poll_loop() anymore in the main loop This partially reverts commit d8fd2af ("BUG/MEDIUM: threads: Use the sync point to check active jobs and exit") which used to address an issue in the way the sync point used to check for present threads, which was later addressed by commit ddb6c16 ("BUG/MEDIUM: threads: Fix the exit condition of the thread barrier"). Thus there is no need anymore to use the sync point for exiting and we can completely remove this call in the main loop.	2018-08-08 09:56:32 +02:00
Willy Tarreau	3d3700f216	MEDIUM: checks: use the new rendez-vous point to spread check result The current sync point causes some important stress when a high number of threads is in use on a config with lots of checks, because it wakes up all threads every time a server state changes. A config like the following can easily saturate a 4-core machine reaching only 750 checks per second out of the ~2000 configured : global nbthread 4 defaults mode http timeout connect 5s timeout client 5s timeout server 5s frontend srv bind :8001 process 1/1 redirect location / if { method OPTIONS } { rand(100) ge 50 } stats uri / backend chk option httpchk server-template srv 1-100 127.0.0.1:8001 check rise 1 fall 1 inter 50 The reason is that the random on the fake server causes the responses to randomly match an HTTP check, and results in a lot of up/down events that are broadcasted to all threads. It's worth noting that the CPU usage already dropped by about 60% between 1.8 and 1.9 just due to the scheduler updates, but the sync point remains expensive. In addition, it's visible on the stats page that a lot of requests end up with an L7TOUT status in ~60ms. With smaller timeouts, it's even L4TOUT around 20-25ms. By not using THREAD_WANT_SYNC() anymore and only calling the server updates under thread_isolate(), we can avoid all these wakeups. The CPU usage on the same config drops to around 44% on the same machine, with all checks being delivered at ~1900 checks per second, and the stats page shows no more timeouts, even at 10 ms check interval. The difference is mainly caused by the fact that there's no more need to wait for a thread to wake up from poll() before starting to process check results.	2018-08-08 09:56:32 +02:00
Christopher Faulet	98d9fe21e0	MINOR: mux: Print the list of existing mux protocols during HA startup This is done in verbose/debug mode and when build options are reported.	2018-08-08 09:54:22 +02:00
Christopher Faulet	32f61c0421	MINOR: mux: Unlink ALPN and multiplexers to rather speak of mux protocols Multiplexers are not necessarily associated to an ALPN. ALPN is a TLS extension, so it is not always defined or used. Instead, we now rather speak of multiplexer's protocols. So in this patch, there are no significative changes, some structures and functions are just renamed.	2018-08-08 09:54:22 +02:00
Christopher Faulet	2d5292a412	MINOR: mux: Add info about the supported side in alpn_mux_list structure Now, a multiplexer can specify if it can be install on incoming connections (ALPN_SIDE_FE), on outgoing connections (ALPN_SIDE_BE) or both (ALPN_SIDE_BOTH). These flags are compatible with proxies' ones.	2018-08-08 09:54:22 +02:00
Christopher Faulet	b75bb21092	MEDIUM: backend: don't rely on mux_pt_ops in connect_server() The comment above the change remains true. We assume there is always 1 conn_stream per outgoing connectionq. Today, it is always true because H2 is not supported yet for server connections.	2018-08-08 09:54:22 +02:00
Christopher Faulet	6cc7afa04e	MINOR: backend: Try to find the best mux for outgoing connections For now, there is no effect. mux-pt will always be used because this is only available mux for backend connections.	2018-08-08 09:54:22 +02:00
Christopher Faulet	063f786553	MINOR: conn_stream: add cs_send() as a default snd_buf() function This function is generic and is able to automatically transfer data from a buffer to the conn_stream's tx buffer. It does this automatically if the mux doesn't define another snd_buf() function. It cannot yet be used as-is with the conn_stream's txbuf without risking to lose data on close since conn_streams need to be orphaned for this.	2018-08-08 09:53:58 +02:00
Christopher Faulet	2bf88c05d0	CLEANUP: backend: Move mux install to call it at only one place It makes the code readability simpler. It will also ease futur changes.	2018-08-07 14:37:37 +02:00
Christopher Faulet	d44a9b3627	MEDIUM: mux: Remove const on the buffer in mux->snd_buf() This is a partial revert of the commit deccd1116 ("MEDIUM: mux: make mux->snd_buf() take the byte count in argument"). It is a requirement to do zero-copy transfers. This will be mandatory when the TX buffer of the conn_stream will be used. So, now, data are consumed by mux->snd_buf() and not only sent. So it needs to update the buffer state. On its side, the caller must be aware the buffer can be replaced y an empty or unallocated one. As a side effet of this change, the function co_set_data() is now only responsible to update the channel set, by update ->output field.	2018-08-07 14:36:52 +02:00
Willy Tarreau	a8694654ba	BUG/MEDIUM: queue: prevent a backup server from draining the proxy's connections When switching back from a backup to an active server, the backup server currently continues to drain the proxy's connections, which is a problem because it's not expected to be able to pick them. This patch ensures that a backup server will only pick backend connections if there is no active server and it is the selected backup server or all backup servers are supposed to be used. This issue seems to have existed forever, so this fix should be backported to all stable versions.	2018-08-07 10:52:01 +02:00
Willy Tarreau	6a78e61694	BUG/MEDIUM: servers: check the queues once enabling a server Commit 64cc49c ("MAJOR: servers: propagate server status changes asynchronously.") heavily changed the way the server states are updated since they became asynchronous. During this change, some code was lost, which is used to shut down some sessions from a backup server and to pick pending connections from a proxy once a server is turned back from maintenance to ready state. The effect is that when temporarily disabling a server, connections stay in the backend's queue, and when re-enabling it, they are not picked and they expire in the backend's queue. Now they're properly picked again. This fix must be backported to 1.8.	2018-08-07 10:14:53 +02:00
Willy Tarreau	ab657ce251	BUG/MEDIUM: threads: fix the no-thread case after the change to the sync point In commit 0c026f4 ("MINOR: threads: add more consistency between certain variables in no-thread case"), we ensured that we don't have all_threads_mask zeroed anymore. But one test was missed for the write() to the sync pipe. This results in a situation where when running single-threaded, once a server status changes, a wake-up message is written to the pipe and never consumed, showing a 100% CPU usage. No backport is needed.	2018-08-07 10:07:15 +02:00

... 167 168 169 170 171 ...

14616 Commits