haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-22 07:01:20 +02:00

Author	SHA1	Message	Date
Willy Tarreau	5b654ad42c	BUILD: stick-table: shut up invalid "uninitialized" warning in gcc 8.3 gcc 8.3.0 spews a bunch of: src/stick_table.c: In function 'action_inc_gpc0': include/haproxy/freq_ctr.h:66:12: warning: 'period' may be used uninitialized in this function [-Wmaybe-uninitialized] curr_tick += period; ^~ src/stick_table.c:2241:15: note: 'period' was declared here unsigned int period; ^~~~~~ but they're incorrect because all accesses are guarded by the exact same condition (ptr1 not being null), it's just the compiler being overzealous about the uninitialized detection that seems to be stronger than its ability to follow its own optimizations. This code path is not critical, let's just pre-initialize the period to zero. No backport is needed.	2021-07-06 18:54:07 +02:00
Marno Krahmer	07954fb069	MEDIUM: stats: include disabled proxies that hold active sessions to stats After reloading HAProxy, the old process may still hold active sessions. Currently there is no way to gather information, how many sessions such a process still holds. This patch will not exclude disabled proxies from stats output when they hold at least one active session. This will allow sending `!@<PID> show stat` through a master socket to the disabled process and have it returning its stats data.	2021-07-06 11:54:08 +02:00
Christopher Faulet	23048875a4	Revert "MINOR: tcp-act: Add set-src/set-src-port for "tcp-request content" rules" This reverts commit 19bbbe05629ea947dd60d5b96d96f0066b047b97. For now, set-src/set-src-port actions are directly performed on the client connection. Using these actions at the stream level is really a problem with HTTP connection (See #90) because all requests are affected by this change and not only the current request. And it is worse with the H2, because several requests can set their source address into the same connection at the same time. It is already an issue when these actions are called from "http-request" rules. It is safer to wait a bit before adding the support to "tcp-request content" rules. The solution is to be able to set src/dst address on the stream and not on the connection when the action if performed from the L7 level.. Reverting the above commit means the issue #1303 is no longer fixed. This patch must be backported in all branches containing the above commit (as far as 2.0 for now).	2021-07-06 11:44:04 +02:00
Willy Tarreau	dfb34a8f87	BUG/MINOR: cli: fix server name output in "show fd" A server name was displayed as <srv>/<proxy> instead of the reverse. It only confuses diagnostics. This was introduced by commit 7a4a0ac71 ("MINOR: cli: add a new "show fd" command") so this fix can be backport down to 1.8.	2021-07-06 11:41:10 +02:00
Willy Tarreau	5a9c637bf3	BUG/MEDIUM: sock: make sure to never miss early connection failures As shown in issue #1251, it is possible for a connect() to report an error directly via the poller without ever reporting send readiness, but currentlt sock_conn_check() manages to ignore that situation, leading to high CPU usage as poll() wakes up on these FDs. The bug was apparently introduced in 1.5-dev22 with commit fd803bb4d ("MEDIUM: connection: add check for readiness in I/O handlers"), but was likely only woken up by recent changes to conn_fd_handler() that made use of wakeups instead of direct calls between 1.8 and 1.9, voiding any chance to catch such errors in the early recv() callback. The exact sequence that leads to this situation remains obscure though because the poller does not report send readiness nor does it report an error. Only HUP and IN are reported on the FD. It is also possible that some recent kernel updates made this condition appear while it never used to previously. This needs to be backported to all stable branches, at least as far as 2.0. Before 2.2 the code was in tcp_connect_probe() in proto_tcp.c.	2021-07-06 10:52:19 +02:00
Emeric Brun	726783db18	MEDIUM: stick-table: make the use of 'gpc' excluding the use of 'gpc0/1'' This patch makes the use of 'gpc' excluding the use of the legacy types 'gpc0' and 'gpc1" on the same table. It also makes the use of 'gpc_rate' excluding the use of the legacy types 'gpc0_rate' and 'gpc1_rate" on the same table. The 'gpc0' and 'gpc1' related fetches and actions will apply to the first two elements of the 'gpc' array if stored in table. The 'gpc0_rate' and 'gpc1_rate' related fetches and actions will apply to the first two elements of the 'gpc_rate' array if stored in table.	2021-07-06 07:24:42 +02:00
Emeric Brun	4d7ada8f9e	MEDIUM: stick-table: add the new arrays of gpc and gpc_rate This patch adds the definition of two new array data_types: 'gpc': This is an array of 32bits General Purpose Counters. 'gpc_rate': This is an array on increment rates of General Purpose Counters. Like for all arrays, they are limited to 100 elements. This patch also adds actions and fetches to handle elements of those arrays. Note: As documented, those new actions and fetches won't apply to the legacy 'gpc0', 'gpc1', 'gpc0_rate' nor 'gpc1_rate'.	2021-07-06 07:24:42 +02:00
Emeric Brun	f7ab0bfb62	MEDIUM: stick-table: make the use of 'gpt' excluding the use of 'gpt0' This patch makes the use of 'gpt' excluding the use of the legacy type 'gpt0' on the same table. It also makes the 'gpt0' related fetches and actions applying to the first element of the 'gpt' array if stored in table.	2021-07-06 07:24:42 +02:00
Emeric Brun	877b0b5a7b	MEDIUM: stick-table: add the new array of gpt data_type This patch adds the definition of a new array data_type 'gpt'. This is an array of 32bits General Purpose Tags. Like for all arrays, it is limited to 100 elements. This patch also adds actions and fetches to handle elements of this array. Note: As documented, those new actions and fetches won't apply to the legacy 'gpt0' data type.	2021-07-06 07:24:42 +02:00
Emeric Brun	90a9b676a8	MEDIUM: peers: handle arrays of std types in peers protocol This patch adds support of array data_types on the peer protocol. The table definition message will provide an additionnal parameter for array data-types: the number of elements of the array. In case of array of frqp it also provides a second parameter: the period used to compute freq counter. The array elements are std_type values linearly encoded in the update message. Note: if a remote peer announces an array data_type without parameters into the table definition message, all updates on this table will be ignored because we can not parse update messages consistently.	2021-07-06 07:24:42 +02:00
Emeric Brun	c64a2a307c	MEDIUM: stick-table: handle arrays of standard types into stick-tables This patch provides the code to handle arrays of some standard types (SINT, UINT, ULL and FRQP) in stick table. This way we could define new "array" data types. Note: the number of elements of an array was limited to 100 to put a limit and to ensure that an encoded update message will continue to fit into a buffer when the peer protocol will handle such data types.	2021-07-06 07:24:42 +02:00
Emeric Brun	0e3457b63a	MINOR: stick-table: make skttable_data_cast to use only std types This patch replaces all advanced data type aliases on stktable_data_cast calls by standard types. This way we could call the same stktable_data_cast regardless of the used advanced data type as long they are using the same std type. It also removes all the advanced data type aliases.	2021-07-06 07:24:42 +02:00
Emeric Brun	08b0f6780c	BUG/MINOR: peers: fix data_type bit computation more than 32 data_types This patch fixes the computation of the bit of the current data_type in some part of code of peer protocol where the computation is limited to 32bits whereas the bitfield of data_types can support 64bits. Without this patch it could result in bugs when we will define more than 32 data_types. Backport is useless because there is currently less than 32 data_types	2021-07-06 07:24:42 +02:00
Emeric Brun	01928ae56b	BUG/MINOR: stick-table: fix several printf sign errors dumping tables This patch fixes several errors printing integers of stick table entry values and args during dump on cli. This patch should be backported since the dump of entries is supported. [wt: roughly 1.5-dev1 hence all stable branches]	2021-07-06 07:24:42 +02:00
David Carlier	bae4cb2790	BUILD/MEDIUM: tcp: set-mark support for OpenBSD set-mark support for this platform, for routing table purpose. Follow-up from f7f53afcf9d367d19, this time for OpenBSD.	2021-07-05 10:53:18 +02:00
Emeric Brun	5ea07d9e91	CLEANUP: peers: re-write intdecode function comment. The varint decoding function comment was not clear enough and didn't reflect the current usage. This patch re-writes this.	2021-06-30 13:49:12 +02:00
Christopher Faulet	81ba74ae50	BUG/MEDIUM: resolvers: Make 1st server of a template take part to SRV resolution The commit 3406766d5 ("MEDIUM: resolvers: add a ref between servers and srv request or used SRV record") introduced a regression. The first server of a template based on SRV record is no longer resolved. The same bug exists for a normal server based on a SRV record. In fact, the server used during parsing (used as reference when a server-template line is parsed) is never attached to the corresponding srvrq object. Thus with following lines, no resolution is performed because "srvrq->attached_servers" is empty: server-template test 1 _http.domain.tld resolvers dns ... server test1 _http.domain.tld resolvers dns ... This patch should fix the issue #1295 (but not confirmed yet it is the same bug). It must be backported everywhere the above commit is.	2021-06-29 20:52:37 +02:00
Christopher Faulet	0de0becf0b	BUG/MINOR: mqtt: Support empty client ID in CONNECT message As specified by the MQTT specification (MQTT-3.1.3-6), the client ID may be empty. That means the length of the client ID string may be 0. However, The MQTT parser does not support empty strings. So, to fix the bug, the mqtt_read_string() function may now parse empty string. 2 bytes must be found to decode the string length, but the length may be 0 now. It is the caller responsibility to test the string emptiness if necessary. In addition, in mqtt_parse_connect(), the client ID may be empty now. This patch should partely fix the issue #1310. It must be backported to 2.4.	2021-06-28 16:29:44 +02:00
Christopher Faulet	ca925c9c28	BUG/MINOR: mqtt: Fix parser for string with more than 127 characters Parsing of too long strings (> 127 characters) was buggy because of a wrong cast on the length bytes. To fix the bug, we rely on mqtt_read_2byte_int() function. This way, the string length is properly decoded. This patch should partely fix the issue #1310. It must be backported to 2.4.	2021-06-28 16:29:44 +02:00
Willy Tarreau	5bbfff107b	BUILD: tcp-act: avoid warning when set-mark / set-tos are not supported Since recent commit 469c06c30 ("MINOR: http-act/tcp-act: Add "set-mark" and "set-tos" for tcp content rules") there's a build warning (or error) on Windows due to static function tcp_action_set_mark() not being used because the set-mark functionality is not supported there. It's caused by the fact that only the parsing function uses it so if the code is ifdefed out the function remains unused. Let's surround it with ifdefs as well, and do the same for tcp_action_set_tos() which could suffer the same fate on operating systems not defining IP_TOS. This may need to be backported if the patch above is backported. Also be careful, the condition was adjusted to cover FreeBSD after commit f7f53afcf ("BUILD/MEDIUM: tcp: set-mark setting support for FreeBSD.").	2021-06-28 07:12:22 +02:00
David Carlier	f7f53afcf9	BUILD/MEDIUM: tcp: set-mark setting support for FreeBSD. This platform has a similar socket option from Linux's SO_MARK, marking a socket with an id for packet filter purpose, DTrace monitoring and so on.	2021-06-28 07:03:35 +02:00
Christopher Faulet	ee9c98d81b	CLEANUP: tcp-act: Sort action lists Sort the lists used to register tcp actions.	2021-06-25 16:12:02 +02:00
Christopher Faulet	469c06c30e	MINOR: http-act/tcp-act: Add "set-mark" and "set-tos" for tcp content rules It is now possible to set the Netfilter MARK and the TOS field value in all packets sent to the client from any tcp-request rulesets or the "tcp-response content" one. To do so, the parsing of "set-mark" and "set-tos" actions are moved in tcp_act.c and the actions evaluation is handled in dedicated functions. This patch may be backported as far as 2.2 if necessary.	2021-06-25 16:11:58 +02:00
Christopher Faulet	1da374af2f	MINOR: http-act/tcp-act: Add "set-nice" for tcp content rules It is now possible to set the "nice" factor of the current stream from a "tcp-request content" or "tcp-response content" ruleset. To do so, the action parsing is moved in stream.c and the action evaluation is handled in a dedicated function. This patch may be backported as far as 2.2 if necessary.	2021-06-25 16:11:53 +02:00
Christopher Faulet	551a641cff	MINOR: http-act/tcp-act: Add "set-log-level" for tcp content rules It is now possible to set the stream log level from a "tcp-request content" or "tcp-response content" ruleset. To do so, the action parsing is moved in stream.c and the action evaluation is handled in a dedicated function. This patch should fix issue #1306. It may be backported as far as 2.2 if necessary.	2021-06-25 16:11:46 +02:00
Christopher Faulet	fa5880bd53	BUG/MINOR: tcpcheck: Fix numbering of implicit HTTP send/expect rules The index of the failing rule is reported in the health-check log message. The rules index is also used in the check traces. But for implicit HTTP send/expect rules, the index is wrong. It must be incremented by one compared to the preceding rule. This patch may be backported as far as 2.2.	2021-06-25 14:03:45 +02:00
Dirkjan Bussink	dfee217b68	BUG/MINOR: checks: return correct error code for srv_parse_agent_check In srv_parse_agent_check the error code is not returned in case something goes wrong. The value 0 is always return. Additionally, there's a small cleanup of unreachable returns that in most checks are not present either and removed in two places they were present. This makes the code consistent across the different checks.	2021-06-25 08:55:39 +02:00
Christopher Faulet	07ecff589d	MINOR: resolvers: Reset server IP on error in resolv_get_ip_from_response() If resolv_get_ip_from_response() returns an error (or an unexpected return value), the server is set to RMAINT status. However, its address must also be reset. Otherwise, it is still reported by the cli on "show servers state" commands. This may be confusing. Note that it is a theorical patch because this code path does not exist. Thus it is not tagged as a BUG. This patch may be backported as far as 2.0.	2021-06-24 17:22:36 +02:00
Christopher Faulet	a8ce497aac	BUG/MINOR: resolvers: Reset server IP when no ip is found in the response For A/AAAA resolution, if no ip is found for a server in the response, the server is set to RMAINT status. However, its address must also be reset. Otherwise, it is still reported by the cli on "show servers state" commands. This may be confusing. This patch may be backported as far as 2.0.	2021-06-24 17:22:36 +02:00
Christopher Faulet	d7bb23490c	BUG/MINOR: resolvers: Always attach server on matching record on resolution On A/AAAA resolution, for a given server, if a record is matching, we must always attach the server to this record. Before it was only done if the server IP was not the same than the record one. However, it is a problem if the server IP was not set for a previous resolution. From the libc during startup for instance. In this case, the server IP is not updated and the server is not attached to any record. It remains in this state while a matching record is found in the DNS response. It is especially a problem when the resolution is used for server-templates. This bug was introduced by the commit bd78c912f ("MEDIUM: resolvers: add a ref on server to the used A/AAAA answer item"). This patch should solve the issue #1305. It must be backported to all versions containing the above commit.	2021-06-24 17:15:33 +02:00
Willy Tarreau	47ee44fb71	BUG/MINOR: queue/debug: use the correct lock labels on the queue lock A dedicated queue lock was added by commit 16fbdda3c ("MEDIUM: queue: use a dedicated lock for the queues (v2)") but during its rebase, some labels were lost and left to SERVER_LOCK / PROXY_LOCK instead of QUEUE_LOCK. It's harmless but can confuse the lock debugger, so better fix it. No backport is needed.	2021-06-24 16:00:18 +02:00
Willy Tarreau	19c5581b43	BUG: backend: stop looking for queued connections once there's no more Commit ae0b12ee0 ("MEDIUM: queue: use a trylock on the server's queue") introduced a hard to trigger bug that's more visible with a single thread: if a server dequeues a connection and finds another free slot with no connection to place there, process_srv_queue() will never break out of the loop. In multi-thread it almost does not happen because other threads bring new connections. No backport is needed as it's only in -dev.	2021-06-24 15:56:07 +02:00
Willy Tarreau	d03adce575	MINOR: queue: simplify pendconn_unlink() regarding srv vs px Since the code paths became exactly the same except for what log field to update, let's simplify the code and move further code out of the lock. The queue position update and the test for server vs proxy do not need to be inside the lock.	2021-06-24 10:52:31 +02:00
Willy Tarreau	51c63f0f0a	MINOR: queue: remove the px/srv fields from pendconn Now we directly use p->queue to get to the queue, which is much more straightforward. The performance on 100 servers and 16 threads increased from 560k to 574k RPS, or 2.5%. A lot more simplifications are possible, but the minimum was done at this point.	2021-06-24 10:52:31 +02:00
Willy Tarreau	8429097c61	MINOR: queue: store a pointer to the queue into the pendconn By following the queue pointer in the pendconn it will now be possible to always retrieve the elements (index, srv, px, etc).	2021-06-24 10:52:31 +02:00
Willy Tarreau	cdc83e0192	MINOR: queue: add a pointer to the server and the proxy in the queue A queue is specific to a server or a proxy, so we don't need to place this distinction inside all pendconns, it can be in the queue itself. This commit adds the relevant fields "px" and "sv" into the struct queue, and initializes them accordingly.	2021-06-24 10:52:31 +02:00
Willy Tarreau	df3b0cbe31	MINOR: queue: add queue_init() to initialize a queue This is better and cleaner than open-coding this in the server and proxy code, where it has all chances of becoming wrong once forgotten.	2021-06-24 10:52:31 +02:00
Willy Tarreau	ae0b12ee03	MEDIUM: queue: use a trylock on the server's queue Doing so makes sure that threads attempting to wake up new connections for a server will give up early if another thread is already in charge of this. The goal is to avoid unneeded contention on low server counts. Now with a single server with 16 threads in roundrobin we get the same performance as with multiple servers, i.e. ~575kreq/s instead of ~496k before. Leastconn is seeing a similar jump, from ~460 to ~560k (the difference being the calls to fwlc_srv_reposition). The overhead of process_srv_queue() is now around 2% instead of ~20% previously.	2021-06-24 10:52:31 +02:00
Willy Tarreau	49667c14ba	MEDIUM: queue: take the proxy lock only during the px queue accesses There's no point keeping the proxy lock held for a long time, it's only needed when checking the proxy's queue, and keeping it prevents multiple servers from dequeuing in parallel. Let's move it into pendconn_process_next_strm() and release it ASAP. The pendconn remains under the server queue lock's protection, guaranteeing that no stream will release it while it's being touched. For roundrobin, the performance increases by 76% (327k to 575k) on 16 threads. Even with a single server and maxconn=100, the performance increases from 398 to 496 kreq/s. For leastconn, almost no change is visible (less than one percent) but this is expected since most of the time there is spent in fwlc_reposition() and fwlc_get_next_server().	2021-06-24 10:52:31 +02:00
Willy Tarreau	98c8910be7	MINOR: queue: use atomic-ops to update the queue's index (v2) Doing so allows to retrieve and update the pendconn's queue index outside of the queue's lock and to save one more percent CPU on a highly-contented backend.	2021-06-24 10:52:31 +02:00
Willy Tarreau	12529c0ed3	MINOR: queue: factor out the proxy/server queuing code (v2) The code only differed by the nbpend_max counter. Let's have a pointer to it and merge the two variants to always use a generic queue. It was initially considered to put the max inside the queue structure itself, but the stats support clearing values and maxes and this would have been the only counter having to be handled separately there. Given that we don't need this max anywhere outside stats, let's keep it where it is and have a pointer to it instead. The CAS loop to update the max remains. It was naively thought that it would have been faster without atomic ops inside the lock, but this is not the case for the simple reason that it is a max, it converges very quickly and never has to perform the check anymore. Thus this code is better out of the lock. The queue_idx is still updated inside the lock since that's where the idx is updated, though it could be performed using atomic ops given that it's only used to roughly count places for logging.	2021-06-24 10:52:31 +02:00
Willy Tarreau	a0e9c55ab1	MEDIUM: queue: determine in process_srv_queue() if the proxy is usable (v2) By doing so we can move some evaluations outside of the lock and the loop.	2021-06-24 10:52:31 +02:00
Willy Tarreau	9ab78293bf	MEDIUM: queue: simplify again the process_srv_queue() API (v2) This basically undoes the API changes that were performed by commit 0274286dd ("BUG/MAJOR: server: fix deadlock when changing maxconn via agent-check") to address the deadlock issue: since process_srv_queue() doesn't use the server lock anymore, it doesn't need the "server_locked" argument, so let's get rid of it before it gets used again.	2021-06-24 10:52:31 +02:00
Willy Tarreau	16fbdda3c3	MEDIUM: queue: use a dedicated lock for the queues (v2) Till now whenever a server or proxy's queue was touched, this server or proxy's lock was taken. Not only this requires distinct code paths, but it also causes unnecessary contention with other uses of these locks. This patch adds a lock inside the "queue" structure that will be used the same way by the server and the proxy queuing code. The server used to use a spinlock and the proxy an rwlock, though the queue only used it for locked writes. This new version uses a spinlock since we don't need the read lock part here. Tests have not shown any benefit nor cost in using this one versus the rwlock so we could change later if needed. The lower contention on the locks increases the performance from 362k to 374k req/s on 16 threads with 20 servers and leastconn. The gain with roundrobin even increases by 9%. This is tagged medium because the lock is changed, but no other part of the code touches the queues, with nor without locking, so this should remain invisible.	2021-06-24 10:52:31 +02:00
Willy Tarreau	9cef43acab	MEDIUM: queue: update px->served and lb's take_conn once per loop There's no point doing atomic incs over px->served/px->totpend under the locks from the inner loop, as this value is used by the LB algorithms but not during the dequeuing step. In addition, the LB algo's take_conn() doesn't need to be refreshed for each and every connection taken under the lock, it can be performed once at the end and out of the lock. While the gain on roundrobin is not noticeable (only the atomic inc), on leastconn which uses take_conn(), the performance increases from 355k to 362k req/s on 16 threads.	2021-06-24 10:09:40 +02:00
Willy Tarreau	a48905bad8	Revert "MEDIUM: queue: make pendconn_process_next_strm() only return the pendconn" This reverts commit 5304669e1b1a213d2754755a47735ecd5549ce7b. The recent changes since 5304669e1 MEDIUM: queue: make pendconn_process_next_strm() only return the pendconn opened a tiny race condition between stream_free() and process_srv_queue(), as the pendconn is accessed outside of the lock, possibly while it's being freed. A different approach is required.	2021-06-24 09:55:59 +02:00
Willy Tarreau	d83c98eb14	Revert "MINOR: queue: update proxy->served once out of the loop" This reverts commit 3e92a31783b545dd58c4be6c588808763e0042bc. The recent changes since 5304669e1 MEDIUM: queue: make pendconn_process_next_strm() only return the pendconn opened a tiny race condition between stream_free() and process_srv_queue(), as the pendconn is accessed outside of the lock, possibly while it's being freed. A different approach is required.	2021-06-24 09:55:14 +02:00
Willy Tarreau	e76fc3253d	Revert "MEDIUM: queue: refine the locking in process_srv_queue()" This reverts commit 1b648c857bb9e0fb857e86838bcca0c9ed01e2bd. The recent changes since 5304669e1 MEDIUM: queue: make pendconn_process_next_strm() only return the pendconn opened a tiny race condition between stream_free() and process_srv_queue(), as the pendconn is accessed outside of the lock, possibly while it's being freed. A different approach is required.	2021-06-24 09:55:14 +02:00
Willy Tarreau	3f70fb9ea2	Revert "MEDIUM: queue: use a dedicated lock for the queues" This reverts commit fcb8bf8650ec6b5614d1b88db54f1200ebd96cbd. The recent changes since 5304669e1 MEDIUM: queue: make pendconn_process_next_strm() only return the pendconn opened a tiny race condition between stream_free() and process_srv_queue(), as the pendconn is accessed outside of the lock, possibly while it's being freed. A different approach is required.	2021-06-24 07:26:28 +02:00
Willy Tarreau	ccd85a3e08	Revert "MEDIUM: queue: simplify again the process_srv_queue() API" This reverts commit c83e45e9b001591633188a480a896c935d3c9625. The recent changes since 5304669e1 MEDIUM: queue: make pendconn_process_next_strm() only return the pendconn opened a tiny race condition between stream_free() and process_srv_queue(), as the pendconn is accessed outside of the lock, possibly while it's being freed. A different approach is required.	2021-06-24 07:22:18 +02:00

... 86 87 88 89 90 ...

16131 Commits