haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-07 07:37:02 +02:00

Author	SHA1	Message	Date
Willy Tarreau	fd9417ba3f	CLEANUP: stconn: rename cs_conn() to sc_conn() It's mostly used from upper layers. Both the checked and unchecked functions were updated, or ~150 entries.	2022-05-27 19:33:34 +02:00
Willy Tarreau	ea27f48c5a	CLEANUP: stconn: rename cs_{check,strm,strm_task} to sc_strm_* These functions return the app-layer associated with an stconn, which is a check, a stream or a stream's task. They're used a lot to access channels, flags and for waking up tasks. Let's just name them appropriately for the stream connector.	2022-05-27 19:33:34 +02:00
Willy Tarreau	2f2318df87	MEDIUM: stconn: merge the app_ops and the data_cb fields For historical reasons (stream-interface and connections), we used to require two independent fields for the application level callbacks and the transport-level functions. Over time the distinction faded away so much that the low-level functions became specific to the application and conversely. For example, applets may only work with streams on top since they rely on the channels, and the stream-level functions differ between applets and connections. Right now the application level only contains a wake() callback and the low-level ones contain the functions that act at the lower level to perform the shutr/shutw and at the upper level to notify about readability and writability. Let's just merge them together into a single set and get rid of this confusing distinction. Note that the check ops do not define any app-level function since these are only called by streams.	2022-05-27 19:33:34 +02:00
Willy Tarreau	f3ae34b67d	MINOR: check: export wake_srv_chk() We'll need it to centralize the stream connectors definitions.	2022-05-27 19:33:34 +02:00
Willy Tarreau	cb04166525	CLEANUP: stconn: tree-wide rename stream connector flags CS_FL_* to SC_FL_* This follows the natural naming. There are roughly 100 changes, all totally trivial.	2022-05-27 19:33:34 +02:00
Willy Tarreau	4596fe20d9	CLEANUP: conn_stream: tree-wide rename to stconn (stream connector) This renames the "struct conn_stream" to "struct stconn" and updates the descriptions in all comments (and the rare help descriptions) to "stream connector" or "connector". This touches a lot of files but the change is minimal. The local variables were not even renamed, so there's still a lot of "cs" everywhere.	2022-05-27 19:33:34 +02:00
Willy Tarreau	b605c4213f	CLEANUP: conn_stream: rename the stream endpoint flags CS_EP_* to SE_FL_* Let's now use the new flag names for the stream endpoint.	2022-05-27 19:33:34 +02:00
Willy Tarreau	0cfcc40812	CLEANUP: conn_stream: apply cs_endp_flags.cocci tree-wide This changes all main uses of cs->endp->flags to the sc_ep_*() equivalent by applying coccinelle script cs_endp_flags.cocci. Note: 143 locations were touched, manually reviewed and found to be OK, except a single one that was adjusted in cs_reset_endp() where the flags are read and filtered to be used as-is and not as a boolean, hence was replaced with sc_ep_get() & $FLAGS. The script was applied with all includes: spatch --in-place --recursive-includes -I include --sp-file $script $files	2022-05-27 19:33:34 +02:00
Christopher Faulet	c95eaefbfd	MEDIUM: check: Use the CS to handle subscriptions for read/write events Instead of using the health-check to subscribe to read/write events, we now rely on the conn-stream. Indeed, on the server side, the conn-stream's endpoint is a multiplexer. Thus it seems appropriate to handle subscriptions for read/write events the same way than for the streams. Of course, the I/O callback function is not the same. We use srv_chk_io_cb() instead of cs_conn_io_cb().	2022-05-19 10:12:38 +02:00
Christopher Faulet	361417f9b4	REORG: check: Rename and export I/O callback function event_srv_chk_io() function is renamed srv_chk_io_cb() to be consistant with the I/O callback function of connections. In addition, this function is exported. It will be required to use the conn-stream's subscriptions.	2022-05-19 10:12:38 +02:00
Christopher Faulet	08c8f8e20d	MEDIUM: check: No longer shutdown the connection in .wake callback function The connection is already closed by the health-check itself. Thus there is now reason to duplicate this part in the .wake callback function. It is enough to wake the health-check and wait.	2022-05-19 10:12:38 +02:00
Christopher Faulet	6d781f612a	BUG/MINOR: check: Reinit the buffer wait list at the end of a check The buffer wait list is used to deal with buffer allocation failure. But at the end of health-check, it must be reinitialized. There is no reason to reason to get a buffer between two health-check runs. And in fact, the associated flags, CHK_ST_IN_ALLOC and CHK_ST_OUT_ALLOC, are already cleared at the end of a health-check. This patch must be backported as far as 2.2. On the 2.2, MT_LIST_ADDED and MT_LIST_DEL must be used instead of LIST_INLIST and LIST_DEL_INIT.	2022-05-19 10:12:38 +02:00
Christopher Faulet	a6c4a48341	BUG/MEDIUM: conn-stream: Don't erase endpoint flags on reset Only CS_EP_ERROR flag is now removed from the endpoint when a reset is performed. When a new the endpoint is allocated, flags are preserved. It is the caller responsibility to remove other flags, depending on its need. Concretly, during a connection retry or a L7 retry, we must preserve flags. In tcpcheck and the CLI, we reset flags. This patch is 2.6-specific. No backport needed.	2022-04-29 14:12:42 +02:00
Willy Tarreau	7e2e4f8401	CLEANUP: tree-wide: remove 25 occurrences of unneeded fcntl.h There were plenty of leftovers from old code that were never removed and that are not needed at all since these files do not use any definition depending on fcntl.h, let's drop them.	2022-04-26 10:59:48 +02:00
Willy Tarreau	acef5e27b0	MINOR: tree-wide: always consider EWOULDBLOCK in addition to EAGAIN Some older systems may routinely return EWOULDBLOCK for some syscalls while we tend to check only for EAGAIN nowadays. Modern systems define EWOULDBLOCK as EAGAIN so that solves it, but on a few older ones (AIX, VMS etc) both are different, and for portability we'd need to test for both or we never know if we risk to confuse some status codes with plain errors. There were few entries, the most annoying ones are the switch/case because they require to only add the entry when it differs, but the other ones are really trivial.	2022-04-25 20:32:15 +02:00
Christopher Faulet	eb50c01fef	MINOR: conn-stream: Make cs_detach_* private and use cs_destroy() from outside A conn-stream is never detached from an endpoint or an application alone, except on a reset. Thus, to avoid any error, these functions are now private. And cs_destroy() function is added to destroy a conn-stream. This function is called when a stream is released, on the front and back conn-streams, and when a health-check is finished.	2022-04-22 14:32:30 +02:00
Christopher Faulet	ff022a2b8c	CLEANUP: conn-stream: Rename cs_conn_close() and cs_conn_drain_and_close() These functions don't close the connection but only perform shutdown for reads and writes at the mux level. It is a bit ambiguous. Thus, cs_conn_close() is renamed cs_conn_shut() and cs_conn_drain_and_close() is renamed cs_conn_drain_and_shut(). These both functions rely on cs_conn_shutw() and cs_conn_shutr().	2022-04-22 14:14:27 +02:00
Christopher Faulet	177a0e60ee	MEDIUM: check: Use a new conn-stream for each health-check run It is a partial revert of `54e85cbfc` ("MAJOR: check: Use a persistent conn-stream for health-checks"). But with the CS refactoring, the result is cleaner now. A CS is allocated when a new health-check run is started. The same CS is then used throughout the run. If there are several connections, the endpoint is just reset. At the end of the run, the CS is released. It means, in the tcp-check part, the CS is always defined.	2022-04-13 15:10:16 +02:00
Christopher Faulet	6b0a0fb2f9	CLEANUP: tree-wide: Remove any ref to stream-interfaces Stream-interfaces are gone. Corresponding files can be safely be removed. In addition, comments are updated accordingly.	2022-04-13 15:10:16 +02:00
Christopher Faulet	69ef6c9ef4	MINOR: conn-stream: Rename CS functions dedicated to connections Some conn-stream functions are only used when there is a connection. Thus, they was renamed with "cs_conn_" prefix. In addition, we expect to have a connection, so a BUG_ON is added to be sure the functions are never called in another context.	2022-04-13 15:10:15 +02:00
Christopher Faulet	b041b23ae4	MEDIUM: conn-stream: Move remaning flags from CS to endpoint All old flags CS_FL_* are now moved in the endpoint scope and renamed CS_EP_* accordingly. It is a systematic replacement. There is no true change except for the health-check and the endpoint reset. Here it is a bit special because the same conn-stream is reused. Thus, we must handle endpoint allocation errors. To do so, cs_reset_endp() has been adapted. Thanks to this last change, it will now be possible to simplify the multiplexer and probably the applets too. A review must also be performed to remove some flags in the channel or the stream-interface. The HTX will probably be simplified too. Finally, there is now some place in the conn-stream to move info from the stream-interface.	2022-04-13 15:10:14 +02:00
Christopher Faulet	9ec2f4dc7c	MAJOR: conn-stream: Share endpoint struct between the CS and the mux/applet The conn-stream endpoint is now shared between the conn-stream and the applet or the multiplexer. If the mux or the applet is created first, it is responsible to also create the endpoint and share it with the conn-stream. If the conn-stream is created first, it is the opposite. When the endpoint is only owned by an applet or a mux, it is called an orphan endpoint (there is no conn-stream). When it is only owned by a conn-stream, it is called a detached endpoint (there is no mux/applet). The last entity that owns an endpoint is responsible to release it. When a mux or an applet is detached from a conn-stream, the conn-stream relinquishes the endpoint to recreate a new one. This way, the endpoint state is never lost for the mux or the applet.	2022-04-13 15:10:14 +02:00
Christopher Faulet	a9e8b3979d	MEDIUM: conn-stream: Pre-allocate endpoint to create CS from muxes and applets It is a transient commit to prepare next changes. Now, when a conn-stream is created from an applet or a multiplexer, an endpoint is always provided. In addition, the API to create a conn-stream was specialized to have one function per type. The next step will be to share the endpoint structure.	2022-04-13 15:10:14 +02:00
Christopher Faulet	b669d684c0	MEDIUM: conn-stream: Be able to pass endpoint to create a conn-stream It is a transient commit to prepare next changes. It is possible to pass a pre-allocated endpoint to create a new conn-stream. If it is NULL, a new endpoint is created, otherwise the existing one is used. There no more change at the conn-stream level. In the applets, all conn-stream are created with no pre-allocated endpoint. But for multiplexers, an endpoint is systematically created before creating the conn-stream.	2022-04-13 15:10:14 +02:00
Willy Tarreau	07ecfc5e88	MEDIUM: connection: panic when calling FD-specific functions on FD-less conns Certain functions cannot be called on an FD-less conn because they are normally called as part of the protocol-specific setup/teardown sequence. Better place a few BUG_ON() to make sure none of them is called in other situations. If any of them would trigger in ambiguous conditions, it would always be possible to replace it with an error.	2022-04-11 19:31:47 +02:00
Amaury Denoyelle	7d098bea2b	MEDIUM: check: do not auto configure SSL/PROXY for dynamic servers For server checks, SSL and PROXY is automatically inherited from the server settings if no specific check port is specified. Change this behavior for dynamic servers : explicit "check-ssl"/"check-send-proxy" are required for them. Without this change, it is impossible to add a dynamic server with SSL/PROXY settings and checks without, if the check port is not explicit. This is because "no-check-ssl"/"no-check-send-proxy" keywords are not available for dynamic servers. This change respects the principle that dynamic servers on the CLI should not reuse the same shortcuts used during the config file parsing. Mostly because we expect this feature to be manipulated by automated tools, contrary to the config file which should aim to be the shortest possible for human readability. Update the documentation of the "check" keyword to reflect this change.	2022-03-11 14:28:28 +01:00
Christopher Faulet	693b23bb10	MEDIUM: tree-wide: Use unsafe conn-stream API when it is relevant The unsafe conn-stream API (__cs_*) is now used when we are sure the good endpoint or application is attached to the conn-stream. This avoids compiler warnings about possible null derefs. It also simplify the code and clear up any ambiguity about manipulated entities.	2022-02-28 17:13:36 +01:00
Christopher Faulet	54e85cbfc7	MAJOR: check: Use a persistent conn-stream for health-checks In the same way a stream has always valid conn-streams, when a health-checks is created, a conn-stream is now created and the health-check is attached on it, as an app. This simplify a bit the connect part when a health-check is running.	2022-02-24 11:00:03 +01:00
Christopher Faulet	f835dea939	MEDIUM: conn_stream: Add a pointer to the app object into the conn-stream In the same way the conn-stream has a pointer to the stream endpoint , this patch adds a pointer to the application entity in the conn-stream structure. For now, it is a stream or a health-check. It is mandatory to merge the stream-interface with the conn-stream.	2022-02-24 11:00:02 +01:00
Christopher Faulet	0256da14a5	MINOR: connection: Be prepared to handle conn-stream with no connection The conn-stream will progressively replace the stream-interface. Thus, a stream will have to allocate the backend conn-stream during its creation. This means it will be possible to have a conn-stream with no connection. To prepare this change, we test the conn-stream's connection when we retrieve it.	2022-02-24 11:00:01 +01:00
Willy Tarreau	95d3eaff36	BUILD: checks: fix inlining issue on set_srv_agent_[addr,port} These functions are declared as external functions in check.h and as inline functions in check.c. Let's move them as static inline in check.h. This appeared in 2.4 with the following commits: `4858fb2e1` ("MEDIUM: check: align agentaddr and agentport behaviour") `1c921cd74` ("BUG/MINOR: check: consitent way to set agentaddr") While harmless (it only triggers build warnings with some gcc 4.x), it should probably be backported where the paches above are present to keep the code consistent.	2022-01-28 19:04:02 +01:00
Willy Tarreau	a89c19127d	BUG/MEDIUM: checks: fix the starting thread for external checks When cleaning up the code to remove most explicit task masks in commit `beeabf531` ("MINOR: task: provide 3 task_new_* wrappers to simplify the API"), a mistake was done with the external checks where the call does task_new_on(1) instead of task_new_on(0) due to the confusion with the previous mask 1. No backport is needed as that's only 2.5-dev.	2021-10-20 18:43:30 +02:00
Christopher Faulet	dfd10ab5ee	MINOR: proxy: Introduce proxy flags to replace disabled bitfield This change is required to support TCP/HTTP rules in defaults sections. The 'disabled' bitfield in the proxy structure, used to know if a proxy is disabled or stopped, is replaced a generic bitfield named 'flags'. PR_DISABLED and PR_STOPPED flags are renamed to PR_FL_DISABLED and PR_FL_STOPPED respectively. In addition, everywhere there is a test to know if a proxy is disabled or stopped, there is now a bitwise AND operation on PR_FL_DISABLED and/or PR_FL_STOPPED flags.	2021-10-15 14:12:19 +02:00
Willy Tarreau	b4e34766a3	REORG: thread/sched: move the last dynamic thread_info to thread_ctx The last 3 fields were 3 list heads that are per-thread, and which are: - the pool's LRU head - the buffer_wq - the streams list head Moving them into thread_ctx completes the removal of dynamic elements from the struct thread_info. Now all these dynamic elements are packed together at a single place for a thread.	2021-10-08 17:22:26 +02:00
Willy Tarreau	beeabf5314	MINOR: task: provide 3 task_new_* wrappers to simplify the API We'll need to improve the API to pass other arguments in the future, so let's start to adapt better to the current use cases. task_new() is used: - 18 times as task_new(tid_bit) - 18 times as task_new(MAX_THREADS_MASK) - 2 times with a single bit (in a loop) - 1 in the debug code that uses a mask This patch provides 3 new functions to achieve this: - task_new_here() to create a task on the calling thread - task_new_anywhere() to create a task to be run anywhere - task_new_on() to create a task to run on a specific thread The change is trivial and will allow us to later concentrate the required adaptations to these 3 functions only. It's still possible to call task_new() if needed but a comment was added to encourage the use of the new ones instead. The debug code was not changed and still uses it.	2021-10-01 18:36:29 +02:00
Amaury Denoyelle	29d1ac1330	REORG: server: move slowstart init outside of checks 'slowstart' can be used without check on a server, with the CLI handlers 'enable/disable server'. Move the code to initialize and start the slowstart task outside of check.c. This change will also be reused to enable slowstart for dynamic servers.	2021-09-21 14:00:32 +02:00
Amaury Denoyelle	bc2ebfa5a4	MEDIUM: server: extend refcount for all servers In a future patch, it will be possible to remove at runtime every servers, both static and dynamic. This requires to extend the server refcount for all instances. First, refcount manipulation functions have been renamed to better express the API usage. * srv_refcount_use -> srv_take The refcount is always initialize to 1 on the server creation in new_server. It's also incremented for each check/agent configured on a server instance. * free_server -> srv_drop This decrements the refcount and if null, the server is freed, so code calling it must not use the server reference after it. As a bonus, this function now returns the next server instance. This is useful when calling on the server loop without having to save the next pointer before each invocation. In these functions, remove the checks that prevent refcount on non-dynamic servers. Each reference to "dynamic" in variable/function naming have been eliminated as well.	2021-08-25 15:53:54 +02:00
Amaury Denoyelle	26cb8342ad	BUG/MEDIUM: check: fix leak on agent-check purge There is currently a leak on agent-check for dynamic servers. When deleted, the check rules and vars are not liberated. This leak grows each time a dynamic server with agent-check is deleted. Replace the manual purge code by a free_check invocation which centralizes all the details on check cleaning. There is no leak for health check because in this case the proxy is the owner of the check vars and rules. This should not be backported, unless dynamic server checks are backported.	2021-08-11 14:40:21 +02:00
Amaury Denoyelle	6d7fc446b4	BUG/MINOR: check: fix leak on add dynamic server with agent-check error If an error occured during a dynamic server creation, free_check is used to liberate a possible agent-check. However, this does not free associated vars and rules associated as this is done on another function named deinit_srv_agent_check. To simplify the check free and avoid a leak, move free vars/rules in free_check. This is valid because deinit_srv_agent_check also uses free_check. This operation is done only for an agent-check because for a health check, the proxy instance is the owner of check vars/rules. This should not be backported, unless dynamic server checks are backported.	2021-08-11 14:37:42 +02:00
Amaury Denoyelle	25fe1033cb	BUG/MINOR: check: do not reset check flags on purge Do not reset check flags when setting CHK_ST_PURGE. Currently, this change has no impact. However, it is semantically wrong to clear important flags such as CHK_ST_AGENT on purge. Furthermore, this change will become mandatoy for a future fix to properly free agent checks on dynamic servers removal. For this, it will be needed to differentiate health/agent-check on purge via CHK_ST_AGENT to properly free agent checks. This must not be backported unless dynamic servers checks are backported.	2021-08-11 14:33:34 +02:00
Amaury Denoyelle	9ba34ae710	BUG/MINOR: check: test if server is not null in purge Test if server is not null before using free_server in the check purge operation. Currently, the null server scenario should not occured as purge is used with refcounted dynamic servers. However, this might not be always the case if purge is use in the future in other cases; thus the test is useful for extensibility. No need to backport, unless dynamic server checks are backported. This has been reported through a coverity report in github issue #1343.	2021-08-09 17:48:34 +02:00
Amaury Denoyelle	b65f4cab6a	MEDIUM: server: implement agent check for dynamic servers This commit is the counterpart for agent check of "MEDIUM: server: implement check for dynamic servers". The "agent-check" keyword is enabled for dynamic servers. The agent check must manually be activated via "enable agent" CLI. This can enable the dynamic server if the agent response is "ready" without an explicit "enable server" CLI.	2021-08-06 11:09:48 +02:00
Amaury Denoyelle	2fc4d39577	MEDIUM: server: implement check for dynamic servers Implement check support for dynamic servers. The "check" keyword is now enabled for dynamic servers. If used, the server check is initialized and the check task started in the "add server" CLI handler. The check is explicitely disabled and must be manually activated via "enable health" CLI handler. The dynamic server refcount is incremented if a check is configured. On "delete server" handler, the check is purged, which decrements the refcount.	2021-08-06 11:09:48 +02:00
Amaury Denoyelle	9ecee0fa36	MINOR: check: enable safe keywords for dynamic servers Implement a collection of keywords deemed safe and useful to dynamic servers. The list of the supported keywords is : - addr - check-proto - check-send-proxy - check-via-socks4 - rise - fall - fastinter - downinter - port - agent-addr - agent-inter - agent-port - agent-send	2021-08-06 11:09:48 +02:00
Amaury Denoyelle	b33a0abc0b	MEDIUM: check: implement check deletion for dynamic servers Implement a mechanism to free a started check on runtime for dynamic servers. A new function check_purge is created for this. The check task will be marked for deletion and scheduled to properly close connection elements and free the task/tasklet/buf_wait elements. This function will be useful to delete a dynamic server wich checks.	2021-08-06 11:09:48 +02:00
Amaury Denoyelle	403dce8e5a	MINOR: check: do not increment global maxsock at runtime global maxsock is used to estimate a number of fd to reserve for internal use, such as checks. It is incremented at startup with the info from the config file. Disable this incrementation in checks functions at runtime. First, it currently serves no purpose to increment it after startup. Worse, it may lead to out-of-bound accesse on the fdtab. This will be useful to initiate checks for dynamic servers.	2021-08-06 11:08:24 +02:00
Amaury Denoyelle	3c2ab1a0d4	MINOR: check: export check init functions Remove static qualifier on init_srv_check, init_srv_agent_check and start_check_task. These functions will be called in server.c for dynamic servers with checks.	2021-08-06 11:08:04 +02:00
Amaury Denoyelle	f2c27a5c67	MINOR: check: allocate default check ruleset for every backends Allocate default tcp ruleset for every backend without explicit rules defined, even if no server in the backend use check. This change is required to implement checks for dynamic servers. This allocation is done on check_config_validity. It must absolutely be called before check_proxy_tcpcheck (called via post proxy check) which allocate the implicit tcp connect rule.	2021-08-06 11:08:04 +02:00
Willy Tarreau	acff309753	BUG/MINOR: check: fix the condition to validate a port-less server A config like the below fails to validate because of a bogus test: backend b1 tcp-check connect port 1234 option tcp-check server s1 1.2.3.4 check [ALERT] (18887) : config : config: proxy 'b1': server 's1' has neither service port nor check port, and a tcp_check rule 'connect' with no port information. A \|\| instead of a && only validates the connect rule when both the address and the port are set. A work around is to set the rule like this: tcp-check connect addr 0:1234 port 1234 This needs to be backported as far as 2.2 (2.0 is OK).	2021-07-22 11:21:33 +02:00
Dirkjan Bussink	dfee217b68	BUG/MINOR: checks: return correct error code for srv_parse_agent_check In srv_parse_agent_check the error code is not returned in case something goes wrong. The value 0 is always return. Additionally, there's a small cleanup of unreachable returns that in most checks are not present either and removed in two places they were present. This makes the code consistent across the different checks.	2021-06-25 08:55:39 +02:00
Willy Tarreau	a05704582c	MINOR: server: replace the pendconns-related stuff with a struct queue Just like for proxies, all three elements (pendconns, nbpend, queue_idx) were moved to struct queue.	2021-06-22 18:43:14 +02:00
Willy Tarreau	7f3c1df248	MINOR: proxy: replace the pendconns-related stuff with a struct queue All three elements (pendconns, nbpend, queue_idx) were moved to struct queue.	2021-06-22 18:43:14 +02:00
Amaury Denoyelle	111243003e	MINOR: errors: specify prefix "config" for parsing output Set "config :" as a prefix for the user messages context before starting the configuration parsing. All following stderr output will be prefixed by it. As a consequence, remove extraneous prefix "config" already specified in various ha_alert/warning/notice calls.	2021-06-07 17:19:16 +02:00
Christopher Faulet	ea86083718	BUG/MINOR: checks: Reschedule check on observe mode only if fastinter is set On observe mode, if a server is marked as DOWN, the server's health-check is rescheduled using the fastinter timeout if the new expiration date is newer that the current one. But this must only be performed if the fastinter timeout is defined. Internally, tick_is_lt() function only checks the date and does not perform any verification on the provided args. Thus, we must take care of it. However, it is possible to disable the server health-check by setting its task expiration date to TICK_ETERNITY. This patch must be backported as far as 2.2. It is related to	2021-05-07 12:10:30 +02:00
Christopher Faulet	92017a3215	BUG/MINOR: checks: Handle synchronous connect when a tcpcheck is started A connection may be synchronously established. In the tcpcheck context, it may be a problem if several connections come one after another. In this case, there is no event to close the very first connection before starting the next one. The checks is thus blocked and timed out, a L7 timeout error is reported. To fix the bug, when a tcpcheck is started, we immediately evaluate its state. Most of time, nothing is performed and we must wait. But it is thus possible to handle the result of a successfull connection. This patch should fix the issue #1234. It must be backported as far as 2.2.	2021-05-07 12:00:56 +02:00
Ilya Shipitsin	b2be9a1ea9	CLEANUP: assorted typo fixes in the code and comments This is 22nd iteration of typo fixes	2021-04-26 10:42:58 +02:00
Willy Tarreau	2b71810cb3	CLEANUP: lists/tree-wide: rename some list operations to avoid some confusion The current "ADD" vs "ADDQ" is confusing because when thinking in terms of appending at the end of a list, "ADD" naturally comes to mind, but here it does the opposite, it inserts. Several times already it's been incorrectly used where ADDQ was expected, the latest of which was a fortunate accident explained in `6fa922562` ("CLEANUP: stream: explain why we queue the stream at the head of the server list"). Let's use more explicit (but slightly longer) names now: LIST_ADD -> LIST_INSERT LIST_ADDQ -> LIST_APPEND LIST_ADDED -> LIST_INLIST LIST_DEL -> LIST_DELETE The same is true for MT_LISTs, including their "TRY" variant. LIST_DEL_INIT keeps its short name to encourage to use it instead of the lazier LIST_DELETE which is often less safe. The change is large (~674 non-comment entries) but is mechanical enough to remain safe. No permutation was performed, so any out-of-tree code can easily map older names to new ones. The list doc was updated.	2021-04-21 09:20:17 +02:00
Christopher Faulet	0f1fc23d4e	BUG/MINOR: checks: Set missing id to the dummy checks frontend The dummy frontend used to create the session of the tcp-checks is initialized without identifier. However, it is required because this id may be used without any guard, for instance in log-format string via "%f" or when fe_name sample fetch is called. Thus, an unset id may lead to crashes. This patch must be backported as far as 2.2.	2021-04-17 11:14:58 +02:00
Christopher Faulet	147b8c919c	MINOIR: checks/trace: Register a new trace source with its events Add the trace support for the checks. Only tcp-check based health-checks are supported, including the agent-check. In traces, the first argument is always a check object. So it is easy to get all info related to the check. The tcp-check ruleset, the conn-stream and the connection, the server state...	2021-04-12 12:09:36 +02:00
Willy Tarreau	4781b1521a	CLEANUP: atomic/tree-wide: replace single increments/decrements with inc/dec This patch replaces roughly all occurrences of an HA_ATOMIC_ADD(&foo, 1) or HA_ATOMIC_SUB(&foo, 1) with the equivalent HA_ATOMIC_INC(&foo) and HA_ATOMIC_DEC(&foo) respectively. These are 507 changes over 45 files.	2021-04-07 18:18:37 +02:00
Willy Tarreau	d68d4f1002	MEDIUM: dynbuf: remove last usages of b_alloc_margin() The function's purpose used to be to fail a buffer allocation if that allocation wouldn't result in leaving some buffers available. Thus, some allocations could succeed and others fail for the sole purpose of trying to provide 2 buffers at once to process_stream(). But things have changed a lot with 1.7 breaking the promise that process_stream() would always succeed with only two buffers, and later the thread-local pool caches that keep certain buffers available that are not accounted for in the global pool so that local allocators cannot guess anything from the number of currently available pools. Let's just replace all last uses of b_alloc_margin() with b_alloc() once for all.	2021-03-22 16:27:59 +01:00
Amaury Denoyelle	76e10e78bb	MINOR: server: prepare parsing for dynamic servers Prepare the server parsing API to support dynamic servers. - define a new parsing flag to be used for dynamic servers - each keyword contains a new field dynamic_ok to indicate if it can be used for a dynamic server. For now, no keyword are supported. - do not copy settings from the default server for a new dynamic server. - a dynamic server is created in a maintenance mode and requires an explicit 'enable server' command. - a new server flag named SRV_F_DYNAMIC is created. This flag is set for all servers created at runtime. It might be useful later, for example to know if a server can be purged.	2021-03-18 15:51:12 +01:00
Willy Tarreau	144f84a09d	MEDIUM: task: extend the state field to 32 bits It's been too short for quite a while now and is now full. It's still time to extend it to 32-bits since we have room for this without wasting any space, so we now gained 16 new bits for future flags. The values were not reassigned just in case there would be a few hidden u16 or short somewhere in which these flags are placed (as it used to be the case with stream->pending_events). The patch is tagged MEDIUM because this required to update the task's process() prototype to use an int instead of a short, that's quite a bunch of places.	2021-03-05 08:30:08 +01:00
Tim Duesterhus	dcf753aabe	CLEANUP: Use the ist() macro whenever possible Refactoring performed with the following Coccinelle patch: @@ char *s; @@ ( - ist2(s, strlen(s)) + ist(s) \| - ist2(strdup(s), strlen(s)) + ist(strdup(s)) ) Note that this replacement is safe even in the strdup() case, because `ist()` will not call `strlen()` on a `NULL` pointer. Instead is inserts a length of `0`, effectively resulting in `IST_NULL`.	2021-03-05 08:28:53 +01:00
Willy Tarreau	61cfdf4fd8	CLEANUP: tree-wide: replace free(x);x=NULL with ha_free(&x) This makes the code more readable and less prone to copy-paste errors. In addition, it allows to place some __builtin_constant_p() predicates to trigger a link-time error in case the compiler knows that the freed area is constant. It will also produce compile-time error if trying to free something that is not a regular pointer (e.g. a function). The DEBUG_MEM_STATS macro now also defines an instance for ha_free() so that all these calls can be checked. 178 occurrences were converted. The vast majority of them were handled by the following Coccinelle script, some slightly refined to better deal with "&*x" or with long lines: @ rule @ expression E; @@ - free(E); - E = NULL; + ha_free(&E); It was verified that the resulting code is the same, more or less a handful of cases where the compiler optimized slightly differently the temporary variable that holds the copy of the pointer. A non-negligible amount of {free(str);str=NULL;str_len=0;} are still present in the config part (mostly header names in proxies). These ones should also be cleaned for the same reasons, and probably be turned into ist strings.	2021-02-26 21:21:09 +01:00
Willy Tarreau	4d77bbf856	MINOR: dynbuf: pass offer_buffers() the number of buffers instead of a threshold Historically this function would try to wake the most accurate number of process_stream() waiters. But since the introduction of filters which could also require buffers (e.g. for compression), things started not to be as accurate anymore. Nowadays muxes and transport layers also use buffers, so the runqueue size has nothing to do anymore with the number of supposed users to come. In addition to this, the threshold was compared to the number of free buffer calculated as allocated minus used, but this didn't work anymore with local pools since these counts are not updated upon alloc/free! Let's clean this up and pass the number of released buffers instead, and consider that each waiter successfully called counts as one buffer. This is not rocket science and will not suddenly fix everything, but at least it cannot be as wrong as it is today. This could have been marked as a bug given that the current situation is totally broken regarding this, but this probably doesn't completely fix it, it only goes in a better direction. It is possible however that it makes sense in the future to backport this as part of a larger series if the situation significantly improves.	2021-02-20 12:38:18 +01:00
Willy Tarreau	90f366b595	MINOR: dynbuf: use regular lists instead of mt_lists for buffer_wait There's no point anymore in keeping mt_lists for the buffer_wait and buffer_wq since it's thread-local now.	2021-02-20 12:38:18 +01:00
Willy Tarreau	e8e5091510	MINOR: dynbuf: make the buffer wait queue per thread The buffer wait queue used to be global historically but this doest not make any sense anymore given that the most common use case is to have thread-local pools. Thus there's no point waking up waiters of other threads after releasing an entry, as they won't benefit from it. Let's move the queue head to the thread_info structure and use ti->buffer_wq from now on.	2021-02-20 12:38:18 +01:00
Willy Tarreau	4e9df2737d	BUG/MEDIUM: checks: don't needlessly take the server lock in health_adjust() The server lock was taken preventively for anything in health_adjust(), including the static config checks needed to detect that the lock was not needed, while the function is always called on the response path to update a server's status. This was responsible for huge contention causing a performance drop of about 17% on 16 threads. Let's move the lock only where it should be, i.e. inside the function around the critical sections only. By doing this, a 16-thread process jumped back from 575 to 675 krps. This should be backported to 2.3 as the situation degraded there, and maybe later to 2.2.	2021-02-18 10:06:45 +01:00
Willy Tarreau	64ba5ebadc	BUG/MINOR: checks: properly handle wrapping time in __health_adjust() There's an issue when a server state changes, we use an integer comparison to decide whether or not to reschedule a test instead of using a wrapping timer comparison. This will cause some health-checks not to be immediately triggered half of the time, and some unneeded calls to task_queue() to be performed in other cases. This bug has always been there as it was introduced with the commit that added the feature, `97f07b832` ("[MEDIUM] Decrease server health based on http responses / events, version 3"). This may be backported everywhere.	2021-02-18 10:06:45 +01:00
William Dauchy	b26122b032	CLEANUP: check: fix get_check_status_info declaration we always put a \n between function name and `{` Signed-off-by: William Dauchy <wdauchy@gmail.com>	2021-02-15 11:56:31 +01:00
Emeric Brun	c943799c86	MEDIUM: resolvers/dns: split dns.c into dns.c and resolvers.c This patch splits current dns.c into two files: The first dns.c contains code related to DNS message exchange over UDP and in future other TCP. We try to remove depencies to resolving to make it usable by other stuff as DNS load balancing. The new resolvers.c inherit of the code specific to the actual resolvers. Note: It was really difficult to obtain a clean diff dur to the amount of moved code. Note2: Counters and stuff related to stats is not cleany separated because currently counters for both layers are merged and hard to separate for now.	2021-02-13 10:03:46 +01:00
Emeric Brun	d30e9a1709	MINOR: resolvers: rework prototype suffixes to split resolving and dns. A lot of prototypes in dns.h are specific to resolvers and must be renamed to split resolving and DNS layers.	2021-02-13 09:43:18 +01:00
Emeric Brun	08622d3c0a	MINOR: resolvers: renames some resolvers specific types to not use dns prefix This patch applies those changes on names: -struct dns_resolution { +struct resolv_resolution { -struct dns_requester { +struct resolv_requester { -struct dns_srvrq { +struct resolv_srvrq { @@ -185,12 +185,12 @@ struct stream { struct { - struct dns_requester dns_requester; + struct resolv_requester requester; ... - } dns_ctx; + } resolv_ctx;	2021-02-13 09:43:18 +01:00
William Dauchy	f4300902b9	CLEANUP: check: fix some typo in comments a few obvious english typo in comments, some of which introduced by myself quite recently Signed-off-by: William Dauchy <wdauchy@gmail.com>	2021-02-08 10:49:08 +01:00
William Dauchy	4858fb2e18	MEDIUM: check: align agentaddr and agentport behaviour in the same manner of agentaddr, we now: - permit to set agentport through `port` keyword, like it is the case for agentaddr through `addr` - set the priority on `agent-port` keyword when used - add a flag to be able to test when the value is set like for agentaddr it makes the behaviour between `addr` and `port` more consistent. Signed-off-by: William Dauchy <wdauchy@gmail.com>	2021-02-04 14:00:38 +01:00
William Dauchy	1c921cd748	BUG/MINOR: check: consitent way to set agentaddr small consistency problem with `addr` and `agent-addr` options: for the both options, the last one parsed is always used to set the agent-check addr. Thus these two lines don't have the same behavior: server ... addr <addr1> agent-addr <addr2> server ... agent-addr <addr2> addr <addr1> After this patch `agent-addr` will always be the priority option over `addr`. It means we test the flag before setting agentaddr. We also fix all the places where we did not set the flag to be coherent everywhere. I was not really able to determine where this issue is coming from. So it is probable we may backport it to all stable version where the agent is supported. Signed-off-by: William Dauchy <wdauchy@gmail.com>	2021-02-04 13:55:04 +01:00
William Dauchy	69f118d7b6	MEDIUM: check: remove checkport checkaddr flag While trying to fix some consistency problem with the config file/cli (e.g. check-port cli command does not set the flag), we realised checkport flag was not necessarily needed. Indeed tcpcheck uses service port as the last choice if check.port is zero. So we can assume if check.port is zero, it means it was never set by the user, regardless if it is by the cli or config file. In the longterm this will avoid to introduce a new consistency issue if we forget to set the flag. in the same manner of checkport flag, we don't really need checkaddr flag. We can assume if checkaddr is not set, it means it was never set by the user or config. Signed-off-by: William Dauchy <wdauchy@gmail.com>	2021-02-04 10:43:00 +01:00
Christopher Faulet	7aa3271439	MINOR: checks: Add function to get the result code corresponding to a status The function get_check_status_result() can now be used to get the result code (CHK_RES_) corresponding to a check status (HCHK_STATUS_). It will be used by the Prometheus exporter when reporting the check status of a server.	2021-02-01 15:16:33 +01:00
Willy Tarreau	025fc71b47	MINOR: checks: export a few functions that appear often in trace dumps The check I/O handler, process_chk_conn and server_warmup are often present in complex backtraces as they're impacted by locking or I/O issues. Let's export them so that they resolve cleanly.	2021-01-29 12:35:24 +01:00
Christopher Faulet	8f100427c4	BUG/MEDIUM: tcpcheck: Don't destroy connection in the wake callback context When a tcpcheck ruleset uses multiple connections, the existing one must be closed and destroyed before openning the new one. This part is handled in the tcpcheck_main() function, when called from the wake callback function (wake_srv_chk). But it is indeed a problem, because this function may be called from the mux layer. This means a mux may call the wake callback function of the data layer, which may release the connection and the mux. It is easy to see how it is hazardous. And actually, depending on the scheduling, it leads to crashes. Thus, we must avoid to release the connection in the wake callback context, and move this part in the check's process function instead. To do so, we rely on the CHK_ST_CLOSE_CONN flags. When a connection must be replaced by a new one, this flag is set on the check, in tcpcheck_main() function, and the check's task is woken up. Then, the connection is really closed in process_chk_conn() function. This patch must be backported as far as 2.2, with some adaptations however because the code is not exactly the same.	2021-01-21 15:21:12 +01:00
Christopher Faulet	6ecd59326f	BUG/MINOR: check: Don't perform any check on servers defined in a frontend If a server is defined in a frontend, thus a proxy without the backend capability, the 'check' and 'agent-check' keywords are ignored. This way, no check is performed on an ignored server. This avoids a segfault because some part of the tcpchecks are not fully initialized (or released for frontends during the post-check). In addition, an test on the server's proxy capabilities is performed when checks or agent-checks are initialized and nothing is performed for servers attached to a non-backend proxy. This patch should fix the issue #1043. It must be backported as far as 2.2.	2021-01-12 17:55:22 +01:00
Willy Tarreau	f7e4a6fc07	MINOR: checks: don't call conn_cond_update_polling() anymore This was a leftover of the pre-mux v1.8-dev3 era. It makes no sense anymore to try to disable polling on a connection we don't own, it's the mux's job and it's properly done upon shutdowns and closes.	2020-12-11 11:11:06 +01:00
Willy Tarreau	30bd4efb1b	MINOR: checks: use cs_drain_and_close() instead of draining the connection As explained in previous commit, the situation is absurd as we try to cleanly drain pending data before impolitely shutting down, and it could be counter productive on real muxes. Let's use cs_drain_and_close() instead.	2020-12-11 11:09:29 +01:00
Christopher Faulet	1e527cbf53	MINOR: check: Always increment check health counter on CONPASS Regarding the health counter, a check finished with the CONDPASS result is now the same than with the PASSED result: The health counter is always incemented. Before, it was only performed is the health counter was not 0. There is no change for the disable-on-404 option because it is only evaluated for running or stopping servers. So with an health check counter greater than 0. But it will make possible to handle (STOPPED -> STOPPING) transition for servers.	2020-11-27 10:30:23 +01:00
Christopher Faulet	97b7bdfcf7	REORG: tcpcheck: Move check option parsing functions based on tcp-check The parsing of the check options based on tcp-check rules (redis, spop, smtp, http...) are moved aways from check.c. Now, these functions are placed in tcpcheck.c. These functions are only related to the tcpcheck ruleset configured on a proxy and not to the health-check attached to a server.	2020-11-27 10:30:23 +01:00
Christopher Faulet	b1bb069c15	MINOR: tcpcheck: Don't handle anymore in-progress connect rules in tcpcheck_main The special handling of in-progress connect rules at the begining of tcpcheck_main() function can be removed. Instead, at the begining of the tcpcheck_eval_connect() function, we test is there is already an existing connection. In this case, it means we are waiting for a connection establishment. In addition, before evaluating a new connect rule, we take care to release any previous connection.	2020-11-27 10:29:41 +01:00
Christopher Faulet	b381a505c1	BUG/MAJOR: tcpcheck: Allocate input and output buffers from the buffer pool Historically, the input and output buffers of a check are allocated by hand during the startup, with a specific size (not necessarily the same than other buffers). But since the recent refactoring of the checks to rely exclusively on the tcp-checks and to use the underlying mux layer, this part is totally buggy. Indeed, because these buffers are now passed to a mux, they maybe be swapped if a zero-copy is possible. In fact, for now it is only possible in h2_rcv_buf(). Thus the bug concretely only exists if a h2 health-check is performed. But, it is a latent bug for other muxes. Another problem is the size of these buffers. because it may differ for the other buffer size, it might be source of bugs. Finally, for configurations with hundreds of thousands of servers, having 2 buffers per check always allocated may be an issue. To fix the bug, we now allocate these buffers when required using the buffer pool. Thus not-running checks don't waste memory and muxes may swap them if possible. The only drawback is the check buffers have now always the same size than buffers used by the streams. This deprecates indirectly the "tune.chksize" global option. In addition, the http-check regtest have been update to perform some h2 health-checks. Many thanks to @VigneshSP94 for its help on this bug. This patch should solve the issue #936. It relies on the commit "MINOR: tcpcheck: Don't handle anymore in-progress send rules in tcpcheck_main". Both must be backport as far as 2.2. bla	2020-11-27 10:29:41 +01:00
Christopher Faulet	1faf18ae39	BUG/MINOR: tcpcheck: Don't forget to reset tcp-check flags on new kind of check When a new kind of check is found during the parsing of a proxy section (via an option directive), we must reset tcpcheck flags for this proxy. It is mandatory to not inherit some flags from a previously declared check (for instance in the default section). This patch must be backported as far as 2.2.	2020-11-27 10:08:18 +01:00
Christopher Faulet	fc633b6eff	CLEANUP: config: Return ERR_NONE from config callbacks instead of 0 Return ERR_NONE instead of 0 on success for all config callbacks that should return ERR_* codes. There is no change because ERR_NONE is a macro equals to 0. But this makes the return value more explicit.	2020-11-13 16:26:10 +01:00
Christopher Faulet	c751b4508d	BUG/MINOR: tcpcheck: Don't warn on unused rules if check option is after When tcp-check or http-check rules are used, if the corresponding check option (option tcp-check and option httpchk) is declared after the ruleset, a warning is emitted about an unused check ruleset while there is no problem in reality. This patch must be backported as far as 2.2.	2020-11-13 16:26:10 +01:00
Amaury Denoyelle	7c14890183	MINOR: check: report error on incompatible proto If the check mux has been explicitly defined but is incompatible with the selected check type (tcp-check vs http-check), report a warning and prevent haproxy startup.	2020-11-13 16:26:10 +01:00
Amaury Denoyelle	0519bd4d04	BUG/MEDIUM: check: reuse srv proto only if using same mode Only reuse the mux from server if the check is using the same mode. For example, this prevents a tcp-check on a h2 server to select the h2 multiplexer instead of passthrough. This bug was introduced by the following commit : BUG/MEDIUM: checks: Use the mux protocol specified on the server line It must be backported up to 2.2. Fixes github issue #945.	2020-11-13 16:26:10 +01:00
Christopher Faulet	5e29376efb	BUG/MINOR: checks: Report a socket error before any connection attempt When a health-check fails, if no connection attempt was performed, a socket error must be reported. But this was only done if the connection was not allocated. It must also be done if there is no control layer. Otherwise, a L7TOUT will be reported instead. It is possible to not having a control layer for a connection if the connection address family is invalid or not defined. This patch must be backported to 2.2.	2020-11-03 10:23:00 +01:00
Willy Tarreau	c3914d4fff	MEDIUM: proxy: replace proxy->state with proxy->disabled The remaining proxy states were only used to distinguish an enabled proxy from a disabled one. Due to the initialization order, both PR_STNEW and PR_STREADY were equivalent after startup, and they would only differ from PR_STSTOPPED when the proxy is disabled or shutdown (which is effectively another way to disable it). Now we just have a "disabled" field which allows to distinguish them. It's becoming obvious that start_proxies() is only used to print a greeting message now, that we'd rather get rid of. Probably that zombify_proxy() and stop_proxy() should be merged once their differences move to the right place.	2020-10-09 11:27:30 +02:00
Willy Tarreau	65ec4e3ff7	MEDIUM: tools: make str2sa_range() check that the protocol has ->connect() Most callers of str2sa_range() need the protocol only to check that it provides a ->connect() method. It used to be used to verify that it's a stream protocol, but it might be a bit early to get rid of it. Let's keep the test for now but move it to str2sa_range() when the new flag PA_O_CONNECT is present. This way almost all call places could be cleaned from this. There's a strange test in the server address parsing code that rechecks the family from the socket which seems to be a duplicate of the previously removed tests. It will have to be rechecked.	2020-09-16 22:08:08 +02:00
Willy Tarreau	5fc9328aa2	MINOR: tools: make str2sa_range() directly return the protocol We'll need this so that it can return pointers to stacked protocol in the future (for QUIC). In addition this removes a lot of tests for protocol validity in the callers. Some of them were checked further apart, or after a call to str2listener() and they were simplified as well. There's still a trick, we can fail to return a protocol in case the caller accepts an fqdn for use later. This is what servers do and in this case it is valid to return no protocol. A typical example is: server foo localhost:1111	2020-09-16 22:08:08 +02:00
Willy Tarreau	a93e5c7fae	MINOR: tools: make str2sa_range() optionally return the fd If a file descriptor was passed, we can optionally return it. This will be useful for listening sockets which are both a pre-bound FD and a ready socket.	2020-09-16 22:08:08 +02:00
Willy Tarreau	328199348b	MINOR: tools: add several PA_O_* flags in str2sa_range() callers These flags indicate whether the call is made to fill a bind or a server line, or even just send/recv calls (like logs or dns). Some special cases are made for outgoing FDs (e.g. pipes for logs) or socket FDs (e.g external listeners), and there's a distinction between stream or dgram usage that's expected to significantly help str2sa_range() proceed appropriately with the input information. For now they are not used yet.	2020-09-16 22:08:08 +02:00
Willy Tarreau	8b0fa8f0ab	MEDIUM: config: remove all checks for missing/invalid ports/ranges Now that str2sa_range() checks for appropriate port specification, we don't need to implement adhoc test cases in every call place, if the result is valid, the conditions are met otherwise the error message is appropriately filled.	2020-09-16 22:08:08 +02:00

1 2 3 4

170 Commits