haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2026-01-16 06:11:00 +01:00

Author	SHA1	Message	Date
William Lallemand	7ad501e6a1	MINOR: acme: emit a log when the scheduler can't start the task Emit an error log when the renewal scheduler can't start the task.	2025-05-02 16:12:41 +02:00
William Lallemand	7fe59ebb88	MEDIUM: acme: add a basic scheduler This patch implements a very basic scheduler for the ACME tasks. The scheduler is a task which is started from the postparser function when at least one acme section was configured. The scheduler will loop over the certificates in the ckchs_tree, and for each certificate will start an ACME task if the notAfter date is past curtime + (notAfter - notBefore) / 12, or 7 days if notBefore is not available. Once the lookup over all certificates is terminated, the task will sleep and will wakeup after 12 hours.	2025-05-02 16:01:32 +02:00
William Lallemand	7251c13c77	MINOR: acme: move the acme task init in a dedicated function acme_start_task() is a dedicated function which starts an acme task for a specified <store> certificate. The initialization code was move from the "acme renew" command parser to this function, in order to be called from a scheduler.	2025-05-02 16:01:32 +02:00
William Lallemand	626de9538e	MINOR: ssl: add function to extract X509 notBefore date in time_t Add x509_get_notbefore_time_t() which returns the notBefore date in time_t format.	2025-05-02 16:01:32 +02:00
Valentine Krasnobaeva	8a4b3216f9	MINOR: cfgparse-global: add explicit error messages in cfg_parse_global_env_opts When env variable name or value are not provided for setenv/presetenv it's not clear from the old error message shown at stderr, what exactly is missed. User needs to search in it's configuration. Let's add more explicit error messages about these inconsistencies. No need to be backported.	2025-05-02 15:37:45 +02:00
Olivier Houchard	994cc58576	MEDIUM: stick-tables: Limit the number of entries we expire In process_table_expire(), limit the number of entries we remove in one call, and just reschedule the task if there's more to do. Removing entries require to use the heavily contended update write lock, and we don't want to hold it for too long. This helps getting stick tables perform better under heavy load.	2025-05-02 15:27:55 +02:00
Olivier Houchard	d2d4c3eb65	MEDIUM: stick-tables: Limit the number of old entries we remove Limit the number of old entries we remove in one call of stktable_trash_oldest(), as we do so while holding the heavily contended update write lock, so we'd rather not hold it for too long. This helps getting stick tables perform better under heavy load.	2025-05-02 15:27:55 +02:00
Olivier Houchard	388539faa3	MEDIUM: stick-tables: defer adding updates to a tasklet There is a lot of contention trying to add updates to the tree. So instead of trying to add the updates to the tree right away, just add them to a mt-list (with one mt-list per thread group, so that the mt-list does not become the new point of contention that much), and create a tasklet dedicated to adding updates to the tree, in batchs, to avoid keeping the update lock for too long. This helps getting stick tables perform better under heavy load.	2025-05-02 15:27:55 +02:00
Olivier Houchard	b3ad7b6371	MEDIUM: peers: Give up if we fail to take locks in hot path In peer_send_msgs(), give up in order to retry later if we failed at getting the update read lock. Similarly, in __process_running_peer_sync(), give up and just reschedule the task if we failed to get the peer lock. There is an heavy contention on both those locks, so we could spend a lot of time trying to get them. This helps getting peers perform better under heavy load.	2025-05-02 15:27:55 +02:00
Aurelien DARRAGON	7a8d1a3122	MINOR: hlua: ignore "tune.lua.bool-sample-conversion" if set after "lua-load" tune.lua.bool-sample-conversion must be set before any lua-load or lua-load-per-thread is used for it to be considered. Indeed, lua-load directives are parsed on the fly and will cause some parts of the scripts to be executed during init already (script body/init contexts). As such, we cannot afford to have "tune.lua.bool-sample-conversion" set after some Lua code was loaded, because it would mean that the setting would be handled differently for Lua's code executed during or after config parsing. To avoid ambiguities, the documentation now states that the setting must be set before any lua-load(-per-thread) directive, and if the setting is met after some Lua was already loaded, the directive is ignored and a warning informs about that. It should fix GH #2957 It may be backported with 29b6d8af16 ("MINOR: hlua: rename "tune.lua.preserve-smp-bool" to "tune.lua.bool-sample-conversion"")	2025-05-02 14:38:37 +02:00
Willy Tarreau	1ed238101a	CLEANUP: tasks: use the local state, not t->state, to check for tasklets There's no point reading t->state to check for a tasklet after we've atomically read the state into the local "state" variable. Not only it's more expensive, it's also less clear whether that state is supposed to be atomic or not. And in any case, tasks and tasklets have their type forever and the one reflected in state is correct and stable.	2025-05-02 11:09:28 +02:00
Willy Tarreau	45e83e8e81	BUG/MAJOR: tasks: fix task accounting when killed After recent commit b81c9390f ("MEDIUM: tasks: Mutualize the TASK_KILLED code between tasks and tasklets"), the task accounting was no longer correct for killed tasks due to the decrement of tasks in list that was no longer done, resulting in infinite loops in process_runnable_tasks(). This just illustrates that this code remains complex and should be further cleaned up. No backport is needed, as this was in 3.2.	2025-05-02 11:09:28 +02:00
Olivier Houchard	faa18c1ad8	BUG/MEDIUM: quic: Let it be known if the tasklet has been released. quic_conn_release() may, or may not, free the tasklet associated with the connection. So make it return 1 if it was, and 0 otherwise, so that if it was called from the tasklet handler itself, the said handler can act accordingly and return NULL if the tasklet was destroyed. This should be backported if 9240cd4a2771245fae4d0d69ef025104b14bfc23 is backported.	2025-05-02 11:09:28 +02:00
William Lallemand	f63ceeded0	MINOR: acme: delay of 5s after the finalize Let 5 seconds by default to the server after the finalize to generate the certificate. Some servers would not send a Retry-After during processing.	2025-05-02 10:34:48 +02:00
William Lallemand	2db4848fc8	MINOR: acme: emit a log when starting Emit a administrative log when starting the ACME client for a certificate.	2025-05-02 10:23:42 +02:00
William Lallemand	fbd740ef3e	MINOR: acme: wait 5s before checking the challenges results Wait 5 seconds before trying to check if the challenges are ready, so it let time to server to execute the challenges.	2025-05-02 10:18:24 +02:00
William Lallemand	f7cae0e55b	MINOR: acme: allow a delay after a valid response Use the retryafter value to set a delay before doing the next request when the previous response was valid.	2025-05-02 10:16:12 +02:00
William Lallemand	24fbd1f724	BUG/MINOR: acme: reinit the retries only at next request The retries were reinitialized incorrectly, it must be reinit only when we didn't retry. So any valid response would reinit the retries number.	2025-05-02 09:34:45 +02:00
William Lallemand	6626011720	MINOR: acme: does not leave task for next request The next request was always leaving the task befor initializing the httpclient. This patch optimize it by jumping to the next step at the end of the current one. This way, only the httpclient is doing a task_wakeup() to handle the response. But transiting from response to the next request does not leave the task.	2025-05-02 09:31:39 +02:00
William Lallemand	51f9415d5e	MINOR: acme: retry label always do a request Doing a retry always result in initializing a request again, set ACME_HTTP_REQ directly in the label instead of doing it for each step.	2025-05-02 09:15:07 +02:00
Olivier Houchard	81e4083efb	BUILD/MEDIUM: quic: Make sure we build with recent changes TASK_IN_LIST has been changed to TASK_QUEUED, but one was missed in quic_conn.c, so fix that.	2025-04-30 18:00:56 +02:00
Olivier Houchard	b138eab302	BUG/MEDIUM: connections: Report connection closing in conn_create_mux() Add an extra parametre to conn_create_mux(), "closed_connection". If a pointer is provided, then let it know if the connection was closed. Callers have no way to determine that otherwise, and we need to know that, at least in ssl_sock_io_cb(), as if the connection was closed we need to return NULL, as the tasklet was free'd, otherwise that can lead to memory corruption and crashes. This should be backported if 9240cd4a2771245fae4d0d69ef025104b14bfc23 is backported too.	2025-04-30 17:17:36 +02:00
Olivier Houchard	b81c9390f4	MEDIUM: tasks: Mutualize the TASK_KILLED code between tasks and tasklets The code to handle a task/tasklet when it's been killed before it were to run is mostly identical, so move it outside of task and tasklet specific code, and inside the common code. This commit is just cosmetic, and should have no impact.	2025-04-30 17:09:14 +02:00
Olivier Houchard	2bab043c8c	MEDIUM: tasks: Remove TASK_IN_LIST and use TASK_QUEUED instead. TASK_QUEUED was used to mean "the task has been scheduled to run", TASK_IN_LIST was used to mean "the tasklet has been scheduled to run", remove TASK_IN_LIST and just use TASK_QUEUED for tasklets instead. This commit is just cosmetic, and should not have any impact.	2025-04-30 17:08:57 +02:00
Olivier Houchard	35df7cbe34	MEDIUM: tasks: More code factorization There is some code that should run no matter if the task was killed or not, and was needlessly duplicated, so only use one instance. This also fixes a small bug when a tasklet that got killed before it could run would still count as a tasklet that ran, when it should not, which just means that we'd run one less useful task before going back to the poller. This commit is mostly cosmetic, and should not have any impact.	2025-04-30 17:08:57 +02:00
Olivier Houchard	438c000e9f	MEDIUM: tasks: Mutualize code between tasks and tasklets. The code that checks if we're currently running, and waits if so, was identical between tasks and tasklets, so move it in code common to tasks and tasklets. This commit is just cosmetic, and should not have any impact.	2025-04-30 17:08:57 +02:00
William Lallemand	6462f183ad	MINOR: acme: use acme_ctx_destroy() upon error Use acme_ctx_destroy() instead of a simple free() upon error in the "acme renew" error handling. It's better to use this function to be sure than everything has been been freed.	2025-04-30 17:18:46 +02:00
William Lallemand	b8a5270334	MINOR: acme: acme_ctx_destroy() returns upon NULL acme_ctx_destroy() returns when its argument is NULL.	2025-04-30 17:17:58 +02:00
William Lallemand	563ca94ab8	MINOR: ssl/cli: "acme ps" shows the acme tasks Implement a way to display the running acme tasks over the CLI. It currently only displays a "Running" status with the certificate name and the acme section from the configuration. The displayed running tasks are limited to the size of a buffer for now, it will require a backref list later to be called multiple times to resume the list.	2025-04-30 17:12:50 +02:00
Aurelien DARRAGON	7f418ac7d2	MINOR: hlua_fcn: enforce yield after _get_stats() methods {listener,proxy,server}_get_stats() methods are know to be expensive, expecially if used under an iteration. Indeed, while automatic yield is performed every X lua instructions (defaults to 10k), computing an object's stats 10K times in a single cpu loop is not desirable and could create contention. In this patch we leverage hlua_yield_asap() at the end of _get_stats() methods in order to force the automatic yield to occur ASAP after the method returns. Hopefully this should help in similar scenarios as the one described in GH #2903	2025-04-30 17:00:31 +02:00
Aurelien DARRAGON	97363015a5	MINOR: add hlua_yield_asap() helper When called, this function will try to enforce a yield (if available) as soon as possible. Indeed, automatic yield is already enforced every X Lua instructions. However, there may be some cases where we know after running heavy operation that we should yield already to avoid taking too much CPU at once. This is what this function offers, instead of asking the user to manually yield using "core.yield()" from Lua itself after using an expensive Lua method offered by haproxy, we can directly enforce the yield without the need to do it in the Lua script.	2025-04-30 17:00:27 +02:00
Amaury Denoyelle	df50d3e39f	MINOR: mux-quic: limit emitted MSD frames count per qcs The previous commit has implemented a new calcul method for MAX_STREAM_DATA frame emission. Now, a frame may be emitted as soon as a buffer was consumed by a QCS instance. This will probably increase the number of MAX_STREAM_DATA frame emission. It may even cause a series of frame emitted for the same stream with increasing values under high load, which is completely unnecessary. To improve this, limit the number of MAX_STREAM_DATA frames built to one per QCS instance. This is implemented by storing a reference to this frame in QCS structure via a new member <tx.msd_frm>. Note that to properly reset QCS msd_frm member, emission of flow-control frames have been changed. Now, each frame is emitted individually. On one side, it is better as it prevent to emit frames related to different streams in a single datagram, which is not desirable in case of packet loss. However, this can also increase sendto() syscall invocation.	2025-04-30 16:08:47 +02:00
Amaury Denoyelle	14a3fb679f	MEDIUM: mux-quic: increase flow-control on each bufsize Recently, QCS Rx allocation buffer method has been improved. It is now possible to allocate multiple buffers per QCS instances, which was necessary to improve HTTP/3 POST throughput. However, a limitation remained related to the emission of MAX_STREAM_DATA. These frames are only emitted once at least half of the receive capacity has been consumed by its QCS instance. This may be too restrictive when a client need to upload a large payload. Improve this by adjusting MAX_STREAM_DATA allocation. If QCS capacity is still limited to 1 or 2 buffers max, the old calcul is still used. This is necessary when user has limited upload throughput via their configuration. If QCS capacity is more than 2 buffers, a new frame is emitted if at least a buffer was consumed. This patch has reduced number of STREAM_DATA_BLOCKED frames received in POST tests with some specific clients.	2025-04-30 16:08:47 +02:00
Christopher Faulet	2ccfebcebf	BUG/MINOR: mux-spop: Use the right bitwise operator in spop_ctl() Becaues of a typo, '\|\|' was used instead of '\|' to test the SPOP conneciton flags and decide if the mux is ready or not. The regression was introduced in the commit fd7ebf117 ("BUG/MEDIUM: mux-spop: Wait end of handshake to declare a spop connection ready"). This patch must be backported to 3.1 with the commit above.	2025-04-30 16:01:36 +02:00
Remi Tricot-Le Breton	f191a830d8	BUILD: ssl: Fix wolfssl build The newly added SSL traces require an extra 'conn' parameter to ssl_sock_chose_sni_ctx which was added in the "regular" code but not in the wolfssl specific one. Wolfssl also has a different prototype for some getter functions (SSL_get_servername for instance), which do not expect a const SSL while openssl version does.	2025-04-30 15:50:10 +02:00
Christopher Faulet	7dc4e94830	BUG/MINOR: mux-h1: Fix trace message in h1_detroy() to not relay on connection h1_destroy() may be called to release a H1C after a multiplexer upgrade. In that case, the connection is no longer attached to the H1C. It must not be used in the h1 trace message because the connection context is no longer a H1C. Because of this bug, when a H1>H2 upgrade is performed, a crash may be experienced if the H1 traces are enabled. This patch must be backport to all stable versions.	2025-04-30 14:44:42 +02:00
Christopher Faulet	2dc334be61	BUG/MINOR: mux-h1: Don't pretend connection was released for TCP>H1>H2 upgrade When an applicative upgrade of the H1 multiplexer is performed, we must not pretend the connection was released. Indeed, in that case, a H1 stream is still their with a stream connector attached on it. It must be detached first before releasing the H1 connection and the underlying connection. So it is important to not pretend the connection was already released. Concretely, in that case h1_process() must return 0 instead of -1. It is minor error because, AFAIK, it is harmless. But it is not correct. So let's fix it to avoid futur bugs. To be clear, this happens when a TCP connection is upgraded to H1 connection and a H2 preface is detected, leading to a second upgrade from H1 to H2. This patch may be backport to all stable versions.	2025-04-30 14:44:42 +02:00
Christopher Faulet	53c3046898	BUG/MEDIUM: mux-spop: Handle CLOSING state and wait for AGENT DISCONNECT frame In the SPOE specification, when an error occurred on the SPOP connection, HAProxy must send a DISCONNECT frame and wait for the agent DISCONNECT frame in return before trully closing the connection. However, this part was not properly handled by the SPOP multiplexer. In this case, the SPOP connection should be in the CLOSING state. But this state was not used at all. Depending on when the error was encountered, the connection could be closed immediately, without sending any DISCONNECT frame. It was the case when an early error was detected during the AGENT-HELLO frame parsing. Or it could be moved from ERROR to FRAME_H state, as if no error were detected. This case was less dramatic than it seemed because some flags were also set to prevent any problem. But it was not obvious. So now, the SPOP connection is properly switch to CLOSING state when an DISCONNECT is sent to the agent to be able to wait for its DISCONNECT in reply. spop_process_demux() was updated to parse frames in that state and some validity checks was added. This patch must be backport to 3.1.	2025-04-30 14:44:42 +02:00
Christopher Faulet	fd7ebf117b	BUG/MEDIUM: mux-spop: Wait end of handshake to declare a spop connection ready A SPOP connection must not be considered as ready while the hello handshake is not finished with success. In addition, no error or shutdown must have been reported for the underlying connection. Otherwise a freshly openned spop connexion may be reused while it is in fact dead, leading to a connection retry. This patch must be backported to 3.1.	2025-04-30 14:44:42 +02:00
Remi Tricot-Le Breton	047fb37b19	MINOR: Add 'conn' param to ssl_sock_chose_sni_ctx This is only useful in the traces, the conn parameter won't be used otherwise.	2025-04-30 11:11:26 +02:00
Remi Tricot-Le Breton	6519cec2ed	MINOR: ssl: Add traces about sigalg extension parsing in clientHello callback We had to parse the sigAlg extension by hand in order to properly select the certificate used by the SSL frontends. These traces allow to dump the allowed sigAlg list sent by the client in its clientHello.	2025-04-30 11:11:26 +02:00
Remi Tricot-Le Breton	105c1ca139	MINOR: ssl: Add traces to the switchctx callback This callback allows to pick the used certificate on an SSL frontend. The certificate selection is made according to the information sent by the client in the clientHello. The traces that were added will allow to better understand what certificate was chosen and why. It will also warn us if the chosen certificate was the default one. The actual certificate parsing happens in ssl_sock_chose_sni_ctx. It's in this function that we actually get the filename of the certificate used.	2025-04-30 11:11:26 +02:00
Remi Tricot-Le Breton	dbdd0630e1	MINOR: ssl: Add ocsp stapling callback traces If OCSP stapling fails because of a missing or invalid OCSP response we used to silently disable stapling for the given session. We can now know a bit more what happened regarding OCSP stapling.	2025-04-30 11:11:26 +02:00
Remi Tricot-Le Breton	0fb05540b2	MINOR: ssl: Add traces to verify callback Those traces allow to know which errors were met during certificate chain validation as well as which ones were ignored.	2025-04-30 11:11:26 +02:00
Remi Tricot-Le Breton	4a8fa28e36	MINOR: ssl: Add traces around SSL_do_handshake call Those traces dump information about the multiple SSL_do_handshake calls (renegotiation and regular call). Some errors coud also be dumped in case of rejected early data. Depending on the chosen verbosity, some information about the current handshake can be dumped as well (servername, tls version, chosen cipher for instance). In case of failed handshake, the error codes and messages will also be dumped in the log to ease debugging.	2025-04-30 11:11:26 +02:00
Remi Tricot-Le Breton	9f146bdab3	MINOR: ssl: Add traces to ssl_sock_io_cb function Add new SSL traces.	2025-04-30 11:11:26 +02:00
Remi Tricot-Le Breton	475bb8d843	MINOR: ssl: Add traces to recv/send functions Those traces will allow to identify sessions on which early data is used as well as some forcefully closed connections.	2025-04-30 11:11:26 +02:00
Remi Tricot-Le Breton	9bb8d6dcd1	MINOR: ssl: Add traces to ssl init/close functions Add a dedicated trace for some unlikely allocation failures and async errors. Those traces will ostly be used to identify the start and end of a given SSL connection.	2025-04-30 11:11:26 +02:00
Remi Tricot-Le Breton	08e40f4589	MINOR: Add "sigalg" to "sigalg name" helper function This function can be used to convert a TLSv1.3 sigAlg entry (2bytes) from the signature_agorithms client hello extension into a string. In order to ease debugging, some TLSv1.2 combinations can also be dumped. In TLSv1.2 those signature algorithms pairs were built out of a one byte signature identifier combined to a one byte hash identifier. In TLSv1.3 those identifiers are two bytes blocs that must be treated as such.	2025-04-30 11:11:26 +02:00
Willy Tarreau	566b384e4e	MINOR: tools: make my_strndup() take a size_t len instead of and int In relation to issue #2954, it appears that turning some size_t length calculations to the int that uses my_strndup() upsets coverity a bit. Instead of dealing with such warnings each time, better address it at the root. An inspection of all call places show that the size passed there is always positive so we can safely use an unsigned type, and size_t will always suit it like for strndup() where it's available.	2025-04-30 05:17:43 +02:00

1 2 3 4 5 ...

19286 Commits