haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2026-04-28 01:31:01 +02:00

Author	SHA1	Message	Date
William Lallemand	1494cd7137	MAJOR: ssl: use the msg callback mecanism for backend connections Backend SSL connections never used the ssl_sock_msg_callbacks() which prevent the use of keylog on the server side. The impact should be minimum, though it add a major callback system for protocol analysis, which is the same used on frontend connections. https://www.openssl.org/docs/man1.1.1/man3/SSL_CTX_set_msg_callback.html The patch add a call to SSL_CTX_set_msg_callback() in ssl_sock_prepare_srv_ssl_ctx() the same way it's done for bind lines in ssl_sock_prepare_ctx().	2024-04-19 14:48:44 +02:00
William Lallemand	64201ad2c3	MEDIUM: ssl: crt-base and key-base local keywords for crt-store Add support for crt-base and key-base local keywords for the crt-store. current_crtbase and current_keybase are filed with a copy of the global keyword argument when a crt-store is declared, and updated with a new path when the keywords are in the crt-store section. The ckch_conf_kws[] array was updated with &current_crtbase and &current_keybase instead of the global_ssl ones so the parser can use them. The keyword must be used before any "load" line in a crt-store section. Example: crt-store web crt-base /etc/ssl/certs/ key-base /etc/ssl/private/ load crt "site3.crt" alias "site3" load crt "site4.crt" key "site4.key" frontend in2 bind *:443 ssl crt "@web/site3" crt "@web/site4.crt"	2024-04-18 17:47:24 +02:00
Amaury Denoyelle	0109c0658d	REORG: stats: extract JSON related functions This commit is similar to the previous one. This time it deals with functions related to stats JSON output.	2024-04-18 17:04:08 +02:00
Amaury Denoyelle	b8c1fdf24e	REORG: stats: extract HTML related functions Extract functions related to HTML stats webpage from stats.c into a new module named stats-html. This allows to reduce stats.c to roughly half of its original size.	2024-04-18 17:04:08 +02:00
Amaury Denoyelle	b3d5708adc	MINOR: stats: remove implicit static trash_chunk usage A static variable trash_chunk was used as implicit buffer in most of stats output function. It was a oneline buffer uses as temporary storage before emitting to the final applet or CLI buffer. Replaces it by a buffer defined in show_stat_ctx structure. This allows to retrieve it in most of stats output function. An additional parameter was added for the function where context was not already used. This renders the code cleaner and will allow to split stats.c in several source files. As a result of a new member into show_stat_ctx, per-command context max size has increased. This forces to increase APPLET_MAX_SVCCTX to ensure pool size is big enough. Increase it to 128 bytes which includes some extra room for the future.	2024-04-18 17:04:08 +02:00
William Lallemand	ffea2e1a13	MEDIUM: ssl: support a named crt-store section This patch introduces named crt-store section. A named crt-store allows to add a scope to the crt name. For example, a crt named "foo.crt" in a crt-store named "web" will result in a certificate called "@web/foo.crt".	2024-04-18 16:10:09 +02:00
Aurelien DARRAGON	81a8a2cae1	MINOR: peers: stop relying on srv->addr to find peer port Now that peers entirely rely on peer->srv for connection settings, and that it was confirmed that it works properly thanks to previous commit, let's finish what we started in f6ae258 ("MINOR: peers: rely on srv->addr and remove peer->addr") and stop using srv->addr to find out peers port and instead rely on srv->svc_port as it's already done for other proxy types.	2024-04-18 11:18:26 +02:00
Aurelien DARRAGON	f51f438875	BUG/MEDIUM: peers: fix localpeer regression with 'bind+server' config style A dumb mistake was made in f6ae25858 ("MINOR: peers: rely on srv->addr and remove peer->addr"). I completely overlooked the part where the bind address settings are used as implicit server's address settings when the peers are declared using the new bind+server config style (which is the new recommended method to declare peers as it follows the same logic as the one used in other proxy sections). As such, the peers synchro fails to work between previous and new process (localpeer mechanism) upon reload when declaring peers with way: global localpeer local peers mypeers bind 127.0.0.1:10001 server local And one has to use the 'old' config style to make it work: global localpeer local peers mypeers peer local 127.0.0.1:10001 -- To fix the issue, let's explicitly set the server's addr:port according to the bind's address settings (only the first listener is considered) when local peer was declared using the 'bind+server' method. No backport needed.	2024-04-18 11:18:13 +02:00
Christopher Faulet	494bc03ff7	BUG/MEDIUM: peers: Fix exit condition when max-updates-at-once is reached When a peer applet is pushing updates, we limit the number of update sent at once via a global parameter to not spend too much time in the applet. On interrupt, we claimed for more room to be woken up quickly. However, this statement is only true if something was pushed in the buffer. Otherwise, with an empty buffer, if the stream itself is not woken up, the applet remains also blocked because there is no send activity on the other side to unblock it. In this case, instead of requesting more room, it is sufficient to state the applet have more data to send. This patch must be backported as far as 2.6.	2024-04-18 09:17:03 +02:00
Christopher Faulet	4fd656e311	BUG/MEDIUM: spoe: Always retry when an applet fails to send a frame This bug is related to the previous one ("BUG/MEDIUM: spoe: Always retry when an applet fails to send a frame"). applet_putblk() function retruns -1 on error and it should always be interpreted as a missing of room in the buffer. However, on the spoe, this was processed as an I/O error. This patch must be backported as far as 2.8.	2024-04-18 09:17:03 +02:00
William Lallemand	10224d72fd	BUG/MINOR: ssl: fix crt-store load parsing The crt-store load line parser relies on offsets of member of the ckch_conf struct. However the new "alias" keyword as an offset to -1, because it does not need to be used. Plan was to handle it that way in the parser, but it wasn't supported yet. So -1 was still used in an offset computation which was not used, but ASAN could see the problem. This patch fixes the issue by using a signed type for the offset value, so any negative value would be skipped. It also introduced a PARSE_TYPE_NONE for the parser. No backport needed.	2024-04-17 21:00:34 +02:00
William Lallemand	ff4a0f6562	BUG/MINOR: ssl: check on forbidden character on wrong value The check on the forbidden '/' for the crt-store load keyword was done on the keyword instead of the value itself. No backport needed.	2024-04-17 21:00:25 +02:00
William Lallemand	bdee8ace81	MEDIUM: ssl: support aliases in crt-store The crt-store load line now allows to put an alias. This alias is used as the key in the ckch_tree instead of the certificate. This way an alias can be referenced in the configuration with the '@/' prefix. This can only be define with a crt-store.	2024-04-17 17:24:49 +02:00
Willy Tarreau	e6662bf706	MEDIUM: evports: permit to report multiple events at once Since the beginning in 2.0 the nevlist parameter was set to 1 before calling port_getn(), which means that a single FD event will be reported per polling loop. This is extremely inefficient, and all the code was designed to use global.tune.maxpollevents. It looks like it's a leftover of a temporary debugging change. No apparent issues were found by setting it to a higher value, so better do that. That code is not much used nowadays with Solaris disappearing from the landscape, so even if this definitely was a bug, it's preferable not to backport that fix as it could uncover other subtle bugs that were never raised yet.	2024-04-17 16:37:04 +02:00
Willy Tarreau	36d92dcd9b	BUG/MEDIUM: evports: do not clear returned events list on signal Since 2.0 with commit 0ba4f483d2 ("MAJOR: polling: add event ports support (Solaris)"), the polling system on Solaris suffers from a signal handling problem. It turns out that this API is very bizarre, as reported events are automatically unregistered and their counter is updated in the same variable that was used to pass the count on input, making it difficult to handle certain error codes (how should one handle ENOSYS for example?). And to complete everything, the API is able to return both EINTR and an event if a signal is reported. The code tries to deal with certain such cases (e.g. ETIME for timeout can also report an event), otherwise it defaults to clearing the event counter upon error. This has the effect that EINTR clears the list of events, which are also automatically cleared from the set by the system. This is visible when using external checks where the SIGCHLD of the leaving child causes a wakeup that ruins the event counter and causes endless loops, apparently due to the queued inter-thread byte in the pipe used to wake threads up that never gets removed in this case. Note that extcheck would also deserve deeper investigation because it can immediately re-trigger a check in such a case, which is not normal. Removing the wiping of the nevlist variable fixes the problem. This can be backported to all versions since it affects 2.0.	2024-04-17 16:25:20 +02:00
Ilya Shipitsin	ab7f05daba	CLEANUP: assorted typo fixes in the code and comments This is 41st iteration of typo fixes	2024-04-17 11:14:44 +02:00
Willy Tarreau	1c944eab08	BUILD: cache: fix a build warning with gcc < 7 Gcc before 7 does really not like direct operations on cast pointers such as "((struct a)b)->c += d;". It turns our that we have exactly that construct in 3.0 since commit 5baa9ea168 ("MEDIUM: cache: Save body size of cached objects and track it on delivery"). It's generally sufficient to use an intermediary variable such as : "({ (struct a) _ = b; _; })->c +=d;" but that's ugly. Fortunately DISGUISE() implicitly does something very similar and works fine, so let's use that. No backport is needed.	2024-04-17 09:43:32 +02:00
Christopher Faulet	50d8c18742	BUG/MEDIUM: stconn: Don't forward channel data if input data must be filtered Once data are received and placed in a channel buffer, if it is possible, outgoing data are immediately forwarded. But we must take care to not do so if there is also pending input data and a filter registered on the channel. It is especially important for HTX streams because the HTX may be altered, especially the extra field. And it is indeed an issue with the HTTP compression filter and the H1 multiplexer. The wrong chunk size may be announced leading to an internal error. This patch should fix the issue #2530. It must be backported to all stable versions.	2024-04-16 11:36:54 +02:00
Christopher Faulet	ffe0874cfb	MINOR: peer: Restore previous peer flags value to ease debugging The last fixes on the peers to improve the locking mechanism introduced new peer flags and the value of some old flags was changed. This was done in the commit 9b78e33837 ("MINOR: peers: Add 2 peer flags about the peer learn status"). But, to ease the debugging of the peers team, old values are restored. This patch must be backported with the commit above.	2024-04-16 11:35:47 +02:00
Christopher Faulet	9075a7e32f	MEDIUM: peers: Only lock one peer at a time in the sync process function Thanks to all previous changes, it is now possible to stop locking all peers at once in the resync process function. Peer are locked one after the other. Wen a peer is locked, another one may be locked when all peer sharing the same shard must be updated. Otherwise, at anytime, at most one peer is locked. This should significantly improve the situation. This patch depends on the following patchs: * BUG/MAJOR: peers: Update peers section state from a thread-safe manner * BUG/MINOR: peers: Report a resync was explicitly requested from a thread-safe manner * MINOR: peers: Add functions to commit peer changes from the resync task * MINOR: peers: sligthly adapt part processing the stopping signal * MINOR: peers: Add flags to report the peer state to the resync task * MINOR: peers: Add 2 peer flags about the peer learn status * MINOR: peers: Split resync process function to separate running/stopping states It may be good to backport it to 2.9. All the seris should fix the issue #2470.	2024-04-16 10:29:21 +02:00
Christopher Faulet	9425aeaffb	BUG/MAJOR: peers: Update peers section state from a thread-safe manner It is the main part of this series. In the peer applet, only the peer flags are updated. It is now the responsibility of the resync process function to check changes on each peer to update the peers section state accordingly. Concretly, changes on the connection state (accepted, connected, released or renewed) are first reported at the peer level and then handled in __process_peer_state() function. In the same manner, when the learn status of a peer changes, the peers section state is no longer updated immediately. The resync task is woken up to deal with this changes. Thanks to these changes, the peers should be now really thread-safe. This patch relies on the following ones: * BUG/MINOR: peers: Report a resync was explicitly requested from a thread-safe manner * MINOR: peers: Add functions to commit peer changes from the resync task * MINOR: peers: sligthly adapt part processing the stopping signal * MINOR: peers: Add flags to report the peer state to the resync task * MINOR: peers: Add 2 peer flags about the peer learn status * MINOR: peers: Split resync process function to separate running/stopping states No bug was reported about the thread-safety of peers. Only a performance issue was encountered with a huge number of peers (> 50). So there is no reason to backport all these patches further than 2.9.	2024-04-16 10:29:21 +02:00
Christopher Faulet	ef066fa186	BUG/MINOR: peers: Report a resync was explicitly requested from a thread-safe manner Flags on the peers section state must be updated from a thread-safe manner. It is not true today. With this patch we take care PEERS_F_RESYNC_REQUESTED flag is only set by the resync task. To do so, a peer flag is used. This flag is only set once and never removed. It is juste used for debugging purpose. So it is enough to set it on a peer and be sure to report it on the peers section when the sync task is executed. This patch relies on previous ones: * MINOR: peers: Add functions to commit peer changes from the resync task * MINOR: peers: sligthly adapt part processing the stopping signal * MINOR: peers: Add flags to report the peer state to the resync task * MINOR: peers: Add 2 peer flags about the peer learn status * MINOR: peers: Split resync process function to separate running/stopping states	2024-04-16 10:29:21 +02:00
Christopher Faulet	bdf1634883	MINOR: peers: Add functions to commit peer changes from the resync task For now, nothing is done in these functions. It is only a patch to prepare the huge part of the refactoring about the locking mechanism of the peers. These functions will be responsible to check peers state and their learn status to update the peers section flags accordingly.	2024-04-16 10:29:21 +02:00
Christopher Faulet	4a16560315	MINOR: peers: sligthly adapt part processing the stopping signal The signal and the PEERS_F_DONOTSTOP flag are now handled in the loop on peers to force sessions shutdown. We will need to loop on all peers to update their state. It is easier this way.	2024-04-16 10:29:21 +02:00
Christopher Faulet	4ca8a00955	MINOR: peers: Add flags to report the peer state to the resync task As the previous patch, this patch is also part of the refactoring of peer locking mechanisme. Here we add flags to represent a transitional state for a peer. It will be the resync task responsibility to update the peers state accordingly. A peer may be in 4 transitional states: * accepted : a connection was accepted from a peer * connected: a connection to a peer was established * release : a peer session was released * renewed : a peer session was released because it was replaced by a new one. Concretly, this will be equivalent to released+accepted If none of these flags is set, it means the transition, if any, was processed by the resync task, or no transition happened.	2024-04-16 10:29:21 +02:00
Christopher Faulet	9b78e33837	MINOR: peers: Add 2 peer flags about the peer learn status PEER_F_LEARN_PROCESS and PEER_F_LEARN_FINISHED flags are added to help to fix locking issue about peers. Indeed, a peer is able to update the peers "section" state under its own lock. Because the resync task locks all peers at once, there is no conflict at this level. But there is nothing to prevent 2 peers to update the peers state in same time. So it seems there is no real issue here, but there is a theorical thread-safety issue here. And it means the locking mechanism of the peers must be reviewed. In this context, the 2 flags above will help to move all update of the peers state in the scope of resync task. Each peer will be able to update its own state and the resync task will be responsible to update the peers state accordingly.	2024-04-16 10:29:21 +02:00
Christopher Faulet	4078893049	MINOR: peers: Split resync process function to separate running/stopping states The function responsible to deal with resynchro between all peers is now split in two subfunctions. The first one is used when HAProxy is running while the other one is used in soft-stop case. This patch is required to be able to refactor locking mechanism of the peers.	2024-04-16 10:29:21 +02:00
Frederic Lecaille	98583c4256	BUG/MEDIUM: grpc: Fix several unaligned 32/64 bits accesses There were several places in grpc and its dependency protobuf where unaligned accesses were done. Read accesses to 32 (resp. 64) bits values should be performed by read_u32() (resp. read_u64()). Replace these unligned read accesses by correct calls to these functions. Same fixes for doubles and floats. Such unaligned read accesses could lead to crashes with bus errors on CPU archictectures which do not fix them at run time. This patch depends on this previous commit: 861199fa71 MINOR: net_helper: Add support for floats/doubles. Must be backported as far as 2.6.	2024-04-16 07:37:28 +02:00
William Lallemand	fa5c4cc6ce	MINOR: ssl: 'key-base' allows to load a 'key' from a specific path The global 'key-base' keyword allows to read the 'key' parameter of a crt-store load line using a path prefix. This is the equivalent of the 'crt-base' keyword but for 'key'. It only applies on crt-store.	2024-04-15 15:27:10 +02:00
William Lallemand	6567d09af5	MINOR: ssl: supports crt-base in crt-store Add crt-base support for "crt-store". It will be used by 'crt', 'ocsp', 'issuer', 'sctl' load line parameter. In order to keep compatibility with previous configurations and scripts for the CLI, a crt-store load line will save its ckch_store using the absolute crt path with the crt-base as the ckch tree key. This way, a `show ssl cert` on the CLI will always have the completed path.	2024-04-15 15:25:36 +02:00
William Lallemand	785d5ef3f0	CLEANUP: ssl: remove dead code in cfg_parse_crtstore() Remove dead code reported in #2531.	2024-04-15 09:05:27 +02:00
Willy Tarreau	3ef7daa731	BUG/MAJOR: ring: use the correct size to reallocate startup_logs In 3.0-dev, with commit 7c9ce715c9 ("MINOR: ring: make callers use ring_data() and ring_size(), not ring->buf"), we made startup_logs_dup() use ring_size() to get the old ring size and pass it to ring_new() to create a new ring. But due to the ambiguity of the allocate vs usable size, this resulted in slightly shrinking the buffer compared to the previous one, occasionally causing crashes if the first one was already full of warnings, as seen in GH issue #2529. We need to use the allocated size instead, thanks to the function brought by previous commit. No backport is needed, this only affects 3.0-dev. Thanks to @felipewd for the detailed report that allowed to spot the problem.	2024-04-15 08:26:41 +02:00
Willy Tarreau	b662c5d2b8	MINOR: ring: clarify the usage of ring_size() and add ring_allocated_size() There's currently an abiguity around ring_size(), it's said to return the allocated size but returns the usable size. We can't change it as it's used everywhere in the code like this. Let's fix the comment and add ring_allocated_size() instead for anything related to allocation.	2024-04-15 08:25:03 +02:00
Willy Tarreau	da6bb13790	BUG/MINOR: lru: fix the standalone test case for invalid revision In 2.6, a build issue for LRU in standalone test mode was addressed by commit bf9c07fd9 ("BUILD/DEBUG: lru: update the standalone code to support the revision"), but using revision 1 while looking up rev 0 results in 100% misses. Let's fix this and commit with revision 0 as well. No backport is needed, this only happens when hacking on the code.	2024-04-13 08:43:12 +02:00
Valentine Krasnobaeva	985d458571	MINOR: proto_quic: add proto name in alert In quic_alloc_dghdlrs() add proto name in the last alert. This helps to identify potential problem immediately and makes log messages more uniform.	2024-04-12 18:51:50 +02:00
Valentine Krasnobaeva	7041c078d6	MINOR: listener/protocol: add proto name in alerts Frontend and listen sections allow unlimited number of bind statements, it is often, when there is a bind statement per supported protocol, like below: listen test mode http bind quic4@0.0.0.0:443 name quic ssl crt ... bind 0.0.0.0:443 name https ssl alpn http/1.1,h2 crt ... bind 0.0.0.0:8080 ... ... It seems useful to show corresponded protocol name in alerts and warnings, when problem occures with port binding, connection resuming or sharding. This helps to figure out immediately, which bind statement has a wrong setting or which protocol module is the root cause of the issue.	2024-04-12 18:51:40 +02:00
Willy Tarreau	c0ee2d78d7	DEBUG: pools: report the data around the offending area in case of mismatch When the integrity check fails, it's useful to get a dump of the area around the first faulty byte. That's what this patch does. For example it now shows this before reporting info about the tag itself: Contents around first corrupted address relative to pool item:. Contents around address 0xe4febc0792c0+40=0xe4febc0792e8: 0xe4febc0792c8 [80 75 56 d8 fe e4 00 00] [.uV.....] 0xe4febc0792d0 [a0 f7 23 a4 fe e4 00 00] [..#.....] 0xe4febc0792d8 [90 75 56 d8 fe e4 00 00] [.uV.....] 0xe4febc0792e0 [d9 93 fb ff fd ff ff ff] [........] 0xe4febc0792e8 [d9 93 fb ff ff ff ff ff] [........] 0xe4febc0792f0 [d9 93 fb ff ff ff ff ff] [........] 0xe4febc0792f8 [d9 93 fb ff ff ff ff ff] [........] 0xe4febc079300 [d9 93 fb ff ff ff ff ff] [........] This may be backported to 2.9 and maybe even 2.8 as it does help spot the cause of the memory corruption.	2024-04-12 18:01:55 +02:00
Willy Tarreau	16e3655fbd	REORG: pool: move the area dump with symbol resolution to tools.c This function is particularly useful to dump unknown areas watching for opportunistic symbols, so let's move it to tools.c so that we can reuse it a little bit more.	2024-04-12 18:01:20 +02:00
Willy Tarreau	b21aaef4e5	DEBUG: pool: improve decoding of corrupted pools When a corruption was detected in an object, it's often said that the tag doesn't match the pool, but it should also check if it matches the location of an earlier pool_free() call, which happens when -dMcaller is used. That's what we're doing now.	2024-04-12 18:01:05 +02:00
Willy Tarreau	21447b1dd4	BUG/MAJOR: stick-tables: fix race with peers in entry expiration In 2.9 with commit 7968fe3889 ("MEDIUM: stick-table: change the ref_cnt atomically") we significantly relaxed the stick-tables locking when dealing with peers by adjusting the ref_cnt atomically and moving it out of the lock. However it opened a tiny window that became problematic in 3.0-dev7 when the table's contention was lowered by commit 1a088da7c2 ("MAJOR: stktable: split the keys across multiple shards to reduce contention"). What happens is that some peers may access the entry for reading at the moment it's about to expire, and while the read accesses to push the data remain unnoticed (possibly that from time to time we push crap), but the releasing of the refcount causes a new write that may damage anything else. The scenario is the following: process_table_expire() peer_send_teachmsgs() RDLOCK(&updt_lock); tick_is_expired() != 0 ebmb_delete(ts->key); if (ts->upd.node.leaf_p) { HA_ATOMIC_INC(&ts->ref_cnt); RDUNLOCK(&updt_lock); WRLOCK(&updt_lock); eb32_delete(&ts->upd); } __stksess_free(t, ts); peer_send_updatemsg(ts); RDLOCK(&updt_lock); HA_ATOMIC_DEC(&ts->ref_cnt); Here it's clear that the bottom part of peer_send_teachmsgs() believes to be protected but may act on freed data. This is more visible when enabling -dMtag,no-merge,integrity because the ATOMIC_DEC(&ref_cnt) decrements one byte in the area, that makes the eviction check fail while the tag has the address of the left __stksess_free(), proving a completed pool_free() before the decrement, and the anomaly there is pretty visible in the crash dump. Changing INC()/DEC() with ADD(2)/DEC(2) shows that the byte is now off by two, confirming that the operation happened there. The solution is not very hard, it consists in checking for the ref_cnt on the left after grabbing the lock, and doing both before deleting the element, so that we have the guarantee that either the peer will not take it or that it has already started taking it. This was proven to be sufficient, as instead of crashing after 3s of injection with 4 peers, 16 threads and 130k RPS, it survived for 15mn. In order to stress the setup, a config involving 4+ peers, tracking HTTP request with randoms and applying a bwlim-out filter with a random key, with a client made of 160 h2 conns downloading 10 streams of 4MB objects in parallel managed to trigger it within a few seconds: frontend ft http-request track-sc0 rand(100000) table tbl filter bwlim-out lim-out limit 2047m key rand(100000000),ipmask(32) min-size 1 table tbl http-request set-bandwidth-limit lim-out use_backend bk backend bk server s1 198.18.0.30:8000 server s2 198.18.0.34:8000 backend tbl stick-table type ip size 1000k expire 1s store http_req_cnt,bytes_in_rate(1s),bytes_out_rate(1s) peers peers This seems to be very dependent on the timing and setup though. This will need to be backported to 2.9. This part of the code was reindented with shards but the block should remain mostly unchanged. The logic to apply is the same.	2024-04-12 18:00:13 +02:00
Willy Tarreau	d8c2f5c586	BUG/MEDIUM: peers/trace: fix crash when listing event types Sending "trace peers event" on the CLI crashes because the event list in the peers is not finished. This was introduced in 2.4 by commit d865935f32 ("MINOR: peers: Add traces to peer_treat_updatemsg().") so this must be backported to 2.4.	2024-04-12 17:59:55 +02:00
Willy Tarreau	90efe8a877	CLEANUP: stick-tables: always respect the to_batch limit when trashing When adding the shards support to tables with commit 1a088da7c ("MAJOR: stktable: split the keys across multiple shards to reduce contention"), the condition to stop eliminating entries based on the batch size being reached is based on a pre-decrement of the max_search counter, but now it goes back into the outer loop which doesn't check it, so next time it does it when entering the next shard, it will become even more negative and will properly stop, but at first glance it looks like an int overflow (which it is not). Let's make sure the outer loop stops on this condition so that we don't continue searching when the limit is reached.	2024-04-12 17:58:54 +02:00
Willy Tarreau	44a8f9e7fc	BUG/MEDIUM: stick-tables: fix the task's next expiration date While changing the stick-table indexing that led to commit 1a088da7c ("MAJOR: stktable: split the keys across multiple shards to reduce contention"), I met a problem with the task's expiration date being incorrectly updated, I fixed it and apparently I committed the wrong version :-/ The effect is that the task's date is only correctly reset if the table is empty, otherwise the task wakes up again and is queued at the previous date, eating 100% CPU. The tick_isfirst() must not be used when storing the last result. No backport is needed as this was only merged in 3.0-dev7.	2024-04-12 17:58:54 +02:00
William Lallemand	d308c9a9f6	MINOR: ssl/crtlist: alloc ssl_conf only when a valid keyword is found crt-list will be enhanced with ckch_conf keywords, however these keywords does not fill the 'ssl_conf' structure. So we don't need to allocate the ssl_conf for every options between [ ] but only when we found a relevant one.	2024-04-12 15:38:54 +02:00
William Lallemand	81e54ef197	MINOR: ssl: rename ckchs_load_cert_file to new_ckch_store_load_files_path Remove the ambiguous "ckchs" name and reflect the fact that its loaded from a path.	2024-04-12 15:38:54 +02:00
William Lallemand	00eb44864b	MINOR: ssl: add the section parser for 'crt-store' 'crt-store' is a new section useful to define the struct ckch_store. The "load" keyword in the "crt-store" section allows to define which files you want to load for a specific certificate definition. Ex: crt-store load crt "site1.crt" key "site1.key" load crt "site2.crt" key "site2.key" frontend in bind *:443 ssl crt "site1.crt" crt "site2.crt" This is part of the certificate loading which was discussed in #785.	2024-04-12 15:38:54 +02:00
Christopher Faulet	aaa72e06e5	BUG/MEDIUM: cache/stats: Handle inbuf allocation failure in the I/O handler When cache and stats applets were changed to use their own buffers, a change was also performed to no longer access the stream from the I/O handller. Among other things, the HTTP start-line of the request is now retrieved to get the method. But, when these changes were brought, the inbuf buffer allocation failures were not handled. It is of course not so common. But if this happens, a crash may be experienced. To fix the issue, we now check for inbuf allocation failures before accessing it. No backported needed.	2024-04-12 15:00:04 +02:00
Damien Claisse	0797e05d9f	BUG/MINOR: server: fix slowstart behavior We observed that a dynamic server which health check is down for longer than slowstart delay at startup doesn't trigger the warmup phase, it receives full traffic immediately. This has been confirmed by checking haproxy UI, weight is immediately the full one (e.g. 75/75), without any throttle applied. Further tests showed that it was similar if it was in maintenance, and even when entering a down or maintenance state after being up. Another issue is that if the server is down for less time than slowstart, when it comes back up, it briefly has a much higher weight than expected for a slowstart. An easy way to reproduce is to do the following: - Add a server with e.g. a 20s slowstart and a weight of 10 in config file - Put it in maintenance using CLI (set server be1/srv1 state maint) - Wait more than 20s, enable it again (set server be1/srv1 state ready) - Observe UI, weight will show 10/10 immediately. If server was down for less than 20s, you'd briefly see a weight and throttle value that is inconsistent, e.g. 50% throttle value and a weight of 5 if server comes back up after 10s before going back to 6% after a second or two. Code analysis shows that the logic in server_recalc_eweight stops the warmup task by setting server's next state to SRV_ST_RUNNING if it didn't change state for longer than the slowstart duration, regardless of its current state. As a consequence, a server being down or disabled for longer than the slowstart duration will never enter the warmup phase when it will be up again. Regarding the weight when server comes back up, issue is that even if the server is down, we still compute its next weight as if it was up, hence when it comes back up, it can briefly have a much higher weight than expected during slowstart, until the warmup task is called again after last_change is updated. This patch aims to fix both issues.	2024-04-11 19:24:01 +02:00
Willy Tarreau	018443b8a1	BUILD: makefile: get rid of the CPU variable The CPU variable, when used, is almost always exclusively used with "generic" to disable any CPU-specific optimizations, or "native" to enable "-march=native". Other options are not used and are just making CPU_CFLAGS more confusing. This commit just drops all pre-configured variants and replaces them with documentation about examples of supported options. CPU_CFLAGS is preserved as it appears that it's mostly used as a proxy to inject the distro's CFLAGS, and it's just empty by default. The CPU variable is checked, and if set to anything but "generic", it emits a warning about its deprecation and invites the user to read INSTALL. Users who would just set CPU_CFLAGS will be able to continue to do so, those who were using CPU=native will have to pass CPU_CFLAGS=-march=native and those who were passing one of the other options will find it in the doc as well. Note that this also removes the "CPU=" line from haproxy -vv, that most users got used to seeing set to "generic" or occasionally "native" anyway, thus that didn't provide any useful information.	2024-04-11 17:33:28 +02:00
Willy Tarreau	772f9a5874	BUILD: pools: make DEBUG_MEMORY_POOLS=1 the default option This option has been set by default for a very long time and also complicates the manipulation of the DEBUG variable. Let's make it the official default and permit to unset it by setting it to zero. The other pool-related DEBUG options were adjusted to also explicitly check for the zero value for consistency.	2024-04-11 17:25:45 +02:00

1 2 3 4 5 ...

17338 Commits