haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-11-23 11:51:00 +01:00

Author	SHA1	Message	Date
Tim Duesterhus	ad54273cf9	MINOR: systemd: Include MONOTONIC_USEC field in RELOADING=1 message As per the `sd_notify` manual: > A field carrying the monotonic timestamp (as per CLOCK_MONOTONIC) formatted > in decimal in μs, when the notification message was generated by the client. > This is typically used in combination with "RELOADING=1", to allow the > service manager to properly synchronize reload cycles. See systemd.service(5) > for details, specifically "Type=notify-reload". Thus this change allows users with a recent systemd to switch to `Type=notify-reload`, should they desire to do so. Correct behavior was verified with a Fedora 39 VM. see systemd/systemd#25916 [wla: the service file should be updated this way:] diff --git a/admin/systemd/haproxy.service.in b/admin/systemd/haproxy.service.in index 22a53d8aab..8c6dadb5e5 100644 --- a/admin/systemd/haproxy.service.in +++ b/admin/systemd/haproxy.service.in @@ -8,12 +8,11 @@ EnvironmentFile=-/etc/default/haproxy EnvironmentFile=-/etc/sysconfig/haproxy Environment="CONFIG=/etc/haproxy/haproxy.cfg" "PIDFILE=/run/haproxy.pid" "EXTRAOPTS=-S /run/haproxy-master.sock" ExecStart=@SBINDIR@/haproxy -Ws -f $CONFIG -p $PIDFILE $EXTRAOPTS -ExecReload=@SBINDIR@/haproxy -Ws -f $CONFIG -c $EXTRAOPTS -ExecReload=/bin/kill -USR2 $MAINPID KillMode=mixed Restart=always SuccessExitStatus=143 -Type=notify +Type=notify-reload +ReloadSignal=SIGUSR2 # The following lines leverage SystemD's sandboxing options to provide # defense in depth protection at the expense of restricting some flexibility Signed-off-by: William Lallemand <wlallemand@haproxy.com>	2024-04-04 15:58:29 +02:00
Frederic Lecaille	fcb096f7cd	BUG/MINOR: stick-tables: Missing stick-table key nullity check This bug arrived with this commit: MAJOR: stktable: split the keys across multiple shards to reduce contention At this time, there are no callers which call stktable_get_entry() without checking the nullity of <key> passed as parameter. But the documentation of this function says it supports this case where the <key> passed as parameter could be null. Move the nullity test on <key> at first statement of this function. Thanks to @chipitsine for having reported this issue in GH #2518.	2024-04-04 11:08:56 +02:00
Willy Tarreau	1a088da7c2	MAJOR: stktable: split the keys across multiple shards to reduce contention In order to reduce the contention on the table when keys expire quickly, we're spreading the load over multiple trees. That counts for keys and expiration dates. The shard number is calculated from the key value itself, both when looking up and when setting it. The "show table" dump on the CLI iterates over all shards so that the output is not fully sorted, it's only sorted within each shard. The Lua table dump just does the same. It was verified with a Lua program to count stick-table entries that it works as intended (the test case is reproduced here as it's clearly not easy to automate as a vtc): function dump_stk() local dmp = core.proxies['tbl'].stktable:dump({}); local count = 0 for _, __ in pairs(dmp) do count = count + 1 end core.Info('Total entries: ' .. count) end core.register_action("dump_stk", {'tcp-req', 'http-req'}, dump_stk, 0); ## global tune.lua.log.stderr on lua-load-per-thread lua-cnttbl.lua listen front bind :8001 http-request lua.dump_stk if { path_beg /stk } http-request track-sc1 rand(),upper,hex table tbl http-request redirect location / backend tbl stick-table size 100k type string len 12 store http_req_cnt ## $ h2load -c 16 -n 10000 0:8001/ $ curl 0:8001/stk ## A count close to 100k appears on haproxy's stderr ## On the CLI, "show table tbl" \| wc will show the same. Some large parts were reindented only to add a top-level loop to iterate over shards (e.g. process_table_expire()). Better check the diff using git show -b. The number of shards is decided just like for the pools, at build time based on the max number of threads, so that we can keep a constant. Maybe this should be done differently. For now CONFIG_HAP_TBL_BUCKETS is used, and defaults to CONFIG_HAP_POOL_BUCKETS to keep the benefits of all the measurements made for the pools. It turns out that this value seems to be the most reasonable one without inflating the struct stktable too much. By default for 1024 threads the value is 32 and delivers 980k RPS in a test involving 80 threads, while adding 1kB to the struct stktable (roughly doubling it). The same test at 64 gives 1008 kRPS and at 128 it gives 1040 kRPS for 8 times the initial size. 16 would be too low however, with 675k RPS. The stksess already have a shard number, it's the one used to decide which peer connection to send the entry. Maybe we should also store the one associated with the entry itself instead of recalculating it, though it does not happen that often. The operation is done by hashing the key using XXH32(). The peers also take and release the table's lock but the way it's used it not very clear yet, so at this point it's sure this will not work. At this point, this allowed to completely unlock the performance on a 80-thread setup: before: 5.4 Gbps, 150k RPS, 80 cores 52.71% haproxy [.] stktable_lookup_key 36.90% haproxy [.] stktable_get_entry.part.0 0.86% haproxy [.] ebmb_lookup 0.18% haproxy [.] process_stream 0.12% haproxy [.] process_table_expire 0.11% haproxy [.] fwrr_get_next_server 0.10% haproxy [.] eb32_insert 0.10% haproxy [.] run_tasks_from_lists after: 36 Gbps, 980k RPS, 80 cores 44.92% haproxy [.] stktable_get_entry 5.47% haproxy [.] ebmb_lookup 2.50% haproxy [.] fwrr_get_next_server 0.97% haproxy [.] eb32_insert 0.92% haproxy [.] process_stream 0.52% haproxy [.] run_tasks_from_lists 0.45% haproxy [.] conn_backend_get 0.44% haproxy [.] __pool_alloc 0.35% haproxy [.] process_table_expire 0.35% haproxy [.] connect_server 0.35% haproxy [.] h1_headers_to_hdr_list 0.34% haproxy [.] eb_delete 0.31% haproxy [.] srv_add_to_idle_list 0.30% haproxy [.] h1_snd_buf WIP: uint64_t -> long WIP: ulong -> uint code is much smaller	2024-04-03 17:34:47 +02:00
Willy Tarreau	864ac31174	OPTIM: stick-tables: check the stksess without taking the read lock Thanks to the previous commit, we can now simply perform an atomic read on stksess->seen and take the write lock to recreate the entry only if at least one peer has seen it, otherwise leave it untouched. On a test on 40 cores, the performance used to drop from 2.10 to 1.14M RPS when one peer was connected, now it drops to 2.05, thus there's basically no impact of connecting a peer vs ~45% previously, all spent in the read lock. This can be particularly important when often updating the same entries (user-agent, source address during an attack etc).	2024-04-03 17:34:47 +02:00
Willy Tarreau	4c1480f13b	MINOR: stick-tables: mark the seen stksess with a flag "seen" Right now we're taking the stick-tables update lock for reads just for the sake of checking if the update index is past it or not. That's costly because even taking the read lock is sufficient to provoke a cache line write, while when under load or attack it's frequent that the update has not yet been propagated and wouldn't require anything. This commit brings a new field to the stksess, "seen", which is zeroed when the entry is updated, and set to one as soon as at least one peer starts to consult it. This way it will reflect that the entry must be updated again so that this peer can see it. Otherwise no update will be necessary. For now the flag is only set/reset but not exploited. A great care is taken to avoid writes whenever possible.	2024-04-03 17:34:47 +02:00
Willy Tarreau	15522fc243	BUG/MINOR: bwlim/config: fix missing '\n' after error messages Some bwlim error messages at parsing time were missing the trailing '\n' in commit 2b6777021d ("MEDIUM: bwlim: Add support of bandwith limitation at the stream level"). This commit can be backported wherever the commit above is (likely as far as 2.7).	2024-04-03 17:34:36 +02:00
Willy Tarreau	f821a3983e	BUILD: systemd: fix build error on non-systemd systems with USE_SYSTEMD=1 Thanks to previous commit, we can now build with USE_SYSTEMD=1 on any system without requiring any parts from systemd. It just turns our that there was one remaining include in haproxy.c that needed to be replaced with haproxy/systemd.h to build correctly. That's what this commit does.	2024-04-03 17:34:36 +02:00
William Lallemand	aa3632962f	MEDIUM: mworker: get rid of libsystemd Given the xz drama which allowed liblzma to be linked to openssh, lets remove libsystemd to get rid of useless dependencies. The sd_notify API seems to be stable and is now documented. This patch replaces the sd_notify() and sd_notifyf() function by a reimplementation inspired by the systemd documentation. This should not change anything functionnally. The function will be built when haproxy is built using USE_SYSTEMD=1. References: https://github.com/systemd/systemd/issues/32028 https://www.freedesktop.org/software/systemd/man/devel/sd_notify.html#Notes Before: wla@kikyo:~% ldd /usr/sbin/haproxy linux-vdso.so.1 (0x00007ffcfaf65000) libcrypt.so.1 => /lib/x86_64-linux-gnu/libcrypt.so.1 (0x000074637fef4000) libssl.so.3 => /lib/x86_64-linux-gnu/libssl.so.3 (0x000074637fe4f000) libcrypto.so.3 => /lib/x86_64-linux-gnu/libcrypto.so.3 (0x000074637f400000) liblua5.4.so.0 => /lib/x86_64-linux-gnu/liblua5.4.so.0 (0x000074637fe0d000) libsystemd.so.0 => /lib/x86_64-linux-gnu/libsystemd.so.0 (0x000074637f92a000) libpcre2-8.so.0 => /lib/x86_64-linux-gnu/libpcre2-8.so.0 (0x000074637f365000) libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x000074637f000000) libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x000074637f27a000) libcap.so.2 => /lib/x86_64-linux-gnu/libcap.so.2 (0x000074637fdff000) libgcrypt.so.20 => /lib/x86_64-linux-gnu/libgcrypt.so.20 (0x000074637eeb8000) liblzma.so.5 => /lib/x86_64-linux-gnu/liblzma.so.5 (0x000074637fdcd000) libzstd.so.1 => /lib/x86_64-linux-gnu/libzstd.so.1 (0x000074637ee01000) liblz4.so.1 => /lib/x86_64-linux-gnu/liblz4.so.1 (0x000074637fda8000) /lib64/ld-linux-x86-64.so.2 (0x000074637ff5d000) libgpg-error.so.0 => /lib/x86_64-linux-gnu/libgpg-error.so.0 (0x000074637f904000) After: wla@kikyo:~% ldd /usr/sbin/haproxy linux-vdso.so.1 (0x00007ffd51901000) libcrypt.so.1 => /lib/x86_64-linux-gnu/libcrypt.so.1 (0x00007f758d6c0000) libssl.so.3 => /lib/x86_64-linux-gnu/libssl.so.3 (0x00007f758d61b000) libcrypto.so.3 => /lib/x86_64-linux-gnu/libcrypto.so.3 (0x00007f758ca00000) liblua5.4.so.0 => /lib/x86_64-linux-gnu/liblua5.4.so.0 (0x00007f758d5d9000) libpcre2-8.so.0 => /lib/x86_64-linux-gnu/libpcre2-8.so.0 (0x00007f758d365000) libz.so.1 => /lib/x86_64-linux-gnu/libz.so.1 (0x00007f758d5ba000) libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007f758c600000) libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007f758c915000) /lib64/ld-linux-x86-64.so.2 (0x00007f758d729000) A backport to all stable versions could be considered at some point.	2024-04-03 15:53:18 +02:00
Aurelien DARRAGON	837a26ab05	BUG/MEDIUM: server/lbprm: fix crash in _srv_set_inetaddr_port() Since faa8c3e ("MEDIUM: lb-chash: Deterministic node hashes based on server address") the following configuration will cause haproxy to crash: backend test1 mode http balance hash int(1) server s1 haproxy.org:80 This is because lbprm.update_server_eweight() method is now systematically called in _srv_set_inetaddr_port() upon srv addr/port change (and with the above config it happens during startup after initial dns resolution). However, depending on the chosen lbprm algo, update_server_eweight function may not be set (it is not a mandatory method, some lb implementations don't define it). Thus, using 'balance hash' with map-based hashing or 'balance sticky' will cause a crash due to a NULL de-reference in _srv_set_inetaddr_port(). To fix the issue, we first check that the update_server_eweight() method is set before using it. No backport needed unless faa8c3e ("MEDIUM: lb-chash: Deterministic node hashes based on server address") gets backported.	2024-04-03 11:58:03 +02:00
Frederic Lecaille	0e14bac7bd	BUILD: quic: 32 bits compilation issue (QUIC_MIN() usage) This issue arrived with this commit: "MINOR: quic: HyStart++ implementation (RFC 9406)" Thanks to @chipitsine for having reported this issue in GH #2513. Should be backported where the previous commit will be backported.	2024-04-03 11:14:50 +02:00
Willy Tarreau	6a2f09de1c	OPTIM: peers: avoid the locking dance around peer_send_teach_process_msgs() In peer_send_msg(), we take a lock before calling peer_send_teach_process_msgs because of the check on the flags and update indexes, and the function then drops it then takes it again just to resume in the same situation, so that on return we can drop it again! Not only this is absurd because it doubles the costs of taking the lock, it's also totally inefficient because it takes a write lock while the only usage that is done with it is to read the indexes! Let's drop the lock from peer_send_teach_process_msgs() and move it explicitly in its only caller around the condition, and turn it into a read lock only.	2024-04-03 09:34:08 +02:00
Willy Tarreau	1ea18fa8a3	BUG/MAJOR: applet: fix a MIN vs MAX usage in appctx_raw_rcv_buf() The MAX() macro was used to limit the count of bytes to be transferred in appctx_raw_rcv_buf() by commit ee53d8421f ("MEDIUM: applet: Simplify a bit API to exchange data with applets") instead of MIN(). It didn't seem to have any consequences until commit f37ddbeb4b ("MAJOR: cli: Update the CLI applet to handle its own buffers") that triggers a BUG_ON() in __b_putblk() when the other side is slow to read, because we're trying to append a full buffer on top of a non-empty one. A way to reproduce it is to dump a heavy stick table on the CLI with a screen scrolling. No backport is needed since this was introduced in 3.0-dev3 and revealed after dev5 only.	2024-04-03 09:34:08 +02:00
Willy Tarreau	ed45d13321	BUG/MEDIUM: stick-table: use the update lock when reading tables from peers In 2.9, the stick-tables' locking was split between the lock used to manipulate the contents (->lock) and the lock used to manipulate the list of updates and the update indexes (->updt_lock). This was done with commit 87e072eea5 ("MEDIUM: stick-table: use a distinct lock for the updates tree"). However a part was overlooked in the peers code, the parts that consult (and update) the indexes use the table's lock instead of the update lock. It's surprising that it hasn't caused more trouble. It's likely due to the fact that the tree nodes are not often immediately freed and that their memory area remains connected to valid nodes in the tree during peer_stksess_lookup(), while other parts only check or update indexes, thus are not that critical. This needs to be backported wherever the commit above is, thus logically 2.9.	2024-04-03 09:33:10 +02:00
Christopher Faulet	3abf6934a4	BUG/MEDIUM: stconn: Don't forward shutdown to SE if iobuf is not empty It is only an issue when the kernel splicing is used. The zero-copy forwarding via the buffers is not affected. When a shutdown is received on the producer side and some data are blocked in the pipe for a while, the shutdown may be forwarded to the other side. Usually, in this case, the shutdown must be scheduled, waiting all output data (from the channel and the consumer's iobuf) are sent. But only the channel was considered. The bug was introduced by commit 20c463955d ("MEDIUM: channel: don't look at iobuf to report an empty channel"). To fix the issue, we must also check data blocked in the consummer iobuf. This patch should solve the issue #2505. It must be backported to 2.9.	2024-04-03 08:46:37 +02:00
Frederic Lecaille	a305bb92b9	MINOR: quic: HyStart++ implementation (RFC 9406) This is a simple algorithm to replace the classic slow start phase of the congestion control algorithms. It should reduce the high packet loss during this step. Implemented only for Cubic.	2024-04-02 18:47:19 +02:00
Willy Tarreau	e9b774f4b3	BUG/MINOR: backend: properly handle redispatch 0 According to the documentation, "option redispatch 0" is expected to disable redispatch just like "no option redispatch", but due to the fact that it keeps PR_O_REDISP set, it doesn't actually work. Let's make sure value 0 is properly handled and drops PR_O_REDISP. This can be backported to all versions since it seems it has been broken since its introduction in 1.6 with commit 726ab7145c ("MEDIUM: backend: Allow redispatch on retry intervals"). As a workaround, "no option redispatch" does work though.	2024-04-02 15:19:18 +02:00
Tim Duesterhus	ec38e1b39b	CLEANUP: Reapply ha_free.cocci This reapplies ha_free.cocci across the whole src/ tree.	2024-04-02 07:27:33 +02:00
Tim Duesterhus	7c317f4619	CLEANUP: Reapply xalloc_cast.cocci This reapplies xalloc_cast.cocci across the whole src/ tree.	2024-04-02 07:27:33 +02:00
Tim Duesterhus	f88ea5949c	CLEANUP: Reapply strcmp.cocci (2) This reapplies strcmp.cocci across the whole src/ tree.	2024-04-02 07:27:33 +02:00
Tim Duesterhus	cd5d62249f	CLEANUP: Reapply ist.cocci (3) This reapplies ist.cocci across the whole src/ tree.	2024-04-02 07:27:33 +02:00
Willy Tarreau	5fc1afb341	BUG/MEDIUM: stick-tables: fix a small remaining race in expiration task In 2.7 we addressed a race condition in the stick tables expiration task with commit fbb934d ("BUG/MEDIUM: stick-table: fix a race condition when updating the expiration task"). The issue was that the task could be running on another thread which would destroy its expiration timer while one had just recalculated it and prepares to queue it, causing a bug due to the attempt to queue an expired task. The fix consisted in enclosing the change into the stick-table's lock, which had a very low cost since it's done only after having checked that the date changed, i.e. no more than once every millisecond. But as reported by Ricardo and Felipe from Taghos in github issue #2508, a tiny race remained after the fix: the unlock() was done before the call to task_queue(), leaving a tiny window for another thread to run between unlock() and task_queue() and erase the timer. As confirmed, it's sufficient to also protect the task_queue() call. But overall this raises a point regarding the task_queue() API on tasks that may run anywhere. A while ago an attempt was made at removing the timer for woken up tasks, but something like this would be deserved with more atomicity on the timer manipulation (e.g. atomically use task_schedule() instead maybe). This should be backported to all stable branches.	2024-04-02 07:07:57 +02:00
Anthony Deschamps	faa8c3e024	MEDIUM: lb-chash: Deterministic node hashes based on server address Motivation: When services are discovered through DNS resolution, the order in which DNS records get resolved and assigned to servers is arbitrary. Therefore, even though two HAProxy instances using chash balancing might agree that a particular request should go to server3, it is likely the case that they have assigned different IPs and ports to the server in that slot. This patch adds a server option, "hash-key <key>" which can be set to "id" (the existing behaviour, default), "addr", or "addr-port". By deriving the keys for the chash tree nodes from a server's address and port we ensure that independent HAProxy instances will agree on routing decisions. If an address is not known then the key is derived from the server's puid as it was previously. When adjusting a server's weight, we now check whether the server's hash has changed. If it has, we have to remove all its nodes first, since the node keys will also have to change.	2024-04-02 07:00:10 +02:00
Amaury Denoyelle	da03396bb3	BUG/BUILD: debug: fix unused variable error A compilation error occurs when using DEBUG_MEM_STATS due to a variable now being unused in debug_iohandler_memstats() : src/debug.c: In function ‘debug_iohandler_memstats’: src/debug.c:1862:24: error: unused variable ‘sc’ [-Werror=unused-variable] 1862 \| struct stconn *sc = appctx_sc(appctx); \| ^~ This is caused since the following commit : 94b8ed446f70a381dde0ea2dc891b62fcc3cc8e1 MEDIUM: cli/applet: Stop to test opposite SC in I/O handler of CLI commands This must not be backported.	2024-03-29 17:21:04 +01:00
Aurelien DARRAGON	3c6dfa618a	MEDIUM: log/balance: leverage lbprm api for log load-balancing log load-balancing implementation was not seamlessly integrated within lbprm API. The consequence is that it could become harder to maintain over time since it added some specific cases just for the log backend. Moreover, it resulted in some code duplication since balance algorithms that are common to logs and regular (tcp, http) backends were specifically rewritten for log backends. Thanks to the previous commit, we now have all the prerequisites to make log load-balancing fully leverage lbprm logic. Thus in this patch we make __do_send_log_backend() use existing lbprm algorithms, and we no longer require log-specific lbprm initialization in cfgparse.c and in postcheck_log_backend(). As a bonus, for log backends this allows weighed algorithms to properly support weights (ie: roundrobin, random and log-hash) since we now leverage the same lb algorithms that we use for tcp/http backends (doc was updated).	2024-03-29 17:08:37 +01:00
Aurelien DARRAGON	9aea6df81f	MINOR: lbprm: implement true "sticky" balance algo As previously mentioned in cd352c0db ("MINOR: log/balance: rename "log-sticky" to "sticky""), let's define a sticky algorithm that may be used from any protocol. Sticky algorithm sticks on the same server as long as it remains available. The documentation was updated accordingly.	2024-03-29 17:08:37 +01:00
Aurelien DARRAGON	d0692d7019	BUG/MINOR: log/balance: detect if user tries to use unsupported algo b61147fd ("MEDIUM: log/balance: merge tcp/http algo with log ones") introduced some ambiguities, because while it shares some algos with the ones from mode {tcp,http}, we forgot report an error when the user tries to use an algorithm that is not available in this mode (as per the doc). Because of that, haproxy would silently drop log messages during runtime. To fix that, we ensure that algo is one of the supported ones during log backend postparsing. If the algo is not supported, we raise an error. This should be backported in 2.9 with b61147fd	2024-03-29 17:08:36 +01:00
Christopher Faulet	87426e82ec	MAJOR: cli: Use a custom .snd_buf function to only copy the current command The CLI applet is now using its own snd_buf callback function. Instead of copying as most output data as possible, only one command is copied at a time. To do so, a new state CLI_ST_PARSEREQ is added for the CLI applet. In this state, the CLI I/O handle knows a full command was copied into its input buffer and it must parse this command to evaluate it.	2024-03-28 17:32:55 +01:00
Christopher Faulet	838fb54de6	MINOR: stconn: Add a connection flag to notify sending data are the last ones This flag can be use by endpoints to know the data to send, via .snd_buf callback function are the last ones. It is useful to know a shutdown is pending but it cannot be delivered while sedning data are not consumed.	2024-03-28 17:32:55 +01:00
Christopher Faulet	a933569b52	MINOR: applet: Let's applets .snd_buf function deal with full input buffers It is now the responsbility of applets .snd_buf callback function to notify the input buffer is full. This will allow the applets to not consume all data waiting for more data. Of course, it is only useful for applets using a custom .snd_buf callback function.	2024-03-28 17:32:55 +01:00
Christopher Faulet	f37ddbeb4b	MAJOR: cli: Update the CLI applet to handle its own buffers It is the third applet to be refactored to use its own buffers. In addition to the CLI applet, some I/O handlers of CLI commands were also updated, especially the stats ones. Some command I/O handlers were updated to use applet's buffers instead of channels ones.	2024-03-28 17:32:51 +01:00
Christopher Faulet	b8ca114031	BUG/MEDIUM: applet: State appctx have more data if its EOI/EOS/ERROR flag is set It is an harmless bug for now because only stats and cache applets are using their own buffers and it is not possible to trigger this bug with these applets. However, it remains important to try a receive if EOI, EOS or ERROR is reached by the applet while no data was produced. Otherwise, it is not possible to ack these events at the SE level. No backport needed.	2024-03-28 17:28:21 +01:00
Christopher Faulet	d2403a412c	MINOR: applet: Always use applet API to set appctx flags Some appctx flags were still set manually while there is a dedicated function to do so. Be sure to always use applet_fl_set() to set appctx flags.	2024-03-28 17:28:20 +01:00
Christopher Faulet	94b8ed446f	MEDIUM: cli/applet: Stop to test opposite SC in I/O handler of CLI commands The main CLI I/O handle is responsible to interrupt the processing on shutdown/abort. It is not the responsibility of the I/O handler of CLI commands to take care of it.	2024-03-28 17:28:20 +01:00
Christopher Faulet	2e6733eb45	MEDIUM: stream: Use generic version to perform sync receives and sends Instead of using connection versions, we now use generic versions. It means we will also perfom sync receives and sync sends on applets too, but only for applets using their own buffers. Old applets are not concerned.	2024-03-28 17:28:20 +01:00
Christopher Faulet	5056cbdb86	MINOR: sc_strm: Add generic version to perform sync receives and sends sc_sync_recv() and sc_sync_send() were added to use connection or applet versions, depending on the endpoint type. For now these functions are not used. But this will be used by process_stream() to replace the connection version.	2024-03-28 17:28:20 +01:00
Christopher Faulet	498520fdf5	BUG/MINOR: cli: Report an error to user if command or payload is too big Too big command, larger than a buffer, was silently rejected by the CLI applet. It was handled as an error and the connection was closed, but no error message was reported to user to notify him. Now an error is reported before closing. It is only displayed if the chunk buffer used by the CLI applet is full and no delimiter (\n or ;) is found to mark the end of the command. It works for a simple command but also for a command with a huge payload. This patch could be backported to all stable versions.	2024-03-28 17:28:20 +01:00
Amaury Denoyelle	6333e6ec8e	MINOR: server: allow cookie for dynamic servers This commit allows "cookie" keyword for dynamic servers. After code review, nothing was found which could prevent a dynamic server to use it. An extra warning is added under cli_parse_add_server() if cookie value is ignored due to a non HTTP backend. This patch is not considered a bugfix. However, it may backported if needed as its impact seems minimal.	2024-03-28 11:54:21 +01:00
Damien Claisse	9a0e0d3a19	BUG/MINOR: server: fix persistence cookie for dynamic servers When adding a server dynamically, we observe that when a backend has a dynamic persistence cookie, the new server has no cookie as we receive the following HTTP header: set-cookie: test-cookie=; Expires=Thu, 01-Jan-1970 00:00:01 GMT; path=/ Whereas we were expecting to receive something like the following, which is what we receive for a server added in the config file: set-cookie: test-cookie=abcdef1234567890; path=/ After investigating code path, srv_set_dyncookie() is never called when adding a server through CLI, it is only called when parsing config file or using "set server bkd1/srv1 addr". To fix this, call srv_set_dyncookie() inside cli_parse_add_server(). This patch must be backported up to 2.4.	2024-03-28 11:54:21 +01:00
Amaury Denoyelle	250c19032f	BUG/MINOR: server: reject enabled for dynamic server Since their first implementation, dynamic servers are created into maintenance state. This has been done purposely to avoid immediate activation of a newly inserted server. However, this principle is incompatible if "enabled" keyword is used on "add server". The newly created instance will be unreacheable as proxy load-balancing algorithm is not informed of its presence via srv_lb_propagate(). The new server could be unblocked by toggling its state with "disable server" / "enable server" commands, which will trigger srv_lb_propagate() invocation. To avoid this unexpected state, simply forbid "enabled" keyword for dynamic servers. In the long-term, it could be possible to re authorize it but at least this requires to call srv_lb_propagate() on dynamic server creation. This should fix github issue #2497. This patch should not be backported as-is, to avoid breaking dynamic servers API on stable versions. "enabled" should instead be ignored for them. This will be implemented in a dedicated patch on top of 2.9.	2024-03-28 11:51:05 +01:00
Remi Tricot-Le Breton	7359c0c7f4	MEDIUM: ssl: Add 'tune.ssl.ocsp-update.mode' global option This option can be used to set a default ocsp-update mode for all certificates of a given conf file. It allows to activate ocsp-update on certificates without the need to create separate crt-lists. It can still be superseded by the crt-list 'ocsp-update' option. It takes either "on" or "off" as value and defaults to "off". Since setting this new parameter to "on" would mean that we try to enable ocsp-update on any certificate, and also certificates that don't have an OCSP URI, the checks performed in ssl_sock_load_ocsp were softened. We don't systematically raise an error when trying to enable ocsp-update on a certificate that does not have an OCSP URI, be it via the global option or the crt-list one. We will still raise an error when a user tries to load a certificate that does have an OCSP URI but a missing issuer certificate (if ocsp-update is enabled).	2024-03-27 11:38:28 +01:00
Remi Tricot-Le Breton	b1d623949c	BUG/MINOR: ssl: Detect more 'ocsp-update' incompatibilities The inconsistencies in 'ocsp-update' parameter were only checked when parsing a crt-list line so if a certificate was used on a bind line after being used in a crt-list with 'ocsp-update' set to 'on', then no error would be raised. This patch helps detect such inconsistencies. This patch can be backported up to branch 2.8.	2024-03-27 11:38:28 +01:00
Remi Tricot-Le Breton	97c2734f44	BUG/MINOR: ssl: Wrong ocsp-update "incompatibility" error message In a crt-list such as the following: foo.pem [ocsp-update off] foo.com foo.pem bar.com we would get a wrong "Incompatibilities found in OCSP update mode ..." error message during init when the two lines are actually saying the same thing since the default for 'ocsp-update' option is 'off'. This patch can be backported up to branch 2.8.	2024-03-27 11:38:28 +01:00
Willy Tarreau	40d1c84bf0	BUG/MAJOR: ring: free the ring storage not the ring itself when using maps A recent issue was uncovered by the CI which started to randomly report segfaults on a few tests, and more systematically on FreeBSD. It turn out that it was introduced by recent commit 03816ccfa9 ("MAJOR: ring: insert an intermediary ring_storage level"), which overlooked the munmap() path of the sink and startup logs: once the ring and its storage were split, it was no longer correct to munmap() the ring, only its storage area needs to be unmapped, and the ring must always be freed separately. Thanks to Christopher and William for their help at trying to reproduce it and figure the circumstances that triggers it. No backport is needed.	2024-03-26 15:15:59 +01:00
Aurelien DARRAGON	bd98db5078	BUG/MINOR: server: 'source' interface ignored from 'default-server' directive Sebastien Gross reported that 'interface' keyword ('source' subargument) is silently ignored when used from 'default-server' directive despite the documentation implicitly stating that the keyword should be supported there. When support for 'source' keyword was added to 'default-server' directive in dba97077 ("MINOR: server: Make 'default-server' support 'source' keyword."), we properly duplicated the conn iface_name from the default- server but we forgot to copy the conn iface_len which must be set as well since it is used as setsockopt()'s 'optlen' argument in tcp_connect_server(). It should be backported to all stable versions.	2024-03-26 11:09:02 +01:00
Willy Tarreau	2431b20640	BUILD: ssl: fix build error on older compilers with openssl-3.2 OpenSSL 3.2 triggers the code part added by commit 25da217 ("MINOR: ssl: Update ssl_fc_curve/ssl_bc_curve to use SSL_get0_group_name") which contains a variable declaration in the for() statement and breaks on older compilers, as reported in GH issues #2501. Let's just declare it normally to fix the problem. This must be backported wherever the commit above is (at least 2.9).	2024-03-25 21:21:47 +01:00
Willy Tarreau	4bc81ec985	CLEANUP: ring: use only curr_cell and not next_cell in the main write loop It turns out that we can reduce by one variable in the loop and this clobbers one less register, making it slightly faster on Cortex A72.	2024-03-25 17:34:19 +00:00
Willy Tarreau	0a0a64ef02	OPTIM: ring: use relaxed stores to release the threads We don't care in what order the threads are released, so we can write their sent value using relaxed atomic stores. This brings a 3-5% perf boost on ARM with 80 cores, reaching 7.25M/s, and doesn't change anything on x86 since it keeps using strict ordering.	2024-03-25 17:34:19 +00:00
Willy Tarreau	cabe945876	MINOR: ring: avoid writes to cells during copy It has been found that performing a first pass consisting in copying all messages, and a second one to notify about releases is more efficient on AMD than updating all of them on the fly using a CAS, despite making writers wait longer to be released. Maybe it's related to the ability for the CPU to prefetch the contents during a simple load while it wouldn't do it for an XCHG, it's unsure at this point. This will also mater permit to use relaxed stores to release threads. On ARM the performance increased to 7.0M/s. If this patch is applied before the dropping of the intermediary step, instead it drops to 3.9M/s. This shows the dependency between such changes that strive to limit the number of writes on the fast path. On x86_64, the EPYC at 3C6T saw a small drop from 4.57M to 4.45M, but the 24C48T setup saw a nice 33% boost from 3.33M to 4.44M, i.e. we get stable perf at 3 and 24 cores, despite having 8 CCX involved and fighting with each other. Other possibilities are: - use of HA_ATOMIC_XCHG() instead of FETCH_OR() => slightly faster (4.62/7.37 vs 4.58/7.34). Pb: requires to modify the readers to wait much longer since the tail value won't be valid in this case during updates, and it will have to wait by looping over it. - use other conditions to release a cell => to be tested	2024-03-25 17:34:19 +00:00
Willy Tarreau	39df8c903d	MINOR: ring: it's not x86 but all non-ARMv8.1 which needs the read before OR Archs relying on CAS benefit from a read prior to FETCH_OR, so it's not just x86 that benefits from this. Let's just change the condition to only exclude __ARM_FEATURE_ATOMICS which is the only one faster without.	2024-03-25 17:34:19 +00:00
Willy Tarreau	e6fc167aec	CLEANUP: ring: further simplify the write loop The loop was cleaned up a little bit so that the inner loops are more readable and that the ifdef'd parts are whole blocks and not just an "if" condition. A few conditions were adjusted to benefit from "break" and "continue".	2024-03-25 17:34:19 +00:00

... 29 30 31 32 33 ...

18755 Commits