haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-20 14:11:19 +02:00

Author	SHA1	Message	Date
Remi Tricot-Le Breton	a996763619	BUG/MINOR: ssl: Store client SNI in SSL context in case of ClientHello error If an error is raised during the ClientHello callback on the server side (ssl_sock_switchctx_cbk), the servername callback won't be called and the client's SNI will not be saved in the SSL context. But since we use the SSL_get_servername function to return this SNI in the ssl_fc_sni sample fetch, that means that in case of error, such as an SNI mismatch with a frontend having the strict-sni option enabled, the sample fetch would not work (making strict-sni related errors hard to debug). This patch fixes that by storing the SNI as an ex_data in the SSL context in case the ClientHello callback returns an error. This way the sample fetch can fallback to getting the SNI this way. It will still first call the SSL_get_servername function first since it is the proper way of getting a client's SNI when the handshake succeeded. In order to avoid memory allocations are runtime into this highly used runtime function, a new memory pool was created to store those client SNIs. Its entry size is set to 256 bytes since SNIs can't be longer than 255 characters. This fixes GitHub #1484. It can be backported in 2.5.	2022-01-10 16:31:22 +01:00
William Lallemand	f82afbb9cd	BUG/MEDIUM: mworker: don't use _getsocks in wait mode Since version 2.5 the master is automatically re-executed in wait-mode when the config is successfully loaded, puting corner cases of the wait mode in plain sight. When using the -x argument and with the right timing, the master will try to get the FDs again in wait mode even through it's not needed anymore, which will harm the worker by removing its listeners. However, if it fails, (and it's suppose to, sometimes), the master will exit with EXIT_FAILURE because it does not have the MODE_MWORKER flag, but only the MODE_MWORKER_WAIT flag. With the consequence of killing the workers. This patch fixes the issue by restricting the use of _getsocks to some modes. This patch must be backported in every version supported, even through the impact should me more harmless in version prior to 2.5.	2022-01-07 18:44:27 +01:00
Frédéric Lécaille	99942d6f4c	MINOR: quic: Non-optimal use of a TX buffer When full, after having reset the writer index, let's reuse the TX buffer in any case.	2022-01-07 17:58:26 +01:00
Frédéric Lécaille	f010f0aaf2	MINOR: quic: Missing retransmission from qc_prep_fast_retrans() In fact we must look for the first packet with some ack-elicting frame to in the packet number space tree to retransmit from. Obviously there may be already retransmit packets which are not deemed as lost and still present in the packet number space tree for TX packets.	2022-01-07 17:58:26 +01:00
Frédéric Lécaille	d4ecf94827	MINOR: quic: Only one CRYPTO frame by encryption level When receiving CRYPTO data from the TLS stack, concatenate the CRYPTO data to the first allocated CRYPTO frame if present. This reduces by one the number of handshake packets built for a connection with a standard size certificate.	2022-01-07 17:58:26 +01:00
Ilya Shipitsin	37d3e38130	CLEANUP: assorted typo fixes in the code and comments This is 30th iteration of typo fixes	2022-01-07 14:42:54 +01:00
David CARLIER	df91cbd584	MINOR: cpuset: switch to sched_setaffinity for FreeBSD 14 and above. Following up previous update on cpuset-t.h. Ultimately, at some point the cpuset_setaffinity code path could be removed.	2022-01-07 06:53:51 +01:00
William Dauchy	a9dd901143	MINOR: proxy: add option idle-close-on-response Avoid closing idle connections if a soft stop is in progress. By default, idle connections will be closed during a soft stop. In some environments, a client talking to the proxy may have prepared some idle connections in order to send requests later. If there is no proper retry on write errors, this can result in errors while haproxy is reloading. Even though a proper implementation should retry on connection/write errors, this option was introduced to support back compat with haproxy < v2.4. Indeed before v2.4, we were waiting for a last request to be able to add a "connection: close" header and advice the client to close the connection. In a real life example, this behavior was seen in AWS using the ALB in front of a haproxy. The end result was ALB sending 502 during haproxy reloads. This patch was tested on haproxy v2.4, with a regular reload on the process, and a constant trend of requests coming in. Before the patch, we see regular 502 returned to the client; when activating the option, the 502 disappear. This patch should help fixing github issue #1506. In order to unblock some v2.3 to v2.4 migraton, this patch should be backported up to v2.4 branch. Signed-off-by: William Dauchy <wdauchy@gmail.com> [wt: minor edits to the doc to mention other options to care about] Signed-off-by: Willy Tarreau <w@1wt.eu>	2022-01-06 09:09:51 +01:00
Frédéric Lécaille	6b6631593f	MINOR: quic: Re-arm the PTO timer upon datagram receipt When block by the anti-amplification limit, this is the responsability of the client to unblock it sending new datagrams. On the server side, even if not well parsed, such datagrams must trigger the PTO timer arming.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	078634d126	MINOR: quic: PTO timer too often reset It must be reset when the anti-amplication was reached but only if the peer address was not validated.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	41a076087b	MINOR: quic: Flag asap the connection having reached the anti-amplification limit The best location to flag the connection is just after having built the packet which reached the anti-amplication limit.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	de6f7c503e	MINOR: quic: Prepare Handshake packets asap after completed handshake Switch back to QUIC_HS_ST_SERVER_HANDSHAKE state after a completed handshake if acks must be send. Also ensure we build post handshake frames only one time without using prev_st variable and ensure we discard the Handshake packet number space only one time.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	917a7dbdc7	MINOR: quic: Do not drop secret key but drop the CRYPTO data We need to be able to decrypt late Handshake packets after the TLS secret keys have been discarded. If not the peer send Handshake packet which have not been acknowledged. But for such packets, we discard the CRYPTO data.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	ee2b8b377f	MINOR: quic: Improve qc_prep_pkts() flexibility We want to be able to choose the encryption levels to be used by qc_prep_pkts() outside of it.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	f7ef97698a	MINOR: quic: Comment fix. When we drop a packet with unknown length, this is the entire datagram which must be skipped.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	a56054e438	MINOR: quic: Probe several packet number space upon timer expiration When the loss detection timer expires, we SHOULD include new data in our probing packets (RFC 9002 par 6.2.4. Sending Probe Packets).	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	db6a4727cf	MINOR: quic: Probe Initial packet number space more often Especially when the PTO expires for Handshake packet number space and when Initial packets are still flying (for QUIC servers).	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	3bb457c4ba	MINOR: quic: Speeding up Handshake Completion According to RFC 9002 par. 6.2.3. when receving duplicate Initial CRYPTO data a server may a packet containing non unacknowledged before the PTO expiry.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	63556772cc	MINOR: quic: qc_prep_pkts() code moving Move the switch default case code out of the switch to improve the readibily.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	5732062cd2	MINOR: quic: Useless test in qc_prep_pkts() These tests were there to initiate PTO probing but they are not correct. Furthermore they may break the PTO probing process and lead to useless packet building.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	dd51da599e	MINOR: quic: Wrong packet number space trace in qc_prep_pkts() It was always the first packet number space information which was dumped.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	466e9da145	MINOR: quic: Remove nb_pto_dgrams quic_conn struct member For now on we rely on tx->pto_probe pktns struct member to inform the packet building function we want to probe.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	22576a2e55	MINOR: quic: Wrong ack_delay compution before calling quic_loss_srtt_update() RFC 9002 5.3. Estimating smoothed_rtt and rttvar: MUST use the lesser of the acknowledgment delay and the peer's max_ack_delay after the handshake is confirmed.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	dc90c07715	MINOR: quic: Wrong loss time computation in qc_packet_loss_lookup() This part as been modified by the RFC since our first implementation.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	22cfd83890	MINOR: quic: Add trace about in flight bytes by packet number space This parameter is useful to diagnose packet loss detection issues.	2022-01-04 17:30:00 +01:00
Frédéric Lécaille	fde2a98dd1	MINOR: quic: Wrong traces after rework TRACE_*() macros must take a quic_conn struct as first argument.	2022-01-04 17:30:00 +01:00
Christopher Faulet	7bf46bb9a9	BUG/MEDIUM: http-ana: Preserve response's FLT_END analyser on L7 retry When a filter is attached on a stream, the FLT_END analyser must not be removed from the response channel on L7 retry. It is especially important because CF_FLT_ANALYZE flag is still set. This means the synchronization between the two sides when the filter ends can be blocked. Depending on the timing, this can freeze the stream infinitely or lead to a spinning loop. Note that the synchronization between the two sides at the end of the analysis was introduced because the stream was reused in HTTP between two transactions. But, since the HTX was introduced, a new stream is created for each transaction. So it is probably possible to remove this step for 2.2 and higher. This patch must be backported as far as 2.0.	2022-01-04 10:56:04 +01:00
David Carlier	ae5c42f4d0	BUILD/MINOR: tools: solaris build fix on dladdr. dladdr takes a mutable address on this platform.	2022-01-03 14:43:51 +01:00
Ilya Shipitsin	5e87bcf870	CLEANUP: assorted typo fixes in the code and comments This is 29th iteration of typo fixes	2022-01-03 14:40:58 +01:00
Willy Tarreau	1513c5479a	MEDIUM: pools: release cached objects in batches With this patch pool_evict_last_items builds clusters of up to CONFIG_HAP_POOL_CLUSTER_SIZE entries so that accesses to the shared pools are reduced by CONFIG_HAP_POOL_CLUSTER_SIZE and the inter- thread contention is reduced by as much..	2022-01-02 19:35:26 +01:00
Willy Tarreau	43937e920f	MEDIUM: pools: start to batch eviction from local caches Since previous patch we can forcefully evict multiple objects from the local cache, even when evicting basd on the LRU entries. Let's define a compile-time configurable setting to batch releasing of objects. For now we set this value to 8 items per round. This is marked medium because eviction from the LRU will slightly change in order to group the last items that are freed within a single cache instead of accurately scanning only the oldest ones exactly in their order of appearance. But this is required in order to evolve towards batched removals.	2022-01-02 19:35:26 +01:00
Willy Tarreau	a0b5831eed	MEDIUM: pools: centralize cache eviction in a common function We currently have two functions to evict cold objects from local caches: pool_evict_from_local_cache() to evict from a single cache, and pool_evict_from_local_caches() to evict oldest objects from all caches. The new function pool_evict_last_items() focuses on scanning oldest objects from a pool and releasing a predefined number of them, either to the shared pool or to the system. For now they're evicted one at a time, but the next step will consist in creating clusters.	2022-01-02 19:35:26 +01:00
Willy Tarreau	337410c5a4	MINOR: pools: pass the objects count to pool_put_to_shared_cache() This is in order to let the caller build the cluster of items to be released. For now single items are released hence the count is always 1.	2022-01-02 19:35:26 +01:00
Willy Tarreau	148160b027	MINOR: pools: prepare pool_item to support chained clusters In order to support batched allocations and releases, we'll need to prepare chains of items linked together and that can be atomically attached and detached at once. For this we implement a "down" pointer in each pool_item that points to the other items belonging to the same group. For now it's always NULL though freeing functions already check them when trying to release everything.	2022-01-02 19:35:26 +01:00
Willy Tarreau	361e31e3fe	MEDIUM: pool: compute the number of evictable entries once per pool In pool_evict_from_local_cache() we used to check for room left in the pool for each and every object. Now we compute the value before entering the loop and keep into a local list what has to be released, and call the OS-specific functions for the other ones. It should already save some cycles since it's not needed anymore to recheck for the pool's filling status. But the main expected benefit comes from the ability to pre-construct a list of all releasable objects, that will later help with grouping them.	2022-01-02 19:35:26 +01:00
Willy Tarreau	c16ed3b090	MINOR: pool: introduce pool_item to represent shared pool items In order to support batch allocation from/to shared pools, we'll have to support a specific representation for pool objects. The new pool_item structure will be used for this. For now it only contains a "next" pointer that matches exactly the current storage model. The few functions that deal with the shared pool entries were adapted to use the new type. There is no functionality difference at this point.	2022-01-02 19:35:26 +01:00
Willy Tarreau	b46674a283	MINOR: pool: check for pool's fullness outside of pool_put_to_shared_cache() Instead of letting pool_put_to_shared_cache() pass the object to the underlying OS layer when there's no more room, let's have the caller check if the pool is full and either call pool_put_to_shared_cache() or call pool_free_nocache(). Doing this sensibly simplifies the code as this function now only has to deal with a pool and an item and only for cases where there are local caches and shared caches. As the code was simplified and the calls more isolated, the function was moved to pool.c. Note that it's only called from pool_evict_from_local_cache{,s}() and that a part of its logic might very well move there when dealing with batches.	2022-01-02 19:35:26 +01:00
Willy Tarreau	afe2c4a1fc	MINOR: pool: allocate from the shared cache through the local caches One of the thread scaling challenges nowadays for the pools is the contention on the shared caches. There's never any situation where we have a shared cache and no local cache anymore, so we can technically afford to transfer objects from the shared cache to the local cache before returning them to the user via the regular path. This adds a little bit more work per object per miss, but will permit batch processing later. This patch simply moves pool_get_from_shared_cache() to pool.c under the new name pool_refill_local_from_shared(), and this function does not return anything but it places the allocated object at the head of the local cache.	2022-01-02 19:27:57 +01:00
Willy Tarreau	8c4927098e	CLEANUP: pools: get rid of the POOL_LINK macro The POOL_LINK macro is now only used for debugging, and it still requires ifdefs around, which needlessly complicates the code. Let's replace it and the calling code with a new pair of macros: POOL_DEBUG_SET_MARK() and POOL_DEBUG_CHECK_MARK(), that respectively store and check the pool pointer in the extra location at the end of the pool. This removes 4 pairs of ifdefs in the middle of the code.	2022-01-02 12:44:19 +01:00
Willy Tarreau	799f6143ca	CLEANUP: pools: do not use the extra pointer to link shared elements This practice relying on POOL_LINK() dates from the era where there were no pool caches, but given that the structures are a bit more complex now and that pool caches do not make use of this feature, it is totally useless since released elements have already been overwritten, and yet it complicates the architecture and prevents from making simplifications and optimizations. Let's just get rid of this feature. The pointer to the origin pool is preserved though, as it helps detect incorrect frees and serves as a canary for overflows.	2022-01-02 12:44:19 +01:00
Willy Tarreau	d5ec100661	MINOR: pools: always evict oldest objects first in pool_evict_from_local_cache() For an unknown reason, despite the comment stating that we were evicting oldest objects first from the local caches, due to the use of LIST_NEXT, the newest were evicted, since pool_put_to_cache() uses LIST_INSERT(). Some tests on 16 threads show that evicting oldest objects instead can improve performance by 0.5-1% especially when using shared pools.	2022-01-02 12:40:14 +01:00
William Lallemand	e69563fd8e	BUG/MEDIUM: ssl: free the ckch instance linked to a server This patch unlinks and frees the ckch instance linked to a server during the free of this server. This could have locked certificates in a "Used" state when removing servers dynamically from the CLI. And could provoke a segfault once we try to dynamically update the certificate after that. This must be backported as far as 2.4.	2021-12-30 16:56:52 +01:00
William Lallemand	231610ad9c	BUG/MINOR: ssl: free the fields in srv->ssl_ctx A lot of free are missing in ssl_sock_free_srv_ctx(), this could result in memory leaking when removing dynamically a server via the CLI. This must be backported in every branches, by removing the fields that does not exist in the previous branches.	2021-12-30 13:43:04 +01:00
William Lallemand	2c776f1c30	BUG/MEDIUM: ssl: initialize correctly ssl w/ default-server This bug was introduced by d817dc73 ("MEDIUM: ssl: Load client certificates in a ckch for backend servers") in which the creation of the SSL_CTX for a server was moved to the configuration parser when using a "crt" keyword instead of being done in ssl_sock_prepare_srv_ctx(). The patch 0498fa40 ("BUG/MINOR: ssl: Default-server configuration ignored by server") made it worse by setting the same SSL_CTX for every servers using a default-server. Resulting in any SSL option on a server applied to every server in its backend. This patch fixes the issue by reintroducing a string which store the path of certificate inside the server structure, and loading the certificate in ssl_sock_prepare_srv_ctx() again. This is a quick fix to backport, a cleaner way can be achieve by always creating the SSL_CTX in ssl_sock_prepare_srv_ctx() and splitting properly the ssl_sock_load_srv_cert() function. This patch fixes issue #1488. Must be backported as far as 2.4.	2021-12-29 14:42:16 +01:00
Willy Tarreau	654726db5a	MINOR: debug: add support for -dL to dump library names at boot This is a second help to dump loaded library names late at boot, once external code has already been initialized. The purpose is to provide a format that makes it easy to pass to "tar" to produce an archive containing the executable and the list of dependencies. For example if haproxy is started as "haproxy -f foo.cfg", a config check only will suffice to quit before starting, "-q" will be used to disable undesired output messages, and -dL will be use to dump libraries. This will result in such a command to trivially produce a tarball of loaded libraries: ./haproxy -q -c -dL -f foo.cfg \| tar -T - -hzcf archive.tgz	2021-12-28 17:07:13 +01:00
Willy Tarreau	6ab7b21a11	MINOR: debug: add ability to dump loaded shared libraries Many times core dumps reported by users who experience trouble are difficult to exploit due to missing system libraries. Sometimes, having just a list of loaded libraries and their respective addresses can already provide some hints about some problems. This patch makes a step in that direction by adding a new "show libs" command that will try to enumerate the list of object files that are loaded in memory, relying on the dynamic linker for this. It may also be used to detect that some foreign code embarks other undesired libs (e.g. some external Lua modules). At the moment it's only supported on glibc when USE_DL is set, but it's implemented in a way that ought to make it reasonably easy to be extended to other platforms.	2021-12-28 16:59:00 +01:00
Willy Tarreau	b4ff6f4ae9	BUG/MEDIUM: peers: properly skip conn_cur from incoming messages The approach used for skipping conn_cur in commit db2ab8218 ("MEDIUM: stick-table: never learn the "conn_cur" value from peers") was wrong, it only works with simple tables but as soon as frequency counters or arrays are exchanged after conn_cur, the stream is desynchronized and incorrect values are read. This is because the fields have a variable length depending on their types and cannot simply be skipped by a "continue" statement. Let's change the approach to make sure we continue to completely parse these local-only fields, and only drop the value at the moment we're about to store them, since this is exactly the intent. A simpler approach could consist in having two sets of stktable_data_ptr() functions, one for retrieval and one for storage, and to make the store function return a NULL pointer for local types. For now this doesn't seem worth the trouble. This fixes github issue #1497. Thanks to @brenc for the reproducer. This must be backported to 2.5.	2021-12-24 13:48:39 +01:00
Willy Tarreau	266d540549	BUG/MEDIUM: backend: fix possible sockaddr leak on redispatch A subtle change of target address allocation was introduced with commit 68cf3959b ("MINOR: backend: rewrite alloc of stream target address") in 2.4. Prior to this patch, a target address was allocated by function assign_server_address() only if none was previously allocated. After the change, the allocation became unconditional. Most of the time it makes no difference, except when we pass multiple times through connect_server() with SF_ADDR_SET cleared. The most obvious fix would be to avoid allocating that address there when already set, but the root cause is that since introduction of dynamically allocated addresses, the SF_ADDR_SET flag lies. It can be cleared during redispatch or during a queue redistribution without the address being released. This patch instead gives back all its correct meaning to SF_ADDR_SET and guarantees that when not set no address is allocated, by freeing that address at the few places the flag is cleared. The flag could even be removed so that only the address is checked but that would require to touch many areas for no benefit. The easiest way to test it is to send requests to a proxy with l7 retries enabled, which forwards to a server returning 500: defaults mode http timeout client 1s timeout server 1s timeout connect 1s retry-on all-retryable-errors retries 1 option redispatch listen proxy bind *:5000 server app 0.0.0.0:5001 frontend dummy-app bind :5001 http-request return status 500 Issuing "show pools" on the CLI will show that pool "sockaddr" grows as requests are redispatched, and remains stable with the fix. Even "ps" will show that the process' RSS grows by ~160B per request. This fix will need to be backported to 2.4. Note that before 2.5, there's no strm->si[1].dst, strm->target_addr must be used instead. This addresses github issue #1499. Special thanks to Daniil Leontiev for providing a well-documented reproducer.	2021-12-24 11:50:01 +01:00
Amaury Denoyelle	9979d0d1ea	BUG/MINOR: quic: fix potential use of uninit pointer Properly initialized the ssl_sock_ctx pointer in qc_conn_init. This is required to avoid to set an undefined pointer in qc.xprt_ctx if argument *xprt_ctx is NULL.	2021-12-23 16:33:47 +01:00
Amaury Denoyelle	c6fab98f9b	BUG/MINOR: quic: fix potential null dereference This is not a real issue because found_in_dcid can not be set if qc is NULL.	2021-12-23 16:32:19 +01:00

... 67 68 69 70 71 ...

16131 Commits