haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-10 17:17:06 +02:00

Author	SHA1	Message	Date
Christopher Faulet	0a52c17f81	BUG/MEDIUM: lb-chash: Ensure the tree integrity when server weight is increased When the server weight is increased in consistant hash, extra nodes have to be allocated. So a realloc() is performed on the nodes array of the server. the previous commit 962ea7732 ("BUG/MEDIUM: lb-chash: Remove all server's entries before realloc() to re-insert them after") have fixed the size used during the realloc() to avoid segfaults. But another bug remains. After the realloc(), the memory area allocated for the nodes array may change, invalidating all node addresses in the chash tree. So, to fix the bug, we must remove all server's entries from the chash tree before the realloc to insert all of them after, old nodes and new ones. The insert will be automatically handled by the loop at the end of the function chash_queue_dequeue_srv(). Note that if the call to realloc() failed, no new entries will be created for the server, so the effective server weight will be unchanged. This issue was reported on Github (#189). This patch must be backported to all versions since the 1.6.	2019-08-01 11:35:29 +02:00
Christopher Faulet	366ad86af7	BUG/MEDIUM: lb-chash: Fix the realloc() when the number of nodes is increased When the number of nodes is increased because the server weight is changed, the nodes array must be realloc. But its new size is not correctly set. Only the total number of nodes is used to set the new size. But it must also depends on the size of a node. It must be the total nomber of nodes times the size of a node. This issue was reported on Github (#189). This patch must be backported to all versions since the 1.6.	2019-07-26 14:12:59 +02:00
Willy Tarreau	76e84f5091	MINOR: backend: move hash_balance_factor out of chash This one is a proxy option which can be inherited from defaults even if the LB algo changes. Move it out of the lb_chash struct so that we don't need to keep anything separate between these structs. This will allow us to merge them into an union later. It even takes less room now as it fills a hole and removes another one.	2019-01-14 19:33:17 +01:00
Willy Tarreau	59884a646c	MINOR: lb: allow redispatch when using consistent hash Redispatch traditionally only worked for cookie based persistence. Adding redispatch support for consistent hash based persistence - also update docs. Reported by Oskar Stenman on discourse: https://discourse.haproxy.org/t/balance-uri-consistent-hashing-redispatch-3-not-redispatching/3344 Should be backported to 1.8. Cc: Lukas Tribus <lukas@ltri.eu>	2019-01-02 20:22:17 +01:00
Willy Tarreau	1b87748ff5	BUG/MEDIUM: lb/threads: always properly lock LB algorithms on maintenance operations Since commit `3ff577e` ("MAJOR: server: make server state changes synchronous again"), srv_update_status() calls the various maintenance operations of the LB algorithms (->set_server_up, ->set_server_down, ->update_server_weight()). These ones are called with a single thread guaranteed by the rendez-vous point, so the fact that they're lacking some locks has no effect. However we'll need to remove the rendez-vous point so we have to take care of properly locking all the LB algos. The comments have been properly updated on the various functions to mention their locking expectations. All these functions are called with the server lock held, and all of them now support concurrent calls by using the lbprm's lock. This fix doesn't need to be backported at the moment, though if any check-specific issue surfaced in 1.8, it could make sense to reuse it.	2018-08-21 19:44:53 +02:00
Christopher Faulet	2a944ee16b	BUILD: threads: Rename SPIN/RWLOCK macros using HA_ prefix This remove any name conflicts, especially on Solaris.	2017-11-07 11:10:24 +01:00
Willy Tarreau	1ed90ac377	BUG/MAJOR: threads/lb: fix missing unlock on consistent hash LB If no matching node was found, the function was left without unlocking the tree.	2017-11-05 10:54:50 +01:00
Christopher Faulet	5b51755aef	MEDIUM: threads/lb: Make LB algorithms (lb_*.c) thread-safe A lock for LB parameters has been added inside the proxy structure and atomic operations have been used to update server variables releated to lb. The only significant change is about lb_map. Because the servers status are updated in the sync-point, we can call recalc_server_map function synchronously in map_set_server_status_up/down function.	2017-10-31 13:58:31 +01:00
Olivier Houchard	f8eb8d56a7	MINOR: server: Handle weight increase in consistent hash. When the server weight is rised using the CLI, extra nodes have to be allocated, or the weight will be effectively the same as the original one. [wt: given that the doc made no explicit mention about this limitation, this patch could even be backported as it fixes an unexpected behaviour]	2017-10-17 18:08:38 +02:00
Emeric Brun	52a91d3d48	MEDIUM: check: server states and weight propagation re-work The server state and weight was reworked to handle "pending" values updated by checks/CLI/LUA/agent. These values are commited to be propagated to the LB stack. In further dev related to multi-thread, the commit will be handled into a sync point. Pending values are named using the prefix 'next_' Current values used by the LB stack are named 'cur_'	2017-09-05 15:23:16 +02:00
Andrew Rodland	18330ab17f	BUG/MINOR: hash-balance-factor isn't effective in certain circumstances in chash_get_server_hash, we find the nearest server entries both before and after the request hash. If the next and prev entries both point to the same server, the function would exit early and return that server, to save work. Before hash-balance-factor this was a valid optimization -- one of nsrv and psrv would definitely be chosen, so if they are the same there's no need to choose between them. But with hash-balance-factor it's possible that adding another request to that server would overload it (chash_server_is_eligible returns false) and we go further around the ring. So it's not valid to return before checking for that. This commit simply removes the early return, as it provides a minimal savings even when it's correct.	2017-04-26 15:45:27 +02:00
Andrew Rodland	4f88c63609	MEDIUM: server: Implement bounded-load hash algorithm The consistent hash lookup is done as normal, then if balancing is enabled, we progress through the hash ring until we find a server that doesn't have "too much" load. In the case of equal weights for all servers, the allowed number of requests for a server is either the floor or the ceil of (num_requests * hash-balance-factor / num_servers); with unequal weights things are somewhat more complicated, but the spirit is the same -- a server should not be able to go too far above (its relative weight times) the average load. Using the hash ring to make the second/third/etc. choice maintains as much locality as possible given the load limit. Signed-off-by: Andrew Rodland <andrewr@vimeo.com>	2016-10-25 20:21:32 +02:00
Vincent Bernat	3c2f2f207f	CLEANUP: remove unneeded casts In C89, "void " is automatically promoted to any pointer type. Casting the result of malloc/calloc to the type of the LHS variable is therefore unneeded. Most of this patch was built using this Coccinelle patch: @@ type T; @@ - (T ) (\(lua_touserdata\\|malloc\\|calloc\\|SSL_get_app_data\\|hlua_checkudata\\|lua_newuserdata\)(...)) @@ type T; T x; void data; @@ x = - (T ) data @@ type T; T x; T data; @@ x = - (T ) data Unfortunately, either Coccinelle or I is too limited to detect situation where a complex RHS expression is of type "void *" and therefore casting is not needed. Those cases were manually examined and corrected.	2016-04-03 14:17:42 +02:00
Willy Tarreau	c93cd16b6c	REORG/MEDIUM: server: split server state and flags in two different variables Till now, the server's state and flags were all saved as a single bit field. It causes some difficulties because we'd like to have an enum for the state and separate flags. This commit starts by splitting them in two distinct fields. The first one is srv->state (with its counter-part srv->prev_state) which are now enums, but which still contain bits (SRV_STF_*). The flags now lie in their own field (srv->flags). The function srv_is_usable() was updated to use the enum as input, since it already used to deal only with the state. Note that currently, the maintenance mode is still in the state for simplicity, but it must move as well.	2014-05-22 11:27:00 +02:00
Willy Tarreau	87eb1d6994	MINOR: server: create srv_was_usable() from srv_is_usable() and use a pointer We used to call srv_is_usable() with either the current state and weights or the previous ones. This causes trouble for future changes, so let's first split it in two variants : - srv_is_usable(srv) considers the current status - srv_was_usable(srv) considers the previous status	2014-05-13 22:34:55 +02:00
Willy Tarreau	c5150dafd8	MINOR: server: use functions to detect state changes and to update them Detecting that a server's status has changed is a bit messy, as well as it is to commit the status changes. We'll have to add new conditions soon and we'd better avoid to multiply the number of touched locations with the high risk of forgetting them. This commit introduces : - srv_lb_status_changed() to report if the status changed from the previously committed one ; - svr_lb_commit_status() to commit the current status The function is now used by all load-balancing algorithms.	2014-05-13 22:18:22 +02:00
Willy Tarreau	004e045f31	BUG/MAJOR: server: weight calculation fails for map-based algorithms A crash was reported by Igor at owind when changing a server's weight on the CLI. Lukas Tribus could reproduce a related bug where setting a server's weight would result in the new weight being multiplied by the initial one. The two bugs are the same. The incorrect weight calculation results in the total farm weight being larger than what was initially allocated, causing the map index to be out of bounds on some hashes. It's easy to reproduce using "balance url_param" with a variable param, or with "balance static-rr". It appears that the calculation is made at many places and is not always right and not always wrong the same way. Thus, this patch introduces a new function "server_recalc_eweight()" which is dedicated to this task of computing ->eweight from many other elements including uweight and current time (for slowstart), and all users now switch to use this function. The patch is a bit large but the code was not trivially fixable in a way that could guarantee this situation would not occur anymore. The fix is much more readable and has been verified to work with all algorithms, with both consistent and map-based hashes, and even with static-rr. Slowstart was tested as well, just like enable/disable server. The same bug is very likely present in 1.4 as well, so the patch will probably need to be backported eventhough it will not apply as-is. Thanks to Lukas and Igor for the information they provided to reproduce it.	2013-11-21 15:09:02 +01:00
Willy Tarreau	d16a1b2a81	BUG/MAJOR: backend: consistent hash can loop forever in certain circumstances When the parameter passed to a consistent hash is not found, we fall back to round-robin using chash_get_next_server(). This one stores the last visited server in lbprm.chash.last, which can be NULL upon the first invocation or if the only server was recently brought up. The loop used to scan for a server is able to skip the previously attempted server in case of a redispatch, by passing this previous server in srvtoavoid. For this reason, the loop stops when the currently considered server is different from srvtoavoid and different from the original chash.last. A problem happens in a special sequence : if a connection to a server fails, then all servers are removed from the farm, then the original server is added again before the redispatch happens, we have chash.last = NULL and srvtoavoid set to the only server in the farm. Then this server is always equal to srvtoavoid and never to NULL, and the loop never stops. The fix consists in assigning the stop point to the first encountered node if it was not yet set. This issue cannot happen with the map-based algorithm since it's based on an index and not a stop point. This issue was reported by Henry Qian who kindly provided lots of critically useful information to figure out the conditions to reproduce the issue. The fix needs to be backported to 1.4 which is also affected.	2013-04-12 14:46:51 +02:00
Willy Tarreau	798a39cdc9	[MEDIUM] hash: add support for an 'avalanche' hash-type When the number of servers is a multiple of the size of the input set, map-based hash can be inefficient. This typically happens with 64 servers when doing URI hashing. The "avalanche" hash-type applies an avalanche hash before performing a map lookup in order to smooth the distribution. The result is slightly less smooth than the map for small numbers of servers, but still better than the consistent hashing.	2010-11-29 07:28:16 +01:00
Willy Tarreau	4c14eaa0d4	[CLEANUP] hash: move the avalanche hash code globally available We'll use this hash at other places, let's make it globally available. The function has also been renamed because its "chash_hash" name was not appropriate.	2010-11-29 07:28:16 +01:00
Willy Tarreau	45cb4fb640	[MEDIUM] build: switch ebtree users to use new ebtree version All files referencing the previous ebtree code were changed to point to the new one in the ebtree directory. A makefile variable (EBTREE_DIR) is also available to use files from another directory. The ability to build the libebtree library temporarily remains disabled because it can have an impact on some existing toolchains and does not appear worth it in the medium term if we add support for multi-criteria stickiness for instance.	2009-10-26 21:10:04 +01:00
Willy Tarreau	6b2e11be1e	[MEDIUM] backend: implement consistent hashing variation Consistent hashing provides some interesting advantages over common hashing. It avoids full redistribution in case of a server failure, or when expanding the farm. This has a cost however, the hashing is far from being perfect, as we associate a server to a request by searching the server with the closest key in a tree. Since servers appear multiple times based on their weights, it is recommended to use weights larger than approximately 10-20 in order to smoothen the distribution a bit. In some cases, playing with weights will be the only solution to make a server appear more often and increase chances of being picked, so stats are very important with consistent hashing. In order to indicate the type of hashing, use : hash-type map-based (default, old one) hash-type consistent (new one) Consistent hashing can make sense in a cache farm, in order not to redistribute everyone when a cache changes state. It could also probably be used for long sessions such as terminal sessions, though that has not be attempted yet. More details on this method of hashing here : http://www.spiteful.com/2008/03/17/programmers-toolbox-part-3-consistent-hashing/	2009-10-09 07:17:58 +02:00

22 Commits