haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-21 06:31:18 +02:00

Author	SHA1	Message	Date
Willy Tarreau	71bada5ca4	MINOR: fd: remove the unneeded running bit from fd_insert() There's no point taking the running bit in fd_insert() since by definition there will never be more than one thread inserting the FD, and that fd_insert() may only be done after the fd was allocated by the system, indicating the end of use by any other thread. This will need to be backported to 2.2 to fix an issue.	2021-03-24 17:17:21 +01:00
Willy Tarreau	6e8e10b415	MINOR: fd: make fd_clr_running() return the remaining running mask We'll need to know that a thread is the last one to use an fd, so let's make fd_clr_running() return the remaining bits after removal. Note that in practice we're only interested in knowing if it's zero but the compiler doesn't make use of the clags after the AND and emits a CMPXCHG anyway :-/ This will need to be backported to 2.2 to fix an issue.	2021-03-24 17:17:21 +01:00
Christopher Faulet	cc2c4f8f4c	BUG/MEDIUM: debug/lua: Use internal hlua function to dump the lua traceback The commit reverts following commits: * 83926a04 BUG/MEDIUM: debug/lua: Don't dump the lua stack if not dumpable * a61789a1 MEDIUM: lua: Use a per-thread counter to track some non-reentrant parts of lua Instead of relying on a Lua function to print the lua traceback into the debugger, we are now using our own internal function (hlua_traceback()). This one does not allocate memory and use a chunk instead. This avoids any issue with a possible deadlock in the memory allocator because the thread processing was interrupted during a memory allocation. This patch relies on the commit "BUG/MEDIUM: debug/lua: Use internal hlua function to dump the lua traceback". Both must be backported wherever the patches above are backported, thus as far as 2.0	2021-03-24 16:35:23 +01:00
Christopher Faulet	d09cc519bd	MINOR: lua: Slightly improve function dumping the lua traceback The separator string is now configurable, passing it as parameter when the function is called. In addition, the message have been slightly changed to be a bit more readable.	2021-03-24 16:33:26 +01:00
Ilya Shipitsin	8cd1627599	CLEANUP: ssl: remove unused definitions not need since e7eb1fec2f2349359c752c8fbb82357b14c7e4cf	2021-03-24 09:52:32 +01:00
Remi Tricot-Le Breton	fb00f31af4	BUG/MINOR: ssl: Prevent disk access when using "add ssl crt-list" If an unknown CA file was first mentioned in an "add ssl crt-list" CLI command, it would result in a call to X509_STORE_load_locations which performs a disk access which is forbidden during runtime. The same would happen if a "ca-verify-file" or "crl-file" was specified. This was due to the fact that the crt-list file parsing and the crt-list related CLI commands parsing use the same functions. The patch simply adds a new parameter to all the ssl_bind parsing functions so that they know if the call is made during init or by the CLI, and the ssl_store_load_locations function can then reject any new cafile_entry creation coming from a CLI call. It can be backported as far as 2.2.	2021-03-23 19:29:46 +01:00
Emeric Brun	69ba35146f	MINOR: tools: introduce new option PA_O_DEFAULT_DGRAM on str2sa_range. str2sa_range function options PA_O_DGRAM and PA_O_STREAM are used to define the supported address types but also to set the default type if it is not explicit. If the used address support both STREAM and DGRAM, the default was always set to STREAM. This patch introduce a new option PA_O_DEFAULT_DGRAM to force the default to DGRAM type if it is not explicit in the address field and both STREAM and DGRAM are supported. If only DGRAM or only STREAM is supported, it continues to be considered as the default.	2021-03-23 15:32:22 +01:00
Willy Tarreau	8cc586c73f	BUG/MEDIUM: freq_ctr/threads: use the global_now_ms variable In commit a1ecbca0a ("BUG/MINOR: freq_ctr/threads: make use of the last updated global time"), for period-based counters, the millisecond part of the global_now variable was used as the date for the new period. But it's wrong, it only works with sub-second periods as it wraps every second, and for other periods the counters never rotate anymore. Let's make use of the newly introduced global_now_ms variable instead, which contains the global monotonic time expressed in milliseconds. This patch needs to be backported wherever the patch above is backported. It depends on previous commit "MINOR: time: also provide a global, monotonic global_now_ms timer".	2021-03-23 09:03:37 +01:00
Willy Tarreau	6064b34be0	MINOR: time: also provide a global, monotonic global_now_ms timer The period-based freq counters need the global date in milliseconds, so better calculate it and expose it rather than letting all call places incorrectly retrieve it. Here what we do is that we maintain a new globally monotonic timer, global_now_ms, which ought to be very close to the global_now one, but maintains the monotonic approach of now_ms between all threads in that global_now_ms is always ahead of any now_ms. This patch is made simple to ease backporting (it will be needed for a subsequent fix), but it also opens the way to some simplifications on the time handling: instead of computing the local time and trying to force it to the global one, we should soon be able to proceed in the opposite way, that is computing the new global time an making the local one just the latest snapshot of it. This will bring the benefit of making sure that the global time is always ahead of the local one.	2021-03-23 09:01:37 +01:00
Willy Tarreau	5d110b25dd	CLEANUP: connection: use pool_zalloc() in conn_alloc_hash_node() This one used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:17:24 +01:00
Willy Tarreau	18759079b6	MINOR: pools: add pool_zalloc() to return a zeroed area It's like pool_alloc() but the output is zeroed before being returned and is never poisonned.	2021-03-22 22:05:05 +01:00
Willy Tarreau	de749a9333	MINOR: pools: make the pool allocator support a few flags The pool_alloc_dirty() function was renamed to __pool_alloc() and now takes a set of flags indicating whether poisonning is permitted or not and whether zeroing the area is needed or not. The pool_alloc() function is now just a wrapper calling __pool_alloc(pool, 0).	2021-03-22 20:54:15 +01:00
Willy Tarreau	a213b683f7	CLEANUP: pools: remove the unused pool_get_first() function This one used to maintain a shortcut in the pools allocation path that was only justified by b_alloc_fast() which was not used! Let's get rid of it as well so that the allocator becomes a bit more straight forward.	2021-03-22 16:28:08 +01:00
Willy Tarreau	7be7ffac15	CLEANUP: dynbuf: remove the unused b_alloc_fast() function It is never used anymore since 1.7 where it was used by b_alloc_margin() then replaced by direct calls to the pools function, and it maintains a dependency on the exposed pools functions. It's time to get rid of it, as it's not even certain it still works.	2021-03-22 16:28:05 +01:00
Willy Tarreau	f44ca97fcb	CLEANUP: dynbuf: remove b_alloc_margin() It's not used anymore, let's completely remove it before anyone uses it again by accident.	2021-03-22 16:28:02 +01:00
Willy Tarreau	0f495b3d87	MINOR: channel: simplify the channel's buffer allocation The channel's buffer allocator, channel_alloc_buffer(), was still relying on the principle of a margin for the request and not for the response. But this margin stopped working around 1.7 with the introduction of the content filters such as SPOE, and was completely anihilated with the local pools that came with threads. Let's simplify this and just use b_alloc().	2021-03-22 16:19:45 +01:00
Willy Tarreau	766b6cf206	MINOR: dynbuf: make b_alloc() always check if the buffer is allocated Right now there is a discrepancy beteween b_alloc() and b_allow_margin(): the former forcefully overwrites the target pointer while the latter tests it and returns it as-is if already allocated. As a matter of fact, all callers of b_alloc() either preliminary test the buffer, or assume it's already null. Let's remove this pain and make the function test the buffer's allocation before doing it again, and match call places' expectations.	2021-03-22 16:14:45 +01:00
Christopher Faulet	a61789a1d6	MEDIUM: lua: Use a per-thread counter to track some non-reentrant parts of lua Some parts of the Lua are non-reentrant. We must be sure to carefully track these parts to not dump the lua stack when it is interrupted inside such parts. For now, we only identified the custom lua allocator. If the thread is interrupted during the memory allocation, we must not try to print the lua stack wich also allocate memory. Indeed, realloc() is not async-signal-safe. In this patch we introduce a thread-local counter. It is incremented before entering in a non-reentrant part and decremented when exiting. It is only performed in hlua_alloc() for now.	2021-03-19 16:16:23 +01:00
Olivier Houchard	dae6975498	MINOR: muxes: garbage collect the reset() method. Now that connections aren't being reused when they failed, remove the reset() method. It was unimplemented anywhere, except for H1 where it did nothing, anyway.	2021-03-19 15:33:04 +01:00
Olivier Houchard	1b3c931bff	MEDIUM: connections: Introduce a new XPRT method, start(). Introduce a new XPRT method, start(). The init() method will now only initialize whatever is needed for the XPRT to run, but any action the XPRT has to do before being ready, such as handshakes, will be done in the new start() method. That way, we will be sure the full stack of xprt will be initialized before attempting to do anything. The init() call is also moved to conn_prepare(). There's no longer any reason to wait for the ctrl to be ready, any action will be deferred until start(), anyway. This means conn_xprt_init() is no longer needed.	2021-03-19 15:33:04 +01:00
Amaury Denoyelle	216a1ce3b9	MINOR: stats: export function to allocate extra proxy counters Remove static qualifier on stats_allocate_proxy_counters_internal. This function will be used to allocate extra counters at runtime for dynamic servers.	2021-03-18 15:52:07 +01:00
Amaury Denoyelle	76e10e78bb	MINOR: server: prepare parsing for dynamic servers Prepare the server parsing API to support dynamic servers. - define a new parsing flag to be used for dynamic servers - each keyword contains a new field dynamic_ok to indicate if it can be used for a dynamic server. For now, no keyword are supported. - do not copy settings from the default server for a new dynamic server. - a dynamic server is created in a maintenance mode and requires an explicit 'enable server' command. - a new server flag named SRV_F_DYNAMIC is created. This flag is set for all servers created at runtime. It might be useful later, for example to know if a server can be purged.	2021-03-18 15:51:12 +01:00
Amaury Denoyelle	30c0537f5a	REORG: server: use flags for parse_server Modify the API of parse_server function. Use flags to describe the type of the parsed server instead of discrete arguments. These flags can be used to specify if a server/default-server/server-template is parsed. Additional parameters are also specified (parsing of the address required, resolve of a name must be done immediately). It is now unneeded to use strcmp on args[0] in parse_server. Also, the calls to parse_server are more explicit thanks to the flags.	2021-03-18 15:37:05 +01:00
Amaury Denoyelle	828adf0121	REORG: server: add a free server function Create a new server function named free_server. It can be used to deallocate a server and its member.	2021-03-18 15:37:05 +01:00
Amaury Denoyelle	18487fb532	MINOR: cli: implement experimental-mode Experimental mode is similar to expert-mode. It can be used to access to features still in development.	2021-03-18 15:37:05 +01:00
Willy Tarreau	6f9f2c0857	MINOR: freq_ctr/threads: relax when failing to update a sliding window value The swrate_add* functions would sping fast on a failed CAS, better place a cpu_relax() call there to reduce contention if any.	2021-03-17 19:36:15 +01:00
Willy Tarreau	a1ecbca0a5	BUG/MINOR: freq_ctr/threads: make use of the last updated global time The freq counters were using the thread's own time as the start of the current period. The problem is that in case of contention, it was occasionally possible to perform non-monotonic updates on the edge of the next second, because if the upfront thread updates a counter first, it causes a rotation, then the second thread loses the race from its older time, and tries again, and detects a different time again, but in the past so it only updates the counter, then a third thread on the new date would detect a change again, thus provoking a rotation again. The effect was triple: - rare loss of stored values during certain transitions from one period to the next one, causing counters to report 0 - half of the threads forced to go through the slow path every second - difficult convergence when using many threads where the CAS can fail a lot and we can observe N(N-1) attempts for N threads to complete This patch fixes this issue in two ways: - first, it now makes use og the monotonic global_now value which also happens to be volatile and to carry the latest known time; this way time will never jump backwards anymore and only the first thread updates it on transition, the other ones do not need to. - second, re-read the time in the loop after each failure, because if the date changed in the counter, it means that one thread knows a more recent one and we need to update. In this case if it matches the new current second, the fast path is usable. This patch relies on previous patch "MINOR: time: export the global_now variable" and must be backported as far as 1.8.	2021-03-17 19:36:15 +01:00
Willy Tarreau	650f374f24	MINOR: time: export the global_now variable This is the process-wide monotonic time that is used to update each thread's own time. It may be required at a few places where a strictly monotonic clock is required such as freq_ctr. It will be have to be backported as a dependency of a forthcoming fix.	2021-03-17 19:25:47 +01:00
Willy Tarreau	31a3cea84f	MINOR: cfgparse/proxy: also support spelling fixes on options Some are not always easy to spot with "chk" vs "check" or hyphens at some places and not at others. Now entering "option http-close" properly suggests "httpclose" and "option tcp-chk" suggests "tcp-check". There's no need to consider the proxy's capabilities, what matters is to figure what related word the user tried to spell, and there are not that many options anyway.	2021-03-15 11:14:57 +01:00
Willy Tarreau	b12bc646d5	MINOR: cli: limit spelling suggestions to 5 There's no need to suggest up to 10 entries for matching keywords, most of the times 5 are plenty, and will be more readable.	2021-03-15 10:40:13 +01:00
Willy Tarreau	9294e8822f	MINOR: tools: improve word fingerprinting by counting presence The distance between two words can be high due to a sub-word being missing and in this case it happens that other totally unrealted words are proposed because their average score looks lower thanks to being shorter. Here we're introducing the notion of presence of each character so that word sequences that contain existing sub-words are favored against the shorter ones having nothing in common. In addition we do not distinguish being/end from a regular delimitor anymore. That made it harder to spot inverted words.	2021-03-15 09:38:42 +01:00
Ilya Shipitsin	f3ede874a5	CLEANUP: assorted typo fixes in the code and comments This is 20th iteration of typo fixes	2021-03-13 11:45:17 +01:00
Willy Tarreau	7416314145	CLEANUP: task: make sure tasklet handlers always indicate their statuses When tasklets were derived from tasks, there was no immediate need for the scheduler to know their status after execution, and in a spirit of simplicity they just started to always return NULL. The problem is that it simply prevents the scheduler from 1) accounting their execution time, and 2) keeping track of their current execution status. Indeed, a remote wake-up could very well end up manipulating a tasklet that's currently being executed. And this is the reason why those handlers have to take the idle lock before checking their context. In 2.5 we'll take care of making tasklets and tasks work more similarly, but trouble is to be expected if we continue to propagate the trend of returning NULL everywhere, especially if some fixes relying on a stricter model later need to be backported. For this reason this patch updates all known tasklet handlers to make them return NULL only when the tasklet was freed. It has no effect for now and isn't even guaranteed to always be 100% safe but it puts the code into the right direction for this.	2021-03-13 11:30:19 +01:00
Willy Tarreau	4975d1482f	CLEANUP: cli: rename the last few "stats_" to "cli_" There were still a very small list of functions, variables and fields called "stats_" while they were really purely CLI-centric. There's the frontend called "stats_fe" in the global section, which instantiates a "cli_applet" called "<CLI>" so it was renamed "cli_fe". The "alloc_stats_fe" function cas renamed to "cli_alloc_fe" which also better matches the naming convention of all cli-specific functions. Finally the "stats_permission_denied_msg" used to return an error on the CLI was renamed "cli_permission_denied_msg". Now there's no more "stats_something" that designates the CLI.	2021-03-13 11:04:35 +01:00
Willy Tarreau	f14c7570d6	CLEANUP: cli: rename MAX_STATS_ARGS to MAX_CLI_ARGS This is the number of args accepted on a command received on the CLI, is has long been totally independent of stats and should not carry this misleading "stats" name anymore.	2021-03-13 10:59:23 +01:00
Willy Tarreau	e33c4b3c11	MINOR: tools: add the ability to update a word fingerprint Instead of making a new one from scratch, let's support not wiping the existing fingerprint and updating it, and to do the same char by char. The word-by-word one will still result in multiple beginnings and ends, but that will accurately translate word boundaries. The char-based one has more flexibility and requires that the caller maintains the previous char to indicate the transition, which also allows to insert delimiters for example.	2021-03-12 19:09:19 +01:00
Willy Tarreau	b736458bfa	MEDIUM: cli: apply spelling fixes for known commands before listing them Entering "show tls" would still emit 35 entries. By measuring the distance between all unknown words and the candidates, we can sort them and pick the 10 most likely candidates. This works reasonably well, as now "show tls" only proposes "show tls-keys", "show threads", "show pools" and "show tasks". If the distance is still too high or if a word is missing, the whole prefix list continues to be dumped, thus "show" alone will still report the entire list of commands beginning with "show". It's still impossible to skip a word, for example "show conn" will not propose "show servers conn" because the distance is calculated for each word individually. Some changes to the distance calculation to support updating an existing map could easily address this. But this is already a great improvement.	2021-03-12 19:09:19 +01:00
Willy Tarreau	4451150251	CLEANUP: cli: fix misleading comment and better indent the access level flags It was mentioned that ACCESS_MASTER_ONLY as for workers only instead of master-only. And it wasn't clear that all ACCESS_* would belong to the same thing.	2021-03-12 19:09:19 +01:00
Christopher Faulet	55c1c4053f	MINOR: resolvers: Use milliseconds for cached items in resolver responses The last time when an item was seen in a resolver responses is now stored in milliseconds instead of seconds. This avoid some corner-cases at the edges. This also simplifies time comparisons.	2021-03-12 17:41:28 +01:00
Christopher Faulet	0efc0993ec	BUG/MEDIUM: resolvers: Don't release resolution from a requester callbacks Another way to say it: "Safely unlink requester from a requester callbacks". Requester callbacks must never try to unlink a requester from a resolution, for the current requester or another one. First, these callback functions are called in a loop on a request list, not necessarily safe. Thus unlink resolution at this place, may be unsafe. And it is useless to try to make these loops safe because, all this stuff is placed in a loop on a resolution list. Unlink a requester may lead to release a resolution if it is the last requester. However, the unkink is necessary because we cannot reset the server state (hostname and IP) with some pending DNS resolution on it. So, to workaround this issue, we introduce the "safe" unlink. It is only performed from a requester callback. In this case, the unlink function never releases the resolution, it only reset it if necessary. And when a resolution is found with an empty requester list, it is released. This patch depends on the following commits : * MINOR: resolvers: Purge answer items when a SRV resolution triggers an error * MINOR: resolvers: Use a function to remove answers attached to a resolution * MINOR: resolvers: Directly call srvrq_update_srv_state() when possible * MINOR: resolvers: Add function to change the srv status based on SRV resolution All the series must be backported as far as 2.2. It fixes a regression introduced by the commit b4badf720 ("BUG/MINOR: resolvers: new callback to properly handle SRV record errors"). don't release resolution from requester cb	2021-03-12 17:41:28 +01:00
Christopher Faulet	5efdef24c1	MINOR: resolvers: Add function to change the srv status based on SRV resolution srvrq_update_srv_status() update the server status based on result of SRV resolution. For now, it is only used from snr_update_srv_status() when appropriate.	2021-03-12 17:41:28 +01:00
Christopher Faulet	1dec5c7934	MINOR: resolvers: Use a function to remove answers attached to a resolution resolv_purge_resolution_answer_records() must be used to removed all answers attached to a resolution. For now, it is only used when a resolution is released.	2021-03-12 17:41:28 +01:00
Baptiste Assmann	6a8d11dc80	MINOR: resolvers: new function find_srvrq_answer_record() This function search for a SRV answer item associated to a requester whose type is server. This is mainly useful to "link" a server to its SRV record when no additional record were found to configure the IP address. This patch is required by a bug fix.	2021-03-12 17:41:28 +01:00
Willy Tarreau	99eb2cc1cc	MINOR: actions: add a function to suggest an action ressembling a given word action_suggest() will return a pointer to an action whose keyword more or less ressembles the passed argument. It also accepts to be more tolerant against prefixes (since actions taking arguments are handled as prefixes). This will be used to suggest approaching words.	2021-03-12 14:13:21 +01:00
Willy Tarreau	433b05fa64	MINOR: cfgparse/bind: suggest correct spelling for unknown bind keywords Just like with the server keywords, now's the turn of "bind" keywords. The difference is that 100% of the bind keywords are registered, thus we do not need the list of extra keywords. There are multiple bind line parsers today, all were updated: - peers - log - dgram-bind - cli $ printf "listen f\nbind :8000 tcut\n" \| ./haproxy -c -f /dev/stdin [NOTICE] 070/101358 (25146) : haproxy version is 2.4-dev11-7b8787-26 [NOTICE] 070/101358 (25146) : path to executable is ./haproxy [ALERT] 070/101358 (25146) : parsing [/dev/stdin:2] : 'bind :8000' unknown keyword 'tcut'; did you mean 'tcp-ut' maybe ? [ALERT] 070/101358 (25146) : Error(s) found in configuration file : /dev/stdin [ALERT] 070/101358 (25146) : Fatal errors found in configuration.	2021-03-12 14:13:21 +01:00
Willy Tarreau	e2afcc4509	MINOR: cfgparse: add cfg_find_best_match() to suggest an existing word Instead of just reporting "unknown keyword", let's provide a function which will look through a list of registered keywords for a similar-looking word to the one that wasn't matched. This will help callers suggest correct spelling. Also, given that a large part of the config parser still relies on a long chain of strcmp(), we'll need to be able to pass extra candidates. Thus the function supports an optional extra list for this purpose.	2021-03-12 14:13:21 +01:00
Willy Tarreau	ba2c4459a5	MINOR: tools: add simple word fingerprinting to find similar-looking words This introduces two functions, one which creates a fingerprint of a word, and one which computes a distance between two words fingerprints. The fingerprint is made by counting the transitions between one character and another one. Here we consider the 26 alphabetic letters regardless of their case, then any digit as a digit, and anything else as "other". We also consider the first and last locations as transitions from begin to first char, and last char to end. The distance is simply the sum of the squares of the differences between two fingerprints. This way, doubling/ missing a letter has the same cost, however some repeated transitions such as "e"->"r" like in "server" are very unlikely to match against situations where they do not exist. This is a naive approach but it seems to work sufficiently well for now. It may be refined in the future if needed.	2021-03-12 14:13:21 +01:00
Willy Tarreau	133c8c412e	CLEANUP: actions: the keyword must always be const from the rule There's no reason for a rule to want to modify an action keyword, let's make sure it is always const.	2021-03-12 14:13:21 +01:00
Christopher Faulet	77e376783e	BUG/MINOR: proxy/session: Be sure to have a listener to increment its counters It is possible to have a session without a listener. It happens for applets on the client side. Thus all accesses to the listener info from the session must be guarded. It was the purpose of the commit 36119de18 ("BUG/MEDIUM: session: NULL dereference possible when accessing the listener"). However, some tests on the session's listener existence are missing in proxy_inc_* functions. This patch should fix the issues #1171, #1172, #1173, #1174 and #1175. It must be backported with the above commit as far as 1.8.	2021-03-12 09:25:45 +01:00
Willy Tarreau	3b728a92bb	BUILD: atomic/arm64: force the register pairs to use in __ha_cas_dw() Since commit f8fb4f75f ("MINOR: atomic: implement a more efficient arm64 __ha_cas_dw() using pairs"), on some modern arm64 (armv8.1+) compiled with -march=armv8.1-a under gcc-7.5.0, a build error may appear on ev_poll.o : /tmp/ccHD2lN8.s:1771: Error: reg pair must start from even reg at operand 1 -- `casp x27,x28,x22,x23,[x12]' Makefile:927: recipe for target 'src/ev_poll.o' failed It appears that the compiler cannot always assign register pairs there for a structure made of two u64. It was possibly later addressed since gcc-9.3 never caused this, but there's no trivially available info on the subject in the changelogs. Unsuprizingly, using a u128 instead does fix this, but it significantly inflates the code (+4kB for just 6 places, very likely that it loaded some extra stubs) and the comparison is ugly, involving two slower conditional jumps instead of a single one and a conditional comparison. For example, ha_random64() grew from 144 bytes to 232. However, simply forcing the base register does work pretty well, and makes the code even cleaner and more efficient by further reducing it by about 4.5kB, possibly because it helps the compiler to pick suitable registers for the pair there. And the perf on 64-cores looks steadily 0.5% above the previous one, so let's do this. Note that the commit above was backported to 2.3 to fix scalability issues on AWS Graviton2 platform, so this one will need to be as well.	2021-03-12 06:26:22 +01:00

... 45 46 47 48 49 ...

7236 Commits