haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-10 17:17:06 +02:00

Author	SHA1	Message	Date
Aurelien DARRAGON	d0e0105181	BUG/MEDIUM: hlua: make hlua_ctx_renew() safe hlua_ctx_renew() is called from unsafe places where the caller doesn't expect it to LJMP.. however hlua_ctx_renew() makes use of Lua library function that could potentially raise errors, such as lua_newthread(), and it does nothing to catch errors. Because of this, haproxy could unexpectedly crash. This was discovered and reported by GH user @JB0925 on #2745. To fix the issue, let's simply make hlua_ctx_renew() safe by applying the same logic implemented for hlua_ctx_init() or hlua_ctx_destroy(), which is catching Lua errors by leveraging SET_SAFE_LJMP_PARENT() helper. It should be backported to all stable versions.	2024-10-08 12:00:36 +02:00
Aperence	a7b04e383a	MINOR: tools: extend str2sa_range to add an alt parameter Add a new parameter "alt" that will store wether this configuration use an alternate protocol. This alt pointer will contain a value that can be transparently passed to protocol_lookup to obtain an appropriate protocol structure. This change is needed to allow for example the servers to know if it need to use an alternate protocol or not.	2024-08-30 18:53:49 +02:00
Christopher Faulet	e5e36ce097	BUG/MEDIUM: hlua/cli: Fix lua CLI commands to work with applet's buffers In 3.0, the CLI applet was rewritten to use its own buffers. However, the lua part, used to register CLI commands at runtime, was not updated accordingly. It means the lua CLI commands still try to write in the channel buffers. This is of course totally unexepected and not supported. Because of this bug, the applet hangs intead of returning the command result. The registration of lua CLI commands relies on the lua TCP applets. So the send and receive functions were fixed to use the applet's buffer when it is required and still use the channel buffers otherwies. This way, other lua TCP applets can still run on the legacy mode, without the applet's buffers. This patch must be backported to 3.0.	2024-07-02 10:05:40 +02:00
Aurelien DARRAGON	185d230e2c	BUG/MINOR: hlua: report proper context upon error in hlua_cli_io_handler_fct() As a result of copy pasting, hlua_cli_io_handler_fct() used to report lua exceptions like E_ETMOUT as "Lua converter" instead of "Lua cli". Let's fix that. It could be backported to all stable versions. [ada: for older versions, HLUA_E_BTMOUT case didn't exist so it has to be skipped]	2024-06-26 11:06:24 +02:00
Aurelien DARRAGON	983513d901	DEBUG: hlua: distinguish burst timeout errors from exec timeout errors hlua burst timeout was introduced in `58e36e5b1` ("MEDIUM: hlua: introduce tune.lua.burst-timeout"). It is a safety measure that allows to detect when too much time is spent on a single lua execution (between 2 interruptions/yields), meaning that the current thread is not able to perform other tasks. Such scenario should be avoided because it will cause thread contention which may have negative performance impact and could cause the watchdog to trigger. When the burst timeout is exceeded, the current Lua execution is aborted and a timeout error is reported to the user. Unfortunately, the same error is currently being reported for cumulative (AKA execution) timeout and for burst timeout, which may be confusing to the user. Indeed, "execution timeout" error historically results from the current hlua context exceeding the total (cumulative) time it's allowed to run. It is set per lua context using the dedicated tunables: - tune.lua.session-timeout - tune.lua.task-timeout - tune.lua.service-timeout We've already faced an user report where the user was able to trigger the burst timeout and got "Lua task: execution timeout." error while the user didn't set cumulative timeout. Thus the error was actually confusing because it was indeed the burst timeout which was causing it due to the use of cpu-intensive call from within the task without sufficient manual "yield" keypoints around the cpu-intensive call to ensure it runs on a dedicated scheduler cycle. In this patch we make it so burst timeout related errors are reported as "burst timeout" errors instead of "execution timeout" errors (which in fact became the generic timeout errors catchall with `58e36e5b1`). To do this, hlua_timer_check() now returns a different value depending if the exeeded timeout is the burst one or the cumulative one, which allows us to return either HLUA_E_ETMOUT or HLUA_E_BTMOUT in hlua_ctx_resume(). It should improve the situation described in GH #2356 and may possibly be backported with `58e36e5b1` to improve error reporting if it applies without resistance.	2024-06-14 18:25:58 +02:00
Aurelien DARRAGON	2bde0d64dd	CLEANUP: hlua: simplify ambiguous lua_insert() usage in hlua_ctx_resume() 'lua_insert(lua->T, -lua_gettop(lua->T))' is actually used to rotate the top value with the bottom one, thus the code was overkill and the comment was actually misleading, let's fix that by using explicit equivalent form (absolute index). It may be backported with `5508db9a2` ("BUG/MINOR: hlua: fix unsafe lua_tostring() usage with empty stack") to all stable versions to ease code maintenance.	2024-06-04 16:31:38 +02:00
Aurelien DARRAGON	755c2daf0f	BUG/MINOR: hlua: fix leak in hlua_ckch_set() error path in hlua_ckch_commit_yield() and hlua_ckch_set(), when an error occurs, we enter the error path and try to raise an error from the <err> msg pointer which must be freed afterwards. However, the fact that luaL_error() never returns was overlooked, because of that <err> msg is never freed in such case. To fix the issue, let's use hlua_pushfstring_safe() helper to push the err on the lua stack and then free it before throwing the error using lua_error(). It should be backported up to 2.6 with `30fcca18` ("MINOR: ssl/lua: CertCache.set() allows to update an SSL certificate file")	2024-06-04 16:31:30 +02:00
Aurelien DARRAGON	2be94c008e	CLEANUP: hlua: get rid of hlua_traceback() security checks Thanks to the previous commit, we may now assume that hlua_traceback() won't LJMP, so it's safe to use it from unprotected environment without any precautions.	2024-06-04 16:31:22 +02:00
Aurelien DARRAGON	365ee28510	BUG/MINOR: hlua: prevent LJMP in hlua_traceback() Function is often used on error paths where no precaution is taken against LJMP. Since the function is used on error paths (which include out-of-memory error paths) the function lua_getinfo() could also raise a memory exception, causing the process to crash or improper error handling if the caller isn't prepared against that eventually. Since the function is only used on rare events (error handling) and is lacking the __LJMP prototype pefix, let's make it safe by protecting the lua_getinfo() call so that hlua_traceback() callers may use it safely now (the function will always succeed, output will be truncated in case of error). This could be backported to all stable versions.	2024-06-04 16:31:15 +02:00
Aurelien DARRAGON	f0e5b825cf	BUG/MINOR: hlua: fix unsafe hlua_pusherror() usage Following previous commit's logic: hlua_pusherror() is mainly used from cleanup paths where the caller isn't protected against LJMPs. Caller was tempted to think that the function was safe because func prototype was lacking the __LJMP prefix. Let's make the function really LJMP-safe by wrapping the sensitive calls under lua_pcall(). This may be backported to all stable versions.	2024-06-04 16:31:09 +02:00
Aurelien DARRAGON	c0a3c1281f	BUG/MINOR: hlua: don't use lua_pushfstring() when we don't expect LJMP lua_pushfstring() is used in multiple cleanup paths (upon error) to push the error message that will be raised by lua_error(). However this is often done from an unprotected environment, or in the middle of a cleanup sequence, thus we don't want the function to LJMP! (it may cause various issues ranging from memory leaks to crashing the process..) Hopefully this has very few chances of happening but since the use of lua_pushfstring() is limited to error reporting here, it's ok to use our own hlua_pushfstring_safe() implementation with a little overhead to ensure that the function will never LJMP. This could be backported to all stable versions.	2024-06-04 16:31:01 +02:00
Aurelien DARRAGON	6e484996c6	CLEANUP: hlua: use hlua_pusherror() where relevant In hlua_map_new(), when error occurs we use a combination of luaL_where, lua_pushfstring and lua_concat to build the error string before calling lua_error(). It turns out that we already have the hlua_pusherror() macro which is exactly made for that purpose so let's use it. It could be backported to all stable versions to ease code maintenance.	2024-06-04 16:30:55 +02:00
Aurelien DARRAGON	a63f2cde94	CLEANUP: hlua: fix CertCache class comment CLASS_CERTCACHE is used to declare CertCache global object, not Regex one This copy-paste typo introduced was in `30fcca18` ("MINOR: ssl/lua: CertCache.set() allows to update an SSL certificate file")	2024-06-03 17:00:06 +02:00
Aurelien DARRAGON	4f906a9c38	BUG/MINOR: hlua: use CertCache.set() from various hlua contexts Using CertCache.set() from init context wasn't explicitly supported and caused the process to crash: crash.lua: core.register_init(function() CertCache.set{filename="reg-tests/ssl/set_cafile_client.pem", ocsp=""} end) crash.conf: global lua-load crash.lua listen front bind localhost:9090 ssl crt reg-tests/ssl/set_cafile_client.pem ca-file reg-tests/ssl/set_cafile_interCA1.crt verify none ./haproxy -f crash.conf [NOTICE] (267993) : haproxy version is 3.0-dev2-640ff6-910 [NOTICE] (267993) : path to executable is ./haproxy [WARNING] (267993) : config : missing timeouts for proxy 'front'. \| While not properly invalid, you will certainly encounter various problems \| with such a configuration. To fix this, please ensure that all following \| timeouts are set to a non-zero value: 'client', 'connect', 'server'. [1] 267993 segmentation fault (core dumped) ./haproxy -f crash.conf This is because in hlua_ckch_set/hlua_ckch_commit_yield, we always consider that we're being called from a yield-capable runtime context. As such, hlua_gethlua() is never checked for NULL and we systematically try to wake hlua->task and yield every 10 instances. In fact, if we're called from the body or init context (that is, during haproxy startup), hlua_gethlua() will return NULL, and in this case we shouldn't care about yielding because it is ok to commit all instances at once since haproxy is still starting up. Also, when calling CertCache.set() from a non-yield capable runtime context (such as hlua fetch context), we kept doing as if the yield succeeded, resulting in unexpected function termination (operation would be aborted and the CertCache lock wouldn't be released). Instead, now we explicitly state in the doc that CertCache.set() cannot be used from a non-yield capable runtime context, and we raise a runtime error if it is used that way. These bugs were discovered by reading the code when trying to address Svace report documented by @Bbulatov GH #2586. It should be backported up to 2.6 with `30fcca18` ("MINOR: ssl/lua: CertCache.set() allows to update an SSL certificate file")	2024-06-03 17:00:00 +02:00
Aurelien DARRAGON	231d3d32be	MEDIUM: hlua: take nbthread into account in hlua_get_nb_instruction() Based on Willy's idea (from 3.0-dev6 announcement message): in this patch we try to reduce the max latency that can be caused by running lua scripts with default settings. Indeed, by default, hlua engine is allowed to process up to 10k instructions per batch. While this value was found to be the optimal one for a single thread, it turns out that keeping a thread busy for 10k lua instructions could increase thread contention. This is especially true when the script is loaded with 'lua-load', because in that case the current thread owns the main lua lock and prevent other threads from making any progress if they're also waiting on the main lock. Thanks to Thierry Fournier's work, we know that performance-wise we can reach optimal performance by sticking between 500 and 10k instructions per batch. Given that, when the script is loaded using 'lua-load', if no "tune.lua.forced-yield" was set by the user, we automatically divide the default value (10K) by the number of threads haproxy can use to reduce thread contention (given that all threads could compete for the main lua lock), however we make sure not to return a value below 500, because Thierry's work showed that this would come with a significant performance loss. The historical behavior may still be enforced by setting "tune.lua.forced-yield" to 10000 in the global config section.	2024-05-15 11:59:44 +02:00
Aurelien DARRAGON	e60d9dddf8	MINOR: hlua: add hlua_nb_instruction getter No functional behavior change, but this will ease the work of dynamically computing hlua_nb_instruction value depending on various inputs.	2024-05-15 11:59:37 +02:00
Aurelien DARRAGON	07b2e84bce	BUG/MEDIUM: hlua: streams don't support mixing lua-load with lua-load-per-thread (2nd try) While trying to reproduce another crash case involving lua filters reported by @bgrooot on GH #2467, we found out that mixing filters loaded from different contexts ('lua-load' vs 'lua-load-per-thread') for the same stream isn't supported and may even cause the process to crash. Historically, mixing lua-load and lua-load-per-threads for a stream wasn't supported, but this changed thanks to `0913386` ("BUG/MEDIUM: hlua: streams don't support mixing lua-load with lua-load-per-thread"). However, the above fix didn't consider lua filters's use-case properly: unlike lua fetches, actions or even services, lua filters don't simply use the stream hlua context as a "temporary" hlua running context to process some hlua code. For fetches, actions.. hlua executions are processed sequentially, so we simply reuse the hlua context from the previous action/fetch to run the next one (this allows to bypass memory allocations and initialization, thus it increases performance), unless we need to run on a different hlua state-id, in which case we perform a reset of the hlua context. But this cannot work with filters: indeed, once registered, a filter will last for the whole stream duration. It means that the filter will rely on the stream hlua context from ->attach() to ->detach(). And here is the catch, if for the same stream we register 2 lua filters from different contexts ('lua-load' + 'lua-load-per-thread'), then we have an issue, because the hlua stream will be re-created each time we switch between runtime contexts, which means each time we switch between the filters (may happen for each stream processing step), and since lua filters rely on the stream hlua to carry context between filtering steps, this context will be lost upon a switch. Given that lua filters code was not designed with that in mind, it would confuse the code and cause unexpected behaviors ranging from lua errors to crashing process. So here we take another approach: instead of re-creating the stream hlua context each time we switch between "global" and "per-thread" runtime context, let's have both of them inside the stream directly as initially suggested by Christopher back then when talked about the original issue. For this we leverage hlua_stream_ctx_prepare() and hlua_stream_ctx_get() helper functions which return the proper hlua context for a given stream and state_id combination. As for debugging infos reported after ha_panic(), we check for both hlua runtime contexts to check if one of them was active when the panic occured (only 1 runtime ctx per stream may be active at a given time). This should be backported to all stable versions with `0913386` ("BUG/MEDIUM: hlua: streams don't support mixing lua-load with lua-load-per-thread") This commit depends on: - "DEBUG: lua: precisely identify if stream is stuck inside lua or not" [for versions < 2.9 the ha_thread_dump_one() part should be skipped] - "MINOR: hlua: use accessors for stream hlua ctx" For 2.4, the filters API didn't exist. However it may be a good idea to backport it anyway because ->set_priv()/->get_priv() from tcp/http lua applets may also be affected by this bug, plus it will ease code maintenance. Of course, filters-related parts should be skipped in this case.	2024-03-13 09:24:46 +01:00
Aurelien DARRAGON	aa554be69c	MINOR: hlua: use accessors for stream hlua ctx Change hlua_stream_ctx_prepare() prototype so that it now returns the proper hlua ctx on success instead of returning a boolean. Add hlua_stream_ctx_get() to retrieve hlua ctx out of a given stream. This way we may easily change the storage mechanism for hlua stream in the future without extensive code changes. No backport needed unless a commit depends on it.	2024-03-13 09:24:46 +01:00
Aurelien DARRAGON	1a2cdf64c9	DEBUG: lua: precisely identify if stream is stuck inside lua or not When ha_panic() is called by the watchdog, we try to guess from ha_task_dump() and ha_thread_dump_one() if the thread was stuck while executing lua from the stream context. However we consider this is the case by simply checking if the stream hlua context was set, but this is not very precise because if the hlua context is set, then it simply means that at least one lua instruction was executed at the stream level, not that the stuck was currently executing lua when the panic occured. This is especially true with filters, one could simply register a lua filter that does nothing but this will still end up initializing the stream hlua context for each stream. If the thread end up being stuck during the stream handling, then debug dumping functions will report that the stream was stuck while handling lua, which is not necessarilly true, and could in fact confuse us even more. So here we take another approach, we add the BUSY flag to hlua context: this flag is set by hlua_ctx_resume() around lua_resume() call, this way we can precisely tell if the thread was handling lua when it was interrupted, and we rely on this flag in debug functions to check if the thread was effectively stuck inside lua or not while processing the stream No backport needed unless a commit depends on it.	2024-03-13 09:24:46 +01:00
Aurelien DARRAGON	85d81e4d0a	BUG/MINOR: hlua: fix missing lock in hlua_filter_delete() hlua_filter_delete() calls hlua_unref() on the stream hlua stack, but we should own the lock prior to manipulating the stack. This should be backported up to 2.6.	2024-03-13 09:24:46 +01:00
Aurelien DARRAGON	ecd8f3bfd7	BUG/MINOR: hlua: missing lock in hlua_filter_new() This is a complementary patch to `8670db7` ("BUG/MAJOR: hlua: improper lock usage with hlua_ctx_resume()") for hlua_filter_new(). Indeed, the HLUA_E_ERRMSG case still relies on the lua stack but didn't take the lock to do so. This should be backported up to 2.6.	2024-03-13 09:24:46 +01:00
Aurelien DARRAGON	4aefffc38c	BUG/MINOR: hlua: segfault when loading the same filter from different contexts Trying to register the same lua filter from global and per-thread context (using 'lua-load' + 'lua-load-per-thread') causes a segmentation fault in hlua_post_init(). This is due to a simple copy paste error as we try to print the function name in the error message (like we do when loading the same lua function from different contexts) instead of the filter name. This should be backported up to 2.6.	2024-03-13 09:24:46 +01:00
Aurelien DARRAGON	75c8a1bc2d	CLEANUP: hlua: txn class functions may LJMP Clarify that some txn related class functions may LJMP by adding the __LJMP tag to their prototype.	2024-03-04 16:48:51 +01:00
Aurelien DARRAGON	f364f4670b	MINOR: hlua: use SEND_ERR to report errors in hlua_event_runner() Instead of reporting lua errors using ha_alert(), let's use SEND_ERR() helper which will also try to generate a log message according to lua log settings.	2024-03-04 16:48:48 +01:00
Aurelien DARRAGON	e1b0031650	BUG/MINOR: hlua: don't call ha_alert() in hlua_event_subscribe() hlua_event_subscribe() is meant to be called from a protected lua env during init and/or runtime. As such, only hlua_event_sub() makes uses of it: when an error happens hlua_event_sub() will already raise a Lua exception. Thus it's not relevant to use ha_alert() there as it could generate log pollution (error is relevant from Lua script point of view, not from haproxy one). This could be backported in 2.8.	2024-03-04 16:48:42 +01:00
Aurelien DARRAGON	8670db7a89	BUG/MAJOR: hlua: improper lock usage with hlua_ctx_resume() hlua_ctx_resume() itself can safely be used as-is in a multithreading context because it takes care of taking the lua lock. However, when hlua_ctx_resume() returns, the lock is released and it is thus the caller's responsibility to ensure it owns the lock prior to performing additional manipulations on the Lua stack. Unfortunately, since early haproxy lua implementation, we used to do it wrong: The most common hlua_ctx_resume() pattern we can find in the code (because it was duplicated over and over over time) is the following: \|ret = hlua_ctx_resume() \|switch (ret) { \| case HLUA_E_OK: \| break; \| case HLUA_E_ERRMSG: \| break; \| [...] \|} Problem is: for some of the switch cases, we still perform lua stack manipulations. This is the case for the HLUA_E_ERRMSG for instance where we often use lua_tostring() to retrieve last lua error message on the top of the stack, or sometimes for the HLUA_E_OK case, when we need to perform some lua cleanup logic once the resume ended. But all of this is done WITHOUT the lua lock, so this means that the main lua stack could be accessed simultaneously by concurrent threads when a script was loaded using 'lua-load'. While it is not critical for switch-cases dedicated to error handling, (those are not supposed to happen very often), it can be very problematic for stack manipulations occuring in the HLUA_E_OK case under heavy load for instance. In this case, main lua stack corruptions will eventually happen. This is especially true inside hlua_filter_new(), where this bug was known to cause lua stack corruptions under load, leading to lua errors and even crashing the process as reported by @bgrooot in GH #2467. The fix is relatively simple, once hlua_ctx_resume() returns: we should consider that ANY lua stack access should be lua-lock protected. If the related lua calls may raise lua errors, then (RE)SET_SAFE_LJMP combination should be used as usual (it allows to lock the lua stack and catch lua exceptions at the same time), else hlua_{lock,unlock} may be used if no exceptions are expected. This patch should fix GH #2467. It should be backported to all stable versions. [ada: some ctx adj will be required for older versions as event_hdl doesn't exist prior to 2.8 and filters were implemented in 2.5, thus some chunks won't apply]	2024-03-04 16:48:31 +01:00
Aurelien DARRAGON	19b016f9f8	BUG/MEDIUM: hlua: improper lock usage with SET_SAFE_LJMP() When we want to perform some unsafe lua stack manipulations from an unprotected lua environment, we use SET_SAFE_LJMP() RESET_SAFE_LJMP() combination to lock lua stack and catch potential lua exceptions that may occur between the two. Hence, we regularly find this pattern (duplicated over and over): \|if (!SET_SAFE_LJMP(hlua)) { \| const char error; \| \| if (lua_type(hlua->T, -1) == LUA_TSTRING) \| error = hlua_tostring_safe(hlua->T, -1); \| else \| error = "critical error"; \| SEND_ERR(NULL, ": %s.\n", error); \|} This is wrong because when SET_SAFE_LJMP() returns false (meaning that an exception was caught), then the lua lock was released already, thus the caller is not expected to perform lua stack manipulations (because the main lua stack may be shared between multiple threads). In the pattern above we only want to retrieve the lua exception message which may be found at the top of the stack, to do so we now explicitly take the lua lock before accessing the lua stack. Note that hlua_lock() doesn't catch lua exceptions so only safe lua functions are expected to be used there (lua functions that may NOT raise exceptions). It should be backported to every stable versions. [ada: some ctx adj will be required for older versions as event_hdl doesn't exist prior to 2.8 and filters were implemented in 2.5, thus some chunks won't apply, but other fixes should stay relevant]	2024-03-04 16:47:20 +01:00
Aurelien DARRAGON	d81c2205a3	BUG/MINOR: hlua: improper lock usage in hlua_filter_new() In hlua_filter_new(), after each hlua resume, we systematically try to empty the stack by calling lua_settop(). However we're doing this without locking the lua context, so it is unsafe in multithreading context if the script is loaded using 'lua-load'. To fix the issue, we protect the call with hlua_{lock,unlock}() helpers. This should be backported up to 2.6.	2024-03-04 16:47:18 +01:00
Aurelien DARRAGON	51f291c795	BUG/MINOR: hlua: improper lock usage in hlua_filter_callback() In hlua_filter_callback(), some lua stack work is performed under SET_SAFE_LJMP() guard which also takes care of locking the hlua context when needed. However, a lua_gettop() call is performed out of the guard, thus it is unsafe in multithreading context if the script is loaded using 'lua-load' because in this case the main lua stack is shared between threads and each access to a lua stack must be performed under the lock, thus we move lua_gettop() call under the lock. It should be backported up to 2.6.	2024-03-04 16:47:17 +01:00
Aurelien DARRAGON	9578524091	BUG/MINOR: hlua: fix possible crash in hlua_filter_new() under load hlua_filter_new() handles memory allocation errors by jumping to the "end:" cleanup label in case of errors. Such errors may happen when the system is heavily loaded for instance. In hlua_filter_new(), we try to allocate two hlua contexts in a row before checking if one of them failed (in which case we jump to the cleanup part of the function), and only then we initialize them both. If a memory allocation failure happens for only one out of the two flt_ctx->hlua[] contexts pair, we still jump to the cleanup part. It means that the hlua context that was successfully allocated and wasn't initialized yet will be passed to hlua_ctx_destroy(), resulting in invalid reads in the cleanup function, which may ultimately cause the process to crash. To fix the issue: we make sure flt_ctx hlua contexts are initialized right after they are allocated, that is before any error handling condition that may force the cleanup. This bug was discovered when trying to reproduce GH #2467 with haproxy started with "-dMfail" argument. It should be backported up to 2.6.	2024-03-04 16:47:03 +01:00
Aurelien DARRAGON	369bfa0b50	BUG/MINOR: hlua: don't use lua_tostring() from unprotected contexts As per lua documentation, lua_tostring() may raise a memory error. However, we're often using it to fetch the error message at the top of the stack (ie: after a failing lua call) from unprotected environments. In practise, lua_tostring() has rare chances of failing, but still, if it happens to be the case, it could crash the process and we better not risk it. So here, we add hlua_tostring_safe() function, which works exactly as lua_tostring(), but the function cannot LJMP as it will catch lua_tostring() exceptions to return NULL instead. Everywhere lua_tostring() was used to retrieve error string from such unprotected contexts, we now rely on hlua_tostring_safe(). This should be backported to all stable versions. [ada: ctx adj will be required, for versions prior to 2.8 event_hdl API didn't exist so some chunks won't apply, and prior to 2.5 filters API didn't exist either, so again, some chunks should be ignored]	2024-03-04 16:46:55 +01:00
Aurelien DARRAGON	5508db9a20	BUG/MINOR: hlua: fix unsafe lua_tostring() usage with empty stack Lua documentation says that lua_tostring() returns a pointer that remains valid as long as the object is not removed from the stack. However there are some places were we use the returned string AFTER the corresponding object is removed from the stack. In practise this doesn't seem to cause visible bugs (probably because the pointer remains valid waiting for a GC cycle), but let's fix that to comply with the documentation and avoid undefined behavior. It should be backported in all stable versions.	2024-03-04 16:46:53 +01:00
Christopher Faulet	31ec9f18bb	MINOR: hlua: Be able to disable logging from lua Add core.silent (-1) value to be able to disable logging via TXN:set_loglevel() call. Otherwise, there is no way to do so and it may be handy. This special value cannot be used with TXN:log() function. This patch may be backported if necessary.	2024-03-01 15:01:18 +01:00
Christopher Faulet	75fb0afde4	BUG/MINOR: hlua: Fix log level to the right value when set via TXN:set_loglevel When the log level is changed in lua, by calling TXN:set_loglevel function, it must be incremented by one because it is decremented in strm_log() function. This patch must be backport to all stable versions.	2024-03-01 15:01:18 +01:00
Christopher Faulet	56e73df37d	BUG/MEDIUM: hlua: Don't loop if a lua socket does not consume received data If some data are received for a lua socket while the lua script responsible to consume these data is not ready to do so, for instance because it is sleeping, the applet is woken up in loop because it never states it will not consume these data yet. To fix the issue, in the applet I/O handle, when there are outgoing data, we always pretend the applet will not consume it. It is the responsibility to the lua script to reactivate receives by calling Socket.receive() function. This patch must be backported to every stable version. For 2.4 and older, si_want_get()/si_cant_get() must be used instead of applet_will_consume()/applet_wont_consume().	2024-02-16 15:48:08 +01:00
Christopher Faulet	38534d344b	BUG/MEDIUM: hlua: Be able to garbage collect uninitialized lua sockets It is poosible to create a lua socket without performing any connect. In this case, the lua socket is released because of the garbage collector. However, the garbarge collector does not release the applet, it wakes it up. Since commit `751b59c40b` ("BUG/MEDIUM: hlua: Initialize appctx used by a lua socket on connect only"), the applet initialization is performed on connect. So, here, it is possible to wake an uninitialized applet. It is an unexpected case for the applet's I/O handler, leading to a segfault because some resources are not initialized (the stream's target in this case). So, now, in the lua socket GC function, we take care to immediately release uninitialized applets. At worst, the release itself is delayed. But it is safe because we are sure the applet's I/O handler will never be executed. In addition, we take case to increment the GC counter when the lua socket is created. The way, uninitialized lua socket are released more quickly. This patch should fix the issue #2451. It must be backported as far as 2.6.	2024-02-16 15:48:08 +01:00
Christopher Faulet	dcd917d972	MINOR: applet: Remove uselelss test on SE_FL_SHR/SHW flags These both flags are set after releasing the applet, in appctx_shut(). Concretly, it means the applet is shutdown for reads and writes. Once set, the applet's I/O handler was no longer called. Tests on these flags are useless. There is no chance to match them.	2024-02-14 14:22:36 +01:00
Aurelien DARRAGON	03cb782bcb	MINOR: hlua: Rename set_{tos, mark} to set_fc_{tos, mark} This is a complementary patch to "MINOR: tcp-act: Rename "set-{mark,tos}" to "set-fc-{mark,tos}"", but for the Lua API. set_mark and set_tos were kept as aliases for set_fc_mark and set_fc_tos but they were marked as deprecated. Using this opportunity to reorder set_mark and set_tos by alphabetical order.	2024-02-01 10:58:30 +01:00
Aurelien DARRAGON	f41402ab29	CLEANUP: hlua: fix indent, remove extra return in hlua_core_get_var() This is cleanup patch to address cosmetic issues introduced in `f034139bc0` ("MINOR: lua: Allow reading "proc." scoped vars from LUA core.") Also taking this opportunity to prefix the function with __LJMP to indicate that it may longjump. No backport needed.	2024-01-24 16:27:47 +01:00
Aurelien DARRAGON	564addcb72	BUG/MINOR: hlua: fix uninitialized var in hlua_core_get_var() As raised by Coverity in GH #2223, `f034139bc0` ("MINOR: lua: Allow reading "proc." scoped vars from LUA core.") causes uninitialized reads due to smp being passed to vars_get_by_name() without being initialized first. Indeed, vars_get_by_name() tries to read smp->sess and smp->strm pointers. As we're only interested in the PROC var scope, it is safe to call vars_get_by_name() with sess and strm pointers set to NULL, thus we simply memset smp prior to calling vars_get_by_name() to fix the issue. This should be backported in 2.9 with `f034139bc0`.	2024-01-24 16:27:38 +01:00
Aurelien DARRAGON	835263047e	OPTIM: server: ebtree lookups for findserver_unique_* functions `4e5e2664` ("MINOR: proxy: add findserver_unique_id() and findserver_unique_name()") added findserver_unique_id() and findserver_unique_name() functions that were inspired from the historical findserver() function, so unfortunately they don't perform well when used on large backend farms because they scan the whole server list linearly. I was about to provide a patch to optimize such functions when I stumbled on Baptiste's work: `19a106d24` ("MINOR: server: server_find functions: id, name, best_match") It turns out Baptiste already implemented helper functions to supersed the unoptimized findserver() function (at least at runtime when servers have been assigned their final IDs and inserted in the lookup trees): they offer more matching options and rely on eb lookups so they are much more suitable for fast queries. I don't know how I missed that, but they are a perfect base for the server rid matching functions. So in this patch, we essentially revert `4e5e2664` to provide the optimized equivalent functions named server_find_by_id_unique() and server_find_by_name_unique(), then we force existing findserver_unique_*() callers to switch to the new functions. This patch depends on: - "OPTIM: server: eb lookup for server_find_by_name()" This could be backported up to 2.8.	2023-12-21 14:22:26 +01:00
Frédéric Lécaille	917f7c74d3	BUG/MINOR: lua: Wrong OCSP CID after modifying an SSL certficate (LUA) This bugfix is the same as the following one: "BUG/MINOR: ssl_ckch: Wrong OCSP CID after modifying an SSL certficate" where the OCSP CID had to be reset when updating a certificate. Must be backported to 2.8.	2023-12-06 16:12:08 +01:00
Ilya Shipitsin	80813cdd2a	CLEANUP: assorted typo fixes in the code and comments This is 37th iteration of typo fixes	2023-11-23 16:23:14 +01:00
Willy Tarreau	cf07cb96be	BUG/MEDIUM: proxy: always initialize the default settings after init The proxy's initialization is rather odd. First, init_new_proxy() is called to zero all the lists and certain values, except those that can come from defaults, which are initialized by proxy_preset_defaults(). The default server settings are also only set there. This results in these settings not to be set for a number of internal proxies that do not explicitly call proxy_preset_defaults() after allocation, such as sink and log forwarders. This was revealed by last commit `79aa63823` ("MINOR: server: always initialize pp_tlvs for default servers") which crashes in log parsers when applied to certain proxies which did not initialize their default servers. In theory this should be backported, however it would be desirable to wait a bit before backporting it, in case certain parts would rely on these elements not being initialized.	2023-11-13 09:17:05 +01:00
Aurelien DARRAGON	12582eb8e5	MINOR: tools: make str2sa_range() directly return type hints str2sa_range() already allows the caller to provide <proto> in order to get a pointer on the protocol matching with the string input thanks to `5fc9328a` ("MINOR: tools: make str2sa_range() directly return the protocol") However, as stated into the commit message, there is a trick: "we can fail to return a protocol in case the caller accepts an fqdn for use later. This is what servers do and in this case it is valid to return no protocol" In this case, we're unable to return protocol because the protocol lookup depends on both the [proto type + xprt type] and the [family type] to be known. While family type might not be directly resolved when fqdn is involved (because family type might be discovered using DNS queries), proto type and xprt type are already known. As such, the caller might be interested in knowing those address related hints even if the address family type is not yet resolved and thus the matching protocol cannot be looked up. Thus in this patch we add the optional net_addr_type (custom type) argument to str2sa_range to enable the caller to check the protocol type and transport type when the function succeeds.	2023-11-10 17:49:57 +01:00
Tristan	8da0e45382	MINOR: lua: change tune.lua.log.stderr default from 'on' to 'auto' After making it configurable in previous commit "MINOR: lua: Add flags to configure logging behaviour", this patch changes the default value of tune.lua.log.stderr from 'on' (unconditionally forward LUA logs to stderr) to 'auto' (only forward LUA logs to stderr if logging via a standard logger is disabled, or none is configured for the current context) Since this is a change in behaviour, it shouldn't be backported	2023-10-25 07:49:03 +02:00
Tristan	97dacbbb86	MINOR: lua: Add flags to configure logging behaviour Until now, messages printed from LUA log functions were sent both to the any logger configured for the current proxy, and additionally to stderr (in most cases) This introduces two flags to configure LUA log handling: - tune.lua.log.loggers to use standard loggers or not - tune.lua.log.stderr to use stderr, or not, or only conditionally This addresses github feature request #2316 This can be backported to 2.8 as it doesn't change previous behaviour.	2023-10-25 07:48:48 +02:00
Christopher Faulet	322d660d08	MINOR: tree-wide: Only rely on co_data() to check channel emptyness Because channel_is_empty() function does now only check the channel's buffer, we can remove it and rely on co_data() instead. Of course, all tests must be inverted. channel_is_empty() is thus removed.	2023-10-17 18:51:13 +02:00
Christopher Faulet	2f9db80cc6	CLEANUP: hlua: Remove dead-code on error path in hlua_socket_new() Since last fixes about the lua cosocket, the appctx is no longer initialized in hlua_socket_new(). The code to deal with error at this stage can be removed. This patch should fix the issue #2308.	2023-10-17 18:11:04 +02:00
Christopher Faulet	751b59c40b	BUG/MEDIUM: hlua: Initialize appctx used by a lua socket on connect only Ths appctx used by a lua socket was synchronously initialized after the appctx creation. The connect itself is performed later. However it is an issue because the script may be interrupted beteween the two operation. In this case, the stream attached to the appctx is woken up before any destination is set. The stream will try to connect but without destination, it fails. When the lua script is rescheduled and the connect is performed, the connection has already failed and an error is returned. To fix the issue, we must be sure to not woken up the stream before the connect. To do so, we must defer the appctx initilization. It is now perform on connect. This patch relies on the following commits: * MINOR: hlua: Test the hlua struct first when the lua socket is connecting * MINOR: hlua: Save the lua socket's server in its context * MINOR: hlua: Save the lua socket's timeout in its context * MINOR: hlua: Don't preform operations on a not connected socket * MINOR: hlua: Set context's appctx when the lua socket is created All the series must be backported as far as 2.6.	2023-10-04 15:34:13 +02:00
Christopher Faulet	66fc9238f0	MINOR: hlua: Test the hlua struct first when the lua socket is connecting It makes sense to first verify the hlua context is valid. It is probably better than doing it after updated the appctx.	2023-10-04 15:34:10 +02:00
Christopher Faulet	6f4041c75d	MINOR: hlua: Save the lua socket's server in its context For the same reason than the timeout, the server used by a lua socket is now saved in its context. This will be mandatory to fix issues with the lua sockets.	2023-10-04 15:34:06 +02:00
Christopher Faulet	0be1ae2fa2	MINOR: hlua: Save the lua socket's timeout in its context When the lua socket timeout is set, it is now saved in its context. If there is already a stream attached to the appctx, the timeout is then immediately modified. Otherwise, it is modified when the stream is created, thus during the appctx initialization. For now, the appctx is initialized when it is created. But this will change to fix issues with the lua sockets. Thus, this patch is mandatory.	2023-10-04 15:34:03 +02:00
Christopher Faulet	ee687aa18d	MINOR: hlua: Don't preform operations on a not connected socket There is nothing that prevent someone to create a lua socket and try to receive or to write before the connection was established ot after the shutdown was performed. The same is true when info about the socket are retrieved. It is not an issue because this will fail later. But now, we check the socket is connected or not earlier. It is more effecient but it will be also mandatory to fix issue with the lua sockets.	2023-10-04 15:34:00 +02:00
Christopher Faulet	ed9333827a	MINOR: hlua: Set context's appctx when the lua socket is created The lua socket's context referenced the owning appctx. It was set when the appctx was initialized. It is now performed when the appctx is created. It is a small change but this will be required to fix several issues with the lua sockets.	2023-10-04 15:33:57 +02:00
Sébastien Gross	6a9ba85322	MINOR: hlua: Add support for the "http-after-res" action This commit introduces support for the "http-after-res" action in hlua, enabling the invocation of a Lua function in a "http-after-response" rule. With this enhancement, a Lua action can be registered using the "http-after-res" action type: core.register_action('myaction', {'http-after-res'}, myaction) A new "lua.myaction" is created and can be invoked in a "http-after-response" rule: http-after-response lua.myaction This addition provides greater flexibility and extensibility in handling post-response actions using Lua. This commit depends on: - `4457783` ("MINOR: http_ana: position the FINAL flag for http_after_res execution") Signed-off-by: Sébastien Gross <sgross@haproxy.com>	2023-09-21 16:31:20 +02:00
Aurelien DARRAGON	1115fc348e	BUG/MINOR: hlua/init: coroutine may not resume itself It's not supported to call lua_resume with <L> and <from> designating the same lua coroutine. It didn't cause visible bugs so far because Lua 5.3 used to be more permissive about this, and moreover, yielding is not involved during the hlua init state. But this is wrong usage, and the doc clearly specifies that the <from> argument can be NULL when there is no such coroutine, which is the case here. This should be backported in every stable versions.	2023-09-12 19:50:17 +02:00
Aurelien DARRAGON	e7281f3f5d	BUG/MEDIUM: hlua: don't pass stale nargs argument to lua_resume() In hlua_ctx_resume(), we call lua_resume() function like this: lua_resume(lua->T, hlua_states[lua->state_id], lua->nargs) Once the call returns, we may call the function again with the same hlua context when E_YIELD is returned (the execution was interrupted and may be resumed through another lua_resume() call). The 3rd argument to lua_resume(), 'nargs', is a hint passed to Lua to know how many (optional) arguments were pushed on the stack prior to resuming the execution (arguments that Lua will then expose to the Lua script). But here is the catch: we never reset lua->nargs between successive lua_resume() calls, meaning that next lua_resume() calls will still inherit from the initial nargs value that was set in hlua ctx prior to calling hlua_ctx_resume() (our wrapper function) for the first time. This is problematic, because despite not being explicitly mentioned in the Lua documentation, passed arguments (to which `nargs` refer to), are already consumed once lua_resume() returns. This means that we cannot keep calling lua_resume() with non-zero nargs if we don't push new arguments on the stack prior to resuming lua after the initial call: nargs is proper to a single lua_resume() invocation. Despite improper use of lua_resume() for a long time, this didn't cause visible issues in the past with Lua 5.3, but it is particularly sensitive starting with Lua 5.4.3 due to debugging hooks improvements that led to some internal changes (see: lua/lua@58aa09a). Not using nargs properly now exposes us to undefined behavior when resuming after a yield triggered from a debugging hook, which may cause running scripts to crash unexpectedly: for instance with Lua raising errors and complaining about values being NULL where it should not be the case. For reference, this issue was initially raised on the Lua mailing list: http://lua-users.org/lists/lua-l/2023-09/msg00005.html In this patch, we immediately reset nargs when lua_resume() returns to prevent any misuse. It should be backported to every maintained versions.	2023-09-12 19:50:17 +02:00
Aurelien DARRAGON	967608a432	BUG/MINOR: hlua/action: incorrect message on E_YIELD error When hlua_action error messages were reworked in `d5b073cf1` ("MINOR: lua: Improve error message"), an error was made for the E_YIELD case. Indeed, everywhere E_YIELD error is handled: "yield is not allowed" or similar error message is reported to the user. But instead we currently have: "aborting Lua processing on expired timeout". It is quite misleading because this error message often refers to the HLUA_E_ETMOUT case. Thus, we now report the proper error message thanks to this patch. This should be backported to all stable versions. [on 2.0, the patch needs to be slightly adapted]	2023-09-06 11:42:34 +02:00
Fr�d�ric L�caille	81815a9a83	MEDIUM: map/acl: Replace map/acl spin lock by a read/write lock. Replace ->lock type of pat_ref struct by HA_RWLOCK_T. Replace all calls to HA_SPIN_LOCK() (resp. HA_SPIN_UNLOCK()) by HA_RWLOCK_WRLOCK() (resp. HA_RWLOCK_WRUNLOCK()) when a write access is required. There is only one read access which is needed. This is in the "show map" command callback, cli_io_handler_map_lookup() where a HA_SPIN_LOCK() call is replaced by HA_RWLOCK_RDLOCK() (resp. HA_SPIN_UNLOCK() by HA_RWLOCK_RDUNLOCK). Replace HA_SPIN_INIT() calls by HA_RWLOCK_INIT() calls.	2023-08-25 15:42:03 +02:00
Fr�d�ric L�caille	0844bed7d3	MEDIUM: map/acl: Improve pat_ref_set() efficiency (for "set-map", "add-acl" action perfs) Organize reference to pattern element of map (struct pat_ref_elt) into an ebtree: - add an eb_root member to the map (pat_ref struct) and an ebpt_node to its element (pat_ref_elt struct), - modify the code to insert these nodes into their ebtrees each time they are allocated. This is done in pat_ref_append(). Note that ->head member (struct list) of map (struct pat_ref) is not removed could have been removed. This is not the case because still necessary to dump the map contents from the CLI in the order the map elememnts have been inserted. This patch also modifies http_action_set_map() which is the callback at least used by "set-map" action. The pat_ref_elt element returned by pat_ref_find_elt() is no more ignored, but reused if not NULL by pat_ref_set() as first element to lookup from. This latter is also modified to use the ebtree attached to the map in place of the ->head list attached to each map element (pat_ref_elt struct). Also modify pat_ref_find_elt() to makes it use ->eb_root map ebtree added to the map by this patch in place of inspecting all the elements with a strcmp() call.	2023-08-25 15:41:56 +02:00
Aurelien DARRAGON	09133860bf	BUG/MEDIUM: hlua: streams don't support mixing lua-load with lua-load-per-thread Michel Mayen reported that mixing lua actions loaded from 'lua-load' and 'lua-load-per-thread' directives within a single http/tcp session yields unexpected results. When executing action defined in another running context from the one of the previously executed action (from lua-load, then from lua-load-per-thread or the opposite, order doesn't matter), it would yield this kind of error: "Lua function 'name': [state-id x] runtime error: attempt to call a nil value from ." He also noted that when loading all actions using the same loading directive, the issue is gone. This is due to the fact that for lua actions, fetches and converters, lua code is being executed from the stream lua context. However, the stream lua context, which is created on the fly when first executing some lua code related to the stream, is reused between multiple lua executions. But the thing is, despite successive executions referring to the same parent "stream" (which is also assigned to a given thread id), they don't necessarily depend on the same running context from lua point of view. Indeed, since the function which is about to be executed could have been loaded from either 'lua-load' or 'lua-load-per-thread', the function declaration and related dependencies are defined in a specific stack ID which is known by calling fcn_ref_to_stack_id() on the given function. Thus, in order to make streams capable of chaining lua actions, fetches and converters loaded in different lua stacks, we add a new detection logic in hlua_stream_ctx_prepare() to be able to recreate the lua context in the proper stack space when the existing one conflicts with the expected stack id. This must be backported in every stable versions. It depends on: - "MINOR: hlua: add hlua_stream_prepare helper function" [for < 2.5, skip the filter part since they didn't exist] [wt: warning, wait a little bit before backporting too far, we need to be certain the added BUG_ON() will never trigger]	2023-08-11 19:02:59 +02:00
Aurelien DARRAGON	2fdb9d41b3	MINOR: hlua: add hlua_stream_ctx_prepare helper function Stream-dedicated hlua ctx creation and attachment is now performed in hlua_stream_ctx_prepare() helper function to ease code maintenance. No functional behavior change should be expected.	2023-08-11 19:00:57 +02:00
Aurelien DARRAGON	12cf8d4db7	BUG/MINOR: hlua: fix invalid use of lua_pop on error paths Multiple error paths made invalid use of lua_pop(): When the stack is emptied using lua_settop(0), lua_pop() (which is implemented as a lua_settop() macro) should not be used right after, because it could lead to invalid reads since the stack is already empty. Unfortunately, some remnants from initial lua stack implementation kept doing so, resulting in haproxy crashs on some lua runtime errors paths from time to time (ie: ERRRUN, ERRMEM). Moreover, the extra lua_pop() instruction, even if it was safe, is totally pointless in such case. Removing such unsafe lua_pop() statements when we know that the stack is already empty. This must be backported in every stable versions.	2023-08-11 19:00:55 +02:00
Daan van Gorkum	f034139bc0	MINOR: lua: Allow reading "proc." scoped vars from LUA core. This adds the "core.get_var()" method allow the reading of "proc." scoped variables outside of TXN or HTTP/TCPApplet. Fixes: #2212 Signed-off-by: Daan van Gorkum <djvg@djvg.net>	2023-07-20 10:55:28 +02:00
firexinghe	bfff46f411	BUG/MINOR: hlua: add check for lua_newstate Calling lual_newstate(Init main lua stack) in the hlua_init_state() function, the return value of lua_newstate() may be NULL (for example in case of OOM). In this case, L will be NULL, and then crash happens in lua_getextraspace(). So, we add a check for lua_newstate. This should be backported at least to 2.4, maybe further.	2023-07-19 10:16:14 +02:00
Aurelien DARRAGON	2e7d3d2e5c	BUG/MINOR: hlua: hlua_yieldk ctx argument should support pointers lua_yieldk ctx argument is of type lua_KContext which is typedefed to intptr_t when available so it can be used to store pointers. But the wrapper function hlua_yieldk() passes it as a regular it so it breaks that promise. Changing hlua_yieldk() prototype so that ctx argument is of type lua_KContext. This bug had no functional impact because ctx argument is not being actively used so far. This may be backported to all stable versions anyway.	2023-07-17 07:42:47 +02:00
Christopher Faulet	28d17e26b8	BUG/MEDIUM: hlua: Use front SC to detect EOI in HTTP applets' receive functions When an HTTP applet tries to get request data, we must take care to properly detect the end of the message. It an empty HTX message with the SC_FL_EOI flag set on the front SC. However, an issue was introduced during the SC refactoring performed in the 2.8. The backend SC is tested instead of the frontend one. Because of this bug, the receive functions hang because the test on SC_FL_EOI flag never succeeds. Of course, by checking the frontend SC (the opposite SC to the one attached to the appctx), it works. This patch should fix the issue #2180. It must be backported to the 2.8.	2023-06-12 09:16:29 +02:00
Aurelien DARRAGON	1c07da4b48	BUG/MINOR: hlua: unsafe hlua_lua2smp() usage Fixing hlua_lua2smp() usage in hlua's code since it was assumed that hlua_lua2smp() makes a standalone smp out of lua data, but it is not the case. This is especially true when dealing with lua strings (string is extracted using lua_tolstring() which returns a pointer to lua string memory location that may be reclaimed by lua at any time when no longer used from lua's point of view). Thus, smp generated by hlua_lua2smp() may only be used from the lua context where the call was initially made, else it should be explicitly duplicated before exporting it out of lua's context to ensure safe (standalone) usage. This should be backported to all stable versions.	2023-05-24 16:48:17 +02:00
Aurelien DARRAGON	a3624cb528	DOC: hlua: document hlua_lua2smp() function Add some developer notes to hlua_lua2smp() function description since it lacks some important infos, including a critical usage restriction.	2023-05-24 16:48:17 +02:00
Aurelien DARRAGON	0aaf6c45ca	DOC: hlua: document hlua_lua2arg() function Add some developer notes to hlua_lua2arg() function description since it lacks some important infos, including an usage restriction.	2023-05-24 16:48:17 +02:00
Aurelien DARRAGON	e5c048a72d	MINOR: hlua: hlua_arg2lua() may LJMP Add LJMP hint to hlua_arg2lua() prototype since it relies on functions (e.g.: lua_pushlstring()) which may raise lua memory errors.	2023-05-24 16:48:17 +02:00
Aurelien DARRAGON	4121772c50	MINOR: hlua: hlua_smp2lua() may LJMP Add LJMP hint to hlua_smp2lua() prototype since it relies on functions (e.g.: lua_pushstring()) which may raise lua memory errors.	2023-05-24 16:48:17 +02:00
Aurelien DARRAGON	742b1a8797	MINOR: hlua: hlua_smp2lua_str() may LJMP Add LJMP hint to hlua_smp2lua_str() prototype since it relies on functions (e.g.: lua_pushstring()) which may raise lua memory errors.	2023-05-24 16:48:17 +02:00
Aurelien DARRAGON	7428adaf0d	BUG/MINOR: hlua: SET_SAFE_LJMP misuse in hlua_event_runner() When hlua_event_runner() pauses the subscription (ie: if the consumer can't keep up the pace), hlua_traceback() is used to get the current lua trace (running context) to provide some info to the user. However, as hlua_traceback() may raise an error (__LJMP) is set, it is used within a SET_SAFE_LJMP() / RESET_SAFE_LJMP() combination to ensure lua errors are properly handled and don't result in unexpected behavior. But the current usage of SET_SAFE_LJMP() within the function is wrong since hlua_traceback() will run a second time (unprotected) if the first (protected) attempt fails. This is undefined behavior and could even lead to crashes. Hopefully it is very hard to trigger this code path, thus we can consider this as a minor bug. Also using this as an opportunity to enhance the message report to make it more meaningful to the user. This should fix GH #2159. It is a 2.8 specific bug, no backport needed unless `c84899c636` ("MEDIUM: hlua/event_hdl: initial support for event handlers") gets backported.	2023-05-17 16:48:40 +02:00
Aurelien DARRAGON	40cd44f52c	MINOR: hlua: declare hlua_gethlua() function Declaring hlua_gethlua() function to make it usable from hlua_fcn.c.	2023-05-05 16:28:32 +02:00
Aurelien DARRAGON	e0b16355ce	CLEANUP: hlua: hlua_register_task() may longjmp Adding __LJMP prefix to hlua_register_task() to indicate that the function may longjmp when executed.	2023-05-05 16:28:32 +02:00
Aurelien DARRAGON	a2c5321045	BUG/MINOR: hlua: spinning loop in hlua_socket_handler() Since `3157222` ("MEDIUM: hlua/applet: Use the sedesc to report and detect end of processing"), hlua_socket_handler() might spin loop if the hlua socket is destroyed and some data was left unconsumed in the applet. Prior to the above commit, the stream was explicitly KILLED (when ctx->die == 1) so the app couldn't spinloop on unconsumed data. But since the refactor this is no longer the case. To prevent unconsumed data from waking the applet indefinitely, we consume pending data when either one of EOS\|ERROR\|SHR\|SHW flags are set, as it is done everywhere else this check is performed in the code. Hence it was probably overlooked in the first place during the refacto. This bug is 2.8 specific only, so no backport needed.	2023-05-05 16:28:32 +02:00
Aurelien DARRAGON	5bed48fec8	MINOR: mailers/hlua: disable email sending from lua Exposing a new hlua function, available from body or init contexts, that forcefully disables the sending of email alerts even if the mailers are defined in haproxy configuration. This will help for sending email directly from lua. (prevent legacy email sending from intefering with lua)	2023-05-05 16:28:32 +02:00
Aurelien DARRAGON	0bd53b2152	MINOR: hlua/event_hdl: expose SERVER_CHECK event Exposing SERVER_CHECK event through the lua API. New lua class named ServerEventCheck was added to provide additional data for SERVER_CHECK event. Lua documentation was updated accordingly.	2023-05-05 16:28:32 +02:00
Aurelien DARRAGON	948dd3ddfb	MINOR: hlua: expose SERVER_ADMIN event Exposing SERVER_ADMIN event in lua and updating the documentation.	2023-05-05 16:28:32 +02:00
Aurelien DARRAGON	c99f3adf10	MINOR: hlua: expose SERVER_STATE event Exposing SERVER_STATE event in lua and updating the documentation.	2023-05-05 16:28:32 +02:00
Christopher Faulet	7b3d38a633	MEDIUM: tree-wide: Change sc API to specify required free space to progress sc_need_room() now takes the required free space to receive more data as parameter. All calls to this function are updated accordingly. For now, this value is set but not used. When we are waiting for a buffer, 0 is used. So we expect to be unblocked ASAP. However this must be reviewed because SC_FL_NEED_BUF is probably enough in this case and this flag is already set if the input buffer allocation fails.	2023-05-05 15:44:23 +02:00
Willy Tarreau	69530f59ae	MEDIUM: clock: replace timeval "now" with integer "now_ns" This puts an end to the occasional confusion between the "now" date that is internal, monotonic and not synchronized with the system's date, and "date" which is the system's date and not necessarily monotonic. Variable "now" was removed and replaced with a 64-bit integer "now_ns" which is a counter of nanoseconds. It wraps every 585 years, so if all goes well (i.e. if humanity does not need haproxy anymore in 500 years), it will just never wrap. This implies that now_ns is never nul and that the zero value can reliably be used as "not set yet" for a timestamp if needed. This will also simplify date checks where it becomes possible again to do "date1<date2". All occurrences of "tv_to_ns(&now)" were simply replaced by "now_ns". Due to the intricacies between now, global_now and now_offset, all 3 had to be turned to nanoseconds at once. It's not a problem since all of them were solely used in 3 functions in clock.c, but they make the patch look bigger than it really is. The clock_update_local_date() and clock_update_global_date() functions are now much simpler as there's no need anymore to perform conversions nor to round the timeval up or down. The wrapping continues to happen by presetting the internal offset in the short future so that the 32-bit now_ms continues to wrap 20 seconds after boot. The start_time used to calculate uptime can still be turned to nanoseconds now. One interrogation concerns global_now_ms which is used only for the freq counters. It's unclear whether there's more value in using two variables that need to be synchronized sequentially like today or to just use global_now_ns divided by 1 million. Both approaches will work equally well on modern systems, the difference might come from smaller ones. Better not change anyhting for now. One benefit of the new approach is that we now have an internal date with a resolution of the nanosecond and the precision of the microsecond, which can be useful to extend some measurements given that timestamps also have this resolution.	2023-04-28 16:08:08 +02:00
Willy Tarreau	ad5a5f6779	MEDIUM: tree-wide: replace timeval with nanoseconds in tv_accept and tv_request Let's get rid of timeval in storage of internal timestamps so that they are no longer mistaken for wall clock time. These were exclusively used subtracted from each other or to/from "now" after being converted to ns, so this patch removes the tv_to_ns() conversion to use them natively. Two occurrences of tv_isge() were turned to a regular wrapping subtract.	2023-04-28 16:08:08 +02:00
Ilya Shipitsin	ccf8012f28	CLEANUP: assorted typo fixes in the code and comments This is 36th iteration of typo fixes	2023-04-23 09:44:53 +02:00
Tim Duesterhus	fe83f58906	CLEANUP: Stop checking the pointer before calling `task_free()` Changes performed with this Coccinelle patch: @@ expression e; @@ - if (e != NULL) { task_destroy(e); - } @@ expression e; @@ - if (e) { task_destroy(e); - } @@ expression e; @@ - if (e) task_destroy(e); @@ expression e; @@ - if (e != NULL) task_destroy(e);	2023-04-23 00:28:25 +02:00
Aurelien DARRAGON	096b383e16	MINOR: hlua/event_hdl: timestamp for events 'when' optional argument is provided to lua event handlers. It is an integer representing the number of seconds elapsed since Epoch and may be used in conjunction with lua `os.date()` function to provide a custom format string.	2023-04-21 14:36:45 +02:00
Aurelien DARRAGON	2f6a07dce8	MINOR: hlua/event_hdl: fix return type for hlua_event_hdl_cb_data_push_args Changing hlua_event_hdl_cb_data_push_args() return type to void since it does not return anything useful. Also changing its name to hlua_event_hdl_cb_push_args() since it does more than just pushing cb data argument (it also handles event type and mgmt). Errors catched by the function are reported as lua errors.	2023-04-21 14:36:45 +02:00
Aurelien DARRAGON	55f84c7cab	MINOR: hlua/event_hdl: expose proxy_uuid variable in server events Adding proxy_uuid to ServerEvent class. proxy_uuid contains the uuid of the proxy to which the server belongs	2023-04-21 14:36:45 +02:00
Aurelien DARRAGON	3d9bf4e1a5	MINOR: hlua/event_hdl: rely on proxy_uuid instead of proxy_name for lookups Since "MINOR: server/event_hdl: add proxy_uuid to event_hdl_cb_data_server" we may now use proxy_uuid variable to perform proxy lookups when handling a server event. It is more reliable since proxy_uuid isn't subject to any size limitation	2023-04-21 14:36:45 +02:00
Aurelien DARRAGON	8cd620b46f	MINOR: hlua: safe coroutine.create() Overriding global coroutine.create() function in order to link the newly created subroutine with the parent hlua ctx. (hlua_gethlua() function from a subroutine will return hlua ctx from the hlua ctx on which the coroutine.create() was performed, instead of NULL) Doing so allows hlua_hook() function to support being called from subroutines created using coroutine.create() within user lua scripts. That is: the related subroutine will be immune to the forced-yield, but it will still be checked against hlua timeouts. If the subroutine fails to yield or finish before the timeout, the related lua handler will be aborted (instead of going rogue unnoticed like it would be the case prior to this commit)	2023-04-19 11:03:31 +02:00
Aurelien DARRAGON	cf0f792490	MINOR: hlua: hook yield on known lua state When forcing a yield attempt from hlua_hook(), we should perform it on the known hlua state, not on a potential substate created using coroutine.create() from an existing hlua state from lua script. Indeed, only true hlua couroutines will properly handle the yield and perform the required timeout checks when returning in hlua_ctx_resume(). So far, this was not a concern because hlua_gethlua() would return NULL if hlua_hook() is not directly being called from a hlua coroutine anyway. But with this we're trying to make hlua_hook() ready for being called from a subcoroutine which inherits from a parent hlua ctx. In this case, no yield attempt will be performed, we will simply check for hlua timeouts. Not doing so would result in the timeout checks not being performed since hlua_ctx_resume() is completely bypassed when yielding from the subroutine, resulting in a user-defined coroutine potentially going rogue unnoticed.	2023-04-19 11:03:31 +02:00
Aurelien DARRAGON	2a9764baae	CLEANUP: hlua: avoid confusion between internal timers and tick based timers Not all hlua "time" variables use the same time logic. hlua->wake_time relies on ticks since its meant to be used in conjunction with task scheduling. Thus, it should be stored as a signed int and manipulated using the tick api. Adding a few comments about that to prevent mixups with hlua internal timer api which doesn't rely on the ticks api.	2023-04-19 11:03:31 +02:00
Aurelien DARRAGON	58e36e5b14	MEDIUM: hlua: introduce tune.lua.burst-timeout The "burst" execution timeout applies to any Lua handler. If the handler fails to finish or yield before timeout is reached, handler will be aborted to prevent thread contention, to prevent traffic from not being served for too long, and ultimately to prevent the process from crashing because of the watchdog kicking in. Default value is 1000ms. Combined with forced-yield default value of 10000 lua instructions, it should be high enough to prevent any existing script breakage, while still being able to catch slow lua converters or sample fetches doing thread contention and risking the process stability. Setting value to 0 completely bypasses this check. (not recommended but could be required to restore original behavior if this feature breaks existing setups somehow...) No backport needed, although it could be used to prevent watchdog crashes due to poorly coded (slow/cpu consuming) lua sample fetches/converters.	2023-04-19 11:03:31 +02:00
Aurelien DARRAGON	da9503ca9a	MEDIUM: hlua: reliable timeout detection For non yieldable lua handlers (converters, fetches or yield incompatible lua functions), current timeout detection relies on now_ms thread local variable. But within non-yieldable contexts, now_ms won't be updated if not by us (because we're momentarily stuck in lua context so we won't re-enter the polling loop, which is responsible for clock updates). To circumvent this, clock_update_date(0, 1) was manually performed right before now_ms is being read for the timeout checks. But this fails to work consistently, because if no other concurrent threads periodically run clock_update_global_date(), which do happen if we're the only active thread (nbthread=1 or low traffic), our clock_update_date() call won't reliably update our local now_ms variable Moreover, clock_update_date() is not the right tool for this anyway, as it was initially meant to be used from the polling context. Using it could have negative impact on other threads relying on now_ms to be stable. (because clock_update_date() performs global clock update from time to time) -> Introducing hlua multipurpose timer, which is internally based on now_cpu_time_fast() that provides per-thread consistent clock readings. Thanks to this new hlua timer API, hlua timeout logic is less error-prone and more robust. This allows the timeout detection to work as expected for both yieldable and non-yieldable lua handlers. This patch depends on commit "MINOR: clock: add now_cpu_time_fast() function" While this could theorically be backported to all stable versions, it is advisable to avoid backports unless we're confident enough since it could cause slight behavior changes (timing related) in existing setups.	2023-04-19 11:03:31 +02:00
Christopher Faulet	12762f09a5	MINOR: channel/stconn: Replace channel_shutr_now() by sc_schedule_abort() After the flag renaming, it is now the turn for the channel function to be renamed and moved in the SC scope. channel_shutr_now() is replaced by sc_schedule_abort(). The request channel is replaced by the front SC and the response is replace by the back SC.	2023-04-14 14:08:49 +02:00
Christopher Faulet	3db538ac2f	MINOR: channel: Forwad close to other side on abort Most of calls to channel_abort() are associated to a call to channel_auto_close(). Others are in areas where the auto close is the default. So, it is now systematically enabled when an abort is performed on a channel, as part of channel_abort() function.	2023-04-14 13:56:28 +02:00
Christopher Faulet	f65cf3684d	MINOR: hlua: Stop to check the SC state when executing a hlua cli command This part has changed but it was already handled by the CLI applet. There is no reason to performe this test when a hlua cli command is executed.	2023-04-11 08:19:06 +02:00
Willy Tarreau	22450af22a	BUG/MINOR: lua: remove incorrect usage of strncat() As every time strncat() is used, it's wrong, and this one is no exception. Users often think that the length applies to the destination except it applies to the source and makes it hard to use correctly. The bug did not have an impact because the length was preallocated from the sum of all the individual lengths as measured by strlen() so there was no chance one of them would change in between. But it could change in the future. Let's fix it to use memcpy() instead for strings, or byte copies for delimiters. No backport is needed, though it can be done if it helps to apply other fixes.	2023-04-07 16:04:54 +02:00

1 2 3 4 5 ...

970 Commits