haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-24 16:11:24 +02:00

Author	SHA1	Message	Date
Willy Tarreau	6bdf3e9b11	MINOR: debug/cli: add some debugging commands for developers When haproxy is built with DEBUG_DEV, the following commands are added to the CLI : debug dev close <fd> : close this file descriptor debug dev delay [ms] : sleep this long debug dev exec [cmd] ... : show this command's output debug dev exit [code] : immediately exit the process debug dev hex <addr> [len]: dump a memory area debug dev log [msg] ... : send this msg to global logs debug dev loop [ms] : loop this long debug dev panic : immediately trigger a panic debug dev tkill [thr] [sig] : send signal to thread These are essentially aimed at helping developers trigger certain conditions and are expected to be complemented over time.	2019-05-20 16:59:30 +02:00
Willy Tarreau	56131ca58e	MINOR: debug: implement ha_panic() This function dumps all existing threads using the thread dump mechanism then aborts. This will be used by the lockup detection and by debugging tools.	2019-05-20 16:51:30 +02:00
Willy Tarreau	9fc5dcbd71	MINOR: tools: add dump_hex() This is used to dump a memory area into a buffer for debugging purposes.	2019-05-20 16:51:30 +02:00
Willy Tarreau	da5a63f8f1	CLEANUP: stream: remove an obsolete debugging test The test consisted in checking that there was always a timeout on a stream's task and was only enabled when built in development mode, but 1) it is never tested and 2) if it had been tested it would have been noticed that it triggers a bit too easily on the CLI. Let's get rid of this old one.	2019-05-20 16:19:40 +02:00
Willy Tarreau	91e6df01fa	MINOR: threads: add each thread's clockid into the global thread_info This is the per-thread CPU runtime clock, it will be used to measure the CPU usage of each thread and by the lockup detection mechanism. It must only be retrieved at the beginning of run_thread_poll_loop() since the thread must already have been started for this. But it must be done before performing any per-thread initcall so that all thread init functions have access to the clock ID. Note that it could make sense to always have this clockid available even in non-threaded situations and place the process' clock there instead. But it would add portability issues which are currently easy to deal with by disabling threads so it may not be worth it for now.	2019-05-20 11:42:25 +02:00
Willy Tarreau	522cfbc1ea	MINOR: init/threads: make the global threads an array of structs This way we'll be able to store more per-thread information than just the pthread pointer. The storage became an array of struct instead of an allocated array since it's very small (typically 512 bytes) and not worth the hassle of dealing with memory allocation on this. The array was also renamed thread_info to make its intended usage more explicit.	2019-05-20 11:37:57 +02:00
Willy Tarreau	64a47b943c	CLEANUP: memory: make the fault injection code use the OTHER_LOCK label The mem_should_fail() function sets a lock while it's building its messages, and when this was done there was no relevant label available hence the confusing use of START_LOCK. Now OTHER_LOCK is available for such use cases, so let's switch to this one instead as START_LOCK is going to disappear.	2019-05-20 11:26:12 +02:00
Willy Tarreau	619a95f5ad	MEDIUM: init/mworker: make the pipe register function a regular initcall Now that we have the guarantee that init calls happen before any other thread starts, we don't need anymore the workaround installed by commit 1605c7ae6 ("BUG/MEDIUM: threads/mworker: fix a race on startup") and we can instead rely on a regular per-thread initcall for this function. It will only be performed on worker thread #0, the other ones and the master have nothing to do, just like in the original code that was only moved to the function.	2019-05-20 11:26:12 +02:00
Willy Tarreau	3078e9f8e2	MINOR: threads/init: synchronize the threads startup It's a bit dangerous to let threads initialize at different speeds on startup. Some are still in their init functions while others area already running. It was even subject to some race condition bugs like the one fixed by commit 1605c7ae6 ("BUG/MEDIUM: threads/mworker: fix a race on startup"). Here in order to secure all this, we take a very simplistic approach consisting in using half of the rendez-vous point, which is made exactly for this purpose : we first initialize the mask of the threads requesting a rendez-vous to the mask of all threads, and we simply call thread_release() once the init is complete. This guarantees that no thread will go further than the initialization code during this time. This could even safely be backported if any other issue related to an init race was discovered in a stable release.	2019-05-20 11:26:12 +02:00
William Lallemand	7b302d8dd5	MINOR: init: setenv HAPROXY_CFGFILES Set the HAPROXY_CFGFILES environment variable which contains the list of configuration files used to start haproxy, separated by semicolon.	2019-05-20 11:21:00 +02:00
Willy Tarreau	c7091d89ae	MEDIUM: debug/threads: implement an advanced thread dump system The current "show threads" command was too limited as it was not possible to dump other threads' detailed states (e.g. their tasks). This patch goes further by using thread signals so that each thread can dump its own state in turn into a shared buffer provided by the caller. Threads are synchronized using a mechanism very similar to the rendez-vous point and using this method, each thread can safely dump any of its contents and the caller can finally report the aggregated ones from the buffer. It is important to keep in mind that the list of signal-safe functions is limited, so we take care of only using chunk_printf() to write to a pre-allocated buffer. This mechanism is enabled by USE_THREAD_DUMP and is enabled by default on Linux 2.6.28+. On other platforms it falls back to the previous solution using the loop and the less precise dump.	2019-05-17 17:16:20 +02:00
Willy Tarreau	0ad46fa6f5	MINOR: stream: detach the stream from its own task on stream_free() This makes sure that the stream is not visible from its own task just before starting to free some of its components. This way we have the guarantee that a stream found in a task list is totally valid and can safely be dereferenced.	2019-05-17 17:16:20 +02:00
Willy Tarreau	01f3489752	MINOR: task: put barriers after each write to curr_task This one may be watched by signal handlers, we don't want the compiler to optimize its assignment away at the end of the loop and leave some wandering pointers there.	2019-05-17 17:16:20 +02:00
Willy Tarreau	38171daf21	MINOR: thread: implement ha_thread_relax() At some places we're using a painful ifdef to decide whether to use sched_yield() or pl_cpu_relax() to relax in loops, this is hardly exportable. Let's move this to ha_thread_relax() instead and une this one only.	2019-05-17 17:16:20 +02:00
Willy Tarreau	20db9115dc	BUG/MINOR: debug: don't check the call date on tasklets tasklets don't have a call date, so when a tasklet is cast into a task and is present at the end of a page we run a risk of dereferencing unmapped memory when dumping them in ha_task_dump(). This commit simplifies the test and uses to distinct calls for tasklets and tasks. No backport is needed.	2019-05-17 17:16:20 +02:00
Willy Tarreau	5cf64dd1bd	MINOR: debug: make ha_thread_dump() and ha_task_dump() take a buffer Instead of having them dump into the trash and initialize it, let's have the caller initialize a buffer and pass it. This will be convenient to dump multiple threads at once into a single buffer.	2019-05-17 17:16:20 +02:00
Willy Tarreau	14a1ab75d0	BUG/MINOR: debug: make ha_task_dump() actually dump the requested task It used to only dump the current task, which isn't different for now but the purpose clearly is to dump the requested task. No backport is needed.	2019-05-17 17:16:20 +02:00
Willy Tarreau	231ec395c1	BUG/MINOR: debug: make ha_task_dump() always check the task before dumping it For now it cannot happen since we're calling it from a task but it will break with signals. No backport is needed.	2019-05-17 17:16:20 +02:00
Olivier Houchard	6db1699f77	BUG/MEDIUM: streams: Try to L7 retry before aborting the connection. In htx_wait_for_response, in case of error, attempt a L7 retry before aborting the connection if the TX_NOT_FIRST flag is set. If we don't do that, then we wouldn't attempt L7 retries after the first request, or if we use HTTP/2, as with HTTP/2 that flag is always set.	2019-05-17 15:49:21 +02:00
Olivier Houchard	ce1a0292bf	BUG/MEDIUM: streams: Don't use CF_EOI to decide if the request is complete. In si_cs_send(), don't check CF_EOI on the request channel to decide if the request is complete and if we should save the buffer to eventually attempt L7 retries. The flag may not be set yet, and it may too be set to early, before we're done modifying the buffer. Instead, get the msg, and make sure its state is HTTP_MSG_DONE. That way we will store the request buffer when sending it even in H2.	2019-05-17 15:49:21 +02:00
Willy Tarreau	4e2b646d60	MINOR: cli/debug: add a thread dump function The new function ha_thread_dump() will dump debugging info about all known threads. The current thread will contain a bit more info. The long-term goal is to make it possible to use it in signal handlers to improve the accuracy of some dumps. The function dumps its output into the trash so as it was trivial to add, a new "show threads" command appeared on the CLI.	2019-05-16 18:06:45 +02:00
Willy Tarreau	58d9621fc8	MINOR: cli/activity: show the dumping thread ID starting at 1 Both the config and gdb report thread IDs starting at 1, so better do the same in "show activity" to limit confusion. We also display the full permitted range. This could be backported to 1.9 since it was present there.	2019-05-16 18:02:03 +02:00
Tim Duesterhus	3506dae342	MEDIUM: Make 'resolution_pool_size' directive fatal This directive never appeared in a stable release and instead was introduced and deprecated within 1.8-dev. While it technically could be outright removed we detect it and error out for good measure.	2019-05-16 18:02:03 +02:00
Tim Duesterhus	10c6c16cde	MEDIUM: Make 'option forceclose' actually warn It is deprecated since 315b39c3914f4c2301ce19a93564566caa2ede50 (1.9-dev), but only was deprecated in the docs. Make it warn when being used and remove it from the docs.	2019-05-16 18:02:03 +02:00
Christopher Faulet	c1f40dd492	BUG/MINOR: http_fetch: Rely on the smp direction for "cookie()" and "hdr()" A regression was introduced in the commit 89dc49935 ("BUG/MAJOR: http_fetch: Get the channel depending on the keyword used") on the samples "cookie()" and "hdr()". Unlike other samples manipulating the HTTP headers, these ones depend on the sample direction. To fix the bug, these samples use now their own functions. Depending on the sample direction, they call smp_fetch_cookie() and smp_fetch_hdr() with the appropriate keyword. Thanks to Yves Lafon to report this issue. This patch must be backported wherever the commit 89dc49935 was backported. For now, 1.9 and 1.8.	2019-05-16 11:31:28 +02:00
Olivier Houchard	35d116885d	MINOR: connections: Use BUG_ON() to enforce rules in subscribe/unsubscribe. It is not legal to subscribe if we're already subscribed, or to unsubscribe if we did not subscribe, so instead of trying to handle those cases, just assert that it's ok using the new BUG_ON() macro.	2019-05-14 18:18:25 +02:00
Olivier Houchard	00b8f7c60b	MINOR: h1: Use BUG_ON() to enforce rules in subscribe/unsubscribe. It is not legal to subscribe if we're already subscribed, or to unsubscribe if we did not subscribe, so instead of trying to handle those cases, just assert that it's ok using the new BUG_ON() macro.	2019-05-14 18:18:25 +02:00
Olivier Houchard	f8338151a3	MINOR: h2: Use BUG_ON() to enforce rules in subscribe/unsubscribe. It is not legal to subscribe if we're already subscribed, or to unsubscribe if we did not subscribe, so instead of trying to handle those cases, just assert that it's ok using the new BUG_ON() macro.	2019-05-14 18:18:25 +02:00
Christopher Faulet	fa922f03a3	BUG/MEDIUM: mux-h2: Set EOI on the conn_stream during h2_rcv_buf() Just like CS_FL_REOS previously, the CS_FL_EOI flag is abused as a proxy for H2_SF_ES_RCVD. The problem is that this flag is consumed by the application layer and is set immediately when an end of stream was met, which is too early since the application must retrieve the rxbuf's contents first. The effect is that some transfers are truncated (mostly the first one of a connection in most tests). The problem of mixing CS flags and H2S flags in the H2 mux is not new (and is currently being addressed) but this specific one was emphasized in commit 63768a63d ("MEDIUM: mux-h2: Don't mix the end of the message with the end of stream") which was backported to 1.9. Note that other flags, particularly CS_FL_REOS still need to be asynchronously reported, though their impact seems more limited for now. This patch makes sure that all internal uses of CS_FL_EOI are replaced with a test on H2_SF_ES_RCVD (as there is a 1-to-1 equivalence) and that CS_FL_EOI is only reported once the rxbuf is empty. This should ideally be backported to 1.9 unless it causes too much trouble due to the recent changes in this area, as 1.9 seems not to be directly affected by this bug.	2019-05-14 15:47:57 +02:00
Willy Tarreau	99ad1b3e8c	MINOR: mux-h2: stop relying on CS_FL_REOS This flag was introduced early in 1.9 development (a3f7efe00) to report the fact that the rxbuf that was present on the conn_stream was followed by a shutr. Since then the rxbuf moved from the conn_stream to the h2s (638b799b0) but the flag remained on the conn_stream. It is problematic because some state transitions inside the mux depend on it, thus depend on the CS, and as such have to test for its existence before proceeding. This patch replaces the test on CS_FL_REOS with a test on the only states that set this flag (H2_SS_CLOSED, H2_SS_HREM, H2_SS_ERROR). The few places where the flag was set were removed (the flag is not used by the data layer).	2019-05-14 15:47:57 +02:00
Willy Tarreau	4c688eb8d1	MINOR: mux-h2: add macros to check multiple stream states at once At many places we need to test for several stream states at once, let's have macros to make a bit mask from a state to ease this.	2019-05-14 15:47:57 +02:00
Willy Tarreau	f8fe3d63f0	CLEANUP: mux-h2: don't test for impossible CS_FL_REOS conditions This flag is currently set when an incoming close was received, which results in the stream being in either H2_SS_HREM, H2_SS_CLOSED, or H2_SS_ERROR states, so let's remove the test for the OPEN and HLOC cases.	2019-05-14 15:47:57 +02:00
Willy Tarreau	3cf69fe6b2	BUG/MINOR: mux-h2: make sure to honor KILL_CONN in do_shut{r,w} If the stream closes and quits while there's no room in the mux buffer to send an RST frame, next time it is attempted it will not lead to the connection being closed because the conn_stream will have been released and the KILL_CONN flag with it as well. This patch reserves a new H2_SF_KILL_CONN flag that is copied from the CS when calling shut{r,w} so that the stream remains autonomous on this even when the conn_stream leaves. This should ideally be backported to 1.9 though it depends on several previous patches that may or may not be suitable for backporting. The severity is very low so there's no need to insist in case of trouble.	2019-05-14 15:47:57 +02:00
Willy Tarreau	aebbe5ef72	MINOR: mux-h2: make h2s_wake_one_stream() not depend on temporary CS flags In h2s_wake_one_stream() we used to rely on the temporary flags used to adjust the CS to determine the new h2s state. This really is not convenient and creates far too many dependencies. This commit just moves the same condition to the places where the temporary flags were set so that we don't have to rely on these anymore. Whether these are relevant or not was not the subject of the operation, what matters was to make sure the conditions to adjust the stream's state and the CS's flags remain the same. Later it could be studied if these conditions are correct or not.	2019-05-14 15:47:57 +02:00
Willy Tarreau	13b6c2e8b3	MINOR: mux-h2: make h2s_wake_one_stream() the only function to deal with CS h2s_wake_one_stream() has access to all the required elements to update the connstream's flags and figure the necessary state transitions, so let's move the conditions there from h2_wake_some_streams().	2019-05-14 15:47:57 +02:00
Willy Tarreau	234829111f	MINOR: mux-h2: make h2_wake_some_streams() not depend on the CS flags It's problematic to have to pass some CS flags to this function because that forces some h2s state transistions to update them just in time while some of them are supposed to only be updated during I/O operations. As a first step this patch transfers the decision to pass CS_FL_ERR_PENDING from the caller to the leaf function h2s_wake_one_stream(). It is easy since this is the only flag passed there and it depends on the position of the stream relative to the last_sid if it was set.	2019-05-14 15:47:57 +02:00
Willy Tarreau	c3b1183f57	MINOR: mux-h2: remove useless test on stream ID vs last in wake function h2_wake_some_streams() first looks up streams whose IDs are greater than or equal to last+1, then checks if the id is lower than or equal to last, which by definition will never match. Let's remove this confusing leftover from ancient code.	2019-05-14 15:47:57 +02:00
William Lallemand	920fc8bbe4	BUG/MINOR: mworker: use after free when the PID not assigned Commit 4528611 ("MEDIUM: mworker: store the leaving state of a process") introduced a bug in the mworker_env_to_proc_list() function. This is very unlikely to occur since the PID should always be assigned. It can probably happen if the environment variable is corrupted. No backport needed.	2019-05-14 11:28:16 +02:00
Willy Tarreau	f983d00a1c	BUG/MINOR: mux-h2: make the do_shut{r,w} functions more robust against retries These functions may fail to emit an RST or an empty DATA frame because the mux is full or busy. Then they subscribe the h2s and try again. However when doing so, they will already have marked the error state on the stream and will not pass anymore through the sequence resulting in the failed frame to be attempted to be sent again nor to the close to be done, instead they will return a success. It is important to only leave when the stream is already closed, but to go through the whole sequence otherwise. This patch should ideally be backported to 1.9 though it's possible that the lack of the WANT_SHUT* flags makes this difficult or dangerous. The severity is low enough to avoid this in case of trouble.	2019-05-14 11:13:06 +02:00
Fr�d�ric L�caille	90a10aeb65	BUG/MINOR: log: Wrong log format initialization. This patch fixes an issue introduced by 0bad840b commit "MINOR: log: Extract some code to send syslog messages" which leaded to wrong log format variable initializations at least for "short" and "raw" format. This commit skipped the cases where even if passed to __do_send_log(), the syslog tag and syslog pid string must not be used to format the log message with "short" and "raw". This is done iniatilizing "tag_max" and "pid_max" variables (the lengths of the tag and pid strings) to 0, then updating to them to the length of the tag and pid strings passed as variables to __do_send_log() depending on the log format and in every cases using this length for the iovec variable used to send() the log. This bug is specific to 2.0.	2019-05-14 11:12:00 +02:00
Willy Tarreau	8bdb5c9bb4	CLEANUP: connection: remove the handle field from the wait_event struct It was only set and not consumed after the previous change. The reason is that the task's context always contains the relevant information, so there is no need for a second pointer.	2019-05-13 19:14:52 +02:00
Willy Tarreau	88bdba31fa	CLEANUP: mux-h2: simply use h2s->flags instead of ret in h2_deferred_shut() This one used to rely on the combined return statuses of the shutr/w functions but now that we have the H2_SF_WANT_SHUT{R,W} flags we don't need this anymore if we properly remove these flags after their operations succeed. This is what this patch does.	2019-05-13 19:14:52 +02:00
Willy Tarreau	2c249ebc75	MINOR: mux-h2: add two H2S flags to report the need for shutr/shutw Currently when a shutr/shutw fails due to lack of buffer space, we abuse the wait_event's handle pointer to place up to two bits there in addition to the original pointer. This pointer is not used for anything but this and overall the intent becomes clearer with h2s flags than with these two alien bits in the pointer, so let's use clean flags now.	2019-05-13 19:14:52 +02:00
Willy Tarreau	c234ae38f8	CLEANUP: mux-h2: use LIST_ADDED() instead of LIST_ISEMPTY() where relevant Lots of places were using LIST_ISEMPTY() to detect if a stream belongs to one of the send lists or to detect if a connection was already waiting for a buffer or attached to an idle list. Since these ones are not list heads but list elements, let's use LIST_ADDED() instead.	2019-05-13 19:14:52 +02:00
William Lallemand	7e1770b151	BUG/MAJOR: ssl: segfault upon an heartbeat request 7b5fd1e ("MEDIUM: connections: Move some fields from struct connection to ssl_sock_ctx.") introduced a bug in the heartbleed mitigation code. Indeed the code used conn->ctx instead of conn->xprt_ctx for the ssl context, resulting in a null dereference.	2019-05-13 16:03:44 +02:00
Tim Duesterhus	a6cc7e872a	BUG/MINOR: vars: Fix memory leak in vars_check_arg vars_check_arg previously leaked the string containing the variable name: Consider this config: frontend fe1 mode http bind :8080 http-request set-header X %[var(txn.host)] Starting HAProxy and immediately stopping it by sending a SIGINT makes Valgrind report this leak: ==7795== 9 bytes in 1 blocks are definitely lost in loss record 15 of 71 ==7795== at 0x4C2DB8F: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==7795== by 0x4AA2AD: my_strndup (standard.c:2227) ==7795== by 0x51FCC5: make_arg_list (arg.c:146) ==7795== by 0x4CF095: sample_parse_expr (sample.c:897) ==7795== by 0x4BA7D7: add_sample_to_logformat_list (log.c:495) ==7795== by 0x4BBB62: parse_logformat_string (log.c:688) ==7795== by 0x4E70A9: parse_http_req_cond (http_rules.c:239) ==7795== by 0x41CD7B: cfg_parse_listen (cfgparse-listen.c:1466) ==7795== by 0x480383: readcfgfile (cfgparse.c:2089) ==7795== by 0x47A081: init (haproxy.c:1581) ==7795== by 0x4049F2: main (haproxy.c:2591) This leak can be detected even in HAProxy 1.6, this patch thus should be backported to all supported branches [Cf: This fix was reverted because the chunk's area was inconditionnaly released, making haproxy to crash when spoe was enabled. Now the chunk is released by calling chunk_destroy(). This function takes care of the chunk's size to release it or not. It is the responsibility of callers to set or not the chunk's size.]	2019-05-13 11:09:12 +02:00
Christopher Faulet	bf9bcb0a00	MINOR: spoe: Set the argument chunk size to 0 when SPOE variables are checked When SPOE variables are registered during HAProxy startup, the argument used to call the function vars_check_arg() uses the trash area. To be sure it is never released by the callee function, the size of the internal chunk (arg.data.str) is set to 0. It is important to do so because, to fix a memory leak, this buffer must be released by the function vars_check_arg(). This patch must be backported to 1.9.	2019-05-13 11:07:00 +02:00
Willy Tarreau	ce9bbf523c	BUG/MINOR: htx: make sure to always initialize the HTTP method when parsing a buffer smp_prefetch_htx() is used when trying to access the contents of an HTTP buffer from the TCP rulesets. The method was not properly set in this case, which will cause the sample fetch methods relying on the method to randomly fail in this case. Thanks to Tim D�sterhus for reporting this issue (#97). This fix must be backported to 1.9.	2019-05-13 10:10:44 +02:00
Tim Duesterhus	04bcaa1f9f	BUG/MINOR: peers: Fix memory leak in cfg_parse_peers cfg_parse_peers previously leaked the contents of the `kws` string, as it was unconditionally filled using bind_dump_kws, but only used (and freed) within the error case. Move the dumping into the error case to: 1. Ensure that the registered keywords are actually printed as least once. 2. The contents of kws are not leaked. This move allows to narrow the scope of `kws`, so this is done as well. This bug was found using valgrind: ==28217== 590 bytes in 1 blocks are definitely lost in loss record 51 of 71 ==28217== at 0x4C2DB8F: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==28217== by 0x4AD4C7: indent_msg (standard.c:3676) ==28217== by 0x47E962: cfg_parse_peers (cfgparse.c:700) ==28217== by 0x480273: readcfgfile (cfgparse.c:2147) ==28217== by 0x479D51: init (haproxy.c:1585) ==28217== by 0x404A02: main (haproxy.c:2585) with this super simple configuration: peers peers bind :8081 server A This bug exists since the introduction of cfg_parse_peers in commit 355b2033ec0c89660db179b23d6f77b678d8c26d (which was introduced for HAProxy 2.0, but marked as backportable). It should be backported to all branches containing that commit.	2019-05-13 10:10:01 +02:00
Willy Tarreau	f7b0523425	Revert "BUG/MINOR: vars: Fix memory leak in vars_check_arg" This reverts commit 6ea00195c479d96c5aa651adcca3bc3637e3eceb. As found by Christopher, this fix is not correct due to the way args are built at various places. For example some config or runtime parsers will place a substring pointer there, and calling free() on it will immediately crash the program. A quick audit of the code shows that there are not that many users, but the way it's done requires to properly set the string as a regular chunk (size=0 if free not desired, then call chunk_destroy() at release time), and given that the size is currently set to len+1 in all parsers, a deeper audit needs to be done to figure the impacts of not setting it anymore. Thus for now better leave this harmless leak which impacts only the config parsing time. This fix must be backported to all branches containing the fix above.	2019-05-13 10:10:01 +02:00

... 33 34 35 36 37 ...

9436 Commits