mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-10-26 22:20:59 +01:00

Go to file

Willy Tarreau 205f1cbf4c BUG/MEDIUM: wdt: improve stuck task detection accuracy

The fact that the watchdog timer measures the execution time from the
last return from the poller tends to amplify the impact of multiple
bad tasks, and may explain some of the panics reported by Felipe and
Ricardo in GH issues #3084, #3092 and #3101. The problem is that we
check the time if we see that the scheduler appears not to be moving
anymore, but one situation may still arise and catch a bad task:
  - one slow task takes so long a time that it triggers the watchdog
    twice, emitting a warning the second time (~200ms). The scheduler
    is rightfully marked as stuck.
  - then it completes and the scheduler is no longer stuck. Many other
    tasks run in turn, they all take quite some time but not enough to
    trigger a warning. But collectively their cost adds up.
  - then a task takes more than the warning time (100ms), and causes
    the total execution time to cross the second. The watchdog is
    called, sees that we've spend more than 1 second since we left the
    poller, and marks the thread as stuck.
  - the task is not finished, the watchdog is called again, sees more
    than one second with a stuck thread and panics 100ms later.

The total time away from the poller is indeed more than one second,
which is very bad, but no single task caused this individually, and
while the warnings are OK, the watchdog should not panic in this case.

This patch revisits the approach to store the moment the scheduler was
marked as stuck in the wdt context. The idea is that this date will be
used to detect warnings and panics. And by doing so and exploiting the
new is_sched_alive(thr), we can greatly simplify the mechanism so that
the signal handling thread does the strict minimum (mark the scheduler
as possibly stuck and update the stuck_start date), and only bounces to
the reporting thread if the scheduler made no progress since last call.
This means that without even doing computations in the handing thread,
we can continue to avoid all bounces unless a warning is required. Then
when the reporting thread is signaled, it will check the dates from the
last moment the scheduler was marked, and will decide to warn or panic.

The panic decision continues to pass via a TH_FL_STUCK flag to probe the
code so that exceptionally slow code (e.g. live cert generation etc) can
still find a way to avoid the panic if absolutely certain that things
are still moving.

This means that now we have the guarantee that panics will only happen
if a given task spends more than one full second not moving, and that
warnings will be issued for other calls crossing the warn delay boundary.

This was tested using artificially slow operations, and all combinations
which individually took less than a second only resulted in floods of
warnings even if the total reported time in the warning was much higher,
while those above one second provoked the panic.

One improvement could consist in reporting the time since last stuck
in the thread dumps to differentiate the individual task from the whole
set.

This needs to be backported to 3.2 along with the two previous patches:

    MINOR: sched: let's permit to share the local ctx between threads
    MINOR: sched: pass the thread number to is_sched_alive()

2025-10-01 10:18:53 +02:00

.github

CI: github: build halog on the vtest job

2025-09-26 16:29:29 +02:00

addons

MINOR: applet: Add a flag to know an applet is using HTX buffers

2025-08-25 11:11:05 +02:00

admin

ADMIN: reload: introduce -vv mode

2025-09-29 19:29:10 +02:00

dev

DEV: gdb: add a memprofile decoder to the debug tools

2025-07-16 15:33:33 +02:00

doc

MINOR: mt_list: Implement MT_LIST_POP_LOCKED()

2025-09-30 16:25:07 +02:00

examples

MINOR: mailers: warn if mailers are configured but not actually used

2025-06-27 16:41:18 +02:00

include

MINOR: sched: pass the thread number to is_sched_alive()

2025-10-01 10:18:53 +02:00

reg-tests

REGTESTS: ssl: Fix the script about automatic SNI selection

2025-09-08 15:55:56 +02:00

scripts

CI: scripts: build curl with ECH support

2025-09-25 17:05:46 +02:00

src

BUG/MEDIUM: wdt: improve stuck task detection accuracy

2025-10-01 10:18:53 +02:00

tests

TESTS: quic: add unit-tests for QUIC TX part

2025-09-08 14:49:03 +02:00

.cirrus.yml

CI: cirrus-ci: bump FreeBSD image to 14-2

2025-02-12 13:18:55 +01:00

.gitattributes

MINOR: Configure the cpp userdiff driver for *.[ch] in .gitattributes

2021-02-22 18:17:57 +01:00

.gitignore

MINOR: tevt/dev: Add term_events tool

2025-01-31 10:41:50 +01:00

.mailmap

DOC: update Tim's address in .mailmap

2021-09-16 09:14:14 +02:00

.travis.yml

MEDIUM: mworker: remove USE_SYSTEMD requirement for -Ws

2024-11-20 12:07:38 +01:00

BRANCHES

DOC: fix some spelling issues over multiple files

2021-01-08 14:53:47 +01:00

BSDmakefile

BUILD: makefile: commit the tiny FreeBSD makefile stub

2023-05-24 17:17:36 +02:00

CHANGELOG

[RELEASE] Released version 3.3-dev8

2025-09-05 09:54:34 +02:00

CONTRIBUTING

CLEANUP: assorted typo fixes in the code and comments

2025-04-02 11:12:20 +02:00

INSTALL

BUILD: makefile: bump the default minimum linux version to 4.17

2025-09-05 09:44:56 +02:00

LICENSE

LICENSE: add licence exception for OpenSSL

2012-09-07 13:52:26 +02:00

MAINTAINERS

MAJOR: spoe: Let the SPOE back into the game

2024-05-22 09:04:38 +02:00

Makefile

IMPORT: cebtree: import version 0.5.0 to support duplicates

2025-09-16 09:23:46 +02:00

README.md

DOC: change the link to the FreeBSD CI in README.md

2024-06-03 15:21:29 +02:00

SUBVERS

BUILD: use format tags in VERDATE and SUBVERS files

2013-12-10 11:22:49 +01:00

VERDATE

[RELEASE] Released version 3.3-dev8

2025-09-05 09:54:34 +02:00

VERSION

[RELEASE] Released version 3.3-dev8

2025-09-05 09:54:34 +02:00

README.md

HAProxy

HAProxy is a free, very fast and reliable reverse-proxy offering high availability, load balancing, and proxying for TCP and HTTP-based applications.

Installation

The INSTALL file describes how to build HAProxy. A list of packages is also available on the wiki.

Getting help

The discourse and the mailing-list are available for questions or configuration assistance. You can also use the slack or IRC channel. Please don't use the issue tracker for these.

The issue tracker is only for bug reports or feature requests.

Documentation

The HAProxy documentation has been split into a number of different files for ease of use. It is available in text format as well as HTML. The wiki is also meant to replace the old architecture guide.

Please refer to the following files depending on what you're looking for:

INSTALL for instructions on how to build and install HAProxy
BRANCHES to understand the project's life cycle and what version to use
LICENSE for the project's license
CONTRIBUTING for the process to follow to submit contributions

The more detailed documentation is located into the doc/ directory:

doc/intro.txt for a quick introduction on HAProxy
doc/configuration.txt for the configuration's reference manual
doc/lua.txt for the Lua's reference manual
doc/SPOE.txt for how to use the SPOE engine
doc/network-namespaces.txt for how to use network namespaces under Linux
doc/management.txt for the management guide
doc/regression-testing.txt for how to use the regression testing suite
doc/peers.txt for the peers protocol reference
doc/coding-style.txt for how to adopt HAProxy's coding style
doc/internals for developer-specific documentation (not all up to date)

License

HAProxy is licensed under GPL 2 or any later version, the headers under LGPL 2.1. See the LICENSE file for a more detailed explanation.

Languages

C 98%

Shell 0.9%

Makefile 0.5%

Lua 0.2%

Python 0.2%