haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-11-18 01:11:01 +01:00

Go to file

Willy Tarreau ba4c7a1597 BUG/MEDIUM: sched: allow a bit more TASK_HEAVY to be processed when needed

As reported in github issue #1881, there are situations where an excess
of TLS handshakes can cause a livelock. What's happening is that normally
we process at most one TLS handshake per loop iteration to maintain the
latency low. This is done by tagging them with TASK_HEAVY, queuing these
tasklets in the TL_HEAVY queue. But if something slows down the loop, such
as a connect() call when no more ports are available, we could end up
processing no more than a few hundred or thousands handshakes per second.

If the llmit becomes lower than the rate of incoming handshakes, we will
accumulate them and at some point users will get impatient and give up or
retry. Then a new problem happens: the queue fills up with even more
handshake attempts, only one of which will be handled per iteration, so
we can end up processing only outdated handshakes at a low rate, with
basically nothing else in the queue. This can for example happen in
parallel with health checks that don't require incoming handshakes to
succeed to continue to cause some activity that could maintain the high
latency stuff active.

Here we're taking a slightly different approach. First, instead of always
allowing only one handshake per loop (and usually it's critical for
latency), we take the current situation into account:
  - if configured with tune.sched.low-latency, the limit remains 1
  - if there are other non-heavy tasks, we set the limit to 1 + one
    per 1024 tasks, so that a heavily loaded queue of 4k handshakes
    per thread will be able to drain them at ~4 per loops with a
    limited impact on latency
  - if there are no other tasks, the limit grows to 1 + one per 128
    tasks, so that a heavily loaded queue of 4k handshakes per thread
    will be able to drain them at ~32 per loop with still a very
    limited impact on latency since only I/O will get delayed.

It was verified on a 56-core Xeon-8480 that this did not degrade the
latency; all requests remained below 1ms end-to-end in full close+
handshake, and even 500us under low-lat + busy-polling.

This must be backported to 2.4.

2023-02-17 16:01:34 +01:00

.github

CI: Reformat matrix.py using black

2023-01-03 16:28:34 +01:00

addons

BUG/MINOR: promex: Don't forget to consume the request on error

2023-01-13 09:45:23 +01:00

admin

BUILD: halog: fix missing double-quote at end of help line

2022-11-25 11:11:41 +01:00

dev

DEV: hpack: fix trash build regression

2023-01-27 10:22:20 +01:00

doc

MINOR: haproxy: Add an command option to disable data fast-forward

2023-02-17 10:17:02 +01:00

examples

EXAMPLES: remove completely outdated acl-content-sw.cfg

2022-05-30 18:14:24 +02:00

include

MINOR: global: Add an option to disable the data fast-forward

2023-02-17 10:17:02 +01:00

reg-tests

REGTESTS: Remove unsupported feature command in http_splicing.vtc

2023-02-17 15:27:11 +01:00

scripts

SCRIPTS: run-regtests: add a version check

2022-11-30 18:44:33 +01:00

src

BUG/MEDIUM: sched: allow a bit more TASK_HEAVY to be processed when needed

2023-02-17 16:01:34 +01:00

tests

TESTS: add a unit test for one_among_mask()

2022-06-21 20:29:57 +02:00

.cirrus.yml

CI: cirrus-ci: bump FreeBSD image to 13-1

2022-09-09 13:30:17 +02:00

.gitattributes

MINOR: Configure the cpp userdiff driver for *.[ch] in .gitattributes

2021-02-22 18:17:57 +01:00

.gitignore

CLEANUP: exclude udp-perturb with .gitignore

2022-09-16 15:47:04 +02:00

.mailmap

DOC: update Tim's address in .mailmap

2021-09-16 09:14:14 +02:00

.travis.yml

CI: travis-ci: temporarily disable arm64 builds

2021-08-07 07:28:15 +02:00

BRANCHES

DOC: fix some spelling issues over multiple files

2021-01-08 14:53:47 +01:00

CHANGELOG

[RELEASE] Released version 2.8-dev4

2023-02-14 16:55:17 +01:00

CONTRIBUTING

CLEANUP: assorted typo fixes in the code and comments

2021-08-16 12:37:59 +02:00

INSTALL

MINOR: version: mention that it's development again

2022-12-01 15:24:10 +01:00

LICENSE

LICENSE: add licence exception for OpenSSL

2012-09-07 13:52:26 +02:00

MAINTAINERS

CLEANUP: assorted typo fixes in the code and comments

2022-11-30 14:02:36 +01:00

Makefile

BUILD: makefile: fix PCRE overriding specific lib path

2023-02-03 09:42:49 +01:00

README

DOC: create a BRANCHES file to explain the life cycle

2019-06-15 22:00:14 +02:00

SUBVERS

BUILD: use format tags in VERDATE and SUBVERS files

2013-12-10 11:22:49 +01:00

VERDATE

[RELEASE] Released version 2.8-dev4

2023-02-14 16:55:17 +01:00

VERSION

[RELEASE] Released version 2.8-dev4

2023-02-14 16:55:17 +01:00

README

The HAProxy documentation has been split into a number of different files for
ease of use.

Please refer to the following files depending on what you're looking for :

  - INSTALL for instructions on how to build and install HAProxy
  - BRANCHES to understand the project's life cycle and what version to use
  - LICENSE for the project's license
  - CONTRIBUTING for the process to follow to submit contributions

The more detailed documentation is located into the doc/ directory :

  - doc/intro.txt for a quick introduction on HAProxy
  - doc/configuration.txt for the configuration's reference manual
  - doc/lua.txt for the Lua's reference manual
  - doc/SPOE.txt for how to use the SPOE engine
  - doc/network-namespaces.txt for how to use network namespaces under Linux
  - doc/management.txt for the management guide
  - doc/regression-testing.txt for how to use the regression testing suite
  - doc/peers.txt for the peers protocol reference
  - doc/coding-style.txt for how to adopt HAProxy's coding style
  - doc/internals for developer-specific documentation (not all up to date)

Languages

C 98%

Shell 0.9%

Makefile 0.5%

Lua 0.2%

Python 0.2%