haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-11-21 02:41:17 +01:00

Author	SHA1	Message	Date
Amaury Denoyelle	9534e59bb9	MINOR: mux-quic: refactor snd_buf Factorize common code between h3 and hq-interop snd_buf operation. This is inserted in MUX QUIC snd_buf own callback. The h3/hq-interop API has been adjusted to directly receive a HTX message instead of a plain buf. This led to extracting part of MUX QUIC snd_buf in qmux_http module. This should be backported up to 2.6.	2022-09-20 15:35:29 +02:00
Amaury Denoyelle	d80fbcaca2	REORG: mux-quic: export HTTP related function in a dedicated file Extract function dealing with HTX outside of MUX QUIC. For the moment, only rcv_buf stream operation is concerned. The main objective is to be able to support both TCP and HTTP proxy mode with a common base and add specialized modules on top of it. This should be backported up to 2.6.	2022-09-20 15:35:23 +02:00
Amaury Denoyelle	36d50bff22	REORG: mux-quic: extract traces in a dedicated source file QUIC MUX implements several APIs to interface with stream, quic-conn and app-ops layers. It is planified to better separate this roles, possibly by using several files. The first step is to extract QUIC MUX traces in a dedicated source files. This will allow to reuse traces in multiple files. The main objective is to be able to support both TCP and HTTP proxy mode with a common base and add specialized modules on top of it. This should be backported up to 2.6.	2022-09-20 15:35:09 +02:00
Amaury Denoyelle	3dc4e5a5b9	BUG/MINOR: mux-quic: do not keep detached qcs with empty Tx buffers A qcs instance free may be postponed in stream detach operation if the stream is not locally closed. This condition is there to achieve transfering data still present in Tx buffer. Once all data have been emitted to quic-conn layer, qcs instance can be released. However, the stream is only closed locally if HTX EOM has been seen or it has been resetted. In case the transfer finished without EOM, a detached qcs won't be freed even if there is no more activity on it. This bug was not reproduced but was found on code analysis. Its precise impact is unknown but it should not cause any leak as all qcs instances are freed with its parent qcc connection : this should eventually happen on MUX timeout or QUIC idle timeout. To adjust this, condition to mark a stream as locally closed has been extended. On qcc_streams_sent_done() notification, if its Tx buffer has been fully transmitted, it will be closed if either FIN STREAM was set or the stream is detached. This must be backported up to 2.6.	2022-09-20 10:46:59 +02:00
Willy Tarreau	9f4f6b038c	OPTIM: hpack-huff: reduce the cache footprint of the huffman decoder Some tables are currently used to decode bit blocks and lengths. We do see such lookups in perf top. We have 4 512-byte tables and one 64-byte one. Looking closer, the second half of the table (length) has so few variations that most of the time it will be computed in a single "if", and never more than 3. This alone allows to cut the tables in half. In addition, one table (bits 15-11) is only 32-element long, while another one (bits 11-4) starts at 0x60, so we can merge the two as they do not overlap, and further save size. We're now down to 4 256-entries tables. This is visible in h3 and h2 where the max request rate is slightly higher (e.g. +1.6% for h2). The huff_dec() function got slightly larger but the overall code size shrunk: $ nm --size haproxy-before \| grep huff_dec 000000000000029e T huff_dec $ nm --size haproxy-after \| grep huff_dec 0000000000000345 T huff_dec $ size haproxy-before haproxy-after text data bss dec hex filename 7591126 569268 2761348 10921742 a6a70e haproxy-before 7591082 568180 2761348 10920610 a6a2a2 haproxy-after	2022-09-20 07:41:58 +02:00
Miroslav Zagorac	cbfee3a9f6	MINOR: httpclient: enabled the use of SNI presets This commit allows setting SNI outside http_client.c code.	2022-09-19 14:39:28 +02:00
Miroslav Zagorac	133e2a23d0	CLEANUP: httpclient: deleted unused variables The locally defined static variables 'httpclient_srv_raw' and 'httpclient_srv_ssl' are not used anywhere in the source code, except that they are set in the httpclient_precheck() function.	2022-09-19 14:39:28 +02:00
Amaury Denoyelle	afb7b9d8e5	BUG/MEDIUM: mux-quic: fix nb_hreq decrement nb_hreq is a counter on qcc for active HTTP requests. It is incremented for each qcs where a full HTTP request was received. It is decremented when the stream is closed locally : - on HTTP response fully transmitted - on stream reset A bug will occur if a stream is resetted without having processed a full HTTP request. nb_hreq will be decremented whereas it was not incremented. This will lead to a crash when building with DEBUG_STRICT=2. If BUG_ON_HOT are not active, nb_hreq counter will wrap which may break the timeout logic for the connection. This bug was triggered on haproxy.org. It can be reproduced by simulating the reception of a STOP_SENDING frame instead of a STREAM one by patching qc_handle_strm_frm() : + if (quic_stream_is_bidi(strm_frm->id)) + qcc_recv_stop_sending(qc->qcc, strm_frm->id, 0); + //ret = qcc_recv(qc->qcc, strm_frm->id, strm_frm->len, + // strm_frm->offset.key, strm_frm->fin, + // (char *)strm_frm->data); To fix this bug, a qcs is now flagged with a new QC_SF_HREQ_RECV. This is set when the full HTTP request is received. When the stream is closed locally, nb_hreq will be decremented only if this flag was set. This must be backported up to 2.6.	2022-09-19 12:12:21 +02:00
Erwan Le Goas	b0c0501516	MINOR: config: add command-line -dC to dump the configuration file This commit adds a new command line option -dC to dump the configuration file. An optional key may be appended to -dC in order to produce an anonymized dump using this key. The anonymizing process uses the same algorithm as the CLI so that the same key will produce the same hashes for the same identifiers. This way an admin may share an anonymized extract of a configuration to match against live dumps. Note that key 0 will not anonymize the output. However, in any case, the configuration is dumped after tokenizing, thus comments are lost.	2022-09-17 11:27:09 +02:00
Erwan Le Goas	acfdf7600b	MINOR: cli: anonymize 'show servers state' and 'show servers conn' Modify proxy.c in order to anonymize the following confidential data on commands 'show servers state' and 'show servers conn': - proxy name - server name - server address	2022-09-17 11:27:09 +02:00
Erwan Le Goas	57e35f4b87	MINOR: cli: anonymize commands 'show sess' and 'show sess all' Modify stream.c in order to hash the following confidential data if the anonymized mode is enabled: - configuration elements such as frontend/backend/server names - IP addresses	2022-09-17 11:27:09 +02:00
Erwan Le Goas	54966dffda	MINOR: anon: store the anonymizing key in the CLI's appctx In order to allow users to dump internal states using a specific key without changing the global one, we're introducing a key in the CLI's appctx. This key is preloaded from the global one when "set anon on" is used (and if none exists, a random one is assigned). And the key can optionally be assigned manually for the whole CLI session. A "show anon" command was also added to show the anon state, and the current key if the users has sufficient permissions. In addition, a "debug dev hash" command was added to test the feature.	2022-09-17 11:27:09 +02:00
Erwan Le Goas	fad9da83da	MINOR: anon: store the anonymizing key in the global structure Add a uint32_t key in global to hash words with it. A new CLI command 'set global-key <key>' was added to change the global anonymizing key. The global may also be set in the configuration using the global "anonkey" directive. For now this key is not used.	2022-09-17 11:24:53 +02:00
Erwan Le Goas	9c76637fff	MINOR: anon: add new macros and functions to anonymize contents These macros and functions will be used to anonymize strings by producing a short hash. This will allow to match config elements against dump elements without revealing the original data. This will later be used to anonymize configuration parts and CLI commands output. For now only string, identifiers and addresses are supported, but the model is easily extensible.	2022-09-17 11:24:53 +02:00
Willy Tarreau	85af760704	BUILD: fd: fix a build warning on the DWCAS Ilya reported in issue #1816 a build warning on armhf (promoted to error here since -Werror): src/fd.c: In function fd_rm_from_fd_list: src/fd.c:209:87: error: passing argument 3 of __ha_cas_dw discards volatile qualifier from pointer target type [-Werror=discarded-array-qualifiers] 209 \| unlikely(!_HA_ATOMIC_DWCAS(((long )&fdtab[fd].update), (uint32_t )&cur_list.u32, &next_list.u32)) \| ^~~~~~~~~~~~~~ This happens only on such an architecture because the DWCAS requires the pointer not the value, and gcc seems to be needlessly picky about reading a const from a volatile! This may safely be backported to older versions.	2022-09-17 11:20:44 +02:00
Willy Tarreau	da9f258759	BUG/MEDIUM: captures: free() an error capture out of the proxy lock Ed Hein reported in github issue #1856 some occasional watchdog panics in 2.4.18 showing extreme contention on the proxy's lock while the libc was in malloc()/free(). One cause of this problem is that we call free() under the proxy's lock in proxy_capture_error(), which makes no sense since if we can free the object under the lock after it's been detached, we can also free it after releasing the lock (since it's not referenced anymore). This should be backported to all relevant versions, likely all supported ones.	2022-09-17 11:07:19 +02:00
cui fliter	a94bedc0de	CLEANUP: quic,ssl: fix tiny typos in C comments This fixes 4 tiny and harmless typos in mux_quic.c, quic_tls.c and ssl_sock.c. Originally sent via GitHub PR #1843. Signed-off-by: cui fliter <imcusg@gmail.com> [Tim: Rephrased the commit message] [wt: further complete the commit message]	2022-09-17 10:59:59 +02:00
Aurelien DARRAGON	8d0ff28406	BUG/MEDIUM: server: segv when adding server with hostname from CLI When calling 'add server' with a hostname from the cli (runtime), str2sa_range() does not resolve hostname because it is purposely called without PA_O_RESOLVE flag. This leads to 'srv->addr_node.key' being NULL. According to Willy it is fine behavior, as long as we handle it properly, and is already handled like this in srv_set_addr_desc(). This patch fixes GH #1865 by adding an extra check before inserting 'srv->addr_node' into 'be->used_server_addr'. Insertion and removal will be skipped if 'addr_node.key' is NULL. It must be backported to 2.6 and 2.5 only.	2022-09-17 06:30:59 +02:00
Amaury Denoyelle	d1310f8d32	BUG/MINOR: mux-quic: do not remotely close stream too early A stream is considered as remotely closed once we have received all the data with the FIN bit set. The condition to close the stream was wrong. In particular, if we receive an empty STREAM frame with FIN bit set, this would have close the stream even if we do not have yet received all the data. The condition is now adjusted to ensure that Rx buffer contains all the data up to the stream final size. In most cases, this bug is harmless. However, if compiled with DEBUG_STRICT=2, a BUG_ON_HOT crash would have been triggered if close is done too early. This was most notably the case sometimes on interop test suite with quinn or kwik clients. This can also be artificially reproduced by simulating reception of an empty STREAM frame with FIN bit set in qc_handle_strm_frm() : + if (strm_frm->fin) { + qcc_recv(qc->qcc, strm_frm->id, 0, + strm_frm->len, strm_frm->fin, + (char )strm_frm->data); + } ret = qcc_recv(qc->qcc, strm_frm->id, strm_frm->len, strm_frm->offset.key, strm_frm->fin, (char )strm_frm->data); This must be backported up to 2.6.	2022-09-16 14:17:27 +02:00
Amaury Denoyelle	8d4ac48d3d	CLEANUP: mux-quic: remove stconn usage in h3/hq Small cleanup on snd_buf for application protocol layer. * do not export h3_snd_buf * replace stconn by a qcs argument. This is better as h3/hq-interop only uses the qcs instance. This should be backported up to 2.6.	2022-09-16 13:53:30 +02:00
Christopher Faulet	18ad15f5c4	REORG: mux-h1: extract flags and enums into mux_h1-t.h The same was performed for the H2 multiplexer. H1C and H1S flags are moved in a dedicated header file. It will be mainly used to be able to decode mux-h1 flags from the flags utility. In this patch, we only move the flags to mux_h1-t.h.	2022-09-15 11:01:59 +02:00
Amaury Denoyelle	f8aaf8bdfa	BUG/MEDIUM: mux-quic: fix crash on early app-ops release H3 SETTINGS emission has recently been delayed. The idea is to send it with the first STREAM to reduce sendto syscall invocation. This was implemented in the following patch : 3dd79d378c86b3ebf60e029f518add5f1ed54815 MINOR: h3: Send the h3 settings with others streams (requests) This patch works fine under nominal conditions. However, it will cause a crash if a HTTP/3 connection is released before having sent any data, for example when receiving an invalid first request. In this case, qc_release will first free qcc.app_ops HTTP/3 application protocol layer via release callback. Then qc_send is called to emit any closing frames built by app_ops release invocation. However, in qc_send, as no data has been sent, it will try to complete application layer protocol intialization, with a SETTINGS emission for HTTP/3. Thus, qcc.app_ops is reused, which is invalid as it has been just freed. This will cause a crash with h3_finalize in the call stack. This bug can be reproduced artificially by generating incomplete HTTP/3 requests. This will in time trigger http-request timeout without any data send. This is done by editing qc_handle_strm_frm function. - ret = qcc_recv(qc->qcc, strm_frm->id, strm_frm->len, + ret = qcc_recv(qc->qcc, strm_frm->id, strm_frm->len - 1, strm_frm->offset.key, strm_frm->fin, (char *)strm_frm->data); To fix this, application layer closing API has been adjusted to be done in two-steps. A new shutdown callback is implemented : it is used by the HTTP/3 layer to generate GOAWAY frame in qc_release prologue. Application layer context qcc.app_ops is then freed later in qc_release via the release operation which is now only used to liberate app layer ressources. This fixes the problem as the intermediary qc_send invocation will be able to reuse app_ops before it is freed. This patch fixes the crash, but it would be better to adjust H3 SETTINGS emission in case of early connection closing : in this case, there is no need to send it. This should be implemented in a future patch. This should fix the crash recently experienced by Tristan in github issue #1801. This must be backported up to 2.6.	2022-09-15 10:41:44 +02:00
William Lallemand	95fc737fc6	MEDIUM: quic: separate path for rx and tx with set_encryption_secrets With quicTLS the set_encruption_secrets callback is always called with the read_secret and the write_secret. However this is not the case with libreSSL, which uses the set_read_secret()/set_write_secret() mecanism. It still provides the set_encryption_secrets() callback, which is called with a NULL parameter for the write_secret during the read, and for the read_secret during the write. The exchange key was not designed in haproxy to be called separately for read and write, so this patch allow calls with read or write key to NULL.	2022-09-14 18:16:37 +02:00
William Lallemand	992ad62e3c	MEDIUM: httpclient: allow to use another proxy httpclient_new_from_proxy() is a variant of httpclient_new() which allows to create the requests from a different proxy. The proxy and its 2 servers are now stored in the httpclient structure. The proxy must have been created with httpclient_create_proxy() to be used. The httpclient_postcheck() callback will finish the initialization of all proxies created with PR_CAP_HTTPCLIENT.	2022-09-13 17:12:38 +02:00
William Lallemand	54aec5f678	MEDIUM: httpclient: httpclient_create_proxy() creates a proxy for httpclient httpclient_create_proxy() is a function which creates a proxy that could be used for the httpclient. It will allocate a proxy, a raw server and an ssl server. This patch moves most of the code from httpclient_precheck() into a generic function httpclient_create_proxy(). The proxy will have the PR_CAP_HTTPCLIENT capability. This could be used for specifics httpclient instances that needs different proxy settings.	2022-09-13 17:12:38 +02:00
Emeric Brun	d6e581de4b	BUG/MEDIUM: sink: bad init sequence on tcp sink from a ring. The init of tcp sink, particularly for SSL, was done too early in the code, during parsing, and this can cause a crash specially if nbthread was not configured. This was detected by William using ASAN on a new regtest on log forward. This patch adds the 'struct proxy' created for a sink to a list and this list is now submitted to the same init code than the main proxies list or the log_forward's proxies list. Doing this, we are assured to use the right init sequence. It also removes the ini code for ssl from post section parsing. This patch should be backported as far as v2.2 Note: this fix uses 'goto' labels created by commit 'BUG/MAJOR: log-forward: Fix log-forward proxies not fully initialized' but this code didn't exist before v2.3 so this patch needs to be adapted for v2.2.	2022-09-13 17:03:30 +02:00
Willy Tarreau	6c0fadfb7d	REORG: mux-h2: extract flags and enums into mux_h2-t.h Originally in 1.8 we wanted to have an independent mux that could possibly be disabled and would not impose dependencies on the outside. Everything would fit into a single C file and that was fine. Nowadays muxes are unavoidable, and not being able to easily inspect them from outside is sometimes a bit of a pain. In particular, the flags utility still cannot be used to decode their flags. As a first step towards this, this patch moves the flags and enums to mux_h2-t.h, as well as the two state decoding inline functions. It also dropped the H2_SS_*_BIT defines that nobody uses. The mux_h2.c file remains the only one to include that for now.	2022-09-12 19:33:07 +02:00
Aurelien DARRAGON	a57786e87d	BUG/MINOR: listener: null pointer dereference suspected by coverity Please refer to GH #1859 for more info. Coverity suspected improper proxy pointer handling. Without the fix it is considered safe for the moment, but it might not be the case in the future as we want to keep the ability to have isolated listeners. Making sure stop_listener(), pause_listener(), resume_listener() and listener_release() functions make proper use of px pointer in that context. No need for backport except if multi-connection protocols (ie:FTP) were to be backported as well.	2022-09-12 10:12:18 +02:00
Aurelien DARRAGON	187396e34e	CLEANUP: listener: function comment typo in stop_listener() A minor typo related to stop_listener() function comment was introduced in 0013288. This makes stop_listener() function comment easier to read.	2022-09-12 10:12:13 +02:00
Christopher Faulet	af5336fd23	BUG/MINOR: mux-h1: Increment open_streams counter when H1 stream is created Since this counter was added, it was incremented at the wrong place for client streams. It was incremented when the stream-connector (formely the conn-stream) was created while it should be done when the H1 stream is created. Thus, on parsing error, on H1>H2 upgrades or TCP>H1 upgrades, the counter is not incremented. However, it is always decremented when the H1 stream is destroyed. On bakcned side, there is no issue. This patch must be backported to 2.6.	2022-09-12 09:54:11 +02:00
Willy Tarreau	af985e0151	CLEANUP: pollers: remove dead code in the polling loop As reported by Ilya and Coverity in issue #1858, since recent commit eea152ee6 ("BUG/MINOR: signals/poller: ensure wakeup from signals") which removed the test for the global signal flag from the pollers' loop, the remaining "wake" flag doesn't need to be tested since it already participates to zeroing the wait_time and will be caught on the previous line. Let's just remove that test now.	2022-09-12 09:35:44 +02:00
Aurelien DARRAGON	cddec0aef5	BUG/MINOR: stats: fixing stat shows disabled frontend status as 'OPEN' This patch adresses the issue #1626. Adding support for PR_FL_PAUSED flag in the function stats_fill_fe_stats(). The command 'show stat' now properly reports a disabled frontend using "PAUSED" state label. This patch depends on the following commits: - 7d00077fd5 "BUG/MEDIUM: proxy: ensure pause_proxy() and resume_proxy() own PROXY_LOCK". - 001328873c "MINOR: listener: small API change" - d46f437de6 "MINOR: proxy/listener: support for additional PAUSED state" It should be backported to 2.6, 2.5 and 2.4	2022-09-09 17:24:22 +02:00
Aurelien DARRAGON	d46f437de6	MINOR: proxy/listener: support for additional PAUSED state This patch is a prerequisite for #1626. Adding PAUSED state to the list of available proxy states. The flag is set when the proxy is paused at runtime (pause_listener()). It is cleared when the proxy is resumed (resume_listener()). It should be backported to 2.6, 2.5 and 2.4	2022-09-09 17:23:01 +02:00
Aurelien DARRAGON	001328873c	MINOR: listener: small API change A minor API change was performed in listener(.c/.h) to restore consistency between stop_listener() and (resume/pause)_listener() functions. LISTENER_LOCK was never locked prior to calling stop_listener(): lli variable hint is thus not useful anymore. Added PROXY_LOCK locking in (resume/pause)_listener() functions with related lpx variable hint (prerequisite for #1626). It should be backported to 2.6, 2.5 and 2.4	2022-09-09 17:23:01 +02:00
Aurelien DARRAGON	7d00077fd5	BUG/MEDIUM: proxy: ensure pause_proxy() and resume_proxy() own PROXY_LOCK There was a race involving hlua_proxy_* functions and some proxy management functions. pause_proxy() and resume_proxy() can be used directly from lua code, but that could lead to some race as lua code didn't make sure PROXY_LOCK was owned before calling the proxy functions. This patch makes sure it won't happen again elsewhere in the code by locking PROXY_LOCK directly in resume and pause proxy functions so that it's not the caller's responsibility anymore. (based on stop_proxy() behavior that was already safe prior to the patch) This should be backported to stable series. Note that the API will likely differ < 2.4	2022-09-09 17:23:01 +02:00
Matthias Wirth	eea152ee68	BUG/MINOR: signals/poller: ensure wakeup from signals Add self-wake in signal_handler() to fix a race condition with a signal coming in between checking signal_queue_len and entering polling sleep. The changes in commit 43c891dda ("BUG/MINOR: signals/poller: set the poller timeout to 0 when there are signals") were insufficient. Move the signal_queue_len check from the poll implementations to run_poll_loop() to keep that logic in one place. The poll loops are terminated either by the parameter wake being set or wake up due to a write to their poller_wr_pipe by wake_thread() in signal_handler(). This fixes issue #1841. Must be backported in every stable version.	2022-09-09 11:15:22 +02:00
Frédéric Lécaille	3dd79d378c	MINOR: h3: Send the h3 settings with others streams (requests) This is the ->finalize application callback which prepares the unidirectional STREAM frames for h3 settings and wakeup the mux I/O handler to send them. As haproxy is at the same time always waiting for the client request, this makes haproxy call sendto() to send only about 20 bytes of stream data. Furthermore in case of heavy loss, this give less chances to short h3 requests to succeed. Drawback: as at this time the mux sends its streams by their IDs ascending order the stream 0 is always embedded before the unidirectional stream 3 for h3 settings. Nevertheless, as these settings may be lost and received after other h3 request streams, this is permitted by the RFC. Perhaps there is a better way to do. This will have to be checked with Amaury. Must be backported to 2.6.	2022-09-08 18:04:58 +02:00
Frédéric Lécaille	befcf7031d	MINOR: h3: Missing connection argument for a TRACE_LEAVE() argument This should help in debbuging issues to be able to associate this trace to a QUIC connection. Must be backported to 2.6.	2022-09-08 18:04:58 +02:00
Frédéric Lécaille	2eb5faa2ad	MINOR: h3: Add the quic_conn object to h3 traces This is very useful to associate h3 traces to a QUIC connection when debugging. Must be backported to 2.6.	2022-09-08 18:04:58 +02:00
Frédéric Lécaille	1c725aa9cd	BUG/MINOR: h3: Crash when h3 trace verbosity is "minimal" This was due to a missing check in h3_trace() about the first argument presence (connection) and h3_parse_settings_frm() which calls TRACE_LEAVE() without any argument. Then this argument was dereferenced. Must be backported to 2.6	2022-09-08 18:04:58 +02:00
Frédéric Lécaille	3c1b81fdd7	BUG/MINOR: quic: Trace fix about packet number space information. <qc> variable was confused with <qel>. The consequence was that it was always the same packet number space which was displayed: the first one (or the Initial packet number space). Must be backported to 2.6.	2022-09-08 18:04:58 +02:00
Frédéric Lécaille	bb995eafc7	BUG/MINOR: quic: Speed up the handshake completion only one time It is possible to speed up the handshake completion but only one time by connection as mentionned in RFC 9002 "6.2.3. Speeding up Handshake Completion". Add a flag to prevent this process to be run several times (see https://www.rfc-editor.org/rfc/rfc9002#name-speeding-up-handshake-compl). Must be backported to 2.6.	2022-09-08 18:04:58 +02:00
William Lallemand	43c891dda0	BUG/MINOR: signals/poller: set the poller timeout to 0 when there are signals When receiving a signal before entering the poller, and without any activity in the process, the poller will be entered with a timeout calculated without checking the signals. Since commit 4f59d3 ("MINOR: time: increase the minimum wakeup interval to 60s") the issue is much more visible because it could be stuck for 60s. When in mworker mode, if a worker quits and the SIGCHLD signal deliver at the right time to the master, this one could be stuck for the time of the timeout. This should fix issue #1841 Must be backported in every stable version.	2022-09-08 17:46:31 +02:00
Willy Tarreau	e86bc35672	MINOR: activity/cli: support sorting task profiling by total CPU time The new "bytime" sorting criterion uses the reported CPU time instead of the usage. This is convenient to spot tasks that are mostly reponsible for the CPU usage in a running process. It supports both the detailed and the aggregated format. The output looks like this: > show profiling tasks bytime Tasks activity: function calls cpu_tot cpu_avg lat_tot lat_avg qc_io_cb 117739 1.961m 999.1us 37.45s 318.1us <- h3_snd_buf@src/h3.c:1084 tasklet_wakeup process_stream 7376273 1.384m 11.26us 1.013h 494.2us <- stream_new@src/stream.c:563 task_wakeup process_stream 8104400 1.133m 8.389us 1.130h 502.0us <- sc_notify@src/stconn.c:1209 task_wakeup qc_io_cb 43280 45.76s 1.057ms 13.95s 322.3us <- qc_stream_desc_ack@src/quic_stream.c:128 tasklet_wakeup h1_io_cb 11025715 24.82s 2.251us 5.406m 29.42us <- sock_conn_iocb@src/sock.c:869 tasklet_wakeup quic_conn_app_io_cb 312861 23.86s 76.27us 2.373s 7.584us <- qc_lstnr_pkt_rcv@src/xprt_quic.c:6184 tasklet_wakeup_after qc_io_cb 37063 12.65s 341.4us 6.409s 172.9us <- qc_treat_acked_tx_frm@src/xprt_quic.c:1695 tasklet_wakeup h1_io_cb 4783520 11.79s 2.463us 1.419h 1.068ms <- conn_subscribe@src/connection.c:732 tasklet_wakeup sc_conn_io_cb 12269693 11.51s 938.0ns 2.117h 621.2us <- sc_app_chk_rcv_conn@src/stconn.c:762 tasklet_wakeup sc_conn_io_cb 6479006 10.94s 1.689us 7.984m 73.93us <- h1_wake_stream_for_recv@src/mux_h1.c:2600 tasklet_wakeup qc_io_cb 12011 10.72s 892.5us 2.120s 176.5us <- qcc_release_remote_stream@src/mux_quic.c:1200 tasklet_wakeup h2_io_cb 246423 6.225s 25.26us 56.52s 229.4us <- h2_snd_buf@src/mux_h2.c:6712 tasklet_wakeup h2_io_cb 137744 6.076s 44.11us 16.59s 120.4us <- sock_conn_iocb@src/sock.c:869 tasklet_wakeup quic_lstnr_dghdlr 323575 3.062s 9.462us 3.424m 634.9us <- quic_lstnr_dgram_dispatch@src/quic_sock.c:255 tasklet_wakeup sc_conn_io_cb 1206939 1.616s 1.338us 27.62m 1.373ms <- qcs_notify_send@src/mux_quic.c:529 tasklet_wakeup h2_io_cb 212370 251.2ms 1.182us 6.476s 30.49us <- h2c_restart_reading@src/mux_h2.c:856 tasklet_wakeup h1_io_cb 44109 197.0ms 4.466us 31.89s 723.0us <- h1_takeover@src/mux_h1.c:4085 tasklet_wakeup quic_conn_app_io_cb 3029 87.59ms 28.92us 999.0ms 329.8us <- qc_process_timer@src/xprt_quic.c:4635 tasklet_wakeup task_run_applet 40 35.77ms 894.3us 4.407ms 110.2us <- sc_applet_create@src/stconn.c:489 appctx_wakeup task_run_applet 18 27.36ms 1.520ms 19.56us 1.086us <- sc_app_chk_snd_applet@src/stconn.c:996 appctx_wakeup sc_conn_io_cb 2186 11.76ms 5.377us 963.0ms 440.5us <- h1_wake_stream_for_send@src/mux_h1.c:2610 tasklet_wakeup qc_io_cb 8 9.880ms 1.235ms 5.871ms 733.9us <- qcs_consume@src/mux_quic.c:800 tasklet_wakeup quic_conn_io_cb 4 5.951ms 1.488ms 38.85us 9.713us <- qc_lstnr_pkt_rcv@src/xprt_quic.c:6184 tasklet_wakeup_after qc_io_cb 101 4.975ms 49.26us 13.91ms 137.8us <- qc_process_timer@src/xprt_quic.c:4602 tasklet_wakeup h1_io_cb 2186 1.809ms 827.0ns 720.2ms 329.5us <- sock_conn_iocb@src/sock.c:849 tasklet_wakeup qc_process_timer 3031 1.735ms 572.0ns 1.153s 380.3us <- wake_expired_tasks@src/task.c:344 task_wakeup accept_queue_process 359 1.362ms 3.793us 80.32ms 223.7us <- listener_accept@src/listener.c:1099 tasklet_wakeup quic_conn_app_io_cb 2 921.1us 460.6us 203.1us 101.5us <- qc_xprt_start@src/xprt_quic.c:7122 tasklet_wakeup h1_timeout_task 2618 526.8us 201.0ns 1.121s 428.4us <- h1_release@src/mux_h1.c:1087 task_wakeup process_resolvers 316 283.3us 896.0ns 14.96ms 47.33us <- wake_expired_tasks@src/task.c:429 task_drop_running sc_conn_io_cb 420 235.6us 560.0ns 116.7ms 277.8us <- h2s_notify_recv@src/mux_h2.c:1298 tasklet_wakeup qc_idle_timer_task 1 225.5us 225.5us 506.0ns 506.0ns <- wake_expired_tasks@src/task.c:344 task_wakeup accept_queue_process 36 153.0us 4.250us 5.834ms 162.1us <- accept_queue_process@src/listener.c:165 tasklet_wakeup sc_conn_io_cb 18 54.05us 3.003us 11.50us 638.0ns <- sock_conn_iocb@src/sock.c:869 tasklet_wakeup h2_io_cb 6 38.88us 6.480us 2.089ms 348.2us <- h2_do_shutw@src/mux_h2.c:4656 tasklet_wakeup srv_cleanup_idle_conns 54 37.72us 698.0ns 14.21ms 263.1us <- wake_expired_tasks@src/task.c:429 task_drop_running sc_conn_io_cb 50 32.86us 657.0ns 28.83ms 576.5us <- qcs_notify_recv@src/mux_quic.c:519 tasklet_wakeup qc_io_cb 2 30.25us 15.12us 6.093us 3.046us <- qc_init@src/mux_quic.c:2057 tasklet_wakeup srv_cleanup_toremove_conns 1 27.16us 27.16us 905.6us 905.6us <- srv_cleanup_idle_conns@src/server.c:5948 task_wakeup task_run_applet 39 19.61us 502.0ns 818.7us 20.99us <- run_tasks_from_lists@src/task.c:652 task_drop_running quic_accept_run 2 15.46us 7.727us 305.5us 152.8us <- quic_accept_push_qc@src/quic_sock.c:458 tasklet_wakeup h2_timeout_task 32 12.91us 403.0ns 4.207ms 131.5us <- h2_release@src/mux_h2.c:1191 task_wakeup quic_conn_app_io_cb 1 9.645us 9.645us 1.445us 1.445us <- qc_process_timer@src/xprt_quic.c:4589 tasklet_wakeup > show profiling tasks bytime aggr Tasks activity: function calls cpu_tot cpu_avg lat_tot lat_avg qc_io_cb 212301 3.147m 889.5us 1.009m 285.2us process_stream 15503573 2.519m 9.747us 2.148h 498.7us h1_io_cb 15916733 36.95s 2.321us 1.535h 347.1us quic_conn_app_io_cb 318845 24.21s 75.92us 3.410s 10.70us sc_conn_io_cb 20037058 24.19s 1.207us 2.737h 491.8us h2_io_cb 596543 12.55s 21.04us 1.326m 133.4us quic_lstnr_dghdlr 326624 3.094s 9.473us 3.462m 635.9us task_run_applet 100 64.43ms 644.3us 5.285ms 52.85us quic_conn_io_cb 4 5.951ms 1.488ms 38.85us 9.713us qc_process_timer 3061 1.750ms 571.0ns 1.162s 379.5us accept_queue_process 396 1.521ms 3.840us 86.16ms 217.6us h1_timeout_task 2618 526.8us 201.0ns 1.121s 428.4us process_resolvers 319 286.0us 896.0ns 16.82ms 52.73us qc_idle_timer_task 1 225.5us 225.5us 506.0ns 506.0ns srv_cleanup_idle_conns 54 37.72us 698.0ns 14.21ms 263.1us srv_cleanup_toremove_conns 1 27.16us 27.16us 905.6us 905.6us quic_accept_run 2 15.46us 7.727us 305.5us 152.8us h2_timeout_task 32 12.91us 403.0ns 4.207ms 131.5us	2022-09-08 16:38:10 +02:00
Willy Tarreau	dc89b1806c	MINOR: activity/cli: support aggregating task profiling outputs By default we now dump stats between caller and callee, but by specifying "aggr" on the command line, stats get aggregated by callee again as it used to be before the feature was available. It may sometimes be helpful when comparing total call counts, though that's about all.	2022-09-08 16:32:17 +02:00
Willy Tarreau	64435aaa85	MINOR: tasks/activity: improve the caller-callee activity hash The previous dump already showed that the "other" category was getting a few entries. Let's proceed like for the memory profiling, by scanning a limited range of adjacent slots to find a spare one (16 max). That's pretty fast since close and likely prefetched and the comparison is cheap. The new dump now shows up to 45 entries below without "other": Now: Tasks activity: function calls cpu_tot cpu_avg lat_tot lat_avg task_run_applet 22 34.56ms 1.571ms 1.145ms 52.04us <- sc_applet_create@src/stconn.c:489 appctx_wakeup task_run_applet 21 11.11us 529.0ns 2.590ms 123.3us <- run_tasks_from_lists@src/task.c:652 task_drop_running task_run_applet 5 7.715ms 1.543ms 2.186us 437.0ns <- sc_app_chk_snd_applet@src/stconn.c:996 appctx_wakeup accept_queue_process 345 3.129ms 9.068us 72.84ms 211.1us <- listener_accept@src/listener.c:1099 tasklet_wakeup accept_queue_process 32 113.0us 3.529us 3.070ms 95.94us <- accept_queue_process@src/listener.c:165 tasklet_wakeup sc_conn_io_cb 5026032 3.037s 604.0ns 17.47m 208.5us <- sc_app_chk_rcv_conn@src/stconn.c:762 tasklet_wakeup sc_conn_io_cb 4361192 7.626s 1.748us 3.179m 43.74us <- h1_wake_stream_for_recv@src/mux_h1.c:2600 tasklet_wakeup sc_conn_io_cb 178293 275.4ms 1.544us 2.740m 922.0us <- qcs_notify_send@src/mux_quic.c:529 tasklet_wakeup sc_conn_io_cb 2561 15.84ms 6.185us 1.036s 404.4us <- h1_wake_stream_for_send@src/mux_h1.c:2610 tasklet_wakeup sc_conn_io_cb 453 261.4us 577.0ns 86.79ms 191.6us <- h2s_notify_recv@src/mux_h2.c:1298 tasklet_wakeup sc_conn_io_cb 89 50.05us 562.0ns 100.7ms 1.131ms <- qcs_notify_recv@src/mux_quic.c:519 tasklet_wakeup sc_conn_io_cb 8 19.04us 2.379us 472.5us 59.06us <- sock_conn_iocb@src/sock.c:869 tasklet_wakeup process_resolvers 50 57.50us 1.149us 1.116ms 22.32us <- wake_expired_tasks@src/task.c:429 task_drop_running srv_cleanup_idle_conns 8 5.669us 708.0ns 216.6us 27.08us <- wake_expired_tasks@src/task.c:429 task_drop_running process_stream 4599847 48.79s 10.61us 16.92m 220.7us <- sc_notify@src/stconn.c:1209 task_wakeup process_stream 4530081 52.82s 11.66us 14.92m 197.6us <- stream_new@src/stream.c:563 task_wakeup process_stream 15 201.7us 13.45us 31.58ms 2.105ms <- sc_app_chk_snd_conn@src/stconn.c:857 task_wakeup h1_io_cb 7861205 18.22s 2.317us 2.408m 18.38us <- sock_conn_iocb@src/sock.c:869 tasklet_wakeup h1_io_cb 474763 1.379s 2.905us 6.578m 831.4us <- conn_subscribe@src/connection.c:732 tasklet_wakeup h1_io_cb 34830 38.64ms 1.109us 18.85s 541.2us <- h1_takeover@src/mux_h1.c:4085 tasklet_wakeup h1_io_cb 2561 2.150ms 839.0ns 674.4ms 263.3us <- sock_conn_iocb@src/sock.c:849 tasklet_wakeup h1_timeout_task 2634 588.5us 223.0ns 890.5ms 338.1us <- h1_release@src/mux_h1.c:1087 task_wakeup h2_timeout_task 16 7.519us 469.0ns 1.146ms 71.63us <- h2_release@src/mux_h2.c:1191 task_wakeup h2_io_cb 99601 2.212s 22.21us 19.33s 194.1us <- h2_snd_buf@src/mux_h2.c:6712 tasklet_wakeup h2_io_cb 79777 146.6ms 1.837us 3.529s 44.24us <- h2c_restart_reading@src/mux_h2.c:856 tasklet_wakeup h2_io_cb 60698 2.259s 37.21us 4.704s 77.50us <- sock_conn_iocb@src/sock.c:869 tasklet_wakeup h2_io_cb 5 36.90us 7.380us 2.045ms 409.0us <- h2_do_shutw@src/mux_h2.c:4656 tasklet_wakeup qc_io_cb 26595 8.007s 301.1us 4.261s 160.2us <- qc_treat_acked_tx_frm@src/xprt_quic.c:1695 tasklet_wakeup qc_io_cb 7921 5.284s 667.1us 2.171s 274.1us <- qc_stream_desc_ack@src/quic_stream.c:128 tasklet_wakeup qc_io_cb 6229 5.851s 939.3us 1.856s 297.9us <- h3_snd_buf@src/h3.c:1084 tasklet_wakeup qc_io_cb 994 699.1ms 703.3us 174.9ms 176.0us <- qcc_release_remote_stream@src/mux_quic.c:1200 tasklet_wakeup qc_io_cb 65 9.883ms 152.0us 13.33ms 205.1us <- qc_process_timer@src/xprt_quic.c:4602 tasklet_wakeup qc_io_cb 1 293.5us 293.5us 105.9us 105.9us <- qcs_consume@src/mux_quic.c:800 tasklet_wakeup qc_io_cb 1 10.87us 10.87us 3.307us 3.307us <- qc_init@src/mux_quic.c:2057 tasklet_wakeup quic_conn_io_cb 2 2.531ms 1.265ms 2.839us 1.419us <- qc_lstnr_pkt_rcv@src/xprt_quic.c:6184 tasklet_wakeup_after quic_conn_app_io_cb 61392 2.620s 42.67us 268.0ms 4.365us <- qc_lstnr_pkt_rcv@src/xprt_quic.c:6184 tasklet_wakeup_after quic_conn_app_io_cb 408 10.56ms 25.88us 124.0ms 303.8us <- qc_process_timer@src/xprt_quic.c:4635 tasklet_wakeup quic_conn_app_io_cb 2 15.61us 7.806us 103.2us 51.59us <- qc_process_timer@src/xprt_quic.c:4589 tasklet_wakeup quic_conn_app_io_cb 1 410.6us 410.6us 11.52us 11.52us <- qc_xprt_start@src/xprt_quic.c:7122 tasklet_wakeup quic_lstnr_dghdlr 62716 409.2ms 6.523us 21.81s 347.8us <- quic_lstnr_dgram_dispatch@src/quic_sock.c:255 tasklet_wakeup qc_process_timer 410 245.4us 598.0ns 238.5ms 581.7us <- wake_expired_tasks@src/task.c:344 task_wakeup quic_accept_run 1 7.711us 7.711us 82.28us 82.28us <- quic_accept_push_qc@src/quic_sock.c:458 tasklet_wakeup	2022-09-08 16:25:36 +02:00
Willy Tarreau	3d4cdb198c	MEDIUM: tasks/activity: combine the called function with the caller Now instead of getting aggregate stats per called function, we have them per function AND per call place. The "byaddr" sort considers the function pointer first, then the call count, so that dominant callers of a given callee are instantly spotted. This allows to get sorted outputs like this: Tasks activity: function calls cpu_tot cpu_avg lat_tot lat_avg h1_io_cb 17357952 40.91s 2.357us 4.849m 16.76us <- sock_conn_iocb@src/sock.c:869 tasklet_wakeup sc_conn_io_cb 10357182 6.297s 607.0ns 27.93m 161.8us <- sc_app_chk_rcv_conn@src/stconn.c:762 tasklet_wakeup process_stream 9891131 1.809m 10.97us 53.61m 325.2us <- sc_notify@src/stconn.c:1209 task_wakeup process_stream 9823934 1.887m 11.52us 48.31m 295.1us <- stream_new@src/stream.c:563 task_wakeup sc_conn_io_cb 9347863 16.59s 1.774us 6.143m 39.43us <- h1_wake_stream_for_recv@src/mux_h1.c:2600 tasklet_wakeup h1_io_cb 501344 1.848s 3.686us 6.544m 783.2us <- conn_subscribe@src/connection.c:732 tasklet_wakeup sc_conn_io_cb 239717 492.3ms 2.053us 3.213m 804.3us <- qcs_notify_send@src/mux_quic.c:529 tasklet_wakeup h2_io_cb 173019 4.204s 24.30us 40.95s 236.7us <- h2_snd_buf@src/mux_h2.c:6712 tasklet_wakeup h2_io_cb 149487 424.3ms 2.838us 14.63s 97.87us <- h2c_restart_reading@src/mux_h2.c:856 tasklet_wakeup other 101893 4.626s 45.40us 14.84s 145.7us quic_lstnr_dghdlr 94389 614.0ms 6.504us 30.54s 323.6us <- quic_lstnr_dgram_dispatch@src/quic_sock.c:255 tasklet_wakeup quic_conn_app_io_cb 92205 3.735s 40.51us 390.9ms 4.239us <- qc_lstnr_pkt_rcv@src/xprt_quic.c:6184 tasklet_wakeup_after qc_io_cb 50355 19.01s 377.5us 10.65s 211.4us <- qc_treat_acked_tx_frm@src/xprt_quic.c:1695 tasklet_wakeup h1_io_cb 44427 155.0ms 3.489us 21.50s 484.0us <- h1_takeover@src/mux_h1.c:4085 tasklet_wakeup qc_io_cb 9018 4.924s 546.0us 3.084s 342.0us <- qc_stream_desc_ack@src/quic_stream.c:128 tasklet_wakeup h1_timeout_task 3236 1.172ms 362.0ns 1.119s 345.9us <- h1_release@src/mux_h1.c:1087 task_wakeup h1_io_cb 2804 7.974ms 2.843us 1.980s 706.0us <- sock_conn_iocb@src/sock.c:849 tasklet_wakeup sc_conn_io_cb 2804 33.44ms 11.92us 2.597s 926.2us <- h1_wake_stream_for_send@src/mux_h1.c:2610 tasklet_wakeup qc_io_cb 2623 2.669s 1.017ms 1.347s 513.5us <- h3_snd_buf@src/h3.c:1084 tasklet_wakeup qc_process_timer 662 526.4us 795.0ns 1.081s 1.633ms <- wake_expired_tasks@src/task.c:344 task_wakeup quic_conn_app_io_cb 648 12.62ms 19.47us 225.7ms 348.2us <- qc_process_timer@src/xprt_quic.c:4635 tasklet_wakeup accept_queue_process 286 1.571ms 5.494us 72.55ms 253.7us <- listener_accept@src/listener.c:1099 tasklet_wakeup process_resolvers 176 157.8us 896.0ns 7.835ms 44.52us <- wake_expired_tasks@src/task.c:429 task_drop_running qc_io_cb 167 10.71ms 64.12us 32.47ms 194.4us <- qc_process_timer@src/xprt_quic.c:4602 tasklet_wakeup sc_conn_io_cb 123 80.05us 650.0ns 50.35ms 409.4us <- qcs_notify_recv@src/mux_quic.c:519 tasklet_wakeup h2_timeout_task 32 30.69us 958.0ns 9.038ms 282.4us <- h2_release@src/mux_h2.c:1191 task_wakeup task_run_applet 24 33.79ms 1.408ms 5.838ms 243.3us <- sc_applet_create@src/stconn.c:489 appctx_wakeup accept_queue_process 17 56.34us 3.314us 7.505ms 441.5us <- accept_queue_process@src/listener.c:165 tasklet_wakeup srv_cleanup_toremove_conns 16 1.133ms 70.81us 5.685ms 355.3us <- srv_cleanup_idle_conns@src/server.c:5948 task_wakeup srv_cleanup_idle_conns 16 74.57us 4.660us 2.797ms 174.8us <- wake_expired_tasks@src/task.c:429 task_drop_running quic_conn_app_io_cb 12 786.9us 65.58us 2.042ms 170.1us <- qc_process_timer@src/xprt_quic.c:4589 tasklet_wakeup sc_conn_io_cb 9 20.55us 2.283us 2.475ms 275.0us <- sock_conn_iocb@src/sock.c:869 tasklet_wakeup h2_io_cb 8 34.12us 4.265us 1.784ms 223.0us <- h2_do_shutw@src/mux_h2.c:4656 tasklet_wakeup task_run_applet 4 6.615ms 1.654ms 2.306us 576.0ns <- sc_app_chk_snd_applet@src/stconn.c:996 appctx_wakeup quic_conn_io_cb 4 4.278ms 1.069ms 6.469us 1.617us <- qc_lstnr_pkt_rcv@src/xprt_quic.c:6184 tasklet_wakeup_after qc_io_cb 2 20.81us 10.40us 4.943us 2.471us <- qc_init@src/mux_quic.c:2057 tasklet_wakeup quic_conn_app_io_cb 2 752.9us 376.4us 63.97us 31.99us <- qc_xprt_start@src/xprt_quic.c:7122 tasklet_wakeup quic_accept_run 2 13.84us 6.920us 172.8us 86.42us <- quic_accept_push_qc@src/quic_sock.c:458 tasklet_wakeup qc_idle_timer_task 2 295.0us 147.5us 8.761us 4.380us <- wake_expired_tasks@src/task.c:344 task_wakeup qc_io_cb 1 867.1us 867.1us 812.8us 812.8us <- qcs_consume@src/mux_quic.c:800 tasklet_wakeup ... and calls sorted by address like this: Tasks activity: function calls cpu_tot cpu_avg lat_tot lat_avg task_run_applet 23 32.73ms 1.423ms 5.837ms 253.8us <- sc_applet_create@src/stconn.c:489 appctx_wakeup task_run_applet 4 6.615ms 1.654ms 2.306us 576.0ns <- sc_app_chk_snd_applet@src/stconn.c:996 appctx_wakeup accept_queue_process 285 1.566ms 5.495us 72.49ms 254.3us <- listener_accept@src/listener.c:1099 tasklet_wakeup accept_queue_process 17 56.34us 3.314us 7.505ms 441.5us <- accept_queue_process@src/listener.c:165 tasklet_wakeup sc_conn_io_cb 10357182 6.297s 607.0ns 27.93m 161.8us <- sc_app_chk_rcv_conn@src/stconn.c:762 tasklet_wakeup sc_conn_io_cb 9347863 16.59s 1.774us 6.143m 39.43us <- h1_wake_stream_for_recv@src/mux_h1.c:2600 tasklet_wakeup sc_conn_io_cb 239717 492.3ms 2.053us 3.213m 804.3us <- qcs_notify_send@src/mux_quic.c:529 tasklet_wakeup sc_conn_io_cb 2804 33.44ms 11.92us 2.597s 926.2us <- h1_wake_stream_for_send@src/mux_h1.c:2610 tasklet_wakeup sc_conn_io_cb 123 80.05us 650.0ns 50.35ms 409.4us <- qcs_notify_recv@src/mux_quic.c:519 tasklet_wakeup sc_conn_io_cb 9 20.55us 2.283us 2.475ms 275.0us <- sock_conn_iocb@src/sock.c:869 tasklet_wakeup process_resolvers 159 145.9us 917.0ns 7.823ms 49.20us <- wake_expired_tasks@src/task.c:429 task_drop_running srv_cleanup_idle_conns 16 74.57us 4.660us 2.797ms 174.8us <- wake_expired_tasks@src/task.c:429 task_drop_running srv_cleanup_toremove_conns 16 1.133ms 70.81us 5.685ms 355.3us <- srv_cleanup_idle_conns@src/server.c:5948 task_wakeup process_stream 9891130 1.809m 10.97us 53.61m 325.2us <- sc_notify@src/stconn.c:1209 task_wakeup process_stream 9823933 1.887m 11.52us 48.31m 295.1us <- stream_new@src/stream.c:563 task_wakeup h1_io_cb 17357952 40.91s 2.357us 4.849m 16.76us <- sock_conn_iocb@src/sock.c:869 tasklet_wakeup h1_io_cb 501344 1.848s 3.686us 6.544m 783.2us <- conn_subscribe@src/connection.c:732 tasklet_wakeup h1_io_cb 44427 155.0ms 3.489us 21.50s 484.0us <- h1_takeover@src/mux_h1.c:4085 tasklet_wakeup h1_io_cb 2804 7.974ms 2.843us 1.980s 706.0us <- sock_conn_iocb@src/sock.c:849 tasklet_wakeup h1_timeout_task 3236 1.172ms 362.0ns 1.119s 345.9us <- h1_release@src/mux_h1.c:1087 task_wakeup h2_timeout_task 32 30.69us 958.0ns 9.038ms 282.4us <- h2_release@src/mux_h2.c:1191 task_wakeup h2_io_cb 173019 4.204s 24.30us 40.95s 236.7us <- h2_snd_buf@src/mux_h2.c:6712 tasklet_wakeup h2_io_cb 149487 424.3ms 2.838us 14.63s 97.87us <- h2c_restart_reading@src/mux_h2.c:856 tasklet_wakeup h2_io_cb 8 34.12us 4.265us 1.784ms 223.0us <- h2_do_shutw@src/mux_h2.c:4656 tasklet_wakeup qc_io_cb 50355 19.01s 377.5us 10.65s 211.4us <- qc_treat_acked_tx_frm@src/xprt_quic.c:1695 tasklet_wakeup qc_io_cb 9018 4.924s 546.0us 3.084s 342.0us <- qc_stream_desc_ack@src/quic_stream.c:128 tasklet_wakeup qc_io_cb 2623 2.669s 1.017ms 1.347s 513.5us <- h3_snd_buf@src/h3.c:1084 tasklet_wakeup qc_io_cb 167 10.71ms 64.12us 32.47ms 194.4us <- qc_process_timer@src/xprt_quic.c:4602 tasklet_wakeup qc_io_cb 2 20.81us 10.40us 4.943us 2.471us <- qc_init@src/mux_quic.c:2057 tasklet_wakeup qc_io_cb 1 867.1us 867.1us 812.8us 812.8us <- qcs_consume@src/mux_quic.c:800 tasklet_wakeup qc_idle_timer_task 2 295.0us 147.5us 8.761us 4.380us <- wake_expired_tasks@src/task.c:344 task_wakeup quic_conn_io_cb 4 4.278ms 1.069ms 6.469us 1.617us <- qc_lstnr_pkt_rcv@src/xprt_quic.c:6184 tasklet_wakeup_after quic_conn_app_io_cb 92205 3.735s 40.51us 390.9ms 4.239us <- qc_lstnr_pkt_rcv@src/xprt_quic.c:6184 tasklet_wakeup_after quic_conn_app_io_cb 648 12.62ms 19.47us 225.7ms 348.2us <- qc_process_timer@src/xprt_quic.c:4635 tasklet_wakeup quic_conn_app_io_cb 12 786.9us 65.58us 2.042ms 170.1us <- qc_process_timer@src/xprt_quic.c:4589 tasklet_wakeup quic_conn_app_io_cb 2 752.9us 376.4us 63.97us 31.99us <- qc_xprt_start@src/xprt_quic.c:7122 tasklet_wakeup quic_lstnr_dghdlr 94389 614.0ms 6.504us 30.54s 323.6us <- quic_lstnr_dgram_dispatch@src/quic_sock.c:255 tasklet_wakeup qc_process_timer 662 526.4us 795.0ns 1.081s 1.633ms <- wake_expired_tasks@src/task.c:344 task_wakeup quic_accept_run 2 13.84us 6.920us 172.8us 86.42us <- quic_accept_push_qc@src/quic_sock.c:458 tasklet_wakeup other 101892 4.626s 45.40us 14.84s 145.7us It already becomes visible that some tasks have different very costs depending where they're called (e.g. process_stream). The method used to wake them up is also shown. Applets are handled specially and shown as appctx_wakeup.	2022-09-08 16:21:22 +02:00
Willy Tarreau	41e701e2c1	DEBUG: quic: export the few task handlers that often appear in task dumps The following task/tasklet handlers often appear in "show profiling tasks" but were not resolved since static: qc_io_cb, quic_conn_app_io_cb, process_timer, quic_accept_run, qc_idle_timer_task This commit simply exports them so they can be resolved now. "process_timer" which was a bit too generic and renamed to qc_process_timer.	2022-09-08 16:13:38 +02:00
Willy Tarreau	0fbc16cfb9	DEBUG: resolvers: unstatify process_resolvers() to make it appear in profiling The function appears like this in "show profiling tasks", so let's export it: function calls cpu_tot cpu_avg lat_tot lat_avg main+0x1463f0 92 77.28us 839.0ns 2.018ms 21.93us <- wake_expired_tasks@src/task.c:429 task_drop_running	2022-09-08 16:13:38 +02:00
Willy Tarreau	a3423873fe	CLEANUP: activity: make the number of sched activity entries more configurable This removes all the hard-coded 8-bit and 256 entries to use a pair of macros instead so that we can more easily experiment with larger table sizes if needed.	2022-09-08 14:55:09 +02:00

... 8 9 10 11 12 ...

14970 Commits