haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-08 08:07:10 +02:00

Author	SHA1	Message	Date
Willy Tarreau	ed4464e6c6	BUG/MINOR: mux_h2: missing space between "st" and ".flg" in the "show fd" helper That was causing confusing outputs like this one whenan H2S is known: 1030 : ... last_h2s=0x2ed8390 .id=775 .st=HCR.flg=0x4001 .rxbuf=... ^^^^ This was introduced by commit `ab2ec4540` in 2.1-dev2 so the fix can be backported as far as 2.1.	2021-01-20 17:17:39 +01:00
Fr�d�ric L�caille	2b0ba54ddb	BUG/MINOR: peers: Wrong "new_conn" value for "show peers" CLI command. This counter could be hugely incremented by the peer task responsible of managing peer synchronizations and reconnections, for instance when a peer is not reachable there is a period where the appctx is not created. If we receive stick-table updates before the peer session (appctx) is instantiated, we reach the code responsible of incrementing the "new_conn" counter. With this patch we increment this counter only when we really instantiate a new peer session thanks to peer_session_create(). May be backported as far as 2.0.	2021-01-19 10:08:18 +01:00
Tim Duesterhus	ed84d84a29	CLEANUP: Rename accept_encoding_hash_cmp to accept_encoding_bitmap_cmp For the `accept-encoding` header a bitmap and not a hash is stored.	2021-01-18 15:01:48 +01:00
Tim Duesterhus	5897cfe18e	CLEANUP: cache: Use proper data types in secondary_key_cmp() - hash_length is `unsigned int` and so should offset. - idx is compared to a `size_t` and thus it should also be.	2021-01-18 15:01:46 +01:00
Tim Duesterhus	1d66e396bf	MINOR: cache: Remove the `hash` part of the accept-encoding secondary key As of commit `6ca89162dc` this hash no longer is required, because unknown encodings are not longer stored and known encodings do not use the cache.	2021-01-18 15:01:41 +01:00
Fr�d�ric L�caille	4b1a05fcf8	BUG/MINOR: peers: Possible appctx pointer dereference. This bug may occur when enabling peers traces. It is possible that peer->appctx is NULL when entering peer_session_release().	2021-01-17 21:58:03 +01:00
Remi Tricot-Le Breton	6ca89162dc	MINOR: cache: Do not store responses with an unknown encoding If a server varies on the accept-encoding header and it sends a response with an encoding we do not know (see parse_encoding_value function), we will not store it. This will prevent unexpected errors caused by cache collisions that could happen in accept_encoding_hash_cmp.	2021-01-15 22:33:05 +01:00
Adis Nezirovic	b62b78be13	BUG/MEDIUM: stats: add missing INF_BUILD_INFO definition commit `5a982a7165` ("MINOR: contrib/prometheus-exporter: export build_info") is breaking lua `core.get_info()`. This patch makes sure build_info is correctly initialised in all cases. Reviewed-by: William Dauchy <wdauchy@gmail.com>	2021-01-15 18:47:19 +01:00
Willy Tarreau	81d7092dbd	BUILD: peers: fix build warning about unused variable Previous commit `da2b0844f` ("MINOR: peers: Add traces for peer control messages.") introduced a build warning on some compiler versions after the removal of variable "peers" in peer_send_msgs() because variable "s" was used only to assign this one, and variable "si" to assign "s". Let's remove both to fix the warning. No backport is needed.	2021-01-15 17:08:38 +01:00
Baptiste Assmann	6554742b15	BUG/MINOR: dns: SRV records ignores duplicated AR records (v2) V2 of this fix which includes a missing pointer initialization which was causing a segfault in v1 (`949a7f6459`) This bug happens when a service has multiple records on the same host and the server provides the A/AAAA resolution in the response as AR (Additional Records). In such condition, the first occurence of the host will be taken from the Additional section, while the second (and next ones) will be process by an independent resolution task (like we used to do before 2.2). This can lead to a situation where the "synchronisation" of the resolution may diverge, like described in github issue #971. Because of this behavior, HAProxy mixes various type of requests to resolve the full list of servers: SRV+AR for all "first" occurences and A/AAAA for all other occurences of an existing hostname. IE: with the following type of response: ;; ANSWER SECTION: _http._tcp.be2.tld. 3600 IN SRV 5 500 80 A2.tld. _http._tcp.be2.tld. 3600 IN SRV 5 500 86 A3.tld. _http._tcp.be2.tld. 3600 IN SRV 5 500 80 A1.tld. _http._tcp.be2.tld. 3600 IN SRV 5 500 85 A3.tld. ;; ADDITIONAL SECTION: A2.tld. 3600 IN A 192.168.0.2 A3.tld. 3600 IN A 192.168.0.3 A1.tld. 3600 IN A 192.168.0.1 A3.tld. 3600 IN A 192.168.0.3 the first A3 host is resolved using the Additional Section and the second one through a dedicated A request. When linking the SRV records to their respective Additional one, a condition was missing (chek if said SRV record is already attached to an Additional one), leading to stop processing SRV only when the target SRV field matches the Additional record name. Hence only the first occurence of a target was managed by an additional record. This patch adds a condition in this loop to ensure the record being parsed is not already linked to an Additional Record. If so, we can carry on the parsing to find a possible next one with the same target field value. backport status: 2.2 and above	2021-01-15 17:01:24 +01:00
Fr�d�ric L�caille	da2b0844fc	MINOR: peers: Add traces for peer control messages. Display traces when sending/receiving peer control messages (synchronisation, heartbeat). Add remaining traces when parsing malformed messages (acks, stick-table definitions) or ignoring them. Also add traces when releasing session or when reaching the PEER_SESS_ST_ERRPROTO peer protocol state.	2021-01-15 16:57:17 +01:00
Willy Tarreau	dc2410d093	CLEANUP: pattern: rename pat_ref_commit() to pat_ref_commit_elt() It's about the third time I get confused by these functions, half of which manipulate the reference as a whole and those manipulating only an entry. For me "pat_ref_commit" means committing the pattern reference, not just an element, so let's rename it. A number of other ones should really be renamed before 2.4 gets released :-/	2021-01-15 14:11:59 +01:00
David CARLIER	6a9060189d	BUG/MINOR: threads: Fixes the number of possible cpus report for Mac. There is no low level api to achieve same as Linux/FreeBSD, we rely on CPUs available. Without this, the number of threads is just 1 for Mac while having 8 cores in my M1. Backporting to 2.1 should be enough if that's possible. Signed-off-by: David CARLIER <devnexen@gmail.com>	2021-01-15 11:58:46 +01:00
Christopher Faulet	e3bdc81f8a	MINOR: server: Forbid server definitions in frontend sections An fatal error is now reported if a server is defined in a frontend section. til now, a warning was just emitted and the server was ignored. The warning was added in the 1.3.4 when the frontend/backend keywords were introduced to allow a smooth transition and to not break existing configs. It is old enough now to emit an fatal error in this case. This patch is related to the issue #1043. It may be backported at least as far as 2.2, and possibly to older versions. It relies on the previous commit ("MINOR: config: Add failifnotcap() to emit an alert on proxy capabilities").	2021-01-13 17:45:34 +01:00
Christopher Faulet	4e36682d51	BUG/MINOR: init: Use a dynamic buffer to set HAPROXY_CFGFILES env variable The HAPROXY_CFGFILES env variable is built using a static trash chunk, via a call to get_trash_chunk() function. This chunk is reserved during the whole configuration parsing. It is far too large to guarantee it will not be reused during the configuration parsing. And in fact, it happens in the lua code since the commit `f67442efd` ("BUG/MINOR: lua: warn when registering action, conv, sf, cli or applet multiple times"), when a lua script is loaded. To fix the bug, we now use a dynamic buffer instead. And we call memprintf() function to handle both the allocation and the formatting. Allocation errors at this stage are fatal. This patch should fix the issue #1041. It must be backported as far as 2.0.	2021-01-13 17:45:25 +01:00
William Dauchy	5d9b8f3c93	MINOR: contrib/prometheus-exporter: use fill_info for process dump use `stats_fill_info` when possible to avoid duplicating code. Signed-off-by: William Dauchy <wdauchy@gmail.com>	2021-01-13 15:19:00 +01:00
Jerome Magnin	50f757c5fd	BUG/MINOR: init: enforce strict-limits when using master-worker The strict-limits global option was introduced with commit `0fec3ab7b` ("MINOR: init: always fail when setrlimit fails"). When used in conjuction with master-worker, haproxy will not fail when a setrlimit fails. This happens because we only exit() if master-worker isn't used. This patch removes all tests for master-worker mode for all cases covered by strict-limits scope. This should be backported from 2.1 onward. This should fix issue #1042. Reviewed by William Dauchy <wdauchy@gmail.com>	2021-01-13 13:17:11 +01:00
Christopher Faulet	6ecd59326f	BUG/MINOR: check: Don't perform any check on servers defined in a frontend If a server is defined in a frontend, thus a proxy without the backend capability, the 'check' and 'agent-check' keywords are ignored. This way, no check is performed on an ignored server. This avoids a segfault because some part of the tcpchecks are not fully initialized (or released for frontends during the post-check). In addition, an test on the server's proxy capabilities is performed when checks or agent-checks are initialized and nothing is performed for servers attached to a non-backend proxy. This patch should fix the issue #1043. It must be backported as far as 2.2.	2021-01-12 17:55:22 +01:00
Remi Tricot-Le Breton	22e0d9b39c	BUG/MINOR: sample: Memory leak of sample_expr structure in case of error If an errors occurs during the sample expression parsing, the alloced sample_expr is not freed despite having its main pointer reset. This fixes GitHub issue #1046. It could be backported as far as 1.8.	2021-01-12 17:00:59 +01:00
Christopher Faulet	a1eea3bbb1	Revert "BUG/MINOR: dns: SRV records ignores duplicated AR records" This reverts commit `949a7f6459`. The first part of the patch introduces a bug. When a dns answer item is allocated, its <ar_item> is only initialized at the end of the parsing, when the item is added in the answer list. Thus, we must not try to release it during the parsing. The second part is also probably buggy. It fixes the issue #971 but reverts a fix for the issue #841 (see commit fb0884c8297 "BUG/MEDIUM: dns: Don't store additional records in a linked-list"). So it must be at least revalidated. This revert fixes a segfault reported in a comment of the issue #971. It must be backported as far as 2.2.	2021-01-12 16:37:54 +01:00
William Dauchy	e997010acc	BUG/MINOR: sample: check alloc_trash_chunk return value in concat() like it is done in other places, check the return value of `alloc_trash_chunk` before using it. This was detected by coverity. this patch fixes commit `591fc3a330` ("BUG/MINOR: sample: fix concat() converter's corruption with non-string variables" As a consequence, this patch should be backported as far as 2.0 this should fix github issue #1039 Signed-off-by: William Dauchy <wdauchy@gmail.com>	2021-01-11 14:10:11 +01:00
William Dauchy	aabde71332	MINOR: reg-tests: add a way to add service dependency I was looking at writing a simple first test for prometheus but I realised there is no proper way to exclude it if haproxy was not built with prometheus plugin. Today we have `REQUIRE_OPTIONS` in reg-tests which is based on `Feature list` from `haproxy -vv`. Those options are coming from the Makefile itself. A plugin is build this way: EXTRA_OBJS="contrib/prometheus-exporter/service-prometheus.o" It does register service actions through `service_keywords_register`. Those are listed through `list_services` in `haproxy -vv`. To facilitate parsing, I slightly changed the output to a single line and integrate it in regtests shell script so that we can now specify a dependency while writing a reg-test for prometheus, e.g: #REQUIRE_SERVICE=prometheus-exporter #REQUIRE_SERVICES=prometheus-exporter,foo There might be other ways to handle this, but that's the cleanest I found; I understand people might be concerned by this output change in `haproxy -vv` which goes from: Available services : foo bar to: Available services : foo bar Signed-off-by: William Dauchy <wdauchy@gmail.com>	2021-01-10 07:42:33 +01:00
William Dauchy	5417e898ff	CLEANUP: sample: remove uneeded check in json validation - check functions are never called with a NULL args list, it is always an array, so first check can be removed - the expression parser guarantees that we can't have anything else, because we mentioned json converter takes a mandatory string argument. Thus test on `ARGT_STR` can be removed as well - also add breaking line between enum and function declaration In order to validate it, add a simple json test testing very simple cases but can be improved in the future: - default json converter without args - json converter failing on error (utf8) - json converter with error being removed (utf8s) Signed-off-by: William Dauchy <wdauchy@gmail.com>	2021-01-10 07:39:58 +01:00
Thayne McCombs	4fb255df03	BUG/MINOR: server: Memory leak of proxy.used_server_addr during deinit GitHub Issue #1037 Reported a memory leak in deinit() caused by an allocation made in sa2str() that was stored in srv_set_addr_desc(). When destroying each server for a proxy in deinit, include freeing the memory in the key of server->addr_node. The leak was introduced in commit `92149f9a8` ("MEDIUM: stick-tables: Add srvkey option to stick-table") which is not in any released version so no backport is needed. Cc: Tim Duesterhus <tim@bastelstu.be>	2021-01-10 07:22:15 +01:00
Willy Tarreau	591fc3a330	BUG/MINOR: sample: fix concat() converter's corruption with non-string variables Patrick Hemmer reported that calling concat() with an integer variable causes a %00 to appear at the beginning of the output. Looking at the code, it's not surprising. The function uses get_trash_chunk() to get one of the trashes, but can call casting functions which will also use their trash in turn and will cycle back to ours, causing the trash to be overwritten before being assigned to a sample. By allocating the trash from a pool using alloc_trash_chunk(), we can avoid this. However we must free it so the trash's contents must be moved to a permanent trash buffer before returning. This is what's achieved using smp_dup(). This should be backported as far as 2.0.	2021-01-08 16:08:43 +01:00
Thayne McCombs	8f0cc5c4ba	CLEANUP: Fix spelling errors in comments This is from the output of codespell. It's done at once over a bunch of files and only affects comments, so there is nothing user-visible. No backport needed.	2021-01-08 14:56:32 +01:00
Tim Duesterhus	22586524e3	BUG/MINOR: hlua: Fix memory leak in hlua_alloc During a configuration check valgrind reports: ==14425== 0 bytes in 106 blocks are definitely lost in loss record 1 of 107 ==14425== at 0x4C2DB8F: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==14425== by 0x4C2FDEF: realloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==14425== by 0x443CFC: hlua_alloc (hlua.c:8662) ==14425== by 0x5F72B11: luaM_realloc_ (in /usr/lib/x86_64-linux-gnu/liblua5.3.so.0.0.0) ==14425== by 0x5F78089: luaH_free (in /usr/lib/x86_64-linux-gnu/liblua5.3.so.0.0.0) ==14425== by 0x5F707D3: sweeplist (in /usr/lib/x86_64-linux-gnu/liblua5.3.so.0.0.0) ==14425== by 0x5F710D0: luaC_freeallobjects (in /usr/lib/x86_64-linux-gnu/liblua5.3.so.0.0.0) ==14425== by 0x5F7715D: close_state (in /usr/lib/x86_64-linux-gnu/liblua5.3.so.0.0.0) ==14425== by 0x443D4C: hlua_deinit (hlua.c:9302) ==14425== by 0x543F88: deinit (haproxy.c:2742) ==14425== by 0x5448E7: deinit_and_exit (haproxy.c:2830) ==14425== by 0x5455D9: init (haproxy.c:2044) This is due to Lua calling `hlua_alloc()` with `ptr = NULL` and `nsize = 0`. While `realloc` is supposed to be equivalent `free()` if the size is `0` this is only required for a non-NULL pointer. Apparently my allocator (or valgrind) actually allocates a zero size area if the pointer is NULL, possibly taking up some memory for management structures. Fix this leak by specifically handling the case where both the pointer and the size are `0`. This bug appears to have been introduced with the introduction of the multi-threaded Lua, thus this fix is specific for 2.4. No backport needed.	2021-01-08 14:46:43 +01:00
Ilya Shipitsin	76837bc948	CLEANUP: cfgparse: replace "realloc" with "my_realloc2" to fix to memory leak on error my_realloc2 frees variable in case of allocation failure. fixes #1030 realloc was introduced in `9e1758efbd` this might be backported to 2.2, 2.3	2021-01-08 14:45:39 +01:00
Ilya Shipitsin	761d64c7ae	BUILD: ssl: guard openssl specific with SSL_READ_EARLY_DATA_SUCCESS let us switch to SSL_READ_EARLY_DATA_SUCCESS instead of openssl versions	2021-01-07 10:20:04 +01:00
Ilya Shipitsin	ec36c91c69	BUILD: ssl: guard EVP_PKEY_get_default_digest_nid with ASN1_PKEY_CTRL_DEFAULT_MD_NID let us switch to openssl specific macro instead of versions	2021-01-07 10:20:00 +01:00
Ilya Shipitsin	2aa4b3a083	BUILD: SSL: guard TLS13 ciphersuites with HAVE_SSL_CTX_SET_CIPHERSUITES accidently src/server.c still used earlier guarding	2021-01-07 10:19:56 +01:00
William Dauchy	888b0ae8cf	MINOR: converter: adding support for url_enc add base support for url encode following RFC3986, supporting `query` type only. - add test checking url_enc/url_dec/url_enc - update documentation - leave the door open for future changes this should resolve github issue #941 Signed-off-by: William Dauchy <wdauchy@gmail.com>	2021-01-06 23:43:04 +01:00
Willy Tarreau	421ed3952d	[RELEASE] Released version 2.4-dev5 Released version 2.4-dev5 with the following main changes : - BUG/MEDIUM: mux_h2: Add missing braces in h2_snd_buf()around trace+wakeup - BUILD: hpack: hpack-tbl-t.h uses VAR_ARRAY but does not include compiler.h - MINOR: time: increase the minimum wakeup interval to 60s - MINOR: check: do not ignore a connection header for http-check send - REGTESTS: complete http-check test - CI: travis-ci: drop coverity scan builds - MINOR: atomic: don't use ; to separate instruction on aarch64. - IMPORT: xxhash: update to v0.8.0 that introduces stable XXH3 variant - MEDIUM: xxhash: use the XXH3 functions to generate 64-bit hashes - MEDIUM: xxhash: use the XXH_INLINE_ALL macro to inline all functions - CLEANUP: xxhash: remove the unused src/xxhash.c - MINOR: sample: add the xxh3 converter - REGTESTS: add tests for the xxh3 converter - MINOR: protocol: Create proto_quic QUIC protocol layer. - MINOR: connection: Attach a "quic_conn" struct to "connection" struct. - MINOR: quic: Redefine control layer callbacks which are QUIC specific. - MINOR: ssl_sock: Initialize BIO and SSL objects outside of ssl_sock_init() - MINOR: connection: Add a new xprt to connection. - MINOR: ssl: Export definitions required by QUIC. - MINOR: cfgparse: Do not modify the QUIC xprt when parsing "ssl". - MINOR: tools: Add support for QUIC addresses parsing. - MINOR: quic: Add definitions for QUIC protocol. - MINOR: quic: Import C source code files for QUIC protocol. - MINOR: listener: Add QUIC info to listeners and receivers. - MINOR: server: Add QUIC definitions to servers. - MINOR: ssl: SSL CTX initialization modifications for QUIC. - MINOR: ssl: QUIC transport parameters parsing. - MINOR: quic: QUIC socket management finalization. - MINOR: cfgparse: QUIC default server transport parameters init. - MINOR: quic: Enable the compilation of QUIC modules. - MAJOR: quic: Make usage of ebtrees to store QUIC ACK ranges. - MINOR: quic: Attempt to make trace more readable - MINOR: quic: Make usage of the congestion control window. - MINOR: quic: Flag RX packet as ack-eliciting from the generic parser. - MINOR: quic: Code reordering to help in reviewing/modifying. - MINOR: quic: Add traces to congestion avoidance NewReno callback. - MINOR: quic: Display the SSL alert in ->ssl_send_alert() callback. - MINOR: quic: Update the initial salt to that of draft-29. - MINOR: quic: Add traces for in flght ack-eliciting packet counter. - MINOR: quic: make a packet build fails when qc_build_frm() fails. - MINOR: quic: Add traces for quic_packet_encrypt(). - MINOR: cache: Refactoring of secondary_key building functions - MINOR: cache: Avoid storing responses whose secondary key was not correctly calculated - BUG/MINOR: cache: Manage multiple headers in accept-encoding normalization - MINOR: cache: Add specific secondary key comparison mechanism - MINOR: http: Add helper functions to trim spaces and tabs - MEDIUM: cache: Manage a subset of encodings in accept-encoding normalizer - REGTESTS: cache: Simplify vary.vtc file - REGTESTS: cache: Add a specific test for the accept-encoding normalizer - MINOR: cache: Remove redundant test in http_action_req_cache_use - MINOR: cache: Replace the "process-vary" option's expected values - CI: GitHub Actions: enable daily Coverity scan - BUG/MEDIUM: cache: Fix hash collision in `accept-encoding` handling for `Vary` - MEDIUM: stick-tables: Add srvkey option to stick-table - REGTESTS: add test for stickiness using "srvkey addr" - BUILD: Makefile: disable -Warray-bounds until it's fixed in gcc 11 - BUG/MINOR: sink: Return an allocation failure in __sink_new if strdup() fails - BUG/MINOR: lua: Fix memory leak error cases in hlua_config_prepend_path - MINOR: lua: Use consistent error message 'memory allocation failed' - CLEANUP: Compare the return value of `XXXcmp()` functions with zero - CLEANUP: Apply the coccinelle patch for `XXXcmp()` on include/ - CLEANUP: Apply the coccinelle patch for `XXXcmp()` on contrib/ - MINOR: qpack: Add static header table definitions for QPACK. - CLEANUP: qpack: Wrong comment about the draft for QPACK static header table. - CLEANUP: quic: Remove useless QUIC event trace definitions. - BUG/MINOR: quic: Possible CRYPTO frame building errors. - MINOR: quic: Pass quic_conn struct to frame parsers. - BUG/MINOR: quic: Wrong STREAM frames parsing. - MINOR: quic: Drop packets with STREAM frames with wrong direction. - CLEANUP: ssl: Remove useless loop in tlskeys_list_get_next() - CLEANUP: ssl: Remove useless local variable in tlskeys_list_get_next() - MINOR: ssl: make tlskeys_list_get_next() take a list element - Revert "BUILD: Makefile: disable -Warray-bounds until it's fixed in gcc 11" - BUG/MINOR: cfgparse: Fail if the strdup() for `rule->be.name` for `use_backend` fails - CLEANUP: mworker: remove duplicate pointer tests in cfg_parse_program() - CLEANUP: Reduce scope of `header_name` in http_action_store_cache() - CLEANUP: Reduce scope of `hdr_age` in http_action_store_cache() - CLEANUP: spoe: fix typo on `var_check_arg` comment - BUG/MINOR: tcpcheck: Report a L7OK if the last evaluated rule is a send rule - CI: github actions: build several popular "contrib" tools - DOC: Improve the message printed when running `make` w/o `TARGET` - BUG/MEDIUM: server: srv_set_addr_desc() crashes when a server has no address - REGTESTS: add unresolvable servers to srvkey-addr - BUG/MINOR: stats: Make stat_l variable used to dump a stat line thread local - BUG/MINOR: quic: NULL pointer dereferences when building post handshake frames. - SCRIPTS: improve announce-release to support different tag and versions - SCRIPTS: make announce release support preparing announces before tag exists - CLEANUP: assorted typo fixes in the code and comments - BUG/MINOR: srv: do not init address if backend is disabled - BUG/MINOR: srv: do not cleanup idle conns if pool max is null - CLEANUP: assorted typo fixes in the code and comments - CLEANUP: few extra typo and fixes over last one ("ot" -> "to")	2021-01-06 17:41:32 +01:00
Willy Tarreau	94a01e1cb7	CLEANUP: few extra typo and fixes over last one ("ot" -> "to") As noticed by Tim there were a few incorrect fixes in the previous patch ("ot" -> "to" and not "or").	2021-01-06 17:35:52 +01:00
Ilya Shipitsin	b8888ab557	CLEANUP: assorted typo fixes in the code and comments This is 15th iteration of typo fixes	2021-01-06 17:32:03 +01:00
Amaury Denoyelle	10d5c3172b	BUG/MINOR: srv: do not cleanup idle conns if pool max is null If a server is configured to not have any idle conns, returns immediatly from srv_cleanup_connections. This avoids a segfault when a server is configured with pool-max-conn to 0. This should be backported up to 2.2.	2021-01-06 16:57:17 +01:00
Amaury Denoyelle	e3c4192962	BUG/MINOR: srv: do not init address if backend is disabled Do not proceed on init_addr if the backend of the server is marked as disabled. When marked as disabled, the server is not fully initialized and some operation must be avoided to prevent segfault. It is correct because there is no way to activate a disabled backend. This fixes the github issue #1031. This should be backported to 2.2.	2021-01-06 16:57:17 +01:00
Ilya Shipitsin	1e9a66603f	CLEANUP: assorted typo fixes in the code and comments This is 14th iteration of typo fixes	2021-01-06 16:26:50 +01:00
Fr�d�ric L�caille	153d4a89d0	BUG/MINOR: quic: NULL pointer dereferences when building post handshake frames. The second one was detected by cppcheck contrary to the first one. Fixes issue #1032. Thank you to Ilya for having reported this.	2021-01-06 13:59:05 +01:00
Christopher Faulet	de79cd28ec	BUG/MINOR: stats: Make stat_l variable used to dump a stat line thread local Since `ee63d4bd6` ("MEDIUM: stats: integrate static proxies stats in new stats"), all dumped stats for a given domain, the default ones and the modules ones, are merged in a signle array to dump them in a generic way. For this purpose, the stat_l global variable is allocated at startup to store a line of stats before the dump, i.e. all stats of an entity (frontend, backend, listener, server or dns nameserver). But this variable is not thread safe. If stats are retrieved concurrently by several clients on different threads, the same variable is used. This leads to corrupted stats output. To fix the bug, the stat_l variable is now thread local. This patch should probably solve issues #972 and #992. It must be backported to 2.3.	2021-01-06 10:34:12 +01:00
Thayne McCombs	24da7e1aa6	BUG/MEDIUM: server: srv_set_addr_desc() crashes when a server has no address GitHub Issue #1026 reported a crash during configuration check for the following example config: backend 0 server 0 0 server 0 0 HAProxy crashed in srv_set_addr_desc() due to a NULL pointer dereference caused by `sa2str` returning NULL for an `AF_UNSPEC` address (`0`). Check to make sure the address key is non-null before using it for comparison or inserting it into the tree. The crash was introduced in commit `92149f9a8` ("MEDIUM: stick-tables: Add srvkey option to stick-table") which not in any released version so no backport is needed. Cc: Tim Duesterhus <tim@bastelstu.be>	2021-01-06 09:19:15 +01:00
Christopher Faulet	8d4977ae86	BUG/MINOR: tcpcheck: Report a L7OK if the last evaluated rule is a send rule When all rules of a tcpcheck ruleset are successfully evaluated, the right check status must always be reported. It is true if the last evaluated rule is an expect or a connect rule. But not if it is a send rule. In this situation, nothing more is done until the check timeout expiration and a L7TOUT is reported instead of a L7OK. Now, by default, when all rules were successfully evaluated, a L7OK is reported. When the last evaluated rule is an expect or a connect, the behavior remains unchanged. This patch should fix the issue #1027. It must be backported as far as 2.2.	2021-01-05 17:31:49 +01:00
William Dauchy	afb9368221	CLEANUP: spoe: fix typo on `var_check_arg` comment there was an extra `s` added to the `var_check_arg` function Signed-off-by: William Dauchy <wdauchy@gmail.com>	2021-01-05 17:23:32 +01:00
Tim Duesterhus	c294284e33	CLEANUP: Reduce scope of `hdr_age` in http_action_store_cache() This is only required to process the `age` header.	2021-01-05 17:05:58 +01:00
Tim Duesterhus	e2fff10a19	CLEANUP: Reduce scope of `header_name` in http_action_store_cache() This variable is only needed deeply nested in a single location and clang's static analyzer complains about a dead initialization. Reduce the scope to satisfy clang and the human that reads the function.	2021-01-05 17:05:58 +01:00
Willy Tarreau	8f7efcddd6	CLEANUP: mworker: remove duplicate pointer tests in cfg_parse_program() As reported in issue #1017, there are two harmless duplicate tests in cfg_parse_program(), one made of a "if" using the same condition as the loop it's in, and the other one being a null test before a free. This just removes them. No backport is needed.	2021-01-05 15:58:37 +01:00
Tim Duesterhus	5ce5a1586d	BUG/MINOR: cfgparse: Fail if the strdup() for `rule->be.name` for `use_backend` fails This patch fixes GitHub issue #1024. I could track the `strdup` back to commit `3a1f5fda10` which is 1.9-dev8. It's probably not worth the effort to backport it across this refactoring. This patch should be backported to 1.9+.	2021-01-05 11:37:41 +01:00
Willy Tarreau	b6fc524f05	MINOR: ssl: make tlskeys_list_get_next() take a list element As reported in issue #1010, gcc-11 as of 2021-01-05 is overzealous in its -Warray-bounds check as it considers that a cast of a global struct accesses the entire struct even if only one specific element is accessed. This instantly breaks all lists making use of container_of() to build their iterators as soon as the starting point is known if the next element is retrieved from the list head in a way that is visible to the compiler's optimizer, because it decides that accessing the list's next element dereferences the list as a larger struct (which it does not). The temporary workaround consisted in disabling -Warray-bounds, but this warning is traditionally quite effective at spotting real bugs, and we actually have is a single occurrence of this issue in the whole code. By changing the tlskeys_list_get_next() function to take a list element as the starting point instead of the current element, we can avoid the starting point issue but this requires to change all call places to write hideous casts made of &((struct blah*)ref)->list. At the moment we only have two such call places, the first one being used to initialize the list (which is the one causing the warning) and which is thus easy to simplify, and the second one for which we already have an aliased pointer to the reference that is still valid at the call place, and given the original pointer also remained unchanged, we can safely use this alias, and this is safer than leaving a cast there. Let's make this change now while it's still easy. The generated code only changed in function cli_io_handler_tlskeys_files() due to register allocation and the change of variable scope between the old one and the new one.	2021-01-05 11:15:45 +01:00
Tim Duesterhus	cb8b281c02	CLEANUP: ssl: Remove useless local variable in tlskeys_list_get_next() `getnext` was only used to fill `ref` at the beginning of the function. Both have the same type. Replace the parameter name by `ref` to remove the useless local variable.	2021-01-05 10:25:20 +01:00
Tim Duesterhus	2c7bb33144	CLEANUP: ssl: Remove useless loop in tlskeys_list_get_next() This loop was always exited in the first iteration by `return`.	2021-01-05 10:24:36 +01:00
Fr�d�ric L�caille	242fb1b639	MINOR: quic: Drop packets with STREAM frames with wrong direction. A server initiates streams with odd-numbered stream IDs. Also add useful traces when parsing STREAM frames.	2021-01-04 12:31:28 +01:00
Fr�d�ric L�caille	129a351a3f	BUG/MINOR: quic: Wrong STREAM frames parsing. After having re-read the RFC, we noticed there are two bugs in the STREAM frame parser. When the OFF bit (0x04) in the frame type is not set we must set the offset to 0 (it was not set at all). When the LEN bit (0x02) is not set we must extend the length of the data field to the end of the packet (it was not set at all).	2021-01-04 12:31:28 +01:00
Fr�d�ric L�caille	50044adc60	MINOR: quic: Pass quic_conn struct to frame parsers. This is only for debugging purposes.	2021-01-04 12:31:28 +01:00
Fr�d�ric L�caille	ea60499912	BUG/MINOR: quic: Possible CRYPTO frame building errors. This is issue is due to the fact that when we call the function responsible of building CRYPTO frames to fill a buffer, the Length field of this packet did not take into an account the trailing 16 bytes for the AEAD tag. Furthermore, the remaining <room> available in this buffer was not decremented by the CRYPTO frame length, but only by the CRYPTO data length of this frame.	2021-01-04 12:31:28 +01:00
Fr�d�ric L�caille	6c1e36ce55	CLEANUP: quic: Remove useless QUIC event trace definitions. Remove QUIC_EV_CONN_E* event trace macros which were defined for errors. Replace QUIC_EV_CONN_ECHPKT by QUIC_EV_CONN_BCFRMS used in qc_build_cfrms()	2021-01-04 12:31:28 +01:00
Fr�d�ric L�caille	d341fc3609	CLEANUP: qpack: Wrong comment about the draft for QPACK static header table. This came with a "copy and paste" from the definition for HPACK.	2021-01-04 12:31:28 +01:00
Fr�d�ric L�caille	164096eb76	MINOR: qpack: Add static header table definitions for QPACK. As HPACK, QPACK makes usage of a static header table.	2021-01-04 12:31:28 +01:00
Tim Duesterhus	e5ff14100a	CLEANUP: Compare the return value of `XXXcmp()` functions with zero According to coding-style.txt it is recommended to use: `strcmp(a, b) == 0` instead of `!strcmp(a, b)` So let's do this. The change was performed by running the following (very long) coccinelle patch on src/: @@ statement S; expression E; expression F; @@ if ( ( dns_hostname_cmp \| eb_memcmp \| memcmp \| strcasecmp \| strcmp \| strncasecmp \| strncmp ) - (E, F) + (E, F) != 0 ) ( S \| { ... } ) @@ statement S; expression E; expression F; @@ if ( - ! ( dns_hostname_cmp \| eb_memcmp \| memcmp \| strcasecmp \| strcmp \| strncasecmp \| strncmp ) - (E, F) + (E, F) == 0 ) ( S \| { ... } ) @@ expression E; expression F; expression G; @@ ( G && ( dns_hostname_cmp \| eb_memcmp \| memcmp \| strcasecmp \| strcmp \| strncasecmp \| strncmp ) - (E, F) + (E, F) != 0 ) @@ expression E; expression F; expression G; @@ ( G \|\| ( dns_hostname_cmp \| eb_memcmp \| memcmp \| strcasecmp \| strcmp \| strncasecmp \| strncmp ) - (E, F) + (E, F) != 0 ) @@ expression E; expression F; expression G; @@ ( ( dns_hostname_cmp \| eb_memcmp \| memcmp \| strcasecmp \| strcmp \| strncasecmp \| strncmp ) - (E, F) + (E, F) != 0 && G ) @@ expression E; expression F; expression G; @@ ( ( dns_hostname_cmp \| eb_memcmp \| memcmp \| strcasecmp \| strcmp \| strncasecmp \| strncmp ) - (E, F) + (E, F) != 0 \|\| G ) @@ expression E; expression F; expression G; @@ ( G && - ! ( dns_hostname_cmp \| eb_memcmp \| memcmp \| strcasecmp \| strcmp \| strncasecmp \| strncmp ) - (E, F) + (E, F) == 0 ) @@ expression E; expression F; expression G; @@ ( G \|\| - ! ( dns_hostname_cmp \| eb_memcmp \| memcmp \| strcasecmp \| strcmp \| strncasecmp \| strncmp ) - (E, F) + (E, F) == 0 ) @@ expression E; expression F; expression G; @@ ( - ! ( dns_hostname_cmp \| eb_memcmp \| memcmp \| strcasecmp \| strcmp \| strncasecmp \| strncmp ) - (E, F) + (E, F) == 0 && G ) @@ expression E; expression F; expression G; @@ ( - ! ( dns_hostname_cmp \| eb_memcmp \| memcmp \| strcasecmp \| strcmp \| strncasecmp \| strncmp ) - (E, F) + (E, F) == 0 \|\| G ) @@ expression E; expression F; expression G; @@ ( - ! ( dns_hostname_cmp \| eb_memcmp \| memcmp \| strcasecmp \| strcmp \| strncasecmp \| strncmp ) - (E, F) + (E, F) == 0 )	2021-01-04 10:09:02 +01:00
Tim Duesterhus	f89d43a381	MINOR: lua: Use consistent error message 'memory allocation failed' Other locations in the configuration parser use 'memory allocation failed', so use this one as well.	2021-01-03 20:37:16 +01:00
Tim Duesterhus	621e74afd1	BUG/MINOR: lua: Fix memory leak error cases in hlua_config_prepend_path In case of an error `p` is not properly freed. Minor leak during configuration parsing in out of memory situations, no backport needed.	2021-01-03 20:37:16 +01:00
Tim Duesterhus	a7ebffef66	BUG/MINOR: sink: Return an allocation failure in __sink_new if strdup() fails This patch fixes GitHub issue #1023. The function was introduced in commit `99c453d` ("MEDIUM: ring: new section ring to declare custom ring buffers."), which first appeared in 2.2-dev9. The fix should be backported to 2.2+.	2021-01-03 20:35:45 +01:00
Thayne McCombs	92149f9a82	MEDIUM: stick-tables: Add srvkey option to stick-table This allows using the address of the server rather than the name of the server for keeping track of servers in a backend for stickiness. The peers code was also extended to support feeding the dictionary using this key instead of the name. Fixes #814	2020-12-31 10:04:54 +01:00
Tim Duesterhus	dc38bc4a1a	BUG/MEDIUM: cache: Fix hash collision in `accept-encoding` handling for `Vary` This patch fixes GitHub Issue #988. Commit `ce9e7b2521` was not sufficient, because it fell back to a hash comparison if the bitmap of known encodings was not acceptable instead of directly returning the the cached response is not compatible. This patch also extends the reg-test to test the hash collision that was mentioned in #988. Vary handling is 2.4, no backport needed.	2020-12-31 09:39:08 +01:00
Remi Tricot-Le Breton	e6cc5b5974	MINOR: cache: Replace the "process-vary" option's expected values Replace the <0/1> expected values of the process-vary option by a more usual <on/off> pair.	2020-12-24 17:18:00 +01:00
Remi Tricot-Le Breton	42efffd7f6	MINOR: cache: Remove redundant test in http_action_req_cache_use The suppressed check is fully covered by the next one and can then be removed.	2020-12-24 17:18:00 +01:00
Remi Tricot-Le Breton	ce9e7b2521	MEDIUM: cache: Manage a subset of encodings in accept-encoding normalizer The accept-encoding normalizer now explicitely manages a subset of encodings which will all have their own bit in the encoding bitmap stored in the cache entry. This way two requests with the same primary key will be served the same cache entry if they both explicitely accept the stored response's encoding, even if their respective secondary keys are not the same and do not match the stored response's one. The actual hash of the accept-encoding will still be used if the response's encoding is unmanaged. The encoding matching and the encoding weight parsing are done for every subpart of the accept-encoding values, and a bitmap of accepted encodings is built for every request. It is then tested upon any stored response that has the same primary key until one with an accepted encoding is found. The specific "identity" and "*" accept-encoding values are managed too. When storing a response in the key, we also parse the content-encoding header in order to only set the response's corresponding encoding's bit in its cache_entry encoding bitmap. This patch fixes GitHub issue #988. It does not need to be backported.	2020-12-24 17:18:00 +01:00
Remi Tricot-Le Breton	56e46cb393	MINOR: http: Add helper functions to trim spaces and tabs Add two helper functions that trim leading or trailing spaces and horizontal tabs from an ist string.	2020-12-24 17:18:00 +01:00
Remi Tricot-Le Breton	6a34b2b65d	MINOR: cache: Add specific secondary key comparison mechanism Add the possibility to define custom comparison functions for every sub-part of the secondary key hash instead of using a global memcmp.	2020-12-24 17:18:00 +01:00
Remi Tricot-Le Breton	e4421dec7e	BUG/MINOR: cache: Manage multiple headers in accept-encoding normalization The accept-encoding part of the secondary key (vary) was only built out of the first occurrence of the header. So if a client had two accept-encoding headers, gzip and br for instance, the key would have been built out of the gzip string. So another client that only managed gzip would have been sent the cached resource, even if it was a br resource. The http_find_header function is now called directly by the normalizers so that they can manage multiple headers if needed. A request that has more than 16 encodings will be considered as an illegitimate request and its response will not be stored. This fixes GitHub issue #987. It does not need any backport.	2020-12-24 17:18:00 +01:00
Remi Tricot-Le Breton	2b5c5cbef6	MINOR: cache: Avoid storing responses whose secondary key was not correctly calculated If any of the secondary hash normalizing functions raises an error, the secondary hash will be unusable. In this case, the response will not be stored anymore.	2020-12-24 17:18:00 +01:00
Remi Tricot-Le Breton	bba2912758	MINOR: cache: Refactoring of secondary_key building functions The two secondary_key building functions (prebuild_full_key and build_key) have roughly the same content so their code can be mutualized.	2020-12-24 17:18:00 +01:00
Fr�d�ric L�caille	f63921fc24	MINOR: quic: Add traces for quic_packet_encrypt(). Add traces to have an idea why this function may fail. In fact in never fails when the passed parameters are correct, especially the lengths. This is not the case when a packet is not correctly built before being encrypted.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	133e8a7146	MINOR: quic: make a packet build fails when qc_build_frm() fails. Even if the size of frames built by qc_build_frm() are computed so that not to overflow a buffer, do not rely on this and always makes a packet build fails if we could not build a frame. Also add traces to have an idea where qc_build_frm() fails. Fixes a memory leak in qc_build_phdshk_apkt().	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	f7e0b8d6ae	MINOR: quic: Add traces for in flght ack-eliciting packet counter. Add trace for this counter. Also shorten its variable name (->ifae_pkts).	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	b4e17386cb	MINOR: quic: Update the initial salt to that of draft-29. This salt is ued at leat up to draft-32. At this date ngtcp2 always uses this salt even if it started the draft-33 development. Note that when the salt is not correct, we cannot remove the header protection. In this case the packet number length is wrong.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	47c433fdcb	MINOR: quic: Display the SSL alert in ->ssl_send_alert() callback. At least displays the SSL alert error code passed to ->ssl_send_alert() QUIC BIO method and the SSL encryption level. This function is newly called when using picoquic client with a recent version of BoringSSL (Nov 19 2020). This is not the case with OpenSSL with 32 as QUIC draft implementation.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	26c49d9eb0	MINOR: quic: Add traces to congestion avoidance NewReno callback. These traces are missing and are useful do diagnose issue in the congestion avoidance callback for NewReno algorithm.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	0c14020f11	MINOR: quic: Code reordering to help in reviewing/modifying. Reorder by increasing type the switch/case in qc_parse_pkt_frms() which is the high level frame parser. Add new STREAM_X frame types to support some tests with ngtcp2 client.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	f7fe9659f0	MINOR: quic: Flag RX packet as ack-eliciting from the generic parser. Add ->flags to the QUIC frame parser as this has been done for the builder so that to flag RX packets as ack-eliciting at low level. This should also be helpful to maintain the code if we have to add new flags to RX packets. Remove the statements which does the same thing as higher level in qc_parse_pkt_frms().	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	04ffb66bc9	MINOR: quic: Make usage of the congestion control window. Remove ->ifcdata which was there to control the CRYPTO data sent to the peer so that not to saturate its reception buffer. This was a sort of flow control. Add ->prep_in_flight counter to the QUIC path struct to control the number of bytes prepared to be sent so that not to saturare the congestion control window. This counter is increased each time a packet was built. This has nothing to see with ->in_flight which is the real in flight number of bytes which have really been sent. We are olbiged to maintain two such counters to know how many bytes of data we can prepared before sending them. Modify traces consequently which were useful to diagnose issues about the congestion control window usage.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	c5e72b9868	MINOR: quic: Attempt to make trace more readable As there is a lot of information in this protocol, this is not easy to make the traces readable. We remove here a few of them and shorten some line shortening the variable names.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	8090b51e92	MAJOR: quic: Make usage of ebtrees to store QUIC ACK ranges. Store QUIC ACK ranges in ebtrees in place of lists with a 0(n) time complexity for insertion.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	0a76901926	MINOR: cfgparse: QUIC default server transport parameters init. This patch is there to initialize the default transport parameters for QUIC as a preparation for one of the QUIC next steps to come: fully support QUIC protocol for haproxy servers.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	026a7921a5	MINOR: quic: QUIC socket management finalization. Implement ->accept_conn() callback for QUIC listener sockets. Note that this patch also implements quic_session_accept() function which is similar to session_accept_fd() without calling conn_complete_session() at this time because we do not have any real QUIC mux.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	e9473c7833	MINOR: ssl: QUIC transport parameters parsing. This patch modifies the TLS ClientHello message callback so that to parse the QUIC client transport parameters.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	ec216523f7	MINOR: ssl: SSL CTX initialization modifications for QUIC. Makes TLS/TCP and QUIC share the same CTX initializer so that not to modify the caller which is an XPRT callback used both by the QUIC xprt and the SSL xprt over TCP.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	f46c10cfb1	MINOR: server: Add QUIC definitions to servers. This patch adds QUIC structs to server struct so that to make the QUIC code compile. Also initializes the ebtree to store the connections by connection IDs.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	884f2e9f43	MINOR: listener: Add QUIC info to listeners and receivers. This patch adds a quic_transport_params struct to bind_conf struct used for the listeners. This is to store the QUIC transport parameters for the listeners. Also initializes them when calling str2listener(). Before str2sa_range() it's too early to figure we're going to speak QUIC, and after it's too late as listeners are already created. So it seems that doing it in str2listener() when the protocol is discovered is the best place. Also adds two ebtrees to the underlying receivers to store the connection by connections IDs (one for the original connection IDs, and another one for the definitive connection IDs which really identify the connections. However it doesn't seem normal that it is stored in the receiver nor the listener. There should be a private context in the listener so that protocols can store internal information. This element should in fact be the listener handle. Something still feels wrong, and probably we'll have to make QUIC and SSL co-exist: a proof of this is that there's some explicit code in bind_parse_ssl() to prevent the "ssl" keyword from replacing the xprt.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	a7e7ce957d	MINOR: quic: Import C source code files for QUIC protocol. This patch imports all the C files for QUIC protocol implementation with few modifications from 20200720-quic branch of quic-dev repository found at https://github.com/haproxytech/quic-dev. Traces were implemented to help with the development.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	10caf65634	MINOR: tools: Add support for QUIC addresses parsing. Add "quic4" and "quic6" keywords to str2sa_range() to parse QUIC IPv4 and IPv6 addresses respectively.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	e50afbd4e4	MINOR: cfgparse: Do not modify the QUIC xprt when parsing "ssl". When parsing "ssl" keyword for TLS bindings, we must not use the same xprt as the one for TLS/TCP connections. So, do not modify the QUIC xprt which will be initialized when parsing QUIC addresses wich "ssl" bindings.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	901ee2f37b	MINOR: ssl: Export definitions required by QUIC. QUIC needs to initialize its BIO and SSL session the same way as for SSL over TCP connections. It needs also to use the same ClientHello callback. This patch only exports functions and variables shared between QUIC and SSL/TCP connections.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	5aa92411fb	MINOR: ssl_sock: Initialize BIO and SSL objects outside of ssl_sock_init() This patch extraces the code which initializes the BIO and SSL session objects so that to reuse it elsewhere later for QUIC conections which only needs SSL and BIO objects at th TLS layer stack level to work.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	70da889d57	MINOR: quic: Redefine control layer callbacks which are QUIC specific. We add src/quic_sock.c QUIC specific socket management functions as callbacks for the control layer: ->accept_conn, ->default_iocb and ->rx_listening. accept_conn() will have to be defined. The default I/O handler only recvfrom() the datagrams received. Furthermore, ->rx_listening callback always returns 1 at this time but should returns 0 when reloading the processus.	2020-12-23 11:57:26 +01:00
Fr�d�ric L�caille	ca42b2c9d3	MINOR: protocol: Create proto_quic QUIC protocol layer. As QUIC is a connection oriented protocol, this file is almost a copy of proto_tcp without TCP specific features. To suspend/resume a QUIC receiver we proceed the same way as for proto_udp receivers. With the recent updates to the listeners, we don't need a specific set of quic*_add_listener() functions, the default ones are sufficient. The fields declaration were reordered to make the various layers more visible like in other protocols. udp_suspend_receiver/udp_resume_receiver are up-to-date (the check for INHERITED is present) and the code being UDP-specific, it's normal to use UDP here. Note that in the future we might more reasily reference stacked layers so that there's no more need for specifying the pointer here.	2020-12-23 11:57:26 +01:00
Dragan Dosen	04bf0cc086	MINOR: sample: add the xxh3 converter This patch adds support for the XXH3 variant of hash function that generates a 64-bit hash.	2020-12-23 06:39:21 +01:00
Dragan Dosen	6bfe425679	CLEANUP: xxhash: remove the unused src/xxhash.c The source file src/xxhash.c is removed, as we use XXH_INLINE_ALL.	2020-12-23 06:39:21 +01:00
Dragan Dosen	967e7e79af	MEDIUM: xxhash: use the XXH3 functions to generate 64-bit hashes Replace the XXH64() function calls with the XXH3 variant function XXH3_64bits_withSeed() where possible.	2020-12-23 06:39:21 +01:00
Dragan Dosen	de37443e64	IMPORT: xxhash: update to v0.8.0 that introduces stable XXH3 variant A new XXH3 variant of hash functions shows a noticeable improvement in performance (especially on small data), and also brings 128-bit support, better inlining and streaming capabilities. Performance comparison is available here: https://github.com/Cyan4973/xxHash/wiki/Performance-comparison	2020-12-23 06:39:21 +01:00
Amaury Denoyelle	6d975f0af6	MINOR: check: do not ignore a connection header for http-check send Allow the user to specify a custom Connection header for http-check send. This is useful for example to implement a websocket upgrade check. If no connection header has been set, a 'Connection: close' header is automatically appended to allow the server to close the connection immediately after the request/response. Update the documentation related to http-check send. This fixes the github issue #1009.	2020-12-22 14:22:44 +01:00
Tim Duesterhus	12a08d8849	BUG/MEDIUM: mux_h2: Add missing braces in h2_snd_buf()around trace+wakeup This is a regression in `7838a79ba` ("MEDIUM: mux-h2/trace: add lots of traces all over the code"). The issue was found using -Wmisleading-indentation. This patch fixes GitHub issue #1015. The impact of this bug is that it could in theory cause occasional delays on some long responses for connections having otherwise no traffic. This patch should be backported to 2.1+, the commit was first tagged in v2.1-dev2.	2020-12-22 09:02:11 +01:00
Ilya Shipitsin	f38a01884a	CLEANUP: assorted typo fixes in the code and comments This is 13n iteration of typo fixes	2020-12-21 11:24:48 +01:00
Baptiste Assmann	949a7f6459	BUG/MINOR: dns: SRV records ignores duplicated AR records This bug happens when a service has multiple records on the same host and the server provides the A/AAAA resolution in the response as AR (Additional Records). In such condition, the first occurence of the host will be taken from the Additional section, while the second (and next ones) will be process by an independent resolution task (like we used to do before 2.2). This can lead to a situation where the "synchronisation" of the resolution may diverge, like described in github issue #971. Because of this behavior, HAProxy mixes various type of requests to resolve the full list of servers: SRV+AR for all "first" occurences and A/AAAA for all other occurences of an existing hostname. IE: with the following type of response: ;; ANSWER SECTION: _http._tcp.be2.tld. 3600 IN SRV 5 500 80 A2.tld. _http._tcp.be2.tld. 3600 IN SRV 5 500 86 A3.tld. _http._tcp.be2.tld. 3600 IN SRV 5 500 80 A1.tld. _http._tcp.be2.tld. 3600 IN SRV 5 500 85 A3.tld. ;; ADDITIONAL SECTION: A2.tld. 3600 IN A 192.168.0.2 A3.tld. 3600 IN A 192.168.0.3 A1.tld. 3600 IN A 192.168.0.1 A3.tld. 3600 IN A 192.168.0.3 the first A3 host is resolved using the Additional Section and the second one through a dedicated A request. When linking the SRV records to their respective Additional one, a condition was missing (chek if said SRV record is already attached to an Additional one), leading to stop processing SRV only when the target SRV field matches the Additional record name. Hence only the first occurence of a target was managed by an additional record. This patch adds a condition in this loop to ensure the record being parsed is not already linked to an Additional Record. If so, we can carry on the parsing to find a possible next one with the same target field value. backport status: 2.2 and above	2020-12-21 11:19:09 +01:00
Ilya Shipitsin	af204881a3	BUILD: ssl: fine guard for SSL_CTX_get0_privatekey call SSL_CTX_get0_privatekey is openssl/boringssl specific function present since openssl-1.0.2, let us define readable guard for it, not depending on HA_OPENSSL_VERSION	2020-12-21 11:17:36 +01:00
Willy Tarreau	c7ead07b9c	CLEANUP: debug: mark the RNG's seed as unsigned Since commit `8a069eb9a` ("MINOR: debug: add a trivial PRNG for scheduler stress-tests"), 32-bit gcc 4.7 emits this warning when parsing the initial seed for the debugger's RNG (2463534242): src/debug.c:46:1: warning: this decimal constant is unsigned only in ISO C90 [enabled by default] Let's mark it explicitly unsigned.	2020-12-18 16:31:08 +01:00
Christopher Faulet	0c366a8761	BUG/MEDIUM: mux-h1: Handle h1_process() failures on a pipelined request On frontend side, when a conn-stream is detached from a H1 connection, the H1 stream is destroyed and if we already have some data to parse (a pipelined request), we process these data immedialtely calling h1_process(). Then we adjust the H1 connection timeout. But h1_process() may fail and release the H1 connection. For instance, a parsing error may be reported. Thus, when that happens, we must not use anymore the H1 connection and exit. This patch must be backported as far as the 2.2. This bug can impact the 2.3 and the 2.2, in theory, if h1 stream creation fails. But, concretly, it only fails on the 2.4 because the requests are now parsed at this step.	2020-12-18 15:13:58 +01:00
Christopher Faulet	fac0f8f029	CLEANUP: mux-h2: Rename h2c_frt_handle_data() to be generic h2c_frt_handle_data() is now used to parse DATA frames on the frontend and the backend side. Thus it is renamed into h2c_handle_data().	2020-12-18 15:05:57 +01:00
Christopher Faulet	142854b1da	CLEANUP: mux-h2: Rename h2s_frt_make_resp_data() to be generic h2s_frt_make_resp_data() is now used to emit DATA frames on the frontend and the backend side. Thus it is renamed into h2s_make_data().	2020-12-18 15:05:57 +01:00
Christopher Faulet	198ef8b1de	BUG/MEDIUM: http-ana: Never for sending data in TUNNEL mode When a channel is set in TUNNEL mode, we now always set the CF_NEVER_WAIT flag, to be sure to never wait for sending data. It is important because in TUNNEL mode, we have no idea if more data are expected or not. Setting this flag prevent the MSG_MORE flag to be set on the connection. It is only a problem with the HTX, since the 2.2. On previous versions, the MSG_MORE flag is only set on the mux initiative. In fact, the problem arises because there is an ambiguity in tunnel mode about the HTX_FL_EOI flag. In this mode, from the mux point of view, while the SHUTR is not received more data are expected. But from the channel point of view, we want to send data asap. At short term, this fix is good enough and is valid anyway. But for the long term more reliable solution must be found. At least, the to_forward field must regain its original meaning. This patch must be backported as far as 2.2.	2020-12-18 15:05:57 +01:00
Christopher Faulet	3e1748bbf3	BUG/MINOR: mux-h1: Don't set CS_FL_EOI too early for protocol upgrade requests When a protocol upgrade request is received, once parsed, it is waiting for the response in the DONE state. But we must not set the flag CS_FL_EOI because we don't know if a protocol upgrade will be performed or not. Now, it is set on the response path, if both sides reached the DONE state. If a protocol upgrade is finally performed, both side are switched in TUNNEL state. Thus the CS_FL_EOI flag is not set. If backported, this patch must be adapted because for now it relies on last 2.4-dev changes. It may be backported as far as 2.0.	2020-12-18 15:05:57 +01:00
Christopher Faulet	c75668ebff	BUG/MINOR: http: Establish a tunnel for all 2xx responses to a CONNECT As stated in the rfc7231, section 4.3.6, an HTTP tunnel via a CONNECT method is successfully established if the server replies with any 2xx status code. However, only 200 responses are considered as valid. With this patch, any 2xx responses are now considered to estalish the tunnel. This patch may be backported on demand to all stable versions and adapted for the legacy HTTP. It works this way since a very long time and nobody complains.	2020-12-18 15:05:57 +01:00
Miroslav Zagorac	7f8314c8d1	MINOR: opentracing: add ARGC_OT enum Due to the addition of the OpenTracing filter it is necessary to define ARGC_OT enum. This value is used in the functions fmt_directive() and smp_resolve_args().	2020-12-16 15:49:53 +01:00
Miroslav Zagorac	6deab79d59	MINOR: vars: replace static functions with global ones The OpenTracing filter uses several internal HAProxy functions to work with variables and therefore requires two static local HAProxy functions, var_accounting_diff() and var_clear(), to be declared global. In fact, the var_clear() function was not originally defined as static, but it lacked a declaration.	2020-12-16 14:20:08 +01:00
Remi Tricot-Le Breton	5853c0c0d5	MINOR: cache: Add a max-secondary-entries cache option This new option allows to tune the maximum number of simultaneous entries with the same primary key in the cache (secondary entries). When we try to store a response in the cache and there are already max-secondary-entries living entries in the cache, the storage will fail (but the response will still be sent to the client). It defaults to 10 and does not have a maximum number.	2020-12-15 16:35:09 +01:00
Remi Tricot-Le Breton	73be796462	MEDIUM: cache: Avoid going over duplicates lists too often The secondary entry counter cannot be updated without going over all the items of a duplicates list periodically. In order to avoid doing it too often and to impact the cache's performances, a timestamp is added to the cache_entry. It will store the timestamp (with second precision) of the last iteration over the list (actually the last call of the clear_expired_duplicates function). This way, this function will not be called more than once per second for a given duplicates list.	2020-12-15 16:35:09 +01:00
Remi Tricot-Le Breton	65904e4f07	MEDIUM: cache: Add a secondary entry counter and insertion limitation Add an arbitrary maximum number of secondary entries per primary hash (10 for now) to the cache. This prevents the cache from being filled with duplicates of the same resource. This works thanks to an entry counter that is kept in one of the duplicates of the list (the last one). When an entry is added to the list, the ebtree's implementation ensures that it will be added to the end of the existing list so the only thing to do to keep the counter updated is to get the previous counter from the second to last entry. Likewise, when an entry is explicitely deleted, we update the counter from the list's last item.	2020-12-15 16:35:09 +01:00
Ilya Shipitsin	ec60909871	BUILD: SSL: fine guard for SSL_CTX_add_server_custom_ext call SSL_CTX_add_server_custom_ext is openssl specific function present since openssl-1.0.2, let us define readable guard for it, not depending on HA_OPENSSL_VERSION	2020-12-15 16:13:35 +01:00
Remi Tricot-Le Breton	964caaff0e	BUG/MAJOR: cache: Crash because of disabled entry not removed from the tree The cache entries are now added into the tree even when they are not complete yet. If we realized while trying to add a response's payload that the shctx was full, the entry was disabled through the disable_cache_entry function, which cleared the key field of the entry's node, but without actually removing it from the tree. So the shctx row could be stolen from the entry and the row's content be rewritten while a lookup in the tree would still find a reference to the old entry. This caused a random crash in case of cache saturation and row reuse. This patch adds the missing removal of the node from the tree next to the reset of the key in disable_cache_entry. This bug was introduced by commit `3243447` ("MINOR: cache: Add entry to the tree as soon as possible") It does not need to be backported.	2020-12-15 15:31:30 +01:00
William Lallemand	a55685bfea	BUG/MEDIUM: ssl/crt-list: bad behavior with "commit ssl cert" In issue #1004, it was reported that it is not possible to remove correctly a certificate after updating it when it came from a crt-list. Indeed the "commit ssl cert" command on the CLI does not update the list of ckch_inst in the crtlist_entry. Because of this, the "del ssl crt-list" command does not remove neither the instances nor the SNIs because they were never linked to the crtlist_entry. This patch fixes the issue by inserting the ckch_inst in the crtlist_entry once generated. Must be backported as far as 2.2.	2020-12-15 15:13:21 +01:00
Christopher Faulet	cc043f66b7	BUG/MEDIUM: mux-h1: Fix a deadlock when a 408 error is pending for a client When a frontend H1 connection timed out waiting for the next request, a 408 error message is returned to the client. It is performed into the H1C task process function, h1_timeout_task(), and under the idle connection takeover lock. If the 408 error message cannot be sent immediately, we wait for a next retry. In this case, the lock must be released. This bug was introduced by the commit `c4bfa59f1d` ("MAJOR: mux-h1: Create the client stream as later as possible") and is specific to the 2.4-DEV. No backport needed.	2020-12-14 10:06:13 +01:00
Christopher Faulet	cb33d3ac7f	BUG/MEDIUM: lb-leastconn: Reposition a server using the right eweight Depending on the context, the current eweight or the next one must be used to reposition a server in the tree. When the server state is updated, for instance its weight, the next eweight must be used because it is not yet committed. However, when the server is used, on normal conditions, the current eweight must be used. In fact, it is only a bug on the 1.8. On newer versions, the changes on a server are performed synchronously. But it is safer to rely on the right eweight value to avoid any futur bugs. On the 1.8, it is important to do so, because the server state is updated and committed inside the rendez-vous point. Thus, the next server state may be unsync with the current state for a short time, waiting all threads join the rendez-vous point. It is especially a problem if the next eweight is set to 0. Because otherwise, it must not be used to reposition the server in the tree, leading to a divide by 0. This patch must be backported as far as 1.8.	2020-12-14 09:52:34 +01:00
Willy Tarreau	746b0515a4	MEDIUM: connection: make use of the control layer check_events/ignore_events This changes the subscribe/unsubscribe functions to rely on the control layer's check_events/ignore_events. At the moment only the socket version of these functions is present so the code should basically be the same.	2020-12-11 17:06:11 +01:00
Willy Tarreau	472125bc04	MINOR: protocol: add a pair of check_events/ignore_events functions at the ctrl layer Right now the connection subscribe/unsubscribe code needs to manipulate FDs, which is not compatible with QUIC. In practice what we need there is to be able to either subscribe or wake up depending on readiness at the moment of subscription. This commit introduces two new functions at the control layer, which are provided by the socket code, to check for FD readiness or subscribe to it at the control layer. For now it's not used.	2020-12-11 17:02:50 +01:00
Willy Tarreau	2ded48dd27	MINOR: connection: make conn_sock_drain() use the control layer's ->drain() Now we don't touch the fd anymore there, instead we rely on the ->drain() provided by the control layer. As such the function was renamed to conn_ctrl_drain().	2020-12-11 16:26:01 +01:00
Willy Tarreau	427c846cc9	MINOR: protocol: add a ->drain() function at the connection control layer This is what we need to drain pending incoming data from an connection. The code was taken from conn_sock_drain() without the connection-specific stuff. It still takes a connection for now for API simplicity.	2020-12-11 16:26:00 +01:00
Willy Tarreau	586f71b43f	REORG: connection: move the socket iocb (conn_fd_handler) to sock.c conn_fd_handler() is 100% specific to socket code. It's about time it moves to sock.c which manipulates socket FDs. With it comes conn_fd_check() which tests for the socket's readiness. The ugly connection status check at the end of the iocb was moved to an inlined function in connection.h so that if we need it for other socket layers it's not too hard to reuse. The code was really only moved and not changed at all.	2020-12-11 16:26:00 +01:00
Willy Tarreau	827fee7406	MINOR: connection: remove sock-specific code from conn_sock_send() The send() loop present in this function and the error handling is already present in raw_sock_from_buf(). Let's rely on it instead and stop touching the FD from this place. The send flag was changed to use a more agnostic CO_SFL_*. The name was changed to "conn_ctrl_send()" to remind that it's meant to be used to send at the lowest level.	2020-12-11 16:25:11 +01:00
Amaury Denoyelle	f7719a25db	MINOR: stream: add timeout sample fetches Add cur_server_timeout and cur_tunnel_timeout. These sample fetches return the current timeout value for a stream. This is useful to retrieve the value of a timeout which was changed via a set-timeout rule.	2020-12-11 12:01:07 +01:00
Amaury Denoyelle	12bada5662	MINOR: stream: add sample fetches Prepare the possibility to register sample fetches on the stream. This commit is necessary to implement sample fetches to retrieve the current timeout values.	2020-12-11 12:01:07 +01:00
Amaury Denoyelle	d91d779618	MINOR: backend: add timeout sample fetches Add be_server_timeout and be_tunnel_timeout. These sample fetches return the configuration value for server or tunnel timeout on the backend side.	2020-12-11 12:01:07 +01:00
Amaury Denoyelle	da184d5306	MINOR: frontend: add client timeout sample fetch Add a sample fetch named fe_client_timeout to return the configuration value for the client timeout on a frontend.	2020-12-11 12:01:07 +01:00
Amaury Denoyelle	8d22823ade	MEDIUM: http_act: define set-timeout server/tunnel action Add a new http-request action 'set-timeout [server/tunnel]'. This action can be used to update the server or tunnel timeout of a stream. It takes two parameters, the timeout name to update and the new timeout value. This rule is only valid for a proxy with backend capabilities. The timeout value cannot be null. A sample expression can also be used instead of a plain value.	2020-12-11 12:01:07 +01:00
Amaury Denoyelle	fb50443517	MEDIUM: stream: support a dynamic tunnel timeout Allow the modification of the tunnel timeout on the stream side. Use a new field in the stream for the tunnel timeout. It is initialized by the tunnel timeout from backend unless it has already been set by a set-timeout tunnel rule.	2020-12-11 12:01:07 +01:00
Amaury Denoyelle	90d3d882e3	MEDIUM: stream: support a dynamic server timeout Allow the modification of the timeout server value on the stream side. Do not apply the default backend server timeout in back_establish if it is already defined. This is the case if a set-timeout server rule has been executed.	2020-12-11 12:01:07 +01:00
Amaury Denoyelle	b715078821	MINOR: stream: prepare the hot refresh of timeouts Define a stream function to allow to update the timeouts. This commit is in preparation for the support of dynamic timeouts with the set-timeout rule.	2020-12-11 12:01:07 +01:00
Christopher Faulet	82635a0fc1	BUG/MINOR: tools: Reject size format not starting by a digit parse_size_err() function is now more strict on the size format. The first character must be a digit. Otherwise an error is returned. Thus "size k" is now rejected. This patch must be backported to all stable versions.	2020-12-11 12:01:07 +01:00
Christopher Faulet	c20ad0d8db	BUG/MINOR: tools: make parse_time_err() more strict on the timer validity First, an error is now reported if the first character is not a digit. Thus, "timeout client s" triggers an error now. Then 'u' is also rejected now. 'us' is valid and should be used set the timer in microseconds. However 'u' alone is not a valid unit. It was just ignored before (default to milliseconds). Now, it is an error. Finally, a warning is reported if the end of the text is not reached after the timer parsing. This warning will probably be switched to an error in a futur version. This patch must be backported to all stable versions.	2020-12-11 12:01:04 +01:00
Christopher Faulet	cad5f5e1ed	MINOR: tcpcheck: Only wait for more payload data on HTTP expect rules For HTTP expect rules, if the buffer is not empty, it is guarantee that all responses headers are received, with the start-line. Thus, except for payload matching, there is no reason to wait for more data from the moment the htx message is not empty. This patch may be backported as far as 2.2.	2020-12-11 11:48:15 +01:00
Christopher Faulet	c878f56f7c	BUG/MINOR: tcpcheck: Don't rearm the check timeout on each read The check timeout is used to limit a health-check execution. By default inter timeout is used. But when defined the check timeout is used. In this case, the inter timeout (or connect timeout) is used for the connection establishment only. And the check timeout for the health-check execution. Thus, it must be set after a successfull connect. It means it is rearm at the end of each connect rule. This patch with the previous one (BUG/MINOR: http-check: Use right condition to consider HTX message as full) should solve the issue #991. It must be backported as far as 2.2. On the 2.3 and 2.2, there are 2 places were the connection establishement is handled. The check timeout must be set on both.	2020-12-11 11:48:15 +01:00
Christopher Faulet	3f527197cd	BUG/MINOR: http-check: Use right condition to consider HTX message as full When an HTTP expect rule is evaluated, we must know if more data is expected or not to wait if the matching fails. If the whole response is received or if the HTX message is full, we must not wait. In this context, htx_free_data_space() must be used instead of htx_free_space(). The fisrt one count down the block size. Otherwise at the edge, when only the block size remains free (8 bytes), we may think there is some place for more data while the mux is unable to add more block. This bug explains the loop described on the GH issue #991. It should be backported as far as 2.2.	2020-12-11 11:48:15 +01:00
Willy Tarreau	8b250ba738	CLEANUP: connection: open-code conn_cond_update_polling() and update the comment This last call to conn_cond_update_polling() is now totally misleading as the function only stops polling in case of unrecoverable connection error. Let's open-code the test to make it more prominent and explain what we're trying to do there. It's even almost certain this code is never executed anymore, as the only remaining case should be a mux's wake function setting CO_FL_ERROR without disabling the polling, but they need to be audited first to make sure this is the case.	2020-12-11 11:19:24 +01:00
Willy Tarreau	f7e4a6fc07	MINOR: checks: don't call conn_cond_update_polling() anymore This was a leftover of the pre-mux v1.8-dev3 era. It makes no sense anymore to try to disable polling on a connection we don't own, it's the mux's job and it's properly done upon shutdowns and closes.	2020-12-11 11:11:06 +01:00
Willy Tarreau	30bd4efb1b	MINOR: checks: use cs_drain_and_close() instead of draining the connection As explained in previous commit, the situation is absurd as we try to cleanly drain pending data before impolitely shutting down, and it could be counter productive on real muxes. Let's use cs_drain_and_close() instead.	2020-12-11 11:09:29 +01:00
Willy Tarreau	7d7b11cf93	MINOR: mux-pt: take care of CS_SHR_DRAIN in shutr() When the shutr() requests CS_SHR_DRAIN and there's no particular shutr implemented on the underlying transport layer, we must drain pending data. This is what happens when cs_drain_and_close() is called. It is important for TCP checks to drain large responses and close cleanly.	2020-12-11 11:07:19 +01:00
Willy Tarreau	a5ea751922	MINOR: stream-int: don't touch polling anymore on shutdown Not only it's become totally useless with muxes, in addition it's dangerous to play with the mux's FD while shutting a stream down for writes. It's already done if necessary by the cs_shutw() code at the mux layer. Fortunately it doesn't seem to have any impact, most likely the polling updates used to immediately revert this operation.	2020-12-11 10:29:11 +01:00
Willy Tarreau	5a1d439225	CLEANUP: connection: use fd_stop_both() instead of conn_stop_polling() conn_stop_polling() in fact only calls fd_stop_both() after checking that the ctrl layer is ready. It's the case in conn_fd_check() so let's get rid of this next-to-last user of this function.	2020-12-11 09:56:53 +01:00
Remi Tricot-Le Breton	e3e1e5f34b	MINOR: cache: Dump secondary entries in "show cache" The duplicated entries (in case of vary) were not taken into account by the "show cache" command. They are now dumped too. A new "vary" column is added to the output. It contains the complete seocndary key (in hex format).	2020-12-10 15:59:49 +01:00
Willy Tarreau	29885f0308	MINOR: udp: export udp_suspend_receiver() and udp_resume_receiver() QUIC will rely on UDP at the receiver level, and will need these functions to suspend/resume the receivers. In the future, protocol chaining may simplify this.	2020-12-08 18:10:18 +01:00
Willy Tarreau	de471c4655	MINOR: protocol: add a set of ctrl_init/ctrl_close methods for setup/teardown Currnetly conn_ctrl_init() does an fd_insert() and conn_ctrl_close() does an fd_delete(). These are the two only short-term obstacles against using a non-fd handle to set up a connection. Let's have pur these into the protocol layer, along with the other connection-level stuff so that the generic connection code uses them instead. This will allow to define new ones for other protocols (e.g. QUIC). Since we only support regular sockets at the moment, the code was placed into sock.c and shared with proto_tcp, proto_uxst and proto_sockpair.	2020-12-08 15:50:56 +01:00
Willy Tarreau	b366c9a59a	CLEANUP: protocol: group protocol struct members by usage For the sake of an improved readability, let's group the protocol field members according to where they're supposed to be defined: - connection layer (note: for now even UDP needs one) - binding layer - address family - socket layer Nothing else was changed.	2020-12-08 14:58:24 +01:00
Willy Tarreau	b9b2fd7cf4	MINOR: protocol: export protocol definitions The various protocols were made static since there was no point in exporting them in the past. Nowadays with QUIC relying on UDP we'll significantly benefit from UDP being exported and more generally from being able to declare some functions as being the same as other protocols'. In an ideal world it should not be these protocols which should be exported, but the intermediary levels: - socket layer (sock.c only right now), already exported as functions but nothing structured at the moment ; - family layer (sock_inet, sock_unix, sockpair etc): already structured and exported - binding layer (the part that relies on the receiver): currently fused within the protocol - connectiong layer (the part that manipulates connections): currently fused within the protocol - protocol (connection's control): shouldn't need to be exposed ultimately once the elements above are in an easily sharable way.	2020-12-08 14:54:08 +01:00
Willy Tarreau	f9ad06cb26	MINOR: protocol: remove the redundant ->sock_domain field This field used to be needed before commit `2b5e0d8b6` ("MEDIUM: proto_udp: replace last AF_CUST_UDP* with AF_INET*") as it was used as a protocol entry selector. Since this commit it's always equal to the socket family's value so it's entirely redundant. Let's remove it now to simplify the protocol definition a little bit.	2020-12-08 12:13:54 +01:00
Christopher Faulet	c43fca0139	BUG/MINOR: stream: Don't use input buffer after the ownership xfer At the end of stream_new(), once the input buffer is transfer to the request channel, it must not be used anymore. The previous patch (`16df178b6` "BUG/MEDIUM: stream: Xfer the input buffer to a fully created stream") was pushed to quickly. No backport needed.	2020-12-04 17:22:50 +01:00
Christopher Faulet	16df178b6e	BUG/MEDIUM: stream: Xfer the input buffer to a fully created stream The input buffer passed as argument to create a new stream must not be transferred when the request channel is initialized because the channel flags are not set at this stage. In addition, the API is a bit confusing regarding the buffer owner when an error occurred. The caller remains the owner, but reading the code it is not obvious. So, first of all, to avoid any ambiguities, comments are added on the calling chain to make it clear. The buffer owner is the caller if any error occurred. And the ownership is transferred to the stream on success. Then, to make things simple, the ownership is transferred at the end of stream_new(), in case of success. And the input buffer is updated to point on BUF_NULL. Thus, in all cases, if the caller try to release it calling b_free() on it, it is not a problem. Of course, it remains the caller responsibility to release it on error. The patch fixes a bug introduced by the commit `26256f86e` ("MINOR: stream: Pass an optional input buffer when a stream is created"). No backport is needed.	2020-12-04 17:15:03 +01:00
William Lallemand	b7fdfdfd92	MEDIUM: ssl: fatal error with bundle + openssl < 1.1.1 Since HAProxy 2.3, OpenSSL 1.1.1 is a requirement for using a multi-certificate bundle in the configuration. This patch emits a fatal error when HAProxy tries to load a bundle with an older version of HAProxy. This problem was encountered by an user in issue #990. This must be backported in 2.3.	2020-12-04 15:45:02 +01:00
Willy Tarreau	d1f250f87b	MINOR: listener: now use a generic add_listener() function With the removal of the family-specific port setting, all protocol had exactly the same implementation of ->add(). A generic one was created with the name "default_add_listener" so that all other ones can now be removed. The API was slightly adjusted so that the protocol and the listener are passed instead of the listener and the port. Note that all protocols continue to provide this ->add() method instead of routinely calling default_add_listener() from create_listeners(). This makes sure that any non-standard protocol will still be able to intercept the listener addition if needed. This could be backported to 2.3 along with the few previous patches on listners as a pure code cleanup.	2020-12-04 15:08:00 +01:00
Willy Tarreau	07400c56bb	MINOR: listener: automatically set the port when creating listeners In create_listeners() we iterate over a port range and call the protocol's ->add() function to add a new listener on the specified port. Only tcp4/tcp6/udp4/udp6 support a port, the other ones ignore it. Now that we can rely on the address family to properly set the port, better do it this way directly from create_listeners() and remove the family-specific case from the protocol layer.	2020-12-04 15:08:00 +01:00
Willy Tarreau	73bed9ff13	MINOR: protocol: add a ->set_port() helper to address families At various places we need to set a port on an IPv4 or IPv6 address, and it requires casts that are easy to get wrong. Let's add a new set_port() helper to the address family to assist in this. It will be directly accessible from the protocol and will make the operation seamless. Right now this is only implemented for sock_inet as other families do not need a port.	2020-12-04 15:08:00 +01:00
Christopher Faulet	c31bc724d4	MINOR: h1-htx/http-ana: Set BODYLESS flag on message in TUNNEL state When a H1 message is parsed, if the parser state is switched to TUNNEL mode just after the header parsing, the BODYLESS flag is set on the HTX start-line. By transitivity, the corresponding flag is set on the message in HTTP analysers. Thus it is possible to rely on it to not wait for the request body.	2020-12-04 14:41:49 +01:00
Christopher Faulet	2a40854244	MINOR: http-ana: Properly set message flags from the start-line flags CNT_LEN and TE_CHNK flags must be set on the message only when the corresponding flag is set on the HTX start-line. Before, when the transfer length was known XFER_LEN set), the HTTP_MSGF_TE_CHNK was the default. But it is not appropriate. Now, it is only set if the message is chunked. Thus, it is now possible to have a known transfer length without CNT_LEN or TE_CHNK. In addition, the BODYLESS flags may be set, independently on XFER_LEN one.	2020-12-04 14:41:49 +01:00
Christopher Faulet	6ad06066cd	CLEANUP: connection: Remove CS_FL_READ_PARTIAL flag Since the recent refactoring of the H1 multiplexer, this flag is no more used. Thus it is removed.	2020-12-04 14:41:49 +01:00
Christopher Faulet	da831fa068	CLEANUP: http-ana: Remove TX_WAIT_NEXT_RQ unsued flag This flags is now unused. It was used in REQ_WAIT_HTTP analyser, when a stream was waiting for a request, to set the keep-alive timeout or to avoid to send HTTP errors to client.	2020-12-04 14:41:49 +01:00
Christopher Faulet	8bebd2fe52	MEDIUM: http-ana: Don't process partial or empty request anymore It is now impossible to start the HTTP request processing in the stream analysers with a partial or empty request message. The mux-h2 was already waiting of the request headers before creating the stream. Now the mux-h1 does the same. All errors (aborts, timeout or invalid requests) waiting for the request headers are now handled by the multiplexers. So there is no reason to still handle them in the REQ_WAIT_HTTP (http_wait_for_request) analyser. To ensure there is no ambiguity, a BUG_ON() was added to exit if a partial request is received in this analyser.	2020-12-04 14:41:49 +01:00
Christopher Faulet	2afd874704	CLEANUP: htx: Remove HTX_FL_UPGRADE unsued flag Now the H1 to H2 upgrade is handled before the stream creation. HTX_FL_UPGRADE flag is now unused.	2020-12-04 14:41:49 +01:00
Christopher Faulet	4a8779f808	MINOR: http-ana: Remove useless update of t_idle duration of the stream Becaues the stream is now created after the request headers parsing, the idle duration from the session is always up-to-date.	2020-12-04 14:41:49 +01:00
Christopher Faulet	3ced1d1db4	CLEANUP: mux-h1: Rename H1C_F_CS_* flags and reorder H1C flags H1C_F_CS_* flags are renamed into H1C_F_ST_*. They reflect the connection state. So "ST" is well suited. "CS" is confusing because it is also the abbreviation for conn-stream. In addition, H1C flags are reordered.	2020-12-04 14:41:49 +01:00
Christopher Faulet	c4bfa59f1d	MAJOR: mux-h1: Create the client stream as later as possible This is the reason for all previous patches. The conn-stream and the associated stream are created as later as possible. It only concerns the frontend connections. But it means the request headers, and possibly the first data block, are received and parsed before the conn-stream creation. To do so, an embryonic H1 stream, with no conn-stream, is created. The result of this "early parsing" is stored in its rx buffer, used to fill the request channel when the stream is created. During this step, some HTTP errors may be returned by the mux. It must also handle http-request/keep-alive timeouts. A significative change is about H1 to H2 upgrade. It happens very early now, and no H1 stream are created (and thus of course no conn-stream). The most important part of this patch is located to the h1_process() function. Because it must trigger the parsing when there is no H1 stream. h1_recv() function has also been simplified.	2020-12-04 14:41:49 +01:00
Christopher Faulet	c18fc234d9	MINOR: mux-h1: Add functions to send HTTP errors from the mux For now, this part is unsued. But this patch adds functions to handle errors on idle and embryonic H1 connections and send corresponding HTTP error messages to the client (400, 408 or 500). Thanks to previous patches, these functions take care to update the right stats counters, but also the counters tracked by the session. A field to store the HTTP error code has been added in the H1C structure. It is used for error retransmits, if any, and to get it in http logs. It is used to return the mux exit status code when the MUX_EXIT_STATUS ctl parameter is requested.	2020-12-04 14:41:49 +01:00
Christopher Faulet	ce5e6bcb04	MINOR: logs: Get the multiplexer exist status when no stream is provided When a log message is emitted from the session level, by a multiplexer, there is no stream. Thus for HTTP session, there no status code and the termination flags are not correctly set. Thanks to previous patch, the HTTP status code is deduced from the mux exist status, using the MUX_EXIT_STATE ctl param. This is only done for HTTP frontends. If it is defined ( != 0), it is used to deduce the termination flags.	2020-12-04 14:41:49 +01:00
Christopher Faulet	4c8ad84232	MINOR: mux: Add a ctl parameter to get the exit status of the multiplexers The ctl param MUX_EXIT_STATUS can be request to get the exit status of a multiplexer. For instance, it may be an HTTP status code or an H2 error. For now, 0 is always returned. When the mux h1 will be able to return HTTP errors itself, this ctl param will be used to get the HTTP status code from the logs. the mux_exit_status enum has been created to map internal mux exist status to generic one. Thus there is 5 possible status for now: success, invalid error, timeout error, internal error and unknown.	2020-12-04 14:41:49 +01:00
Christopher Faulet	84600631cd	MINOR: stick-tables: Add functions to update some values of a tracked counter The cumulative numbers of http requests, http errors, bytes received and sent and their respective rates for a tracked counters are now updated using specific stream independent functions. These functions are used by the stream but the aim is to allow the session to do so too. For now, there is no reason to perform these updates from the session, except from the mux-h2 maybe. But, the mux-h1, on the frontend side, will be able to return some errors to the client, before the stream creation. In this case, it will be mandatory to update counters tracked at the session level.	2020-12-04 14:41:49 +01:00
Christopher Faulet	dbe57794c4	MINOR: mux-h1: Add a idle expiration date on the H1 connection An idle expiration date is added on the H1 connection with the function to set it depending on connection state. First, there is no idle timeout on backend connections, For idle frontend connections, the http-request or keep-alive timeout are used depending on which timeout is defined and if it is the first request or not. For embryonic connections, the http-request is always used, if defined. For attached or shutted down connections, no idle timeout is applied. For now the idle expiration date is never set and the h1_set_idle_expiration function remains unused.	2020-12-04 14:41:49 +01:00
Christopher Faulet	5d3c93cd43	MINOR: mux-h1: Process next request for IDLE connection only When the conn-stream is detached for a H1 connection, there is no reason to subscribe for reads or process pending input data if the connection is not idle. Because, it means a shutdown is pending.	2020-12-04 14:41:49 +01:00
Christopher Faulet	adcd789d92	MINOR: mux-h1: Rework h1_refresh_timeout to be easier to read Conditions to set a timeout on the H1C task have been simplified or at least changed to rely on H1 connection flags. Now, following rules are used : * the shutdown timeout is applied on dead (not alive) or shutted down connections. * The client/server timeout is applied if there are still some pending outgoing data. * The client timeout is applied on alive frontend connections with no conn-stream. It means on idle or embryionic frontend connections. * For all other connections (backend or attached connections), no timeout is applied. For frontend or backend attached connections, the timeout is handled by the application layer. For idle backend connections, there is no timeout.	2020-12-04 14:41:49 +01:00
Christopher Faulet	3c82d8b328	MINOR: mux-h1: Rework how shutdowns are handled We now only rely on one flag to notify a shutdown. The shutdown is performed at the connection level when there are no more pending outgoing data. So, it means it is performed immediately if the output buffer is empty. Otherwise it is deferred after the outgoing data are sent. This simplify a bit the mux because there is now only one flag to check.	2020-12-04 14:41:49 +01:00
Christopher Faulet	119ac870ce	MINOR: mux-h1: Disable reads if an error was reported on the H1 stream Don't try to read more data if a parsing or a formatting error was reported on the H1 stream. There is no reason to continue to process the messages for the current connection in this case. If a parsing error occurs, it means the input is invalid. If a formatting error occurs, it is an internal error and it is probably safer to give up.	2020-12-04 14:41:49 +01:00
Christopher Faulet	295b8d1649	MINOR: mux-h1: Reset more H1C flags when a H1 stream is destroyed When a H1 stream is destroyed, all dynamic flags on the H1 connection are reset to be sure to leave it in a clean state.	2020-12-04 14:41:49 +01:00
Christopher Faulet	c1c66a4759	MINOR: mux-h1: rework the h1_timeout_task() function Mainly to make it easier to read. First of all, when a H1 connection is still there, we check if the connection was stolen by another thread or not. If yes we release the task and leave. Then we check if the task is expired or not. Only expired tasks are considered. Finally, if a conn-stream is still attached to the connection (H1C_F_CS_ATTACHED flag set), we return. Otherwise, the task and the H1 connection are released.	2020-12-04 14:41:48 +01:00
Christopher Faulet	bb8baf477d	MINOR: mux-h1: Add embryonic and attached states on the H1 connection Be prepared to have a H1 connection in one of the following states : * A H1 connection waiting for a new message with no H1 stream. H1C_F_CS_IDLE flag is set. * A H1 connection processing a new message with a H1 stream but no conn-stream attached. H1C_F_CS_EMBRYONIC flag is set * A H1 connection with a H1 stream and a conn-stream attached. H1C_F_CS_ATTACHED flag is set. * A H1 connection with no H1 stream, waiting to be released. No flag is set. These flags are mutually exclusives. When none is set, it means the connection will be released ASAP, just remaining outgoing data must be sent before. For now, the second state (H1C_F_CS_EMBRYONIC) is transient.	2020-12-04 14:41:48 +01:00
Christopher Faulet	a583af6333	MINOR: mux-h1: Don't set CS flags in internal parsing functions Now, only h1_process_input() function set or unset the conn-stream flags. This way, internal parsing functions don't rely anymore on the conn-stream.	2020-12-04 14:41:48 +01:00
Christopher Faulet	d17ad8214f	MINOR: mux-h1: Add a rxbuf into the H1 stream For now this buffer is not used. But it will be used to parse the headers, and possibly the first block of data, when no stream is attached to the H1 connection. The aim is to use it to create the stream, thanks to recent changes on the streams creation api.	2020-12-04 14:41:48 +01:00
Christopher Faulet	2f0ec66613	MINOR: mux-h1: Split front/back h1 stream creation in 2 functions Dedicated functions are now used to create frontend and backend H1 streams. h1c_frt_stream_new() is now used to create frontend H1 streams and h1c_bck_stream_new() to create backend ones. Both rely on h1s_new() function to allocate the stream itself. It is a bit easier to add specific processing depending we are on the frontend or the backend side.	2020-12-04 14:41:48 +01:00
Christopher Faulet	60ef12c80b	MINOR: mux-h1: Separate parsing and formatting errors at H1 stream level Instead of using H1S flags to report an error on the request or the response, independently it is a parsing or a formatting error, we now use a flag to report parsing errors and another one to report formatting ones. This simplify the message parsing. It is also easier to figure out what error happened when one of this flag is set. The side may be deduced checking the H1C_F_IS_BACK flag.	2020-12-04 14:41:48 +01:00
Christopher Faulet	0a799aa3d6	MINOR: mux-h1: Introduce H1C_F_IS_BACK flag on the H1 connection This flag is only set on the backend side and is tested instead of calling conn_is_back() function.	2020-12-04 14:41:48 +01:00
Christopher Faulet	ae635766f6	MEDIUM: mux-h1: Use a h1c flag to block reads when splicing is in-progress Instead of using 2 flags on the H1 stream (H1S_F_BUF_FLUSH and H1S_F_SPLICED_DATA), we now only use one flag on the H1 connection (H1C_F_WANT_SPLICE) to notify we want to use splicing or we are using splicing. This flag blocks the calls to rcv_buf() connection callback. It is a bit easier to set the H1 connection capability to receive data in its input buffer instead of relying on the H1 stream.	2020-12-04 14:41:48 +01:00
Christopher Faulet	089acd5b0d	MINOR: mux-h1: Add a flag to disable reads to wait opposite side H1C_F_WAIT_OPPOSITE must be set on the H1 conenction to don't read more data because we must be sync with the opposite side. This flag replaces the H1C_F_IN_BUSY flag. Its name is a bit explicit. It is automatically set on the backend side when the mux is created. It is safe to do so because at this stage, the request has not yet been sent to the server. This way, in h1_recv_allowed(), a test on this flag is enough to block the reads instead of testing the H1 stream state on the backend side.	2020-12-04 14:41:48 +01:00
Christopher Faulet	26256f86e1	MINOR: stream: Pass an optional input buffer when a stream is created It is now possible to set the buffer used by the channel request buffer when a stream is created. It may be useful if input data are already received, instead of waiting the first call to the mux rcv_buf() callback. This change is mandatory to support H1 connection with no stream attached. For now, the multiplexers don't pass any buffer. BUF_NULL is thus used to call stream_create_from_cs().	2020-12-04 14:41:48 +01:00
Christopher Faulet	3b536a3131	MINOR: mux-h1: Don't provide anymore timing info using cs_info structure The cs_info are now unused. The stream uses the session to get these info. So we can safely remove it from the mux-h1.	2020-12-04 14:41:48 +01:00
Christopher Faulet	15e525f495	MINOR: stream: Don't retrieve anymore timing info from the mux csinfo These info are only provided by the mux-h1. But, thanks to previous patches, we can get them from the session directly. There is no need to retrieve them from the mux anymore.	2020-12-04 14:41:48 +01:00
Christopher Faulet	7a6c513246	MINOR: stream: Always get idle duration from the session Since the idle duration provided by the session is always up-to-date, there is no more reason to rely on the multiplexer cs_info to set it to the stream.	2020-12-04 14:41:48 +01:00
Christopher Faulet	dd78921c66	MINOR: logs: Use session idle duration when no stream is provided When a log message is emitted from the session, using sess_log() function, there is no stream available. In this case, instead of deducing the idle duration from the accept date, we use the one provided by the session. 0 is used if it is undefined (i.e set to -1).	2020-12-04 14:41:48 +01:00
Christopher Faulet	42849b047a	MINOR: mux-h1: Reset session dates and durations info when the CS is detached These info are reset for the next transaction, if the connection is kept alive. From the stream point of view, it should be the same a new connection, except there is no handshake. Thus the handshake duration is set to 0.	2020-12-04 14:41:48 +01:00
Christopher Faulet	4e74155466	MINOR: mux-h1: Update session idle duration when data are received The session idle duration is set if not already done when data are received. For now, this value is still unused.	2020-12-04 14:41:48 +01:00
Christopher Faulet	d517396f8e	MINOR: session: Add the idle duration field into the session The idle duration between two streams is added to the session structure. It is not necessarily pertinent on all protocols. In fact, it is only defined for H1 connections. It is the duration between two H1 transactions. But the .get_cs_info() callback function on the multiplexers only exists because this duration is missing at the session level. So it is a simplification opportunity for a really low cost. To reduce the cost, a hole in the session structure is filled by moving .srv_list field at the end of the structure.	2020-12-04 14:41:48 +01:00
Christopher Faulet	268c92e2f8	BUG/MINOR: mux-h1: Handle keep-alive timeout for idle frontend connections IDLE frontend connections have no stream attached. The stream is only created when new data are received, when the parsing of the next request starts. Thus the keep-alive timeout, handled into the HTTP analysers, is not considered while nothing is received. But this is especially when this timeout must be considered. Concretely the http-keep-alive is ignored while no data are received. Only the client timeout is used. It will only be considered on incomplete requests, if the http-request timeout is not set. To fix the bug, the http-keep-alive timeout must be handled at the mux level, for IDLE frontend connection only. This patch should fix the issue #984. It must be backported as far as 2.2. On prior versions, the stream is created earlier. So, it is not a problem, except if this behavior changes of course (it was an optim of the 2.2, but don't remember the commit).	2020-12-04 14:41:48 +01:00
Willy Tarreau	7da02dd308	BUG/MINOR: listener: use sockaddr_in6 for IPv6 A copy-paste bug between {tcp,udp}{4,6}_add_listener() resulted in using a struct sockaddr_in to set the TCP/UDP port while it ought to be a struct sockaddr_in6. Fortunately, the port has the same offset (2) in both so it was harmless. A cleaner way to proceed would be to have a set_port function exported by the address family layer. This needs to be backported to 2.3.	2020-12-04 14:28:23 +01:00
Willy Tarreau	186f37674c	BUG/MINOR: lua-thread: close all states on deinit It seems to me that lua_close() must be called on all states at deinit time, not just the first two ones. This is likely a remnant of commit `59f11be43` ("MEDIUM: lua-thread: Add the lua-load-per-thread directive"). There should likely be some memory leak reports when using Lua without this fix, though none were observed for now. No backport is needed as this was merged into 2.4-dev.	2020-12-04 12:00:11 +01:00
Thierry Fournier	aafc777854	BUG/MEDIUM: lua-thread: some parts must be initialized once Lua dedicated TCP, HTTP and SSL socket and proxies must be initialized once. Right now, they are initialized from the Lua init state, but since commit `59f11be43` ("MEDIUM: lua-thread: Add the lua-load-per-thread directive") this function is called one time per lua context. This caused some fields to be cleared and overwritten, and pre-allocated object to be lost. This is why the address sanitizer detected memory leaks from the socket_ssl server initialization. Let's move all the state-independent part of the function to the hlua_init() function to avoid this. No backport is needed, this is only 2.4-dev.	2020-12-04 11:55:05 +01:00
Remi Tricot-Le Breton	51058d64a6	MINOR: cache: Consider invalid Age values as stale Do not store responses that have an invalid age header (non numerical, negative ...).	2020-12-04 10:21:56 +01:00
Remi Tricot-Le Breton	72cffaf440	MEDIUM: cache: Remove cache entry in case of POST on the same resource In case of successful unsafe method on a stored resource, the cached entry must be invalidated (see RFC7234#4.4). A "non-error response" is one with a 2xx (Successful) or 3xx (Redirection) status code. This implies that the primary hash must now be calculated on requests that have an unsafe method (POST or PUT for instance) so that we can disable the corresponding entries when we process the response.	2020-12-04 10:21:56 +01:00
Remi Tricot-Le Breton	fcea374fdf	MINOR: cache: Add extra "cache-control" value checks The Cache-Control max-age and s-maxage directives should be followed by a positive numerical value (see RFC 7234#5.2.1.1). According to the specs, a sender "should not" generate a quoted-string value but we will still accept this format.	2020-12-04 10:21:56 +01:00
Remi Tricot-Le Breton	795e1412b0	MINOR: cache: Do not store stale entry When a response has an Age header (filled in by another cache on the message's path) that is greater than its defined maximum age (extracted either from cache-control directives or an expires header), it is already stale and should not be cached.	2020-12-04 10:21:56 +01:00
David Carlier	2d0493af49	BUILD/MINOR: haproxy DragonFlyBSD affinity build update. sched_setaffinity supported by this platform.	2020-12-02 22:43:57 +01:00
Thierry Fournier	46278ff828	MINOR: lua-thread: Add verbosity in errors Because lua-load-per-thread could not load the same code for each thread, this patch displays the state-id associated with the error.	2020-12-02 21:53:16 +01:00
Thierry Fournier	59f11be436	MEDIUM: lua-thread: Add the lua-load-per-thread directive The goal is to allow execution of one main lua state per thread. This patch contains the main job. The lua init is done using these steps: - "lua-load-per-thread" loads the lua code in the first thread - it creates the structs - it stores loaded files - the 1st step load is completed (execution of hlua_post_init) and now, we known the number of threads - we initilize lua states for all remaining threads - for each one, we load the lua file - for each one, we execute post-init Once all is loaded, we control consistency of functions references. The rules are: - a function reference cannot be in the shared lua state and in a per-thread lua state at the same time. - if a function reference is declared in a per-thread lua state, it must be declared in all per-thread lua states	2020-12-02 21:53:16 +01:00
Thierry Fournier	c749259dff	MINOR: lua-thread: Store each function reference and init reference in array The goal is to allow execution of one main lua state per thread. The array introduces storage of one reference per thread, because each lua state can have different reference id for a same function. A function returns the preferred state id according to configuration and current thread id.	2020-12-02 21:53:16 +01:00
Thierry Fournier	021d986ecc	MINOR: lua-thread: Replace state_from by state_id The goal is to allow execution of one main lua state per thread. "state_from" is a pointer to the parent lua state. "state_id" is the index of the parent state id in the reference lua states array. "state_id" is better because the lock is a "== 0" test which is quick than pointer comparison. In other way, the state_id index could index other things the the Lua state concerned. I think to the function references.	2020-12-02 21:53:16 +01:00
Thierry Fournier	62a22aa23f	MINOR: lua-thread: Replace "struct hlua_function" allocation by dedicated function The goal is to allow execution of one main lua state per thread. This function will initialize the struct with other things than 0. With this function helper, the initialization is centralized and it prevents mistakes. This patch also keeps a reference to each declared function in a list. It will be useful in next patches to control consistency of declared references.	2020-12-02 21:53:16 +01:00
Thierry Fournier	afc63e2cb1	MINOR: lua-thread: Replace global gL var with an array of states The goal is to allow execution of one main lua state per thread. The array of states is initialized at the max number of thread +1. We define the index 0 is the common state shared by all threads and should be locked. Other index index are dedicated to each one thread. The old gL now becomes hlua_states[0].	2020-12-02 21:53:16 +01:00
Thierry Fournier	7cbe5046e8	MEDIUM: lua-thread: Apply lock only if the parent state is the main thread The goal is to allow execution of one main lua state per thread. This patch opens the way to addition of a per-thread dedicated lua state. By passing the hlua we can figure the original state that's been used and decide to lock or not.	2020-12-02 21:53:16 +01:00
Thierry Fournier	3c539327f4	MEDIUM: lua-thread: No longer use locked context in initialization parts The goal is to allow execution of one main lua state per thread. Stop using locks in init part, we will use only in parts where the parent lua state is known, so we could take decision about lock according with the lua parent state.	2020-12-02 21:53:16 +01:00
Thierry Fournier	ecb83c24c4	MINOR: lua-thread: Add the "thread" core variable The goal is to allow execution of one main lua state per thread. This commit introduces this variable in the core. Lua state initialized by thread will have access to this variable, which reports the executing thread. 0 indicates the shared thread. Programs which must be executed only once can check for core.thread <= 1.	2020-12-02 21:53:16 +01:00
Thierry Fournier	b8cef175bd	MINOR: lua-thread: Split hlua_post_init() function in two parts The goal is to allow execution of one main lua state per thread. This function will be called for each initialized lua state, so one per thread. The split transforms the lua state variable from global to local.	2020-12-02 21:53:16 +01:00
Thierry Fournier	c93c15cf8c	MINOR: lua-thread: Split hlua_load function in two parts The goal is to allow execution of one main lua state per thread. This function will be called once per thread, using different Lua states. This patch prepares the work.	2020-12-02 21:53:16 +01:00
Thierry Fournier	75fc02956b	MINOR: lua-thread: make hlua_ctx_init() get L from its caller The goal is to allow execution of one main lua state per thread. The function hlua_ctx_init() now gets the original lua state from its caller. This allows the initialisation of lua_thread (coroutines) from any master lua state. The parent lua state is stored in the hlua struct. This patch is a temporary transition, it will be modified later.	2020-12-02 21:53:16 +01:00
Thierry Fournier	1eac28f5fc	MINOR: lua-thread: Split hlua_init() function in two parts The goal is to allow execution of one main lua state per thread. This is a preparative work in order to init more than one stack in the lua-thread objective.	2020-12-02 21:53:16 +01:00
Thierry Fournier	ad5345fed7	MINOR: lua-thread: Replace embedded struct hlua_function by a pointer The goal is to allow execution of one main lua state per thread. Because this struct will be filled after the configuration parser, we cannot copy the content. The actual state of the Haproxy code doesn't justify this change, it is an update preparing next steps.	2020-12-02 21:53:16 +01:00
Thierry Fournier	92689e651e	MINOR: lua-thread: Stop usage of struct hlua for the global lua state The goal is to no longer use "struct hlua" with global main lua_state. The usage of the "struct hlua" is no longer required. This patch replaces this struct by another one. Now, the usage of runtime Lua phase is separated from the start lua phase.	2020-12-02 21:53:16 +01:00
Thierry Fournier	4234dbd03b	MINOR: lua-thread: Use NULL context for main lua state The goal is to no longer use "struct hlua" with global main lua_state. This patch returns NULL value when some code tries go get the hlua struct associated with a task through hlua_gethlua(). This functions is useful only during runtime because the struct hlua contains only runtime states. Some Lua functions allowed to yield are called from init environment. I'm not sure this is a good practice. Maybe it will be clever to disallow calling this kind of functions.	2020-12-02 21:53:16 +01:00
Thierry Fournier	9eb3230b7c	MINOR: lua-thread: hlua_ctx_renew() is never called with main gL lua state The goal is no longer using "struct hlua" with global main lua_state. if somewhere in the code, hlua_ctx_renew() is called with a global Lua context, we have a serious bug. A crash is better than working with this bug, so this patch remove a useless control. In other way, this control were used during hlua_post_init() function. The function hlua_post_init() used a call to the runtime hlua_ctx_resume() function. This call no longer exists.	2020-12-02 21:53:16 +01:00
Thierry Fournier	670db24329	MEDIUM: lua-thread: make hlua_post_init() no longer use the runtime execution function The goal is to no longer use "struct hlua" with global main lua_state. The hlua_post_init() is executed during start phase, it does not require yielding nor any advanced runtime error processing. Let's simplify this by re-implementing the code using lower-level functions which directly take a state and not an hlua anymore.	2020-12-02 21:53:16 +01:00
Thierry Fournier	3fb9e5133a	MINOR: lua-thread: remove struct hlua from function hlua_prepend_path() The goal is to no longer use "struct hlua" with global main lua_state and directly take the state instead. This patch removes the implicit dependency to this struct with the function hlua_prepend_path()	2020-12-02 21:53:16 +01:00
Willy Tarreau	cdb53465f4	MEDIUM: lua-thread: use atomics for memory accounting Let's switch memory accounting to atomics so that the allocator function may safely be used from concurrent Lua states. Given that this function is extremely hot on the call path, we try to optimize it for the most common case, which is: - no limit - there's enough memory The accounting is what is particuarly expensive in threads since all CPUs compete for a cache line, so when the limit is not used, we don't want to use accounting. However we need to preserve it during the boot phase until we may parse a "tune.lua.maxmem" value. For this, we turn the unlimited "0" value to ~0 at the end of the boot phase to mark the definite end of accounting. The function then detects this value and directly jumps to realloc() in this case. When the limit is enforced however, we use a CAS to check and reserve our share of memory, and we roll back on failure. The CAS is used both for increments and decrements so that a single operation is enough to update the counters.	2020-12-02 21:53:16 +01:00
Willy Tarreau	d36c7fa5ec	MINOR: lua: simplify hlua_alloc() to only rely on realloc() The function really has the semantics of a realloc() except that it also passes the old size to help with accounting. No need to special case the free or malloc, realloc does everything we need.	2020-12-02 21:53:16 +01:00
Emeric Brun	fdabf49548	BUG/MAJOR: ring: tcp forward on ring can break the reader counter. If the session is not established, the applet handler could leave with the applet detached from the ring. At next call, the attach counter will be decreased again causing unpredectable behavior. This patch should be backported on branches >=2.2	2020-12-02 20:17:19 +01:00
Fr�d�ric L�caille	fd1831499e	BUG/MINOR: trace: Wrong displayed trace level With commit `a1f12746b` ("MINOR: traces: add a new level "error" below the "user" level") a new trace level was inserted, resulting in shifting all exiting ones by one. But the levels reported in the __trace() function were not updated accordingly, resulting in the TRACE_LEVEL_DEVELOPER not to be properly reported anymore. This patch fixes it by extending the number of levels to 6. No backport is needed.	2020-12-02 17:44:40 +01:00
Remi Tricot-Le Breton	3243447f83	MINOR: cache: Add entry to the tree as soon as possible When many concurrent requests targeting the same resource were seen, the cache could sometimes be filled by too many partial responses resulting in the impossibility to cache a single one of them. This happened because the actual tree insertion happened only after all the payload of every response was seen. So until then, every response was added to the cache because none of the streams knew that a similar request/response was already being treated. This patch consists in adding the cache_entry as soon as possible in the tree (right after the first packet) so that the other responses do not get cached as well (if they have the same primary key). A "complete" flag is also added to the cache_entry so that we know if all the payload is already stored in the entry or if it is still being processed.	2020-12-02 16:38:42 +01:00
Remi Tricot-Le Breton	8bb72aa82f	MINOR: cache: Improve accept_encoding_normalizer Turn the "Accept-Encoding" value to lower case before processing it. Calculate the CRC on every token instead of a sorted concatenation of them all (in order to avoir copying them) then XOR all the CRCs into a single hash (while ignoring duplicates).	2020-12-02 16:32:54 +01:00
Thierry Fournier	f67442efdb	BUG/MINOR: lua: warn when registering action, conv, sf, cli or applet multiple times Lua allows registering multiple sample-fetches, converters, action, cli, applet/services with the same name. This is absolutely useless since only the first registration will be used. This patch sends a warning if the case is encountered. This pach could be backported until 1.8, with the 3 associated patches: - MINOR: actions: Export actions lookup functions - MINOR: actions: add a function returning a service pointer from its name - MINOR: cli: add a function to look up a CLI service description	2020-12-02 09:45:18 +01:00
Thierry Fournier	a51a1fd174	MINOR: cli: add a function to look up a CLI service description This function will be useful to check if the keyword is already registered. Also add a define for the max number of args. This will be needed by a next patch to fix a bug and will have to be backported.	2020-12-02 09:45:18 +01:00
Thierry Fournier	87e539906b	MINOR: actions: add a function returning a service pointer from its name This function simply calls action_lookup() on the private service_keywords, to look up a service name. This will be used to detect double registration of a same service from Lua. This will be needed by a next patch to fix a bug and will have to be backported.	2020-12-02 09:45:18 +01:00
Thierry Fournier	7a71a6d9d2	MINOR: actions: Export actions lookup functions These functions will be useful to check if a keyword is already registered. This will be needed by a next patch to fix a bug, and will need to be backported.	2020-12-02 09:45:18 +01:00
Thierry Fournier	2f05cc6f86	BUG/MINOR: lua: Some lua init operation are processed unsafe Operation luaL_openlibs() and lua_prepend path are processed whithout the safe context, so in case of failure Haproxy aborts or stops without error message. This patch could be backported until 1.8	2020-12-02 09:45:18 +01:00
Thierry Fournier	13d08b73eb	BUG/MINOR: lua: Post init register function are not executed beyond the first one Just because if the first init is a success we return success in place of continuing the loop. This patch could be backported until 1.8	2020-12-02 09:45:18 +01:00
Thierry Fournier	77a88943d6	BUG/MINOR: lua: lua-load doesn't check its parameters "lua-load" doesn't check if the expected parameter is present. It tries to open() directly the argument at second position. So if the filename is omitted, it tries to load an empty filename. This patch could be backported until 1.8	2020-12-02 09:42:43 +01:00
Thierry Fournier	de6145f747	BUG/MINOR: lua: missing "\n" in error message Just replace ".n" by "\n" This could be backported until 1.9, but it is not so important.	2020-12-02 09:31:33 +01:00
Willy Tarreau	f965b2ad13	BUG/MINOR: mux-h2/stats: not all GOAWAY frames are errors The stats on haproxy.org reported ~12k GOAWAY for ~34k connections, with only 2 protocol errorss. It turns out that the GOAWAY frame counter added in commit `a8879238c` ("MINOR: mux-h2: report detected error on stats") matches a bit too many situations. First it counts those which are not sent as well as failed retries, second it counts as errors the cases of attempts to cleanly close, while it's titled "GOAWAY sent on detected error". Let's address this by moving the counter up one line and excluding the clean codes. This can be backported to 2.3.	2020-12-01 10:47:18 +01:00
Willy Tarreau	5dd36ac8a0	MINOR: mux-h2/trace: add traces at level ERROR for protocol errors A number of traces could be added, and a few TRACE_PROTO were replaced with TRACE_ERROR. The goal is to be able to enable error tracing only to detect anomalies. It looks like they're mostly correct as they don't seem to strike on valid H2 traffic but are very verbose on h2spec.	2020-12-01 10:30:37 +01:00
Willy Tarreau	a1f12746b1	MINOR: traces: add a new level "error" below the "user" level Sometimes it would be nice to be able to only trace abnormal events such as protocol errors. Let's add a new "error" level below the "user" level for this. This will allow to add TRACE_ERROR() at various error points and only see them.	2020-12-01 10:25:20 +01:00
Willy Tarreau	a307528fe2	BUG/MINOR: mux-h2/stats: make stream/connection proto errors more accurate Since commit `a8879238c` ("MINOR: mux-h2: report detected error on stats") we now have some error stats on stream/connection level protocol errors, but some were improperly marked as stream while they're connection, and 2 or 3 relevant ones were missing and have now been added. This could be backported to 2.3.	2020-12-01 10:25:20 +01:00
Maciej Zdeb	fcdfd857b3	MINOR: log: Logging HTTP path only with %HPO This patch adds a new logging variable '%HPO' for logging HTTP path only (without query string) from relative or absolute URI. For example: log-format "hpo=%HPO hp=%HP hu=%HU hq=%HQ" GET /r/1 HTTP/1.1 => hpo=/r/1 hp=/r/1 hu=/r/1 hq= GET /r/2?q=2 HTTP/1.1 => hpo=/r/2 hp=/r/2 hu=/r/2?q=2 hq=?q=2 GET http://host/r/3 HTTP/1.1 => hpo=/r/3 hp=http://host/r/3 hu=http://host/r/3 hq= GET http://host/r/4?q=4 HTTP/1.1 => hpo=/r/4 hp=http://host/r/4 hu=http://host/r/4?q=4 hq=?q=4	2020-12-01 09:32:44 +01:00
Emeric Brun	0237c4e3f5	BUG/MEDIUM: local log format regression. Since 2.3 default local log format always adds hostame field. This behavior change was due to log/sink re-work, because according to rfc3164 the hostname field is mandatory. This patch re-introduce a legacy "local" format which is analog to rfc3164 but with hostname stripped. This is the new default if logs are generated by haproxy. To stay compliant with previous configurations, the option "log-send-hostname" acts as if the default format is switched to rfc3164. This patch addresses the github issue #963 This patch should be backported in branches >= 2.3.	2020-12-01 06:58:42 +01:00
Willy Tarreau	4d6c594998	BUG/MEDIUM: task: close a possible data race condition on a tasklet's list link In issue #958 Ashley Penney reported intermittent crashes on AWS's ARM nodes which would not happen on x86 nodes. After investigation it turned out that the Neoverse N1 CPU cores used in the Graviton2 CPU are much more aggressive than the usual Cortex A53/A72/A55 or any x86 regarding memory ordering. The issue that was triggered there is that if a tasklet_wakeup() call is made on a tasklet scheduled to run on a foreign thread and that tasklet is just being dequeued to be processed, there can be a race at two places: - if MT_LIST_TRY_ADDQ() happens between MT_LIST_BEHEAD() and LIST_SPLICE_END_DETACHED() if the tasklet is alone in the list, because the emptiness tests matches ; - if MT_LIST_TRY_ADDQ() happens during LIST_DEL_INIT() in run_tasks_from_lists(), then depending on how LIST_DEL_INIT() ends up being implemented, it may even corrupt the adjacent nodes while they're being reused for the in-tree storage. This issue was introduced in 2.2 when support for waking up remote tasklets was added. Initially the attachment of a tasklet to a list was enough to know its status and this used to be stable information. Now it's not sufficient to rely on this anymore, thus we need to use a different information. This patch solves this by adding a new task flag, TASK_IN_LIST, which is atomically set before attaching a tasklet to a list, and is only removed after the tasklet is detached from a list. It is checked by tasklet_wakeup_on() so that it may only be done while the tasklet is out of any list, and is cleared during the state switch when calling the tasklet. Note that the flag is not set for pure tasks as it's not needed. However this introduces a new special case: the function tasklet_remove_from_tasklet_list() needs to keep both states in sync and cannot check both the state and the attachment to a list at the same time. This function is already limited to being used by the thread owning the tasklet, so in this case the test remains reliable. However, just like its predecessors, this function is wrong by design and it should probably be replaced with a stricter one, a lazy one, or be totally removed (it's only used in checks to avoid calling a possibly scheduled event, and when freeing a tasklet). Regardless, for now the function exists so the flag is removed only if the deletion could be done, which covers all cases we're interested in regarding the insertion. This removal is safe against a concurrent tasklet_wakeup_on() since MT_LIST_DEL() guarantees the atomic test, and will ultimately clear the flag only if the task could be deleted, so the flag will always reflect the last state. This should be carefully be backported as far as 2.2 after some observation period. This patch depends on previous patch "MINOR: task: remove __tasklet_remove_from_tasklet_list()".	2020-11-30 18:17:59 +01:00
Willy Tarreau	2da4c316c2	MINOR: task: remove __tasklet_remove_from_tasklet_list() This function is only used at a single place directly within the scheduler in run_tasks_from_lists() and it really ought not be called by anything else, regardless of what its comment says. Let's delete it, move the two lines directly into the call place, and take this opportunity to factor the atomic decrement on tasks_run_queue. A comment was added on the remaining one tasklet_remove_from_tasklet_list() to mention the risks in using it.	2020-11-30 18:17:44 +01:00
Willy Tarreau	c309dbdd99	MINOR: task: perform atomic counter increments only once per wakeup In process_runnable_tasks(), we walk the run queue and pick tasks to insert them into the local list. And for each of these operations we perform a few increments, some of which are atomic, and they're even performed under the runqueue's lock. This is useless inside the loop, better do them at the end, since we don't use these values inside the loop and they're not used anywhere else either during this time. The only one is task_list_size which is accessed in parallel by other threads performing remote tasklet wakeups, but it's already approximative and is used to decide to get out of the loop when the limit is reached. So now we compute it first as an initial budget instead.	2020-11-30 18:17:44 +01:00
Willy Tarreau	a868c2920b	MINOR: task: remove tasklet_insert_into_tasklet_list() This function is only called at a single place and adds more confusion than it removes. It also makes one think it could be used outside of the scheduler while it must absolutely not. Let's just move its two lines to the call place, making the code more readable there. In addition this clearly shows that the preliminary LIST_INIT() is useless since the entry is immediately overwritten.	2020-11-30 18:17:44 +01:00
Willy Tarreau	8a069eb9a4	MINOR: debug: add a trivial PRNG for scheduler stress-tests Commit `a5a447984` ("MINOR: debug: add "debug dev sched" to stress the scheduler.") doesn't scale with threads because ha_random64() takes care of being totally thread-safe for use with UUIDs. We don't need this for the stress-testing functions, let's just implement a xorshift PRNG instead. On 8 threads the performance jumped from 230k ctx/s with 96% spent in ha_random64() to 14M ctx/s.	2020-11-30 17:07:32 +01:00
Willy Tarreau	a5a4479849	MINOR: debug: add "debug dev sched" to stress the scheduler. This command supports starting a bunch of tasks or tasklets, either on the current thread (mask=0), all (default), or any set, either single-threaded or multi-threaded, and possibly auto-scheduled. These tasks/tasklets will randomly pick another one to wake it up. The tasks only do it 50% of the time while tasklets always wake two tasks up, in order to achieve roughly 50% load (since the target might already be woken up).	2020-11-29 17:43:07 +01:00
Christopher Faulet	a9ffc41637	BUG/MINOR: http-fetch: Fix smp_fetch_body() when called from a health-check res.body may be called from a health-check. It is probably never used. But it is possibe. In such case, there is no channel. Thus we must not use it unconditionally to set the flag SMP_F_MAY_CHANGE on the smp. Now the condition test the channel first. In addtion, the flag is not set if the payload is fully received. This patch must be backported as far as 2.2.	2020-11-27 10:30:23 +01:00
Christopher Faulet	83662b5431	MINOR: tcpcheck: Add support of L7OKC on expect rules error-status argument L7OKC may now be used as an error status for an HTTP/TCP expect rule. Thus it is for instance possible to write: option httpchk GET /isalive http-check expect status 200,404 http-check expect status 200 error-status L7OKC It is more or less the same than the disable-on-404 option except that if a DOWN is up again but still replying a 404 will be set to NOLB state. While it will stay in DOWN state with the disable-on-404 option.	2020-11-27 10:30:23 +01:00
Christopher Faulet	1e527cbf53	MINOR: check: Always increment check health counter on CONPASS Regarding the health counter, a check finished with the CONDPASS result is now the same than with the PASSED result: The health counter is always incemented. Before, it was only performed is the health counter was not 0. There is no change for the disable-on-404 option because it is only evaluated for running or stopping servers. So with an health check counter greater than 0. But it will make possible to handle (STOPPED -> STOPPING) transition for servers.	2020-11-27 10:30:23 +01:00
Christopher Faulet	97b7bdfcf7	REORG: tcpcheck: Move check option parsing functions based on tcp-check The parsing of the check options based on tcp-check rules (redis, spop, smtp, http...) are moved aways from check.c. Now, these functions are placed in tcpcheck.c. These functions are only related to the tcpcheck ruleset configured on a proxy and not to the health-check attached to a server.	2020-11-27 10:30:23 +01:00
Christopher Faulet	f8c869bac4	MINOR: config: Add a warning if tune.chksize is used This option is now deprecated. It is recent, but it is now marked as deprecated as far as 2.2. Thus, there is now a warning in the 2.4 if this option is still used. It will be removed in 2.5. Becaue the 2.3 is quite new, this patch may be backported to 2.3.	2020-11-27 10:30:23 +01:00
Christopher Faulet	bb9fb8b7f8	MINOR: config: Deprecate and ignore tune.chksize global option This option is now ignored because I/O check buffers are now allocated using the buffer pool. Thus, it is marked as deprecated in the documentation and ignored during the configuration parsing. The field is also removed from the global structure. Because this option is ignored since a recent fix, backported as fare as 2.2, this patch should be backported too. Especially because it updates the documentation.	2020-11-27 10:30:23 +01:00
Christopher Faulet	b1bb069c15	MINOR: tcpcheck: Don't handle anymore in-progress connect rules in tcpcheck_main The special handling of in-progress connect rules at the begining of tcpcheck_main() function can be removed. Instead, at the begining of the tcpcheck_eval_connect() function, we test is there is already an existing connection. In this case, it means we are waiting for a connection establishment. In addition, before evaluating a new connect rule, we take care to release any previous connection.	2020-11-27 10:29:41 +01:00
Christopher Faulet	b381a505c1	BUG/MAJOR: tcpcheck: Allocate input and output buffers from the buffer pool Historically, the input and output buffers of a check are allocated by hand during the startup, with a specific size (not necessarily the same than other buffers). But since the recent refactoring of the checks to rely exclusively on the tcp-checks and to use the underlying mux layer, this part is totally buggy. Indeed, because these buffers are now passed to a mux, they maybe be swapped if a zero-copy is possible. In fact, for now it is only possible in h2_rcv_buf(). Thus the bug concretely only exists if a h2 health-check is performed. But, it is a latent bug for other muxes. Another problem is the size of these buffers. because it may differ for the other buffer size, it might be source of bugs. Finally, for configurations with hundreds of thousands of servers, having 2 buffers per check always allocated may be an issue. To fix the bug, we now allocate these buffers when required using the buffer pool. Thus not-running checks don't waste memory and muxes may swap them if possible. The only drawback is the check buffers have now always the same size than buffers used by the streams. This deprecates indirectly the "tune.chksize" global option. In addition, the http-check regtest have been update to perform some h2 health-checks. Many thanks to @VigneshSP94 for its help on this bug. This patch should solve the issue #936. It relies on the commit "MINOR: tcpcheck: Don't handle anymore in-progress send rules in tcpcheck_main". Both must be backport as far as 2.2. bla	2020-11-27 10:29:41 +01:00
Christopher Faulet	39066c2738	MINOR: tcpcheck: Don't handle anymore in-progress send rules in tcpcheck_main The special handling of in-progress send rules at the begining of tcpcheck_main() function can be removed. Instead, at the begining of the tcpcheck_eval_send() function, we test is there is some data in the output buffer. In this case, it means we are evaluating an unfinished send rule and we can jump to the sending part, skipping the formatting part. This patch is mandatory for a major fix on the checks and must be backported as far as 2.2.	2020-11-27 10:08:21 +01:00
Christopher Faulet	1faf18ae39	BUG/MINOR: tcpcheck: Don't forget to reset tcp-check flags on new kind of check When a new kind of check is found during the parsing of a proxy section (via an option directive), we must reset tcpcheck flags for this proxy. It is mandatory to not inherit some flags from a previously declared check (for instance in the default section). This patch must be backported as far as 2.2.	2020-11-27 10:08:18 +01:00
Willy Tarreau	5a7d6ebf2c	MINOR: fd/threads: silence a build warning with threads disabled Building with gcc-9.3.0 without threads may result in this warning: In file included from include/haproxy/api-t.h:36, from include/haproxy/api.h:33, from src/fd.c:90: src/fd.c: In function 'updt_fd_polling': include/haproxy/fd.h:507:11: warning: array subscript 63 is above array bounds of 'int[1]' [-Warray-bounds] 507 \| DISGUISE(write(poller_wr_pipe[tid], &c, 1)); include/haproxy/compiler.h:92:41: note: in definition of macro 'DISGUISE' 92 \| #define DISGUISE(v) ({ typeof(v) __v = (v); ALREADY_CHECKED(__v); __v; }) \| ^ src/fd.c:113:5: note: while referencing 'poller_wr_pipe' 113 \| int poller_wr_pipe[MAX_THREADS]; // Pipe to wake the threads \| ^~~~~~~~~~~~~~ gcc is wrong but this time it cannot be blamed because it doesn't know that the FD's thread_mask always has at least one bit set. Let's add the test for all_threads_mask there. It will also remove that test and drop the else block.	2020-11-26 22:28:41 +01:00
Willy Tarreau	345ebcfc01	BUG/MAJOR: peers: fix partial message decoding Another bug in the peers message parser was uncovered by last commit `1dfd4f106` ("BUG/MEDIUM: peers: fix decoding of multi-byte length in stick-table messages"): the function return on incomplete message does not check if the channel has a pending close before deciding to return 0. It did not hurt previously because the loop calling co_getblk() once per character would have depleted the buffer and hit the end, causing <0 to be returned and matching the condition. But now that we process at once what is available this cannot be relied on anymore and it's now clearly visible that the final check is missing. What happens when this strikes is that if a peer connection breaks in the middle of a message, the function will return 0 (missing data) but the caller doesn't check for the closed buffer, subscribes to reads, and the applet handler is immediately called again since some data are still available. This is detected by the loop prevention and the process dies complaining that an appctx is spinning. This patch simply adds the check for closed channel. It must be backported to the same versions as the fix above.	2020-11-26 17:12:47 +01:00
Tim Duesterhus	23b2945c1c	BUG/CRITICAL: cache: Fix trivial crash by sending accept-encoding header Since commit `3d08236cb3` HAProxy can be trivially crashed remotely by sending an `accept-encoding` HTTP request header that contains 16 commas. This is because the `values` array in `accept_encoding_normalizer` accepts only 16 entries and it is not verified whether the end is reached during looping. Fix this issue by checking the length. This patch also simplifies the ist processing in the loop, because it manually calculated offsets and lengths, when the ist API exposes perfectly safe functions to advance and truncate ists. I wonder whether the accept_encoding_normalizer function is able to re-use some existing function for parsing headers that may contain lists of values. I'll leave this evaluation up to someone else, only patching the obvious crash. This commit is 2.4-dev specific and was merged just a few hours ago. No backport needed.	2020-11-25 10:23:00 +01:00
Remi Tricot-Le Breton	754b2428d3	MINOR: cache: Add a process-vary option that can enable/disable Vary processing The cache section's process-vary option takes a 0 or 1 value to disable or enable the vary processing. When disabled, a response containing such a header will never be cached. When enabled, we will calculate a preliminary hash for a subset of request headers on all the incoming requests (which might come with a cpu cost) which will be used to build a secondary key for a given request (see RFC 7234#4.1). The default value is 0 (disabled).	2020-11-24 16:52:57 +01:00
Remi Tricot-Le Breton	1785f3dd96	MEDIUM: cache: Add the Vary header support Calculate a preliminary secondary key for every request we see so that we can have a real secondary key if the response is cacheable and contains a manageable Vary header. The cache's ebtree is now allowed to have multiple entries with the same primary key. Two of those entries will be distinguished thanks to secondary keys stored in the cache_entry (based on hashes of a subset of their headers). When looking for an entry in the cache (cache_use), we still use the primary key (built the same way as before), but in case of match, we also need to check if the entry has a vary signature. If it has one, we need to perform an extra check based on the newly built secondary key. We will only be able to forge a response out of the cache if both the primary and secondary keys match with one of our entries. Otherwise the request will be forwarder to the server.	2020-11-24 16:52:57 +01:00
Remi Tricot-Le Breton	3d08236cb3	MINOR: cache: Prepare helper functions for Vary support The Vary functionality is based on a secondary key that needs to be calculated for every request to which a server answers with a Vary header. The Vary header, which can only be found in server responses, determines which headers of the request need to be taken into account in the secondary key. Since we do not want to have to store all the headers of the request until we have the response, we will pre-calculate as many sub-hashes as there are headers that we want to manage in a Vary context. We will only focus on a subset of headers which are likely to be mentioned in a Vary response (accept-encoding and referer for now). Every managed header will have its own normalization function which is in charge of transforming the header value into a core representation, more robust to insignificant changes that could exist between multiple clients. For instance, two accept-encoding values mentioning the same encodings but in different orders should give the same hash. This patch adds a function that parses a Vary header value and checks if all the values belong to our supported subset. It also adds the normalization functions for our two headers, as well as utility functions that can prebuild a secondary key for a given request and transform it into an actual secondary key after the vary signature is determined from the response.	2020-11-24 16:52:57 +01:00
Christopher Faulet	401e6dbff3	BUG/MAJOR: filters: Always keep all offsets up to date during data filtering When at least one data filter is registered on a channel, the offsets of all filters must be kept up to date. For data filters but also for others. It is safer to do it in that way. Indirectly, this patch fixes 2 hidden bugs revealed by the commit `22fca1f2c` ("BUG/MEDIUM: filters: Forward all filtered data at the end of http filtering"). The first one, the worst of both, happens at the end of http filtering when at least one data filtered is registered on the channel. We call the http_end() callback function on the filters, when defined, to finish the http filtering. But it is performed for all filters. Before the commit `22fca1f2c`, the only risk was to call the http_end() callback function unexpectedly on a filter. Now, we may have an overflow on the offset variable, used at the end to forward all filtered data. Of course, from the moment we forward an arbitrary huge amount of data, all kinds of bad things may happen. So offset computation is performed for all filters and http_end() callback function is called only for data filters. The other one happens when a data filter alter the data of a channel, it must update the offsets of all previous filters. But the offset of non-data filters must be up to date, otherwise, here too we may have an integer overflow. Another way to fix these bugs is to always ignore non-data filters from the offsets computation. But this patch is safer and probably easier to maintain. This patch must be backported in all versions where the above commit is. So as far as 2.0.	2020-11-24 14:17:32 +01:00
Maciej Zdeb	6dee9969b9	BUG/MEDIUM: http_act: Restore init of log-format list Restore init of log-format list in parse_http_del_header which was accidently deleted by commit `ebdd4c55da` (implementation of different header matching methods for http-request/response del-header). This is related to GitHub issue #909	2020-11-24 10:33:46 +01:00
Ilya Shipitsin	d9a16dc0f2	BUILD: SSL: add BoringSSL guarding to "RAND_keep_random_devices_open" "RAND_keep_random_devices_open" is OpenSSL specific, does not present in other OpenSSL variants like LibreSSL or BoringSSL. BoringSSL recently "updated" its internal openssl version to 1.1.1, we temporarily set it back to 1.1.0, as we are going to remove that hack, let us add proper guarding.	2020-11-24 09:54:44 +01:00
Julien Pivotto	2de240a676	MINOR: stream: Add level 7 retries on http error 401, 403 Level-7 retries are only possible with a restricted number of HTTP return codes. While it is usually not safe to retry on 401 and 403, I came up with an authentication backend which was not synchronizing authentication of users. While not perfect, being allowed to also retry on those return codes is really helpful and acts as a hotfix until we can fix the backend. Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-11-23 09:33:14 +01:00
Tim Duesterhus	c8d19702f4	BUILD: Show the value of DEBUG= in haproxy -vv Previously this was not visible after building.	2020-11-21 18:27:33 +01:00
Maciej Zdeb	ebdd4c55da	MINOR: http_act: Add -m flag for del-header name matching method This patch adds -m flag which allows to specify header name matching method when deleting headers from http request/response. Currently beg, end, sub, str and reg are supported. This is related to GitHub issue #909	2020-11-21 15:54:30 +01:00
Maciej Zdeb	302b9f8d7a	BUG/MINOR: http_htx: Fix searching headers by substring Function __http_find_header is used to search headers by name using specified matching method. Matching by substring returned unexpected results due to wrong length of substring supplied to strnistr function. Fixed also the boolean condition by inverting it, as we're interested in headers that contains the substring. This patch should be backported as far as 2.2	2020-11-21 15:54:26 +01:00
Willy Tarreau	3aab17bd56	BUG/MAJOR: connection: reset conn->owner when detaching from session list Baptiste reported a new crash affecting 2.3 which can be triggered when using H2 on the backend, with http-reuse always and with a tens of clients doing close only. There are a few combined cases which cause this to happen, but each time the issue is the same, an already freed session is dereferenced in session_unown_conn(). Two cases were identified to cause this: - a connection referencing a session as its owner, which is detached from the session's list and is destroyed after this session ends. The test on conn->owner before calling session_unown_conn() is not sufficent as the pointer is not null but is not valid anymore. - a connection that never goes idle and that gets killed form the mux, where session_free() is called first, then conn_free() calls session_unown_conn() which scans the just freed session for older connections. This one is only triggered with DEBUG_UAF The reason for this session to be present here is that it's needed during the connection setup, to be passed to conn_install_mux_be() to mux->init() as the owning session, but it's never deleted aftrewards. Furthermore, even conn_session_free() doesn't delete this pointer after freeing the session that lies there. Both do definitely result in a use-after-free that's more easily triggered under DEBUG_UAF. This patch makes sure that the owner is always deleted after detaching or killing the session. However it is currently not possible to clear the owner right after a synchronous init because the proxy protocol apparently needs it (a reg test checks this), and if we leave it past the connection setup with the session not attached anywhere, it's hard to catch the right moment to detach it. This means that the session may remain in conn->owner as long as the connection has never been added to nor removed from the session's idle list. Given that this patch needs to remain simple enough to be backported, instead it adds a workaround in session_unown_conn() to detect that the element is already not attached anywhere. This fix absolutely requires previous patch "CLEANUP: connection: do not use conn->owner when the session is known" otherwise the situation will be even worse, as some places used to rely on conn->owner instead of the session. The fix could theorically be backported as far as 1.8. However, the code in this area has significantly changed along versions and there are more risks of breaking working stuff than fixing real issues there. The issue was really woken up in two steps during 2.3-dev when slightly reworking the idle conns with commit `08016ab82` ("MEDIUM: connection: Add private connections synchronously in session server list") and when adding support for storing used H2 connections in the session and adding the necessary call to session_unown_conn() in the muxes. But the same test managed to crash 2.2 when built in DEBUG_UAF and patched like this, proving that we used to already leave dangling pointers behind us: \| diff --git a/include/haproxy/connection.h b/include/haproxy/connection.h \| index f8f235c1a..dd30b5f80 100644 \| --- a/include/haproxy/connection.h \| +++ b/include/haproxy/connection.h \| @@ -458,6 +458,10 @@ static inline void conn_free(struct connection conn) \| sess->idle_conns--; \| session_unown_conn(sess, conn); \| } \| + else { \| + struct session sess = conn->owner; \| + BUG_ON(sess && sess->origin != &conn->obj_type); \| + } \| \| sockaddr_free(&conn->src); \| sockaddr_free(&conn->dst); It's uncertain whether an existing code path there can lead to dereferencing conn->owner when it's bad, though certain suspicious memory corruption bugs make one think it's a likely candidate. The patch should not be hard to adapt there. Backports to 2.1 and older are left to the appreciation of the person doing the backport. A reproducer consists in this: global nbthread 1 listen l bind :9000 mode http http-reuse always server s 127.0.0.1:8999 proto h2 frontend f bind :8999 proto h2 mode http http-request return status 200 Then this will make it crash within 2-3 seconds: $ h1load -e -r 1 -c 10 http://0:9000/ If it does not, it might be that DEBUG_UAF was not used (it's harder then) and it might be useful to restart.	2020-11-21 15:29:22 +01:00
Willy Tarreau	38b4d2eb22	CLEANUP: connection: do not use conn->owner when the session is known At a few places we used to rely on conn->owner to retrieve the session while the session is already known. This is not correct because at some of these points the reason the connection's owner was still the session (instead of NULL) is a mistake. At one place a comparison is even made between the session and conn->owner assuming it's valid without checking if it's NULL. Let's clean this up to use the session all the time. Note that this will be needed for a forthcoming fix and will have to be backported.	2020-11-21 15:29:22 +01:00
Ilya Shipitsin	f34ed0b74c	BUILD: SSL: guard TLS13 ciphersuites with HAVE_SSL_CTX_SET_CIPHERSUITES HAVE_SSL_CTX_SET_CIPHERSUITES is newly defined macro set in openssl-compat.h, which helps to identify ssl libs (currently OpenSSL-1.1.1 only) that supports TLS13 cipersuites manipulation on TLS13 context	2020-11-21 11:04:36 +01:00
William Lallemand	77e1c6fb0a	BUG/MEDIUM: ssl/crt-list: fix error when no file found When a file from a crt-list was not found, this one was ignored silently letting HAProxy starts without it. This bug was introduced by `47da821` ("MEDIUM: ssl: emulates the multi-cert bundles in the crtlist"). This commit adds a found variable which is checked once we tried every bundle combination so we can exits with an error if none were found. Must be backported in 2.3.	2020-11-20 18:38:56 +01:00
William Lallemand	7340457158	BUG/MINOR: ssl/crt-list: load bundle in crt-list only if activated Don't try to load a bundle from a crt-list if the bundle support was disabled with ssl-load-extra-files. Must be backported to 2.3.	2020-11-20 18:38:56 +01:00
William Lallemand	06ce84a100	BUG/MEDIUM: ssl: error when no certificate are found When a non-existing file was specified in the configuration, haproxy does not exits with an error which is not normal. This bug was introduced by `dfa93be` ("MEDIUM: ssl: emulate multi-cert bundles loading in standard loading") which does nothing if the stat failed. This patch introduce a "found" variable which is checked at the end of the function so we exit with an error if no find were found. Must be backported to 2.3.	2020-11-20 18:38:56 +01:00
William Lallemand	86c2dd60f1	BUG/MEDIUM: ssl/crt-list: bundle support broken in crt-list In issue #970 it was reported that the bundle loading does not work anymore with crt-list. This bug was introduced by `47da821` ("MEDIUM: ssl: emulates the multi-cert bundles in the crtlist") which incorrectly uses "path" instead of "crt_path" in the name resolution. Must be backported to 2.3.	2020-11-20 18:38:51 +01:00
Christopher Faulet	aab1b67383	BUG/MEDIUM: http-ana: Don't eval http-after-response ruleset on empty messages It is not possible on response comming from a server, but an errorfile may be empty. In this case, the http-after-response ruleset must not be evaluated because it is totally unexpected to manipulate headers on an empty HTX message. This patch must be backported everywhere the http-after-response rules are supported, i.e as far as 2.2.	2020-11-20 09:43:31 +01:00
Ilya Shipitsin	bdec3ba796	BUILD: ssl: use SSL_MODE_ASYNC macro instead of OPENSSL_VERSION	2020-11-19 19:59:32 +01:00
William Lallemand	f69cd68737	BUG/MINOR: ssl: segv on startup when AKID but no keyid In bug #959 it was reported that haproxy segfault on startup when trying to load a certifcate which use the X509v3 AKID extension but without the keyid field. This field is not mandatory and could be replaced by the serial or the DirName. For example: X509v3 extensions: X509v3 Basic Constraints: CA:FALSE X509v3 Subject Key Identifier: 42:7D:5F:6C:3E:0D:B7:2C:FD:6A:8A:32:C6:C6:B9:90:05:D1:B2:9B X509v3 Authority Key Identifier: DirName:/O=HAProxy Technologies/CN=HAProxy Test Intermediate CA serial:F2:AB:C1:41:9F:AB:45:8E:86:23:AD:C5:54:ED:DF:FA This bug was introduced by 70df7b ("MINOR: ssl: add "issuers-chain-path" directive"). This patch must be backported as far as 2.2.	2020-11-19 16:24:13 +01:00
William Dauchy	f63704488e	MEDIUM: cli/ssl: configure ssl on server at runtime in the context of a progressive backend migration, we want to be able to activate SSL on outgoing connections to the server at runtime without reloading. This patch adds a `set server ssl` command; in order to allow that: - add `srv_use_ssl` to `show servers state` command for compatibility, also update associated parsing - when using default-server ssl setting, and `no-ssl` on server line, init SSL ctx without activating it - when triggering ssl API, de/activate SSL connections as requested - clean ongoing connections as it is done for addr/port changes, without checking prior server state example config: backend be_foo default-server ssl server srv0 127.0.0.1:6011 weight 1 no-ssl show servers state: 5 be_foo 1 srv0 127.0.0.1 2 0 1 1 15 1 0 4 0 0 0 0 - 6011 - -1 where srv0 can switch to ssl later during the runtime: set server be_foo/srv0 ssl on 5 be_foo 1 srv0 127.0.0.1 2 0 1 1 15 1 0 4 0 0 0 0 - 6011 - 1 Also update existing tests and create a new one. Signed-off-by: William Dauchy <wdauchy@gmail.com>	2020-11-18 17:22:28 +01:00
William Dauchy	fc52f524b0	MINOR: ssl: create common ssl_ctx init a common init for ssl_ctx will be later usable in other functions in order to support hot enable of ssl during runtime. Signed-off-by: William Dauchy <wdauchy@gmail.com>	2020-11-18 17:22:28 +01:00
Amaury Denoyelle	034c162b9b	MEDIUM: stats: add counters for failed handshake Report on ssl stats the total number of handshakes terminated in a failure.	2020-11-18 16:10:42 +01:00
Amaury Denoyelle	f70b7db825	MINOR: ssl: remove client hello counters Remove the ssl client hello received counter. This counter is not meaningful and was only implemented on the fronted.	2020-11-18 16:10:42 +01:00
Christopher Faulet	47d9a4e870	MINOR: flt-trace: Use a bitfield for the trace options Instead of using a integer for each option, we now use a bitfield. Each option is represented as a flag now.	2020-11-17 11:34:36 +01:00
Christopher Faulet	96a577acae	MINOR: flt-trace: Add an option to inhibits trace messages The 'quiet' option may be set to inibits the trace messages. The trace filter is a bit verbose. This option may be used to not display the messages.	2020-11-17 11:34:36 +01:00
Christopher Faulet	c41d8bd65a	CLEANUP: flt-trace: Remove unused random-parsing option This option was only used by the legacy HTTP mode. In HTX, it is not used. So it can be removed.	2020-11-17 11:34:30 +01:00
Christopher Faulet	63c69a9b4e	BUG/MINOR: http-ana: Don't wait for the body of CONNECT requests CONNECT requests are bodyless messages but with no EOM blocks. Thus, conditions to stop waiting for the message payload are not suited to this kind of messages. Indeed, the message finishes on an EOH block. But the tunnel mode at the stream level is only set in HTTP_XFER_BODY analyser. So, the stream is blocked, waiting for a body that does not exist till a timeout expires. To fix this bug, we just stop waiting for a body for CONNECT requests. Another solution is to rely on HTX_SL_F_BODYLESS/HTTP_MSGF_BODYLESS flags. But this one is less intrusive. This message must be backported as far as 2.0. For the 2.0, only the HTX part must be fixed.	2020-11-17 10:03:12 +01:00
Christopher Faulet	22fca1f2c8	BUG/MEDIUM: filters: Forward all filtered data at the end of http filtering When http filtering ends, if there are some filtered data not forwarded yet, we forward them, in flt_http_end(). Most of time, this doesn't happen, except when a tunnel is established using a CONNECT. In this case, there is not EOM on the request and there is no body. Thus the headers are never forwarded, blocking the stream. This patch must be backported as far as 2.0. Prior versions don't suffer of this bug because there is no HTX support. On the 2.0, the change is only applicable on HTX streams. A special test must be performed to make sure.	2020-11-17 09:59:35 +01:00
Eric Salama	9139ec34ed	MINOR: cfgparse: tighten the scope of newnameserver variable, free it on error. This should fix issue GH #931. Also remove a misleading comment. This commit can be backported as far as 1.9	2020-11-13 16:26:10 +01:00
Christopher Faulet	fc633b6eff	CLEANUP: config: Return ERR_NONE from config callbacks instead of 0 Return ERR_NONE instead of 0 on success for all config callbacks that should return ERR_* codes. There is no change because ERR_NONE is a macro equals to 0. But this makes the return value more explicit.	2020-11-13 16:26:10 +01:00
Christopher Faulet	5214099233	MINOR: config/mux-h2: Return ERR_ flags from init_h2() instead of a status post-check function callbacks must return ERR_* flags. Thus, init_h2() is fixed to return ERR_NONE on success or (ERR_ALERT\|ERR_FATAL) on error. This patch may be backported as far as 2.2.	2020-11-13 16:26:10 +01:00
Christopher Faulet	83fefbcdff	MINOR: init: Fix the prototype for per-thread free callbacks Functions registered to release memory per-thread have no return value. But the registering function and the function pointer in per_thread_free_fct structure specify it should return an integer. This patch fixes it. This patch may be backported as far as 2.0.	2020-11-13 16:26:10 +01:00
Christopher Faulet	c751b4508d	BUG/MINOR: tcpcheck: Don't warn on unused rules if check option is after When tcp-check or http-check rules are used, if the corresponding check option (option tcp-check and option httpchk) is declared after the ruleset, a warning is emitted about an unused check ruleset while there is no problem in reality. This patch must be backported as far as 2.2.	2020-11-13 16:26:10 +01:00
Christopher Faulet	c7ba91039a	MINOR: spoe: Don't close connection in sync mode on processing timeout In sync mode, if an applet receives a ack while the processing delay has already expired, there is not frame waiting for this ack. But there is no reason to close the connection in this case. The ack may be ignored and the connection may be reused to process another frame. The only reason to trigger an error and close the connection is when the wrong ack is received while there is still a frame waiting for its ack. In sync mode, this should never happen. This patch may be backported in all versions supporting the SPOE.	2020-11-13 16:26:10 +01:00
Christopher Faulet	cf181c76e3	BUG/MAJOR: spoe: Be sure to remove all references on a released spoe applet When a SPOE applet is used to send a frame, a reference on this applet is saved in the spoe context of the offladed stream. But, if the applet is released before receving the corresponding ack, we must be sure to remove this reference. This was performed for fragmented frames only. But it must also be performed for a spoe contexts in the applet waiting_queue and in the thread waiting_queue (used in async mode). This bug leads to a memory corruption when an offloaded stream try to update the state of a released applet because it still have a reference on it. There are many ways to trigger this bug. The easiest is probably during reloads. On the old process, all applets are woken up to be released ASAP. Many thanks to Maciej Zdeb to report the bug and to work on it for 2 months. Without his help, it would have been much more difficult to fix the bug. It is always a huge pleasure to see how some users are enthousiast and helpful. Thanks again Maciej ! This patch must be backported to all versions where the spoe is supported (>= 1.7).	2020-11-13 16:26:10 +01:00
Christopher Faulet	3005d28eb8	BUG/MINOR: http-htx: Handle warnings when parsing http-error and http-errors First of all, this patch is tagged as a bug. But in fact, it only fixes a bug in the 2.2. On the 2.3 and above, it only add the ability to display warnings, when an http-error directive is parsed from a proxy section and when an errorfile directive is parsed from a http-errors section. But on the 2.2, it make sure to display the warning emitted on a content-length mismatch when an errorfile is parsed. The following is only applicable to the 2.2. commit "BUG/MINOR: http-htx: Just warn if payload of an errorfile doesn't match the C-L" (which is only present in 2.2, 2.1 and 2.0 trees, i.e see commit 7bf3d81d3cf4b9f4587 in 2.2 tree), is changing the behavior of `http_str_to_htx` function. It may now emit warnings. And, it is the caller responsibility to display it. But the warning is missing when an 'http-error' directive is parsed from a proxy section. It is also missing when an 'errorfile' directive is parsed from a http-errors section. This bug only exists on the 2.2. On earlier versions, these directives are not supported and on later ones, an error is triggered instead of a warning. Thanks to William Dauchy that spotted the bug. This patch must be backported as far as 2.2.	2020-11-13 16:26:10 +01:00
Amaury Denoyelle	90eb93f792	MINOR: check: report error on incompatible connect proto Report an error when using an explicit proto for a connect rule with non-compatible mode in regards with the selected check type (tcp-check vs http-check).	2020-11-13 16:26:10 +01:00
Amaury Denoyelle	7c14890183	MINOR: check: report error on incompatible proto If the check mux has been explicitly defined but is incompatible with the selected check type (tcp-check vs http-check), report a warning and prevent haproxy startup.	2020-11-13 16:26:10 +01:00
Amaury Denoyelle	0519bd4d04	BUG/MEDIUM: check: reuse srv proto only if using same mode Only reuse the mux from server if the check is using the same mode. For example, this prevents a tcp-check on a h2 server to select the h2 multiplexer instead of passthrough. This bug was introduced by the following commit : BUG/MEDIUM: checks: Use the mux protocol specified on the server line It must be backported up to 2.2. Fixes github issue #945.	2020-11-13 16:26:10 +01:00
Christopher Faulet	97fc8da264	BUG/MINOR: http-fetch: Fix calls w/o parentheses of the cookie sample fetches req.cook, req.cook_val, req.cook_cnt and and their response counterparts may be called without cookie name. In this case, empty parentheses may be used, or no parentheses at all. In both, the result must be the same. But only the first one works. The second one always returns a failure. This patch fixes this bug. Note that on old versions (< 2.2), both cases fail. This patch must be backported in all stable versions.	2020-11-13 16:26:10 +01:00
Maciej Zdeb	dea7c209f8	BUG/MINOR: http-fetch: Extract cookie value even when no cookie name HTTP sample fetches dealing with the cookies (req/res.cook, req/res.cook_val and req/res.cook_cnt) must be prepared to be called without cookie name. For the first two, the first cookie value is returned, regardless its name. For the last one, all cookies are counted. To do so, http_extract_cookie_value() may now be called with no cookie name (cookie_name_l set to 0). In this case, the matching on the cookie name is ignored and the first value found is returned. Note this patch also fixes matching on cookie values in ACLs. This should be backported in all stable versions.	2020-11-13 16:26:10 +01:00
Willy Tarreau	1dfd4f106f	BUG/MEDIUM: peers: fix decoding of multi-byte length in stick-table messages There is a bug in peer_recv_msg() due to an incorrect cast when trying to decode the varint length of a stick-table message, causing lengths comprised between 128 and 255 to consume one extra byte, ending in protocol errors. The root cause of this is that peer_recv_msg() tries hard to reimplement all the parsing and control that is already done in intdecode() just to measure the length before calling it. And it got it wrong. Let's just get rid of this unneeded code duplication and solely rely on intdecode() instead. The bug was introduced in 2.0 as part of a cleanup pass on this code with commit `95203f218` ("MINOR: peers: Move high level receive code to reduce the size of I/O handler."), so this patch must be backported to 2.0. Thanks to Yves Lafon for reporting the problem.	2020-11-13 15:21:50 +01:00
Fr�d�ric L�caille	ea875e62e6	BUG/MINOR: peers: Missing TX cache entries reset. The TX part of a cache for a dictionary is made of an reserved array of ebtree nodes which are pointers to dictionary entries. So when we flush the TX part of such a cache, we must not only remove these nodes to dictionary entries from their ebtree. We must also reset their values. Furthermore, the LRU key and the last lookup result must also be reset.	2020-11-13 06:04:18 +01:00
Fr�d�ric L�caille	f9e51beec1	BUG/MINOR: peers: Do not ignore a protocol error for dictionary entries. If we could not decode the ID of a dictionary entry from a peer update message, we must inform the remote peer about such an error as this is done for any other decoding error.	2020-11-13 06:04:08 +01:00
Fr�d�ric L�caille	d865935f32	MINOR: peers: Add traces to peer_treat_updatemsg(). Add minimalistic traces for peers with only one event to diagnose potential issues when decode peer update messages.	2020-11-12 17:38:49 +01:00
Amaury Denoyelle	7f8f6cb926	BUG/MEDIUM: stats: prevent crash if counters not alloc with dummy one Define a per-thread counters allocated with the greatest size of any stat module counters. This variable is named trash_counters. When using a proxy without allocated counters, return the trash counters from EXTRA_COUNTERS_GET instead of a dangling pointer to prevent segfault. This is useful for all the proxies used internally and not belonging to the global proxy list. As these objects does not appears on the stat report, it does not matter to use the dummy counters. For this fix to be functional, the extra counters are explicitly initialized to NULL on proxy/server/listener init functions. Most notably, the crash has already been detected with the following vtc: - reg-tests/lua/txn_get_priv.vtc - reg-tests/peers/tls_basic_sync.vtc - reg-tests/peers/tls_basic_sync_wo_stkt_backend.vtc There is probably other parts that may be impacted (SPOE for example). This bug was introduced in the current release and do not need to be backported. The faulty commits are "MINOR: ssl: count client hello for stats" and "MINOR: ssl: add counters for ssl sessions".	2020-11-12 15:16:05 +01:00
Amaury Denoyelle	a2a6899bee	BUG/MINOR: stats: free dynamically stats fields/lines on shutdown Register a new function on POST DEINIT to free stats fields/lines for each domain. This patch does not fix a critical bug but may be backported to 2.3.	2020-11-12 15:16:05 +01:00
Remi Tricot-Le Breton	cc9bf2e5fe	MEDIUM: cache: Change caching conditions Do not cache responses that do not have an explicit expiration time (s-maxage or max-age Cache-Control directives or Expires header) or a validator (ETag or Last-Modified headers) anymore, as suggested in RFC 7234#3. The TX_FLAG_IGNORE flag is used instead of the TX_FLAG_CACHEABLE so as not to change the behavior of the checkcache option.	2020-11-12 11:22:05 +01:00
Thierry Fournier	91dc0c0d8f	BUG/MINOR: lua: set buffer size during map lookups This size is used by some pattern matching to determine if there is sufficient room in the buffer to add final \0 if necessary. If the size is not set, the conditions use uninitialized value. Note: it seems this bug can't cause a crash. Should be backported until 2.2 (at least)	2020-11-11 10:43:21 +01:00
Thierry Fournier	a68affeaa9	BUG/MINOR: pattern: a sample marked as const could be written The functions add final 0 to string if the final 0 is not set, but don't check the flag CONST. This patch duplicates the strings if the final zero is not set and the string is CONST. Should be backported until 2.2 (at least)	2020-11-11 10:43:15 +01:00
William Lallemand	50c03aac04	BUG/MEDIUM: ssl/crt-list: correctly insert crt-list line if crt already loaded In issue #940, it was reported that the crt-list does not work correctly anymore. Indeed when inserting a crt-list line which use a certificate previously seen in the crt-list, this one won't be inserted in the SNI list and will be silently ignored. This bug was introduced by commit `47da821` "MEDIUM: ssl: emulates the multi-cert bundles in the crtlist". This patch also includes a reg-test which tests this issue. This bugfix must be backported in 2.3.	2020-11-06 16:39:39 +01:00
Willy Tarreau	431a12cafe	BUILD: http-htx: fix build warning regarding long type in printf Commit `a66adf41e` ("MINOR: http-htx: Add understandable errors for the errorfiles parsing") added a warning when loading malformed error files, but this warning may trigger another build warning due to the %lu format used. Let's simply cast it for output since it's just used for end user output. This must be backported to 2.0 like the commit above.	2020-11-06 14:24:02 +01:00
Willy Tarreau	4299528390	BUILD: ssl: silence build warning on uninitialised counters Since commit `d0447a7c3` ("MINOR: ssl: add counters for ssl sessions"), gcc 9+ complains about this: CC src/ssl_sock.o src/ssl_sock.c: In function 'ssl_sock_io_cb': src/ssl_sock.c:5416:3: warning: 'counters_px' may be used uninitialized in this function [-Wmaybe-uninitialized] 5416 \| ++counters_px->reused_sess; \| ^~~~~~~~~~~~~~~~~~~~~~~~~~ src/ssl_sock.c:5133:23: note: 'counters_px' was declared here 5133 \| struct ssl_counters counters, counters_px; \| ^~~~~~~~~~~ Either a listener or a server are expected there, so ther counters are always initialized and the compiler cannot know this. Let's preset them and test before updating the counter, we're not in a hot path here. No backport is needed.	2020-11-06 13:22:44 +01:00
Willy Tarreau	f5fe70620c	MINOR: server: remove idle lock in srv_cleanup_connections This function used to grab the idle lock when scanning the threads for idle connections, but it doesn't need it since the lock only protects the tree. Let's remove it.	2020-11-06 13:22:44 +01:00
Amaury Denoyelle	d0447a7c3e	MINOR: ssl: add counters for ssl sessions Add counters for newly established and resumed sessions.	2020-11-06 12:05:17 +01:00
Amaury Denoyelle	fbc3377cd4	MINOR: ssl: count client hello for stats Add a counter for ssl client_hello received on frontends.	2020-11-06 12:05:17 +01:00
Amaury Denoyelle	9963fa74d2	MINOR: ssl: instantiate stats module This module is responsible for providing statistics for ssl. It allocates counters for frontend/backend/listener/server objects.	2020-11-06 12:05:17 +01:00
Christopher Faulet	a66adf41ea	MINOR: http-htx: Add understandable errors for the errorfiles parsing No details are provided when an error occurs during the parsing of an errorfile, Thus it is a bit hard to diagnose where the problem is. Now, when it happens, an understandable error message is reported. This patch is not a bug fix in itself. But it will be required to change an fatal error into a warning in last stable releases. Thus it must be backported as far as 2.0.	2020-11-06 09:13:58 +01:00
Willy Tarreau	6d27a92b83	BUG/MINOR: ssl: don't report 1024 bits DH param load error when it's higher The default dh_param value is 2048 and it's preset to zero unless explicitly set, so we must not report a warning about DH param not being loadble in 1024 bits when we're going to use 2048. Thanks to Dinko for reporting this. This should be backported to 2.2.	2020-11-05 19:40:14 +01:00
Jerome Magnin	eff2e0a958	CLEANUP: cfgparse: remove duplicate registration for transparent build options Since commit `37bafdcbb` ("MINOR: sock_inet: move the IPv4/v6 transparent mode code to sock_inet"), build options for transparent proxying are registered twice. This patch removes the older one.	2020-11-05 19:27:16 +01:00
Willy Tarreau	38d41996c1	MEDIUM: pattern: turn the pattern chaining to single-linked list It does not require heavy deletion from the expr anymore, so we can now turn this to a single-linked list since most of the time we want to delete all instances of a given pattern from the head. By doing so we save 32 bytes of memory per pattern. The pat_unlink_from_head() function was adjusted accordingly.	2020-11-05 19:27:09 +01:00
Willy Tarreau	867a8a5a10	MINOR: pattern: prepare removal of a pattern from the list head Instead of using LIST_DEL() on the pattern itself inside an expression, we look it up from its head. The goal is to get rid of the double-linked list while this usage remains exclusively for freeing on startup error!	2020-11-05 19:27:09 +01:00
Willy Tarreau	2817472bb0	MINOR: pattern: during reload, delete elements frem the ref, not the expression Instead of scanning all elements from the expression and using the slow delete path there, let's use the faster way which involves pat_delete_gen() while the elements are detached from ther reference.	2020-11-05 19:27:09 +01:00
Willy Tarreau	ae83e63b48	MEDIUM: pattern: make pat_ref_prune() rely on pat_ref_purge_older() When purging all of a reference, it's much more efficient to scan the reference patterns from the reference head and delete all derivative patterns than to scan the expressions. The only thing is that we need to proceed both for the current and next generations, in case there is a huge gap between the two. With this, purging 20M IP addresses in small batches of 100 takes roughly 3 seconds.	2020-11-05 19:27:09 +01:00
Willy Tarreau	94b9abe200	MINOR: pattern: add pat_ref_purge_older() to purge old entries This function will be usable to purge at most a specified number of old entries from a reference. Entries are declared old if their generation number is in the past compared to the one passed in argument. This will ease removal of early entries when new ones have been appended. We also call malloc_trim() when available, at the end of the series, because this is one place where there is a lot of memory to save. Reloads of 1M IP addresses used in an ACL made the process grow up to 1.7 GB RSS after 10 reloads and roughly stabilize there without this call, versus only 260 MB when the call is present. Sadly there is no direct equivalent for jemalloc, which stabilizes around 800MB-1GB.	2020-11-05 19:27:09 +01:00
Willy Tarreau	1a6857b9c1	MINOR: pattern: implement pat_ref_load() to load a pattern at a given generation pat_ref_load() basically combines pat_ref_append() and pat_ref_commit(). It's very similar to pat_ref_add() except that it also allows to set the generation ID and the line number. pat_ref_add() was modified to directly rely on it to avoid code duplication. Note that a previous declaration of pat_ref_load() was removed as it was just a leftover of an earlier incarnation of something possibly similar, so no existing functionality was changed here.	2020-11-05 19:27:09 +01:00
Willy Tarreau	0439e5eeb4	MINOR: pattern: add pat_ref_commit() to commit a previously inserted element This function will be used after a successful pat_ref_append() to propagate the pattern to all use places (including parsing and indexing). On failure, it will entirely roll back all insertions and free the pattern itself. It also preserves the generation number so that it is convenient for use in association with pat_ref_append(). pat_ref_add() was modified to rely on it instead of open-coding the insertion and roll-back.	2020-11-05 19:27:09 +01:00
Willy Tarreau	c93da6950e	MEDIUM: pattern: only match patterns that match the current generation Instead of matching any pattern found in the tree, only match those matching the current generation of entries. This will make sure that reloads are atomic, regardless of the time they take to complete, and that newly added data are not matched until the whole reference is committed. For consistency we proceed the same way on "show map" and "show acl". This will have no impact for now since generations are not used.	2020-11-05 19:27:09 +01:00
Willy Tarreau	29947745b5	MINOR: pattern: store a generation number in the reference patterns Right now it's not possible to perform a safe reload because we don't know what patterns were recently added or were already present. This patch adds a generation counter to the reference patterns so that it is possible to know what generation of the reference they were loaded with. A reference now has two generations, the current one, used for all additions, and the next one, allocated to those wishing to update the contents. The generation wraps at 2^32 so comparisons must be made relative to the current position. The idea will be that upon full reload, the caller will first get a new generation ID, will insert all new patterns using it, will then switch the current ID to the new one, and will delete all entries older than the current ID. This has the benefit of supporting chunked updates that remain consistent and that won't block the whole process for ages like pat_ref_reload() currently does.	2020-11-05 19:27:09 +01:00
Willy Tarreau	1fd52f70e5	MINOR: pattern: introduce pat_ref_delete_by_ptr() to delete a valid reference Till now the only way to remove a known reference was via pat_ref_delete_by_id() which scans the whole list to find a matching pointer. Let's add pat_ref_delete_by_ptr() which takes a valid pointer. It can be called by the function above after the pointer is found, and can also be used to roll back a failed insertion much more efficiently.	2020-11-05 19:27:09 +01:00
Willy Tarreau	a98b2882ac	CLEANUP: pattern: remove pat_delete_fcts[] and pattern_head->delete() These ones are not used anymore, so let's remove them to remove a bit of the complexity. The ACL keyword's delete() function could be removed as well, though most keyword declarations are positional and we have a high risk of introducing a mistake here, so let's not touch the ACL part.	2020-11-05 19:27:09 +01:00
Willy Tarreau	b35aa9b256	CLEANUP: acl: don't reference the generic pattern deletion function anymore A few ACL keyword used to reference pat_delete_gen() as the deletion function but this is not needed since it's the default one now. Let's just remove this reference.	2020-11-05 19:27:09 +01:00
Willy Tarreau	e828d8f0e8	MINOR: pattern: perform a single call to pat_delete_gen() under the expression When we're removing an element under the expression lock, we don't need anymore to run over all ->delete() functions via the expressions, since we know that the single function does it fine now. Note that at this point, pattern->delete() is not used at all through out the code anymore.	2020-11-05 19:27:09 +01:00
Willy Tarreau	f1c0892aa6	MINOR: pattern: remerge the list and tree deletion functions pat_del_tree_gen() was already chained onto pat_del_list_gen() to deal with remaining cases, so let's complete the merge and have a generic pattern deletion function acting on the reference and taking care of reliably removing all elements.	2020-11-05 19:27:09 +01:00
Willy Tarreau	78777ead32	MEDIUM: pattern: change the pat_del_* functions to delete from the references This is the next step in speeding up entry removal. Now we don't scan the whole lists or trees for elements pointing to the target reference, instead we start from the reference and delete all linked patterns. This simplifies some delete functions since we don't need anymore to delete multiple times from an expression since all nodes appear after the reference element. We can now have one generic list and one generic tree deletion function. This required the replacement of pattern_delete() with an open-coded version since we now need to lock all expressions first before proceeding. This means there is a high risk of lock inversion here but given that the expressions are always scanned in the same order from the same head, this must not happen. Now deleting first entries is instantaneous, and it's still slow to delete the last ones when looking up their ID since it still requires to look them up by a full scan, but it's already way faster than previously. Typically removing the last 10 IP from a 20M entries ACL with a full-scan each took less than 2 seconds. It would be technically possible to make use of indexed entries to speed up most lookups for removal by value (e.g. IP addresses) but that's for later.	2020-11-05 19:27:09 +01:00
Willy Tarreau	4bdd0a13d6	MEDIUM: pattern: link all final elements from the reference There is a data model issue in the current pattern design that makes pattern deletion extremely expensive: there's no direct way from a reference to access all indexed occurrences. As such, the only way to remove all indexed entries corresponding to a reference update is to scan all expressions's lists and trees to find a link to the reference. While this was possibly OK when map removal was not common and most maps were small, this is not conceivable anymore with GeoIP maps containing 10M+ entries and del-map operations that are triggered from http-request rulesets. This patch introduces two list heads from the pattern reference, one for the objects linked by lists and one for those linked by tree node. Ideally a single list would be enough but the linked elements are too much unrelated to be distinguished at the moment, so we'll need two lists. However for the long term a single-linked list will suffice but for now it's not possible due to the way elements are removed from expressions. As such this patch adds 32 bytes of memory usage per reference plus 16 per indexed entry, but both will be cut in half later. The links are not yet used for deletion, this patch only ensures the list is always consistent.	2020-11-05 19:27:09 +01:00
Willy Tarreau	6d8a68914e	MINOR: pattern: make the delete and prune functions more generic Now we have a single prune() function to act on an expression, and one delete function for the lists and one for the trees. The presence of a pointer in the lists is enough to warrant a free, and we rely on the PAT_SF_REGFREE flag to decide whether to free using free() or regfree().	2020-11-05 19:27:09 +01:00
Willy Tarreau	9b5c8bbc89	MINOR: pattern: new sflag PAT_SF_REGFREE indicates regex_free() is needed Currently we have no way to know how to delete/prune a pattern in a generic way. A pattern doesn't contain its own type so we don't know what function to call. Tree nodes are roughly OK but not lists where regex are possible. Let's add one new bit for sflags at index time to indicate that regex_free() will be needed upon deletion. It's not used for now.	2020-11-05 19:27:08 +01:00
Willy Tarreau	d4164dcd4a	CLEANUP: pattern: delete the back refs at once during pat_ref_reload() It's pointless to delete a backref and relink it to the next entry since the next entry is going to do the exact same and so on until all of them are deleted. Let's simply delete backrefs on reload.	2020-11-05 19:27:08 +01:00
Willy Tarreau	3ee0de1b41	MINOR: pattern: move the update revision to the pat_ref, not the expression It's not possible to uniquely update a single expression without updating the pattern reference, I don't know why we've put the revision in the expression back then, given that it in fact provides an update for a full pattern. Let's move the revision into the reference's head instead.	2020-11-05 19:27:08 +01:00
Willy Tarreau	114d698fde	MEDIUM: pattern: call malloc_trim() on pat_ref_reload() This is one case where we may release large amounts of data at once. Tests show that without this, after 10 full reloads of an ACL containing 1M IP addresses, the memory usage grew and stabilized around 1.7 GB of RSS. With this change, it stays around 260 MB and is stable across reloads.	2020-11-05 19:27:08 +01:00
Willy Tarreau	88366c2926	MEDIUM: pools: call malloc_trim() from pool_gc() If available it definitely makes sense to call it since it's also called when stopping to reclaim the maximum possible memory.	2020-11-05 19:27:08 +01:00
Baptiste Assmann	e279ca6bbe	MINOR: sample: Add converts to parses MQTT messages This patch implements a couple of converters to validate and extract data from a MQTT (Message Queuing Telemetry Transport) message. The validation consists of a few checks as well as "packet size" validation. The extraction can get any field from the variable header and the payload. This is limited to CONNECT and CONNACK packet types only. All other messages are considered as invalid. It is not a problem for now because only the first packet on each side can be parsed (CONNECT for the client and CONNACK for the server). MQTT 3.1.1 and 5.0 are supported. Reviewed and Fixed by Christopher Faulet <cfaulet@haproxy.com>	2020-11-05 19:27:03 +01:00
Baptiste Assmann	e138dda1e0	MINOR: sample: Add converters to parse FIX messages This patch implements a couple of converters to validate and extract tag value from a FIX (Financial Information eXchange) message. The validation consists in a few checks such as mandatory fields and checksum computation. The extraction can get any tag value based on a tag string or tag id. This patch requires the istend() function. Thus it depends on "MINOR: ist: Add istend() function to return a pointer to the end of the string". Reviewed and Fixed by Christopher Faulet <cfaulet@haproxy.com>	2020-11-05 19:26:30 +01:00
Ilya Shipitsin	0aa8c29460	BUILD: ssl: use feature macros for detecting ec curves manipulation support Let us use SSL_CTX_set1_curves_list, defined by OpenSSL, as well as in openssl-compat when SSL_CTRL_SET_CURVES_LIST is present (BoringSSL), for feature detection instead of versions.	2020-11-05 15:08:41 +01:00
William Lallemand	99e0bb997f	MINOR: mworker/cli: the master CLI use its own applet Following the patch b4daee ("MINOR: sock: add a check against cross worker<->master socket activities"), this patch adds a dedicated applet for the master CLI. It ensures that the CLI connection can't be used with the master rights in the case of bugs.	2020-11-05 10:28:53 +01:00
Willy Tarreau	21b9ff59b2	BUG/MEDIUM: server: make it possible to kill last idle connections In issue #933, @jaroslawr provided a report indicating that when using many threads and many servers, it's very difficult to terminate the last idle connections on each server. The issue has two causes in fact. The first one is that during the calculation of the estimate of needed connections, we round the computation up while in previous round it was already rounded up, so we end up adding 1 to 1 which once divided by 2 remains 1. The second issue is that servers are not woken up anymore for purging their connections if they don't have activity. The only reason that was there to wake them up again was in case insufficient connections were purged. And even then the purge task itself was not woken up. But that is not enough for getting rid of the long tail of old connections nor updating est_need_conns. This patch makes sure to properly wake up as long as at least one idle connection remains, and not to round up the needed connections anymore. Prior to this patch, a test involving many connections which suddenly stopped would keep many idle connections, now they're effectively halved every pool-purge-delay. This needs to be backported to 2.2.	2020-11-05 09:12:20 +01:00
Willy Tarreau	b4daeeb094	MINOR: sock: add a check against cross worker<->master socket activities Given that the previous issues caused spurious worker socket wakeups in the master for inherited FDs that couldn't be closed, let's add a strict test in the I/O callback to make sure that an accept() event is always caught by the appropriate type of process (master for master listeners, worker for worker listeners).	2020-11-04 15:05:50 +01:00
Christopher Faulet	fafd1b0a5b	CLEANUP: mux-h2: Remove the h1 parser state from the h2 stream Since the h2 multiplexer no longer relies on the legacy HTTP representation, and uses exclusively the HTX, the H1 parser state (h1m) is no longer used by the h2 streams. Thus it can be removed. This patch may be backported as far as 2.1.	2020-11-04 15:02:24 +01:00
Willy Tarreau	a4380b211f	MEDIUM: listeners: make use of fd_want_recv_safe() to enable early receivers We used to refrain from calling fd_want_recv() if fd_updt was not allocated but it's not the right solution as this does not allow the FD to be set. Instead, let's use the new fd_want_recv_safe() which will update the FD and create an update entry only if possible. In addition, the equivalent test before calling fd_stop_recv() was removed as totally useless since there's not fd_updt creation in this case.	2020-11-04 14:22:42 +01:00
Willy Tarreau	22ccd5ebaf	BUG/MEDIUM: listener: make the master also keep workers' inherited FDs In commit `374e9af35` ("MEDIUM: listener: let do_unbind_listener() decide whether to close or not") it didn't appear necessary to have the master process keep open the workers' inherited FDs. But this is actually necessary to handle the reload on "bind fd@foo" situations, otherwise the FD may be reassigned and the new socket cannot be set up, sometimes causing "socket operation on non-socket" or other types of errors. William found that this was the cause for the consistent failures of the abns regtest, which already used to fail very often before this and was as such marked as broken. Interestingly I didn't have this issue with my test configs because the FD number I used was higher and within the range of other listening sockets. But this means that one of these wouldn't work as expected. No backport is needed, this was introduced as part of the listeners rework in 2.3.	2020-11-04 14:22:42 +01:00
Willy Tarreau	59b5da4873	BUG/MEDIUM: listener: never suspend inherited sockets It is not acceptable to suspend an inherited socket because we'd kill its listening state, making it possibly unrecoverable for future processes. The situation which can trigger this is when there is an abns socket in a config and an inherited FD on another listener. Upon soft reload, the abns fails to bind, a SIGTTOU is sent to the old process which suspends everything, including the inherited FD, then the new process can bind and tell the old one to quit. Except that the new FD was not set back to the listen state, which is detected by listener_accept() which can pause it. It's only upon second reload that the FD works again. The solution is to refrain from suspending such FDs since we don't own them. And the next process will get them right anyway from its config. For now only TCP and UDP face this issue so it's better to address this on a protocol basis No backport is needed, this is related to the new listeners in 2.3.	2020-11-04 14:22:42 +01:00
Willy Tarreau	38dba27d4d	BUG/MEDIUM: listener: only enable a listening listener if needed The test on listener->state == LI_LISTEN is not sufficient to decide if we need to enable a listener. Indeed, there is a very special case which is the inherited FD shared, which has to reflect the real socket state even after the previous test, and as such needs to remain in LI_LISTEN state. In this case we don't want a worker to start the master's listener nor conversely. Let's add a specific test for this.	2020-11-04 14:22:42 +01:00
Willy Tarreau	dfe79251da	BUG/MEDIUM: stick-table: limit the time spent purging old entries An interesting case was reported with threads and moderately sized stick-tables. Sometimes the watchdog would trigger during the purge. It turns out that the stick tables were sized in the 10s of K entries which is the order of magnitude of the possible number of connections, and that threads were used over distinct NUMA nodes. While at first glance nothing looks problematic there, actually there is a risk that a thread trying to purge the table faces 100% of entries still in use by a connection with (ts->ref_cnt > 0), and ends up scanning the whole table, while other threads on the other NUMA node are causing the cache lines to bounce back and forth and considerably slow down its progress to the point of possibly spending hundreds of milliseconds there, multiplied by the number of queued threads all failing on the same point. Interestingly, smaller tables would not trigger it because the scan would be faster, and larger ones would not trigger it because plenty of entries would be idle! The most efficient solution is to increase the table size to be large enough for this never to happen, but this is not reliable. We could have a parallel list of idle entries but that would significantly increase the storage and processing cost only to improve a few rare corner cases. This patch takes a more pragmatic approach, it considers that it will not visit more than twice the number of nodes to be deleted, which means that it accepts to fail up to 50% of the time. Given that very small batches are programmed each time (1/256 of the table size), this means the operation will finish quickly (128 times faster than now), and will reduce the inter-thread contention. If this needs to be reconsidered, it will probably mean that the batch size needs to be fixed differently. This needs to be backported to stable releases which extensively use threads, typically 2.0. Kudos to Nenad Merdanovic for figuring the root cause triggering this!	2020-11-03 18:02:42 +01:00
Amaury Denoyelle	e6ee820c07	MINOR: stats: do not display empty stat module title on html If a stat module is not available on the current proxy scope, do not display its title on the related html box. This is clearer for the user.	2020-11-03 17:04:22 +01:00
Amaury Denoyelle	e7b891f7d3	MINOR: mux_h2: add stat for total count of connections/streams Add counters for total number of http2 connections/stream since haproxy startup. Contrary to open_conn/stream, they are never reset to zero.	2020-11-03 17:04:22 +01:00
Amaury Denoyelle	2ac34d97a6	MINOR: mux_h2: capitalize frame type in stats http/2 frame type names are capitalized in the rfc, use the same notation on the stats labels.	2020-11-03 17:04:22 +01:00
Christopher Faulet	743bd6adc8	BUG/MINOR: filters: Skip disabled proxies during startup only This partially reverts the patch `400829cd2` ("BUG/MEDIUM: filters: Don't try to init filters for disabled proxies"). Disabled proxies must not be skipped in flt_deinit() and flt_deinit_all_per_thread() when HAProxy is stopped because, obvioulsy, at this step, all proxies appear as disabled (or stopped, it is the same state). It is safe to do so because, during startup, filters declared on disabled proxies are removed. Thus they don't exist anymore during shutdown. This patch must be backported in all versions where the patch above is.	2020-11-03 16:51:48 +01:00
Ilya Shipitsin	04a5a440b8	BUILD: ssl: use HAVE_OPENSSL_KEYLOG instead of OpenSSL versions let us use HAVE_OPENSSL_KEYLOG for feature detection instead of versions	2020-11-03 14:54:15 +01:00
Christopher Faulet	5a7ca29061	BUG/MEDIUM: mux-pt: Release the tasklet during an HTTP upgrade When a TCP connection is upgraded to HTTP, the passthrough multiplexer owning the client connection is detroyed and replaced by an HTTP multiplexer. When it happens, the connection context is changed (it is in fact the mux itself). Thus, when the mux-pt is destroyed, the connection is not released. But, only the connection must be kept. Everything else concerning the mux must be released. Especially, the tasklet used for I/O subscriptions. In this part, there was a bug and the tasklet was never released. This patch should fix the issue #935. It must be backported as far as 2.0.	2020-11-03 10:50:00 +01:00
Christopher Faulet	75bef00538	MINOR: server: Copy configuration file and line for server templates When servers based on server templates are initialized, the configuration file and line are now copied. This helps to emit understandable warning and alert messages. This patch may be backported if needed, as far as 1.8.	2020-11-03 10:44:38 +01:00
Christopher Faulet	ac1c60fd9c	BUG/MINOR: server: Set server without addr but with dns in RMAINT on startup On startup, if a server has no address but the dns resolutions are configured, "none" method is added to the default init-addr methods, in addition to "last" and "libc". Thus on startup, this server is set to RMAINT mode if no address is found. It is only performed if no other init-addr method is configured. Setting the RMAINT mode on startup is important to inhibit the health checks. For instance, following servers will now be set to RMAINT mode on startup : server srv nofound.tld:80 check resolvers mydns server srv _http._tcp.service.local check resolvers mydns server-template srv 1-3 _http._tcp.service.local check resolvers mydns while followings ones will trigger an error : server srv nofound.tld:80 check server srv nofound.tld:80 check resolvers mydns init-addr libc server srv _http._tcp.service.local check server srv _http._tcp.service.local check resolvers mydns init-addr libc server-template srv 1-3 _http._tcp.service.local check resolvers mydns init-addr libc This patch must be backported as far as 1.8.	2020-11-03 10:44:26 +01:00
Christopher Faulet	5e29376efb	BUG/MINOR: checks: Report a socket error before any connection attempt When a health-check fails, if no connection attempt was performed, a socket error must be reported. But this was only done if the connection was not allocated. It must also be done if there is no control layer. Otherwise, a L7TOUT will be reported instead. It is possible to not having a control layer for a connection if the connection address family is invalid or not defined. This patch must be backported to 2.2.	2020-11-03 10:23:00 +01:00
Christopher Faulet	d5bd824b81	BUG/MINOR: proxy/server: Skip per-proxy/server post-check for disabled proxies per-proxy and per-server post-check callback functions must be skipped for disabled proxies because most of the configuration validity check is skipped for these proxies. This patch must be backported as far as 2.1.	2020-11-03 10:23:00 +01:00
Christopher Faulet	400829cd2c	BUG/MEDIUM: filters: Don't try to init filters for disabled proxies Configuration is parsed for such proxies but not validated. Concretely, it means check_config_validity() function does almost nothing for such proxies. Thus, we must be careful to not initialize filters for disabled proxies because the check callback function is not called. In fact, to be sure to avoid any trouble, filters for disabled proxies are released. This patch fixes a segfault at startup if the SPOE is configured for a disabled proxy. It must be backported as far as 1.7 (maybe with some adaptations).	2020-11-03 10:23:00 +01:00
Ilya Shipitsin	c9dfee43f3	BUILD: ssl: use SSL_CTRL_GET_RAW_CIPHERLIST instead of OpenSSL versions let us use SSL_CTRL_GET_RAW_CIPHERLIST for feature detection instead of versions [wla: SSL_CTRL_GET_RAW_CIPHERLIST was introduced by OpenSSL commit 94a209 along with SSL_CIPHER_find. It was removed in boringSSL.] Signed-off-by: William Lallemand <wlallemand@haproxy.org>	2020-11-03 09:24:43 +01:00
Willy Tarreau	a5bbaaf9f4	CLEANUP: pattern: fix spelling/grammatical/copy-paste in comments The code is horrible to work with because most functions are documented with misleading comments resulting from many spelling and grammatical mistakes, and plenty of remains of copy-paste mentioning arguments that do not exist and return values that are never set. Too many hours wasted writing non-working code because of assumptions resulting from this, let's fix this once for all now!	2020-10-31 13:14:10 +01:00
Willy Tarreau	8135d9bc0c	CLEANUP: pattern: use calloc() rather than malloc for structures It's particularly difficult to make sure that the various pattern structures are properly initialized given that they can be allocated at multiple places and systematically via malloc() instead of calloc(), thus not even leaving the possibility of default values. Let's adjust a few of them.	2020-10-31 13:14:10 +01:00
Willy Tarreau	6bedf151e1	MINOR: pattern: export pat_ref_push() Strangely this one was marked static inline within the file itself. Let's export it.	2020-10-31 13:13:48 +01:00
Willy Tarreau	6a1740767c	MINOR: pattern: make pat_ref_add() rely on pat_ref_append() Let's remove unneeded code duplication, both are exactly the same.	2020-10-31 13:13:48 +01:00
Willy Tarreau	f4edb72e0a	MINOR: pattern: make pat_ref_append() return the newly added element It's more convenient to return the element than to return just 0 or 1, as the next thing we'll want to do is to act on this element! In addition it was using variable arguments instead of consts, causing some reuse constraints which were also addressed. This doesn't change its use as a boolean, hence why call places were not modified.	2020-10-31 13:13:48 +01:00
Remi Tricot-Le Breton	8c2db71326	BUG/MINOR: cache: Inverted variables in http_calc_maxage function The maxage and smaxage variables were inadvertently assigned the Cache-Control s-maxage and max-age values respectively when it should have been the other way around. This can be backported on all branches after 1.8 (included).	2020-10-30 14:29:29 +01:00
Remi Tricot-Le Breton	40ed97b04b	BUG/MINOR: cache: Manage multiple values in cache-control header value If an HTTP request or response had a "Cache-Control" header that had multiple comma-separated subparts in its value (like "max-age=1, no-store" for instance), we did not process the values correctly and only parsed the first one. That made us store some HTTP responses in the cache when they were explicitely uncacheable. This patch replaces the way the values are parsed by an http_find_header loop that manages every sub part of the value independently. This patch should be backported to 2.2 and 2.1. The bug also exists on previous versions but since the sources changed, a new commit will have to be created. [wla: This patch requires `bb4582c` ("MINOR: ist: Add a case insensitive istmatch function"). Backporting for < 2.1 is not a requirement since it works well enough for most cases, it was a known limitation of the implementation of non-htx version too]	2020-10-30 13:28:34 +01:00
Remi Tricot-Le Breton	a6476114ec	MINOR: cache: Add Expires header value parsing When no Cache-Control max-age or s-maxage information is present in a cached response, we need to parse the Expires header value (RFC 7234#5.3). An invalid Expires date value or a date earlier than the reception date will make the cache_entry stale upon creation. For now, the Cache-Control and Expires headers are parsed after the insertion of the response in the cache so even if the parsing of the Expires results in an already stale entry, the entry will exist in the cache.	2020-10-30 11:08:38 +01:00
Amaury Denoyelle	bc0af6a199	BUG/MINOR: lua: initialize sample before using it Memset the sample before using it through hlua_lua2smp. This function is ORing the smp.flags, so this field need to be cleared before its use. This was reported by a coverity warning. Fixes the github issue #929. This bug can be backported up to 1.8.	2020-10-29 18:52:44 +01:00
Amaury Denoyelle	e6ba7915eb	BUG/MINOR: server: fix down_time report for stats Adjust condition used to report down_time for statistics. There was a tiny probabilty to have a negative downtime if last_change was superior to now. If this is the case, return only down_time. This bug can backported up to 1.8.	2020-10-29 18:52:39 +01:00
Amaury Denoyelle	fe2bf091f6	BUG/MINOR: server: fix srv downtime calcul on starting When a server is up after a failure, its downtime was reset to 0 on the statistics. This is due to a wrong condition that causes srv.down_time to never be set. Fix this by updating down_time each time the server is in STARTING state. Fixes the github issue #920. This bug can be backported up to 1.8.	2020-10-29 18:52:18 +01:00
Amaury Denoyelle	66942c1d4d	MINOR: mux-h2: count open connections/streams on stats Implement as a gauge h2 counters for currently open connections and streams. The counters are decremented when closing the stream or the connection.	2020-10-28 08:55:23 +01:00
Amaury Denoyelle	a8879238ce	MINOR: mux-h2: report detected error on stats Implement counters for h2 protocol error on connection or stream level. Also count the total number of rst_stream and goaway frames sent by the mux in response to a detected error.	2020-10-28 08:55:19 +01:00
Amaury Denoyelle	2dec1ebec2	MINOR: mux-h2: add stats for received frame types Implement counters for h2 frame received based on their type for HEADERS, DATA, SETTINGS, RST_STREAM and GOAWAY.	2020-10-28 08:55:16 +01:00
Amaury Denoyelle	c92697d977	MINOR: mux-h2: add counters instance to h2c Add pointer to counters as a member for h2c structure. This pointer is initialized on h2_init function. This is useful to quickly access and manipulate the counters inside every h2 functions.	2020-10-28 08:55:11 +01:00
Amaury Denoyelle	3238b3f906	MINOR: mux-h2: register a stats module Use statistics API to register a new stats module generating counters on h2 module. The counters are attached to frontend/backend instances.	2020-10-28 08:55:07 +01:00
Remi Tricot-Le Breton	bf97121f1c	MINOR: cache: Create res.cache_hit and res.cache_name sample fetches Res.cache_hit sample fetch returns a boolean which is true when the HTTP response was built out of a cache. The cache's name is returned by the res.cache_name sample_fetch. This resolves GitHub issue #900.	2020-10-27 18:25:43 +01:00
Remi Tricot-Le Breton	53161d81b8	MINOR: cache: Process the If-Modified-Since header in conditional requests If a client sends a conditional request containing an If-Modified-Since header (and no If-None-Match header), we try to compare the date with the one stored in the cache entry (coming either from a Last-Modified head, or a Date header, or corresponding to the first response's reception time). If the request's date is earlier than the stored one, we send a "304 Not Modified" response back. Otherwise, the stored is sent (through a 200 OK response). This resolves GitHub issue #821.	2020-10-27 18:10:25 +01:00
Remi Tricot Le Breton	27091b4dd0	MINOR: cache: Store the "Last-Modified" date in the cache_entry In order to manage "If-Modified-Since" requests, we need to keep a reference time for our cache entries (to which the conditional request's date will be compared). This reference is either extracted from the "Last-Modified" header, or the "Date" header, or the reception time of the response (in decreasing order of priority). The date values are converted into seconds since epoch in order to ease comparisons and to limit storage space.	2020-10-27 18:10:25 +01:00
Tim Duesterhus	e0142340b2	BUG/MINOR: cache: Check the return value of http_replace_res_status Send the full body if the status `304` cannot be applied. This should be the most graceful failure. Specific for 2.3, no backport needed.	2020-10-27 17:01:49 +01:00
Ilya Shipitsin	b9b84a4b25	BUILD: ssl: more elegant OpenSSL early data support check BorinSSL pretends to be 1.1.1 version of OpenSSL. It messes some version based feature presense checks. For example, OpenSSL specific early data support. Let us change that feature detction to SSL_READ_EARLY_DATA_SUCCESS macro check instead of version comparision.	2020-10-27 13:08:32 +01:00
Willy Tarreau	a0133fcf35	BUG/MINOR: log: fix risk of null deref on error path Previous commit `ae32ac74db` ("BUG/MINOR: log: fix memory leak on logsrv parse error") addressed one issue and introduced another one, the logsrv pointer may also be null at the end of the function so we must test it before deciding to dereference it. This should be backported along with the patch above to 2.2.	2020-10-27 10:35:32 +01:00
Willy Tarreau	ae32ac74db	BUG/MINOR: log: fix memory leak on logsrv parse error In case of parsing error on logsrv, we can leave parse_logsrv() without releasing logsrv->ring_name or smp_rgs. Let's free them on the error path. This should fix issue #926 detected by Coverity. The impact is only a tiny leak just before reporting a fatal error, so it will essentially annoy valgrind. This can be backported to 2.0 (just drop the ring part).	2020-10-27 09:55:00 +01:00
Emmanuel Hocdet	a73a222a98	BUG/MEDIUM: ssl: OCSP must work with BoringSSL It's a regression from `b3201a3e` "BUG/MINOR: disable dynamic OCSP load with BoringSSL". The origin bug is link to `76b4a12` "BUG/MEDIUM: ssl: memory leak of ocsp data at SSL_CTX_free()": ssl_sock_free_ocsp() shoud be in #ifndef OPENSSL_IS_BORINGSSL. To avoid long #ifdef for small code, the BoringSSL part for ocsp load is isolated in a simple #ifdef. This must be backported in 2.2 and 2.1	2020-10-27 09:38:51 +01:00
William Dauchy	5e10e44bce	CLEANUP: http_ana: remove unused assignation of `att_beg` `att_beg` is assigned to `next` at the end of the `for` loop, but is assigned to `prev` at the beginning of the loop, which is itself assigned to `next` after each loop. So it represents a double assignation for the same value. Also `att_beg` is not used after the end of the loop. this is a partial fix for github issue #923, all the others could probably be marked as intentional to protect future changes. no backport needed. Signed-off-by: William Dauchy <wdauchy@gmail.com>	2020-10-26 15:00:09 +01:00
Willy Tarreau	b3250a268b	BUG/MINOR: extcheck: add missing checks on extchk_setenv() Issue #910 reports that we fail to check a few extchk_setenv() in the child process. These are mostly harmless, but instead of counting on the external check script to fail the dirty way, better fail cleanly when detecting the failure. This could probably be backported to all stable branches.	2020-10-24 13:07:39 +02:00
Willy Tarreau	5472aa50f1	BUG/MEDIUM: queue: fix unsafe proxy pointer when counting nbpend As reported by Coverity in issue #917, commit `96bca33` ("OPTIM: queue: decrement the nbpend and totpend counters outside of the lock") introduced a bug when moving the increments outside of the loop, because we can't always rely on the pendconn "p" here as it may be null. We can retrieve the proxy pointer directly from s->proxy instead. The same is true for pendconn_redistribute(), though the last "p" pointer there was still valid. This patch fixes both. No backport is needed, this was introduced just before 2.3-dev8.	2020-10-24 12:57:41 +02:00
Willy Tarreau	bd71510024	MINOR: stats: report server's user-configured weight next to effective weight The "weight" column on the stats page is somewhat confusing when using slowstart becaue it reports the effective weight, without being really explicit about it. In some situations the user-configured weight is more relevant (especially with long slowstarts where it's important to know if the configured weight is correct). This adds a new uweight stat which reports a server's user-configured weight, and in a backend it receives the sum of all servers' uweights. In addition it adds the mention of "effective" in a few descriptions for the "weight" column (help and doc). As a result, the list of servers in a backend is now always scanned when dumping the stats. But this is not a problem given that these servers are already scanned anyway and for way heavier processing.	2020-10-23 22:47:30 +02:00
William Lallemand	089c13850f	MEDIUM: ssl: ssl-load-extra-del-ext work only with .crt In order to be compatible with the "set ssl cert" command of the CLI, this patch restrict the ssl-load-extra-del-ext to files with a ".crt" extension in the configuration. Related to issue #785. Should be backported where `8e8581e` ("MINOR: ssl: 'ssl-load-extra-del-ext' removes the certificate extension") was backported.	2020-10-23 18:41:08 +02:00
Willy Tarreau	2fbe6940f4	MINOR: stats: indicate the number of servers in a backend's status When dumping the stats page (or the CSV output), when many states are mixed, it's hard to figure the number of up servers. But when showing only the "up" servers or hiding the "maint" servers, there's no way to know how many servers are configured, which is problematic when trying to update server-templates. What this patch does, for dumps in "up" or "no-maint" modes, is to add after the backend's "UP" or "DOWN" state "(%d/%d)" indicating the number of servers seen as UP to the total number of servers in the backend. As such, seeing "UP (33/39)" immediately tells that there are 6 servers that are not listed when using "up", or will let the client figure how many servers are left once deducted the number of non-maintenance ones. It's not done on default dumps so as not to disturb existing tools, which already have all the information they need in the dump.	2020-10-23 18:11:30 +02:00
Willy Tarreau	3e32036701	MINOR: stats: also support a "no-maint" show stat modifier "no-maint" is a bit similar to "up" except that it will only hide servers that are in maintenance (or disabled in the configuration), and not those that are enabled but failed a check. One benefit here is to significantly reduce the output of the "show stat" command when using large server-templates containing entries that are not yet provisioned. Note that the prometheus exporter also has such an option which does the exact same.	2020-10-23 18:11:24 +02:00
Willy Tarreau	65141ffc4f	MINOR: stats: support the "up" output modifier for "show stat" We already had it on the HTTP interface but it was not accessible on the CLI. It can be very convenient to hide servers which are down, do not resolve, or are in maintenance.	2020-10-23 18:11:24 +02:00

... 6 7 8 9 10 ...

10974 Commits