haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-11-23 03:41:01 +01:00

Author	SHA1	Message	Date
William Lallemand	99e0bb997f	MINOR: mworker/cli: the master CLI use its own applet Following the patch b4daee ("MINOR: sock: add a check against cross worker<->master socket activities"), this patch adds a dedicated applet for the master CLI. It ensures that the CLI connection can't be used with the master rights in the case of bugs.	2020-11-05 10:28:53 +01:00
Willy Tarreau	21b9ff59b2	BUG/MEDIUM: server: make it possible to kill last idle connections In issue #933, @jaroslawr provided a report indicating that when using many threads and many servers, it's very difficult to terminate the last idle connections on each server. The issue has two causes in fact. The first one is that during the calculation of the estimate of needed connections, we round the computation up while in previous round it was already rounded up, so we end up adding 1 to 1 which once divided by 2 remains 1. The second issue is that servers are not woken up anymore for purging their connections if they don't have activity. The only reason that was there to wake them up again was in case insufficient connections were purged. And even then the purge task itself was not woken up. But that is not enough for getting rid of the long tail of old connections nor updating est_need_conns. This patch makes sure to properly wake up as long as at least one idle connection remains, and not to round up the needed connections anymore. Prior to this patch, a test involving many connections which suddenly stopped would keep many idle connections, now they're effectively halved every pool-purge-delay. This needs to be backported to 2.2.	2020-11-05 09:12:20 +01:00
Willy Tarreau	b4daeeb094	MINOR: sock: add a check against cross worker<->master socket activities Given that the previous issues caused spurious worker socket wakeups in the master for inherited FDs that couldn't be closed, let's add a strict test in the I/O callback to make sure that an accept() event is always caught by the appropriate type of process (master for master listeners, worker for worker listeners).	2020-11-04 15:05:50 +01:00
Christopher Faulet	fafd1b0a5b	CLEANUP: mux-h2: Remove the h1 parser state from the h2 stream Since the h2 multiplexer no longer relies on the legacy HTTP representation, and uses exclusively the HTX, the H1 parser state (h1m) is no longer used by the h2 streams. Thus it can be removed. This patch may be backported as far as 2.1.	2020-11-04 15:02:24 +01:00
Willy Tarreau	a4380b211f	MEDIUM: listeners: make use of fd_want_recv_safe() to enable early receivers We used to refrain from calling fd_want_recv() if fd_updt was not allocated but it's not the right solution as this does not allow the FD to be set. Instead, let's use the new fd_want_recv_safe() which will update the FD and create an update entry only if possible. In addition, the equivalent test before calling fd_stop_recv() was removed as totally useless since there's not fd_updt creation in this case.	2020-11-04 14:22:42 +01:00
Willy Tarreau	22ccd5ebaf	BUG/MEDIUM: listener: make the master also keep workers' inherited FDs In commit 374e9af35 ("MEDIUM: listener: let do_unbind_listener() decide whether to close or not") it didn't appear necessary to have the master process keep open the workers' inherited FDs. But this is actually necessary to handle the reload on "bind fd@foo" situations, otherwise the FD may be reassigned and the new socket cannot be set up, sometimes causing "socket operation on non-socket" or other types of errors. William found that this was the cause for the consistent failures of the abns regtest, which already used to fail very often before this and was as such marked as broken. Interestingly I didn't have this issue with my test configs because the FD number I used was higher and within the range of other listening sockets. But this means that one of these wouldn't work as expected. No backport is needed, this was introduced as part of the listeners rework in 2.3.	2020-11-04 14:22:42 +01:00
Willy Tarreau	59b5da4873	BUG/MEDIUM: listener: never suspend inherited sockets It is not acceptable to suspend an inherited socket because we'd kill its listening state, making it possibly unrecoverable for future processes. The situation which can trigger this is when there is an abns socket in a config and an inherited FD on another listener. Upon soft reload, the abns fails to bind, a SIGTTOU is sent to the old process which suspends everything, including the inherited FD, then the new process can bind and tell the old one to quit. Except that the new FD was not set back to the listen state, which is detected by listener_accept() which can pause it. It's only upon second reload that the FD works again. The solution is to refrain from suspending such FDs since we don't own them. And the next process will get them right anyway from its config. For now only TCP and UDP face this issue so it's better to address this on a protocol basis No backport is needed, this is related to the new listeners in 2.3.	2020-11-04 14:22:42 +01:00
Willy Tarreau	38dba27d4d	BUG/MEDIUM: listener: only enable a listening listener if needed The test on listener->state == LI_LISTEN is not sufficient to decide if we need to enable a listener. Indeed, there is a very special case which is the inherited FD shared, which has to reflect the real socket state even after the previous test, and as such needs to remain in LI_LISTEN state. In this case we don't want a worker to start the master's listener nor conversely. Let's add a specific test for this.	2020-11-04 14:22:42 +01:00
Willy Tarreau	dfe79251da	BUG/MEDIUM: stick-table: limit the time spent purging old entries An interesting case was reported with threads and moderately sized stick-tables. Sometimes the watchdog would trigger during the purge. It turns out that the stick tables were sized in the 10s of K entries which is the order of magnitude of the possible number of connections, and that threads were used over distinct NUMA nodes. While at first glance nothing looks problematic there, actually there is a risk that a thread trying to purge the table faces 100% of entries still in use by a connection with (ts->ref_cnt > 0), and ends up scanning the whole table, while other threads on the other NUMA node are causing the cache lines to bounce back and forth and considerably slow down its progress to the point of possibly spending hundreds of milliseconds there, multiplied by the number of queued threads all failing on the same point. Interestingly, smaller tables would not trigger it because the scan would be faster, and larger ones would not trigger it because plenty of entries would be idle! The most efficient solution is to increase the table size to be large enough for this never to happen, but this is not reliable. We could have a parallel list of idle entries but that would significantly increase the storage and processing cost only to improve a few rare corner cases. This patch takes a more pragmatic approach, it considers that it will not visit more than twice the number of nodes to be deleted, which means that it accepts to fail up to 50% of the time. Given that very small batches are programmed each time (1/256 of the table size), this means the operation will finish quickly (128 times faster than now), and will reduce the inter-thread contention. If this needs to be reconsidered, it will probably mean that the batch size needs to be fixed differently. This needs to be backported to stable releases which extensively use threads, typically 2.0. Kudos to Nenad Merdanovic for figuring the root cause triggering this!	2020-11-03 18:02:42 +01:00
Amaury Denoyelle	e6ee820c07	MINOR: stats: do not display empty stat module title on html If a stat module is not available on the current proxy scope, do not display its title on the related html box. This is clearer for the user.	2020-11-03 17:04:22 +01:00
Amaury Denoyelle	e7b891f7d3	MINOR: mux_h2: add stat for total count of connections/streams Add counters for total number of http2 connections/stream since haproxy startup. Contrary to open_conn/stream, they are never reset to zero.	2020-11-03 17:04:22 +01:00
Amaury Denoyelle	2ac34d97a6	MINOR: mux_h2: capitalize frame type in stats http/2 frame type names are capitalized in the rfc, use the same notation on the stats labels.	2020-11-03 17:04:22 +01:00
Christopher Faulet	743bd6adc8	BUG/MINOR: filters: Skip disabled proxies during startup only This partially reverts the patch 400829cd2 ("BUG/MEDIUM: filters: Don't try to init filters for disabled proxies"). Disabled proxies must not be skipped in flt_deinit() and flt_deinit_all_per_thread() when HAProxy is stopped because, obvioulsy, at this step, all proxies appear as disabled (or stopped, it is the same state). It is safe to do so because, during startup, filters declared on disabled proxies are removed. Thus they don't exist anymore during shutdown. This patch must be backported in all versions where the patch above is.	2020-11-03 16:51:48 +01:00
Ilya Shipitsin	04a5a440b8	BUILD: ssl: use HAVE_OPENSSL_KEYLOG instead of OpenSSL versions let us use HAVE_OPENSSL_KEYLOG for feature detection instead of versions	2020-11-03 14:54:15 +01:00
Christopher Faulet	5a7ca29061	BUG/MEDIUM: mux-pt: Release the tasklet during an HTTP upgrade When a TCP connection is upgraded to HTTP, the passthrough multiplexer owning the client connection is detroyed and replaced by an HTTP multiplexer. When it happens, the connection context is changed (it is in fact the mux itself). Thus, when the mux-pt is destroyed, the connection is not released. But, only the connection must be kept. Everything else concerning the mux must be released. Especially, the tasklet used for I/O subscriptions. In this part, there was a bug and the tasklet was never released. This patch should fix the issue #935. It must be backported as far as 2.0.	2020-11-03 10:50:00 +01:00
Christopher Faulet	75bef00538	MINOR: server: Copy configuration file and line for server templates When servers based on server templates are initialized, the configuration file and line are now copied. This helps to emit understandable warning and alert messages. This patch may be backported if needed, as far as 1.8.	2020-11-03 10:44:38 +01:00
Christopher Faulet	ac1c60fd9c	BUG/MINOR: server: Set server without addr but with dns in RMAINT on startup On startup, if a server has no address but the dns resolutions are configured, "none" method is added to the default init-addr methods, in addition to "last" and "libc". Thus on startup, this server is set to RMAINT mode if no address is found. It is only performed if no other init-addr method is configured. Setting the RMAINT mode on startup is important to inhibit the health checks. For instance, following servers will now be set to RMAINT mode on startup : server srv nofound.tld:80 check resolvers mydns server srv _http._tcp.service.local check resolvers mydns server-template srv 1-3 _http._tcp.service.local check resolvers mydns while followings ones will trigger an error : server srv nofound.tld:80 check server srv nofound.tld:80 check resolvers mydns init-addr libc server srv _http._tcp.service.local check server srv _http._tcp.service.local check resolvers mydns init-addr libc server-template srv 1-3 _http._tcp.service.local check resolvers mydns init-addr libc This patch must be backported as far as 1.8.	2020-11-03 10:44:26 +01:00
Christopher Faulet	5e29376efb	BUG/MINOR: checks: Report a socket error before any connection attempt When a health-check fails, if no connection attempt was performed, a socket error must be reported. But this was only done if the connection was not allocated. It must also be done if there is no control layer. Otherwise, a L7TOUT will be reported instead. It is possible to not having a control layer for a connection if the connection address family is invalid or not defined. This patch must be backported to 2.2.	2020-11-03 10:23:00 +01:00
Christopher Faulet	d5bd824b81	BUG/MINOR: proxy/server: Skip per-proxy/server post-check for disabled proxies per-proxy and per-server post-check callback functions must be skipped for disabled proxies because most of the configuration validity check is skipped for these proxies. This patch must be backported as far as 2.1.	2020-11-03 10:23:00 +01:00
Christopher Faulet	400829cd2c	BUG/MEDIUM: filters: Don't try to init filters for disabled proxies Configuration is parsed for such proxies but not validated. Concretely, it means check_config_validity() function does almost nothing for such proxies. Thus, we must be careful to not initialize filters for disabled proxies because the check callback function is not called. In fact, to be sure to avoid any trouble, filters for disabled proxies are released. This patch fixes a segfault at startup if the SPOE is configured for a disabled proxy. It must be backported as far as 1.7 (maybe with some adaptations).	2020-11-03 10:23:00 +01:00
Ilya Shipitsin	c9dfee43f3	BUILD: ssl: use SSL_CTRL_GET_RAW_CIPHERLIST instead of OpenSSL versions let us use SSL_CTRL_GET_RAW_CIPHERLIST for feature detection instead of versions [wla: SSL_CTRL_GET_RAW_CIPHERLIST was introduced by OpenSSL commit 94a209 along with SSL_CIPHER_find. It was removed in boringSSL.] Signed-off-by: William Lallemand <wlallemand@haproxy.org>	2020-11-03 09:24:43 +01:00
Willy Tarreau	a5bbaaf9f4	CLEANUP: pattern: fix spelling/grammatical/copy-paste in comments The code is horrible to work with because most functions are documented with misleading comments resulting from many spelling and grammatical mistakes, and plenty of remains of copy-paste mentioning arguments that do not exist and return values that are never set. Too many hours wasted writing non-working code because of assumptions resulting from this, let's fix this once for all now!	2020-10-31 13:14:10 +01:00
Willy Tarreau	8135d9bc0c	CLEANUP: pattern: use calloc() rather than malloc for structures It's particularly difficult to make sure that the various pattern structures are properly initialized given that they can be allocated at multiple places and systematically via malloc() instead of calloc(), thus not even leaving the possibility of default values. Let's adjust a few of them.	2020-10-31 13:14:10 +01:00
Willy Tarreau	6bedf151e1	MINOR: pattern: export pat_ref_push() Strangely this one was marked static inline within the file itself. Let's export it.	2020-10-31 13:13:48 +01:00
Willy Tarreau	6a1740767c	MINOR: pattern: make pat_ref_add() rely on pat_ref_append() Let's remove unneeded code duplication, both are exactly the same.	2020-10-31 13:13:48 +01:00
Willy Tarreau	f4edb72e0a	MINOR: pattern: make pat_ref_append() return the newly added element It's more convenient to return the element than to return just 0 or 1, as the next thing we'll want to do is to act on this element! In addition it was using variable arguments instead of consts, causing some reuse constraints which were also addressed. This doesn't change its use as a boolean, hence why call places were not modified.	2020-10-31 13:13:48 +01:00
Remi Tricot-Le Breton	8c2db71326	BUG/MINOR: cache: Inverted variables in http_calc_maxage function The maxage and smaxage variables were inadvertently assigned the Cache-Control s-maxage and max-age values respectively when it should have been the other way around. This can be backported on all branches after 1.8 (included).	2020-10-30 14:29:29 +01:00
Remi Tricot-Le Breton	40ed97b04b	BUG/MINOR: cache: Manage multiple values in cache-control header value If an HTTP request or response had a "Cache-Control" header that had multiple comma-separated subparts in its value (like "max-age=1, no-store" for instance), we did not process the values correctly and only parsed the first one. That made us store some HTTP responses in the cache when they were explicitely uncacheable. This patch replaces the way the values are parsed by an http_find_header loop that manages every sub part of the value independently. This patch should be backported to 2.2 and 2.1. The bug also exists on previous versions but since the sources changed, a new commit will have to be created. [wla: This patch requires bb4582c ("MINOR: ist: Add a case insensitive istmatch function"). Backporting for < 2.1 is not a requirement since it works well enough for most cases, it was a known limitation of the implementation of non-htx version too]	2020-10-30 13:28:34 +01:00
Remi Tricot-Le Breton	a6476114ec	MINOR: cache: Add Expires header value parsing When no Cache-Control max-age or s-maxage information is present in a cached response, we need to parse the Expires header value (RFC 7234#5.3). An invalid Expires date value or a date earlier than the reception date will make the cache_entry stale upon creation. For now, the Cache-Control and Expires headers are parsed after the insertion of the response in the cache so even if the parsing of the Expires results in an already stale entry, the entry will exist in the cache.	2020-10-30 11:08:38 +01:00
Amaury Denoyelle	bc0af6a199	BUG/MINOR: lua: initialize sample before using it Memset the sample before using it through hlua_lua2smp. This function is ORing the smp.flags, so this field need to be cleared before its use. This was reported by a coverity warning. Fixes the github issue #929. This bug can be backported up to 1.8.	2020-10-29 18:52:44 +01:00
Amaury Denoyelle	e6ba7915eb	BUG/MINOR: server: fix down_time report for stats Adjust condition used to report down_time for statistics. There was a tiny probabilty to have a negative downtime if last_change was superior to now. If this is the case, return only down_time. This bug can backported up to 1.8.	2020-10-29 18:52:39 +01:00
Amaury Denoyelle	fe2bf091f6	BUG/MINOR: server: fix srv downtime calcul on starting When a server is up after a failure, its downtime was reset to 0 on the statistics. This is due to a wrong condition that causes srv.down_time to never be set. Fix this by updating down_time each time the server is in STARTING state. Fixes the github issue #920. This bug can be backported up to 1.8.	2020-10-29 18:52:18 +01:00
Amaury Denoyelle	66942c1d4d	MINOR: mux-h2: count open connections/streams on stats Implement as a gauge h2 counters for currently open connections and streams. The counters are decremented when closing the stream or the connection.	2020-10-28 08:55:23 +01:00
Amaury Denoyelle	a8879238ce	MINOR: mux-h2: report detected error on stats Implement counters for h2 protocol error on connection or stream level. Also count the total number of rst_stream and goaway frames sent by the mux in response to a detected error.	2020-10-28 08:55:19 +01:00
Amaury Denoyelle	2dec1ebec2	MINOR: mux-h2: add stats for received frame types Implement counters for h2 frame received based on their type for HEADERS, DATA, SETTINGS, RST_STREAM and GOAWAY.	2020-10-28 08:55:16 +01:00
Amaury Denoyelle	c92697d977	MINOR: mux-h2: add counters instance to h2c Add pointer to counters as a member for h2c structure. This pointer is initialized on h2_init function. This is useful to quickly access and manipulate the counters inside every h2 functions.	2020-10-28 08:55:11 +01:00
Amaury Denoyelle	3238b3f906	MINOR: mux-h2: register a stats module Use statistics API to register a new stats module generating counters on h2 module. The counters are attached to frontend/backend instances.	2020-10-28 08:55:07 +01:00
Remi Tricot-Le Breton	bf97121f1c	MINOR: cache: Create res.cache_hit and res.cache_name sample fetches Res.cache_hit sample fetch returns a boolean which is true when the HTTP response was built out of a cache. The cache's name is returned by the res.cache_name sample_fetch. This resolves GitHub issue #900.	2020-10-27 18:25:43 +01:00
Remi Tricot-Le Breton	53161d81b8	MINOR: cache: Process the If-Modified-Since header in conditional requests If a client sends a conditional request containing an If-Modified-Since header (and no If-None-Match header), we try to compare the date with the one stored in the cache entry (coming either from a Last-Modified head, or a Date header, or corresponding to the first response's reception time). If the request's date is earlier than the stored one, we send a "304 Not Modified" response back. Otherwise, the stored is sent (through a 200 OK response). This resolves GitHub issue #821.	2020-10-27 18:10:25 +01:00
Remi Tricot Le Breton	27091b4dd0	MINOR: cache: Store the "Last-Modified" date in the cache_entry In order to manage "If-Modified-Since" requests, we need to keep a reference time for our cache entries (to which the conditional request's date will be compared). This reference is either extracted from the "Last-Modified" header, or the "Date" header, or the reception time of the response (in decreasing order of priority). The date values are converted into seconds since epoch in order to ease comparisons and to limit storage space.	2020-10-27 18:10:25 +01:00
Tim Duesterhus	e0142340b2	BUG/MINOR: cache: Check the return value of http_replace_res_status Send the full body if the status `304` cannot be applied. This should be the most graceful failure. Specific for 2.3, no backport needed.	2020-10-27 17:01:49 +01:00
Ilya Shipitsin	b9b84a4b25	BUILD: ssl: more elegant OpenSSL early data support check BorinSSL pretends to be 1.1.1 version of OpenSSL. It messes some version based feature presense checks. For example, OpenSSL specific early data support. Let us change that feature detction to SSL_READ_EARLY_DATA_SUCCESS macro check instead of version comparision.	2020-10-27 13:08:32 +01:00
Willy Tarreau	a0133fcf35	BUG/MINOR: log: fix risk of null deref on error path Previous commit ae32ac74db ("BUG/MINOR: log: fix memory leak on logsrv parse error") addressed one issue and introduced another one, the logsrv pointer may also be null at the end of the function so we must test it before deciding to dereference it. This should be backported along with the patch above to 2.2.	2020-10-27 10:35:32 +01:00
Willy Tarreau	ae32ac74db	BUG/MINOR: log: fix memory leak on logsrv parse error In case of parsing error on logsrv, we can leave parse_logsrv() without releasing logsrv->ring_name or smp_rgs. Let's free them on the error path. This should fix issue #926 detected by Coverity. The impact is only a tiny leak just before reporting a fatal error, so it will essentially annoy valgrind. This can be backported to 2.0 (just drop the ring part).	2020-10-27 09:55:00 +01:00
Emmanuel Hocdet	a73a222a98	BUG/MEDIUM: ssl: OCSP must work with BoringSSL It's a regression from b3201a3e "BUG/MINOR: disable dynamic OCSP load with BoringSSL". The origin bug is link to 76b4a12 "BUG/MEDIUM: ssl: memory leak of ocsp data at SSL_CTX_free()": ssl_sock_free_ocsp() shoud be in #ifndef OPENSSL_IS_BORINGSSL. To avoid long #ifdef for small code, the BoringSSL part for ocsp load is isolated in a simple #ifdef. This must be backported in 2.2 and 2.1	2020-10-27 09:38:51 +01:00
William Dauchy	5e10e44bce	CLEANUP: http_ana: remove unused assignation of `att_beg` `att_beg` is assigned to `next` at the end of the `for` loop, but is assigned to `prev` at the beginning of the loop, which is itself assigned to `next` after each loop. So it represents a double assignation for the same value. Also `att_beg` is not used after the end of the loop. this is a partial fix for github issue #923, all the others could probably be marked as intentional to protect future changes. no backport needed. Signed-off-by: William Dauchy <wdauchy@gmail.com>	2020-10-26 15:00:09 +01:00
Willy Tarreau	b3250a268b	BUG/MINOR: extcheck: add missing checks on extchk_setenv() Issue #910 reports that we fail to check a few extchk_setenv() in the child process. These are mostly harmless, but instead of counting on the external check script to fail the dirty way, better fail cleanly when detecting the failure. This could probably be backported to all stable branches.	2020-10-24 13:07:39 +02:00
Willy Tarreau	5472aa50f1	BUG/MEDIUM: queue: fix unsafe proxy pointer when counting nbpend As reported by Coverity in issue #917, commit 96bca33 ("OPTIM: queue: decrement the nbpend and totpend counters outside of the lock") introduced a bug when moving the increments outside of the loop, because we can't always rely on the pendconn "p" here as it may be null. We can retrieve the proxy pointer directly from s->proxy instead. The same is true for pendconn_redistribute(), though the last "p" pointer there was still valid. This patch fixes both. No backport is needed, this was introduced just before 2.3-dev8.	2020-10-24 12:57:41 +02:00
Willy Tarreau	bd71510024	MINOR: stats: report server's user-configured weight next to effective weight The "weight" column on the stats page is somewhat confusing when using slowstart becaue it reports the effective weight, without being really explicit about it. In some situations the user-configured weight is more relevant (especially with long slowstarts where it's important to know if the configured weight is correct). This adds a new uweight stat which reports a server's user-configured weight, and in a backend it receives the sum of all servers' uweights. In addition it adds the mention of "effective" in a few descriptions for the "weight" column (help and doc). As a result, the list of servers in a backend is now always scanned when dumping the stats. But this is not a problem given that these servers are already scanned anyway and for way heavier processing.	2020-10-23 22:47:30 +02:00
William Lallemand	089c13850f	MEDIUM: ssl: ssl-load-extra-del-ext work only with .crt In order to be compatible with the "set ssl cert" command of the CLI, this patch restrict the ssl-load-extra-del-ext to files with a ".crt" extension in the configuration. Related to issue #785. Should be backported where 8e8581e ("MINOR: ssl: 'ssl-load-extra-del-ext' removes the certificate extension") was backported.	2020-10-23 18:41:08 +02:00

... 13 14 15 16 17 ...

10977 Commits