haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-20 06:01:23 +02:00

Author	SHA1	Message	Date
Willy Tarreau	5db847ab65	CLEANUP: ssl: remove 57 occurrences of useless tests on LIBRESSL_VERSION_NUMBER They were all check to comply with the advertised openssl version. Now that libressl doesn't pretend to be a more recent openssl anymore, we can simply rely on the regular openssl version tests without having to deal with exceptions for libressl.	2019-05-09 14:26:39 +02:00
Willy Tarreau	9a1ab08160	CLEANUP: ssl-sock: use HA_OPENSSL_VERSION_NUMBER instead of OPENSSL_VERSION_NUMBER Most tests on OPENSSL_VERSION_NUMBER have become complex and break all the time because this number is fake for some derivatives like LibreSSL. This patch creates a new macro, HA_OPENSSL_VERSION_NUMBER, which will carry the real openssl version defining the compatibility level, and this version will be adjusted depending on the variants.	2019-05-09 14:25:43 +02:00
Willy Tarreau	034c88cf03	MEDIUM: tcp: add the "tfo" option to support TCP fastopen on the server This implements support for the new API which relies on a call to setsockopt(). On systems that support it (currently, only Linux >= 4.11), this enables using TCP fast open when connecting to server. Please note that you should use the retry-on "conn-failure", "empty-response" and "response-timeout" keywords, or the request won't be able to be retried on failure. Co-authored-by: Olivier Houchard <ohouchard@haproxy.com>	2019-05-06 22:29:39 +02:00
Ilya Shipitsin	0c50b1ecbb	BUG/MEDIUM: servers: fix typo "src" instead of "srv" When copying the settings for all servers when using server templates, fix a typo, or we would never copy the length of the ALPN to be used for checks. This should be backported to 1.9.	2019-04-30 23:04:47 +02:00
Olivier Houchard	88698d966d	MEDIUM: connections: Add a way to control the number of idling connections. As by default we add all keepalive connections to the idle pool, if we run into a pathological case, where all client don't do keepalive, but the server does, and haproxy is configured to only reuse "safe" connections, we will soon find ourself having lots of idling, unusable for new sessions, connections, while we won't have any file descriptors available to create new connections. To fix this, add 2 new global settings, "pool_low_ratio" and "pool_high_ratio". pool-low-fd-ratio is the % of fds we're allowed to use (against the maximum number of fds available to haproxy) before we stop adding connections to the idle pool, and destroy them instead. The default is 20. pool-high-fd-ratio is the % of fds we're allowed to use (against the maximum number of fds available to haproxy) before we start killing idling connection in the event we have to create a new outgoing connection, and no reuse is possible. The default is 25.	2019-04-18 19:52:03 +02:00
Willy Tarreau	0e492e2ad0	BUILD: address a few cases of "static <type> inline foo()" Older compilers don't like to see "inline" placed after the type in a function declaration, it must be "static inline <type>" only. This patch touches various areas. The warnings were seen with gcc-3.4.	2019-04-15 21:55:48 +02:00
Christopher Faulet	73c1207c71	MINOR: muxes: Pass the context of the mux to destroy() instead of the connection It is mandatory to handle mux upgrades, because during a mux upgrade, the connection will be reassigned to another multiplexer. So when the old one is destroyed, it does not own the connection anymore. Or in other words, conn->ctx does not point to the old mux's context when its destroy() callback is called. So we now rely on the multiplexer context do destroy it instead of the connection. In addition, h1_release() and h2_release() have also been updated in the same way.	2019-04-12 22:06:53 +02:00
Willy Tarreau	c912f94b57	MINOR: server: remove a few unneeded LIST_INIT calls after LIST_DEL_LOCKED Since LIST_DEL_LOCKED() and LIST_POP_LOCKED() now automatically reinitialize the removed element, there's no need for keeping this LIST_INIT() call in the idle connection code.	2019-02-28 16:08:54 +01:00
Olivier Houchard	9ea5d361ae	MEDIUM: servers: Reorganize the way idle connections are cleaned. Instead of having one task per thread and per server that does clean the idling connections, have only one global task for every servers. That tasks parses all the servers that currently have idling connections, and remove half of them, to put them in a per-thread list of connections to kill. For each thread that does have connections to kill, wake a task to do so, so that the cleaning will be done in the context of said thread.	2019-02-26 18:17:32 +01:00
Olivier Houchard	f131481a0a	BUG/MEDIUM: servers: Add a per-thread counter of idle connections. Add a per-thread counter of idling connections, and use it to determine how many connections we should kill after the timeout, instead of using the global counter, or we're likely to just kill most of the connections. This should be backported to 1.9.	2019-02-21 19:07:45 +01:00
Willy Tarreau	980855bd95	BUG/MEDIUM: server: initialize the orphaned conns lists and tasks at the end This also depends on the nbthread count, so it must only be performed after parsing the whole config file. As a side effect, this removes some code duplication between servers and server-templates. This must be backported to 1.9.	2019-02-07 15:08:13 +01:00
Willy Tarreau	835daa119e	BUG/MEDIUM: server: initialize the idle conns list after parsing the config The idle conns lists are sized according to the number of threads. As such they cannot be initialized during the parsing since nbthread can be set later, as revealed by this simple config which randomly crashes when used. Let's do this at the end instead. listen proxy bind :4445 mode http timeout client 10s timeout server 10s timeout connect 10s http-reuse always server s1 127.0.0.1:8000 global nbthread 8 This fix must be backported to 1.9 and 1.8.	2019-02-07 15:08:13 +01:00
Willy Tarreau	9c538e01c2	MINOR: server: add a max-reuse parameter Some servers may wish to limit the total number of requests they execute over a connection because some of their components might leak resources. In HTTP/1 it was easy, they just had to emit a "connection: close" header field with the last response. In HTTP/2, it's less easy because the info is not always shared with the component dealing with the H2 protocol and it could be harder to advertise a GOAWAY with a stream limit. This patch provides a solution to this by adding a new "max-reuse" parameter to the server keyword. This parameter indicates how many times an idle connection may be reused for new requests. The information is made available and the underlying muxes will be able to use it at will. This patch should be backported to 1.9.	2019-01-24 19:06:43 +01:00
Willy Tarreau	15c120d251	CLEANUP: server: fix indentation mess on idle connections Apparently some code was moved around leaving the inner block incorrectly indented and with the closing brace in the middle of nowhere.	2019-01-24 19:06:43 +01:00
Willy Tarreau	cb923d5001	MINOR: server: make sure pool-max-conn is >= -1 The keyword parser doesn't check the value range, but supported values are -1 and positive values, thus we should check it. This can be backported to 1.9.	2019-01-24 16:31:56 +01:00
J�r�me Magnin	f57afa453a	BUG/MINOR: server: don't always trust srv_check_health when loading a server state When we load health values from a server state file, make sure what we assign to srv->check.health actually matches the state we restore. This should be backported as far as 1.6.	2019-01-21 11:09:03 +01:00
Willy Tarreau	1ba32032ef	BUG/MEDIUM: checks: fix recent regression on agent-check making it crash In order to address the mailers issues, we needed to store the proxy into the checks struct, which was done by commit c98aa1f18 ("MINOR: checks: Store the proxy in checks."). However this one did it only for the health checks and not for the agent checks, resulting in an immediate crash when the agent is enabled on a random config like this one : listen agent bind :8000 server s1 255.255.255.255:1 agent-check agent-port 1 Thanks to Seri Kim for reporting it and providing a reproducer in issue #20. This fix must be backported to 1.9.	2019-01-21 07:48:26 +01:00
Fr�d�ric L�caille	355b2033ec	MINOR: cfgparse: SSL/TLS binding in "peers" sections. Make "bind" keywork be supported in "peers" sections. All "bind" settings are supported on this line. Add "default-bind" option to parse the binding options excepted the bind address. Do not parse anymore the bind address for local peers on "server" lines. Do not use anymore list_for_each_entry() to set the "peers" section listener parameters because there is only one listener by "peers" section. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Fr�d�ric L�caille	c06b5d4f74	MINOR: cfgparse: Make "peer" lines be parsed as "server" lines. With this patch "default-server" lines are supported in "peers" sections to setup the default settings of peers which are from now setup when parsing both "peer" and "server" lines. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Olivier Houchard	c98aa1f182	MINOR: checks: Store the proxy in checks. Instead of assuming we have a server, store the proxy directly in struct check, and use it instead of s->server. This should be a no-op for now, but will be useful later when we change mail checks to avoid having a server. This should be backported to 1.9.	2019-01-14 11:15:11 +01:00
Daniel Corbett	43bb842a08	BUG/MEDIUM: init: Initialize idle_orphan_conns for first server in server-template When initializing server-template all of the servers after the first have srv->idle_orphan_conns initialized within server_template_init() The first server does not have this initialized and when http-reuse is active this causes a segmentation fault when accessed from srv_add_to_idle_list(). This patch removes the check for srv->tmpl_info.prefix within server_finalize_init() and allows the first server within a server-template to have srv->idle_orphan_conns properly initialized. This should be backported to 1.9.	2019-01-09 14:45:21 +01:00
Olivier Houchard	921501443b	MEDIUM: checks: Add check-alpn. Add a way to configure the ALPN used by check, with a new "check-alpn" keyword. By default, the checks will use the server ALPN, but it may not be convenient, for instance because the server may use HTTP/2, while checks are unable to do HTTP/2 yet.	2018-12-21 19:54:16 +01:00
Olivier Houchard	21944019ca	BUG/MEDIUM: server: Also copy "check-sni" for server templates. When using server templates, if "check-sni" is used, make sure it shows up in all the created servers. This should be backported to 1.8 and 1.9.	2018-12-21 19:53:28 +01:00
Olivier Houchard	b7b3faa79c	MEDIUM: servers: Replace idle-timeout with pool-purge-delay. Instead of the old "idle-timeout" mechanism, add a new option, "pool-purge-delay", that sets the delay before purging idle connections. Each time the delay happens, we destroy half of the idle connections.	2018-12-15 23:50:09 +01:00
Olivier Houchard	006e3101f9	MEDIUM: servers: Add a command to limit the number of idling connections. Add a new command, "pool-max-conn" that sets the maximum number of connections waiting in the orphan idling connections list (as activated with idle-timeout). Using "-1" means unlimited. Using pools is now dependant on this.	2018-12-15 23:50:08 +01:00
Olivier Houchard	0c18a6fe34	MEDIUM: servers: Add a way to keep idle connections alive. Add a new keyword for servers, "idle-timeout". If set, unused connections are kept alive until the timeout happens, and will be picked for reuse if no other connection is available.	2018-12-02 18:16:53 +01:00
Willy Tarreau	76a551de2e	MINOR: config: make sure to associate the proper mux to bind and servers Currently a mux may be forced on a bind or server line by specifying the "proto" keyword. The problem is that the mux may depend on the proxy's mode, which is not known when parsing this keyword, so a wrong mux could be picked. Let's simply update the mux entry while checking its validity. We do have the name and the side, we only need to see if a better mux fits based on the proxy's mode. It also requires to remove the side check while parsing the "proto" keyword since a wrong mux could be picked. This way it becomes possible to declare multiple muxes with the same protocol names and different sides or modes.	2018-12-02 13:29:35 +01:00
Willy Tarreau	0108d90c6c	MEDIUM: init: convert all trivial registration calls to initcalls This switches explicit calls to various trivial registration methods for keywords, muxes or protocols from constructors to INITCALL1 at stage STG_REGISTER. All these calls have in common to consume a single pointer and return void. Doing this removes 26 constructors. The following calls were addressed : - acl_register_keywords - bind_register_keywords - cfg_register_keywords - cli_register_kw - flt_register_keywords - http_req_keywords_register - http_res_keywords_register - protocol_register - register_mux_proto - sample_register_convs - sample_register_fetches - srv_register_keywords - tcp_req_conn_keywords_register - tcp_req_cont_keywords_register - tcp_req_sess_keywords_register - tcp_res_cont_keywords_register - flt_register_keywords	2018-11-26 19:50:32 +01:00
Olivier Houchard	c756600103	MINOR: server: Add "alpn" and "npn" keywords. Add new keywords to "server" lines, alpn and npn. If set, when connecting through SSL, those alpn/npn will be negociated during the SSL handshake.	2018-11-22 19:50:08 +01:00
Joseph Herlant	44466826b1	CLEANUP: fix a few typos in the comments of the server subsystem A few misspells where detected in the server subsystem. This commit fixes them.	2018-11-18 22:23:15 +01:00
Willy Tarreau	db398435aa	MINOR: stream-int: replace si_cant_put() with si_rx_room_{blk,rdy}() Remaining calls to si_cant_put() were all for lack of room and were turned to si_rx_room_blk(). A few places where SI_FL_RXBLK_ROOM was cleared by hand were converted to si_rx_room_rdy(). The now unused si_cant_put() function was removed.	2018-11-18 21:41:50 +01:00
Willy Tarreau	0cd3bd628a	MINOR: stream-int: rename si_applet_{want\|stop\|cant}_{get\|put} It doesn't make sense to limit this code to applets, as any stream interface can use it. Let's rename it by simply dropping the "applet_" part of the name. No other change was made except updating the comments.	2018-11-11 10:18:37 +01:00
William Lallemand	313bfd18c1	MINOR: server: export new_server() function The new_server() function will be useful to create a proxy for the master-worker.	2018-10-28 13:51:38 +01:00
Willy Tarreau	5dfb6c4cc9	CLEANUP: state-file: make the path concatenation code a bit more consistent There are as many ways to build the globalfilepathlen variable as branches in the if/then/else, creating lots of confusion. Address the most obvious parts, but some polishing definitely is still needed.	2018-10-16 19:26:12 +02:00
Olivier Houchard	17f8b90736	MINOR: server: Use memcpy() instead of strncpy(). Use memcpy instead of strncpy, strncpy buys us nothing, and gcc is being annoying.	2018-10-16 19:22:20 +02:00
Dirkjan Bussink	415150f764	MEDIUM: ssl: add support for ciphersuites option for TLSv1.3 OpenSSL released support for TLSv1.3. It also added a separate function SSL_CTX_set_ciphersuites that is used to set the ciphers used in the TLS 1.3 handshake. This change adds support for that new configuration option by adding a ciphersuites configuration variable that works essentially the same as the existing ciphers setting. Note that it should likely be backported to 1.8 in order to ease usage of the now released openssl-1.1.1.	2018-10-08 19:20:13 +02:00
Fr�d�ric L�caille	5afb3cfbcc	BUG/MINOR: server: Crash when setting FQDN via CLI. This patch ensures that a DNS resolution may be launched before setting a server FQDN via the CLI. Especially, it checks that resolvers was set. A LEVEL 4 reg testing file is provided. Thanks to Lukas Tribus for having reported this issue. Must be backported to 1.8.	2018-09-12 07:41:41 +02:00
Baptiste Assmann	6d0f38f00d	BUG/MEDIUM: dns/server: fix incomatibility between SRV resolution and server state file Server state file has no indication that a server is currently managed by a DNS SRV resolution. And thus, both feature (DNS SRV resolution and server state), when used together, does not provide the expected behavior: a smooth experience... This patch introduce the "SRV record name" in the server state file and loads and applies it if found and wherever required. This patch applies to haproxy-dev branch only. For backport, a specific patch is provided for 1.8.	2018-09-04 17:40:22 +02:00
Willy Tarreau	49725a0977	BUG/MEDIUM: check/threads: do not involve the rendez-vous point for status updates thread_isolate() is currently being called with the server lock held. This is not acceptable because it prevents other threads from reaching the rendez-vous point. Now that the LB algos are thread-safe, let's get rid of this call. No backport is nedeed.	2018-08-21 19:54:09 +02:00
Willy Tarreau	3bcc2699ba	BUG/MEDIUM: cli/threads: protect some server commands against concurrent operations The server-specific CLI commands "set weight", "set maxconn", "disable agent", "enable agent", "disable health", "enable health", "disable server" and "enable server" were not protected against concurrent accesses. Now they take the server lock around the sensitive part. This patch must be backported to 1.8.	2018-08-21 15:35:31 +02:00
Willy Tarreau	46b7f53ad9	DOC: server/threads: document which functions need to be called with/without locks At the moment it's totally unclear while reading the server's code which functions require to be called with the server lock held and which ones grab it and cannot be called this way. This commit simply inventories all of them to indicate what is detected depending on how these functions use the struct server. Only functions used at runtime were checked, those dedicated to config parsing were skipped. Doing so already has uncovered a few bugs on some CLI actions.	2018-08-21 14:58:25 +02:00
Willy Tarreau	eeba36b3af	BUG/MEDIUM: server: update our local state before propagating changes Commit 3ff577e ("MAJOR: server: make server state changes synchronous again") reintroduced synchronous server state changes. However, during the previous change from synchronous to asynchronous, the server state propagation was placed at the end of the function to ease the code changes, and the commit above didn't put it back at its place. This has resulted in propagated states to be incomplete. For example, making a server leave maintenance would make it up but would leave its tracking servers down because they see their tracked server is still down. Let's just move the status update right to its place. It also adds the benefit of reporting state changes in the order they appear and not in reverse. No backport is needed.	2018-08-21 08:29:25 +02:00
Patrick Hemmer	0355dabd7c	MINOR: queue: replace the linked list with a tree We'll need trees to manage the queues by priorities. This change replaces the list with a tree based on a single key. It's effectively a list but allows us to get rid of the list management right now.	2018-08-10 15:06:27 +02:00
Christopher Faulet	8ed0a3e32a	MINOR: mux/server: Add 'proto' keyword to force the multiplexer's protocol For now, it is parsed but not used. Tests are done on it to check if the side and the mode are compatible with the server's definition.	2018-08-08 10:42:08 +02:00
Willy Tarreau	91c2826e1d	CLEANUP: server: remove the update list and the update lock These ones are not more used, let's get rid of them.	2018-08-08 09:57:45 +02:00
Willy Tarreau	3ff577e165	MAJOR: server: make server state changes synchronous again Now we try to synchronously push updates as they come using the new rdv point, so that the call to the server update function from the main poll loop is not needed anymore. It further reduces the apparent latency in the health checks as the response time almost always appears as 0 ms, resulting in a slightly higher check rate of ~1960 conn/s. Despite this, the CPU consumption has slightly dropped again to ~32% for the same test. The only trick is that the checks code is built with a bit of recursivity because srv_update_status() calls server_recalc_eweight(), and the latter needs to signal srv_update_status() in case of updates. Thus we added an extra argument to this function to indicate whether or not it must propagate updates (no if it comes from srv_update_status).	2018-08-08 09:57:45 +02:00
Willy Tarreau	3d3700f216	MEDIUM: checks: use the new rendez-vous point to spread check result The current sync point causes some important stress when a high number of threads is in use on a config with lots of checks, because it wakes up all threads every time a server state changes. A config like the following can easily saturate a 4-core machine reaching only 750 checks per second out of the ~2000 configured : global nbthread 4 defaults mode http timeout connect 5s timeout client 5s timeout server 5s frontend srv bind :8001 process 1/1 redirect location / if { method OPTIONS } { rand(100) ge 50 } stats uri / backend chk option httpchk server-template srv 1-100 127.0.0.1:8001 check rise 1 fall 1 inter 50 The reason is that the random on the fake server causes the responses to randomly match an HTTP check, and results in a lot of up/down events that are broadcasted to all threads. It's worth noting that the CPU usage already dropped by about 60% between 1.8 and 1.9 just due to the scheduler updates, but the sync point remains expensive. In addition, it's visible on the stats page that a lot of requests end up with an L7TOUT status in ~60ms. With smaller timeouts, it's even L4TOUT around 20-25ms. By not using THREAD_WANT_SYNC() anymore and only calling the server updates under thread_isolate(), we can avoid all these wakeups. The CPU usage on the same config drops to around 44% on the same machine, with all checks being delivered at ~1900 checks per second, and the stats page shows no more timeouts, even at 10 ms check interval. The difference is mainly caused by the fact that there's no more need to wait for a thread to wake up from poll() before starting to process check results.	2018-08-08 09:56:32 +02:00
Willy Tarreau	6a78e61694	BUG/MEDIUM: servers: check the queues once enabling a server Commit 64cc49c ("MAJOR: servers: propagate server status changes asynchronously.") heavily changed the way the server states are updated since they became asynchronous. During this change, some code was lost, which is used to shut down some sessions from a backup server and to pick pending connections from a proxy once a server is turned back from maintenance to ready state. The effect is that when temporarily disabling a server, connections stay in the backend's queue, and when re-enabling it, they are not picked and they expire in the backend's queue. Now they're properly picked again. This fix must be backported to 1.8.	2018-08-07 10:14:53 +02:00
Olivier Houchard	306e653331	BUG/MINOR: servers: Don't make "server" in a frontend fatal. When parsing the configuration, if "server", "default-server" or "server-template" are found in a frontend, we first warn that it will be ignored, only to be considered a fatal error later. Be true to our word, and just ignore it. This should be backported to 1.8 and 1.7.	2018-07-24 17:13:54 +02:00
Willy Tarreau	83061a820e	MAJOR: chunks: replace struct chunk with struct buffer Now all the code used to manipulate chunks uses a struct buffer instead. The functions are still called "chunk*", and some of them will progressively move to the generic buffer handling code as they are cleaned up.	2018-07-19 16:23:43 +02:00

1 2 3 4 5 ...

283 Commits