haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-08 08:07:10 +02:00

Author	SHA1	Message	Date
Willy Tarreau	a261e9b094	CLEANUP: connection: remove all direct references to raw_sock and ssl_sock Now we exclusively use xprt_get(XPRT_RAW) instead of &raw_sock or xprt_get(XPRT_SSL) for &ssl_sock. This removes a bunch of #ifdef and include spread over a number of location including backend, cfgparse, checks, cli, hlua, log, server and session.	2016-12-22 23:26:38 +01:00
Willy Tarreau	141ad85d10	MINOR: server: move the use_ssl field out of the ifdef USE_OPENSSL Having it in the ifdef complicates certain operations which require additional ifdefs just to access a member which could remain zero in non-ssl cases. Let's move it out, it will not even increase the struct size on 64-bit machines due to alignment.	2016-12-22 23:26:38 +01:00
Christopher Faulet	90b5abe46e	BUG/MINOR: cli: be sure to always warn the cli applet when input buffer is full [wt: may only strike if CLI commands are pipelined. Must be backported to 1.7 and 1.6, where it's a bit different and in dumpstats.c]	2016-12-12 17:58:11 +01:00
Willy Tarreau	3b6e547be8	CLEANUP: cli: rename STAT_CLI_* to CLI_ST_* These are in CLI states, not stats states anymore. STAT_CLI_O_CUSTOM was more appropriately renamed CLI_ST_CALLBACK.	2016-11-24 16:59:28 +01:00
Willy Tarreau	58d9cb7d22	REORG: cli: move "{enable\|disable} agent" to server.c Also mention that "set server" is preferred now. Note that these were the last enable/disable commands in cli.c. Also remove the now unused expect_server_admin() function.	2016-11-24 16:59:28 +01:00
Willy Tarreau	2c04eda8b5	REORG: cli: move "{enable\|disable} health" to server.c Also mention that "set server" is preferred now.	2016-11-24 16:59:28 +01:00
Willy Tarreau	ffb4d58e1b	REORG: cli: move "{enable\|disable} server" to server.c Also mention that "set server" is preferred now.	2016-11-24 16:59:28 +01:00
Willy Tarreau	b802627eb3	REORG: cli: move "set maxconn server" to server.c It's used to manipulate the server's maxconn setting.	2016-11-24 16:59:28 +01:00
William Lallemand	6b16094355	REORG: cli: move get/set weight to server.c Move get/set weight CLI functions to server.c and use the cli keyword API to register it on the CLI.	2016-11-24 16:59:27 +01:00
William Lallemand	222baf20da	REORG: cli: move 'set server' to server.c Move 'set server' CLI functions to server.c and use the cli keyword API to register it on the CLI.	2016-11-24 16:59:27 +01:00
Willy Tarreau	21b069dca8	MINOR: server: create new function cli_find_server() to find a server Several CLI commands require a server, so let's have a function to look this one up and prepare the appropriate error message and the appctx's state in case of failure.	2016-11-24 16:59:27 +01:00
David Carlier	327298c215	BUILD: fix build on Solaris 10/11 uint16_t instead of u_int16_t None ISO fields of struct tm are not present, but by zeroyfing it, on GNU and BSD systems tm_gmtoff field will be set. [wt: moved the memset into each of the date functions]	2016-11-22 12:04:19 +01:00
Willy Tarreau	e5a60688a4	MEDIUM: server: do not restrict anymore usage of IP address from the state file Now that it is possible to decide whether we prefer to use libc or the state file to resolve the server's IP address and it is possible to change a server's IP address at run time on the CLI, let's not restrict the reuse of the address from the state file anymore to the DNS only. The impact is that by default the state file will be considered first (which matches its purpose) and only then the libc. This way any address change performed at run time over the CLI will be preserved regardless of DNS usage or not.	2016-11-09 15:33:52 +01:00
Willy Tarreau	3eed10e54b	MINOR: init: add -dr to ignore server address resolution failures It is very common when validating a configuration out of production not to have access to the same resolvers and to fail on server address resolution, making it difficult to test a configuration. This option simply appends the "none" method to the list of address resolution methods for all servers, ensuring that even if the libc fails to resolve an address, the startup sequence is not interrupted.	2016-11-09 15:33:52 +01:00
Willy Tarreau	4310d36a7e	MINOR: server: add support for explicit numeric address in init-addr This will allow a server to automatically fall back to an explicit numeric IP address when all other methods fail. The address is simply specified in the address list.	2016-11-09 15:30:47 +01:00
Willy Tarreau	465b6e5463	MEDIUM: server: make libc resolution failure non-fatal Now that we have "init-addr none", it becomes possible to recover on libc resolver's failures. Thus it's preferable not to alert nor fail at the moment the libc is called, and instead process the failure at the end of the list. This allows "none" to be set after libc to provide a smooth fallback in case of resolver issues.	2016-11-09 15:30:47 +01:00
Willy Tarreau	37ebe1212b	MINOR: server: implement init-addr none The server is put into the "no address" maintenance state in this case.	2016-11-09 15:30:47 +01:00
Willy Tarreau	25e515235a	MEDIUM: server: make use of init-addr It is now supported. If not set, we default to the legacy methods list which is "last,libc".	2016-11-09 15:30:47 +01:00
Baptiste Assmann	25938278b7	MEDIUM: server: add a new init-addr server line setting This new setting supports a comma-delimited list of methods used to resolve the server's FQDN to an IP address. Currently supported methods are "libc" (use the regular libc's resolver) and "last" (use the last known valid address found in the state file). The list is implemented in a 32-bit integer, because each init-addr method only requires 3 bits. The last one must always be SRV_IADDR_END (0), allowing to store up to 10 methods in a single 32 bit integer. Note: the doc is provided at the end of this series.	2016-11-09 15:30:47 +01:00
Baptiste Assmann	3b9fe9f8f4	MAJOR: dns: runtime resolution can change server admin state WARNING: this is a MAJOR (and disruptive) change with previous HAProxy's behavior: before, HAProxy never ever used to change a server administrative status when the DNS resolution failed at run time. This patch gives HAProxy the ability to change the administrative status of a server to MAINT (RMAINT actually) when an error is encountered for a period longer than its own allowed by the corresponding 'hold' parameter. IE if the configuration sets "hold nx 10s" and a server's hostname points to a NX for more than 10s, then the server will be set to RMAINT, hence in MAINTENANCE mode.	2016-11-09 15:30:47 +01:00
Willy Tarreau	8b42848a44	MINOR: server: make srv_set_admin_state() capable of telling why this happens It will be important to help debugging some DNS resolution issues to know why a server was marked down, so let's make the function support a 3rd argument with an indication of the reason. Passing NULL will keep the message as-is.	2016-11-09 15:30:47 +01:00
Willy Tarreau	e659973bfe	MINOR: server: indicate in the logs when RMAINT is cleared It's important to report in the server state change logs that RMAINT was cleared, as it's not the regular maintenance mode, it's specific to name resolution, and it's important to report the new state (which can be DRAIN or READY).	2016-11-09 15:23:37 +01:00
Baptiste Assmann	83cbaa531f	MAJOR: server: postpone address resolution Server addresses are not resolved anymore upon the first pass so that we don't fail if an address cannot be resolved by the libc. Instead they are processed all at once after the configuration is fully loaded, by the new function srv_init_addr(). This function only acts on the server's address if this address uses an FQDN, which appears in server->hostname. For now the function does two things, to followup with HAProxy's historical default behavior: 1. apply server IP address found in server-state file if runtime DNS resolution is enabled for this server 2. use the DNS resolver provided by the libc If none of the 2 options above can find an IP address, then an error is returned. All of this will be needed to support the new server parameter "init-addr". For now, the biggest user-visible change is that all server resolution errors are dumped at once instead of causing a startup failure one by one.	2016-11-09 14:24:20 +01:00
Willy Tarreau	757478e900	BUG/MEDIUM: servers: properly propagate the maintenance states during startup Right now there is an issue with the way the maintenance flags are propagated upon startup. They are not propagate, just copied from the tracked server. This implies that depending on the server's order, some tracking servers may not be marked down. For example this configuration does not work as expected : server s1 1.1.1.1:8000 track s2 server s2 1.1.1.1:8000 track s3 server s3 1.1.1.1:8000 track s4 server s4 wtap:8000 check inter 1s disabled It results in s1/s2 being up, and s3/s4 being down, while all of them should be down. The only clean way to process this is to run through all "root" servers (those not tracking any other server), and to propagate their state down to all their trackers. This is the same algorithm used to propagate the state changes. It has to be done both to compute the IDRAIN flag and the IMAINT flag. However, doing so requires that tracking servers are not marked as inherited maintenance anymore while parsing the configuration (and given that it is wrong, better drop it). This fix also addresses another side effect of the bug above which is that the IDRAIN/IMAINT flags are stored in the state files, and if restored while the tracked server doesn't have the equivalent flag, the servers may end up in a situation where it's impossible to remove these flags. For example in the configuration above, after removing "disabled" on server s4, the other servers would have remained down, and not anymore with this fix. Similarly, the combination of IMAINT or IDRAIN with their respective forced modes was not accepted on reload, which is wrong as well. This bug has been present at least since 1.5, maybe even 1.4 (it came with tracking support). The fix needs to be backported there, though the srv-state parts are irrelevant. This commit relies on previous patch to silence warnings on startup.	2016-11-07 14:31:52 +01:00
Willy Tarreau	6fb8dc1a5a	MINOR: server: do not emit warnings/logs/alerts on server state changes at boot We'll have to use srv_set_admin_flag() to propagate some server flags during the startup, and we don't want the resulting actions to cause warnings, logs nor e-mail alerts to be generated since we're just applying the config or a state file. So let's condition these notifications to the fact that we're starting.	2016-11-07 14:31:45 +01:00
Willy Tarreau	e1bde1492a	BUG/MINOR: srv-state: allow to have both CMAINT and FDRAIN flags CMAINT indicates that the server was initially disabled in the configuration via the "disabled" keyword. FDRAIN indicates that the server was switched to the DRAIN state from the CLI or the agent. This it's perfectly valid to have both of them in the state file, so the parser must not reject this combination. This fix must be backported to 1.6.	2016-11-07 14:30:19 +01:00
Willy Tarreau	22cace2f4c	BUG/MEDIUM: srv-state: properly restore the DRAIN state There were seveal reports about the DRAIN state not being properly restored upon reload. It happens that the condition in the code does exactly the opposite of what the comment says, and the comment is right so the code is wrong. It's worth noting that the conditions are complex here due to the 2 available methods to set the drain state (CLI/agent, and config's weight). To paraphrase the updated comment in the code, there are two possible reasons for FDRAIN to have been present : - previous config weight was zero - "set server b/s drain" was sent to the CLI In the first case, we simply want to drop this drain state if the new weight is not zero anymore, meaning the administrator has intentionally turned the weight back to a positive value to enable the server again after an operation. In the second case, the drain state was forced on the CLI regardless of the config's weight so we don't want a change to the config weight to lose this status. What this means is : - if previous weight was 0 and new one is >0, drop the DRAIN state. - if the previous weight was >0, keep it. This fix must be backported to 1.6.	2016-11-07 14:30:19 +01:00
Willy Tarreau	dc3a9e830c	CLEANUP: tools: make ipcpy() preserve the original port ipcpy() is used to replace an IP address with another one, but it doesn't preserve the original port so all callers have to do it manually while it's trivial to do there. Better do it inside the function.	2016-11-05 13:56:04 +01:00
Baptiste Assmann	65ce3f5ee4	MINOR: dns: query type change when last record is a CNAME DNS servers don't return A or AAAA record if the query points to a CNAME not resolving to the right type. We know it because the last record of the response is a CNAME. We can trigger a new query, switching to a new query type, handled by the layer above.	2016-09-12 20:01:40 +02:00
Baptiste Assmann	c1ce5f358e	MEDIUM: dns: new DNS response parser New DNS response parser function which turn the DNS response from a network buffer into a DNS structure, much easier for later analysis by upper layer. Memory is pre-allocated at start-up in a chunk dedicated to DNS response store. New error code to report a wrong number of queries in a DNS response.	2016-09-12 19:54:23 +02:00
Baptiste Assmann	d458adcc52	MINOR: new update_server_addr_port() function to change both server's ADDR and service PORT This function can replace update_server_addr() where the need to change the server's port as well as the IP address is required. It performs some validation before performing each type of change.	2016-09-11 08:13:11 +02:00
Baptiste Assmann	6b453f166f	MINOR: server: introduction of 3 new server flags Introduction of 3 new server flags to remember if some parameters were set during configuration parsing. * SRV_F_CHECKADDR: this server has a check addr configured * SRV_F_CHECKPORT: this server has a check port configured * SRV_F_AGENTADDR: this server has a agent addr configured	2016-09-11 08:12:42 +02:00
Baptiste Assmann	95db2bcfee	MAJOR: check: find out which port to use for health check at run time HAProxy used to deduce port used for health checks when parsing configuration at startup time. Because of this way of working, it makes it complicated to change the port at run time. The current patch changes this behavior and makes HAProxy to choose the port used for health checking when preparing the check task itself. A new type of error is introduced and reported when no port can be found. There won't be any impact on performance, since the process to find out the port value is made of a few 'if' statements. This patch also introduces a new check state CHK_ST_PORT_MISS: this flag is used to report an error in the case when HAProxy needs to establish a TCP connection to a server, to perform a health check but no TCP ports can be found for it. And last, it also introduces a new stream termination condition: SF_ERR_CHK_PORT. Purpose of this flag is to report an error in the event when HAProxy has to run a health check but no port can be found to perform it.	2016-09-11 08:12:13 +02:00
Baptiste Assmann	d260e1dea6	MAJOR: listen section: don't use first bind port anymore when no server ports are provided Up to HAProxy 1.7-dev3, HAProxy used to use the first bind port from it's local 'listen' section when no port is configured on the server. IE, in the configuration below, the server port would be 25: listen smtp bind :25 server s1 1.0.0.1 check This way of working is now obsolete and can be removed, furthermore it is not documented! This will make the possibility to change the server's port much easier.	2016-08-14 12:18:14 +02:00
Willy Tarreau	eec1d3869d	BUG/MEDIUM: dns: fix alignment issues in the DNS response parser Alexander Lebedev reported that the DNS parser crashes in 1.6 with a bus error on Sparc when it receives a response. This is obviously caused by some alignment issues. The issue can also be reproduced on ARMv5 when setting /proc/cpu/alignment to 4 (which helps debugging). Two places cause this crash in turn, the first one is when the IP address from the packet is compared to the current one, and the second place is when the address is assigned because an unaligned address is passed to update_server_addr(). This patch modifies these places to properly use memcpy() and memcmp() to manipulate the unaligned data. Nenad Merdanovic found another set of places specific to 1.7 in functions in_net_ipv4() and in_net_ipv6(), which are used to compare networks. 1.6 has the functions but does not use them. There we perform a temporary copy to a local variable to fix the problem. The type of the function's argument is wrong since it's not necessarily aligned, so we change it for a const void * instead. This fix must be backported to 1.6. Note that in 1.6 the code is slightly different, there's no rec[] array, the pointer is used directly from the buffer.	2016-07-13 12:13:24 +02:00
Nenad Merdanovic	174dd37d88	MINOR: Add ability for agent-check to set server maxconn This is very useful in complex architecture systems where HAproxy is balancing DB connections for example. We want to keep the maxconn high in order to avoid issues with queueing on the LB level when there is slowness on another part of the system. Example is a case of an architecture where each thread opens multiple DB connections, which if get stuck in queue cause a snowball effect (old connections aren't closed, new ones cannot be established). These connections are mostly idle and the DB server has no problem handling thousands of them. Allowing us to dynamically set maxconn depending on the backend usage (LA, CPU, memory, etc.) enables us to have high maxconn for situations like above, but lowering it in case there are real issues where the backend servers become overloaded (cache issues, DB gets hit hard).	2016-04-25 17:23:50 +02:00
David Carlier	d10025c671	BUG/MINOR: server: risk of over reading the pref_net array. dns_option struct pref_net field is an array of 5. The issue here shows that pref_net_nb can go up to 5 as well which might lead to read outside of this array.	2016-04-12 11:00:39 +02:00
Vincent Bernat	02779b6263	CLEANUP: uniformize last argument of malloc/calloc Instead of repeating the type of the LHS argument (sizeof(struct ...)) in calls to malloc/calloc, we directly use the pointer name (sizeof(...)). The following Coccinelle patch was used: @@ type T; T x; @@ x = malloc( - sizeof(T) + sizeof(x) ) @@ type T; T x; @@ x = calloc(1, - sizeof(T) + sizeof(*x) ) When the LHS is not just a variable name, no change is made. Moreover, the following patch was used to ensure that "1" is consistently used as a first argument of calloc, not the last one: @@ @@ calloc( + 1, ... - ,1 )	2016-04-03 14:17:42 +02:00
Vincent Bernat	3c2f2f207f	CLEANUP: remove unneeded casts In C89, "void " is automatically promoted to any pointer type. Casting the result of malloc/calloc to the type of the LHS variable is therefore unneeded. Most of this patch was built using this Coccinelle patch: @@ type T; @@ - (T ) ($lua_touserdata\\|malloc\\|calloc\\|SSL_get_app_data\\|hlua_checkudata\\|lua_newuserdata$(...)) @@ type T; T x; void data; @@ x = - (T ) data @@ type T; T x; T data; @@ x = - (T ) data Unfortunately, either Coccinelle or I is too limited to detect situation where a complex RHS expression is of type "void *" and therefore casting is not needed. Those cases were manually examined and corrected.	2016-04-03 14:17:42 +02:00
Willy Tarreau	1a786d7f33	BUG/MINOR: tcpcheck: fix incorrect list usage resulting in failure to load certain configs Commit `baf9794` ("BUG/MINOR: tcpcheck: conf parsing error when no port configured on server and first rule(s) is (are) COMMENT") was wrong, it incorrectly implemented a list access by dereferencing a pointer of an incorrect type resulting in checking the next element in the list. The consequence is that it stops before the last comment instead of at the last one and skips the first rule. In the end, rules starting with comments are not affected, but if a sequence of checks directly starts with connect, it is then skipped and this is visible when no port is configured on the server line as the config refuses to load. There was another occurence of the same bug a few lines below, both of them were fixed. Tests were made on different configs and confirm the new fix is OK. This fix must be backported to 1.6.	2016-03-08 15:20:25 +01:00
Thierry Fournier	09a9178311	MINOR: server: generalize the "updater" source the function server_parse_addr_change_request() contain an hardcoded updater source "stats command". this function can be called from other sources than the "stats command", so this patch make this argument generic.	2016-02-24 23:37:39 +01:00
Thierry Fournier	d35b7a6d93	CLEANUP: server: add "const" to some message strings "updater" is used in "read only" mode, so I add a const qualifier to the variable declaration.	2016-02-24 23:37:39 +01:00
Thierry Fournier	c62df8463b	BUG/MINOR: server: fix the format of the warning on address change When the server address is changed, a message with unrequired '\n' or '.' is displayed, like this: [WARNING] 054/101137 (3229) : zzzz/s3 changed its IP from 127.0.0.1 to ::55 by stats command . This patch remove the '\n' which is sent before the '.'. This patch must be backported in 1.6	2016-02-24 23:37:39 +01:00
Thierry Fournier	ac88cfe452	MEDIUM: dns: add a "resolve-net" option which allow to prefer an ip in a network This options prioritize th choice of an ip address matching a network. This is useful with clouds to prefer a local ip. In some cases, a cloud high avalailibility service can be announced with many ip addresses on many differents datacenters. The latency between datacenter is not negligible, so this patch permitsto prefers a local datacenter. If none address matchs the configured network, another address is selected.	2016-02-19 14:37:49 +01:00
Thierry Fournier	ada348459f	MEDIUM: dns: extract options DNS selection preferences are actually declared inline in the struct server. There are copied from the server struct to the dns_resolution struct for each resolution. Next patchs adds new preferences options, and it is not a good way to copy all the configuration information before each dns resolution. This patch extract the configuration preference from the struct server and declares a new dedicated struct. Only a pointer to this new striuict will be copied before each dns resolution.	2016-02-19 14:37:46 +01:00
Willy Tarreau	f3c7a83acc	BUG/MEDIUM: servers state: server port is used uninitialized Nenad spotted that the last fix was unfortunately wrong. Needs to be backported to 1.6 as well.	2016-01-21 13:51:56 +01:00
Baptiste Assmann	a875b1f92e	BUG/MAJOR: servers state: server port is erased when dns resolution is enabled on a server Servers state function save and apply server IP when DNS resolution is enabled on a server. Purpose is to prevent switching traffic from one server to an other one when multiple IPs are returned by the DNS server for the A or AAAA record. That said, a bug in current code lead to erase the service port while copying the IP found in the file into the server structure in HAProxy's memory. This patch fix this bug. The bug was reported on the ML by Robert Samuel Newson and fix proposed by Nenad Merdanovic. Thank you both!!! backport: can be backported to 1.6	2016-01-21 10:47:12 +01:00
Baptiste Assmann	0821bb9ec0	MINOR: server state: missing LF (\n) on error message printed when parsing server state file There is no LF characters printed at the end of the error message returned by the function when applying server state found in a file.	2016-01-21 07:40:51 +01:00
Baptiste Assmann	baf9794b4d	BUG/MINOR: tcpcheck: conf parsing error when no port configured on server and first rule(s) is (are) COMMENT A small configuration parsing error exists when no port is setup on the server IP:port statement and the server's parameter 'port' is not set and if the first tcp-check rule is a comment, like in the example below: backend b option tcp-check tcp-check comment blah tcp-check connect 8444 server s 127.0.0.1 check In such case, an ALERT is improperly returned, despite this configuration is valid and works. The new code move the pointer to the first tcp-check rule which isn't a comment before checking the presence of the port. backport status: 1.6 and above	2015-12-04 07:48:44 +01:00
Baptiste Assmann	3dd73bea64	BUG/MINOR: tcpcheck: conf parsing error when no port configured on server and last rule is a CONNECT with no port Current configuration parsing is permissive in such situation: A server in a backend with no port conigured on the IP address statement, no 'port' parameter configured and last rule of a tcp-check is a CONNECT with no port. The current code currently parses all the rules to validate a port is well available, but it misses the last one, which means such configuration is valid: backend b option tcp-check tcp-check connect port 8444 tcp-check connect server s 127.0.0.1 check the second connect tentative is sent to port '0'... Current patch fixes this by parsing the list the right way, including the last rule. backport status: 1.6 and above	2015-12-04 07:48:35 +01:00
Dragan Dosen	cf4fb036a4	BUG/MINOR: server: check return value of fgets() in apply_server_state() fgets() can return NULL on error or when EOF occurs. This patch adds a check of fgets() return value and displays a warning if the first line of the server state file can not be read. Additionally, we make sure to close the previously opened file descriptor.	2015-11-05 10:39:09 +01:00
James Brown	55f9ff11b5	MINOR: check: add agent-send server parameter Causes HAProxy to emit a static string to the agent on every check, so that you can independently control multiple services running behind a single agent port.	2015-11-04 07:26:51 +01:00
Baptiste Assmann	fad0318c74	BUG/MAJOR: dns: first DNS response packet not matching queried hostname may lead to a loop The status DNS_UPD_NAME_ERROR returned by dns_get_ip_from_response and which means the queried name can't be found in the response was improperly processed (fell into the default case). This lead to a loop where HAProxy simply resend a new query as soon as it got a response for this status and in the only case where such type of response is the very first one received by the process. This should be backported into 1.6 branch	2015-10-30 12:38:14 +01:00
Andrew Hayworth	e6a4a329b8	MEDIUM: dns: Don't use the ANY query type Basically, it's ill-defined and shouldn't really be used going forward. We can't guarantee that resolvers will do the 'legwork' for us and actually resolve CNAMES when we request the ANY query-type. Case in point (obfuscated, clearly): PRODUCTION! ahayworth@secret-hostname.com:~$ dig @10.11.12.53 ANY api.somestartup.io ; <<>> DiG 9.8.4-rpz2+rl005.12-P1 <<>> @10.11.12.53 ANY api.somestartup.io ; (1 server found) ;; global options: +cmd ;; Got answer: ;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 62454 ;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 4, ADDITIONAL: 0 ;; QUESTION SECTION: ;api.somestartup.io. IN ANY ;; ANSWER SECTION: api.somestartup.io. 20 IN CNAME api-somestartup-production.ap-southeast-2.elb.amazonaws.com. ;; AUTHORITY SECTION: somestartup.io. 166687 IN NS ns-1254.awsdns-28.org. somestartup.io. 166687 IN NS ns-1884.awsdns-43.co.uk. somestartup.io. 166687 IN NS ns-440.awsdns-55.com. somestartup.io. 166687 IN NS ns-577.awsdns-08.net. ;; Query time: 1 msec ;; SERVER: 10.11.12.53#53(10.11.12.53) ;; WHEN: Mon Oct 19 22:02:29 2015 ;; MSG SIZE rcvd: 242 HAProxy can't handle that response correctly. Rather than try to build in support for resolving CNAMEs presented without an A record in an answer section (which may be a valid improvement further on), this change just skips ANY record types altogether. A and AAAA are much more well-defined and predictable. Notably, this commit preserves the implicit "Prefer IPV6 behavior." Furthermore, ANY query type by default is a bad idea: (from Robin on HAProxy's ML): Using ANY queries for this kind of stuff is considered by most people to be a bad practice since besides all the things you named it can lead to incomplete responses. Basically a resolver is allowed to just return whatever it has in cache when it receives an ANY query instead of actually doing an ANY query at the authoritative nameserver. Thus if it only received queries for an A record before you do an ANY query you will not get an AAAA record even if it is actually available since the resolver doesn't have it in its cache. Even worse if before it only got MX queries, you won't get either A or AAAA	2015-10-20 22:31:01 +02:00
Willy Tarreau	31138fae9f	BUG/MEDIUM: server: fix misuse of format string in load-server-state's warnings Commit `e11cfcd` ("MINOR: config: new backend directives: load-server-state-from-file and server-state-file-name") introduced a bug which can cause haproxy to crash upon startup by sending user-controlled data in a format string when emitting a warning. Fix the way the warning message is built to avoid this. No backport is needed, this was introduced in 1.6-dev6 only.	2015-09-29 18:51:40 +02:00
Willy Tarreau	e1aebb2994	BUILD: server: fix build warnings introduced by load-server-state Commit `e11cfcd` ("MINOR: config: new backend directives: load-server-state-from-file and server-state-file-name") caused these warnings when building with Clang : src/server.c:1972:21: warning: comparison of unsigned expression < 0 is always false [-Wtautological-compare] (srv_uweight < 0) \|\| (srv_uweight > SRV_UWGHT_MAX)) ~~~~~~~~~~~ ^ ~ src/server.c:1980:21: warning: comparison of unsigned expression < 0 is always false [-Wtautological-compare] (srv_iweight < 0) \|\| (srv_iweight > SRV_UWGHT_MAX)) ~~~~~~~~~~~ ^ ~ Indeed, srv_iweight and srv_uweight are unsigned. Just drop the offending test.	2015-09-29 18:32:57 +02:00
Thierry FOURNIER	7fe3be7281	MINOR: standard: avoid DNS resolution from the function str2sa_range() This patch blocks the DNS resolution in the function str2sa_range(), this is useful if the function is used during the HAProxy runtime.	2015-09-27 15:04:32 +02:00
Baptiste Assmann	6076d1c02d	MINOR: server: startup slowstart task when using seamless reload of HAProxy This patch uses the start up of the health check task to also start the warmup task when required. This is executed only once: when HAProxy has just started up and can be started only if the load-server-state-from-file feature is enabled and the server was in the warmup state before a reload occurs.	2015-09-19 17:05:28 +02:00
Baptiste Assmann	e11cfcd2c9	MINOR: config: new backend directives: load-server-state-from-file and server-state-file-name This directive gives HAProxy the ability to use the either the global server-state-file directive or a local one using server-state-file-name to load server states. The state can be saved right before the reload by the init script, using the "show servers state" command on the stats socket redirecting output into a file.	2015-09-19 17:05:28 +02:00
Baptiste Assmann	54a4730c65	BUG/MAJOR: can't enable a server through the stat socket When a server is disabled in the configuration using the "disabled" keyword, a single flag is positionned: SRV_ADMF_CMAINT (use to be SRV_ADMF_FMAINT).. That said, when providing the first version of this code, we also changed the SRV_ADMF_MAINT mask to match any of the possible MAINT cases: SRV_ADMF_FMAINT, SRV_ADMF_IMAINT, SRV_ADMF_CMAINT Since SRV_ADMF_CMAINT is never (and is not supposed to be) altered at run time, once a server has this flag set up, it can never ever be enabled again using the stats socket. In order to fix this, we should: - consider SRV_ADMF_CMAINT as a simple flag to report the state in the old configuration file (will be used after a reload to deduce the state of the server in a new running process) - enabling both SRV_ADMF_CMAINT and SRV_ADMF_FMAINT when the keyword "disabled" is in use in the configuration - update the mask SRV_ADMF_MAINT as it was before, to only match SRV_ADMF_FMAINT and SRV_ADMF_IMAINT. The following patch perform the changes above. It allows fixing the regression without breaking the way the up coming feature (seamless server state accross reloads) is going to work. Note: this is 1.6-only, no backport needed.	2015-09-18 12:38:23 +02:00
Baptiste Assmann	f778bb46d6	BUG/MINOR: DNS request retry counter used for retry only There are two types of retries when performing a DNS resolution: 1. retry because of a timeout 2. retry of the full sequence of requests (query types failover) Before this patch, the 'resolution->try' counter was incremented after each send of a DNS request, which does not cover the 2 cases above. This patch fix this behavior.	2015-09-10 15:46:03 +02:00
Baptiste Assmann	0453a1dd45	MINOR: dns: new flag to report that no IP can be found in a DNS response packet Some DNS response may be valid from a protocol point of view but may not contain any IP addresses. This patch gives a new flag to the function dns_get_ip_from_response to report such case. It's up to the upper layer to decide what to do with this information.	2015-09-10 15:42:55 +02:00
Baptiste Assmann	96972bcd36	MINOR: dns: no expected DNS record type found Some DNS responses may be valid from a protocol point of view, but may not contain any information considered as interested by the requester.. Purpose of the flag DNS_RESP_NO_EXPECTED_RECORD introduced by this patch is to allow reporting such situation. When this happens, a new DNS query is sent with a new query type. For now, the function only expect A and AAAA query types which is enough to cover current cases. In a next future, it will be up to the caller to tell the function which query types are expected.	2015-09-10 15:41:53 +02:00
Willy Tarreau	07101d5a16	BUG/MEDIUM: dns: use the correct server hostname when resolving The server's host name picked for resolution was incorrect, it did not skip the address family specifier, did not resolve environment variables, and messed up with the optional trailing colon. Instead, let's get the fqdn returned by str2sa_range() and use that exclusively.	2015-09-08 16:16:35 +02:00
Willy Tarreau	72b8c1f0aa	MEDIUM: tools: make str2sa_range() optionally return the FQDN The function does a bunch of things among which resolving environment variables, skipping address family specifiers and trimming port ranges. It is the only one which sees the complete host name before trying to resolve it. The DNS resolving code needs to know the original hostname, so we modify this function to optionally provide it to the caller. Note that the function itself doesn't know if the host part was a host or an address, but str2ip() knows that and can be asked not to try to resolve. So we first try to parse the address without resolving and try again with resolving enabled. This way we know if the address is explicit or needs some kind of resolution.	2015-09-08 15:50:19 +02:00
Baptiste Assmann	90447582d7	MINOR: DNS client query type failover management In the first version of the DNS resolver, HAProxy sends an ANY query type and in case of issue fails over to the type pointed by the directive in 'resolve-prefer'. This patch allows the following new failover management: 1. default query type is still ANY 2. if response is truncated or in error because ANY is not supported by the server, then a fail over to a new query type is performed. The new query type is the one pointed by the directive 'resolve-prefer'. 3. if no response or still some errors occurs, then a query type fail over is performed to the remaining IP address family.	2015-09-08 15:04:17 +02:00
Baptiste Assmann	0df5d9669a	MINOR: dns: New DNS response analysis code: DNS_RESP_TRUNCATED This patch introduces a new internal response state about the analysis of a DNS response received by a server. It is dedicated to report to above layer that the response is 'truncated'.	2015-09-08 14:58:07 +02:00
Baptiste Assmann	11c4e4eefb	BUG/MAJOR: dns: dns client resolution infinite loop Under certain circonstance (a configuration with many servers relying on DNS resolution and one of them triggering the replay of a request because of a timeout or invalid response to an ANY query), HAProxy could end up in an infinite loop over the currently supposed running DNS queries. This was caused because the FIFO list of running queries was improperly updated in snr_resolution_error_cb. The head of the list was removed instead of the resolution in error, when moving the resolution to the end of the list. In the mean time, a LIST_DEL statement is removed since useless. This action is already performed by the dns_reset_resolution function.	2015-09-08 10:51:50 +02:00
Baptiste Assmann	f046f11561	BUG/MEDIUM: dns: wrong first time DNS resolution First DNS resolution is supposed to be triggered by first health check, which is not the case with current code. This patch fixes this behavior by setting the resolution->last_resolution time to 0 instead of now_ms when parsing server's configuration at startup.	2015-08-28 17:23:04 +02:00
Willy Tarreau	29fbe51490	MAJOR: tproxy: remove support for cttproxy This was the first transparent proxy technology supported by haproxy circa 2005 but it was obsoleted in 2007 by Tproxy 4.0 which removed a lot of the earlier versions' shortcomings and was finally merged into the kernel. Since nobody has been using cttproxy for many years now and nobody has even just tried to compile the files, it's time to remove it. The doc was updated as well.	2015-08-20 19:35:14 +02:00
Baptiste Assmann	93c20623db	MINOR: server SRV_ADMF_CMAINT flag doesn't imply SRV_ADMF_FMAINT The newly created server flag SRV_ADMF_CMAINT means that the server is in 'disabled' mode because of configuration statement 'disabled'. The flag SRV_ADMF_FMAINT should not be set anymore in such case and is reserved only when the server is Forced in maintenance mode from the stats socket.	2015-08-17 15:42:07 +02:00
Baptiste Assmann	9f5ada32e4	MINOR: server: add new SRV_ADMF_CMAINT flag The purpose of SRV_ADMF_CMAINT flag is to keep in mind the server was forced to maintenance status because of the configuration file.	2015-08-08 18:18:17 +02:00
Willy Tarreau	7017cb040c	MINOR: server: add a list of safe, already reused idle connections These ones are considered safe as they have already been reused. They will be useful in "aggressive" and "always" http-reuse modes in order to place the first request of a connection with the least risk.	2015-08-06 16:29:01 +02:00
Willy Tarreau	173a1c6b43	MINOR: server: add a list of already used idle connections There's a difference with the other idle conns in that these new ones have already been used and may be reused by other streams.	2015-08-06 11:13:47 +02:00
Willy Tarreau	600802aef0	MINOR: server: add a list of private idle connections For now it's not populated but we have the list entry. It will carry all idle connections that sessions don't want to share. They may be used later to reclaim connections upon socket shortage for example.	2015-08-06 10:59:08 +02:00
Baptiste Assmann	19a106d24a	MINOR: server: server_find functions: id, name, best_match This patch introduces three new functions which can be used to find a server in a farm using different server information: - server unique id (srv->puid) - server name - find best match using either name or unique id When performing best matching, the following applies: - use the server name first (if provided) - use the server id if provided in any case, the function can update the caller about mismatches encountered.	2015-07-21 23:24:16 +02:00
Baptiste Assmann	7cc419ae1d	MINOR: server: new server flag: SRV_F_FORCED_ID This flag aims at reporting whether the server unique id (srv->puid) has been forced by the administrator in HAProxy's configuration. If not set, it means HAProxy has generated automatically the server's unique id.	2015-07-21 23:24:16 +02:00
Baptiste Assmann	a68ca96375	MAJOR: server: add DNS-based server name resolution Relies on the DNS protocol freshly implemented in HAProxy. It performs a server IP addr resolution based on a server hostname.	2015-06-13 22:07:35 +02:00
Baptiste Assmann	3d8f831f13	MEDIUM: server: change server ip address from stats socket New command available on the stats socket to change a server addr using the command "set server <backend>/<server> addr <ip4\|ip6>"	2015-06-13 22:07:35 +02:00
Baptiste Assmann	14e4014a48	MEDIUM: server: add support for changing a server's address Ability to change a server IP address during HAProxy run time. For now this is provided via function update_server_addr() which currently is not called. A log is emitted on each change. For now we do it inconditionally, but later we'll want to do it only on certain circumstances, which explains why the logging block is enclosed in if(1).	2015-06-13 22:07:35 +02:00
Simon Horman	4cd477f372	MEDIUM: Send email alerts when servers are marked as UP or enter the drain state This is similar to the way email alerts are sent when servers are marked as DOWN. Like the log messages corresponding to these state changes the messages have log level notice. Thus they are suppressed by the default email-alert level of 'alert'. To allow these messages the email-alert level should be set to 'notice', 'info' or 'debug'. e.g: email-alert level notice "email-alert mailers" and "email-alert to" settings are also required in order for any email alerts to be sent. A follow-up patch will document the above. Signed-off-by: Simon Horman <horms@verge.net.au>	2015-04-30 07:30:50 +02:00
Willy Tarreau	e7dff02dd4	REORG/MEDIUM: stream: rename stream flags from SN_* to SF_* This is in order to keep things consistent.	2015-04-06 11:23:57 +02:00
Willy Tarreau	87b09668be	REORG/MAJOR: session: rename the "session" entity to "stream" With HTTP/2, we'll have to support multiplexed streams. A stream is in fact the largest part of what we currently call a session, it has buffers, logs, etc. In order to catch any error, this commit removes any reference to the struct session and tries to rename most "session" occurrences in function names to "stream" and "sess" to "strm" when that's related to a session. The files stream.{c,h} were added and session.{c,h} removed. The session will be reintroduced later and a few parts of the stream will progressively be moved overthere. It will more or less contain only what we need in an embryonic session. Sample fetch functions and converters will have to change a bit so that they'll use an L5 (session) instead of what's currently called "L4" which is in fact L6 for now. Once all changes are completed, we should see approximately this : L7 - http_txn L6 - stream L5 - session L4 - connection \| applet There will be at most one http_txn per stream, and a same session will possibly be referenced by multiple streams. A connection will point to a session and to a stream. The session will hold all the information we need to keep even when we don't yet have a stream. Some more cleanup is needed because some code was already far from being clean. The server queue management still refers to sessions at many places while comments talk about connections. This will have to be cleaned up once we have a server-side connection pool manager. Stream flags "SN_*" still need to be renamed, it doesn't seem like any of them will need to move to the session.	2015-04-06 11:23:56 +02:00
Thierry FOURNIER	bb2ae64b82	MEDIUM: protocol: automatically pick the proto associated to the connection. When the destination IP is dynamically set, we can't use the "target" to define the proto. This patch ensures that we always use the protocol associated with the address family. The proto field was removed from the server and check structs.	2015-02-28 23:12:31 +01:00
Simon Horman	64e3416662	MEDIUM: Allow suppression of email alerts by log level This patch adds a new option which allows configuration of the maximum log level of messages for which email alerts will be sent. The default is alert which is more restrictive than the current code which sends email alerts for all priorities. That behaviour may be configured using the new configuration option to set the maximum level to notice or greater. email-alert level notice Signed-off-by: Simon Horman <horms@verge.net.au>	2015-02-06 07:59:58 +01:00
Simon Horman	00b69e08d5	MINOR: Remove trailing '.' from email alert messages This removes the trailing '.' from both the header and the body of email alerts. The main motivation for this change is to make the format of email alerts generated from srv_set_stopped() consistent with those generated from set_server_check_status(). Signed-off-by: Simon Horman <horms@verge.net.au>	2015-02-06 07:59:58 +01:00
Simon Horman	0ba0e4ac07	MEDIUM: Support sending email alerts Signed-off-by: Simon Horman <horms@verge.net.au>	2015-02-03 00:24:16 +01:00
Simon Horman	e16c1b3f3d	MEDIUM: Attach tcpcheck_rules to check This is to allow checks to be established whose tcpcheck_rules are not those of its proxy. Signed-off-by: Simon Horman <horms@verge.net.au>	2015-02-03 00:24:16 +01:00
Simon Horman	41f5876750	MEDIUM: Move proto and addr fields struct check The motivation for this is to make checks more independent of each other to allow further reuse of their infrastructure. For nowserver->check and server->agent still always use the same values for the addr and proto fields so this patch should not introduce any behavioural changes. Signed-off-by: Simon Horman <horms@verge.net.au>	2015-02-03 00:24:16 +01:00
Simon Horman	b1900d55df	MEDIUM: Refactor init_check and move to checks.c Refactor init_check so that an error string is returned rather than alerts being printed by it. Also init_check to checks.c and provide a prototype to allow it to be used from multiple C files. Signed-off-by: Simon Horman <horms@verge.net.au>	2015-02-03 00:24:15 +01:00
Simon Horman	1a23cf0dfb	BUG/MEDIUM: Do not set agent health to zero if server is disabled in config disable starts a server in the disabled state, however setting the health of an agent implies that the agent is disabled as well as the server. This is a problem because the state of the agent is not restored if the state of the server is subsequently updated leading to an unexpected state. For example, if a server is started disabled and then the server state is set to ready then without this change show stat indicates that the server is "DOWN (agent)" when it is expected that the server would be UP if its (non-agent) health check passes. Reported-by: Mark Brooks <mark@loadbalancer.org> Signed-off-by: Simon Horman <horms@verge.net.au>	2015-01-23 16:47:41 +01:00
KOVACS Krisztian	b3e54fe387	MAJOR: namespace: add Linux network namespace support This patch makes it possible to create binds and servers in separate namespaces. This can be used to proxy between multiple completely independent virtual networks (with possibly overlapping IP addresses) and a non-namespace-aware proxy implementation that supports the proxy protocol (v2). The setup is something like this: net1 on VLAN 1 (namespace 1) -\ net2 on VLAN 2 (namespace 2) -- haproxy ==== proxy (namespace 0) net3 on VLAN 3 (namespace 3) -/ The proxy is configured to make server connections through haproxy and sending the expected source/target addresses to haproxy using the proxy protocol. The network namespace setup on the haproxy node is something like this: = 8< = $ cat setup.sh ip netns add 1 ip link add link eth1 type vlan id 1 ip link set eth1.1 netns 1 ip netns exec 1 ip addr add 192.168.91.2/24 dev eth1.1 ip netns exec 1 ip link set eth1.$id up ... = 8< = = 8< = $ cat haproxy.cfg frontend clients bind 127.0.0.1:50022 namespace 1 transparent default_backend scb backend server mode tcp server server1 192.168.122.4:2222 namespace 2 send-proxy-v2 = 8< = A bind line creates the listener in the specified namespace, and connections originating from that listener also have their network namespace set to that of the listener. A server line either forces the connection to be made in a specified namespace or may use the namespace from the client-side connection if that was set. For more documentation please read the documentation included in the patch itself. Signed-off-by: KOVACS Tamas <ktamas@balabit.com> Signed-off-by: Sarkozi Laszlo <laszlo.sarkozi@balabit.com> Signed-off-by: KOVACS Krisztian <hidden@balabit.com>	2014-11-21 07:51:57 +01:00
Cyril Bont�	9ce1311ebc	BUG/MEDIUM: checks: fix conflicts between agent checks and ssl healthchecks Lasse Birnbaum Jensen reported an issue when agent checks are used at the same time as standard healthchecks when SSL is enabled on the server side. The symptom is that agent checks try to communicate in SSL while it should manage raw data. This happens because the transport layer is shared between all kind of checks. To fix the issue, the transport layer is now stored in each check type, allowing to use SSL healthchecks when required, while an agent check should always use the raw_sock implementation. The fix must be backported to 1.5.	2014-11-16 00:53:12 +01:00
Willy Tarreau	bfc7b7acd8	MAJOR: checks: add support for a new "drain" administrative mode This patch adds support for a new "drain" mode. So now we have 3 admin modes for a server : - READY - DRAIN - MAINT The drain mode disables load balancing but leaves the server up. It can coexist with maint, except that maint has precedence. It is also inherited from tracked servers, so just like maint, it's represented with 2 bits. New functions were designed to set/clear each flag and to propagate the changes to tracking servers when relevant, and to log the changes. Existing functions srv_set_adm_maint() and srv_set_adm_ready() were replaced to make use of the new functions. Currently the drain mode is not yet used, however the whole logic was tested with all combinations of set/clear of both flags in various orders to catch all corner cases.	2014-05-23 14:29:11 +02:00
Willy Tarreau	9943d3117e	MINOR: server: make use of srv_is_usable() instead of checking eweight srv_is_usable() is broader than srv_is_usable() as it not only considers the weight but the server's state as well. Future changes will allow a server to be in drain mode with a non-zero weight, so we should migrate to use that function instead.	2014-05-23 14:29:11 +02:00
Willy Tarreau	8eb7784634	MINOR: server: implement srv_set_stopping() This function was taken from check_set_server_drain(). It does not consider health checks at all and only sets a server to stopping provided it's not in maintenance and is not currently stopped. The resulting state will be STOPPING. The state change is propagated to tracked servers. For now the function is not used, but the goal is to split health checks status from server status and to be able to change a server's state regardless of health checks statuses.	2014-05-23 14:29:11 +02:00
Willy Tarreau	dbd5e78f5b	MINOR: server: implement srv_set_running() This function was taken from check_set_server_up(). It does not consider health checks at all and only sets a server up provided it's not in maintenance. The resulting state may be either RUNNING or STARTING depending on the presence of a slowstart or not. The state change is propagated to tracked servers. For now the function is not used, but the goal is to split health checks status from server status and to be able to change a server's state regardless of health checks statuses.	2014-05-23 14:29:11 +02:00
Willy Tarreau	e7d1ef16bf	MINOR: server: implement srv_set_stopped() This function was extracted from check_set_server_down(). In only manipulates the server state and does not consider the health checks at all, nor does it modify their status. It takes a reason message to report in logs, however it passes NULL when recursing through the trackers chain. For now the function is not used, but the goal is to split health checks status from server status and to be able to change a server's state regardless of health checks statuses.	2014-05-23 14:29:11 +02:00
Willy Tarreau	bda92271e6	MINOR: server: make the status reporting function support a reason srv_adm_append_status() was renamed srv_append_status() since it's no more dedicated to maintenance mode. It now supports a reason which if not null is appended to the output string.	2014-05-23 14:29:11 +02:00
Willy Tarreau	3209123fe7	MEDIUM: server: allow multi-level server tracking Now that it is possible to know whether a server is in forced maintenance or inherits its maintenance status from another one, it is possible to allow server tracking at more than one level. We still provide a loop detection however. Note that for the stats it's a bit trickier since we have to report the check state which corresponds to the state of the server at the end of the chain.	2014-05-23 14:29:11 +02:00
Willy Tarreau	a0066ddbda	MEDIUM: server: properly support and propagate the maintenance status This change now involves a new flag SRV_ADMF_IMAINT to note that the maintenance status of a server is inherited from another server. Thus, we know at each server level in the chain if it's running, in forced maintenance or in a maintenance status because it tracks another server, or even in both states. Disabling a server propagates this flag down to other servers. Enabling a server flushes the flag down. A server becomes up again once both of its flags are cleared. Two new functions "srv_adm_set_maint()" and "srv_adm_set_ready()" are used to manipulate this maintenance status. They're used by the CLI and the stats page. Now the stats page always says "MAINT" instead of "MAINT(via)" and it's only the chk/down field which reports "via x/y" when the status is inherited from another server, but it doesn't say it when a server was forced into maintenance. The CSV output indicates "MAINT (via x/y)" instead of only "MAINT(via)". This is the most accurate representation. One important thing is that now entering/leaving maintenance for a tracking server correctly follows the state of the tracked server.	2014-05-22 11:27:00 +02:00
Willy Tarreau	4aac7db940	REORG: checks: put the functions in the appropriate files ! Checks.c has become a total mess. A number of proxy or server maintenance and queue management functions were put there probably because they were used there, but that makes the code untouchable. And that's without saying that their names does not always relate to what they really do! So let's do a first pass by moving these ones : - set_backend_down() => backend.c - redistribute_pending() => queue.c:pendconn_redistribute() - check_for_pending() => queue.c:pendconn_grab_from_px() - shutdown_sessions => server.c:srv_shutdown_sessions() - shutdown_backup_sessions => server.c:srv_shutdown_backup_sessions() All of them were moved at once.	2014-05-22 11:27:00 +02:00
Willy Tarreau	892337c8e1	MAJOR: server: use states instead of flags to store the server state Servers used to have 3 flags to store a state, now they have 4 states instead. This avoids lots of confusion for the 4 remaining undefined states. The encoding from the previous to the new states can be represented this way : SRV_STF_RUNNING \| SRV_STF_GOINGDOWN \| \| SRV_STF_WARMINGUP \| \| \| 0 x x SRV_ST_STOPPED 1 0 0 SRV_ST_RUNNING 1 0 1 SRV_ST_STARTING 1 1 x SRV_ST_STOPPING Note that the case where all bits were set used to exist and was randomly dealt with. For example, the task was not stopped, the throttle value was still updated and reported in the stats and in the http_server_state header. It was the same if the server was stopped by the agent or for maintenance. It's worth noting that the internal function names are still quite confusing.	2014-05-22 11:27:00 +02:00
Willy Tarreau	2012521d7b	REORG/MEDIUM: server: move the maintenance bits out of the server state Now we introduce srv->admin and srv->prev_admin which are bitfields containing one bit per source of administrative status (maintenance only for now). For the sake of backwards compatibility we implement a single source (ADMF_FMAINT) but the code already checks any source (ADMF_MAINT) where the STF_MAINTAIN bit was previously checked. This will later allow us to add ADMF_IMAINT for maintenance mode inherited from tracked servers. Along doing these changes, it appeared that some places will need to be revisited when implementing the inherited bit, this concerns all those modifying the ADMF_FMAINT bit (enable/disable actions on the CLI or stats page), and the checks to report "via" on the stats page. But currently the code is harmless.	2014-05-22 11:27:00 +02:00
Willy Tarreau	c93cd16b6c	REORG/MEDIUM: server: split server state and flags in two different variables Till now, the server's state and flags were all saved as a single bit field. It causes some difficulties because we'd like to have an enum for the state and separate flags. This commit starts by splitting them in two distinct fields. The first one is srv->state (with its counter-part srv->prev_state) which are now enums, but which still contain bits (SRV_STF_*). The flags now lie in their own field (srv->flags). The function srv_is_usable() was updated to use the enum as input, since it already used to deal only with the state. Note that currently, the maintenance mode is still in the state for simplicity, but it must move as well.	2014-05-22 11:27:00 +02:00
Willy Tarreau	c5150dafd8	MINOR: server: use functions to detect state changes and to update them Detecting that a server's status has changed is a bit messy, as well as it is to commit the status changes. We'll have to add new conditions soon and we'd better avoid to multiply the number of touched locations with the high risk of forgetting them. This commit introduces : - srv_lb_status_changed() to report if the status changed from the previously committed one ; - svr_lb_commit_status() to commit the current status The function is now used by all load-balancing algorithms.	2014-05-13 22:18:22 +02:00
Willy Tarreau	02615f9b16	MINOR: server: remove the SRV_DRAIN flag which can always be deduced This flag is only a copy of (srv->uweight == 0), so better get rid of it to reduce some of the confusion that remains in the code, and use a simple function to return this state based on this weight instead.	2014-05-13 22:18:13 +02:00
Willy Tarreau	5cf0b52d29	MEDIUM: checks: only complain about the missing port when the check uses TCP For UNIX socket addresses, we don't need any port, so let's disable the check under this condition.	2014-05-10 01:26:38 +02:00
Willy Tarreau	9cf8d3f46b	MINOR: protocols: use is_inet_addr() when only INET addresses are desired We used to have is_addr() in place to validate sometimes the existence of an address, sometimes a valid IPv4 or IPv6 address. Replace them carefully so that is_inet_addr() is used wherever we can only use an IPv4/IPv6 address.	2014-05-10 01:26:37 +02:00
Willy Tarreau	640556c692	BUG/MINOR: checks: correctly configure the address family and protocol Currently, mixing an IPv4 and an IPv6 address in checks happens to work by pure luck because the two protocols use the same functions at the socket level and both use IPPROTO_TCP. However, they're definitely wrong as the protocol for the check address is retrieved from the server's address. Now the protocol assigned to the connection is the same as the one the address in use belongs to (eg: the server's address or the explicit check address).	2014-05-10 01:26:37 +02:00
David S	afb768340c	MEDIUM: connection: Implement and extented PROXY Protocol V2 This commit modifies the PROXY protocol V2 specification to support headers longer than 255 bytes allowing for optional extensions. It implements the PROXY protocol V2 which is a binary representation of V1. This will make parsing more efficient for clients who will know in advance exactly how many bytes to read. Also, it defines and implements some optional PROXY protocol V2 extensions to send information about downstream SSL/TLS connections. Support for PROXY protocol V1 remains unchanged.	2014-05-09 08:25:38 +02:00
Willy Tarreau	272adea423	REORG: cfgparse: move server keyword parsing to server.c The cfgparse.c file becomes huge, and a large part of it comes from the server keyword parser. Since the configuration is a bit more modular now, move this parser to server.c. This patch also moves the check of the "server" keyword earlier in the supported keywords list, resulting in a slightly faster config parsing for configs with large numbers of servers (about 10%). No functional change was made, only the code was moved.	2014-03-31 10:42:03 +02:00
Bhaskar Maddala	a20cb85eba	MINOR: stats: Enhancement to stats page to provide information of last session time. Summary: Track and report last session time on the stats page for each server in every backend, as well as the backend. This attempts to address the requirement in the ROADMAP - add a last activity date for each server (req/resp) that will be displayed in the stats. It will be useful with soft stop. The stats page reports this as time elapsed since last session. This change does not adequately address the requirement for long running session (websocket, RDP... etc).	2014-02-08 01:19:58 +01:00
Willy Tarreau	ff5ae35b9f	MINOR: checks: use check->state instead of srv->state & SRV_CHECKED Having the check state partially stored in the server doesn't help. Some functions such as srv_getinter() rely on the server being checked to decide what check frequency to use, instead of relying on the check being configured. So let's get rid of SRV_CHECKED and SRV_AGENT_CHECKED and only use the check's states instead.	2013-12-14 16:02:19 +01:00
Simon Horman	58c32978b2	MEDIUM: Set rise and fall of agent checks to 1 This is achieved by moving rise and fall from struct server to struct check. After this move the behaviour of the primary check, server->check is unchanged. However, the secondary agent check, server->agent now has independent rise and fall values each of which are set to 1. The result is that receiving "fail", "stopped" or "down" just once from the agent will mark the server as down. And receiving a weight just once will allow the server to be marked up if its primary check is in good health. This opens up the scope to allow the rise and fall values of the agent check to be configurable, however this has not been implemented at this stage. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-25 07:31:16 +01:00
Willy Tarreau	004e045f31	BUG/MAJOR: server: weight calculation fails for map-based algorithms A crash was reported by Igor at owind when changing a server's weight on the CLI. Lukas Tribus could reproduce a related bug where setting a server's weight would result in the new weight being multiplied by the initial one. The two bugs are the same. The incorrect weight calculation results in the total farm weight being larger than what was initially allocated, causing the map index to be out of bounds on some hashes. It's easy to reproduce using "balance url_param" with a variable param, or with "balance static-rr". It appears that the calculation is made at many places and is not always right and not always wrong the same way. Thus, this patch introduces a new function "server_recalc_eweight()" which is dedicated to this task of computing ->eweight from many other elements including uweight and current time (for slowstart), and all users now switch to use this function. The patch is a bit large but the code was not trivially fixable in a way that could guarantee this situation would not occur anymore. The fix is much more readable and has been verified to work with all algorithms, with both consistent and map-based hashes, and even with static-rr. Slowstart was tested as well, just like enable/disable server. The same bug is very likely present in 1.4 as well, so the patch will probably need to be backported eventhough it will not apply as-is. Thanks to Lukas and Igor for the information they provided to reproduce it.	2013-11-21 15:09:02 +01:00
Simon Horman	125d099662	MEDIUM: Move health element to struct check This is in preparation for associating a agent check with a server which runs as well as the server's existing check. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 09:36:07 +01:00
Simon Horman	4a741432be	MEDIUM: Paramatise functions over the check of a server Paramatise the following functions over the check of a server * set_server_down * set_server_up * srv_getinter * server_status_printf * set_server_check_status * set_server_disabled * set_server_enabled Generally the server parameter of these functions has been removed. Where it is still needed it is obtained using check->server. This is in preparation for associating a agent check with a server which runs as well as the server's existing check. By paramatising these functions they may act on each of the checks without further significant modification. Explanation of the SSP_O_HCHK portion of this change: * Prior to this patch SSP_O_HCHK serves a single purpose which is to tell server_status_printf() weather it should print the details of the check of a server or not. With the paramatisation that this patch adds there are two cases. 1) Printing the details of the check in which case a valid check parameter is needed. 2) Not printing the details of the check in which case the contents check parameter are unused. In case 1) we could pass SSP_O_HCHK and a valid check and; In case 2) we could pass !SSP_O_HCHK and any value for check including NULL. If NULL is used for case 2) then SSP_O_HCHK becomes supurfulous and as NULL is used for case 2) SSP_O_HCHK has been removed. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 09:35:54 +01:00
Simon Horman	6618300e13	MEDIUM: Split up struct server's check element This is in preparation for associating a agent check with a server which runs as well as the server's existing check. The split has been made by: * Moving elements of struct server's check element that will be shared by both checks into a new check_common element of struct server. * Moving the remaining elements to a new struct check and making struct server's check element a struct check. * Adding a server element to struct check, a back-pointer to the server element it is a member of. - At this time the server could be obtained using container_of, however, this will not be so easy once a second struct check element is added to struct server to accommodate an agent health check. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 09:35:48 +01:00
Simon Horman	a360844735	CLEANUP: Make parameters of srv_downtime and srv_getinter const The parameters of srv_downtime and srv_getinter are not modified and thus may be const. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 08:04:58 +01:00
Simon Horman	b796afa60d	MEDIUM: server: Tighten up parsing of weight string Detect: * Empty weight string, including no digits before '%' in relative weight string * Trailing garbage, including between the last integer and '%' in relative weights The motivation for this is to allow the weight string to be safely logged if successfully parsed by this function Signed-off-by: Simon Horman <horms@verge.net.au>	2013-02-13 10:59:50 +01:00
Simon Horman	58b5d292b3	MEDIUM: server: Allow relative weights greater than 100% Allow relative weights greater than 100%, capping the absolute value to 256 which is the largest supported absolute weight. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-02-13 10:56:28 +01:00
Simon Horman	7d09b9a4df	MEDIUM: server: Break out set weight processing code Break out set weight processing code. This is in preparation for reusing the code. Also, remove duplicate check in nested if clauses. {px->lbprm.algo & BE_LB_PROP_DYN) is checked by the immediate outer if clause, so there is no need to check it a second time. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-02-13 10:53:40 +01:00
Willy Tarreau	dff5543618	MEDIUM: server: move parsing of keyword "id" to server.c This is the first keyword to be moved to server.c.	2012-10-10 17:51:05 +02:00
Willy Tarreau	21faa91be6	MINOR: server: add minimal infrastructure to parse keywords Just like with the "bind" lines, we'll switch the "server" line parsing to keyword registration. The code is essentially the same as for bind keywords, with minor changes such as support for the default-server keywords and support for variable argument count.	2012-10-10 17:42:39 +02:00
Willy Tarreau	ec6c5df018	[CLEANUP] remove many #include <types/xxx> from C files It should be stated as a rule that a C file should never include types/xxx.h when proto/xxx.h exists, as it gives less exposure to declaration conflicts (one of which was caught and fixed here) and it complicates the file headers for nothing. Only types/global.h, types/capture.h and types/polling.h have been found to be valid includes from C files.	2008-07-16 10:30:42 +02:00
Krzysztof Piotr Oledzki	5259dfedd1	[MEDIUM]: rework checks handling This patch adds two new variables: fastinter and downinter. When server state is: - non-transitionally UP -> inter (no change) - transitionally UP (going down), unchecked or transitionally DOWN (going up) -> fastinter - down -> downinter It allows to set something like: server sr6 127.0.51.61:80 cookie s6 check inter 10000 downinter 20000 fastinter 500 fall 3 weight 40 In the above example haproxy uses 10000ms between checks but as soon as one check fails fastinter (500ms) is used. If server is down downinter (20000) is used or fastinter (500ms) if one check pass. Fastinter is also used when haproxy starts. New "timeout.check" variable was added, if set haproxy uses it as an additional read timeout, but only after a connection has been already established. I was thinking about using "timeout.server" here but most people set this with an addition reserve but still want checks to kick out laggy servers. Please also note that in most cases check request is much simpler and faster to handle than normal requests so this timeout should be smaller. I also changed the timeout used for check connections establishing. Changes from the previous version: - use tv_isset() to check if the timeout is set, - use min("timeout connect", "inter") but only if "timeout check" is set as this min alone may be to short for full (connect + read) check, - debug code (fprintf) commented/removed - documentation Compile tested only (sorry!) as I'm currently traveling but changes are rather small and trivial.	2008-01-22 11:29:06 +01:00
Krzysztof Oledzki	85130941e7	[MEDIUM] stats: report server and backend cumulated downtime Hello, This patch implements new statistics for SLA calculation by adding new field 'Dwntime' with total down time since restart (both HTTP/CSV) and extending status field (HTTP) or inserting a new one (CSV) with time showing how long each server/backend is in a current state. Additionaly, down transations are also calculated and displayed for backends, so it is possible to know how many times selected backend was down, generating "No server is available to handle this request." error. New information are presentetd in two different ways: - for HTTP: a "human redable form", one of "100000d 23h", "23h 59m" or "59m 59s" - for CSV: seconds I believe that seconds resolution is enough. As there are more columns in the status page I decided to shrink some names to make more space: - Weight -> Wght - Check -> Chk - Down -> Dwn Making described changes I also made some improvements and fixed some small bugs: - don't increment s->health above 's->rise + s->fall - 1'. Previously it was incremented an then (re)set to 's->rise + s->fall - 1'. - do not set server down if it is down already - do not set server up if it is up already - fix colspan in multiple places (mostly introduced by my previous patch) - add missing "status" header to CSV - fix order of retries/redispatches in server (CSV) - s/Tthen/Then/ - s/server/backend/ in DATA_ST_PX_BE (dumpstats.c) Changes from previous version: - deal with negative time intervales - don't relay on s->state (SRV_RUNNING) - little reworked human_time + compacted format (no spaces). If needed it can be used in the future for other purposes by optionally making "cnt" as an argument - leave set_server_down mostly unchanged - only little reworked "process_chk: 9" - additional fields in CSV are appended to the rigth - fix "SEC" macro - named arguments (human_time, be_downtime, srv_downtime) Hope it is OK. If there are only cosmetic changes needed please fill free to correct it, however if there are some bigger changes required I would like to discuss it first or at last to know what exactly was changed especially since I already put this patch into my production server. :) Thank you, Best regards, Krzysztof Oledzki	2007-10-22 21:36:23 +02:00
Willy Tarreau	e3ba5f0aaa	[CLEANUP] included common/version.h everywhere	2006-06-29 18:54:54 +02:00
Willy Tarreau	baaee00406	[BIGMOVE] exploded the monolithic haproxy.c file into multiple files. The files are now stored under : - include/haproxy for the generic includes - include/types.h for the structures needed within prototypes - include/proto.h for function prototypes and inline functions - src/*.c for the C files Most include files are now covered by LGPL. A last move still needs to be done to put inline functions under GPL and not LGPL. Version has been set to 1.3.0 in the code but some control still needs to be done before releasing.	2006-06-26 02:48:02 +02:00

... 10 11 12 13 14

680 Commits