haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-10 00:57:02 +02:00

Author	SHA1	Message	Date
Willy Tarreau	3a925c155d	MEDIUM: stick-tables: flush old entries upon soft-stop When a process with large stick tables is replaced by a new one and remains present until the last connection finishes, it keeps these data in memory for nothing since they will never be used anymore by incoming connections, except during syncing with the new process. This is especially problematic when dealing with long session protocols such as WebSocket as it becomes possible to stack many processes and eat a lot of memory. So the idea here is to know if a table still needs to be synced or not, and to purge all unused entries once the sync is complete. This means that after a few hundred milliseconds when everything has been synchronized with the new process, only a few entries will remain allocated (only the ones held by sessions during the restart) and all the remaining memory will be freed. Note that we carefully do that only after the grace period is expired so as not to impact a possible proxy that needs to accept a few more connections before leaving. Doing this required to add a sync counter to the stick tables, to know how many peer sync sessions are still in progress in order not to flush the entries until all synchronizations are completed.	2013-09-04 17:54:01 +02:00
Evan Broder	be55431f9f	MINOR: ssl: Add statement 'verifyhost' to "server" statements verifyhost allows you to specify a hostname that the remote server's SSL certificate must match. Connections that don't match will be closed with an SSL error.	2013-09-01 07:55:49 +02:00
Willy Tarreau	9f09521f2d	BUG/MEDIUM: unique_id: HTTP request counter must be unique! The HTTP request counter is incremented non atomically, which means that many requests can log the same ID. Let's increment it when it is consumed so that we avoid this case. This bug was reported by Patrick Hemmer. It's 1.5-specific and does not need to be backported.	2013-08-13 17:52:20 +02:00
Willy Tarreau	47060b6ae0	MINOR: cli: make it possible to enter multiple values at once with "set table" The "set table" statement allows to create new entries with their respective values. Till now it was limited to a single data type per line, requiring as many "set table" statements as the desired data types to be set. Since this is only a parser limitation, this patch gets rid of it. It also allows the creation of a key with no data types (all reset to their default values).	2013-08-01 21:17:19 +02:00
Willy Tarreau	b4c8493a9f	MINOR: session: make the number of stick counter entries more configurable In preparation of more flexibility in the stick counters, make their number configurable. It still defaults to 3 which is the minimum accepted value. Changing the value alone is not sufficient to get more counters, some bitfields still need to be updated and the TCP actions need to be updated as well, but this update tries to be easier, which is nice for experimentation purposes.	2013-08-01 21:17:14 +02:00
Willy Tarreau	cadd8c9ec3	MINOR: payload: split smp_fetch_rdp_cookie() This function is also called directly from backend.c, so let's stop building fake args to call it as a sample fetch, and have a lower layer more generic function instead.	2013-08-01 21:17:13 +02:00
Willy Tarreau	ef38c39287	MEDIUM: sample: systematically pass the keyword pointer to the keyword We're having a lot of duplicate code just because of minor variants between fetch functions that could be dealt with if the functions had the pointer to the original keyword, so let's pass it as the last argument. An earlier version used to pass a pointer to the sample_fetch element, but this is not the best solution for two reasons : - fetch functions will solely rely on the keyword string - some other smp_fetch_* users do not have the pointer to the original keyword and were forced to pass NULL. So finally we're passing a pointer to the keyword as a const char *, which perfectly fits the original purpose.	2013-08-01 21:17:13 +02:00
Godbach	a34bdc0ea4	BUG/MEDIUM: server: set the macro for server's max weight SRV_UWGHT_MAX to SRV_UWGHT_RANGE The max weight of server is 256 now, but SRV_UWGHT_MAX is still 255. As a result, FWRR will not work well when server's weight is 256. The description is as below: There are some macros related to server's weight in include/types/server.h: #define SRV_UWGHT_RANGE 256 #define SRV_UWGHT_MAX (SRV_UWGHT_RANGE - 1) #define SRV_EWGHT_MAX (SRV_UWGHT_MAX * BE_WEIGHT_SCALE) Since weight of server can be reach to 256 and BE_WEIGHT_SCALE equals to 16, the max eweight of server should be 25616 = 4096, it will exceed SRV_EWGHT_MAX which equals to SRV_UWGHT_MAXBE_WEIGHT_SCALE = 255*16 = 4080. When a server with weight 256 is insterted into FWRR tree during initialization, the key value of this server should be SRV_EWGHT_MAX - s->eweight = 4080 - 4096 = -16 which is closed to UINT_MAX in unsigned type, so the server with highest weight will be not elected as the first server to process request. In addition, it is a better choice to compare with SRV_UWGHT_MAX than a magic number 256 while doing check for the weight. The max number of servers for round-robin algorithm is also updated. Signed-off-by: Godbach <nylzhaowei@gmail.com>	2013-07-22 09:29:34 +02:00
Lukas Tribus	67db8df12b	MEDIUM: http: add IPv6 support for "set-tos" As per RFC3260 #4 and BCP37 #4.2 and #5.2, the IPv6 counterpart of TOS is "traffic class". Add support for IPv6 traffic class in "set-tos" by moving the "set-tos" related code to the new inline function inet_set_tos(), handling IPv4 (IP_TOS), IPv6 (IPV6_TCLASS) and IPv4-mapped sockets (IP_TOS, like ::ffff:127.0.0.1). Also define - if missing - the IN6_IS_ADDR_V4MAPPED() macro in include/common/compat.h for compatibility.	2013-06-23 18:01:38 +02:00
Willy Tarreau	dc13c11c1e	BUG/MEDIUM: prevent gcc from moving empty keywords lists into BSS Benoit Dolez reported a failure to start haproxy 1.5-dev19. The process would immediately report an internal error with missing fetches from some crap instead of ACL names. The cause is that some versions of gcc seem to trim static structs containing a variable array when moving them to BSS, and only keep the fixed size, which is just a list head for all ACL and sample fetch keywords. This was confirmed at least with gcc 3.4.6. And we can't move these structs to const because they contain a list element which is needed to link all of them together during the parsing. The bug indeed appeared with 1.5-dev19 because it's the first one to have some empty ACL keyword lists. One solution is to impose -fno-zero-initialized-in-bss to everyone but this is not really nice. Another solution consists in ensuring the struct is never empty so that it does not move there. The easy solution consists in having a non-null list head since it's not yet initialized. A new "ILH" list head type was thus created for this purpose : create an Initialized List Head so that gcc cannot move the struct to BSS. This fixes the issue for this version of gcc and does not create any burden for the declarations.	2013-06-21 23:29:02 +02:00
Godbach	430f291a99	CLEANUP: session: remove event_accept() which was not used anymore Remove event_accept() in include/proto/proto_http.h and use correct function name in other two files instead of event_accept(). Signed-off-by: Godbach <nylzhaowei@gmail.com>	2013-06-20 08:07:35 +02:00
Willy Tarreau	be4a3eff34	MEDIUM: counters: use sc0/sc1/sc2 instead of sc1/sc2/sc3 It was a bit inconsistent to have gpc start at 0 and sc start at 1, so make sc start at zero like gpc. No previous release was issued with sc3 anyway, so no existing setup should be affected.	2013-06-17 15:04:07 +02:00
Thierry FOURNIER	7eeb435494	CLEANUP: protect checks.h from multiple inclusions The types/checks.h include file isn't protected against multiple inclusions, so let's surround it with "#ifndef _TYPES_CHECKS_H/#endif" to fix this.	2013-06-14 19:53:02 +02:00
Thierry FOURNIER	d3879e8b57	CLEANUP: fix missing include <string.h> in proto/listener.h The file proto/listener.h makes use of strdup() but doesn't include <string.h> so it's sensible to include file ordering.	2013-06-14 19:52:17 +02:00
Willy Tarreau	4f0d919bd4	MEDIUM: tcp: add "tcp-request connection expect-proxy layer4" This configures the client-facing connection to receive a PROXY protocol header before any byte is read from the socket. This is equivalent to having the "accept-proxy" keyword on the "bind" line, except that using the TCP rule allows the PROXY protocol to be accepted only for certain IP address ranges using an ACL. This is convenient when multiple layers of load balancers are passed through by traffic coming from public hosts.	2013-06-11 20:40:55 +02:00
Willy Tarreau	51347ed94c	MEDIUM: http: add the "set-mark" action on http-request/http-response rules "set-mark" is used to set the Netfilter MARK on all packets sent to the client to the value passed in <mark> on platforms which support it. This value is an unsigned 32 bit value which can be matched by netfilter and by the routing table. It can be expressed both in decimal or hexadecimal format (prefixed by "0x"). This can be useful to force certain packets to take a different route (for example a cheaper network path for bulk downloads). This works on Linux kernels 2.6.32 and above and requires admin privileges.	2013-06-11 19:34:13 +02:00
Willy Tarreau	42cf39e3b9	MEDIUM: http: add support for "set-tos" in http-request/http-response This manipulates the TOS field of the IP header of outgoing packets sent to the client. This can be used to set a specific DSCP traffic class based on some request or response information. See RFC2474, 2597, 3260 and 4594 for more information.	2013-06-11 19:04:37 +02:00
Willy Tarreau	9a355ec257	MEDIUM: http: add support for action "set-log-level" in http-request/http-response Some users want to disable logging for certain non-important requests such as stats requests or health-checks coming from another equipment. Other users want to log with a higher importance (eg: notice) some special traffic (POST requests, authenticated requests, requests coming from suspicious IPs) or some abnormally large responses. This patch responds to all these needs at once by adding a "set-log-level" action to http-request/http-response. The 8 syslog levels are supported, as well as "silent" to disable logging.	2013-06-11 17:50:26 +02:00
Willy Tarreau	abcd5145f8	MEDIUM: log: add a log level override value in struct session This log level will be used in a further patch to change the log level depending on the request or response.	2013-06-11 17:50:26 +02:00
Willy Tarreau	f4c43c13be	MEDIUM: http: add the "set-nice" action to http-request and http-response This new action changes the nice factor of the task processing the current request.	2013-06-11 17:50:26 +02:00
Willy Tarreau	e365c0b92b	MEDIUM: http: add a new "http-response" ruleset Some actions were clearly missing to process response headers. This patch adds a new "http-response" ruleset which provides the following actions : - allow : stop evaluating http-response rules - deny : stop and reject the response with a 502 - add-header : add a header in log-format mode - set-header : set a header in log-format mode	2013-06-11 16:06:12 +02:00
Willy Tarreau	2b57cb8f30	MEDIUM: protocol: implement a "drain" function in protocol layers Since commit `cfd97c6f` was merged into 1.5-dev14 (BUG/MEDIUM: checks: prevent TIME_WAITs from appearing also on timeouts), some valid health checks sometimes used to show some TCP resets. For example, this HTTP health check sent to a local server : 19:55:15.742818 IP 127.0.0.1.16568 > 127.0.0.1.8000: S 3355859679:3355859679(0) win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:15.742841 IP 127.0.0.1.8000 > 127.0.0.1.16568: S 1060952566:1060952566(0) ack 3355859680 win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:15.742863 IP 127.0.0.1.16568 > 127.0.0.1.8000: . ack 1 win 257 19:55:15.745402 IP 127.0.0.1.16568 > 127.0.0.1.8000: P 1:23(22) ack 1 win 257 19:55:15.745488 IP 127.0.0.1.8000 > 127.0.0.1.16568: FP 1:146(145) ack 23 win 257 19:55:15.747109 IP 127.0.0.1.16568 > 127.0.0.1.8000: R 23:23(0) ack 147 win 257 After some discussion with Chris Huang-Leaver, it appeared clear that what we want is to only send the RST when we have no other choice, which means when the server has not closed. So we still keep SYN/SYN-ACK/RST for pure TCP checks, but don't want to see an RST emitted as above when the server has already sent the FIN. The solution against this consists in implementing a "drain" function at the protocol layer, which, when defined, causes as much as possible of the input socket buffer to be flushed to make recv() return zero so that we know that the server's FIN was received and ACKed. On Linux, we can make use of MSG_TRUNC on TCP sockets, which has the benefit of draining everything at once without even copying data. On other platforms, we read up to one buffer of data before the close. If recv() manages to get the final zero, we don't disable lingering. Same for hard errors. Otherwise we do. In practice, on HTTP health checks we generally find that the close was pending and is returned upon first recv() call. The network trace becomes cleaner : 19:55:23.650621 IP 127.0.0.1.16561 > 127.0.0.1.8000: S 3982804816:3982804816(0) win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:23.650644 IP 127.0.0.1.8000 > 127.0.0.1.16561: S 4082139313:4082139313(0) ack 3982804817 win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:23.650666 IP 127.0.0.1.16561 > 127.0.0.1.8000: . ack 1 win 257 19:55:23.651615 IP 127.0.0.1.16561 > 127.0.0.1.8000: P 1:23(22) ack 1 win 257 19:55:23.651696 IP 127.0.0.1.8000 > 127.0.0.1.16561: FP 1:146(145) ack 23 win 257 19:55:23.652628 IP 127.0.0.1.16561 > 127.0.0.1.8000: F 23:23(0) ack 147 win 257 19:55:23.652655 IP 127.0.0.1.8000 > 127.0.0.1.16561: . ack 24 win 257 This change should be backported to 1.4 which is where Chris encountered this issue. The code is different, so probably the tcp_drain() function will have to be put in the checks only.	2013-06-10 20:33:23 +02:00
Willy Tarreau	04ff9f105f	MINOR: http: add full-length header fetch methods The req.hdr and res.hdr fetch methods do not work well on headers which are allowed to contain commas, such as User-Agent, Date or Expires. More specifically, full-length matching is impossible if a comma is present. This patch introduces 4 new fetch functions which are designed to work with these full-length headers : - req.fhdr, req.fhdr_cnt - res.fhdr, res.fhdr_cnt These ones do not stop at commas and permit to return full-length header values.	2013-06-10 18:39:42 +02:00
Willy Tarreau	570f221cbb	MINOR: log: add a new flag 'L' for locally processed requests People who use "option dontlog-normal" are bothered with redirects and stats being logged and reported as errors in the logs ("PR" = proxy blocked the request). This patch introduces a new flag 'L' for when a request is locally processed, that is not considered as an error by the log filters. That way we know a request was intercepted and processed by haproxy without logging the line when "option dontlog-normal" is in effect.	2013-06-10 16:42:09 +02:00
Willy Tarreau	bf43f6eb32	MINOR: defaults: allow REQURI_LEN and CAPTURE_LEN to be redefined Some users want to change these default settings but it was not convenient.	2013-06-10 10:30:09 +02:00
Willy Tarreau	379357af58	BUG/MAJOR: http: always ensure response buffer has some room for a response Since 1.5-dev12 and commit `3bf1b2b8` (MAJOR: channel: stop relying on BF_FULL to take action), the HTTP parser switched to channel_full() instead of BF_FULL to decide whether a buffer had enough room to start parsing a request or response. The problem is that channel_full() intentionally ignores outgoing data, so a corner case exists where a large response might still be left in a response buffer with just a few bytes left (much less than the reserve), enough to accept a second response past the last data, but not enough to permit the HTTP processor to add some headers. Since all the processing relies on this space being available, we can get some random crashes when clients pipeline requests. The analysis of a core from haproxy configured with 20480 bytes buffers shows this : with enough "luck", when sending back the response for the first request, the client is slow, the TCP window is congested, the socket buffers are full, and haproxy's buffer fills up. We still have 20230 bytes of response data in a 20480 response buffer. The second request is sent to the server which returns 214 bytes which fit in the small 250 bytes left in this buffer. And the buffer arrangement makes it possible to escape all the controls in http_wait_for_response() : \|<------ response buffer = 20480 bytes ------>\| [ 2/2 \| 3 \| 4 \| 1/2 ] ^ start of circular buffer 1/2 = beginning of previous response (18240) 2/2 = end of previous response (1990) 3 = current response (214) 4 = free space (36) - channel_full() returns false (20230 bytes are going to leave) - the response headers does not wrap at the end of the buffer - the remaining linear room after the headers is larger than the reserve, because it's the previous response which wraps : => response is processed Header rewriting causes it to reach 260 bytes, 10 bytes larger than what the buffer could hold. So all computations during header addition are wrong and lead to the corruption we've observed. All the conditions are very hard to meet (which explains why it took almost one year for this bug to show up) and are almost impossible to reproduce on purpose on a test platform. But the bug is clearly there. This issue was reported by Dinko Korunic who kindly devoted a lot of time to provide countless traces and cores, and to experiment with troubleshooting patches to knock the bug down. Thanks Dinko! No backport is needed, but all 1.5-dev versions between dev12 and dev18 included must be upgraded. A workaround consists in setting option forceclose to prevent pipelined requests from being processed.	2013-06-08 13:14:17 +02:00
Willy Tarreau	ba2ffd18b5	MEDIUM: counters: add a new "gpc0_rate" counter in stick-tables This counter is special in that instead of reporting the gpc0 cumulative count, it returns its increase rate over the configured period.	2013-05-29 15:54:14 +02:00
Willy Tarreau	e25c917af8	MEDIUM: counters: add support for tracking a third counter We're often missin a third counter to track base, src and base+src at the same time. Here we introduce track_sc3 to have this third counter. It would be wise not to add much more counters because that slightly increases the session size and processing time though the real issue is more the declaration of the keywords in the code and in the doc.	2013-05-29 00:37:16 +02:00
Willy Tarreau	d5ca9abb0d	MINOR: counters: make it easier to extend the amount of tracked counters By properly affecting the flags and values, it becomes easier to add more tracked counters, for example for experimentation. It also slightly reduces the code and the number of tests. No counters were added with this patch.	2013-05-28 17:43:40 +02:00
Pieter Baauw	1eb7592bba	MINOR: tproxy: add support for OpenBSD OpenBSD uses (SOL_SOCKET, SO_BINDANY) to enable transparent proxy on a socket. This patch adds support for the relevant setsockopt() calls.	2013-05-11 08:03:50 +02:00
Pieter Baauw	ff30b6667b	MINOR: tproxy: add support for FreeBSD FreeBSD uses (IPPROTO_IP, IP_BINDANY) and (IPPROTO_IPV6, IPV6_BINDANY) to enable transparent proxy on a socket. This patch adds support for the relevant setsockopt() calls.	2013-05-11 08:03:43 +02:00
Pieter Baauw	d551fb5a8d	REORG: tproxy: prepare the transparent proxy defines for accepting other OSes This patch does not change the logic of the code, it only changes the way OS-specific defines are tested. At the moment the transparent proxy code heavily depends on Linux-specific defines. This first patch introduces a new define "CONFIG_HAP_TRANSPARENT" which is set every time the defines used by transparent proxy are present. This also means that with an up-to-date libc, it should not be necessary anymore to force CONFIG_HAP_LINUX_TPROXY during the build, as the flags will automatically be detected. The CTTPROXY flags still remain separate because this older API doesn't work the same way. A new line has been added in the version output for haproxy -vv to indicate what transparent proxy support is available.	2013-05-11 08:03:37 +02:00
Willy Tarreau	dd11293e84	BUG/MINOR: acl: fix a double free during exit when using PCRE_JIT When freeing ACL regex, we don't want to perform the free() in regex_free() as it's already performed in free_pattern(). The double free only happens when using PCRE_JIT when freeing everything during exit so it's harmless but exhibits libc errors during a reload/restart. Bug reported by Seri.	2013-05-08 18:14:18 +02:00
Emmanuel Hocdet	7c41a1b59b	MEDIUM: ssl: improve crt-list format to support negation Improve the crt-list file format to allow a rule to negate a certain SNI : <crtfile> [[!]<snifilter> ...] This can be useful when a domain supports a wildcard but you don't want to deliver the wildcard cert for certain specific domains.	2013-05-07 22:11:54 +02:00
de Lafond Guillaume	88c278fadf	MEDIUM: stats: add proxy name filtering on the statistic page This patch adds a "scope" box in the statistics page in order to display only proxies with a name that contains the requested value. The scope filter is preserved across all clicks on the page.	2013-04-15 22:50:33 +02:00
Willy Tarreau	62a6123fed	BUG/MEDIUM: log: fix regression on log-format handling Commit `a4312fa2` merged into dev18 improved log-format management by processing "log-format" and "unique-id-format" where they were declared, so that the faulty args could be reported with their correct line numbers. Unfortunately, the log-format parser considers the proxy mode (TCP/HTTP) and now if the directive is set before the "mode" statement, it can be rejected and report warnings. So we really need to parse these directives at the end of a section at least. Right now we do not have an "end of section" event, so we need to store the file name and line number for each of these directives, and take care of them at the end. One of the benefits is that now the line numbers can be inherited from the line passing "option httplog" even if it's in a defaults section. Future improvements should be performed to report line numbers in every log-format processed by the parser.	2013-04-12 18:13:46 +02:00
Willy Tarreau	ab861d3856	MINOR: ssl: add support for the "alpn" bind keyword The ALPN extension is meant to replace the now deprecated NPN extension. This patch implements support for it. It requires a version of openssl with support for this extension. Patches are available here right now : http://html5labs.interopbridges.com/media/167447/alpn_patches.zip	2013-04-03 02:13:02 +02:00
Willy Tarreau	a4312fa28e	MAJOR: sample: maintain a per-proxy list of the fetch args to resolve While ACL args were resolved after all the config was parsed, it was not the case with sample fetch args because they're almost everywhere now. The issue is that ACLs now solely rely on sample fetches, so their args resolving doesn't work anymore. And many fetches involving a server, a proxy or a userlist don't work at all. The real issue is that at the bottom layers we have no information about proxies, line numbers, even ACLs in order to report understandable errors, and that at the top layers we have no visibility over the locations where fetches are referenced (think log node). After failing multiple unsatisfying solutions attempts, we now have a new concept of args list. The principle is that every proxy has a list head which contains a number of indications such as the config keyword, the context where it's used, the file and line number, etc... and a list of arguments. This list head is of the same type as the elements, so it serves as a template for adding new elements. This way, it is filled from top to bottom by the callers with the information they have (eg: line numbers, ACL name, ...) and the lower layers just have to duplicate it and add an element when they face an argument they cannot resolve yet. Then at the end of the configuration parsing, a loop passes over each proxy's list and resolves all the args in sequence. And this way there is all necessary information to report verbose errors. The first immediate benefit is that for the first time we got very precise location of issues (arg number in a keyword in its context, ...). Second, in order to do this we had to parse log-format and unique-id-format a bit earlier, so that was a great opportunity for doing so when the directives are encountered (unless it's a default section). This way, the recorded line numbers for these args are the ones of the place where the log format is declared, not the end of the file. Userlists report slightly more information now. They're the only remaining ones in the ACL resolving function.	2013-04-03 02:13:02 +02:00
Willy Tarreau	93fddf1dbc	MEDIUM: acl: have a pointer to the keyword name in acl_expr The acl_expr struct used to hold a pointer to the ACL keyword. But since we now have all the relevant pointers, we don't need that anymore, we just need the pointer to the keyword as a string in order to return warnings and error messages. So let's change this in order to remove the dependency on the acl_keyword struct from acl_expr. During this change, acl_cond_kw_conflicts() used to return a pointer to an ACL keyword but had to be changed to return a const char* for the same reason.	2013-04-03 02:13:01 +02:00
Willy Tarreau	acca90d8e5	MINOR: acl: remove the use_count in acl keywords use_cnt is not used at all anymore, let's get rid of it.	2013-04-03 02:13:01 +02:00
Willy Tarreau	5adeda1f63	MAJOR: acl: add option -m to change the pattern matching method ACL expressions now support "-m" in addition to "-i" and "-f". This new option is followed by the name of the pattern matching method to be used on the extracted pattern. This makes it possible to reuse existing sample fetch methods with other matching methods (eg: regex). A "found" matching method ignores any pattern and only verifies that the required sample was found (useful for cookies).	2013-04-03 02:13:01 +02:00
Willy Tarreau	d76a98a5fc	MEDIUM: acl: move the ->parse, ->match and ->smp fields to acl_expr We'll need each ACL expression to be able to support its own parse and match methods, so we're moving these fields to the ACL expression.	2013-04-03 02:13:01 +02:00
Willy Tarreau	d86e29d2a1	CLEANUP: acl: remove unused references to ACL_USE_* Now that acl->requires is not used anymore, we can remove all references to it as well as all ACL_USE_* flags.	2013-04-03 02:13:00 +02:00
Willy Tarreau	a91d0a583c	MAJOR: acl: convert all ACL requires to SMP use+val instead of ->requires The ACLs now use the fetch's ->use and ->val to decide upon compatibility between the place where they are used and where the information are fetched. The code is capable of reporting warnings about very fine incompatibilities between certain fetches and an exact usage location, so it is expected that some new warnings will be emitted on some existing configurations. Two degrees of detection are provided : - detecting ACLs that never match - detecting keywords that are ignored All tests show that this seems to work well, though bugs are still possible.	2013-04-03 02:13:00 +02:00
Willy Tarreau	bf8e251077	MINOR: sample: provide a function to report the name of a sample check point We need to put names on places where samples are used in order to emit warnings and errors. Let's do that now.	2013-04-03 02:13:00 +02:00
Willy Tarreau	25320b2906	MEDIUM: proxy: remove acl_requires and just keep a flag "http_needed" Proxy's acl_requires was a copy of all bits taken from ACLs, but we'll get rid of ACL flags and only rely on sample fetches soon. The proxy's acl_requires was only used to allocate an HTTP context when needed, and was even forced in HTTP mode. So better have a flag which exactly says what it's supposed to be used for.	2013-04-03 02:13:00 +02:00
Willy Tarreau	4a96bf5a5d	CLEANUP: acl: remove ACL hooks which were never used These hooks, which established the relation between ACL_USE_* and the location where the ACL were used, were never used because they were superseded with the sample capabilities. Remove them now.	2013-04-03 02:12:59 +02:00
Willy Tarreau	9baae63d8d	MAJOR: acl: remove fetch argument validation from the ACL struct ACL fetch being inherited from the sample fetch keyword, we don't need anymore to specify what function to use to validate the fetch arguments. Note that the job is still done in the ACL parsing code based on elements from the sample fetch structs.	2013-04-03 02:12:59 +02:00
Willy Tarreau	c48c90dfa5	MAJOR: acl: remove the arg_mask from the ACL definition and use the sample fetch's Now that ACLs solely rely on sample fetch functions, make them use the same arg mask. All inconsistencies have been fixed separately prior to this patch, so this patch almost only adds a new pointer indirection and removes all references to ARG*() in the definitions. The parsing is still performed by the ACL code though.	2013-04-03 02:12:58 +02:00
Willy Tarreau	8ed669b12a	MAJOR: acl: make all ACLs reference the fetch function via a sample. ACL fetch functions used to directly reference a fetch function. Now that all ACL fetches have their sample fetches equivalent, we can make ACLs reference a sample fetch keyword instead. In order to simplify the code, a sample keyword name may be NULL if it is the same as the ACL's, which is the most common case. A minor change appeared, http_auth always expects one argument though the ACL allowed it to be missing and reported as such afterwards, so fix the ACL to match this. This is not really a bug.	2013-04-03 02:12:58 +02:00
Willy Tarreau	d4c33c8889	MEDIUM: samples: move payload-based fetches and ACLs to their own file The file acl.c is a real mess, it both contains functions to parse and process ACLs, and some sample extraction functions which act on buffers. Some other payload analysers were arbitrarily dispatched to proto_tcp.c. So now we're moving all payload-based fetches and ACLs to payload.c which is capable of extracting data from buffers and rely on everything that is protocol-independant. That way we can safely inflate this file and only use the other ones when some fetches are really specific (eg: HTTP, SSL, ...). As a result of this cleanup, the following new sample fetches became available even if they're not really useful : always_false, always_true, rep_ssl_hello_type, rdp_cookie_cnt, req_len, req_ssl_hello_type, req_ssl_sni, req_ssl_ver, wait_end The function 'acl_fetch_nothing' was wrong and never used anywhere so it was removed. The "rdp_cookie" sample fetch used to have a mandatory argument while it was optional in ACLs, which are supposed to iterate over RDP cookies. So we're making it optional as a fetch too, and it will return the first one.	2013-04-03 02:12:57 +02:00
Willy Tarreau	434c57c95c	MINOR: log: indicate it when some unreliable sample fetches are logged If a log-format involves some sample fetches that may not be present at the logging instant, we can now report a warning. Note that this is done both for log-format and for add-header and carefully respects the original fetch keyword's capabilities.	2013-04-03 02:12:56 +02:00
Willy Tarreau	80aca90ad2	MEDIUM: samples: use new flags to describe compatibility between fetches and their usages Samples fetches were relying on two flags SMP_CAP_REQ/SMP_CAP_RES to describe whether they were compatible with requests rules or with response rules. This was never reliable because we need a finer granularity (eg: an HTTP request method needs to parse an HTTP request, and is available past this point). Some fetches are also dependant on the context (eg: "hdr" uses request or response depending where it's involved, causing some abiguity). In order to solve this, we need to precisely indicate in fetches what they use, and their users will have to compare with what they have. So now we have a bunch of bits indicating where the sample is fetched in the processing chain, with a few variants indicating for some of them if it is permanent or volatile (eg: an HTTP status is stored into the transaction so it is permanent, despite being caught in the response contents). The fetches also have a second mask indicating their validity domain. This one is computed from a conversion table at registration time, so there is no need for doing it by hand. This validity domain consists in a bitmask with one bit set for each usage point in the processing chain. Some provisions were made for upcoming controls such as connection-based TCP rules which apply on top of the connection layer but before instantiating the session. Then everywhere a fetch is used, the bit for the control point is checked in the fetch's validity domain, and it becomes possible to finely ensure that a fetch will work or not. Note that we need these two separate bitfields because some fetches are usable both in request and response (eg: "hdr", "payload"). So the keyword will have a "use" field made of a combination of several SMP_USE_* values, which will be converted into a wider list of SMP_VAL_* flags. The knowledge of permanent vs dynamic information has disappeared for now, as it was never used. Later we'll probably reintroduce it differently when dealing with variables. Its only use at the moment could have been to avoid caching a dynamic rate measurement, but nothing is cached as of now.	2013-04-03 02:12:56 +02:00
Willy Tarreau	e0db1e8946	MEDIUM: acl: remove flag ACL_MAY_LOOKUP which is improperly used This flag is used on ACL matches that support being looking up patterns in trees. At the moment, only strings and IPs support tree-based lookups, but the flag is randomly set also on integers and binary data, and is not even always set on strings nor IPs. Better get rid of this mess by only relying on the matching function to decide whether or not it supports tree-based lookups, this is safer and easier to maintain.	2013-04-03 02:12:56 +02:00
Lukas Tribus	0999f7662c	BUILD: add explicit support for TFO with USE_TFO TCP Fast Open is supported in server mode since Linux 3.7, but current libc's don't define TCP_FASTOPEN=23. Introduce the new USE flag USE_TFO to define it manually in compat.h. Also note this in the TFO related documentation.	2013-04-02 17:40:43 +02:00
Willy Tarreau	0161d62d23	OPTIM: http: improve branching in chunk size parser By tweaking a bit some conditions in http_parse_chunk_size(), we could improve the overall performance in the worst case by 15%.	2013-04-02 02:00:57 +02:00
Willy Tarreau	bf43927cd7	OPTIM: buffer: remove one jump in buffer_count() We can help gcc build an expression not involving a jump. This function is used a lot when parsing chunks.	2013-04-02 01:25:57 +02:00
Hiroaki Nakamura	7035132349	MEDIUM: regex: Use PCRE JIT in acl This is a patch for using PCRE JIT in acl. I notice regex are used in other places, but they are more complicated to modify to use PCRE APIs. So I focused to acl in the first try. BTW, I made a simple benchmark program for PCRE JIT beforehand. https://github.com/hnakamur/pcre-jit-benchmark I read the manual for PCRE JIT http://www.manpagez.com/man/3/pcrejit/ and wrote my benchmark program. https://github.com/hnakamur/pcre-jit-benchmark/blob/master/test-pcre.c	2013-04-02 00:02:54 +02:00
Willy Tarreau	dad36a3ee3	MAJOR: tools: support environment variables in addresses Now that all addresses are parsed using str2sa_range(), it becomes easy to add support for environment variables and use them everywhere an address is needed. Environment variables are used as $VAR or ${VAR} as in shell. Any number of variables may compose an address, allowing various fantasies such as "fd@${FD_HTTP}" or "${LAN_DC1}.1:80". These ones are usable in logs, bind, servers, peers, stats socket, source, dispatch, and check address.	2013-03-11 01:30:02 +01:00
Willy Tarreau	24709286fe	MEDIUM: tools: support specifying explicit address families in str2sa_range() This change allows one to force the address family in any address parsed by str2sa_range() by specifying it as a prefix followed by '@' then the address. Currently supported address prefixes are 'ipv4@', 'ipv6@', 'unix@'. This also helps forcing resolving for host names (when getaddrinfo is used), and force the family of the empty address (eg: 'ipv4@' = 0.0.0.0 while 'ipv6@' = ::). The main benefits is that unix sockets can now get a local name without being forced to begin with a slash. This is useful during development as it is no longer necessary to have stats socket sent to /tmp.	2013-03-10 22:46:55 +01:00
Willy Tarreau	c120c8d347	CLEANUP: minor cleanup in str2sa_range() and str2ip() Don't use a statically allocated address both for str2ip and str2sa_range, use the same. The inet and unix code paths have been splitted a little better to improve readability.	2013-03-10 21:36:31 +01:00
Willy Tarreau	add0ab1975	CLEANUP: tools: remove str2sun() which is not used anymore.	2013-03-08 14:04:54 +01:00
Willy Tarreau	d393a628bb	MINOR: tools: prepare str2sa_range() to accept a prefix We'll need str2sa_range() to support a prefix for unix sockets. Since we don't always want to use it (eg: stats socket), let's not take it unconditionally from global but let the caller pass it.	2013-03-08 14:04:54 +01:00
Willy Tarreau	df350f1f48	MINOR: tools: prepare str2sa_range() to return an error message We'll need str2sa_range() to return address parsing errors if we want to extend its functionalities. Let's do that now eventhough it's not used yet.	2013-03-08 14:04:53 +01:00
Emeric Brun	6924ef8b12	BUG/MEDIUM: ssl: ECDHE ciphers not usable without named curve configured. Fix consists to use prime256v1 as default named curve to init ECDHE ciphers if none configured.	2013-03-06 19:08:26 +01:00
Willy Tarreau	4f4b18b2ec	BUILD/MINOR: syscall: add definition of NR_accept4 for ARM This platform was not covered and older libc do not provide accept4().	2013-03-04 07:38:08 +01:00
Willy Tarreau	b26cc86b1c	BUG/MINOR: syscall: fix NR_accept4 system call on sparc/linux An invalid copy-paste called it NR_splice instead of NR_accept4. This does not lead to real issues because if this define is used, then the code cannot compile since NR_accept4 is still missing.	2013-03-04 07:31:08 +01:00
Willy Tarreau	bfd5946aa1	MINOR: ssl: add a global tunable for the max SSL/TLS record size Add new tunable "tune.ssl.maxrecord". Over SSL/TLS, the client can decipher the data only once it has received a full record. With large records, it means that clients might have to download up to 16kB of data before starting to process them. Limiting the record size can improve page load times on browsers located over high latency or low bandwidth networks. It is suggested to find optimal values which fit into 1 or 2 TCP segments (generally 1448 bytes over Ethernet with TCP timestamps enabled, or 1460 when timestamps are disabled), keeping in mind that SSL/TLS add some overhead. Typical values of 1419 and 2859 gave good results during tests. Use "strace -e trace=write" to find the best value. This trick was first suggested by Mike Belshe : http://www.belshe.com/2010/12/17/performance-and-the-tls-record-size/ Then requested again by Ilya Grigorik who provides some hints here : http://ofps.oreilly.com/titles/9781449344764/_transport_layer_security_tls.html#ch04_00000101	2013-02-21 07:53:13 +01:00
Willy Tarreau	d4448bc836	MEDIUM: tools: make str2sa_range support all address syntaxes Right now we have multiple methods for parsing IP addresses in the configuration. This is quite painful. This patch aims at adapting str2sa_range() to make it support all formats, so that the callers perform the appropriate tests on the return values. str2sa() was changed to simply return str2sa_range(). The output values are now the following ones (taken from the comment on top of the function). Converts <str> to a locally allocated struct sockaddr_storage , and a port range or offset consisting in two integers that the caller will have to check to find the relevant input format. The following format are supported : String format \| address \| port \| low \| high addr \| <addr> \| 0 \| 0 \| 0 addr: \| <addr> \| 0 \| 0 \| 0 addr:port \| <addr> \| <port> \| <port> \| <port> addr:pl-ph \| <addr> \| <pl> \| <pl> \| <ph> addr:+port \| <addr> \| <port> \| 0 \| <port> addr:-port \| <addr> \|-<port> \| <port> \| 0 The detection of a port range or increment by the caller is made by comparing <low> and <high>. If both are equal, then port 0 means no port was specified. The caller may pass NULL for <low> and <high> if it is not interested in retrieving port ranges. Note that <addr> above may also be : - empty ("") => family will be AF_INET and address will be INADDR_ANY - "" => family will be AF_INET and address will be INADDR_ANY - "::" => family will be AF_INET6 and address will be IN6ADDR_ANY - a host name => family and address will depend on host name resolving.	2013-02-20 17:29:30 +01:00
Lukas Tribus	0defb90784	DOC: tfo: bump required kernel to linux-3.7 Support for server side TFO was actually introduced in linux-3.7, linux-3.6 just has client support. This patch fixes documentation and a code comment about the kernel requirement. It also fixes a wrong tfo related code comment in src/proto_tcp.c.	2013-02-14 00:03:04 +01:00
Simon Horman	a2b9dadedd	MEDIUM: checks: Add agent health check Support a agent health check performed by opening a TCP socket to a pre-defined port and reading an ASCII string. The string should have one of the following forms: * An ASCII representation of an positive integer percentage. e.g. "75%" Values in this format will set the weight proportional to the initial weight of a server as configured when haproxy starts. * The string "drain". This will cause the weight of a server to be set to 0, and thus it will not accept any new connections other than those that are accepted via persistence. * The string "down", optionally followed by a description string. Mark the server as down and log the description string as the reason. * The string "stopped", optionally followed by a description string. This currently has the same behaviour as down (iii). * The string "fail", optionally followed by a description string. This currently has the same behaviour as down (iii). A agent health check may be configured using "option lb-agent-chk". The use of an alternate check-port, used to obtain agent heath check information described above as opposed to the port of the service, may be useful in conjunction with this option. e.g. option lb-agent-chk server http1_1 10.0.0.10:80 check port 10000 weight 100 Signed-off-by: Simon Horman <horms@verge.net.au>	2013-02-13 11:03:28 +01:00
Simon Horman	7d09b9a4df	MEDIUM: server: Break out set weight processing code Break out set weight processing code. This is in preparation for reusing the code. Also, remove duplicate check in nested if clauses. {px->lbprm.algo & BE_LB_PROP_DYN) is checked by the immediate outer if clause, so there is no need to check it a second time. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-02-13 10:53:40 +01:00
Simon Horman	5269cfb458	BUG/MINOR: Correct logic in cut_crlf() This corrects what appears to be logic errors in cut_crlf(). I assume that the intention of this function is to truncate a string at the first cr or lf. However, currently lf are ignored. Also use '\0' instead of 0 as the null character, a cosmetic change. Cc: Krzysztof Piotr Oledzki <ole@ans.pl> Signed-off-by: Simon Horman <horms@verge.net.au> [WT: this fix may be backported to 1.4 too]	2013-02-13 10:52:40 +01:00
Marc-Antoine Perennou	992709bad0	MEDIUM: New cli option -Ds for systemd compatibility This patch adds a new option "-Ds" which is exactly like "-D", but instead of forking n times to get n jobs running and then exiting, prefers to wait for all the children it just created. With this done, haproxy becomes more systemd-compliant, without changing anything for other systems. Signed-off-by: Marc-Antoine Perennou <Marc-Antoine@Perennou.com>	2013-02-13 10:47:49 +01:00
Willy Tarreau	6cbbdbf3f3	BUG/MEDIUM: log: emit '-' for empty fields again Commit `2b0108ad` accidently got rid of the ability to emit a "-" for empty log fields. This can happen for captured request and response cookies, as well as for fetches. Since we don't want to have this done for headers however, we set the default log method when parsing the format. It is still possible to force the desired mode using +M/-M.	2013-02-05 18:55:09 +01:00
Emmanuel Hocdet	656233715e	MEDIUM: ssl: add bind-option "strict-sni" This new option ensures that there is no possible fallback to a default certificate if the client does not provide an SNI which is explicitly handled by a certificate.	2013-01-24 17:23:33 +01:00
Willy Tarreau	8dc21faaf7	BUG/MINOR: unix: remove the 'level' field from the ux struct Commit `290e63aa` moved the unix parameters out of the global stats socket to the bind_conf struct. As such the stats admin level was also moved overthere, but it remained in the stats global section where it was not used, except by a nasty memcpy() used to initialize the ux struct in the bind_conf with too large data. Fortunately, the extra data copied were the previous level over the new level so it did not have any impact, but it could have been worse. This bug is 1.5 specific, no backport is needed. Reported-by: Dinko Korunic <dkorunic@reflected.net>	2013-01-24 16:19:19 +01:00
Willy Tarreau	eab777c32e	BUG/MINOR: time: frequency counters are not totally accurate When a frontend is rate-limited to 1000 connections per second, the effective rate measured from the client is 999/s, and connections experience an average response time of 99.5 ms with a standard deviation of 2 ms. The reason for this inaccuracy is that when computing frequency counters, we use one part of the previous value proportional to the number of milliseconds remaining in the current second. But even the last millisecond still uses a part of the past value, which is wrong : since we have a 1ms resolution, the last millisecond must be dedicated only to filling the current second. So we slightly adjust the algorithm to use 999/1000 of the past value during the first millisecond, and 0/1000 of the past value during the last millisecond. We also slightly improve the computation by computing the remaining time instead of the current time in tv_update_date(), so that we don't have to negate the value in each frequency counter. Now with the fix, the connection rate measured by both the client and haproxy is a steady 1000/s, the average response time measured is 99.2ms and more importantly, the standard deviation has been divided by 3 to 0.6 millisecond. This fix should also be backported to 1.4 which has the same issue.	2012-12-29 21:50:07 +01:00
Emeric Brun	22890a1225	MINOR: ssl: Setting global tune.ssl.cachesize value to 0 disables SSL session cache.	2012-12-28 14:48:13 +01:00
Willy Tarreau	ccbcc37a01	MEDIUM: http: add support for "http-request tarpit" rule The "reqtarpit" rule is not very handy to use. Now that we have more flexibility with "http-request", let's finally make the tarpit rules usable there. There are still semantical differences between apply_filters_to_request() and http_req_get_intercept_rule() because the former updates the counters while the latter does not. So we currently have almost similar code leafs for similar conditions, but this should be cleaned up later.	2012-12-28 14:47:19 +01:00
Willy Tarreau	81499eb67d	MEDIUM: http: add support for "http-request redirect" rules These are exactly the same as the classic redirect rules except that they can be interleaved with other http-request rules for more flexibility. The redirect parser should probably be changed to stop at the condition so that the caller puts its own condition pointer. At the moment, the redirect rule and condition are parsed at once by build_redirect_rule() and the condition is assigned to the http_req_rule.	2012-12-28 14:47:19 +01:00
Willy Tarreau	4baae248fc	REORG: config: move the http redirect rule parser to proto_http.c We'll have to use this elsewhere soon, let's move it to the proper place.	2012-12-28 14:47:19 +01:00
Willy Tarreau	71241abfd3	MINOR: http: move redirect rule processing to its own function We now have http_apply_redirect_rule() which does all the redirect-specific job instead of having this inside http_process_req_common(). Also one of the benefit gained from uniformizing this code is that both keep-alive and close response do emit the PR-- flags. The fix for the flags could probably be backported to 1.4 though it's very minor. The previous function http_perform_redirect() was becoming confusing so it was renamed http_perform_server_redirect() since it only applies to server-based redirection.	2012-12-28 14:47:19 +01:00
Willy Tarreau	d79a3b248e	BUG/MINOR: log: make log-format, unique-id-format and add-header more independant It happens that all of them call parse_logformat_line() which sets proxy->to_log with a number of flags affecting the line format for all three users. For example, having a unique-id specified disables the default log-format since fe->to_log is tested when the session is established. Similarly, having "option logasap" will cause "+" to be inserted in unique-id or headers referencing some of the fields depending on LW_BYTES. This patch first removes most of the dependency on fe->to_log whenever possible. The first possible cleanup is to stop checking fe->to_log for being null, considering that it always contains at least LW_INIT when any such usage is made of the log-format! Also, some checks are wrong. s->logs.logwait cannot be nulled by "logwait &= ~LW_" since LW_INIT is always there. This results in getting the wrong log at the end of a request or session when a unique-id or add-header is set, because logwait is still not null but the log-format is not checked. Further cleanups are required. Most LW_ flags should be removed or at least replaced with what they really mean (eg: depend on client-side connection, depend on server-side connection, etc...) and this should only affect logging, not other mechanisms. This patch fixes the default log-format and tries to limit interferences between the log formats, but does not pretend to do more for the moment, since it's the most visible breakage.	2012-12-28 09:51:00 +01:00
Willy Tarreau	20b0de56d4	MEDIUM: http: add http-request 'add-header' and 'set-header' to build headers These two new statements allow to pass information extracted from the request to the server. It's particularly useful for passing SSL information to the server, but may be used for various other purposes such as combining headers together to emulate internal variables.	2012-12-24 15:56:20 +01:00
Willy Tarreau	b83bc1e1c1	MINOR: log: make parse_logformat_string() take a const char * Sometimes we can't pass a char *, and there is no need for this since we strdup() it.	2012-12-24 12:36:33 +01:00
Willy Tarreau	5c2e198390	MINOR: http: prepare to support more http-request actions We'll need to support per-action arguments, so we need to have an "arg" union in http_req_rule.	2012-12-24 12:26:26 +01:00
Willy Tarreau	354898bba9	MINOR: stats: replace STAT_FMT_CSV with STAT_FMT_HTML We need to switch the default mode if we want to add new output formats later. Let CSV be the default and HTML be an option.	2012-12-23 21:46:30 +01:00
Willy Tarreau	56adcf2cc9	MINOR: tools: simplify the use of the int to ascii macros These macros (U2H, U2A, LIM2A, ...) have been used with an explicit index for the local storage variable, making it difficult to change log formats and causing a few issues from time to time. Let's have a single macro with a rotating index so that up to 10 conversions may be used in a single call.	2012-12-23 21:46:30 +01:00
Willy Tarreau	47ca54505c	MINOR: chunks: centralize the trash chunk allocation At the moment, we need trash chunks almost everywhere and the only correctly implemented one is in the sample code. Let's move this to the chunks so that all other places can use this allocator. Additionally, the get_trash_chunk() function now really returns two different chunks. Previously it used to always overwrite the same chunk and point it to a different buffer, which was a bit tricky because it's not obvious that two consecutive results do alias each other.	2012-12-23 21:46:07 +01:00
Willy Tarreau	d9bdcd5139	REORG: stats: massive code reorg and cleanup The dumpstats code looks like a spaghetti plate. Several functions are supposed to be able to do several things but rely on complex states to dispatch the work to independant functions. Most of the HTML output is performed within the switch/case statements of the whole state machine. Let's clean this up by adding new functions to emit the data and have a few more iterators to avoid relying on so complex states. The new stats dump sequence looks like this for CLI and for HTTP : cli_io_handler() -> stats_dump_sess_to_buffer() // "show sess" -> stats_dump_errors_to_buffer() // "show errors" -> stats_dump_raw_info_to_buffer() // "show info" -> stats_dump_raw_info() -> stats_dump_raw_stat_to_buffer() // "show stat" -> stats_dump_csv_header() -> stats_dump_proxy() -> stats_dump_px_hdr() -> stats_dump_fe_stats() -> stats_dump_li_stats() -> stats_dump_sv_stats() -> stats_dump_be_stats() -> stats_dump_px_end() http_stats_io_handler() -> stats_http_redir() -> stats_dump_http() // also emits the HTTP headers -> stats_dump_html_head() // emits the HTML headers -> stats_dump_csv_header() // emits the CSV headers (same as above) -> stats_dump_http_info() // note: ignores non-HTML output -> stats_dump_proxy() // same as above -> stats_dump_http_end() // emits HTML trailer	2012-12-22 20:45:02 +01:00
Willy Tarreau	c83684519b	MEDIUM: log: add the ability to include samples in logs Using %[expression] it becomes possible to make the log engine fetch some samples from the request or the response and provide them in the logs. Note that this feature is still limited, it does not yet allow to apply converters, to limit the output length, nor to specify the direction which should be fetched when a fetch function works in both directions. However it's quite convenient to log SSL information or to include some information that are used in stick tables. It is worth noting that this has been done in the generic log format handler, which means that the same information may be used to build the unique-id header and to pass the information to a backend server.	2012-12-21 19:24:49 +01:00
Willy Tarreau	2b0108adf6	MINOR: log: add lf_text_len This function allows to log a text of a specific length.	2012-12-21 19:24:48 +01:00
Willy Tarreau	e7ad4bb2f0	MINOR: samples: add a function to fetch and convert any sample to a string Any sample type can now easily be converted to a string that can be used anywhere. This will be used for logging and passing information in headers.	2012-12-21 17:57:24 +01:00
Willy Tarreau	8a3f52fc2e	MEDIUM: log-format: make the format parser more robust and more extensible The log-format parser reached a limit making it hard to add new features. It also suffers from a weak handling of certain incorrect corner cases, for example "%{foo}" is emitted as a litteral while syntactically it's an argument to no variable. Also the argument parser had to redo some of the job with some cases causing minor memory leaks (eg: ignored args). This work aims at improving the situation so that slightly better reporting is possible and that it becomes possible to extend the log format. The code has a few more states but looks significantly simpler. The parser is now capable of reporting ignored arguments and truncated lines.	2012-12-20 23:34:20 +01:00
Willy Tarreau	c5259fdc57	MINOR: log: add a tag for amount of bytes uploaded from client to server For POST, PUT, CONNECT or tunnelled connections, it's annoying not to have the amount of uploaded bytes in the logs. %U now reports this value.	2012-12-20 15:38:04 +01:00
Willy Tarreau	5fb3803f4b	CLEANUP: buffer: use buffer_empty() instead of buffer_len()==0 A few places still made use of buffer_len()==0 to detect an empty buffer. Use the cleaner and more efficient buffer_empty() instead.	2012-12-17 01:14:49 +01:00
Willy Tarreau	7d28149e92	BUG/MEDIUM: connection: always update connection flags prior to computing polling stream_int_chk_rcv_conn() did not clear connection flags before updating them. It is unsure whether this could have caused the stalled transfers that have been reported since dev15. In order to avoid such further issues, we now use a simple inline function to do all the job.	2012-12-17 01:14:25 +01:00
Willy Tarreau	4a29144591	OPTIM: poll: optimize fd management functions for low register count CPUs Looking at the assembly code that updt_fd() and alloc/release_spec_entry produce in the polling loops, it's clear that gcc has to recompute pointers several times in a row because of limited spare registers. By better grouping adjacent structure updates, we improve the code size by around 60 bytes in the fast path on x86.	2012-12-13 23:34:18 +01:00
Willy Tarreau	20d46a5a95	CLEANUP: session: use an array for the stick counters The stick counters were in two distinct sets of struct members, causing some code to be duplicated. Now we use an array, which enables some processing to be performed in loops. This allowed the code to be shrunk by 700 bytes.	2012-12-09 15:57:16 +01:00
Willy Tarreau	5d5b5d8eaf	MEDIUM: proto_tcp: add support for tracking L7 information Until now it was only possible to use track-sc1/sc2 with "src" which is the IPv4 source address. Now we can use track-sc1/sc2 with any fetch as well as any transformation type. It works just like the "stick" directive. Samples are automatically converted to the correct types for the table. Only "tcp-request content" rules may use L7 information, and such information must already be present when the tracking is set up. For example it becomes possible to track the IP address passed in the X-Forwarded-For header. HTTP request processing now also considers tracking from backend rules because we want to be able to update the counters even when the request was already parsed and tracked. Some more controls need to be performed (eg: samples do not distinguish between L4 and L6).	2012-12-09 14:08:47 +01:00
Willy Tarreau	ef9a360555	MEDIUM: connection: introduce "struct conn_src" for servers and proxies Both servers and proxies share a common set of parameters for outgoing connections, and since they're not stored in a similar structure, a lot of code is duplicated in the connection setup, which is one sensible area. Let's first define a common struct for these settings and make use of it. Next patches will de-duplicate code. This change also fixes a build breakage that happens when USE_LINUX_TPROXY is not set but USE_CTTPROXY is set, which seem to be very unlikely considering that the issue was introduced almost 2 years ago an never reported.	2012-12-09 10:04:39 +01:00
Willy Tarreau	02777a1df5	CLEANUP: connection: remove unused server/proxy/task/si_applet declarations These ones are left-overs from the code before the introduction of obj_type.	2012-12-08 21:43:36 +01:00
Willy Tarreau	55e4ecd928	MINOR: stats: add a few more information on session dump We also report fd.spec_p, fd.updt and a few names instead of the values.	2012-12-08 17:48:47 +01:00
Emeric Brun	af9619da3e	MEDIUM: ssl: manage shared cache by blocks for huge sessions. Sessions using client certs are huge (more than 1 kB) and do not fit in session cache, or require a huge cache. In this new implementation sshcachesize set a number of available blocks instead a number of available sessions. Each block is large enough (128 bytes) to store a simple session (without client certs). Huge sessions will take multiple blocks depending on client certificate size. Note: some unused code for session sync with remote peers was temporarily removed.	2012-12-04 10:56:56 +01:00
Willy Tarreau	20879a0233	MEDIUM: connection: add error reporting for the SSL Get a bit more info in the logs when client-side SSL handshakes fail.	2012-12-03 17:21:52 +01:00
Willy Tarreau	8e3bf699db	MEDIUM: connection: add error reporting for the PROXY protocol header When the PROXY protocol header is expected and fails, leading to an abort of the incoming connection, we now emit a log message. If option dontlognull is set and it was just a port probe, then nothing is logged.	2012-12-03 17:21:51 +01:00
Willy Tarreau	0af2912fd1	MEDIUM: connection: add minimal error reporting in logs for incomplete connections Since the introduction of SSL, it became quite annoying not to get any useful info in logs about handshake failures. Let's improve reporting for embryonic sessions by checking a per-connection error code and reporting it into the logs if an error happens before the session is completely instanciated. The "dontlognull" option is supported in that if a connection does not talk before being aborted, nothing will be emitted. At the moment, only timeouts are considered for SSL and the PROXY protocol, but next patches will handle more errors.	2012-12-03 15:38:23 +01:00
Willy Tarreau	14cba4b0b1	MEDIUM: connection: add an error code in connections This will be needed to improve error reporting, especially for SSL.	2012-12-03 14:22:13 +01:00
Emeric Brun	786991e8b7	BUG/MEDIUM: ssl: Fix handshake failure on session resumption with client cert. Openssl session_id_context was not set on cached sessions so handshake returns an error.	2012-11-26 18:43:21 +01:00
Willy Tarreau	77e3af9e6f	MINOR: tcp: add support for the "v4v6" bind option Commit `9b6700f` added "v6only". As suggested by Vincent Bernat, it is sometimes useful to have the opposite option to force binding to the two protocols when the system is configured to bind to v6 only by default. This option does exactly this. v6only still has precedence.	2012-11-24 15:07:23 +01:00
Willy Tarreau	5e16cbc3bd	MINOR: stats: report the total number of compressed responses per front/back Depending on the content-types and accept-encoding fields, some responses might or might not be compressed. Let's have a counter of the number of compressed responses and report it in the stats to help improve compression usage. Some cosmetic issues were fixed in the CSV output too (missing commas at the end).	2012-11-24 14:54:13 +01:00
Willy Tarreau	9b6700f673	MINOR: tcp: add support for the "v6only" bind option This option forces a socket to bind to IPv6 only when it uses the default address (eg: ":::80").	2012-11-24 12:20:28 +01:00
Willy Tarreau	36fb02c526	BUG/MEDIUM: connection: always disable polling upon error Commit `0ffde2cc` in 1.5-dev13 tried to always disable polling on file descriptors when errors were encountered. Unfortunately it did not always succeed in doing so because it relied on detecting polling changes to disable it. Let's use a dedicated conn_stop_polling() function that is inconditionally called upon error instead. This managed to stop a busy loop observed when a health check makes use of the send-proxy protocol and fails before the connection can be established.	2012-11-24 11:09:07 +01:00
Willy Tarreau	f0837b259b	MEDIUM: tcp: add explicit support for delayed ACK in connect() Commit `24db47e0` tried to improve support for delayed ACK upon connect but it was incomplete, because checks with the proxy protocol would always enable polling for data receive and there was no way of distinguishing data polling and delayed ack. So we add a distinct delack flag to the connect() function so that the caller decides whether or not to use a delayed ack regardless of pending data (eg: when send-proxy is in use). Doing so covers all combinations of { (check with data), (sendproxy), (smart-connect) }.	2012-11-24 10:24:27 +01:00
Willy Tarreau	2b199c9ac3	MEDIUM: connection: provide a common conn_full_close() function Several places got the connection close sequence wrong because it was not obvious. In practice we always need the same sequence when aborting, so let's have a common function for this.	2012-11-23 17:32:21 +01:00
Willy Tarreau	5a78f36db3	MAJOR: checks: rework completely bogus state machine The porting of checks to using connections was totally bogus. Some checks were considered successful as soon as the connection was established, regardless of any response. Some errors would be triggered upon recv if polling was enabled for send or if the send channel was shut down. Now the behaviour is much better. It would be cleaner to perform the fd_delete() in wake_srv_chk() and to process failures and timeouts separately, but this is already a good start.	2012-11-23 12:47:05 +01:00
Willy Tarreau	d3aac7088e	CLEANUP: checks: rename some server check flags Some server check flag names were not properly choosen and cause analysis trouble, especially the CHK_RUNNING one which does not mean that a check is running but that the server is running... Here's the rename : CHK_RUNNING -> CHK_PASSED CHK_ERROR -> CHK_FAILED	2012-11-23 11:32:12 +01:00
Willy Tarreau	55058a7c1e	MINOR: stats: report HTTP compression stats per frontend and per backend It was a bit frustrating to have no idea about the bandwidth saved by HTTP compression. Now we have per-frontend and per-backend stats. The stats on the HTTP interface are shown in a hover title in the "bytes out" column if at least something was fed to the compressor. 3 new columns appeared in the CSV stats output.	2012-11-22 01:07:40 +01:00
Willy Tarreau	193b8c6168	MINOR: http: allow the cookie capture size to be changed Some users need more than 64 characters to log large cookies. The limit was set to 63 characters (and not 64 as previously documented). Now it is possible to change this using the global "tune.http.cookielen" setting if required.	2012-11-22 00:44:27 +01:00
Willy Tarreau	88c6d81386	MINOR: http: add some debugging functions to pretty-print msg state names The http_msg_state_str() function reports a string containing the name of the state passed in argument. This helps while debugging.	2012-11-21 21:50:04 +01:00
William Lallemand	072a2bf537	MINOR: compression: CPU usage limit New option 'maxcompcpuusage' in global section. Sets the maximum CPU usage HAProxy can reach before stopping the compression for new requests or decreasing the compression level of current requests. It works like 'maxcomprate' but with the Idle.	2012-11-21 02:15:16 +01:00
William Lallemand	e3a7d99062	MINOR: compression: report zlib memory usage Show the memory usage and the max memory available for zlib. The value stored is now the memory used instead of the remaining available memory.	2012-11-21 02:15:16 +01:00
William Lallemand	8b52bb3878	MEDIUM: compression: use pool for comp_ctx Use pool for comp_ctx, it is allocated during the comp_algo->init(). The allocation of comp_ctx is accounted for in the zlib_memory_available.	2012-11-21 01:56:47 +01:00
Willy Tarreau	bc174aa144	MINOR: cli: report connection status in "show sess xxx" Connection flags, targets and transport layers are now reported in "show sess $PTR", as it is an absolute requirement in debugging.	2012-11-19 16:22:22 +01:00
William Lallemand	bf3ae61789	MEDIUM: compression: don't compress when no data This patch makes changes in the http_response_forward_body state machine. It checks if the compress algorithm had consumed data before swapping the temporary and the input buffer. So it prevents null sized zlib chunks.	2012-11-19 14:57:29 +01:00
Willy Tarreau	16a2147dfe	MEDIUM: adjust the maxaccept per listener depending on the number of processes global.tune.maxaccept was used for all listeners. This becomes really not convenient when some listeners are bound to a single process and other ones are bound to many processes. Now we change the principle : we count the number of processes a listener is bound to, and apply the maxaccept either entirely if there is a single process, or divided by twice the number of processes in order to maintain fairness. The default limit has also been increased from 32 to 64 as it appeared that on small machines, 32 was too low to achieve high connection rates.	2012-11-19 12:39:59 +01:00
Willy Tarreau	37994f034c	MINOR: standard: add a simple popcount function This function returns the number of ones in a word.	2012-11-19 12:12:24 +01:00
Emeric Brun	4f65bff1a5	MINOR: ssl: Add tune.ssl.lifetime statement in global. Sets the ssl session <lifetime> in seconds. Openssl default is 300 seconds.	2012-11-16 16:47:20 +01:00
Willy Tarreau	fc6c032d8d	MEDIUM: global: add support for CPU binding on Linux ("cpu-map") The new "cpu-map" directive allows one to assign the CPU sets that a process is allowed to bind to. This is useful in combination with the "nbproc" and "bind-process" directives. The support is implicit on Linux 2.6.28 and above.	2012-11-16 16:16:53 +01:00
William Lallemand	ec3e3890f0	BUG/MINOR: compression: deinit zlib only when required The zlib stream was deinitialized even when the init failed.	2012-11-15 15:42:17 +01:00
Emeric Brun	4663577e24	MINOR: build: allow packagers to specify the ssl cache size This is done by passing the default value to SSLCACHESIZE in sessions. User can use tune.sslcachesize to change this value. By default, it is set to 20000 sessions as openssl internal cache size. Currently, a session entry size is between 592 and 616 bytes depending on the arch.	2012-11-15 10:52:19 +01:00
Willy Tarreau	3fdb366885	MAJOR: connection: replace struct target with a pointer to an enum Instead of storing a couple of (int, ptr) in the struct connection and the struct session, we use a different method : we only store a pointer to an integer which is stored inside the target object and which contains a unique type identifier. That way, the pointer allows us to retrieve the object type (by dereferencing it) and the object's address (by computing the displacement in the target structure). The NULL pointer always corresponds to OBJ_TYPE_NONE. This reduces the size of the connection and session structs. It also simplifies target assignment and compare. In order to improve the generated code, we try to put the obj_type element at the beginning of all the structs (listener, server, proxy, si_applet), so that the original and target pointers are always equal. A lot of code was touched by massive replaces, but the changes are not that important.	2012-11-12 00:42:33 +01:00
Willy Tarreau	128b03c9ab	CLEANUP: stream_interface: remove the external task type target Before connections were introduced, it was possible to connect an external task to a stream interface. However it was left as an exercise for the brave implementer to find how that ought to be done. The feature was broken since the introduction of connections and was never fixed since due to lack of users. Better remove this dead code now.	2012-11-11 23:14:16 +01:00
Willy Tarreau	b31c971bef	CLEANUP: channel: remove any reference of the hijackers Hijackers were functions designed to inject data into channels in the distant past. They became unused around 1.3.16, and since there has not been any user of this mechanism to date, it's uncertain whether the mechanism still works (and it's not really useful anymore). So better remove it as well as the pointer it uses in the channel struct.	2012-11-11 23:05:39 +01:00
Willy Tarreau	50fc7777c6	MEDIUM: http: refrain from sending "Connection: close" when Upgrade is present Some servers are not totally HTTP-compliant when it comes to parsing the Connection header. This is particularly true with WebSocket where it happens from time to time that a server doesn't support having a "close" token along with the "Upgrade" token in the Connection header. This broken behaviour has also been noticed on some clients though the problem is less frequent on the response path. Sometimes the workaround consists in enabling "option http-pretend-keepalive" to leave the request Connection header untouched, but this is not always the most convenient solution. This patch introduces a new solution : haproxy now also looks for the "Upgrade" token in the Connection header and if it finds it, then it refrains from adding any other token to the Connection header (though "keep-alive" and "close" may still be removed if found). The same is done for the response headers. This way, WebSocket much with less changes even when facing non-compliant clients or servers. At least it fixes the DISCONNECT issue that was seen on the websocket.org test. Note that haproxy does not change its internal mode, it just refrains from adding new tokens to the connection header.	2012-11-11 22:40:00 +01:00
Willy Tarreau	70c6fd82c3	MAJOR: polling: remove unused callbacks from the poller struct Since no poller uses poller->{set,clr,wai,is_set,rem} anymore, let's remove them and remove the associated pointer tests in proto/fd.h.	2012-11-11 21:02:34 +01:00
Willy Tarreau	e9f49e78fe	MAJOR: polling: replace epoll with sepoll and remove sepoll Now that all pollers make use of speculative I/O, there is no point having two epoll implementations, so replace epoll with the sepoll code and remove sepoll which has just become the standard epoll method.	2012-11-11 20:53:30 +01:00
Willy Tarreau	7f7ad91056	BUILD: stream_interface: remove si_fd() and its references si_fd() is not used a lot, and breaks builds on OpenBSD 5.2 which defines this name for its own purpose. It's easy enough to remove this one-liner function, so let's do it.	2012-11-11 20:53:29 +01:00
Willy Tarreau	09f24569d4	REORG: fd: centralize the processing of speculative events Speculative events are independant on the poller, so they can be centralized in fd.c.	2012-11-11 17:45:39 +01:00
Willy Tarreau	6ea20b1acb	REORG: fd: move the fd state management from ev_sepoll ev_sepoll already provides everything needed to manage FD events by only manipulating the speculative I/O list. Nothing there is sepoll-specific so move all this to fd.	2012-11-11 17:45:39 +01:00
Willy Tarreau	7be79a41e1	REORG: fd: move the speculative I/O management from ev_sepoll The speculative I/O will need to be ported to all pollers, so move this to fd.c.	2012-11-11 17:45:39 +01:00
William Lallemand	d85f917daf	MINOR: compression: maximum compression rate limit This patch adds input and output rate calcutation on the HTTP compresion feature. Compression can be limited with a maximum rate value in kilobytes per second. The rate is set with the global 'maxcomprate' option. You can change this value dynamicaly with 'set rate-limit http-compression global' on the UNIX socket.	2012-11-10 17:47:27 +01:00
William Lallemand	f3747837e5	MINOR: compression: tune.comp.maxlevel This option allows you to set the maximum compression level usable by the compression algorithm. It affects CPU usage.	2012-11-10 17:47:07 +01:00
Willy Tarreau	037d2c1f8f	MAJOR: sepoll: make the poller totally event-driven At the moment sepoll is not 100% event-driven, because a call to fd_set() on an event which is already being polled will not change its state. This causes issues with OpenSSL because if some I/O processing is interrupted after clearing the I/O event (eg: read all data from a socket, can't put it all into the buffer), then there is no way to call the SSL_read() again once the buffer releases some space. The only real solution is to go 100% event-driven. The principle is to use the spec list as an event cache and that each time an I/O event is reported by epoll_wait(), this event is automatically scheduled for addition to the spec list for future calls until the consumer explicitly asks for polling or stopping. Doing this is a bit tricky because sepoll used to provide a substantial number of optimizations such as event merging. These optimizations have been maintained : a dedicated update list is affected when events change, but not the event list, so that updates may cancel themselves without any side effect such as displacing events. A specific case was considered for handling newly created FDs as soon as they are detected from within the poll loop. This ensures that their read or write operation will always be attempted as soon as possible, thus reducing the number of poll loops and process_session wakeups. This is especially true for newly accepted fds which immediately perform their first recv() call. Two new flags were added to the fdtab[] struct to tag the fact that a file descriptor already exists in the update list. One flag indicates that a file descriptor is new and has just been created (fdtab[].new) and the other one indicates that a file descriptor is already referenced by the update list (fdtab[].updated). Even if the FD state changes during operations or if the fd is closed and replaced, it's not an issue because the update flag remains and is easily spotted during list walks. The flag must absolutely reflect the presence of the fd in the update list in order to avoid overflowing the update list with more events than there are distinct fds. Note that this change also recovers the small performance loss introduced by its connection counter-part and goes even beyond.	2012-11-10 00:17:27 +01:00
Willy Tarreau	c8dd77fddf	MAJOR: connection: remove the CO_FL_CURR_*_POL flag This is the first step of a series of changes aiming at making the polling totally event-driven. This first change consists in only remembering at the connection level whether an FD was enabled or not, regardless of the fact it was being polled or cached. From now on, an EAGAIN will always be considered as a change so that the pollers are able to manage a cache and to flush it based on such events. One of the noticeable effect is that conn_fd_handler() is called once more per session (6 instead of 5 min) but other update functions are less called. Note that the performance loss caused by this change at the moment is quite significant, around 2.5%, but the change is needed to have SSL working correctly in all situations, even when data were read from the socket and stored in the invisible cache, waiting for some room in the channel's buffer.	2012-11-09 22:09:33 +01:00
William Lallemand	9d5f5480fd	MEDIUM: compression: limit RAM usage With the global maxzlibmem option, you are able ton control the maximum amount of RAM usable for HTTP compression. A test is done before each zlib allocation, if the there isn't available memory, the test fail and so the zlib initialization, so data won't be compressed.	2012-11-08 15:23:30 +01:00
William Lallemand	2b50247695	MEDIUM: use pool for zlib Don't use the zlib allocator anymore, 5 pools are used for the zlib compression. Their sizes depends of the window size and the memLevel in deflateInit2.	2012-11-08 15:23:29 +01:00
William Lallemand	a509e4c332	MINOR: compression: memlevel and windowsize The window size and the memlevel of the zlib are now configurable using global options tune.zlib.memlevel and tune.zlib.windowsize. It affects the memory consumption of the zlib.	2012-11-08 15:23:29 +01:00
William Lallemand	08289f12f9	BUILD: remove dependency to zlib.h The build was dependent of the zlib.h header, regardless of the USE_ZLIB option. The fix consists of several #ifdef in the source code. It removes the overhead of the zstream structure in the session when you don't use the option.	2012-11-05 10:23:16 +01:00
William Lallemand	1c2d622d82	CLEANUP: use struct comp_ctx instead of union Replace union comp_ctx by struct comp_ctx. Use struct comp_ctx * in the init/add_data/flush/reset/end prototypes of compression.h functions.	2012-11-05 10:23:16 +01:00
Willy Tarreau	ed7f836f07	BUG/MINOR: stream_interface: don't loop over ->snd_buf() It is stupid to loop over ->snd_buf() because the snd_buf() itself already loops and stops when system buffers are full. But looping again onto it, we lose the information of the full buffers and perform one useless syscall. Furthermore, this causes issues when dealing with large uploads while waiting for a connection to establish, as it can report a server reject of some data as a connection abort, which is wrong. 1.4 does not have this issue as it loops maximum twice (once for each buffer half) and exists as soon as system buffers are full. So no backport is needed.	2012-10-29 23:30:33 +01:00
Willy Tarreau	07115412d3	MEDIUM: stick-table: allocate the table key of size buffer size Keys are copied from samples to stick_table_key. If a key is larger than the stick_table_key, we have an overflow. In pratice it does not happen because it requires : 1) a configuration with tune.bufsize larger than BUFSIZE (common) 2) a stick-table configured with keys strictly larger than buffers 3) extraction of data larger than BUFSIZE (eg: using payload()) Points 2 and 3 don't make any sense for a real world configuration. That said the issue needs be fixed. The solution consists in allocating it the same size as the global buffer size, just like the samples. This fixes the issue.	2012-10-29 21:56:59 +01:00
Willy Tarreau	7e2c647ee7	MEDIUM: remove remains of BUFSIZE in HTTP auth and sample conversions Sample conversions rely on two alternative buffers which were previously allocated as static bufs of size BUFSIZE. Now they're initialized to the global buffer size. It was the same for HTTP authentication. Note that it seems that none of them was prone to any mistake when dealing with the buffer size, but better stay on the safe side by maintaining the old assumption that a trash buffer is always "large enough".	2012-10-29 20:44:36 +01:00
Willy Tarreau	19d14ef104	MEDIUM: make the trash be a chunk instead of a char * The trash is used everywhere to store the results of temporary strings built out of s(n)printf, or as a storage for a chunk when chunks are needed. Using global.tune.bufsize is not the most convenient thing either. So let's replace trash with a chunk and directly use it as such. We can then use trash.size as the natural way to get its size, and get rid of many intermediary chunks that were previously used. The patch is huge because it touches many areas but it makes the code a lot more clear and even outlines places where trash was used without being that obvious.	2012-10-29 16:57:30 +01:00
Willy Tarreau	7780473c3b	CLEANUP: replace chunk_printf() with chunk_appendf() This function's naming was misleading as it is used to append data at the end of a string, causing some surprizes when used for the first time! Add a chunk_printf() function which does what its name suggests.	2012-10-29 16:14:26 +01:00
Willy Tarreau	c26ac9deea	MINOR: chunk: add a function to reset a chunk This is a first step in avoiding to constantly reinitialize chunks. It replaces the old chunk_reset() which was not properly named as it used to drop everything and was only used by chunk_destroy(). It has been renamed chunk_drop().	2012-10-29 13:33:42 +01:00
Yuxans Yao	4e25b015a7	MINOR: log: add '%Tl' to log-format The '%Tl' is similar to '%T', but using local timezone.	2012-10-29 11:55:26 +01:00
Willy Tarreau	70737d142f	MINOR: compression: add an offload option to remove the Accept-Encoding header This is used when it is desired that backend servers don't compress (eg: because of buggy implementations).	2012-10-27 01:13:24 +02:00
Willy Tarreau	f2943dccd0	MAJOR: session: detach the connections from the stream interfaces We will need to be able to switch server connections on a session and to keep idle connections. In order to achieve this, the preliminary requirement is that the connections can survive the session and be detached from them. Right now they're still allocated at exactly the same place, so when there is a session, there are always 2 connections. We could soon improve on this by allocating the outgoing connection only during a connect(). This current patch touches a lot of code and intentionally does not change any functionnality. Performance tests show no regression (even a very minor improvement). The doc has not yet been updated.	2012-10-26 20:15:20 +02:00
Willy Tarreau	c919dc66a3	CLEANUP: remove trashlen trashlen is a copy of global.tune.bufsize, so let's stop using it as a duplicate, fall back to the original bufsize, it's less confusing this way.	2012-10-26 20:04:27 +02:00
Willy Tarreau	422a0a5161	MINOR: tools: add a clear_addr() function to unset an address This will be used to unset a from address.	2012-10-26 20:04:26 +02:00
Emeric Brun	a7aa309c44	MINOR: ssl: add 'crt' statement on server. crt: client certificate to send	2012-10-26 15:10:10 +02:00
William Lallemand	82fe75c1a7	MEDIUM: HTTP compression (zlib library support) This commit introduces HTTP compression using the zlib library. http_response_forward_body has been modified to call the compression functions. This feature includes 3 algorithms: identity, gzip and deflate: * identity: this is mostly for debugging, and it was useful for developping the compression feature. With Content-Length in input, it is making each chunk with the data available in the current buffer. With chunks in input, it is rechunking, the output chunks will be bigger or smaller depending of the size of the input chunk and the size of the buffer. Identity does not apply any change on data. * gzip: same as identity, but applying a gzip compression. The data are deflated using the Z_NO_FLUSH flag in zlib. When there is no more data in the input buffer, it flushes the data in the output buffer (Z_SYNC_FLUSH). At the end of data, when it receives the last chunk in input, or when there is no more data to read, it writes the end of data with Z_FINISH and the ending chunk. * deflate: same as gzip, but with deflate algorithm and zlib format. Note that this algorithm has ambiguous support on many browsers and no support at all from recent ones. It is strongly recommended not to use it for anything else than experimentation. You can't choose the compression ratio at the moment, it will be set to Z_BEST_SPEED (1), as tests have shown very little benefit in terms of compression ration when going above for HTML contents, at the cost of a massive CPU impact. Compression will be activated depending of the Accept-Encoding request header. With identity, it does not take care of that header. To build HAProxy with zlib support, use USE_ZLIB=1 in the make parameters. This work was initially started by David Du Colombier at Exceliance.	2012-10-26 02:30:48 +02:00
Willy Tarreau	54d23dfc07	CLEANUP: http: rename HTTP_MSG_DATA_CRLF state This state's name is confusing as it is only used with chunked encoding and makes newcomers think it's also related to the content-length. Let's call it CHUNK_CRLF to clear any doubt on this.	2012-10-26 01:13:52 +02:00
Willy Tarreau	3dd0c4e20e	OPTIM: tools: inline hex2i() This tiny function was not inlined because initially not much used. However it's been used un the chunk parser for a while and it became one of the most CPU-cycle eater there. By inlining it, the chunk parser speed was increased by 74 %. We're almost 3 times faster than original with just the last 4 commits.	2012-10-26 01:13:24 +02:00
Willy Tarreau	55a6906125	OPTIM: channel: inline channel_forward's fast path Most calls to channel_forward() are performed with short byte counts and are already optimized in channel_forward() taking just a few instructions. Thus it's a waste of CPU cycles to call a function for this, let's just inline the short byte count case and fall back to the common one for remaining situations. Doing so has increased the chunked encoding parser's performance by 12% !	2012-10-26 01:08:01 +02:00
Emeric Brun	a068a2951d	MINOR: sample: export 'sample_get_trash_chunk(void)' This will be used on external fetch modules.	2012-10-22 18:54:24 +02:00
Emeric Brun	07ca496ea9	MINOR: acl: add parse and match primitives to use binary type on ACLs Binary ACL match patterns can now be entered as hex digit strings.	2012-10-22 18:54:24 +02:00
Willy Tarreau	2e845be249	MEDIUM: sample: pass an empty list instead of a null for fetch args ACL and sample fetches use args list and it is really not convenient to check for null args everywhere. Now for empty args we pass a constant list of end of lists. It will allow us to remove many useless checks.	2012-10-19 19:49:09 +02:00
Willy Tarreau	ad8f8e8ffb	MINOR: chunk: provide string compare functions It's sometimes needed to be able to compare a zero-terminated string with a chunk, so we now have two functions to do that, one strcmp() equivalent and one strcasecmp() equivalent.	2012-10-19 15:18:06 +02:00
Willy Tarreau	6c9a3d5585	MEDIUM: ssl: add support for the "npn" bind keyword The ssl_npn match could not work by itself because clients do not use the NPN extension unless the server advertises the protocols it supports. Thanks to Simone Bordet for the explanations on how to get it right.	2012-10-18 19:03:00 +02:00
Willy Tarreau	378e041797	OPTIM: connection: pack the struct target The struct target contains one int and one pointer, causing it to be 64-bit aligned on 64-bit platforms. By marking it "packed", we can save 8 bytes in struct connection and as many in struct session on such platforms.	2012-10-13 14:33:58 +02:00
Willy Tarreau	109e95a1b4	OPTIM: session: reorder struct session fields A reorering of the struct session fields has increased overall performance by almost 1% due to better cache usage.	2012-10-13 11:22:24 +02:00
Willy Tarreau	c93f7959e5	CLEANUP: session: remove term_trace which is not used anymore This field was used to trace precisely where a session was terminated but it did not survive code rearchitecture and was not used at all anymore. Let's get rid of it.	2012-10-13 11:10:30 +02:00
Willy Tarreau	0a8535fec8	OPTIM: channel: reorganize struct members to improve cache efficiency Now that the buffer is moved out of the channel, it is possible to move the pointer earlier in the struct and reorder some fields. This new ordering improves overall performance by 2%, mainly saved in the HTTP parsers and data transfers.	2012-10-13 10:55:22 +02:00
Willy Tarreau	9b28e03b66	MAJOR: channel: replace the struct buffer with a pointer to a buffer With this commit, we now separate the channel from the buffer. This will allow us to replace buffers on the fly without touching the channel. Since nobody is supposed to keep a reference to a buffer anymore, doing so is not a problem and will also permit some copy-less data manipulation. Interestingly, these changes have shown a 2% performance increase on some workloads, probably due to a better cache placement of data.	2012-10-13 09:07:52 +02:00
Willy Tarreau	974ced6305	CLEANUP: channel: use 'chn' instead of 'buf' as local variable names It's too confusing to see buf->buf everywhere where the first buf is a channel. Let's fix this now.	2012-10-12 23:11:02 +02:00
Willy Tarreau	394db379eb	REORG: http: rename msg->buf to msg->chn since it's a channel It's extremely confusing to have all those msg->buf->buf everywhere after the extraction of the buffer from the channel. Let's clean this up.	2012-10-12 22:40:39 +02:00
Willy Tarreau	ffc3fcd6da	MEDIUM: log: report SSL ciphers and version in logs using logformat %sslc/%sslv These two new log-format tags report the SSL protocol version (%sslv) and the SSL ciphers (%sslc) used for the connection with the client. For instance, to append these information just after the client's IP/port address information on an HTTP log line, use the following configuration : log-format %Ci:%Cp\ %sslv:%sslc\ [%t]\ %ft\ %b/%s\ %Tq/%Tw/%Tc/%Tr/%Tt\ %st\ %B\ %cc\ \ %cs\ %tsc\ %ac/%fc/%bc/%sc/%rc\ %sq/%bq\ %hr\ %hs\ %{+Q}r It will report a line such as the following one : Oct 12 20:47:30 haproxy[9643]: 127.0.0.1:43602 TLSv1:AES-SHA [12/Oct/2012:20:47:30.303] stick2~ stick2/s1 7/0/12/0/19 200 145 - - ---- 0/0/0/0/0 0/0 "GET /?t=0 HTTP/1.0"	2012-10-12 20:48:51 +02:00
Willy Tarreau	4f65356a22	MINOR: log: make lf_text use a const char * lf_text() should use a const char * otherwise it makes it more complex to use data coming from const strings.	2012-10-12 20:30:51 +02:00
Willy Tarreau	93dbc2bc0e	MEDIUM: log: add a new LW_XPRT flag to pin the transport layer This flag will have to be set on log tags which require transport layer information. They will prevent the conn_xprt_close() call from releasing the transport layer too early.	2012-10-12 20:30:51 +02:00
Willy Tarreau	1e954913de	MEDIUM: connection: add a flag to hold the transport layer When we start logging SSL information, we need the SSL struct to be present even past the conn_xprt_close() call. In order to achieve this, we should use refcounting on the connection and the transport layer. At the moment it's not worth using plain refcounting as only the logs require this, so instead of real refcounting we just use a flag which will be set by the log subsystem when SSL data need to be logged. What happens then is that the xprt->close() call is ignored and the transport layer is closed again during session_free(), after the log line is emitted.	2012-10-12 20:30:50 +02:00
Willy Tarreau	6c03a64978	MEDIUM: connection: always unset the transport layer upon close When calling conn_xprt_close(), we always clear the transport pointer so that all transport layers leave the connection in the same state after a close. This will also make it safer and cheaper to call conn_xprt_close() multiple times if needed.	2012-10-12 17:03:04 +02:00
Willy Tarreau	773d65f413	MEDIUM: log: suffix the frontend's name with '~' when using SSL Until now it was not possible to know from the logs whether the incoming connection was made over SSL or not. In order to address this in the existing log formats, a new log format %ft was introduced, to log the frontend's name suffixed with its transport layer. The only transport layer in use right now is '~' for SSL, so that existing log formats for non-SSL traffic are not affected at all, and SSL log formats have the frontend's name suffixed with '~'. The TCP, HTTP and CLF log format now use %ft instead of %f. This does not affect existing log formats which still make use of %f however.	2012-10-12 14:56:11 +02:00
Emeric Brun	ef42d9219d	MINOR: ssl: add statements 'verify', 'ca-file' and 'crl-file' on servers. It now becomes possible to verify the server's certificate using the "verify" directive. This one only supports "none" and "required", as it does not make much sense to also support "optional" here.	2012-10-12 12:05:15 +02:00
Emeric Brun	f9c5c4701c	MINOR: ssl: add statement 'no-tls-tickets' on server side.	2012-10-12 11:48:55 +02:00
Emeric Brun	94324a4c87	MINOR: ssl: move ssl context init for servers from cfgparse.c to ssl_sock.c	2012-10-12 11:37:36 +02:00
Emeric Brun	992adc9210	BUG/MINOR: ssl: Fix issue on server statements 'no-tls' and 'no-sslv3' bit field collision with 'force-tlsv'.	2012-10-12 11:27:57 +02:00
Willy Tarreau	21faa91be6	MINOR: server: add minimal infrastructure to parse keywords Just like with the "bind" lines, we'll switch the "server" line parsing to keyword registration. The code is essentially the same as for bind keywords, with minor changes such as support for the default-server keywords and support for variable argument count.	2012-10-10 17:42:39 +02:00
Willy Tarreau	7fca87fd9d	BUILD: accept4: move the socketcall declaration outside of accept4() Gcc 4.2.4 breaks on the syscall declared inside the function, move it outside and declare it static inline.	2012-10-10 17:42:39 +02:00
Willy Tarreau	1bc4aab290	MEDIUM: listener: add support for linux's accept4() syscall On Linux, accept4() does the same as accept() except that it allows the caller to specify some flags to set on the resulting socket. We use this to set the O_NONBLOCK flag and thus to save one fcntl() call in each connection. The effect is a small performance gain of around 1%. The option is automatically enabled when target linux2628 is set, or when the USE_ACCEPT4 Makefile variable is set. If the libc is too old to provide the equivalent function, this is automatically detected and our own function is used instead. In any case it is possible to force the use of our implementation with USE_MY_ACCEPT4.	2012-10-08 20:11:03 +02:00
Willy Tarreau	1b6c00cb99	BUG/MAJOR: ensure that hdr_idx is always reserved when L7 fetches are used Baptiste Assmann reported a bug causing a crash on recent versions when sticking rules were set on layer 7 in a TCP proxy. The bug is easier to reproduce with the "defer-accept" option on the "bind" line in order to have some contents to parse when the connection is accepted. The issue is that the acl_prefetch_http() function called from HTTP fetches relies on hdr_idx to be preinitialized, which is not the case if there is no L7 ACL. The solution consists in adding a new SMP_CAP_L7 flag to fetches to indicate that they are expected to work on L7 data, so that the proxy knows that the hdr_idx has to be initialized. This is already how ACL and HTTP mode are handled. The bug was present since 1.5-dev9.	2012-10-05 22:46:09 +02:00
Emeric Brun	76d8895c49	MINOR: ssl: add defines LISTEN_DEFAULT_CIPHERS and CONNECT_DEFAULT_CIPHERS. These ones are used to set the default ciphers suite on "bind" lines and "server" lines respectively, instead of using OpenSSL's defaults. These are probably mainly useful for distro packagers.	2012-10-05 22:11:15 +02:00
Emeric Brun	8694b9a682	MINOR: ssl: add 'force-sslv3' and 'force-tlsvXX' statements on server These options force the SSL lib to use the specified protocol when connecting to a server. They are complentary to no-tlsv*/no-sslv3.	2012-10-05 22:05:04 +02:00
Emeric Brun	2cb7ae5302	MINOR: ssl: add 'force-sslv3' and 'force-tlsvXX' statements on bind. These options force the SSL lib to use the specified protocol. They are complentary to no-tlsv*/no-sslv3.	2012-10-05 22:02:42 +02:00
Emeric Brun	8967549d52	MINOR: ssl: use bit fields to store ssl options instead of one int each Too many SSL options already and some still to come, use a bit field and get rid of all the integers. No functional change here.	2012-10-05 21:53:59 +02:00
Emeric Brun	fb510ea2b9	MEDIUM: conf: rename 'cafile' and 'crlfile' statements 'ca-file' and 'crl-file' These names were not really handy.	2012-10-05 21:50:43 +02:00
Emeric Brun	9b3009b440	MEDIUM: conf: rename 'nosslv3' and 'notlsvXX' statements 'no-sslv3' and 'no-tlsvXX'. These ones were really not easy to read nor write, and become confusing with the next ones to be added.	2012-10-05 21:47:42 +02:00
Emeric Brun	c8e8d12257	MINOR: ssl: add 'crt-base' and 'ca-base' global statements. 'crt-base' sets root directory used for relative certificates paths. 'ca-base' sets root directory used for relative CAs and CRLs paths.	2012-10-05 21:46:52 +02:00
Emeric Brun	3c4bc6e10a	MINOR: ssl: remove prefer-server-ciphers statement and set it as the default on ssl listeners.	2012-10-05 20:02:06 +02:00
Willy Tarreau	1c862c5920	MEDIUM: tcp: enable TCP Fast Open on systems which support it If TCP_FASTOPEN is defined, then the "tfo" option is supported on "bind" lines to enable TCP Fast Open (linux >= 3.6).	2012-10-05 16:22:35 +02:00
Willy Tarreau	6c16adc661	MEDIUM: checks: enable the PROXY protocol with health checks When health checks are configured on a server which has the send-proxy directive and no "port" nor "addr" settings, the health check connections will automatically use the PROXY protocol. If "port" or "addr" are set, the "check-send-proxy" directive may be used to force the protocol.	2012-10-05 00:33:14 +02:00
Willy Tarreau	f4288ee4ba	MEDIUM: check: add the ctrl and transport layers in the server check structure Since it's possible for the checks to use a different protocol or transport layer than the prod traffic, we need to have them referenced in the server. The SSL checks are not enabled yet, but the transport layers are completely used.	2012-10-05 00:33:14 +02:00
Willy Tarreau	1ae1b7b53c	MEDIUM: checks: use real buffers to store requests and responses Till now the request was made in the trash and sent to the network at once, and the response was read into a preallocated char[]. Now we allocate a full buffer for both the request and the response, and make use of it. Some of the operations will probably be replaced later with buffer macros but the point was to ensure we could migrate to use the data layers soon. One nice improvement caused by this change is that requests are now formed at the beginning of the check and may safely be sent in multiple chunks if needed.	2012-10-05 00:33:14 +02:00
Willy Tarreau	5b3a202f78	REORG: server: move the check-specific parts into a check subsection The health checks in the servers are becoming a real mess, move them into their own subsection. We'll soon need to have a struct buffer to replace the char * as well as check-specific protocol and transport layers.	2012-10-05 00:33:14 +02:00
Willy Tarreau	5f1504f524	MEDIUM: connection: add a new local send-proxy transport callback This callback sends a PROXY protocol line on the outgoing connection, with the local and remote endpoint information. This is used for local connections (eg: health checks) where the other end needs to have a valid address and no connection is relayed.	2012-10-05 00:32:35 +02:00
Willy Tarreau	e1e4a61e7a	REORG: connection: move the PROXY protocol management to connection.c It was previously in frontend.c but there is no reason for this anymore considering that all the information involved is in the connection itself only. Theorically this should be in the socket layer but we don't have this yet.	2012-10-05 00:32:33 +02:00
Willy Tarreau	0ffde2cc3f	MEDIUM: connection: automatically disable polling on error We absolutely want to disable FD polling after an error is detected, otherwise the data layer has to do it and it's far from being obvious at these layers. The way we did it was a bit tricky in conn_update__polling and conn__polling_changes. However it has almost no impact on performance and code size both for the fast and slow path. We'll now be able to remove some flag updates in the stream interface.	2012-10-04 22:26:11 +02:00
Willy Tarreau	2396c1c4a2	MEDIUM: connection: make it possible for data->wake to return an error Just like ->init(), ->wake() may now be used to return an error and abort the connection. Currently this is not used but will be with embryonic sessions.	2012-10-04 22:26:10 +02:00
Willy Tarreau	9e272bf95d	MEDIUM: connection: only call the data->wake callback on activity We now check the connection flags for changes in order not to call the data->wake callback when there is no activity. Activity means a change on any of the CO_FL__SH, CO_FL_ERROR, CO_FL_CONNECTED, CO_FL_WAIT_CONN flags, as well as a call to data->recv or data->send.	2012-10-04 22:26:10 +02:00
Willy Tarreau	f3a6d7e115	MEDIUM: connection: reorganize connection flags The connection flags have progressively been added one after the other and were not very well organized. Some of them are often used together and a number of operations are performed on the DATA/SOCK ENA/POL flags. Thus, they have been reorganized so that flags that work together are close to each other (allows immediate operands on ARM) and that polling changes can be detected with fewer operations using a simple shift and xor. The handshakes are now the last ones so that it will be easier to add new ones after without risking a collision. All activity-related flags are also grouped together.	2012-10-04 22:26:10 +02:00
Willy Tarreau	071e137ec2	MEDIUM: connection: use a generic data-layer init() callback The generic data-layer init callback is now used after the transport layer is complete and before calling the data layer recv/send callbacks. This allows the session to switch from the embryonic session data layer to the complete stream interface data layer, by making conn_session_complete() the data layer's init callback. It sill looks awkwards that the init() callback must be used opon error, but except by adding yet another one, it does not seem to be mergeable into another function (eg: it should probably not be merged with ->wake to avoid unneeded calls during the handshake, though semantically that would make sense).	2012-10-04 22:26:10 +02:00
Willy Tarreau	f4e114fe54	MINOR: connection: add an init callback to the data_cb struct This callback is used to initialize the data layer.	2012-10-04 22:26:10 +02:00
Willy Tarreau	bd99aab91f	MINOR: connection: split conn_prepare() in two functions We'll also need a function to takeover an existing connection without reinitializing it. The same will be needed at the stream interface level.	2012-10-04 22:26:10 +02:00
Willy Tarreau	4aa3683b2d	MINOR: connection: provide a generic data layer wakeup callback Instead of calling conn_notify_si() from the connection handler, we now call data->wake(), which will allow us to use a different callback with health checks. Note that we still rely on a flag in order to decide whether or not to call this function. The reason is that with embryonic sessions, the callback is already initialized to si_conn_cb without the flag, and we can't call the SI notify function in the leave path before the stream interface is initialized. This issue should be addressed by involving a different data_cb for embryonic sessions and for stream interfaces, that would be changed during session_complete() for the final data_cb.	2012-10-04 22:26:10 +02:00
Willy Tarreau	74beec32a5	REORG: connection: rename app_cb "data" Now conn->data will designate the data layer which is the client for the transport layer. In practice it's the stream interface and will soon also be the health checks.	2012-10-04 22:26:10 +02:00
Willy Tarreau	f7bc57ca6e	REORG: connection: rename the data layer the "transport layer" While working on the changes required to make the health checks use the new connections, it started to become obvious that some naming was not logical at all in the connections. Specifically, it is not logical to call the "data layer" the layer which is in charge for all the handshake and which does not yet provide a data layer once established until a session has allocated all the required buffers. In fact, it's more a transport layer, which makes much more sense. The transport layer offers a medium on which data can transit, and it offers the functions to move these data when the upper layer requests this. And it is the upper layer which iterates over the transport layer's functions to move data which should be called the data layer. The use case where it's obvious is with embryonic sessions : an incoming SSL connection is accepted. Only the connection is allocated, not the buffers nor stream interface, etc... The connection handles the SSL handshake by itself. Once this handshake is complete, we can't use the data functions because the buffers and stream interface are not there yet. Hence we have to first call a specific function to complete the session initialization, after which we'll be able to use the data functions. This clearly proves that SSL here is only a transport layer and that the stream interface constitutes the data layer. A similar change will be performed to rename app_cb => data, but the two could not be in the same commit for obvious reasons.	2012-10-04 22:26:09 +02:00
Willy Tarreau	8c89c2059f	MINOR: buffers: add a few functions to write chars, strings and blocks bo_put{chr,blk,str,chk} are used to write data on the output of a buffer. Output is truncated if the buffer is not large enough.	2012-10-04 22:26:09 +02:00
Willy Tarreau	8113a5d78f	BUG/MINOR: config: use a copy of the file name in proxy configurations Each proxy contains a reference to the original config file and line number where it was declared. The pointer used is just a reference to the one passed to the function instead of being duplicated. The effect is that it is not valid anymore at the end of the parsing and that all proxies will be enumerated as coming from the same file on some late configuration errors. This may happen for exmaple when reporting SSL certificate issues. By copying using strdup(), we avoid this issue. 1.4 has the same issue, though no report of the proxy file name is done out of the config section. Anyway a backport is recommended to ease post-mortem analysis.	2012-10-04 08:13:32 +02:00
Willy Tarreau	d1a33e35fb	BUG/MEDIUM: proxy: must not try to stop disabled proxies upon reload Herv� Commowick reported an issue : haproxy dies in a segfault during a soft restart if it tries to pause a disabled proxy. This is because disabled proxies have no management task so we must not wake the task up. This could easily remain unnoticed since the old process was expected to go away, so having it go away faster was not really troubling. However, with sync peers, it is obvious that there is no peer sync during this reload. This issue has been introduced in 1.5-dev7 with the removal of the maintain_proxies() function. No backport is needed.	2012-10-04 00:20:55 +02:00
Emeric Brun	2d0c482682	MINOR: ssl: add statement 'no-tls-tickets' on bind to disable stateless session resumption Disables the stateless session resumption (RFC 5077 TLS Ticket extension) and force to use stateful session resumption. Stateless session resumption is more expensive in CPU usage.	2012-10-02 16:05:33 +02:00
Emeric Brun	c0ff4924c0	MINOR: ssl : add statements 'notlsv11' and 'notlsv12' and rename 'notlsv1' to 'notlsv10'. This is because "notlsv1" used to disable TLSv1.0 only and had no effect on v1.1/v1.2. so better have an option for each version. This applies both to "bind" and "server" statements.	2012-10-02 08:34:38 +02:00
Emeric Brun	9faf071acb	MINOR: ssl: add build param USE_PRIVATE_CACHE to build cache without shared memory It removes dependencies with futex or mutex but ssl performances decrease using nbproc > 1 because switching process force session renegotiation. This can be useful on small systems which never intend to run in multi-process mode.	2012-10-02 08:34:38 +02:00
Emeric Brun	4b3091e54e	MINOR: ssl: disable shared memory and locks on session cache if nbproc == 1 We don't needa to lock the memory when there is a single process. This can make a difference on small systems where locking is much more expensive than just a test.	2012-10-02 08:34:38 +02:00
Emeric Brun	81c00f0a7a	MINOR: ssl: add ignore verify errors options Allow to ignore some verify errors and to let them pass the handshake. Add option 'crt-ignore-err <list>' Ignore verify errors at depth == 0 (client certificate) <list> is string 'all' or a comma separated list of verify error IDs (see http://www.openssl.org/docs/apps/verify.html) Add option 'ca-ignore-err <list>' Same as 'crt-ignore-err' for all depths > 0 (CA chain certs) Ex ignore all errors on CA and expired or not-yet-valid errors on client certificate: bind 0.0.0.0:443 ssl crt crt.pem verify required cafile ca.pem ca-ignore-err all crt-ignore-err 10,9	2012-10-02 08:32:50 +02:00
Emeric Brun	d94b3fe98f	MEDIUM: ssl: add client certificate authentication support Add keyword 'verify' on bind: 'verify none': authentication disabled (default) 'verify optional': accept connection without certificate and process a verify if the client sent a certificate 'verify required': reject connection without certificate and process a verify if the client send a certificate Add keyword 'cafile' on bind: 'cafile <path>' path to a client CA file used to verify. 'crlfile <path>' path to a client CRL file used to verify.	2012-10-02 08:04:49 +02:00
Emeric Brun	2b58d040b6	MINOR: ssl: add elliptic curve Diffie-Hellman support for ssl key generation Add 'ecdhe' on 'bind' statement: to set named curve used to generate ECDHE keys (ex: ecdhe secp521r1)	2012-10-02 08:03:21 +02:00
Willy Tarreau	cd379950a7	MINOR: connection: add a pointer to the connection owner This will be needed to find the stream interface from the connection once they're detached, but in the more immediate term, we'll need this for health checks since they don't use a stream interface.	2012-09-28 00:01:22 +02:00
Willy Tarreau	dda5e7c986	CLEANUP: connection: offer conn_prepare() to set up a connection This will be used by checks as well as stream interfaces.	2012-09-24 22:49:06 +02:00
Willy Tarreau	c53d42256d	MEDIUM: stats: remove the stats_sock struct from the global struct Now the stats socket is allocated when the 'stats socket' line is parsed, and assigned using the standard str2listener(). This has two effects : - more than one stats socket can now be declared - stats socket now support protocols other than UNIX The next step is to remove the duplicate bind config parsing.	2012-09-24 10:53:16 +02:00
Willy Tarreau	4fbb2285e2	MINOR: config: make str2listener() use memprintf() to report errors. This will make it possible to use the function for other listening sockets.	2012-09-24 10:53:16 +02:00
Willy Tarreau	eb6cead1de	MINOR: standard: make memprintf() support a NULL destination Doing so removes many checks that were systematically made because the callees don't know if the caller passed a valid pointer.	2012-09-24 10:53:16 +02:00
Willy Tarreau	ce39bfb7c4	BUG: backend: balance hdr was broken since 1.5-dev11 Alex Markham reported and diagnosed a bug appearing on 1.5-dev11, causing a crash on x86_64 when header hashing is used. The cause is a missing (int) cast causing a negative offset to appear positive and the resulting pointer to go out of bounds. The crash is not possible anymore since 1.5-dev12 because a second bug caused the negative sign to disappear so the pointer is always within range but always wrong, so balance hdr() never works anymore. This fix restores the correct behaviour and ensures the sign is correct.	2012-09-22 18:36:29 +02:00
Willy Tarreau	290e63aa87	REORG: listener: move unix perms from the listener to the bind_conf Unix permissions are per-bind configuration line and not per listener, so let's concretize this in the way the config is stored. This avoids some unneeded loops to set permissions on all listeners. The access level is not part of the unix perms so it has been moved away. Once we can use str2listener() to set all listener addresses, we'll have a bind keyword parser for this one.	2012-09-20 18:07:14 +02:00
Willy Tarreau	4348fad1c1	MAJOR: listeners: use dual-linked lists to chain listeners with frontends Navigating through listeners was very inconvenient and error-prone. Not to mention that listeners were linked in reverse order and reverted afterwards. In order to definitely get rid of these issues, we now do the following : - frontends have a dual-linked list of bind_conf - frontends have a dual-linked list of listeners - bind_conf have a dual-linked list of listeners - listeners have a pointer to their bind_conf This way we can now navigate from anywhere to anywhere and always find the proper bind_conf for a given listener, as well as find the list of listeners for a current bind_conf.	2012-09-20 16:48:07 +02:00
Willy Tarreau	28a47d6408	MINOR: config: pass the file and line to config keyword parsers This will be needed when we need to create bind config settings.	2012-09-18 20:02:48 +02:00
Willy Tarreau	51fb7651c4	MINOR: listener: add a scope field in the bind keyword lists This scope is used to report what the keywords are used for (eg: TCP, UNIX, ...). It is now reported by bind_dump_kws().	2012-09-18 18:27:14 +02:00
Willy Tarreau	8638f4850f	MEDIUM: config: enumerate full list of registered "bind" keywords upon error When an unknown "bind" keyword is detected, dump the list of all registered keywords. Unsupported default alternatives are also reported as "not supported".	2012-09-18 18:27:14 +02:00
Willy Tarreau	79eeafacb4	MEDIUM: move bind SSL parsing to ssl_sock Registering new SSL bind keywords was not particularly handy as it required many #ifdef in cfgparse.c. Now the code has moved to ssl_sock.c which calls a register function for all the keywords. Error reporting was also improved by this move, because the called functions build an error message using memprintf(), which can span multiple lines if needed, and each of these errors will be displayed indented in the context of the bind line being processed. This is important when dealing with certificate directories which can report multiple errors.	2012-09-18 16:20:01 +02:00
Willy Tarreau	269826659d	MEDIUM: listener: add a minimal framework to register "bind" keyword options With the arrival of SSL, the "bind" keyword has received even more options, all of which are processed in cfgparse in a cumbersome way. So it's time to let modules register their own bind options. This is done very similarly to the ACLs with a small difference in that we make the difference between an unknown option and a known, unimplemented option.	2012-09-15 22:33:08 +02:00
Willy Tarreau	88500de69e	CLEANUP: listener: remove unused conf->file and conf->line These ones are already in bind_conf.	2012-09-15 22:29:33 +02:00
Willy Tarreau	2a65ff014e	MEDIUM: config: replace ssl_conf by bind_conf Some settings need to be merged per-bind config line and are not necessarily SSL-specific. It becomes quite inconvenient to have this ssl_conf SSL-specific, so let's replace it with something more generic.	2012-09-15 22:29:33 +02:00
Willy Tarreau	d1d5454180	REORG: split "protocols" files into protocol and listener It was becoming confusing to have protocols and listeners in the same files, split them.	2012-09-15 22:29:32 +02:00
Willy Tarreau	21c705b0f8	MINOR: config: add a function to indent error messages Bind parsers may return multiple errors, so let's make use of a new function to re-indent multi-line error messages so that they're all reported in their context.	2012-09-15 22:29:27 +02:00
Willy Tarreau	2e1dca8f52	MEDIUM: http: add "redirect scheme" to ease HTTP to HTTPS redirection For instance : redirect scheme https if !{ is_ssl }	2012-09-12 08:43:15 +02:00
Emeric Brun	fc0421fde9	MEDIUM: ssl: add support for SNI and wildcard certificates A side effect of this change is that the "ssl" keyword on "bind" lines is now just a boolean and that "crt" is needed to designate certificate files or directories. Note that much refcounting was needed to have the free() work correctly due to the number of cert aliases which can make a context be shared by multiple names.	2012-09-10 09:27:02 +02:00
Willy Tarreau	f5ae8f7637	MEDIUM: config: centralize handling of SSL config per bind line SSL config holds many parameters which are per bind line and not per listener. Let's use a per-bind line config instead of having it replicated for each listener. At the moment we only do this for the SSL part but this should probably evolved to handle more of the configuration and maybe even the state per bind line.	2012-09-08 08:31:50 +02:00
Willy Tarreau	403edff4b8	MEDIUM: config: implement maxsslconn in the global section SSL connections take a huge amount of memory, and unfortunately openssl does not check malloc() returns and easily segfaults when too many connections are used. The only solution against this is to provide a global maxsslconn setting to reject SSL connections above the limit in order to avoid reaching unsafe limits.	2012-09-06 12:10:43 +02:00
David BERARD	e566ecbea8	MEDIUM: ssl: add support for prefer-server-ciphers option I wrote a small path to add the SSL_OP_CIPHER_SERVER_PREFERENCE OpenSSL option to frontend, if the 'prefer-server-ciphers' keyword is set. Example : bind 10.11.12.13 ssl /etc/haproxy/ssl/cert.pem ciphers RC4:HIGH:!aNULL:!MD5 prefer-server-ciphers This option mitigate the effect of the BEAST Attack (as I understand), and it equivalent to : - Apache HTTPd SSLHonorCipherOrder option. - Nginx ssl_prefer_server_ciphers option. [WT: added a test for the support of the option]	2012-09-04 15:35:32 +02:00
Willy Tarreau	ff9f7698fc	BUILD: fix build error without SSL (ssl_cert) One last-minute optimization broke the build without SSL support. Move ssl_cert out of the #ifdef/#endif and it's OK.	2012-09-04 15:13:20 +02:00
Willy Tarreau	d50265aa0e	BUILD: include sys/socket.h to fix build failure on FreeBSD Joris Dedieu reported that include/common/standard.h needs this.	2012-09-04 14:18:33 +02:00
Willy Tarreau	783f25800c	BUILD: http: rename error_message http_error_message to fix conflicts on RHEL Duncan Hall reported a build issue on CentOS where error_message conflicts with another system declaration when SSL is enabled. Rename the function.	2012-09-04 12:19:04 +02:00
Willy Tarreau	c230b8bfb6	MEDIUM: config: add "nosslv3" and "notlsv1" on bind and server lines This is aimed at disabling SSLv3 and TLSv1 respectively. SSLv2 is always disabled. This can be used in some situations where one version looks more suitable than the other.	2012-09-03 23:55:16 +02:00
Willy Tarreau	d7aacbffcb	MEDIUM: config: add a "ciphers" keyword to set SSL cipher suites This is supported for both servers and listeners. The cipher suite simply follows the "ciphers" keyword.	2012-09-03 23:43:25 +02:00
Emeric Brun	fc32acafcd	MINOR: ssl add global setting tune.sslcachesize to set SSL session cache size. This new global setting allows the user to change the SSL cache size in number of sessions. It defaults to 20000.	2012-09-03 22:36:33 +02:00
Emeric Brun	3e541d1c03	MEDIUM: ssl: add shared memory session cache implementation. This SSL session cache was developped at Exceliance and is the same that was proposed for stunnel and stud. It makes use of a shared memory area between the processes so that sessions can be handled by any process. It is only useful when haproxy runs with nbproc > 1, but it does not hurt performance at all with nbproc = 1. The aim is to totally replace OpenSSL's internal cache. The cache is optimized for Linux >= 2.6 and specifically for x86 platforms. On Linux/x86, it makes use of futexes for inter-process locking, with some x86 assembly for the locked instructions. On other architectures, GCC builtins are used instead, which are available starting from gcc 4.1. On other operating systems, the locks fall back to pthread mutexes so libpthread is automatically linked. It is not recommended since pthreads are much slower than futexes. The lib is only linked if SSL is enabled.	2012-09-03 22:36:33 +02:00
Emeric Brun	e1f38dbb44	MEDIUM: ssl: protect against client-initiated renegociation CVE-2009-3555 suggests that client-initiated renegociation should be prevented in the middle of data. The workaround here consists in having the SSL layer notify our callback about a handshake occurring, which in turn causes the connection to be marked in the error state if it was already considered established (which means if a previous handshake was completed). The result is that the connection with the client is immediately aborted and any pending data are dropped.	2012-09-03 22:03:17 +02:00
Emeric Brun	01f8e2f61b	MEDIUM: config: add support for the 'ssl' option on 'server' lines This option currently takes no option and simply turns SSL on for all connections going to the server. It is likely that more options will be needed in the future.	2012-09-03 22:02:21 +02:00
Emeric Brun	6e159299f1	MEDIUM: config: add the 'ssl' keyword on 'bind' lines "bind" now supports "ssl" followed by a PEM cert+key file name.	2012-09-03 20:49:14 +02:00
Emeric Brun	4659195e31	MEDIUM: ssl: add new files ssl_sock.[ch] to provide the SSL data layer This data layer supports socket-to-buffer and buffer-to-socket operations. No sock-to-pipe nor pipe-to-sock functions are provided, since splicing does not provide any benefit with data transformation. At best it could save a memcpy() and avoid keeping a buffer allocated but that does not seem very useful. An init function and a close function are provided because the SSL context needs to be allocated/freed. A data-layer shutw() function is also provided because upon successful shutdown, we want to store the SSL context in the cache in order to reuse it for future connections and avoid a new key generation. The handshake function is directly called from the connection handler. At this point it is not certain whether this will remain this way or if a new ->handshake callback will be added to the data layer so that the connection handler doesn't care about SSL. The sock-to-buf and buf-to-sock functions are all capable of enabling the SSL handshake at any time. This also implies polling in the opposite direction to what was expected. The upper layers must take that into account (it is OK right now with the stream interface).	2012-09-03 20:49:14 +02:00
Emeric Brun	7dd0e505ca	MEDIUM: connection: add a new handshake flag for SSL (CO_FL_SSL_WAIT_HS). This flag is part of the CO_FL_HANDSHAKE family since the SSL handshake may appear at any time.	2012-09-03 20:49:14 +02:00
Emeric Brun	c6545acee0	MINOR: server: add SSL context to servers if USE_OPENSSL is defined This will be needed to accept outgoing SSL connections.	2012-09-03 20:49:14 +02:00
Emeric Brun	0b8d4d9372	MINOR: protocol: add SSL context to listeners if USE_OPENSSL is defined This will be needed to accept incoming SSL connections.	2012-09-03 20:49:14 +02:00
Willy Tarreau	dd2f85eb3b	CLEANUP: includes: fix includes for a number of users of fd.h It appears that fd.h includes a number of unneeded files and was included from standard.h, and as such served as an intermediary to provide almost everything to everyone. By removing its useless includes, a long dependency chain broke but could easily be fixed.	2012-09-03 20:49:14 +02:00
Willy Tarreau	45dab73788	CLEANUP: fdtab: flatten the struct and merge the spec struct with the rest The "spec" sub-struct was using 8 bytes for only 5 needed. There is no reason to keep it as a struct, it doesn't bring any value. By flattening it, we can merge the single byte with the next single byte, resulting in an immediate saving of 4 bytes (20%). Interestingly, tests have shown a steady performance gain of 0.6% after this change, which can possibly be attributed to a more cache-line friendly struct.	2012-09-03 20:49:14 +02:00
Willy Tarreau	40ff59d820	CLEANUP: fd: remove fdtab->flags These flags were added for TCP_CORK. They were only set at various places but never checked by any user since TCP_CORK was replaced with MSG_MORE. Simply get rid of this now.	2012-09-03 20:49:14 +02:00
Willy Tarreau	56a77e5933	MEDIUM: connection: complete the polling cleanups I/O handlers now all use __conn_{sock,data}_{stop,poll,want}_* instead of returning dummy flags. The code has become slightly simpler because some tricks such as the MIN_RET_FOR_READ_LOOP are not needed anymore, and the data handlers which switch to a handshake handler do not need to disable themselves anymore.	2012-09-03 20:47:35 +02:00
Willy Tarreau	e9dfa79a75	MAJOR: connection: rearrange the polling flags. Polling flags were set for data and sock layer, but while this does make sense for the ENA flag, it does not for the POL flag which translates the detection of an EAGAIN condition. So now we remove the {DATA,SOCK}_POL* flags and instead introduce two new layer-independant flags (WANT_RD and WANT_WR). These flags are only set when an EAGAIN is encountered so that polling can be enabled. In order for these flags to have any meaning they are not persistent and have to be cleared by the connection handler before calling the I/O and data callbacks. For this reason, changes detection has been slightly improved. Instead of comparing the WANT_* flags with CURR_*_POL, we only check if the ENA status changes, or if the polling appears, since we don't want to detect the useless poll to ena transition. Tests show that this has eliminated one useless call to __fd_clr(). Finally the conn_set_polling() function which was becoming complex and required complex operations from the caller was split in two and replaced its two only callers (conn_update_data_polling and conn_update_sock_polling). The two functions are now much smaller due to the less complex conditions. Note that it would be possible to re-merge them and only pass a mask but this does not appear much interesting.	2012-09-03 20:47:35 +02:00
Willy Tarreau	74172ff9c3	CLEANUP: frontend: remove the old proxy protocol decoder This one used to rely on a stream analyser which was inappropriate. It's not used anymore.	2012-09-03 20:47:35 +02:00
Willy Tarreau	22cda21ad5	MAJOR: connection: make the PROXY decoder a handshake handler The PROXY protocol is now decoded in the connection before other handshakes. This means that it may be extracted from a TCP stream before SSL is decoded from this stream.	2012-09-03 20:47:35 +02:00
Willy Tarreau	2542b53b19	MAJOR: session: introduce embryonic sessions When an incoming connection request is accepted, a connection structure is needed to store its state. However we don't want to fully initialize a session until the data layer is about to be ready. As long as the connection is physically stored into the session, it's not easy to split both allocations. As such, we only initialize the minimum requirements of a session, which results in what we call an embryonic session. Then once the data layer is ready, we can complete the function's initialization. Doing so avoids buffers allocation and ensures that a session only sees ready connections. The frontend's client timeout is used as the handshake timeout. It is likely that another timeout will be used in the future.	2012-09-03 20:47:35 +02:00
Willy Tarreau	15678efc45	MEDIUM: connection: add an ->init function to data layer SSL need to initialize the data layer before proceeding with data. At the moment, this data layer is automatically initialized from itself, which will not be possible once we extract connection from sessions since we'll only create the data layer once the handshake is finished. So let's have the application layer initialize the data layer before using it.	2012-09-03 20:47:34 +02:00
Willy Tarreau	64ee491309	MINOR: tcp: replace tcp_src_to_stktable_key with addr_to_stktable_key Make it more obvious that this function does not depend on any knowledge of the session. This is important to plan for TCP rules that can run on connection without any initialized session yet.	2012-09-03 20:47:34 +02:00
Willy Tarreau	14f8e86da5	MEDIUM: proto_tcp: remove any dependence on stream_interface The last uses of the stream interfaces were in tcp_connect_server() and could easily and more appropriately be moved to its callers, si_connect() and connect_server(), making a lot more sense. Now the function should theorically be usable for health checks. It also appears more obvious that the file is split into two distinct parts : - the protocol layer used at the connection level - the tcp analysers executing tcp-* rules and their samples/acls.	2012-09-03 20:47:34 +02:00
Willy Tarreau	93b0f4f6c6	MEDIUM: stream_interface: remove CAP_SPLTCP/CAP_SPLICE flags These ones are implicitly handled by the connection's data layer, no need to rely on them anymore and reaching them maintains undesired dependences on stream-interface.	2012-09-03 20:47:34 +02:00
Willy Tarreau	986a9d2d12	MAJOR: connection: move the addr field from the stream_interface We need to have the source and destination addresses in the connection. They were lying in the stream interface so let's move them. The flags SI_FL_FROM_SET and SI_FL_TO_SET have been moved as well. It's worth noting that tcp_connect_server() almost does not use the stream interface anymore except for a few flags. It has been identified that once we detach the connection from the SI, it will probably be needed to keep a copy of the server-side addresses in the SI just for logging purposes. This has not been implemented right now though.	2012-09-03 20:47:34 +02:00
Willy Tarreau	3cefd521fa	REORG: connection: move the target pointer from si to connection The target is per connection and is directly used by the connection, so we need it there. It's not needed anymore in the SI however.	2012-09-03 20:47:34 +02:00
Willy Tarreau	8263d2b259	CLEANUP: channel: use "channel" instead of "buffer" in function names This is a massive rename of most functions which should make use of the word "channel" instead of the word "buffer" in their names. In concerns the following ones (new names) : unsigned long long channel_forward(struct channel buf, unsigned long long bytes); static inline void channel_init(struct channel buf) static inline int channel_input_closed(struct channel buf) static inline int channel_output_closed(struct channel buf) static inline void channel_check_timeouts(struct channel b) static inline void channel_erase(struct channel buf) static inline void channel_shutr_now(struct channel buf) static inline void channel_shutw_now(struct channel buf) static inline void channel_abort(struct channel buf) static inline void channel_stop_hijacker(struct channel buf) static inline void channel_auto_connect(struct channel buf) static inline void channel_dont_connect(struct channel buf) static inline void channel_auto_close(struct channel buf) static inline void channel_dont_close(struct channel buf) static inline void channel_auto_read(struct channel buf) static inline void channel_dont_read(struct channel buf) unsigned long long channel_forward(struct channel *buf, unsigned long long bytes) Some functions provided by channel.[ch] have kept their "buffer" name because they are really designed to act on the buffer according to some information gathered from the channel. They have been moved together to the same place in the file for better readability but they were not changed at all. The "buffer" memory pool was also renamed "channel".	2012-09-03 20:47:33 +02:00
Willy Tarreau	03cdb7c678	CLEANUP: channel: usr CF_/CHN_ prefixes instead of BF_/BUF_ Get rid of these confusing BF_* flags. Now channel naming should clearly be used everywhere appropriate. No code was changed, only a renaming was performed. The comments about channel operations was updated.	2012-09-03 20:47:33 +02:00
Willy Tarreau	af81935b82	REORG: channel: move buffer_{replace,insert_line}* to buffer.{c,h} These functions do not depend on the channel flags anymore thus they're much better suited to be used on plain buffers. Move them from channel to buffer.	2012-09-03 20:47:33 +02:00
Willy Tarreau	f941cf2ef2	MAJOR: channel: remove the BF_FULL flag This is similar to the recent removal of BF_OUT_EMPTY. This flag was very problematic because it relies on permanently changing information such as the to_forward value, so it had to be updated upon every change to the buffers. Previous patch already got rid of its users. One part of the change is sensible : the flag was also part of BF_MASK_STATIC, which is used by process_session() to rescan all analysers in case the flag's status changes. At first glance, none of the analysers seems to change its mind base on this flag when it is subject to change, so it seems fine not to add variation checks here. Otherwise it's possible that checking the buffer's input and output is more reliable than checking the flag's replacement.	2012-09-03 20:47:33 +02:00
Willy Tarreau	42d06661a2	MINOR: buffer: provide a new buffer_full() function This one only focuses on the input part of the buffer and is dedicated to analysers.	2012-09-03 20:47:33 +02:00
Willy Tarreau	ad1cc3df9c	MINOR: channel: rename bi_full to channel_full as it checks the whole channel Since the function takes care of the forward count and involves more than buffer knowledge, rename it.	2012-09-03 20:47:32 +02:00
Willy Tarreau	a75bcef867	REORG: buffer: move buffer_flush, b_adv and b_rew to buffer.h These one now operate over real buffers, not channels anymore.	2012-09-03 20:47:32 +02:00
Willy Tarreau	8e21bb9e52	MAJOR: channel: remove the BF_OUT_EMPTY flag This flag was very problematic because it was composite in that both changes to the pipe or to the buffer had to cause this flag to be updated, which is not always simple (eg: there may not even be a channel attached to a buffer at all). There were not that many users of this flags, mostly setters. So the flag got replaced with a macro which reports whether the channel is empty or not, by checking both the pipe and the buffer. One part of the change is sensible : the flag was also part of BF_MASK_STATIC, which is used by process_session() to rescan all analysers in case the flag's status changes. At first glance, none of the analysers seems to change its mind base on this flag when it is subject to change, so it seems fine not to add variation checks here. Otherwise it's possible that checking the buffer's output size is more useful than checking the flag's replacement.	2012-09-03 20:47:32 +02:00
Willy Tarreau	c7e4238df0	REORG: buffers: split buffers into chunk,buffer,channel Many parts of the channel definition still make use of the "buffer" word.	2012-09-03 20:47:32 +02:00
Willy Tarreau	c578891112	CLEANUP: connection: split sock_ops into data_ops, app_cp and si_ops Some parts of the sock_ops structure were only used by the stream interface and have been moved into si_ops. Some of them were callbacks to the stream interface from the connection and have been moved into app_cp as they're the application seen from the connection (later, health-checks will need to use them). The rest has moved to data_ops. Normally at this point the connection could live without knowing about stream interfaces at all.	2012-09-03 20:47:31 +02:00
Willy Tarreau	96199b1016	MAJOR: stream-interface: restore splicing mechanism The splicing is now provided by the data-layer rcv_pipe/snd_pipe functions which in turn are called by the stream interface's recv and send callbacks. The presence of the rcv_pipe/snd_pipe functions is used to attest support for splicing at the data layer. It looks like the stream-interface's SI_FL_CAP_SPLICE flag does not make sense anymore as it's used as a proxy for the pointers above. It also appears that we call chk_snd() from the recv callback and then try to call it again in update_conn(). It is very likely that this last function will progressively slip into the recv/send callbacks in order to avoid duplicate check code. The code works right now with and without splicing. Only raw_sock provides support for it and it is automatically selected when the various splice options are set. However it looks like splice-auto doesn't enable it, which possibly means that the streamer detection code does not work anymore, or that it's only called at a time where it's too late to enable splicing (in process_session).	2012-09-03 20:47:31 +02:00
Willy Tarreau	5368d80ede	MAJOR: connection: split the send call into connection and stream interface Similar to what was done on the receive path, the data layer now provides only an snd_buf() callback that is iterated over by the stream interface's si_conn_send_loop() function. The data layer now has no knowledge about channels nor stream interfaces. The splice() code still need to be ported as it currently is disabled.	2012-09-03 20:47:31 +02:00
Willy Tarreau	ce323dea14	REORG: stream-interface: move sock_raw_read() to si_conn_recv_cb() The recv function is now generic and is usable to iterate any connection-to-buf reading function from a stream interface. So let's move it to stream-interface.	2012-09-03 20:47:30 +02:00
Willy Tarreau	1fe6bc335a	MINOR: stream-interface: add an rcv_buf callback to sock_ops This one is to be used by the read I/O handlers.	2012-09-03 20:47:30 +02:00
Willy Tarreau	2ba4465086	MAJOR: raw_sock: extract raw_sock_to_buf() from raw_sock_read() This is the start of the stream connection iterator which calls the data-layer reader. This still looks a bit tricky but is OK. Splicing is not handled at all at the moment.	2012-09-03 20:47:30 +02:00
Willy Tarreau	75bf2c925f	REORG: sock_raw: rename the files raw_sock* The "raw_sock" prefix will be more convenient for naming functions as it will be prefixed with the data layer and suffixed with the data direction. So let's rename the files now to avoid any further confusion. The #include directive was also removed from a number of files which do not need it anymore.	2012-09-02 21:54:56 +02:00
Willy Tarreau	3af56a9359	MINOR: connection: provide conn_{data\|sock}_{read0\|shutw} functions These functions are used to report unidirectional shutdown and to disable polling in the related direction.	2012-09-02 21:54:56 +02:00
Willy Tarreau	572bf9095d	REORG/MAJOR: extract "struct buffer" from "struct channel" At the moment, the struct is still embedded into the struct channel, but all the functions have been updated to use struct buffer only when possible, otherwise struct channel. Some functions would likely need to be splitted between a buffer-layer primitive and a channel-layer function. Later the buffer should become a pointer in the struct buffer, but doing so requires a few changes to the buffer allocation calls.	2012-09-02 21:54:56 +02:00
Willy Tarreau	7421efb85f	REORG/MAJOR: use "struct channel" instead of "struct buffer" This is a massive rename. We'll then split channel and buffer. This change needs a lot of cleanups. At many locations, the parameter or variable is still called "buf" which will become ambiguous. Also, the "struct channel" is still defined in buffers.h.	2012-09-02 21:54:55 +02:00
Willy Tarreau	9bf9c14c12	MEDIUM: stream-interface: provide a generic stream_sock_read0() function This function is used by the data layer when a zero has been read over a connection. At the moment it only handles sockets and nothing else. Once the complete split is done between buffers and stream interfaces, it should become possible to work regardless on the connection type.	2012-09-02 21:54:55 +02:00
Willy Tarreau	eecf6ca68a	MEDIUM: stream-interface: provide a generic si_conn_send_cb callback The connection send() callback is supposed to be generic for a stream-interface, and consists in calling the lower layer snd_buf function. Move this function to the stream interface and remove the sock-raw and sock-ssl clones.	2012-09-02 21:54:55 +02:00
Willy Tarreau	de5722c302	MEDIUM: stream-interface: provide a generic stream_int_chk_snd_conn() function This one can be used by both sock_raw and sock_ssl instead of each having their own.	2012-09-02 21:54:55 +02:00
Willy Tarreau	fae4499e36	MEDIUM: stream-interface: add a snd_buf() callback to sock_ops This callback is used to send data from the buffer to the socket. It is the old write_loop() call of the data layer which is used both by the ->write() callback and the ->chk_snd() function. The reason for having it as a pointer is that it's the only remaining part which causes the write and chk_snd() functions to be different between raw and ssl.	2012-09-02 21:54:18 +02:00
Willy Tarreau	46a8d925c2	MEDIUM: stream-interface: offer a generic chk_rcv function for connections sock_raw and sock_ssl use a pretty generic chk_rcv function, so let's move this function to the stream_interface and remove specific functions. Later we might have a single chk_rcv function.	2012-09-02 21:54:18 +02:00
Willy Tarreau	100c467120	MEDIUM: stream_interface: offer a generic function for connection updates We need to have a generic function to be called by upper layers when buffer flags have been updated (the si->update function). At the moment, both sock_raw and sock_ssl had their own which basically was a copy-paste. Since these functions are only used to update stream interface flags, it is logical to have them handled by the stream interface code. This allowed us to remove the stream_interface-specific update function from sock_raw and sock_ssl which now use the generic code. The stream_sock_update_conn callback has also been more appropriately renamed conn_notify_si() since it's meant to be called by lower layers to notify the SI and possibly upper layers about incoming changes.	2012-09-02 21:54:18 +02:00
Willy Tarreau	26f44d1e91	MINOR: fd: get rid of FD_WAIT_* These flags were used to ease a transition which has been completed, so they're not needed anymore. Get rid of them.	2012-09-02 21:53:12 +02:00
Willy Tarreau	afad0e0f80	MAJOR: make use of conn_{data\|sock}_{poll\|stop\|want}* in connection handlers This is a second attempt at getting rid of FD_WAIT_. Now the situation is much better since native I/O handlers can directly manipulate the FD using fd_{poll\|want\|stop}_ and the connection handlers manipulate connection-level flags using the conn_{data\|sock}_* equivalent. Proceeding this way ensures that the connection flags always reflect the reality even after data<->handshake switches.	2012-09-02 21:53:12 +02:00
Willy Tarreau	f9dabecd03	MEDIUM: connection: make use of the new polling functions Now the connection handler, the handshake callbacks and the I/O callbacks make use of the connection-layer polling functions to enable or disable polling on a file descriptor. Some changes still need to be done to avoid using the FD_WAIT_* constants.	2012-09-02 21:53:11 +02:00
Willy Tarreau	b5e2cbdcc8	MEDIUM: connection: add definitions for dual polling mechanisms The conflicts we're facing with polling is that handshake handlers have precedence over data handlers and may change the polling requirements regardless of what is expected by the data layer. This causes issues such as missed events. The real need is to have three polling levels : - the "current" one, which is effective at any moment - the data one, which reflects what the data layer asks for - the sock one, which reflects what the socket layer asks for Depending on whether a handshake is in progress or not, either one of the last two will replace the current one, and the change will be propagated to the lower layers. At the moment, the shutdown status is not considered, and only handshakes are used to decide which layer to chose. This will probably change.	2012-09-02 21:53:11 +02:00
Willy Tarreau	babd05a6c6	MEDIUM: fd: add fd_poll_{recv,send} for use when explicit polling is required The old EV_FD_SET() macro was confusing, as it would enable receipt but there was no way to indicate that EAGAIN was received, hence the recently added FD_WAIT_* flags. They're not enough as we're still facing a conflict between EV_FD_* and FD_WAIT_*. So let's offer I/O functions what they need to explicitly request polling.	2012-09-02 21:53:11 +02:00
Willy Tarreau	49b046dddf	MAJOR: fd: replace all EV_FD_* macros with new fd__ inline calls These functions have a more explicity meaning and will offer provisions for explicit polling. EV_FD_ISSET() has been left for now as it is still in use in checks.	2012-09-02 21:53:11 +02:00
Willy Tarreau	4a36b56909	MAJOR: stream_int: use a common stream_int_shut() functions regardless of the data layer Up to now, we had to use a shutr/shutw interface per data layer, which basically means 3 distinct functions when we include SSL : - generic stream_interface - sock_raw - sock_ssl With this change, the code located in the stream_interface manages all the stream_interface and buffer updates, and calls the data layer hooks when needed. At the moment, the socket layer hook had been implicitly considered as being a regular socket, so the si_shut() functions call the normal shutdown() and EV_FD_CLR() functions on the fd if a socket layer is defined. This may change in the future. The stream_int_shut*() functions don't call EV_FD_CLR() so that they can later be embedded in lower layers. Thus, the si->data->shutr() is not called anymore and si->data->shutw() is called to close the data layer only (eg: only for SSL). Proceeding like this is very important because it's the only way to be able not to rely on these functions when called from the connection handlers, and call the data layers' instead.	2012-09-02 21:53:10 +02:00
Willy Tarreau	8b117082bc	REORG: connection: replace si_data_close() with conn_data_close() This close function only applies to connection-specific parts and the stream-interface entry may soon disappear. Move this to the connection instead.	2012-09-02 21:53:10 +02:00
Willy Tarreau	3788e4c874	MEDIUM: fd: remove the EV_FD_COND_* primitives These primitives were initially introduced so that callers were able to conditionally set/disable polling on a file descriptor and check in return what the state was. It's been long since we last had an "if" on this, and all pollers' functions were the same for cond_* and their systematic counter parts, except that this required a check and a specific return value that are not always necessary. So let's simplify the FD API by removing this now unused distinction and by making all specific functions return void.	2012-09-02 21:53:10 +02:00
Willy Tarreau	c76ae33bfc	MAJOR: connection: call data layer handshakes from the handler Handshakes is not called anymore from the data handlers, they're only called from the connection handler when their flag is set. Also, this move has uncovered an issue with the stream interface notifier : it doesn't consider the FD_WAIT_* flags possibly set by the handshake handlers. This will result in a stuck handshake when no data is in the output buffer. In order to cover this, for now we'll perform the EV_FD_SET in the SSL handshake function, but this needs to be addressed separately from the stream interface operations.	2012-09-02 21:53:09 +02:00
Willy Tarreau	8f8c92fe93	MAJOR: connection: add a new CO_FL_CONNECTED flag This new flag is used to indicate that the connection was already connected. It can be used by I/O handlers to know that a connection has just completed. It is used by stream_sock_update_conn(), allowing the sock_opt handlers not to manipulate the SI timeout nor the BF_WRITE_NULL flag anymore.	2012-09-02 21:53:09 +02:00
Willy Tarreau	239d7189fc	MEDIUM: stream_interface: pass connection instead of fd in sock_ops The sock_ops I/O callbacks made use of an FD till now. This has become inappropriate and the struct connection is much more useful. It also fixes the race condition introduced by previous change.	2012-09-02 21:53:08 +02:00
Willy Tarreau	fd31e53139	MAJOR: remove the stream interface and task management code from sock_* The socket data layer code must only focus on moving data between a socket and a buffer. We need a special stream interface handler to update the stream interface and the file descriptor status. At the moment the code works but suffers from a race condition caused by its API : the read/write callbacks still make use of the fd instead of using the connection. And when a double shutdown is performed, a call to ->write() after ->read() processed an error results in dereferencing a NULL fdtab[]->owner. This is only a temporary issue which doesn't need to be fixed now since this will automatically go away when the functions change to use the connection instead.	2012-09-02 21:53:08 +02:00
Willy Tarreau	076be25ab8	CLEANUP: remove the now unused fdtab direct I/O callbacks They were all left to NULL since last commit so we can safely remove them all now and remove the temporary dual polling logic in pollers.	2012-09-02 21:51:29 +02:00
Willy Tarreau	2da156fe5e	MAJOR: tcp: remove the specific I/O callbacks for TCP connection probes Use a single tcp_connect_probe() instead of tcp_connect_write() and tcp_connect_read(). We call this one only when no data layer function have been processed, so this is a fallback to test for completion of a connection attempt. With this done, we don't have the need for any direct I/O callback anymore. The function still relies on ->write() to wake the stream interface up, so it's not finished.	2012-09-02 21:51:29 +02:00
Willy Tarreau	2c6be84b3a	MEDIUM: connection: extract the send_proxy callback from proto_tcp This handshake handler must be independant, so move it away from proto_tcp. It has a dedicated connection flag. It is tested before I/O handlers and automatically removes the CO_FL_WAIT_L4_CONN flag upon success. It also sets the BF_WRITE_NULL flag on the stream interface and stops the SI timeout. However it does not perform the task_wakeup(), and relies on the data handler to do so for now. The SI wakeup will have to be moved elsewhere anyway.	2012-09-02 21:51:28 +02:00
Willy Tarreau	8018471f44	MINOR: fd: make fdtab->owner a connection and not a stream_interface anymore It is more convenient with a connection here and will abstract stream_interface more easily.	2012-09-02 21:51:28 +02:00
Willy Tarreau	59f98393bb	MINOR: connection: add a handler for fd-based connections This connection handler will be used as an I/O handler for events detected on a file descriptor. It is not used yet.	2012-09-02 21:51:28 +02:00
Willy Tarreau	4e6049e553	MINOR: fd: add a new I/O handler to fdtab This one will eventually replace both cb[] handlers. At the moment it is not used yet.	2012-09-02 21:51:27 +02:00
Willy Tarreau	505e34a36d	MAJOR: get rid of fdtab[].state and use connection->flags instead fdtab[].state was only used to know whether a connection was in progress or an error was encountered. Instead we now use connection->flags to store a flag for both. This way, connection management will be able to update the connection status on I/O.	2012-09-02 21:51:26 +02:00
Willy Tarreau	900bc93e24	MINOR: connection: add flags to the connection struct We're doing this to take over fdtab[].state.	2012-09-02 21:51:26 +02:00
Willy Tarreau	da92e2fb61	REORG/MINOR: checks: put a struct connection into the server This will be used to handle the connection state once it goes away from fdtab. There is no functional change at the moment.	2012-09-02 21:51:26 +02:00
Willy Tarreau	56e9c5e963	REORG/MINOR: connection: move declaration to its own include file This way we don't depend on stream_interface anymore.	2012-09-02 21:51:26 +02:00
Willy Tarreau	ed8f614078	REORG/MEDIUM: fd: get rid of FD_STLISTEN This state was only used so that ev_sepoll did not match FD_STERROR, which changed in previous patch. We can now safely remove this state.	2012-09-02 21:51:25 +02:00
Willy Tarreau	db3b32610f	REORG/MEDIUM: fd: remove FD_STCLOSE from struct fdtab In an attempt to get rid of fdtab[].state, and to move the relevant parts to the connection struct, we remove the FD_STCLOSE state which can easily be deduced from the <owner> pointer as there is a 1:1 match.	2012-09-02 21:51:25 +02:00
Jamie Gloudon	801a0a353a	DOC: fix name for "option independant-streams" The correct spelling is "independent", not "independant". This patch fixes the doc and the configuration parser to accept the correct form. The config parser still allows the old naming for backwards compatibility.	2012-09-02 21:51:07 +02:00
Willy Tarreau	654694e189	MEDIUM: stats/cli: add support for "set table key" to enter values This is used to enter values for stick tables. The most likely usage is to set gpc0 for a specific IP address in order to block traffic for abusers without having to reload. Since all data types are supported, other usages are possible (eg: replace a users's assigned server).	2012-09-02 21:51:07 +02:00
Willy Tarreau	c3a08a136b	BUG: stktable: tcp_src_to_stktable_key() must return NULL on invalid families Source addresses of non-TCP families were not correctly handled by tcp_src_to_stktable_key() as it forgot to return NULL and instead left the previous value in the stick-table buffer. This bug is 1.5-specific and was introduced by commit `4f92d320` in 1.5-dev6 so it does not need any backport.	2012-08-31 11:03:30 +02:00
David du Colombier	65c1796c4a	MINOR: IPv6 support for transparent proxy Set socket option IPV6_TRANSPARENT on binding to enable transparent proxy on IPv6. This option is available from Linux 2.6.37.	2012-07-31 07:53:42 +02:00
Willy Tarreau	96596aeead	MEDIUM: fd/si: move peeraddr from struct fdinfo to struct connection The destination address is purely a connection thing and not an fd thing. It's also likely that later the address will be stored into the connection and linked to by the SI. struct fdinfo only keeps the pointer to the port range and the local port for now. All of this also needs to move to the connection but before this the release of the port range must move from fd_delete() to a new function dedicated to the connection.	2012-06-08 22:59:52 +02:00
Willy Tarreau	4f8a83cb6e	MEDIUM: stats: add the ability to kill sessions from the admin interface It was not possible to kill remaining sessions from the admin interface, which is annoying especially when switching to maintenance mode. Now it's possible.	2012-06-04 00:26:23 +02:00
Willy Tarreau	d72822442d	MEDIUM: stats: add support for soft stop/soft start in the admin interface One important missing feature on the web interface is the ability to perform a soft stop/soft start. This is now possible.	2012-06-04 00:22:44 +02:00
Justin Karneges	eb2c24ae2a	MINOR: checks: add on-marked-up option This implements the feature discussed in the earlier thread of killing connections on backup servers when a non-backup server comes back up. For example, you can use this to route to a mysql master & slave and ensure clients don't stay on the slave after the master goes from down->up. I've done some minimal testing and it seems to work. [WT: added session flag & doc, moved the killing after logging the server UP, and ensured that the new server is really usable]	2012-06-03 23:48:42 +02:00
Willy Tarreau	496aa0111e	BUG/MEDIUM: ensure that unresolved arguments are freed exactly once When passing arguments to ACLs and samples, some types are stored as strings then resolved later after config parsing is done. Upon exit, the arguments need to be freed only if the string was not resolved yet. At the moment we can encounter double free during deinit() because some arguments (eg: userlists) are freed once as their own type and once as a string. The solution consists in adding an "unresolved" flag to the args to say whether the value is still held in the <str> part or is final. This could be debugged thanks to a useful bug report from Sander Klein.	2012-06-01 10:40:52 +02:00
Willy Tarreau	4992dd2d30	MINOR: http: add support for "httponly" and "secure" cookie attributes httponly This option tells haproxy to add an "HttpOnly" cookie attribute when a cookie is inserted. This attribute is used so that a user agent doesn't share the cookie with non-HTTP components. Please check RFC6265 for more information on this attribute. secure This option tells haproxy to add a "Secure" cookie attribute when a cookie is inserted. This attribute is used so that a user agent never emits this cookie over non-secure channels, which means that a cookie learned with this flag will be presented only over SSL/TLS connections. Please check RFC6265 for more information on this attribute.	2012-05-31 21:02:17 +02:00
Willy Tarreau	b5ba17e3a9	BUG/MINOR: config: do not report twice the incompatibility between cookie and non-http This one was already taken care of in proxy_cfg_ensure_no_http(), so if a cookie is presented in a TCP backend, we got two warnings. This can be backported to 1.4 since it's been this way for 2 years (although not dramatic).	2012-05-31 20:47:00 +02:00
Willy Tarreau	674021329c	REORG/MINOR: use dedicated proxy flags for the cookie handling Cookies were mixed with many other options while they're not used as options. Move them to a dedicated bitmask (ck_opts). This has released 7 flags in the proxy options and leaves some room for new proxy flags.	2012-05-31 20:40:20 +02:00
Willy Tarreau	196729eff8	BUG/MINOR: fix option httplog validation with TCP frontends Option httplog needs to be checked only once the proxy has been validated, so that its final mode (tcp/http) can be used. Also we need to check for httplog before checking the log format, so that we can report a warning about this specific option and not about the format it implies.	2012-05-31 19:30:26 +02:00
Willy Tarreau	ab152a7eda	BUG/MAJOR: b_rew() must pass a signed offset to b_ptr() Commit `13e66da` introduced b_rew() but passes -adv which is an unsigned quantity on 64-bit platforms, causing the buffer to advance in the wrong direction. No backport is needed.	2012-05-31 11:33:42 +02:00
Oskar Stolc	8dc4184c57	MINOR: balance uri: added 'whole' parameter to include query string in hash calculation This patch brings a new "whole" parameter to "balance uri" which makes the hash work over the whole uri, not just the part before the query string. Len and depth parameter are still honnored. The reason for this new feature is explained below. I have 3 backend servers, each accepting different form of HTTP queries: http://backend1.server.tld/service1.php?q=... http://backend1.server.tld/service2.php?q=... http://backend2.server.tld/index.php?query=...&subquery=... http://backend3.server.tld/image/49b8c0d9ff Each backend server returns a different response based on either: - the URI path (the left part of the URI before the question mark) - the query string (the right part of the URI after the question mark) - or the combination of both I wanted to set up a common caching cluster (using 6 Squid servers, each configured as reverse proxy for those 3 backends) and have HAProxy balance the queries among the Squid servers based on URL. I also wanted to achieve hight cache hit ration on each Squid server and send the same queries to the same Squid servers. Initially I was considering using the 'balance uri' algorithm, but that would not work as in case of backend2 all queries would go to only one Squid server. The 'balance url_param' would not work either as it would send the backend3 queries to only one Squid server. So I thought the simplest solution would be to use 'balance uri', but to calculate the hash based on the whole URI (URI path + query string), instead of just the URI path.	2012-05-22 07:56:54 +02:00
Emeric Brun	d88fd824b7	MEDIUM: protocol: add a pointer to struct sock_ops to the listener struct The listener struct is now aware of the socket layer to use upon accept(). At the moment, only sock_raw is supported so this patch should not change anything.	2012-05-21 22:22:39 +02:00
Emeric Brun	21adb02d19	MINOR: stream_interface: add a pointer to the listener for TARG_TYPE_CLIENT When the target is a client, it will be convenient to have a pointer to the original listener so that we can retrieve some configuration information at the stream interface level.	2012-05-21 22:22:39 +02:00
Willy Tarreau	24208275d5	MINOR: stream_interface: add a data channel close function This function will be called later when splitting the shutdown in two steps. It will be needed by SSL and for remote socket operations to release unused contexts.	2012-05-21 17:59:53 +02:00
Willy Tarreau	949811319b	REORG/MEDIUM: stream_interface: move applet->state and private to connection The state and the private pointer are not specific to the applets, since SSL will require exactly both of them. Move them to the connection layer now and rename them. We also now ensure that both are NULL on first call.	2012-05-21 17:09:48 +02:00
Willy Tarreau	fb7508aefb	REORG/MINOR: stream_interface: move si->fd to struct connection The socket fd is used only when in socket mode and with a connection.	2012-05-21 16:47:54 +02:00
Willy Tarreau	73b013b070	MINOR: stream_interface: introduce a new "struct connection" type We start to move everything needed to manage a connection to a special entity "struct connection". We have the data layer operations and the control operations there. We'll also have more info in the future such as file descriptors and applet contexts, so that in the end it becomes detachable from the stream interface, which will allow connections to be reused between sessions. For now on, we start with minimal changes.	2012-05-21 16:31:45 +02:00
Willy Tarreau	9580d16e40	BUG/MAJOR: checks: don't call set_server_status_* when no LB algo is set David Touzeau reported that haproxy dies when a server is checked and is used in a farm with only "option transparent" and no LB algo. This is because the LB params are NULL, the functions should be checked before being called. The same bug is present in 1.4 so this patch must be backported.	2012-05-19 19:09:46 +02:00
Willy Tarreau	2692736aa3	MEDIUM: http: get rid of msg->som which is not used anymore msg->som was zero before the body and was used to carry the beginning of a chunk size for chunked-encoded messages, at a moment when msg->sol is always zero. Remove msg->som and replace it with msg->sol where needed.	2012-05-18 23:50:43 +02:00
Willy Tarreau	09d1e254c9	MAJOR: http: stop using msg->sol outside the parsers This is a left-over from the buffer changes. Msg->sol is always null at the end of the parsing, so we must not use it anymore to read headers or find the beginning of a message. As a side effect, the dump of the request in debug mode is working again because it was relying on msg->sol not being null. Maybe it will even be mergeable with another of the message pointers.	2012-05-18 22:43:55 +02:00
Willy Tarreau	13e66dad26	MINOR: buffers: add a rewind function b_rew() will be used to rewind a buffer for certain specific operations such as header inspection on data already in the output queue.	2012-05-18 22:11:27 +02:00
Willy Tarreau	be0688c64d	MEDIUM: stream_interface: remove the si->init Calling the init() function in sess_establish was a bad idea, it is too late to allow it to fail on lack of resource and does not help at all. Remove it for now before it's used.	2012-05-18 15:15:26 +02:00
David du Colombier	7af4605ef7	BUG/MAJOR: trash must always be the size of a buffer Before it was possible to resize the buffers using global.tune.bufsize, the trash has always been the size of a buffer by design. Unfortunately, the recent buffer sizing at runtime forgot to adjust the trash, resulting in it being too short for content rewriting if buffers were enlarged from the default value. The bug was encountered in 1.4 so the fix must be backported there.	2012-05-16 14:21:55 +02:00
Willy Tarreau	7bb68abb9f	OPTIM/MEDIUM: stream_interface: add a new SI_FL_NOHALF flag This flag indicates that we're not interested in keeping half-open connections on a stream interface. It has the benefit of allowing the socket layer to cause an immediate write close when detecting an incoming read close. This releases resources much faster and saves one syscall (either a shutdown or setsockopt). This flag is only set by HTTP on the interface going to the server since we don't want to continue pushing data there when it has closed. Another benefit is that it responds with a FIN to a server's FIN instead of responding with an RST as it used to, which is much cleaner. Performance gains of 7.5% have been measured on HTTP connection rate on empty objects.	2012-05-13 14:52:22 +02:00
Willy Tarreau	b147a8382a	CLEANUP: fd: remove unused cb->b pointers in the struct fdtab These pointers were used to hold pointers to buffers in the past, but since we introduced the stream interface, they're no longer used but they were still sometimes set. Removing them shrink the struct fdtab from 32 to 24 bytes on 32-bit machines, and from 52 to 36 bytes on 64-bit machines, which is a significant saving. A quick tests shows a steady 0.5% performance gain, probably due to the better cache efficiency.	2012-05-13 00:35:44 +02:00
Willy Tarreau	3d2f16f3c3	MINOR: standard: add a new debug macro : fddebug() This macro is usable like printf but sends messages to fd #-1, which has no visible effect but is easy to spot in strace. This is very useful to put tracers at many points during debugging sessions.	2012-05-13 00:21:17 +02:00
Willy Tarreau	ce887fd3b2	MEDIUM: session: add support for tunnel timeouts Tunnel timeouts are used when TCP connections are forwarded, or when forwarding upgraded HTTP connections (WebSocket) as well as CONNECT requests to proxies. This timeout allows long-lived sessions to be supported without having to set large timeouts to normal requests.	2012-05-12 12:50:00 +02:00
Willy Tarreau	d02394b5a1	MEDIUM: stream_interface: derive the socket operations from the target Instead of hard-coding sock_raw in connect_server(), we set this socket operation at config parsing time. Right now, only servers and peers have it. Proxies are still hard-coded as sock_raw. This will be needed for future work on SSL which requires a different socket layer.	2012-05-11 18:52:14 +02:00
Willy Tarreau	64798bd720	MINOR: stream_interface: add an init callback to sock_ops This will be needed for some socket layers such as SSL. It's not used at the moment.	2012-05-11 18:39:26 +02:00
Willy Tarreau	f873d754f8	CLEANUP: stream_interface: stop exporting socket layer functions Similarly to the previous patch, we don't need the socket-layer functions outside of stream_interface. They could even move to a file dedicated to applets, though that does not seem particularly useful at the moment.	2012-05-11 17:47:17 +02:00
Willy Tarreau	b277d6e568	CLEANUP: sock_raw: remove last references to stream_sock We also stop exporting all functions since they're not needed anymore outside of sock_raw.c.	2012-05-11 17:03:42 +02:00
Willy Tarreau	59b9479667	BUG/MEDIUM: stream_interface: restore get_src/get_dst Commit e164e7a removed get_src/get_dst setting in the stream interfaces but forgot to set it in proto_tcp. Get the feature back because we need it for logging, transparent mode, ACLs etc... We now rely on the stream interface direction to know what syscall to use. One benefit of doing it this way is that we don't use getsockopt() anymore on outgoing stream interfaces nor on UNIX sockets.	2012-05-11 16:48:10 +02:00
Willy Tarreau	1539a01645	MINOR: stream_interface: add a client target : TARG_TYPE_CLIENT This one will be used to identify the direction the SI is being used. All incoming connections have a target of type TARG_TYPE_CLIENT.	2012-05-11 14:47:34 +02:00
Willy Tarreau	c63190d429	REORG: use the name sock_raw instead of stream_sock We'll soon have an SSL socket layer, and in order to ease the difference between the two, we use the name "sock_raw" to designate the one which directly talks to the sockets without any conversion.	2012-05-11 14:23:52 +02:00
Willy Tarreau	a7fe8e527c	MINOR: http: replace http_message_realign() with buffer_slow_realign() There is no more reason for the realign function being HTTP specific, it only operates on a buffer now. Let's move it to buffers.c instead. It's likely that buffer_bounce_realign is broken (not used), this will have to be inspected. The function is worth rewriting as it can be cheaper than buffer_slow_realign() to realign large wrapping buffers.	2012-05-08 21:28:17 +02:00
Willy Tarreau	0a3dd74c9c	MEDIUM: cfgparse: use the new error reporting framework for remaining cfg_keywords All keywords registered using a cfg_kw_list now make use of the new error reporting framework. This allows easier and more precise error reporting without having to deal with complex buffer allocation issues.	2012-05-08 21:28:17 +02:00
Willy Tarreau	a93c74be5c	MEDIUM: cfgparse: make backend_parse_balance() use memprintf to report errors Using the new error reporting framework makes it easier to report complex errors.	2012-05-08 21:28:17 +02:00
Willy Tarreau	6e0644339f	MEDIUM: memory: add the ability to poison memory at run time From time to time, some bugs are discovered that are caused by non-initialized memory areas. It happens that most platforms return a zero-filled area upon first malloc() thus hiding potential bugs. This patch also replaces malloc() in pools with calloc() to ensure that all platforms exhibit the same behaviour upon startup. In order to catch these bugs more easily, add a -dM command line flag to enable memory poisonning. Optionally, passing -dM<byte> forces the poisonning byte to <byte>.	2012-05-08 21:28:16 +02:00
Willy Tarreau	d04b1bce69	MEDIUM: http: improve error capture reports A number of important information were missing from the error captures, so let's improve them. Now we also log source port, session flags, transaction flags, message flags, pending output bytes, expected buffer wrapping position, total bytes transferred, message chunk length, and message body length. As such, the output format has slightly evolved and the source address moved to the third line : [08/May/2012:11:14:36.341] frontend echo (#1): invalid request backend echo (#1), server <NONE> (#-1), event #1 src 127.0.0.1:40616, session #4, session flags 0x00000000 HTTP msg state 26, msg flags 0x00000000, tx flags 0x00000000 HTTP chunk len 0 bytes, HTTP body len 0 bytes buffer flags 0x00909002, out 0 bytes, total 28 bytes pending 28 bytes, wrapping at 8030, error at position 7: 00000 GET / /?t=20000 HTTP/1.1\r\n 00026 \r\n [08/May/2012:11:13:13.426] backend echo (#1) : invalid response frontend echo (#1), server local (#1), event #0 src 127.0.0.1:40615, session #1, session flags 0x0000044e HTTP msg state 32, msg flags 0x0000000e, tx flags 0x08200000 HTTP chunk len 0 bytes, HTTP body len 20 bytes buffer flags 0x00008002, out 81 bytes, total 92 bytes pending 11 bytes, wrapping at 7949, error at position 9: 00000 Foo: bar\r\r\n	2012-05-08 21:28:16 +02:00
Willy Tarreau	bbebbbff83	REORG/MEDIUM: move the default accept function from sockstream to protocols.c The previous sockstream_accept() function uses nothing from sockstream, and is totally irrelevant to stream interfaces. Move this to the protocols.c file which handles listeners and protocols, and call it listener_accept(). It now makes much more sense that the code dealing with listen() also handles accept() and passes it to upper layers.	2012-05-08 21:28:15 +02:00
Willy Tarreau	26d8c59f0b	REORG/MEDIUM: replace stream interface protocol functions by a proto pointer The stream interface now makes use of the socket protocol pointer instead of the direct functions.	2012-05-08 21:28:15 +02:00
Willy Tarreau	5c979a9c71	REORG/MEDIUM: stream_interface: initialize socket ops from descriptors	2012-05-08 21:28:14 +02:00
Willy Tarreau	1b79bdee26	REORG/MEDIUM: move protocol->{read,write} to sock_ops The protocol must not set the read and write callbacks, they're specific to the socket layer. Move them to sock_ops instead.	2012-05-08 21:28:14 +02:00
Willy Tarreau	060781fb4a	REORG: stream_interface: create a struct sock_ops to hold socket operations These operators are used regardless of the socket protocol family. Move them to a "sock_ops" struct. ->read and ->write have been moved there too as they have no reason to remain at the protocol level.	2012-05-08 21:28:14 +02:00
Willy Tarreau	ceb4ac9c34	MEDIUM: acl: support IPv6 address matching Make use of the new IPv6 pattern type so that acl_match_ip() knows how to compare pattern and sample. IPv6 may be entered in their usual form, with or without a netmask appended. Only bit counts are accepted for IPv6 netmasks. In order to avoid any risk of trouble with randomly resolved IP addresses, host names are never allowed in IPv6 patterns. HAProxy is also able to match IPv4 addresses with IPv6 addresses in the following situations : - tested address is IPv4, pattern address is IPv4, the match applies in IPv4 using the supplied mask if any. - tested address is IPv6, pattern address is IPv6, the match applies in IPv6 using the supplied mask if any. - tested address is IPv6, pattern address is IPv4, the match applies in IPv4 using the pattern's mask if the IPv6 address matches with 2002:IPV4::, ::IPV4 or ::ffff:IPV4, otherwise it fails. - tested address is IPv4, pattern address is IPv6, the IPv4 address is first converted to IPv6 by prefixing ::ffff: in front of it, then the match is applied in IPv6 using the supplied IPv6 mask.	2012-05-08 21:28:14 +02:00
Willy Tarreau	6d20e28556	MINOR: standard: add an IPv6 parsing function (str62net) str62net returns an address and a netmask in number of bits.	2012-05-08 20:57:21 +02:00
Willy Tarreau	c92ddbc37d	MINOR: acl: add types to ACL patterns We cannot currently match IPv6 addresses in ACL simply because we don't support types on the patterns. Let's introduce this notion. For now, we rely on the SMP_TYPES though it doesn't seem like it will last forever given that some types are not present there (eg: regex, meth). Still it should be enough to support mixed matchings for most types. We use the special impossible value SMP_TYPES for types that don't exist in the SMP_T_* space.	2012-05-08 20:57:21 +02:00
Willy Tarreau	cd3b094618	REORG: rename "pattern" files They're now called "sample" everywhere to match their description.	2012-05-08 20:57:21 +02:00
Willy Tarreau	1278578487	REORG: use the name "sample" instead of "pattern" to designate extracted data This is mainly a massive renaming in the code to get it in line with the calling convention. Next patch will rename a few files to complete this operation.	2012-05-08 20:57:20 +02:00
Willy Tarreau	7dcb6480db	MEDIUM: acl: extend the pattern parsers to report meaningful errors By passing the error pointer to all ACL parsers, we can make them report useful errors and not simply fail.	2012-05-08 20:57:20 +02:00
Willy Tarreau	b7451bb660	MEDIUM: acl: report parsing errors to the caller All parsing errors were known but impossible to return. Now by making use of memprintf(), we're able to build meaningful error messages that the caller can display.	2012-05-08 20:57:20 +02:00
Willy Tarreau	185b5c4a7b	MEDIUM: http: merge acl and pattern header fetch functions HTTP header fetch is now done using smp_fetch_hdr() for both ACLs and patterns. This one also supports an occurrence number, making it possible to specify explicit occurrences for ACLs and patterns.	2012-05-08 20:57:19 +02:00
Willy Tarreau	ae52f06da3	MINOR: acl: add a val_args field to keywords This will make it possible to delegate argument validating to functions shared with smp_fetch_*.	2012-05-08 20:57:19 +02:00
Willy Tarreau	7a777edbdf	MINOR: acl: set SMP_OPT_ITERATE on fetch functions This way, fetch functions will be able to tell if they're called for a single request or as part of a loop. This is important for instance when we use hdr(foo), because in an ACL this means that all hdr(foo) occurrences must be checked while in a pattern it means only one of them (eg: last one).	2012-05-08 20:57:18 +02:00
Willy Tarreau	32a6f2e572	MEDIUM: acl/pattern: use the same direction scheme Patterns were using a bitmask to indicate if request or response was desired in fetch functions and keywords. ACLs were using a bitmask in fetch keywords and a single bit in fetch functions. ACLs were also using an ACL_PARTIAL bit in fetch functions indicating that a non-final fetch was performed, which was an abuse of the existing direction flag. The change now consists in using : - a capabilities field for fetch keywords => SMP_CAP_REQ/RES to indicate if a keyword supports requests, responses, both, etc... - an option field for fetch functions to indicate what the caller expects (request/response, final/non-final) The ACL_PARTIAL bit was reversed to get SMP_OPT_FINAL as it's more explicit to know we're working on a final buffer than on a non-final one. ACL_DIR_* were removed, as well as PATTERN_FETCH_*. L4 fetches were improved to support being called on responses too since they're still available. The <dir> field of all fetch functions was changed to <opt> which is now unsigned. The patch is large but mostly made of cosmetic changes to accomodate this, as almost no logic change happened.	2012-05-08 20:57:17 +02:00
Willy Tarreau	24e32d8c6b	MEDIUM: acl: replace acl_expr with args in acl fetch_* functions Having the args everywhere will make it easier to share fetch functions between patterns and ACLs. The only place where we could have needed the expr was in the http_prefetch function which can do well without.	2012-05-08 20:57:16 +02:00
Willy Tarreau	32389b7d04	MEDIUM: acl/pattern: switch rdp_cookie functions stack up-down Previously, both pattern, backend and persist_rdp_cookie would build fake ACL expressions to fetch an RDP cookie by calling acl_fetch_rdp_cookie(). Now we switch roles. The RDP cookie fetch function is provided as a sample fetch function that all others rely on, including ACL. The code is exactly the same, only the args handling moved from expr->args to args. The code was moved to proto_tcp.c, but probably that a dedicated file would be more suited to content handling.	2012-05-08 20:57:16 +02:00
Willy Tarreau	342acb4775	MEDIUM: pattern: integrate pattern_data into sample and use sample everywhere Now there is no more reference to union pattern_data. All pattern fetch and conversion functions now make use of the common sample type. Note: none of them adjust the type right now so it's important to do it next otherwise we would risk sharing such functions with ACLs and seeing them fail.	2012-05-08 20:57:15 +02:00
Willy Tarreau	b4a88f0672	MINOR: pattern: replace struct pattern with struct sample This change is pretty minor. Struct pattern is only used for pattern_process() now so changing it to use the common type is quite obvious. It's worth noting that the last argument of pattern_process() is never used so the function is self-sufficient. Note that pattern_process() does not initialize the pattern at all before calling fetch->process(), and that minimal initialization will be required when we later change the argument for the sample.	2012-05-08 20:57:15 +02:00
Willy Tarreau	21e5b0e3cb	MEDIUM: get rid of SMP_F_READ_ONLY and SMP_F_MUST_FREE These ones were either unused or improperly used. Some integers were marked read-only, which does not make much sense. Buffers are not read-only, they're "constant" in that they must be kept intact after any possible change.	2012-05-08 20:57:15 +02:00
Willy Tarreau	197e10aaae	MEDIUM: acl: get rid of the SET_RES flags We now simply rely on a boolean result from a fetch to declare a match. Booleans are not compared against patterns, they fix the result.	2012-05-08 20:57:15 +02:00
Willy Tarreau	f853c46bc3	MEDIUM: pattern/acl: get rid of temp_pattern in ACLs This one is not needed anymore as we can return the data and its type in the sample provided by the caller. ACLs now always return the proper type. BOOL is already returned when the result is expected to be processed as a boolean. temp_pattern has been unexported now.	2012-05-08 20:57:14 +02:00
Willy Tarreau	3740635b88	MAJOR: acl: make use of the new sample struct and get rid of acl_test This change is invasive in lines of code but not much in terms of functionalities as it's mainly a replacement of struct acl_test with struct sample.	2012-05-08 20:57:14 +02:00
Willy Tarreau	422aa0792d	MEDIUM: pattern: add new sample types to replace pattern types The new sample types are necessary for the acl-pattern convergence. These types are boolean and signed int. Some types were renamed for less ambiguity (ip->ipv4, integer->uint).	2012-05-08 20:57:14 +02:00
Willy Tarreau	16c31b00dc	MINOR: pattern: add a new 'sample' type to store fetched data The pattern type is ambiguous because a pattern is only a type and a data part, and is normally used to match against samples. Currently, patterns cannot hold information related to the life of the data which was extracted. We don't want to overload patterns either, so let's add a new "sample" type which will progressively supersede the acl_test and maybe the pattern at most places. The sample shares similar information with patterns and also has flags describing the data volatility and protection.	2012-05-08 20:57:13 +02:00
Willy Tarreau	8f7406e9b4	MEDIUM: acl: remove the ACL_TEST_F_NULL_MATCH flag This flag was used to force a boolean match even if there was no pattern to match. It was used only by http_auth() and designed only for this one. It's easier and cleaner to make the fetch function perform the test and report the boolean result as a few other functions already do. It simplifies the acl_exec_cond() logic and will help merging ACLs and patterns.	2012-05-08 20:57:13 +02:00
Willy Tarreau	21d68a6895	MEDIUM: pattern: add an argument validation callback to pattern descriptors This is used to validate that arguments are coherent. For instance, payload_lv expects that the last arg (if any) is not more negative than the sum of the first two. The error is reported if any.	2012-05-08 20:57:13 +02:00
Willy Tarreau	9fcb984b17	MEDIUM: pattern: use the standard arg parser We don't need the pattern-specific args parsers anymore, make use of the common parser instead. We still need to improve this by adding a validation function to report abnormal argument values or combinations. We don't report precise parsing errors yet but this was not previously done either.	2012-05-08 20:57:13 +02:00
Willy Tarreau	f995410355	MEDIUM: pattern: get rid of arg_i in all functions making use of arguments arg_i was almost unused, and since we migrated to use struct arg everywhere, the rare cases where arg_i was needed could be replaced by switching to arg->type = ARGT_STOP.	2012-05-08 20:57:12 +02:00
Willy Tarreau	ecfb8e8ff9	MEDIUM: pattern: replace type pattern_arg with type arg arg is more complete than pattern_arg since it also covers ACL args, so let's use this one instead.	2012-05-08 20:57:12 +02:00
Willy Tarreau	61612d49a7	MAJOR: acl: store the ACL argument types in the ACL keyword declaration The types and minimal number of ACL keyword arguments are now stored in their declaration. This will allow many more fantasies if some ACL use several arguments or types. Doing so required to rework all ACL keyword declarations to add two parameters. So this was a good opportunity for a general cleanup and to sort all entries in alphabetical order. We still have two pending issues : - parse_acl_expr() checks for errors but has no way to report them to the user ; - the types of some arguments are still not resolved and kept as strings (eg: ARGT_FE/BE/TAB) for compatibility reasons, which must be resolved in acl_find_targets()	2012-05-08 20:57:11 +02:00
Willy Tarreau	34db108423	MAJOR: acl: make use of the new argument parsing framework The ACL parser now uses the argument parser to build a typed argument list. Right now arguments are all strings and only one argument is supported since this is what ACLs currently support.	2012-05-08 20:57:11 +02:00
Willy Tarreau	2ac5718dbd	MEDIUM: add a new typed argument list parsing framework make_arg_list() builds an array of typed arguments with their values, that the caller describes how to parse. This will be used to support multiple arguments for ACLs and patterns, which is currently problematic and prevents ACLs and patterns from being merged. Up to 7 arguments types may be enumerated in a single 32-bit word, including their number of mandatory parts. At the moment, these files are not used yet, they're only built. Note that the 4-bit encoding for the type has left only one unused type!	2012-05-08 20:57:10 +02:00
Willy Tarreau	9dab5fc4d4	MEDIUM: buffers: rename a number of buffer management functions The following renaming took place : 1) buffer input functions buffer_put_block => bi_putblk buffer_put_char => bi_putchr buffer_put_string => bi_putstr buffer_put_chunk => bi_putchk buffer_feed => bi_putstr buffer_feed_chunk => bi_putchk buffer_cut_tail => bi_erase buffer_ignore => bi_fast_delete 2) buffer output functions buffer_get_char => bo_getchr buffer_get_line => bo_getline buffer_get_block => bo_getblk buffer_skip => bo_skip buffer_write => bo_inject 3) buffer input avail/full functions were introduced : bi_avail bi_full	2012-05-08 20:56:56 +02:00
Willy Tarreau	328582c3f9	MEDIUM: buffers: implement b_adv() to advance a buffer's pointer This is more convenient and efficient than buf->p = b_ptr(buf, n); It simply advances the buffer's pointer by <n> and trasfers that amount of bytes from <in> to <out>. The BF_OUT_EMPTY flag is updated accordingly. A few occurrences of such computations in buffers.c and stream_sock.c were updated to use b_adv(), which resulted in a small code shrink.	2012-05-08 12:28:14 +02:00
Willy Tarreau	cc5cfcbcce	MEDIUM: buffers: add new pointer wrappers and get rid of almost all buffer_wrap_add calls buffer_wrap_add was convenient for the migration but is not handy at all. Let's have new wrappers that report input begin/end and output begin/end instead. It looks like we'll also need a b_adv(ofs) to advance a buffer's pointer.	2012-05-08 12:28:14 +02:00
Willy Tarreau	ec1bc82a1d	MEDIUM: buffers: fix unsafe use of buffer_ignore at some places buffer_ignore may only be used when the output of a buffer is empty, but it's not granted it is always the case when sending HTTP error responses. Better use buffer_cut_tail() instead, and use buffer_ignore only on non-wrapping data.	2012-05-08 12:28:14 +02:00
Willy Tarreau	8a0cef2dad	MEDIUM: http: remove buffer arg in http_capture_bad_message The buffer pointer is now taken from the http_msg.	2012-05-08 12:28:13 +02:00
Willy Tarreau	45c0d98769	MEDIUM: http: http_send_name_header: remove references to msg and buffer They can be deduced from txn.	2012-05-08 12:28:12 +02:00
Willy Tarreau	3a215bedba	MAJOR: http: make http_msg->sol relative to buffer's origin msg->sol is now a relative pointer just like all other ones. There is no more absolute references to the buffer outside the struct buffer itself. Next two cleanups should include removing buffer references to functions which already have an msg, and removal of wrapping detection in request and response parsing which cannot wrap by definition.	2012-05-08 12:28:12 +02:00
Willy Tarreau	62f791ea6f	MEDIUM: http: add a pointer to the buffer in http_msg ACLs and patterns only rely on a struct http_msg and don't know the pointer to the actual data. struct http_msg will soon only hold relative references so that's not possible. We need http_msg to hold a reference to the struct buffer before having relative pointers everywhere. It is likely that doing so will also result in opportunities to simplify a number of functions arguments. The following functions are already candidate : http_buffer_heavy_realign http_capture_bad_message http_change_connection_header http_forward_trailers http_header_add_tail http_header_add_tail2 http_msg_analyzer http_parse_chunk_size http_parse_connection_header http_remove_header2 http_send_name_header http_skip_chunk_crlf http_upgrade_v09_to_v10	2012-05-08 12:28:12 +02:00
Willy Tarreau	12e48b36dd	MAJOR: http: turn http_msg->eol to a buffer-relative offset It was an absolute pointer to the buffer's data, now it's a pointer relative to the buffer's origin.	2012-05-08 12:28:12 +02:00
Willy Tarreau	fa4a03ca08	CLEANUP: http: remove unused http_msg->col The <col> element of the struct http_msg has not been used for a long time now, remove it.	2012-05-08 12:28:11 +02:00
Willy Tarreau	ea1175a687	MAJOR: http: change msg->{som,col,sov,eoh} to be relative to buffer origin These offsets were relative to the buffer itself. Now they're relative to the buffer's origin (buf->p) which normally corresponds to the start of current message. This saves a big dependency between the HTTP message struct and the buffers. It appeared during this change that ->col is not used anymore (it will have to be removed). Next step is to turn ->eol and ->sol from absolute to relative.	2012-05-08 12:28:11 +02:00
Willy Tarreau	a458b67965	MAJOR: http: move buffer->lr to http_msg->next The buffer's pointer <lr> was only used by HTTP parsers which also use a struct http_msg to keep track of the parser's state. We've reached a point where it makes no sense to keep ->lr in the buffer, as the split between buffer and msg is only arbitrary for historical reasons. This change ensures that touching buffers will not impact HTTP messages anymore, making the buffers more content-agnostic. However, it becomes very important not to forget to update msg->next when some data get forwarded or moved (and in general each time buf->p is updated). The new pointer in http_msg becomes relative to buffer->p so that parsing multiple messages becomes easier. It is possible that at one point ->som and ->next will be merged. Note: http_parse_reqline() and http_parse_stsline() have been temporarily modified to know the message starting point in the buffer (->p).	2012-05-08 12:28:11 +02:00
Willy Tarreau	363a5bb152	MAJOR: buffers: replace buf->r with buf->p + buf->i This change gets rid of buf->r which is always equal to buf->p + buf->i. It removed some wrapping detection at a number of places, but required addition of new relative offset computations at other locations. A large number of places can be simplified now with extreme care, since most of the time, either the pointer has to be computed once or we need a difference between the old ->w and old ->r to compute free space. The cleanup will probably happen with the rewrite of the buffer_input_* and buffer_output_* functions anyway. buf->lr still has to move to the struct http_msg and be relative to buf->p for the rework to be complete.	2012-05-08 12:28:11 +02:00
Willy Tarreau	89fa706d39	MAJOR: buffers: replace buf->w with buf->p - buf->o This change introduces the buffer's base pointer, which is the limit between incoming and outgoing data. It's the point where the parsing should start from. A number of computations have already been greatly simplified, but more simplifications are expected to come from the removal of buf->r. The changes appear good and have revealed occasional improper use of some pointers. It is possible that this patch has introduced bugs or revealed some, although preliminary testings tend to indicate that everything still works as it should.	2012-05-08 12:28:10 +02:00
Willy Tarreau	3f7ff1406c	MINOR: buffers: remove unused function buffer_contig_data() This one was never used and is buggy. It will be easier to rewrite it when the buffer rework is complete.	2012-05-08 12:28:10 +02:00
Willy Tarreau	7fd758bbcf	MINOR: buffers: provide simple pointer normalization functions Add buffer_wrap_sub() and buffer_wrap_add() to normalize buffer pointers after an addition or subtract.	2012-05-08 12:28:10 +02:00
Willy Tarreau	02d6cfc1d7	MAJOR: buffer: replace buf->l with buf->{o+i} We don't have buf->l anymore. We have buf->i for pending data and the total length is retrieved by adding buf->o. Some computation already become simpler. Despite extreme care, bugs are not excluded. It's worth noting that msg->err_pos as set by HTTP request/response analysers becomes relative to pending data and not to the beginning of the buffer. This has not been completed yet so differences might occur when outgoing data are left in the buffer.	2012-05-08 12:28:10 +02:00
Willy Tarreau	2e046c6017	MAJOR: buffer rework: replace ->send_max with ->o This is the first minor step of the buffer rework. It's only renaming, it should have no impact.	2012-04-30 11:57:00 +02:00
Willy Tarreau	a36fc4d7ed	MEDIUM: move message-related flags from transaction to message Too many flags are stored in the transaction structure. Some flags are clearly message-specific and exist in two versions (request and response). Move them to a new "flags" field in the http_message struct instead.	2012-04-30 11:57:00 +02:00
Willy Tarreau	9a7bea52b1	MINOR: standard: add a memprintf() function to build formatted error messages memprintf() is just like snprintf() except that it always returns a properly sized allocated string that the caller is responsible for freeing. NULL is returned on serious errors. It also supports stackable calls over the same pointer since it offers support for automatically freeing a previous one : memprintf(&err, "invalid argument: '%s'", arg); ... memprintf(&err, "keyword parser said: <%s>", err); ... memprintf(&err, "line parser said: %s\n", err); ... free(*err);	2012-04-30 11:55:35 +02:00
Willy Tarreau	3fb818c014	BUILD: http: make extract_cookie_value() return an int not size_t It's very annoying that we have to deal with the crappy size_t and with ints at some places because these ones don't mix well. Patch 6f61b2 changed the chunk len to int but its size remains size_t and some functions are having trouble being used by several callers depending on the type of their arguments. Let's turn extract_cookie_value() to int for now on, and plan a massive cleanup later to remove all size_t.	2012-04-30 00:19:28 +02:00
Willy Tarreau	9b061e3320	MEDIUM: stream_sock: add a get_src and get_dst callback and remove SN_FRT_ADDR_SET These callbacks are used to retrieve the source and destination address of a socket. The address flags are not hold on the stream interface and not on the session anymore. The addresses are collected when needed. This still needs to be improved to store the IP and port separately so that it is not needed to perform a getsockname() when only the IP address is desired for outgoing traffic.	2012-04-07 18:03:52 +02:00
William Lallemand	5e19a2866f	MINOR: log: log-format: usable without httplog and tcplog Options httplog and tcplog aren't mandatory anymore for the log-format. The LW_ flags are now set during the log-format string parsing.	2012-04-07 16:25:26 +02:00
William Lallemand	a73203e3dc	MEDIUM: log: Unique ID The Unique ID, is an ID generated with several informations. You can use a log-format string to customize it, with the "unique-id-format" keyword, and insert it in the request header, with the "unique-id-header" keyword.	2012-04-07 16:25:26 +02:00
William Lallemand	5f2324019d	MEDIUM: log: New format-log flags: %Fi %Fp %Si %Sp %Ts %rt %H %pid %Fi: Frontend IP %Fp: Frontend Port %Si: Server IP %Sp: Server Port %Ts: Timestamp %rt: HTTP request counter %H: hostname %pid: PID +X: Hexadecimal represenation The +X mode in logformat displays hexadecimal for the following flags %Ci %Cp %Fi %Fp %Bi %Bp %Si %Sp %Ts %ct %pid rename logformat_write_string() to lf_text() Optimize size computation	2012-04-07 16:05:39 +02:00
William Lallemand	1d7055675e	MEDIUM: log: split of log_format generation * logformat functions now take a format linked list as argument * build_logline() build a logline using a format linked list * rename LOG_* by LOG_FMT_* in enum * improve error management in build_logline()	2012-04-07 16:05:02 +02:00
Aman Gupta	0bc0c2426c	MINOR: Add TO/FROM_SET flags to struct stream_interface [WT: it will make sense to remove SN_FRT_ADDR_SET and to use these flags everywhere instead ]	2012-04-07 09:17:26 +02:00
Willy Tarreau	c89ccb6221	MEDIUM: log: add a new cookie flag 'U' to report situations where cookie is not used This happens when a "use-server" rule sets the server instead.	2012-04-05 21:18:22 +02:00
Willy Tarreau	4a5cadea40	MEDIUM: session: implement the "use-server" directive Sometimes it is desirable to forward a particular request to a specific server without having to declare a dedicated backend for this server. This can be achieved using the "use-server" rules. These rules are evaluated after the "redirect" rules and before evaluating cookies, and they have precedence on them. There may be as many "use-server" rules as desired. All of these rules are evaluated in their declaration order, and the first one which matches will assign the server.	2012-04-05 21:14:10 +02:00
Aman Gupta	9a13e84cc2	MINOR: Add release callback to si_applet	2012-04-05 10:39:20 +02:00
Cyril Bonté	19979e176e	MINOR: stats admin: reduce memcmp()/strcmp() calls on status codes memcmp()/strcmp() calls were needed in different parts of code to determine the status code. Each new status code introduces new calls, which can become inefficient and source of bugs. This patch reorganizes the code to rely on a numeric status code internally and to be hopefully more generic.	2012-04-05 09:58:27 +02:00
Cyril Bonté	cf8d9ae3cd	MINOR: stats admin: allow unordered parameters in POST requests Previously, the stats admin page required POST parameters to be provided exactly in the same order as the HTML form. This patch allows to handle those parameters in any orders. Also, note that haproxy won't alter server states anymore if backend or server names are ambiguous (duplicated names in the configuration) to prevent unexpected results (the same should probably be applied to the stats socket).	2012-04-05 09:58:25 +02:00
Simon Horman	b7cd8f9a3a	CLEANUP: Fix HCHK spelling errors	2012-03-24 21:54:25 +01:00
Simon Horman	63a4a822c1	CLEANUP: Make check_statuses, analyze_statuses and process_chk static These symbols are only used inside src/checks.c	2012-03-24 21:54:19 +01:00
Willy Tarreau	b1a2faf7c9	BUG/CRITICAL: log: fix risk of crash in development snapshot Commit a1cc38 introduced a regression which was easy to trigger till `ad4cd58` (snapshots 20120222 to 20120311 included). The bug was still present after that but harder to trigger. The bug is caused by the use of two distinct log buffers due to intermediary changes. The issue happens when an HTTP request is logged just after a TCP request during the same second and the HTTP request is too large for the buffer. In this case, it happens that the HTTP request is logged into the TCP buffer instead and that length controls can't detect anything. Starting with bddd4f, the issue is still possible when logging too large an HTTP request just after a send_log() call (typically a server status change). We owe a big thanks to Sander Klein for testing several snapshots and more specifically for taking significant risks in production by letting the buggy version crash several times in order to provide an exploitable core ! The bug could not have been found without this precious help. Thank you Sander ! This fix does not need to be backported, it did not affect any released version.	2012-03-19 17:09:30 +01:00
Willy Tarreau	6580c06ba3	MINOR: log: use "%ts" to log term status only and "%tsc" to log with cookie The difference could be seen when logging a request in HTTP mode with option tcplog, as it would keep emitting 4 chars. Better use two distinct flags to clear the confusion.	2012-03-12 15:50:53 +01:00
William Lallemand	81f5117a24	BUG/MINOR: log-format: fix %o flag The %o flag was not working at all.	2012-03-12 15:50:53 +01:00
William Lallemand	b7ff6a3a36	MEDIUM: log-format: backend source address %Bi %Bp %Bi return the backend source IP %Bp return the backend source port Add a function pointer in logformat_type to do additional configuration during the log-format variable parsing.	2012-03-12 15:50:52 +01:00
William Lallemand	bddd4fd93b	MEDIUM: log: use log_format for mode tcplog Merge http_sess_log() and tcp_sess_log() to sess_log() and move it to log.c A new field in logformat_type define if you can use a logformat variable in TCP or HTTP mode. doc: log-format in tcp mode Note that due to the way log buffer allocation currently works, trying to log an HTTP request without "option httplog" is still not possible. This will change in the near future.	2012-03-12 15:47:13 +01:00
Willy Tarreau	18dd41dc46	MINOR: buffer: switch a number of buffer args to const A number of offset computation functions use struct buffer* arguments and return integers without modifying the input. Using consts helps simplifying some operations in callers.	2012-03-10 08:55:07 +01:00
Willy Tarreau	f09c6603d3	MEDIUM: backend: add the 'first' balancing algorithm The principle behind this load balancing algorithm was first imagined and modeled by Steen Larsen then iteratively refined through several work sessions until it would totally address its original goal. The purpose of this algorithm is to always use the smallest number of servers so that extra servers can be powered off during non-intensive hours. Additional tools may be used to do that work, possibly by locally monitoring the servers' activity. The first server with available connection slots receives the connection. The servers are choosen from the lowest numeric identifier to the highest (see server parameter "id"), which defaults to the server's position in the farm. Once a server reaches its maxconn value, the next server is used. It does not make sense to use this algorithm without setting maxconn. Note that it can however make sense to use minconn so that servers are not used at full load before starting new servers, and so that introduction of new servers requires a progressively increasing load (the number of servers would more or less follow the square root of the load until maxconn is reached). This algorithm ignores the server weight, and is more beneficial to long sessions such as RDP or IMAP than HTTP, though it can be useful there too.	2012-02-21 22:27:27 +01:00
Willy Tarreau	3ebb1163ba	MINOR: backend: rework the LC definition to support other connection-based algos The leastconn algorithm should be of kind "connection-based", not "leastconn" if we want to later support other connection-based LB algos.	2012-02-13 17:02:31 +01:00
William Lallemand	a1cc381151	MEDIUM: log: make http_sess_log use log_format http_sess_log now use the logformat linked list to make the log string, snprintf is not used for speed issue. CLF mode also uses logformat. NOTE: as of now, empty fields in CLF now are "" not "-" anymore.	2012-02-09 17:03:28 +01:00
William Lallemand	421f5b5882	MINOR: Date and time fonctions that don't use snprintf Also move human_time() to standard.c since it's not related to timeval calculations.	2012-02-09 17:03:28 +01:00
William Lallemand	e7340ec111	MINOR: add ultoa, ulltoa, ltoa, lltoa implementations Implementations that write result from left to right	2012-02-09 17:03:28 +01:00
William Lallemand	723b73ad75	MINOR: config: Parse the string of the log-format config keyword parse_logformat_string: parse the string, detect the type: text, separator or variable parse_logformat_var: dectect variable name parse_logformat_var_args: parse arguments and flags add_to_logformat_list: add to the logformat linked list	2012-02-09 17:03:24 +01:00
William Lallemand	2a4a44f0f9	REORG: log: split send_log function send_log function is now splited in 3 functions * hdr_log: generate the syslog header * send_log: send a syslog message with a printf format string * __send_log: send a syslog message	2012-02-09 15:54:43 +01:00
Willy Tarreau	f8e8b76ed3	BUG/MEDIUM: zero-weight servers must not dequeue requests from the backend It was reported that a server configured with a zero weight would sometimes still take connections from the backend queue. This issue is real, it happens this way : 1) the disabled server accepts a request with a cookie 2) many cookie-less requests accumulate in the backend queue 3) when the disabled server completes its request, it checks its own queue and the backend's queue 4) the server takes a pending request from the backend queue and processes it. In response, the server's cookie is assigned to the client, which ensures that some requests will continue to be served by this server, leading back to point 1 above. The fix consists in preventing a zero-weight server from dequeuing pending requests from the backend. Making use of srv_is_usable() in such tests makes the tests more robust against future changes. This fix must be backported to 1.4 and 1.3.	2012-01-20 16:18:53 +01:00
Willy Tarreau	62c3be28ed	BUG/MEDIUM: correctly disable servers tracking another disabled servers. In a config where server "s1" is marked disabled and "s2" tracks "s1", s2 appears disabled on the stats but is still inserted into the LB farm because the tracking is resolved too late in the configuration process. We now resolve tracked servers before building LB maps and we also mark the tracking server in maintenance mode, which previously was not done, causing half of the issue. Last point is that we also protect srv_is_usable() against electing a server marked for maintenance. This is not absolutely needed but is a safe choice and makes a lot of sense. This fix must be backported to 1.4.	2012-01-20 16:18:30 +01:00
Mark Lamourine	c2247f0b8d	MEDIUM: http: add support for sending the server's name in the outgoing request New option "http-send-name-header" specifies the name of a header which will hold the server name in outgoing requests. This is the name of the server the connection is really sent to, which means that upon redispatches, the header's value is updated so that it always matches the server's name.	2012-01-05 15:17:31 +01:00
Willy Tarreau	294c473756	MEDIUM: http: replace get_ip_from_hdr2() with http_get_hdr() The new function does not return IP addresses but header values instead, so that the caller is free to make what it want of them. The conversion is not quite clean yet, as the previous test which considered that address 0.0.0.0 meant "no address" is still used. A different IP parsing function should be used to take this into account.	2011-12-30 17:33:26 +01:00
Willy Tarreau	aff79da207	CLEANUP: acl: remove last data fields from the acl_test struct acl_test doesn't hold ptr nor len anymore, since everything has moved to the temporary pattern.	2011-12-30 17:33:26 +01:00
Willy Tarreau	31d1b01211	CLEANUP: acl: integer part of acl_test is not used anymore	2011-12-30 17:33:26 +01:00
Willy Tarreau	b666bc7d51	MEDIUM: pattern: change the pattern data integer from unsigned to signed Till now the pattern data integer type was unsigned without any particular reason. In order to make ACLs use it, we must switch it to signed int instead.	2011-12-30 17:33:25 +01:00
Willy Tarreau	918458439e	MINOR: acl: include pattern.h to make pattern migration more transparent	2011-12-30 17:33:25 +01:00
Willy Tarreau	5e6cc4aad8	MINOR: pattern: export the global temporary pattern The global pattern is used for pattern conversions. Export it under the name "temp_pattern" so that it can later be used by ACLs.	2011-12-30 17:33:25 +01:00
Willy Tarreau	f3bfede52e	CLEANUP: ebtree: clarify licence and update to 6.0.6 (from ebtree 6.0.6) This version is mainly aimed at clarifying the fact that the ebtree license is LGPL. Some files used to indicate LGPL and other ones GPL, while the goal clearly is to have it LGPL. A LICENSE file has also been added. No code is affected, but it's better to have the local tree in sync anyway. (cherry picked from commit 24dc7cca051f081600fe8232f33e55ed30e88425)	2011-12-02 17:09:49 +01:00
Willy Tarreau	19ae56b2b6	CLEANUP: kill buffer_replace() and use an inline instead This function is never used, only its buffer_replace2() alternative is used. Replace the former with an inline which calls the later.	2011-11-28 21:01:28 +01:00
Willy Tarreau	71730256a3	MINOR: buffers: make buffer_pointer() support negative pointers too It's more handy if the buffer_pointer() function also handles negative pointers.	2011-11-28 21:00:46 +01:00
Willy Tarreau	fe4b1f9dc0	BUG: buffers: don't return a negative value on buffer_total_space_res() In commit `4b517ca93a` (MEDIUM: buffers: add some new primitives and rework existing ones), we forgot to check if buffer_max_len() < l. No backport is needed.	2011-11-28 21:00:46 +01:00
Willy Tarreau	4b517ca93a	MEDIUM: buffers: add some new primitives and rework existing ones A number of primitives were missing for buffer management, and some of them were particularly awkward to use. Specifically, the functions used to compute free space could not always be used depending what was wrapping in the buffers. Some documentation has been added about how the buffers work and their properties. Some functions are still missing such as a buffer replacement which would support wrapping buffers.	2011-11-25 21:57:29 +01:00
William Lallemand	0f99e34978	MEDIUM: log: Use linked lists for loggers This patch settles the 2 loggers limitation. Loggers are now stored in linked lists. Using "global log", the global loggers list content is added at the end of the current proxy list. Each "log" entries are added at the end of the proxy list. "no log" flush a logger list.	2011-10-31 14:09:19 +01:00
Willy Tarreau	2e99390faf	BUG/MEDIUM: checks: fix slowstart behaviour when server tracking is in use Ludovic Levesque reported and diagnosed an annoying bug. When a server is configured to track another one and has a slowstart interval set, it's assigned a minimal weight when the tracked server goes back up but keeps this weight forever. This is because the throttling during the warmup phase is only computed in the health checking function. After several attempts to resolve the issue, the only real solution is to split the check processing task in two tasks, one for the checks and one for the warmup. Each server with a slowstart setting has a warmum task which is responsible for updating the server's weight after a down to up transition. The task does not run in othe situations. In the end, the fix is neither complex nor long and should be backported to 1.4 since the issue was detected there first.	2011-10-31 11:53:20 +01:00
Willy Tarreau	4426770013	CLEANUP: rename possibly confusing struct field "tracked" When reading the code, the "tracked" member of a server makes one think the server is tracked while it's the opposite, it's a pointer to the server being tracked. This is particularly true in constructs such as : if (srv->tracked) { Since it's the second time I get caught misunderstanding it, let's rename it to "track" to avoid the confusion.	2011-10-28 15:35:33 +02:00
Willy Tarreau	ac1932da3e	MEDIUM: tune.http.maxhdr makes it possible to configure the maximum number of HTTP headers For a long time, the max number of headers was taken as a part of the buffer size. Since the header size can be configured at runtime, it does not make much sense anymore. Nothing was making it necessary to have a static value, so let's turn this into a tunable with a default value of 101 which equals what was previously used.	2011-10-24 19:14:41 +02:00
Willy Tarreau	34eb671f24	OPTIM/MINOR: move the hdr_idx pools out of the proxy struct It makes no sense to have one pointer to the hdr_idx pool in each proxy struct since these pools do not depend on the proxy. Let's have a common pool instead as it is already the case for other types.	2011-10-24 18:15:04 +02:00
Willy Tarreau	bd9a0a7781	OPTIM/MINOR: make it possible to change pipe size (tune.pipesize) By default, pipes are the default size for the system. But sometimes when using TCP splicing, it can improve performance to increase pipe sizes, especially if it is suspected that pipes are not filled and that many calls to splice() are performed. This has an impact on the kernel's memory footprint, so this must not be changed if impacts are not understood.	2011-10-23 21:15:38 +02:00
Willy Tarreau	7b81563443	OPTIM/MINOR: move struct sockaddr_storage to the tail of structs Struct sockaddr_storage is huge (128 bytes) and severely impacts the cache. It also displaces other struct members, causing them to have larger relative offsets. By moving these few occurrences to the end of the structs which host them, we can reduce the code size by no less than 2 kB !	2011-10-23 19:08:46 +02:00
Willy Tarreau	6471afb43d	MINOR: remove the client/server side distinction in SI addresses Stream interfaces used to distinguish between client and server addresses because they were previously of different types (sockaddr_storage for the client, sockaddr_in for the server). This is not the case anymore, and this distinction is confusing at best and has caused a number of regressions to be introduced in the process of converting everything to full-ipv6. We can now remove this and have a much cleaner code.	2011-09-23 10:54:59 +02:00
Willy Tarreau	0e69854ed4	MINOR: acl: add new matches for header/path/url length This patch introduces hdr_len, path_len and url_len for matching these respective parts lengths against integers. This can be used to detect abuse or empty headers.	2011-09-16 08:32:32 +02:00
Willy Tarreau	eabea0763b	[MINOR] stats: report the number of requests intercepted by the frontend These requests are mainly monitor requests, as well as stats requests when the stats are processed by the frontend. Having this counter helps explain the difference in number of sessions that is sometimes observed between a frontend and a backend.	2011-09-10 23:32:41 +02:00
Willy Tarreau	45a1251515	[MEDIUM] poll: add a measurement of idle vs work time We now measure the work and idle times in order to report the idle time in the stats. It's expected that we'll be able to use it at other places later.	2011-09-10 18:01:41 +02:00
Willy Tarreau	a2a64e9689	[MEDIUM] session: make session_shutdown() an independant function We already had the ability to kill a connection, but it was only for the checks. Now we can do this for any session, and for this we add a specific flag "K" to the logs.	2011-09-07 23:01:56 +02:00
Willy Tarreau	532a450ebc	[MEDIUM] stats: add the ability to enable/disable/shutdown a frontend at runtime The stats socket now allows the admin to disable, enable or shutdown a frontend. This can be used when a bug is discovered in a configuration and it's desirable to fix it but the rules in place don't allow to change a running config. Thus it becomes possible to kill the frontend to release the port and start a new one in a separate process. This can also be used to temporarily make haproxy return TCP resets to incoming requests to pretend the service is not bound. For instance, this may be useful to quickly flush a very deep SYN backlog. The frontend check and lookup code was factored with the "set maxconn" usage.	2011-09-07 22:50:52 +02:00
Willy Tarreau	ce8fe259b5	[CLEANUP] proxy: make pause_proxy() perform the required controls and emit the logs It avoids duplicated code in the caller.	2011-09-07 22:47:43 +02:00
Willy Tarreau	3c63fd828a	[MEDIUM] don't limit peers nor stats socket to maxconn nor maxconnrate The peers and the stats socket are control sockets, they must not be limited by traffic rules.	2011-09-07 22:47:42 +02:00
Willy Tarreau	81c25d0ee6	[MEDIUM] add support for global.maxconnrate to limit the per-process conn rate. This one enforces a per-process connection rate limit, regardless of what may be set per frontend. It can be a way to limit the CPU usage of a process being severely attacked. The side effect is that the global process connection rate is now measured for each incoming connection, so it will be possible to report it.	2011-09-07 22:47:42 +02:00
Willy Tarreau	91886b692a	[MEDIUM] stats: add the "set maxconn" setting to the command line interface This option permits to change the global maxconn setting within the limit that was set by the initial value, which is now reported as the hard maxconn value. This allows to immediately accept more concurrent connections or to stop accepting new ones until the value passes below the indicated setting. The main use of this option is on systems where many haproxy instances are loaded and admins need to re-adjust resource sharing at run time to regain a bit of fairness between processes.	2011-09-07 22:47:41 +02:00
Willy Tarreau	588bd4f813	[BUG] http: trailing white spaces must also be trimmed after headers Trailing spaces after headers were not trimmed, only the leading ones were. An issue was detected today with a content-length value which was padded with spaces and which was rejected. Recent updates to the http-bis draft made it a lot more clear that such spaces must be ignored, so this is what this patch does. It should be backported to 1.4.	2011-09-05 00:54:56 +02:00
Willy Tarreau	631f01c2f1	[MINOR] make use of addr_to_str() and get_host_port() to replace many inet_ntop() Many inet_ntop calls were partially right, which was hard to detect given the complex combinations. Some of them were relying on the listener's proto instead of the address itself, which could have been different when dealing with an accept-proxy connection. The new addr_to_str() function does the dirty job and returns the family, which makes it particularly suited to calls from switch/case statements. A large number of if/else statements were removed and the stats output could even be cleaned up in the case of session dump. As a side effect of doing this, the resulting code is smaller by almost 1kB. All changed parts have been tested and provided expected output.	2011-09-05 00:54:36 +02:00
Willy Tarreau	43d8fb2d3a	[REORG] build: move syscall redefinition to specific places Some older libc don't define splice() and and don't define _syscall() either, which causes build errors if splicing is enabled. To solve this, we now split the syscall redefinition into two layers : - one file per syscall (epoll, splice) - one common file to declare the _syscall() macros The code is cleaner because files using the syscalls just have to include their respective file. It's not adviced to merge multiple syscall families into a same file if all are not intended to be used simultaneously, because defining unused static functions causes warnings to be emitted during build. As a result, the new USE_MY_SPLICE parameter was added in order to be able to define the splice() syscall separately.	2011-08-23 00:11:25 +02:00
Willy Tarreau	87cf51406c	[MEDIUM] http: make x-forwarded-for addition conditional If "option forwardfor" has the "if-none" argument, then the header is only added when the request did not already have one. This option has security implications, and should not be set blindly.	2011-08-19 22:57:24 +02:00
Willy Tarreau	1ee51a6581	[BUG] check: http-check expect + regex would crash in defaults section Manoj Kumar reported a case where haproxy would crash upon start-up. The cause was an "http-check expect" statement declared in the defaults section, which caused a NULL regex to be used during the check. This statement is not allowed in defaults sections precisely because this requires saving a copy of the regex in the default proxy. But the check was not made to prevent it from being declared there, hence the issue. Instead of adding code to detect its abnormal use, we decided to implement it. It was not that much complex because the expect_str part was not used with regexes, so it could hold the string form of the regex in order to compile it again for every backend (there's no way to clone regexes). This patch has been tested and works. So it's both a bugfix and a minor feature enhancement. It should be backported to 1.4 though it's not critical since the config was not supposed to be supported.	2011-08-19 20:14:01 +02:00
Willy Tarreau	1620ec39a7	[MEDIUM] checks: group health checks methods by values and save option bits Adding health checks has become a real pain, with cross-references to all checks everywhere because they're all a single bit. Since they're all exclusive, let's change this to have a check number only. We reserve 4 bits allowing up to 16 checks (15+tcp), only 7 of which are currently used. The code has shrunk by almost 1kB and we saved a few option bits. The "dispatch" option has been moved to px->options, making a few tests a bit cleaner.	2011-08-06 17:08:40 +02:00
Herv� COMMOWICK	ec032d63a6	[MINOR] check: add redis check support This patch provides a new "option redis-check" statement to enable server health checks based on redis PING request (http://www.redis.io/commands/ping).	2011-08-06 15:52:47 +02:00
Herv� COMMOWICK	daa824e513	[MINOR] acl: add srv_conn acl to count connections on a specific backend server These ACLs are used to check the number of active connections on the specified server in the specified backend.	2011-08-06 15:52:27 +02:00
Willy Tarreau	e9b2602ac5	[MEDIUM] listeners: add a global listener management task This global task is used to periodically check for end of resource shortage and to try to enable queued listeners again. This is important in case some temporary system-wide shortage is encountered, so that we don't have to wait for an existing connection to be released before checking the queue again. For situations where listeners are queued due to the global maxconn being reached, the task is woken up at least every second. For situations where a system resource shortage is detected (memory, sockets, ...) the task is woken up at least every 100 ms. That way, recovery from severe events can still be achieved under acceptable conditions.	2011-08-01 20:57:55 +02:00
Willy Tarreau	237250cc0d	[BUG] proxy: stats frontend and peers were missing many initializers This was revealed with one of the very latest patches which caused the listener_queue not to be initialized on the stats socket frontend. And in fact a number of other ones were missing too. This is getting so boring that now we'll always make use of the same function to initialize any proxy. Doing so has even saved about 500 bytes on the binary due to the avoided code redundancy. No backport is needed.	2011-07-29 02:00:19 +02:00
Willy Tarreau	918ff608f8	[MAJOR] proxy: finally get rid of maintain_proxies() This function is finally not needed anymore, as it has been replaced with a per-proxy task that is scheduled when some limits are encountered on incoming connections or when the process is stopping. The savings should be noticeable on configs with a large number of proxies. The most important point is that the rate limiting is now enforced in a clean and solid way.	2011-07-25 16:33:49 +02:00
Willy Tarreau	26e4881a2d	[MINOR] task: new function task_schedule() to schedule a wake up This function is used when a task should be woken up at most at a given date. This will be used with rate shapers.	2011-07-25 15:30:39 +02:00
Willy Tarreau	562515cac1	[CLEANUP] proxy: rename a few proxy states (PR_STIDLE and PR_STRUN) Those states have been replaced with PR_STFULL and PR_STREADY respectively, as it is what matches them the best now. Also, two occurrences of PR_STIDLE in peers.c have been removed as this did not provide any form of error recovery anyway.	2011-07-25 08:11:52 +02:00
Willy Tarreau	07687c171e	[MEDIUM] listeners: queue proxy-bound listeners at the proxy's All listeners that are limited by a proxy-specific resource are now queued at the proxy's and not globally. This allows finer-grained wakeups when releasing resource.	2011-07-24 23:55:06 +02:00
Willy Tarreau	08ceb1012b	[MEDIUM] listeners: put listeners in queue upon resource shortage When an accept() fails because of a connection limit or a memory shortage, we now disable it and queue it so that it's dequeued only when a connection is released. This has improved the behaviour of the process near the fd limit as now a listener with a no connection (eg: stats) will not loop forever trying to get its connection accepted. The solution is still not 100% perfect, as we'd like to have this used when proxy limits are reached (use a per-proxy list) and for safety, we'd need to have dedicated tasks to periodically re-enable them (eg: to overcome temporary system-wide resource limitations when no connection is released).	2011-07-24 22:58:00 +02:00
Willy Tarreau	e6ca1fcd84	[MINOR] listeners: add support for queueing resource limited listeners When a listeners encounters a resource shortage, it currently stops until one re-enables it. This is far from being perfect as it does not yet handle the case where the single connection from the listener is rejected (eg: the stats page). Now we'll have a special status for resource limited listeners and we'll queue them into one or multiple lists. That way, each time we have to stop a listener because of a resource shortage, we can enqueue it and change its state, so that it is dequeued once more resources are available. This patch currently does not change any existing behaviour, it only adds the basic building blocks for doing that.	2011-07-24 22:03:52 +02:00
Willy Tarreau	627937158f	[MINOR] listeners: add listen_full() to mark a listener full This is just a cleanup which removes calls to EV_FD_CLR() and state setting everywhere in the code.	2011-07-24 19:25:28 +02:00
Willy Tarreau	be58c38264	[MEDIUM] proxy: add a PAUSED state to listeners and move socket tricks out of proxy.c Managing listeners state is difficult because they have their own state and can at the same time have theirs dictated by their proxy. The pause is not done properly, as the proxy code is fiddling with sockets. By introducing new functions such as pause_listener()/resume_listener(), we make it a bit more obvious how/when they're supposed to be used. The listen_proxies() function was also renamed to resume_proxies() since it's only used for pause/resume. This patch is the first in a series aiming at getting rid of the maintain_proxies mess. In the end, proxies should not call enable_listener()/disable_listener() anymore.	2011-07-24 19:09:37 +02:00
Willy Tarreau	9bd0d744ef	[BUG] session: risk of crash on out of memory (1.5-dev regression) Patch af5149 introduced an issue which can be detected only on out of memory conditions : a LIST_DEL() may be performed on an uninitialized struct member instead of a LIST_INIT() during the accept() phase, causing crashes and memory corruption to occur. This issue was detected and diagnosed by the Exceliance R&D team. This is 1.5-specific and very recent, so no existing deployment should be impacted.	2011-07-20 00:22:54 +02:00
Simon Horman	fa46168c8f	[MINOR] Add non-stick server option Never add connections allocated to this sever to a stick-table. This may be used in conjunction with backup to ensure that stick-table persistence is disabled for backup servers.	2011-06-25 21:14:17 +02:00
Simon Horman	e869176486	[MINOR] Make appsess{,ion}_refresh static apsession_refresh() and apsess_refressh are only used inside apsession.c and thus can be made static. The only use of apsession_refresh() is appsession_task_init(). These functions have been re-ordered to avoid the need for a forward-declaration of apsession_refresh().	2011-06-25 21:07:01 +02:00
Simon Horman	752dc4ab2d	[MINOR] Add down termination condition If a connection is closed by because the backend became unavailable then log 'D' as the termination condition. Signed-off-by: Simon Horman <horms@verge.net.au>	2011-06-21 22:10:56 +02:00
Simon Horman	e0d1bfb4c1	[MINOR] Allow shutdown of sessions when a server becomes unavailable This adds the "on-marked-down shutdown-sessions" statement on "server" lines, which causes all sessions established on a server to be killed at once when the server goes down. The task's priority is reniced to the highest value (1024) so that servers holding many tasks don't cause a massive slowdown due to the wakeup storm.	2011-06-21 22:00:21 +02:00
Simon Horman	af51495397	[MINOR] Add active connection list to server The motivation for this is to allow iteration of all the connections of a server without the expense of iterating over the global list of connections. The first use of this will be to implement an option to close connections associated with a server when is is marked as being down or in maintenance mode.	2011-06-21 22:00:12 +02:00
Simon Horman	dec5be4ed4	[CLEANUP] session.c: Make functions static where possible	2011-06-18 20:27:19 +02:00
Simon Horman	96553775a0	[CLEANUP] peers.h: fix declarations * The declaration of peer_session_create() does not match its definition. As it is only used inside of peers.c make it static. * Make the declaration of peers_register_table() match its definition. * Also, make all functions in peers.c that are not also in peers.h static	2011-06-18 20:27:19 +02:00
Simon Horman	c88b887d8d	[MINOR] More flexible clearing of stick table * Allow clearing of all entries of a table * Allow clearing of all entries of a table that match a data filter	2011-06-17 11:39:29 +02:00
Simon Horman	9bd2c73916	[CLEANUP] dumpstats: make symbols static where possible	2011-06-17 11:39:28 +02:00
Willy Tarreau	bf9c2fcd93	[BUG] stats: support url-encoded forms Bashkim Kasa reported that the stats admin page did not work when colons were used in server or backend names. This was caused by url-encoding resulting in ':' being sent as '%3A'. Now we systematically decode the field names and values to fix this issue.	2011-05-31 22:44:28 +02:00
Willy Tarreau	14acc7072e	[OPTIM] stream_sock: don't use splice on too small payloads It's more expensive to call splice() on short payloads than to use recv()+send(). One of the reasons is that doing a splice() involves allocating a pipe. One other reason is that the kernel will have to copy itself if we try to splice less than a page. So let's fix a short offset of 4kB below which we don't splice. A quick test shows that on chunked encoded data, with splice we had 6826 syscalls (1715 splice, 3461 recv, 1650 send) while with this patch, the same transfer resulted in 5793 syscalls (3896 recv, 1897 send).	2011-05-30 18:42:41 +02:00
Willy Tarreau	96e312139a	[MEDIUM] http: add support for "http-no-delay" There are some very rare server-to-server applications that abuse the HTTP protocol and expect the payload phase to be highly interactive, with many interleaved data chunks in both directions within a single request. This is absolutely not supported by the HTTP specification and will not work across most proxies or servers. When such applications attempt to do this through haproxy, it works but they will experience high delays due to the network optimizations which favor performance by instructing the system to wait for enough data to be available in order to only send full packets. Typical delays are around 200 ms per round trip. Note that this only happens with abnormal uses. Normal uses such as CONNECT requests nor WebSockets are not affected. When "option http-no-delay" is present in either the frontend or the backend used by a connection, all such optimizations will be disabled in order to make the exchanges as fast as possible. Of course this offers no guarantee on the functionality, as it may break at any other place. But if it works via HAProxy, it will work as fast as possible. This option should never be used by default, and should never be used at all unless such a buggy application is discovered. The impact of using this option is an increase of bandwidth usage and CPU usage, which may significantly lower performance in high latency environments. This change should be backported to 1.4 since the first report of such a misuse was in 1.4. Next patch will also be needed.	2011-05-30 18:42:41 +02:00
Willy Tarreau	ae94d4df8f	[MINOR] http: make the "HTTP 200" status code configurable. This status code is used in response to requests matching "monitor-uri". Some users need to adjust it to fit their needs (eg: make some strings appear there). As it's already defined as a chunked string and used exactly like other status codes, it makes sense to make it configurable with the usual "errorfile", "errorloc", ...	2011-05-11 16:31:43 +02:00
Willy Tarreau	436d9ed808	[REORG] http: move HTTP error codes back to proto_http.h This one was left isolated in its own file. It probably is a leftover from the 1.2->1.3 split.	2011-05-11 16:31:43 +02:00
Willy Tarreau	c9f6011760	[BUG] TCP source tracking was broken with IPv6 changes John Helliwell reported a bug when using TCP source address tracking on Solaris. The bug was introduced in haproxy 1.5-dev5.	2011-04-07 10:53:25 +02:00
Willy Tarreau	1b4b7ce6dd	[BUG] stream_sock: use get_addr_len() instead of sizeof() on sockaddr_storage John Helliwell reported a runtime issue on Solaris since 1.5-dev5. Traces show that connect() returns EINVAL, which means the socket length is not appropriate for the family. Solaris does not like being called with sizeof and needs the address family's size on sockaddr_storage. The fix consists in adding a get_addr_len() function which returns the socket's address length based on its family. Tests show that this works for both IPv4 and IPv6 addresses.	2011-04-05 16:56:50 +02:00
David du Colombier	4f92d32004	[MEDIUM] IPv6 support for stick-tables Since IPv6 is a different type than IPv4, the pattern fetch functions src6 and dst6 were added. IPv6 stick-tables can also fetch IPv4 addresses with src and dst. In this case, the IPv4 addresses are mapped to their IPv6 counterpart, according to RFC 4291.	2011-03-29 01:09:14 +02:00
David du Colombier	11bcb6c4f5	[MEDIUM] IPv6 support for syslog	2011-03-28 18:45:15 +02:00
Willy Tarreau	0bc3493d2c	[OPTIM] buffers: uninline buffer_forward() Since the latest additions to buffer_forward(), it became too large for inlining, so let's uninline it. The code size drops by 3kB. Should be backported to 1.4 too.	2011-03-28 16:25:58 +02:00
Willy Tarreau	d8ee85a0a3	[BUG] http: fix content-length handling on 32-bit platforms Despite much care around handling the content-length as a 64-bit integer, forwarding was broken on 32-bit platforms due to the 32-bit nature of the ->to_forward member of the "buffer" struct. The issue is that this member is declared as a long, so while it works OK on 64-bit platforms, 32-bit truncate the content-length to the lower 32-bits. One solution could consist in turning to_forward to a long long, but it is used a lot in the critical path, so it's not acceptable to perform all buffer size computations on 64-bit there. The fix consists in changing the to_forward member to a strict 32-bit integer and ensure in buffer_forward() that only the amount of bytes that can fit into it is considered. Callers of buffer_forward() are responsible for checking that their data were taken into account. We arbitrarily ensure we never consider more than 2G at once. That's the way it was intended to work on 32-bit platforms except that it did not. This issue was tracked down hard at Exosec with Bertrand Jacquin, Thierry Fournier and Julien Thomas. It remained undetected for a long time because files larger than 4G are almost always transferred in chunked-encoded format, and most platforms dealing with huge contents these days run on 64-bit. The bug affects all 1.5 and 1.4 versions, and must be backported.	2011-03-28 16:25:16 +02:00
Willy Tarreau	d3db94399f	[MINOR] tools: add two macros MID_RANGE and MAX_RANGE Those will be used later, they return the largest and middle integer possible for a given variable or type.	2011-03-28 15:55:43 +02:00
Willy Tarreau	fab5a43726	[MEDIUM] config: rework the IPv4/IPv6 address parser to support host-only addresses The parser now distinguishes between pure addresses and address:port. This is useful for some config items where only an address is required. Raw IPv6 addresses are now parsed, but IPv6 host name resolution is still not handled (gethostbyname does not resolve IPv6 names to addresses).	2011-03-23 19:01:18 +01:00
David du Colombier	64e9c90e69	[BUG] standard: is_addr return value for IPv4 was inverted	2011-03-22 14:39:16 +01:00
Willy Tarreau	5ab04ec47c	[MEDIUM] server: add support for the "send-proxy" option This option enables use of the PROXY protocol with the server, which allows haproxy to transport original client's address across multiple architecture layers.	2011-03-20 11:53:50 +01:00
Willy Tarreau	b22e55bc8f	[MEDIUM] stream_sock: add support for sending the proxy protocol header line Upon connection establishment, stream_sock is now able to send a PROXY line before sending any data. Since it's possible that the buffer is already full, and we don't want to allocate a block for that line, we compute it on-the-fly when we need it. We just store the offset from which to (re-)send from the end of the line, since it's assumed that multiple outputs of the same proxy line will be strictly equivalent. In practice, one call is enough. We just make sure to handle the case where the first send() would indicate an incomplete output, eventhough it's very unlikely to ever happen.	2011-03-20 10:16:46 +01:00
Willy Tarreau	a73fcaf424	[MINOR] frontend: add a make_proxy_line function This function will build a PROXY protocol line header from two addresses (IPv4 or IPv6). AF_UNIX family will be reported as UNKNOWN.	2011-03-20 10:15:22 +01:00
Willy Tarreau	ff011f26e9	[REORG] http: move the http-request rules to proto_http And also rename "req_acl_rule" "http_req_rule". At the beginning that was a bit confusing to me, especially the "req_acl" list which in fact holds what we call rules. After some digging, it appeared that some part of the code is 100% HTTP and not just related to authentication anymore, so let's move that part to HTTP and keep the auth-only code in auth.c.	2011-03-13 22:00:24 +01:00
Willy Tarreau	7d0aaf39d1	[MEDIUM] stats: split frontend and backend stats It's very annoying that frontend and backend stats are merged because we don't know what we're observing. For instance, if a "listen" instance makes use of a distinct backend, it's impossible to know what the bytes_out means. Some points take care of not updating counters twice if the backend points to the frontend, indicating a "listen" instance. The thing becomes more complex when we try to add support for server side keep-alive, because we have to maintain a pointer to the backend used for last request, and to update its stats. But we can't perform such comparisons anymore because the counters will not match anymore. So in order to get rid of this situation, let's have both frontend AND backend stats in the "struct proxy". We simply update the relevant ones during activity. Some of them are only accounted for in the backend, while others are just for frontend. Maybe we can improve a bit on that later, but the essential part is that those counters now reflect what they really mean.	2011-03-13 22:00:23 +01:00
David du Colombier	6f5ccb1589	[MEDIUM] add internal support for IPv6 server addresses This patch turns internal server addresses to sockaddr_storage to store IPv6 addresses, and makes the connect() function use it. This code already works but some caveats with getaddrinfo/gethostbyname still need to be sorted out while the changes had to be merged at this stage of internal architecture changes. So for now the config parser will not emit an IPv6 address yet so that user experience remains unchanged. This change should have absolutely zero user-visible effect, otherwise it's a bug introduced during the merge, that should be reported ASAP.	2011-03-13 22:00:12 +01:00
Willy Tarreau	827aee913f	[MAJOR] session: remove the ->srv pointer from struct session This one has been removed and is now totally superseded by ->target. To get the server, one must use target_srv(&s->target) instead of s->srv now. The function ensures that non-server targets still return NULL.	2011-03-10 23:32:17 +01:00
Willy Tarreau	9e000c6ec8	[CLEANUP] stream_interface: use inline functions to manipulate targets The connection target involves a type and a union of pointers, let's make the code cleaner using simple wrappers.	2011-03-10 23:32:17 +01:00
Willy Tarreau	3d80d911aa	[MEDIUM] session: remove s->prev_srv which is not needed anymore s->prev_srv is used by assign_server() only, but all code paths leading to it now take s->prev_srv from the existing s->srv. So assign_server() can do that copy into its own stack. If at one point a different srv is needed, we still have a copy of the last server on which we failed a connection attempt in s->target.	2011-03-10 23:32:16 +01:00
Willy Tarreau	664beb8610	[MINOR] session: add a pointer to the new target into the session When dealing with HTTP keep-alive, we'll have to know if we can reuse an existing connection. For that, we'll have to check if the current connection was made on the exact same target (referenced in the stream interface). Thus, we need to first assign the next target to the session, then copy it to the stream interface upon connect(). Later we'll check for equivalence between those two operations.	2011-03-10 23:32:16 +01:00
Willy Tarreau	f5ab69aad9	[MINOR] proxy: add PR_O2_DISPATCH to detect dispatch mode Till now we used the fact that the dispatch address was not null to use the dispatch mode. This is very unconvenient, so let's have a dedicated option.	2011-03-10 23:32:16 +01:00
Willy Tarreau	295a837726	[REORG] session: move the data_ctx struct to the stream interface's applet This is in fact where those parts belong to. The old data_state was replaced by applet.state and is now initialized when the applet is registered. It's worth noting that the applet does not need to know the session nor the buffer anymore since everything is brought by the stream interface. It is possible that having a separate applet struct would simplify the code but that's not a big deal.	2011-03-10 23:32:16 +01:00
Willy Tarreau	5ec29ffa42	[CLEANUP] stats: make all dump functions only rely on the stream interface This will be needed to move the applet-specific data out of the session.	2011-03-10 23:32:16 +01:00
Willy Tarreau	75581aebb0	[CLEANUP] session: remove data_source from struct session This one was only used for logging purposes, it's not needed anymore.	2011-03-10 23:32:15 +01:00
Willy Tarreau	7c0a151a2e	[CLEANUP] stream_interface: remove the applet.handler pointer Now that we have the target pointer and type in the stream interface, we don't need the applet.handler pointer anymore. That makes the code somewhat cleaner because we know we're dealing with an applet by checking its type instead of checking the pointer is not null.	2011-03-10 23:32:15 +01:00
Willy Tarreau	ac82540c35	[MEDIUM] stream_interface: store the target pointer and type When doing a connect() on a stream interface, some information is needed from the server and from the backend. In some situations, we don't have a server and only a backend (eg: peers). In other cases, we know we have an applet and we don't want to connect to anything, but we'd still like to have the info about the applet being used. For this, we now store a pointer to the "target" into the stream interface. The target describes what's on the other side before trying to connect. It can be a server, a proxy or an applet for now. Later we'll probably have descriptors for multiple-stage chains so that the final information may still be found. This will help removing many specific cases in the code. It already made it possible to remove the "srv" and "be" parameters to tcpv4_connect_server().	2011-03-10 23:32:15 +01:00
Willy Tarreau	f153686a71	[REORG] tcp: make tcpv4_connect_server() take the target address from the SI The address is now available in the stream interface, no need to pass it by argument.	2011-03-10 23:32:15 +01:00
Willy Tarreau	957c0a5845	[REORG] session: move client and server address to the stream interface This will be needed very soon for the keep-alive.	2011-03-10 23:32:14 +01:00
Willy Tarreau	be5ea19188	[REORG] stream_interface: split the struct members in 3 parts Those 3 parts are the buffer side, the remote side and the communication functions. This change has no functional effect but is needed to proceed further.	2011-03-10 23:32:14 +01:00
Willy Tarreau	bc4af0573c	[REORG] stream_interface: move the st0, st1 and private members to the applet Those fields are only used by the applets, so let's move them to the struct.	2011-03-10 23:32:14 +01:00
Willy Tarreau	b24281b0ff	[MINOR] stream_interface: make use of an applet descriptor for IO handlers I/O handlers are still delicate to manipulate. They have no type, they're just raw functions which have no knowledge of themselves. Let's have them declared as applets once for all. That way we can have multiple applets share the same handler functions and we can store their names there. When we later need to add more parameters (eg: usage stats), we'll be able to do so in the applets themselves. The CLI functions has been prefixed with "cli" instead of "stats" as it's clearly what is going on there. The applet descriptor in the stream interface should get all the applet specific data (st0, ...) but this will be done in the next patch so that we don't pollute this one too much.	2011-03-10 23:32:14 +01:00
Willy Tarreau	124d99181c	[BUG] http: fix computation of message body length after forwarding has started Till now, the forwarding code was making use of the hdr_content_len member to hold the size of the last chunk parsed. As such, it was reset after being scheduled for forwarding. The issue is that this entry was reset before the data could be viewed by backend.c in order to parse a POST body, so the "balance url_param check_post" did not work anymore. In order to fix this, we need two things : - the chunk size (reset upon every forward) - the total body size (not reset) hdr_content_len was thus replaced by the former (hence the size of the patch) as it makes more sense to have it stored that way than the way around. This patch should be backported to 1.4 with care, considering that it affects the forwarding code.	2011-03-01 20:30:48 +01:00
Willy Tarreau	930bd6b763	[MINOR] acl: add ability to check for internal response-only parameters Some parameters are known to the response only (eg: the server which processed it). Let's have a flag for that.	2011-02-23 15:32:20 +01:00
Rauf Kuliyev	38b4156a69	[MINOR] checks: add PostgreSQL health check I have written a small patch to enable a correct PostgreSQL health check It works similar to mysql-check with the very same parameters. E.g.: listen pgsql 127.0.0.1:5432 mode tcp option pgsql-check user pgsql server masterdb pgsql.server.com:5432 check inter 10000	2011-01-04 15:14:13 +01:00
Kevinm	48936af9a2	[MINOR] log: ability to override the syslog tag One of the requirements we have is to run multiple instances of haproxy on a single host; this is so that we can split the responsibilities (and change permissions) between product teams. An issue we ran up against is how we would distinguish between the logs generated by each instance. The solution we came up with (please let me know if there is a better way) is to override the application tag written to syslog. We can then configure syslog to write these to different files. I have attached a patch adding a global option 'log-tag' to override the default syslog tag 'haproxy' (actually defaults to argv[0]).	2010-12-30 11:43:36 +01:00
Joe Williams	df5b38fac1	[MINOR] log: add support for passing the forwarded hostname Haproxy does not include the hostname rather the IP of the machine in the syslog headers it sends. Unfortunately this means that for each log line rsyslog does a reverse dns on the client IP and in the case of non-routable IPs one gets the public hostname not the internal one. While this is valid according to RFC3164 as one might imagine this is troublsome if you have some machines with public IPs, internal IPs, no reverse DNS entries, etc and you want a standardized hostname based log directory structure. The rfc says the preferred value is the hostname. This patch adds a global "log-send-hostname" statement which accepts an optional string to force the host name. If unset, the local host name is used.	2010-12-29 17:05:48 +01:00
Willy Tarreau	0499e3575c	[BUG] http: analyser optimizations broke pipelining HTTP pipelining currently needs to monitor the response buffer to wait for some free space to be able to send a response. It was not possible for the HTTP analyser to be called based on response buffer activity. Now we introduce a new buffer flag BF_WAKE_ONCE which is set when the HTTP request analyser is set on the response buffer and some activity is detected. This is not clean at all but once of the only ways to fix the issue before we make it possible to register events for analysers. Also it appeared that one realign condition did not cover all cases.	2010-12-17 07:15:57 +01:00
Willy Tarreau	10479e4bac	[MINOR] stats: add global event ID and count This counter will help quickly spot whether there are new errors or not. It is also assigned to each capture so that a script can keep trace of which capture was taken when.	2010-12-12 14:00:34 +01:00
Willy Tarreau	078272e115	[MINOR] stats: report HTTP message state and buffer flags in error dumps Debugging parsing errors can be greatly improved if we know what the parser state was and what the buffer flags were (especially for closed inputs/outputs and full buffers). Let's add that to the error snapshots.	2010-12-12 12:46:33 +01:00
Willy Tarreau	798a39cdc9	[MEDIUM] hash: add support for an 'avalanche' hash-type When the number of servers is a multiple of the size of the input set, map-based hash can be inefficient. This typically happens with 64 servers when doing URI hashing. The "avalanche" hash-type applies an avalanche hash before performing a map lookup in order to smooth the distribution. The result is slightly less smooth than the map for small numbers of servers, but still better than the consistent hashing.	2010-11-29 07:28:16 +01:00
Willy Tarreau	4c14eaa0d4	[CLEANUP] hash: move the avalanche hash code globally available We'll use this hash at other places, let's make it globally available. The function has also been renamed because its "chash_hash" name was not appropriate.	2010-11-29 07:28:16 +01:00
Willy Tarreau	b695a6e5fa	[BUILD] pattern: use 'int' instead of 'int32_t' Ross West reported that int32_t breaks compilation on FreeBSD. Since an int is 32-bit on all supported platforms and we already rely on that, change the type.	2010-11-14 14:24:27 +01:00
Emeric Brun	32da3c40db	[MEDIUM] Manage peers section parsing and stick table registration on peers.	2010-11-11 09:29:08 +01:00
Emeric Brun	2b920a1af1	[MAJOR] Add new files src/peer.c, include/proto/peers.h and include/types/peers.h for sync stick table management Add cmdline option -L to configure local peer name	2010-11-11 09:29:08 +01:00
Emeric Brun	85e77c7f0d	[MEDIUM] Create updates tree on stick table to manage sync.	2010-11-11 09:29:08 +01:00
Emeric Brun	485479d8e9	[MEDIUM] Create new protected pattern types CONSTSTRING and CONSTDATA to force memcpy if data from protected areas need to be manipulated. Enhance pattern convs and fetch argument parsing, now fetchs and convs callbacks used typed args. Add more details on error messages on parsing pattern expression function. Update existing pattern convs and fetchs to new proto. Create stick table key type "binary". Manage Truncation and padding if pattern's fetch-converted result don't match table key size.	2010-11-11 09:29:07 +01:00
Emeric Brun	97679e7901	[MEDIUM] Implement tcp inspect response rules	2010-11-11 09:28:18 +01:00
Emeric Brun	c89a57284a	[BUG] stick table entries expire on counters updates/read or show table, even if there is no "expire" parameter	2010-11-11 09:28:18 +01:00
Willy Tarreau	17f449b214	[MINOR] move MAXPATHLEN definition to compat.h MAXPATHLEN may be used at other places, it's unconvenient to have it redefined in a few files. Also, since checking it requires including sys/param.h, some versions of it cause a macro declaration conflict with MIN/MAX which are defined in tools.h. The solution consists in including sys/param.h in both files so that we ensure it's loaded before the macros are defined and MAXPATHLEN is checked.	2010-11-11 09:21:53 +01:00
Emeric Brun	ed76092e10	[MEDIUM] Add supports of bind on unix sockets.	2010-11-09 15:59:42 +01:00
Emeric Brun	cf20bf1c1c	[MEDIUM] Enhance message errors management on binds	2010-11-05 10:34:07 +01:00
Willy Tarreau	8b0cbf9969	[MINOR] frontend: add a new analyser to parse a proxied connection The introduction of a new PROXY protocol for proxied connections requires an early analyser to decode the incoming connection and set the session flags accordingly. Some more work is needed, among which setting a flag on the session to indicate it's proxied, and copying the original parameters for later comparisons with new ACLs (eg: real_src, ...).	2010-10-30 19:04:38 +02:00
Willy Tarreau	74172757c7	[MINOR] standard: change arg type from const char* to char* inetaddr_host_lim_ret() used to make use of const char for some args, but that make it impossible ot use char due to the way controls are made by gcc. So let's change that.	2010-10-30 19:04:37 +02:00
Willy Tarreau	4ec83cd939	[MINOR] standard: add read_uint() to parse a delimited unsigned integer This function parses an integer and returns it along with the pointer to the next char not part of the number.	2010-10-30 19:04:37 +02:00
Willy Tarreau	8a95691ae8	[MINOR] listener: add the "accept-proxy" option to the "bind" keyword This option will enable the AN_REQ_DECODE_PROXY analyser on the requests that come from those listeners.	2010-10-30 19:04:37 +02:00
Willy Tarreau	6e595772ad	[MINOR] buffers: add a new request analyser flag for PROXY mode Since it must be the first analyser, the other flags have been renumbered.	2010-10-30 19:04:37 +02:00
Willy Tarreau	ba4c5be880	[MINOR] cookie: add support for the "preserve" option This option makes haproxy preserve any persistence cookie emitted by the server, which allows the server to change it or to unset it, for instance, after a logout request. (cherry picked from commit 52e6d75374c7900c1fe691c5633b4ae029cae8d5)	2010-10-30 19:04:36 +02:00
Cyril Bont�	474be415af	[MEDIUM] stats: add an admin level The stats web interface must be read-only by default to prevent security holes. As it is now allowed to enable/disable servers, a new keyword "stats admin" is introduced to activate this admin level, conditioned by ACLs. (cherry picked from commit 5334bab92ca7debe36df69983c19c21b6dc63f78)	2010-10-30 19:04:34 +02:00
Cyril Bont�	70be45dbdf	[MEDIUM] enable/disable servers from the stats web interface Based on a patch provided by Judd Montgomery, it is now possible to enable/disable servers from the stats web interface. This allows to select several servers in a backend and apply the action to them at the same time. Currently, there are 2 known limitations : - The POST data are limited to one packet (don't alter too many servers at a time). - Expect: 100-continue is not supported. (cherry picked from commit 7693948766cb5647ac03b48e782cfee2b1f14491)	2010-10-30 19:04:34 +02:00
Willy Tarreau	f64d1410fc	[MEDIUM] cookie: check for maxidle and maxlife for incoming dated cookies If a cookie comes in with a first or last date, and they are configured on the backend, they're checked. If a date is expired or too far in the future, then the cookie is ignored and the specific reason appears in the cookie field of the logs. (cherry picked from commit faa3019107eabe6b3ab76ffec9754f2f31aa24c6)	2010-10-30 19:04:33 +02:00
Willy Tarreau	c01062bead	[MINOR] add encode/decode function for 30-bit integers from/to base64 These functions only require 5 chars to encode 30 bits, and don't expect any padding. They will be used to encode dates in cookies. (cherry picked from commit a7e2b5fc4612994c7b13bcb103a4a2c3ecd6438a)	2010-10-30 19:04:33 +02:00
Willy Tarreau	f1348310e8	[MEDIUM] cookie: reassign set-cookie status flags to store more states The set-cookie status flags were not very handy and limited. Reorder them to save some room for additional values and add the "U" flags (for Updated expiration date) that will be used with expirable cookies in insert mode. (cherry picked from commit 5bab52f821bb0fa99fc48ad1b400769e66196ece)	2010-10-30 19:04:33 +02:00
Willy Tarreau	b761ec4c94	[MINOR] cookie: add the expired (E) and old (O) flags for request cookies These flags will indicate the cookie status when an expiration date is set. (cherry picked from commit 3f0f0e4583a432d34b75bc7b9dd2c756b4e181a7)	2010-10-30 19:04:33 +02:00
Willy Tarreau	92954fdf2e	[MINOR] http: make some room in the transaction flags to extend cookies We'll need one more bit to store and report the request cookie's status. Doing this required moving a few bits around. However, now in 1.4 all bits are used, there's no room left. Cookie flags will need (cherry picked from commit 09ebca0413c43620ddc375b5b4ab31a25d47b3f4)	2010-10-30 19:04:32 +02:00
Willy Tarreau	bca9969daf	[MEDIUM] cookie: support client cookies with some contents appended to their value In all cookie persistence modes but prefix, we now support cookies whose value is suffixed with some contents after a vertical bar ('\|'). This will be used to pass an optional expiration date. So as of now we only consider the part of the cookie value which is used before the vertical bar. (cherry picked from commit a4486bf4e5b03b5a980d03fef799f6407b2c992d)	2010-10-30 19:04:32 +02:00
Willy Tarreau	3193685865	[MINOR] cookie: add options "maxidle" and "maxlife" Add two new arguments to the "cookie" keyword, to be able to fix a max idle and max life on them. Right now only the parameter parsing is implemented. (cherry picked from commit 9ad5dec4c3bb8f29129f292cb22d3fc495fcc98a)	2010-10-30 19:04:32 +02:00
Willy Tarreau	43961d523f	[MINOR] global: add "tune.chksize" to change the default check buffer size HTTP content-based health checks will be involved in searching text in pages. Some pages may not fit in the default buffer (16kB) and sometimes it might be desired to have larger buffers in order to find patterns. Running checks on smaller URIs is always preferred of course. (cherry picked from commit 043f44aeb835f3d0b57626c4276581a73600b6b1)	2010-10-30 19:04:32 +02:00
Willy Tarreau	bd741540d2	[MEDIUM] checks: add support for HTTP contents lookup This patch adds the "http-check expect [r]{string,status}" statements which enable health checks based on whether the response status or body to an HTTP request contains a string or matches a regex. This probably is one of the oldest patches that remained unmerged. Over the time, several people have contributed to it, among which FinalBSD (first and second implementations), Nick Chalk (port to 1.4), Anze Skerlavaj (tests and fixes), Cyril Bont� (general fixes), and of course myself for the final fixes and doc during integration. Some people already use an old version of this patch which has several issues, among which the inability to search for a plain string that is not at the beginning of the data, and the inability to look for response contents that are provided in a second and subsequent recv() calls. But since some configs are already deployed, it was quite important to ensure a 100% compatible behaviour on the working cases. Thus, that patch fixes the issues while maintaining config compatibility with already deployed versions. (cherry picked from commit b507c43a3ce9a8e8e4b770e52e4edc20cba4c37f)	2010-10-30 19:04:31 +02:00
Gabor Lekeny	b4c81e4c81	[MINOR] checks: add support for LDAPv3 health checks This patch provides a new "option ldap-check" statement to enable server health checks based on LDAPv3 bind requests. (cherry picked from commit b76b44c6fed8a7ba6f0f565dd72a9cb77aaeca7c)	2010-10-30 19:04:31 +02:00
Willy Tarreau	74b08c9ab7	[MEDIUM] buffers: rework the functions to exchange between SI and buffers There was no consistency between all the functions used to exchange data between a buffer and a stream interface. Also, the functions used to send data to a buffer did not consider the possibility that the buffer was shutdown for read. Now the functions are called buffer_{put,get}_{char,block,chunk,string}. The old buffer_feed* functions have been left available for existing code but marked deprecated.	2010-09-08 17:04:31 +02:00
Willy Tarreau	af7ad00a99	[MINOR] support a global jobs counter This counter is incremented for each incoming connection and each active listener, and is used to prevent haproxy from stopping upon SIGUSR1. It will thus be possible for some tasks in increment this counter in order to prevent haproxy from dying until they have completed their job.	2010-08-31 15:39:26 +02:00
Willy Tarreau	d0807c3c60	[MEDIUM] signals: support redistribution of signal zero when stopping Signal zero is never delivered by the system. However having a signal to which functions and tasks can subscribe to be notified of a stopping event is useful. So this patch does two things : 1) allow signal zero to be delivered from any function of signal handler 2) make soft_stop() deliver this signal so that tasks can be notified of a stopping condition.	2010-08-27 18:26:11 +02:00
Willy Tarreau	24f4efa670	[MEDIUM] signals: add support for registering functions and tasks The two new functions below make it possible to register any number of functions or tasks to a system signal. They will be called in the registration order when the signal is received. struct sig_handler signal_register_fct(int sig, void (fct)(struct sig_handler ), int arg); struct sig_handler signal_register_task(int sig, struct task *task, int reason);	2010-08-27 18:00:40 +02:00
Willy Tarreau	08c4b79f9a	[CLEANUP] reference product branch 1.5 It still appeared as 1.4 on the stats links.	2010-08-27 11:09:17 +02:00
Willy Tarreau	bb545b4cfc	[MINOR] startup: don't wait for nothing when no old pid remains In case of binding failure during startup, we wait for some time sending signals to old pids so that they release the ports we need. But if there aren't any old pids anymore, it's useless to wait, we prefer to fail fast. Along with this change, we now have the number of old pids really found in the nb_oldpids variable.	2010-08-25 12:58:59 +02:00
Willy Tarreau	0a4838cd31	[MEDIUM] session-counters: correctly unbind the counters tracked by the backend In case of HTTP keepalive processing, we want to release the counters tracked by the backend. Till now only the second set of counters was released, while it could have been assigned by the frontend, or the backend could also have assigned the first set. Now we reuse to unused bits of the session flags to mark which stick counters were assigned by the backend and to release them as appropriate.	2010-08-10 18:04:16 +02:00
Willy Tarreau	56123282ef	[MINOR] session-counters: use "track-sc{1,2}" instead of "track-{fe,be}-counters" The assumption that there was a 1:1 relation between tracked counters and the frontend/backend role was wrong. It is perfectly possible to track the track-fe-counters from the backend and the track-be-counters from the frontend. Thus, in order to reduce confusion, let's remove this useless {fe,be} reference and simply use {1,2} instead. The keywords have also been renamed in order to limit confusion. The ACL rule action now becomes "track-sc{1,2}". The ACLs are now "sc{1,2}_" instead of "trk{fe,be}_". That means that we can reasonably document "sc1" and "sc2" (sticky counters 1 and 2) as sort of patterns that are available during the whole session's life and use them just like any other pattern.	2010-08-10 18:04:15 +02:00
Willy Tarreau	f6efda1189	[MEDIUM] session counters: automatically remove expired entries. When a ref_cnt goes down to zero and the entry is expired, remove it.	2010-08-10 18:04:15 +02:00
Willy Tarreau	f059a0f63a	[MAJOR] session-counters: split FE and BE track counters Having a single tracking pointer for both frontend and backend counters does not work. Instead let's have one for each. The keyword has changed to "track-be-counters" and "track-fe-counters", and the ACL "trk_" changed to "trkfe_" and "trkbe_*".	2010-08-10 18:04:15 +02:00
Willy Tarreau	4f3f01fa39	[MEDIUM] stats: add the ability to dump table entries matching criteria It is now possible to dump some select table entries based on criteria which apply to the stored data. This is enabled by appending the following options to the end of the "show table" statement : data.<data_type> {eq\|ne\|lt\|gt\|le\|ge} <value> For intance : show table http_proxy data.conn_rate gt 5 show table http_proxy data.gpc0 ne 0 The compare applies to the integer value as it would be displayed, and operates on signed long long integers.	2010-08-10 18:04:14 +02:00
Willy Tarreau	3b9c6e053e	[MEDIUM] stick-table: make use of generic types for stored data It's a bit cumbersome to have to know all possible storable types from the stats interface. Instead, let's have generic types for all data, which will facilitate their manipulation.	2010-08-10 18:04:14 +02:00
Willy Tarreau	795e4a93dd	[CLEANUP] stick-table: declare stktable_data_types as extern It's fortunate we did not get hit by this stupid declaration !	2010-08-10 18:04:14 +02:00
Willy Tarreau	69f58c8058	[MEDIUM] stats: add "show table [<name>]" to dump a stick-table It is now possible to dump a table's contents with keys, expire, use count, and various data using the command above on the stats socket. "show table" only shows main table stats, while "show table <name>" dumps table contents, only if the socket level is admin.	2010-08-10 18:04:14 +02:00
Willy Tarreau	da7ff64aa9	[MEDIUM] session-counters: add HTTP req/err tracking This patch adds support for the following session counters : - http_req_cnt : HTTP request count - http_req_rate: HTTP request rate - http_err_cnt : HTTP request error count - http_err_rate: HTTP request error rate The equivalent ACLs have been added to check the tracked counters for the current session or the counters of the current source.	2010-08-10 18:04:14 +02:00
Willy Tarreau	c3bd972cda	[MINOR] session-counters: add a general purpose counter (gpc0) This counter may be used to track anything. Two sets of ACLs are available to manage it, one gets its value, and the other one increments its value and returns it. In the second case, the entry is created if it did not exist. Thus it is possible for example to mark a source as being an abuser and to keep it marked as long as it does not wait for the entry to expire : # The rules below use gpc0 to track abusers, and reject them if # a source has been marked as such. The track-counters statement # automatically refreshes the entry which will not expire until a # 1-minute silence is respected from the source. The second rule # evaluates the second part if the first one is true, so GPC0 will # be increased once the conn_rate is above 100/5s. stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request track-counters src tcp-request reject if { trk_get_gpc0 gt 0 } tcp-request reject if { trk_conn_rate gt 100 } { trk_inc_gpc0 gt 0} Alternatively, it is possible to let the entry expire even in presence of traffic by swapping the check for gpc0 and the track-counters statement : stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request reject if { src_get_gpc0 gt 0 } tcp-request track-counters src tcp-request reject if { trk_conn_rate gt 100 } { trk_inc_gpc0 gt 0} It is also possible not to track counters at all, but entry lookups will then be performed more often : stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request reject if { src_get_gpc0 gt 0 } tcp-request reject if { src_conn_rate gt 100 } { src_inc_gpc0 gt 0} The '0' at the end of the counter name is there because if we find that more counters may be useful, other ones will be added.	2010-08-10 18:04:14 +02:00
Willy Tarreau	1f7e925d6a	[MINOR] stktable: add a stktable_update_key() function This function looks up a key, updates its expiration date, or creates it if it was not found. acl_fetch_src_updt_conn_cnt() was updated to make use of it.	2010-08-10 18:04:14 +02:00
Willy Tarreau	6c59e0a942	[MEDIUM] session counters: add bytes_in_rate and bytes_out_rate counters These counters maintain incoming and outgoing byte rates in a stick-table, over a period which is defined in the configuration (2 ms to 24 days). They can be used to detect service abuse and enforce a certain bandwidth limits per source address for instance, and block if the rate is passed over. Since 32-bit counters are used to compute the rates, it is important not to use too long periods so that we don't have to deal with rates above 4 GB per period. Example : # block if more than 5 Megs retrieved in 30 seconds from a source. stick-table type ip size 200k expire 1m store bytes_out_rate(30s) tcp-request track-counters src tcp-request reject if { trk_bytes_out_rate gt 5000000 } # cause a 15 seconds pause to requests from sources in excess of 2 megs/30s tcp-request inspect-delay 15s tcp-request content accept if { trk_bytes_out_rate gt 2000000 } WAIT_END	2010-08-10 18:04:13 +02:00
Willy Tarreau	91c43d7fe4	[MEDIUM] session counters: add conn_rate and sess_rate counters These counters maintain incoming connection rates and session rates in a stick-table, over a period which is defined in the configuration (2 ms to 24 days). They can be used to detect service abuse and enforce a certain accept rate per source address for instance, and block if the rate is passed over. Example : # block if more than 50 requests per 5 seconds from a source. stick-table type ip size 200k expire 1m store conn_rate(5s),sess_rate(5s) tcp-request track-counters src tcp-request reject if { trk_conn_rate gt 50 } # cause a 3 seconds pause to requests from sources in excess of 20 requests/5s tcp-request inspect-delay 3s tcp-request content accept if { trk_sess_rate gt 20 } WAIT_END	2010-08-10 18:04:13 +02:00
Willy Tarreau	ac78288eaf	[MEDIUM] stick-tables: add stored data argument type checking We're now able to return errors based on the validity of an argument passed to a stick-table store data type. We also support ARG_T_DELAY to pass delays to stored data types (eg: for rate counters).	2010-08-10 18:04:13 +02:00
Willy Tarreau	888617dc3b	[MEDIUM] stick-tables: add support for arguments to data_types Some data types will require arguments (eg: period for a rate counter). This patch adds support for such arguments between parenthesis in the "store" directive of the stick-table statement. Right now only integers are supported.	2010-08-10 18:04:13 +02:00
Willy Tarreau	f4d17d9071	[MEDIUM] session: add a counter on the cumulated number of sessions Sessions are like connections but they have been accepted by L4 rules and really became sessions.	2010-08-10 18:04:13 +02:00
Willy Tarreau	e348793696	[MEDIUM] session-counters: automatically update tracked connection count When a session tracks a counter, automatically increase the cumulated connection count. This makes src_updt_conn_cnt() almost useless. In fact it might still be used to update different tables.	2010-08-10 18:04:12 +02:00
Willy Tarreau	855e4bbcc7	[MEDIUM] session: add data in and out volume counters The new "bytes_in_cnt" and "bytes_out_cnt" session counters have been added. They're automatically updated when session counters are updated. They can be matched with the "src_kbytes_in" and "src_kbytes_out" ACLs which apply to the volume per source address. This can be used to deny access to service abusers.	2010-08-10 18:04:12 +02:00
Willy Tarreau	38285c18f4	[MEDIUM] session: add concurrent connections counter The new "conn_cur" session counter has been added. It is automatically updated upon "track XXX" directives, and the entry is touched at the moment we increment the value so that we don't consider further counter updates as real updates, otherwise we would end up updating upon completion, which may not be desired. Probably that some other event counters (eg: HTTP requests) will have to be updated upon each event though. This counter can be matched against current session's source address using the "src_conn_cur" ACL.	2010-08-10 18:04:12 +02:00
Willy Tarreau	8fb12c4b61	[MINOR] stick-table: use suffix "_cnt" for cumulated counts The "_cnt" suffix is already used by ACLs to count various data, so it makes sense to use the same one in "conn_cnt" instead of "conn_cum" to count cumulated connections. This is not a problem because no version was emitted with those keywords. Thus we'll try to stick to the following rules : xxxx_cnt : cumulated event count for criterion xxxx xxxx_cur : current number of concurrent entries for criterion xxxx xxxx_rate: event rate for criterion xxxx	2010-08-10 18:04:12 +02:00
Willy Tarreau	4a0347add0	[MINOR] stick-table: provide a table lookup function We'll often need to lookup a table by its name. This will change in the future once we can resolve these names on startup.	2010-08-10 18:04:12 +02:00
Willy Tarreau	9ba2dcc86c	[MAJOR] session: add track-counters to track counters related to the session This patch adds the ability to set a pointer in the session to an entry in a stick table which holds various counters related to a specific pattern. Right now the syntax matches the target syntax and only the "src" pattern can be specified, to track counters related to the session's IPv4 source address. There is a special function to extract it and convert it to a key. But the goal is to be able to later support as many patterns as for the stick rules, and get rid of the specific function. The "track-counters" directive may only be set in a "tcp-request" statement right now. Only the first one applies. Probably that later we'll support multi-criteria tracking for a single session and that we'll have to name tracking pointers. No counter is updated right now, only the refcount is. Some subsequent patches will have to bring that feature.	2010-08-10 18:04:12 +02:00
Willy Tarreau	591fedc2c3	[MEDIUM] buffer: make buffer_feed* support writing non-contiguous chunks The buffer_feed* functions that are used to send data to buffers did only support sending contiguous chunks while they're relying on memcpy(). This patch improves on this by making them able to write in two chunks if needed. Thus, the buffer_almost_full() function has been improved to really consider the remaining space and not just what can be written at once.	2010-08-10 17:48:57 +02:00
Willy Tarreau	fb35620e87	[MEDIUM] session: support "tcp-request content" rules in backends Sometimes it's necessary to be able to perform some "layer 6" analysis in the backend. TCP request rules were not available till now, although documented in the diagram. Enable them in backend now.	2010-08-10 14:10:58 +02:00
Willy Tarreau	5b18020201	[MINOR] tools: add a get_std_op() function to parse operators We already have several places where we use operators to compare values. Each time the parsing is done again. Let's have a central function for this.	2010-08-10 14:03:25 +02:00
Willy Tarreau	f25fbad356	[MINOR] errors: provide new status codes for config parsing functions Some config parsing functions need to return composite status codes when they rely on other functions. Let's provide a few such codes for general use and extend them later.	2010-08-10 14:01:15 +02:00
Willy Tarreau	2970b0bedf	[MINOR] freq_ctr: add new types and functions for periods different from 1s Some freq counters will have to work on periods different from 1 second. The original freq counters rely on the period to be exactly one second. The new ones (freq_ctr_period) let the user define the period in ticks, and all computations are operated over that period. When reading a value, it indicates the amount of events over that period too.	2010-08-10 14:01:09 +02:00
Willy Tarreau	f0d9eecc52	[MINOR] tools: add a fast div64_32 function We'll need to divide 64 bits by 32 bits with new frequency counters. Gcc does not know when it can safely do that, but the way we build our operations let us be sure. So let's provide an optimised version for that purpose.	2010-08-10 13:59:56 +02:00
Willy Tarreau	258a14b7d7	[MINOR] proxy: add a "parent" member to the structure This member will be used later when frontends are created on the fly by some tasks. It will also be usable later if we need to support multiple config instances for example.	2010-07-13 16:24:48 +02:00
Willy Tarreau	0bd05eaf24	[MEDIUM] stream-interface: add a ->release callback When a connection is closed on a stream interface, some iohandlers will need to be informed in order to release some resources. This normally happens upon a shutr+shutw. It is the equivalent of the fd_delete() call which is done for real sockets, except that this time we release internal resources. It can also be used with real sockets because it does not cost anything else and might one day be useful.	2010-07-13 16:06:23 +02:00
Willy Tarreau	acf9577350	[MINOR] config: provide a function to quote args in a more friendly way The quote_arg() function can be used to quote an argument or indicate "end of line" if it's null or empty. It should be useful to more precisely report location of problems in the configuration.	2010-06-14 19:09:21 +02:00
Willy Tarreau	5214be1b22	[MINOR] session: add a pointer to the tracked counters for the source We'll have to keep counters of various criteria specific to the session's source. When we get one, keep a pointer to it in the session.	2010-06-14 15:32:18 +02:00
Willy Tarreau	e7f3d7ab9f	[MEDIUM] stick-tables: add a reference counter to each entry We'll soon have to maintain links from sessions to entries, so let's add a refcount in entries to avoid purging them if it's not null.	2010-06-14 15:10:26 +02:00
Willy Tarreau	cb18364ca7	[MEDIUM] stick_table: separate storage and update of session entries When an entry already exists, we just need to update its expiration timer. Let's have a dedicated function for that instead of spreading open code everywhere. This change also ensures that an update of an existing sticky session really leads to an update of its expiration timer, which was apparently not the case till now. This point needs to be checked in 1.4.	2010-06-14 15:10:26 +02:00
Willy Tarreau	41883e2041	[MINOR] stick_table: export the stick_table_key This one is huge and will be needed by other portions of code for various data lookups. Let's not have them allocate it in the stack.	2010-06-14 15:10:25 +02:00
Willy Tarreau	13c29dee21	[MEDIUM] stick_table: move the server ID to a generic data type The server ID is now stored just as any other data type. It is only allocated if needed and is manipulated just like the other ones.	2010-06-14 15:10:25 +02:00
Willy Tarreau	68129b90eb	[MINOR] stick_table: provide functions to return stksess data from a type This function does the indirection job in the table to find the pointer to the real data matching the requested type.	2010-06-14 15:10:25 +02:00
Willy Tarreau	f16d2b8c1b	[MEDIUM] stick_table: don't overwrite data when storing an entry Till now sticky sessions only held server IDs. Now there are other data types so it is not acceptable anymore to overwrite the server ID when writing something. The server ID must then only be written from the caller when appropriate. Doing this has also led to separate lookup and storage.	2010-06-14 15:10:24 +02:00
Willy Tarreau	69b870f862	[MINOR] stick_table: add support for "conn_cum" data type. This one can be parsed on the "stick-table" after with the "store" keyword. It will hold the number of connections matching the entry, for use with ACLs or anything else.	2010-06-14 15:10:24 +02:00
Willy Tarreau	08d5f98294	[MEDIUM] stick_table: add room for extra data types The stick_tables will now be able to store extra data for a same key. A limited set of extra data types will be defined and for each of them an offset in the sticky session will be assigned at startup time. All of this information will be stored in the stick table. The extra data types will have to be specified after the new "store" keyword of the "stick-table" directive, which will reserve some space for them.	2010-06-14 15:10:24 +02:00
Willy Tarreau	f0b38bfc33	[CLEANUP] stick_table: move pattern to key functions to stick_table.c pattern.c depended on stick_table while in fact it should be the opposite. So we move from pattern.c everything related to stick_tables and invert the dependency. That way the code becomes more logical and intuitive.	2010-06-14 15:10:24 +02:00
Willy Tarreau	86257dc116	[CLEANUP] stick_table: rename some stksess struct members to avoid confusion The name 'exps' and 'keys' in struct stksess was confusing because it was the same name as in the table which holds all of them, while they only hold one node each. Remove the trailing 's' to more clearly identify who's who.	2010-06-14 15:10:23 +02:00
Willy Tarreau	393379c3e0	[MINOR] stick_table: add support for variable-sized data Right now we're only able to store a server ID in a sticky session. The goal is to be able to store anything whose size is known at startup time. For this, we store the extra data before the stksess pointer, using a negative offset. It will then be easy to cumulate multiple data provided they each have their own offset.	2010-06-14 15:10:23 +02:00
Willy Tarreau	f8f33284bd	[BUILD] memory: add a few missing parenthesis to the pool management macros These missing ones caused a build error when a macro was called with operations as the argument.	2010-06-14 15:10:23 +02:00
Willy Tarreau	aea940eb23	[CLEANUP] stick_table: add/clarify some comments	2010-06-14 15:10:23 +02:00
Willy Tarreau	2799e98a36	[MINOR] frontend: count denied TCP requests separately It's very disturbing to see the "denied req" counter increase without any other session counter moving. In fact, we can't count a rejected TCP connection as "denied req" as we have not yet instanciated any session at all. Let's use a new counter for that.	2010-06-14 10:53:20 +02:00
Willy Tarreau	b36b4244a2	[MINOR] session: differenciate between accepted connections and received connections Now we're able to reject connections very early, so we need to use a different counter for the connections that are received and the ones that are accepted and converted into sessions, so that the rate limits can still apply to the accepted ones. The session rate must still be used to compute the rate limit, so that we can reject undesired traffic without affecting the rate.	2010-06-14 10:53:19 +02:00
Willy Tarreau	1d315eaa24	[MINOR] buffer: refine the flags that may wake an analyser up. Analysers don't care (and must not care) about a few flags such as BF_AUTO_CLOSE or BF_AUTO_CONNECT, so those flags should not be listed in the BF_MASK_STATIC bitmask. We should also recheck if some buffer flags should be ignored or not in process_session() when deciding if we must loop again or not.	2010-06-14 10:53:18 +02:00
Willy Tarreau	decd14d298	[MEDIUM] stats: rely on the standard session_accept() function The stats' accept() function is now ridiculously small. It could even be reduced by moving some parts to the common accept code.	2010-06-14 10:53:18 +02:00
Willy Tarreau	81f9aa3bf2	[MAJOR] frontend: split accept() into frontend_accept() and session_accept() A new function session_accept() is now called from the lower layer to instanciate a new session. Once the session is instanciated, the upper layer's frontent_accept() is called. This one can be service-dependant. That way, we have a 3-phase accept() sequence : 1) protocol-specific, session-less accept(), which is pointed to by the listener. It defaults to the generic stream_sock_accept(). 2) session_accept() which relies on a frontend but not necessarily for use in a proxy (eg: stats or any future service). 3) frontend_accept() which performs the accept for the service offerred by the frontend. It defaults to frontend_accept() which is really what is used by a proxy. The TCP/HTTP proxies have been moved to this mode so that we can now rely on frontend_accept() for any type of session initialization relying on a frontend. The next step will be to convert the stats to use the same system for the stats.	2010-06-14 10:53:17 +02:00
Willy Tarreau	f229eb8f8f	[MINOR] proxy: add an accept() callback for the application layer This will be used by the application layer for setting accept callbacks.	2010-06-14 10:53:17 +02:00
Willy Tarreau	ee28de0a12	[MEDIUM] session: move the conn_retries attribute to the stream interface The conn_retries still lies in the session and its initialization depends on the backend when it may not yet be known. Let's first move it to the stream interface.	2010-06-14 10:53:16 +02:00
Willy Tarreau	a8f55d5473	[MEDIUM] backend: initialize the server stream_interface upon connect() It's not normal to initialize the server-side stream interface from the accept() function, because it may change later. Thus, we introduce a new stream_sock_prepare_interface() function which is called just before the connect() and which sets all of the stream_interface's callbacks to the default ones used for real sockets. The ->connect function is also set at the same instant so that we can easily add new server-side protocols soon.	2010-06-14 10:53:15 +02:00
Willy Tarreau	ace495e468	[CLEANUP] buffer->cto is not used anymore The connection timeout stored in the buffer has not been used since the stream interface were introduced. Let's get rid of it as it's one of the things that complicate factoring of the accept() functions.	2010-06-14 10:53:14 +02:00
Willy Tarreau	de3041d443	[MINOR] frontend: only check for monitor-net rules if LI_O_CHK_MONNET is set We can disable the monitor-net rules on a listener if this flag is not set in the listener's options. This will be useful when we don't want to check that fe->addr is set or not for non-TCP frontends.	2010-06-14 10:53:13 +02:00
Willy Tarreau	a5c0ab200b	[MEDIUM] frontend: check for LI_O_TCP_RULES in the listener The new LI_O_TCP_RULES listener option indicates that some TCP rules must be checked upon accept on this listener. It is now checked by the frontend and the L4 rules are evaluated only in this case. The flag is only set when at least one tcp-req rule is present in the frontend. The L4 rules check function has now been moved to proto_tcp.c where it ought to be.	2010-06-14 10:53:13 +02:00
Willy Tarreau	ab786194f0	[MINOR] proxy: add a list to hold future layer 4 rules This list will be evaluated right after the accept() call.	2010-06-14 10:53:11 +02:00
Willy Tarreau	eb472685cb	[MEDIUM] separate protocol-level accept() from the frontend's For a long time we had two large accept() functions, one for TCP sockets instanciating proxies, and another one for UNIX sockets instanciating the stats interface. A lot of code was duplicated and both did not work exactly the same way. Now we have a stream_sock layer accept() called for either TCP or UNIX sockets, and this function calls the frontend-specific accept() function which does the rest of the frontend-specific initialisation. Some code is still duplicated (session & task allocation, stream interface initialization), and might benefit from having an intermediate session-level accept() callback to perform such initializations. Still there are some minor differences that need to be addressed first. For instance, the monitor nets should only be checked for proxies and not for other connection templates. Last, we renamed l->private as l->frontend. The "private" pointer in the listener is only used to store a frontend, so let's rename it to eliminate this ambiguity. When we later support detached listeners (eg: FTP), we'll add another field to avoid the confusion.	2010-06-14 10:53:11 +02:00
Willy Tarreau	03fa5df64a	[CLEANUP] rename client -> frontend The 'client.c' file now only contained frontend-specific functions, so it has naturally be renamed 'frontend.c'. Same for client.h. This has also been an opportunity to remove some cross references from files that should not have depended on it. In the end, this file should contain a protocol-agnostic accept() code, which would initialize a session, task, etc... based on an accept() from a lower layer. Right now there are still references to TCP.	2010-06-14 10:53:10 +02:00
Willy Tarreau	44b90cc4d8	[CLEANUP] tcp: move some non tcp-specific layer6 processing out of proto_tcp Some functions which act on generic buffer contents without being tcp-specific were historically in proto_tcp.c. This concerns ACLs and RDP cookies. Those have been moved away to more appropriate locations. Ideally we should create some new files for each layer6 protocol parser. Let's do that later.	2010-06-14 10:53:09 +02:00
Willy Tarreau	06457871a4	[CLEANUP] acl: use 'L6' instead of 'L4' in ACL flags relying on contents Just like we do on health checks, we should consider that ACLs that make use of buffer data are layer 6 and not layer 4, because we'll soon have to distinguish between pure layer 4 ACLs (without any buffer) and these ones.	2010-06-14 10:53:09 +02:00
Willy Tarreau	0b1cd94c8b	[MINOR] acl: add srv_is_up() to check that a specific server is up or not This ACL was missing in complex setups where the status of a remote site has to be considered in switching decisions. Until there, using a server's status in an ACL required to have a dedicated backend, which is a bit heavy when multiple servers have to be monitored.	2010-05-16 22:18:27 +02:00
Willy Tarreau	e56cda9a6a	[MEDIUM] acl: add ability to insert patterns in trees The code is now ready to support loading pattern from filesinto trees. For that, it will be required that the ACL keyword has a flag ACL_MAY_LOOKUP and that the expr is case sensitive. When that is true, the pattern will have a flag ACL_PAT_F_TREE_OK to indicate that it is possible to feed the tree instead of a usual pattern if the parsing function is able to do this. The tree's root is pre-initialized in the pattern's value so that the function can easily find it. At that point, if the parsing function decides to use the tree, it just sets ACL_PAT_F_TREE in the return flags so that the caller knows the tree has been used and the pattern can be recycled. That way it will be possible to load some patterns into the tree when it is compatible, and other ones as linear linked lists. A good example of this might be IPv4 network entries : right now we support holes in masks, but this very rare feature is not compatible with binary lookup in trees. So the parser will be able to decide itself whether the pattern can go to the tree or not.	2010-05-13 21:37:41 +02:00
Willy Tarreau	438b0f3f0f	[MINOR] acl trees: add flags and union members to store values in trees If we want to be able to match ACLs against a lot of possible values, we need to put those values in trees. That will only work for exact matches, which is normally just what is needed. Right now, only IPv4 and string matching are planned, but others might come later.	2010-05-12 16:59:59 +02:00
Cyril Bont�	47fdd8e993	[MINOR] add the "ignore-persist" option to conditionally ignore persistence This is used to disable persistence depending on some conditions (for example using an ACL matching static files or a specific User-Agent). You can see it as a complement to "force-persist". In the configuration file, the force-persist/ignore-persist declaration order define the rules priority. Used with the "appsesion" keyword, it can also help reducing memory usage, as the session won't be hashed the persistence is ignored.	2010-04-25 22:37:14 +02:00
Willy Tarreau	8a8e1d99cb	[MINOR] http: make it possible to pretend keep-alive when doing close Some servers do not completely conform with RFC2616 requirements for keep-alive when they receive a request with "Connection: close". More specifically, they don't bother using chunked encoding, so the client never knows whether the response is complete or not. One immediately visible effect is that haproxy cannot maintain client connections alive. The second issue is that truncated responses may be cached on clients in case of network error or timeout. �scar Fr�as Barranco reported this issue on Tomcat 6.0.20, and Patrik Nilsson with Jetty 6.1.21. Cyril Bont� proposed this smart idea of pretending we run keep-alive with the server and closing it at the last moment as is already done with option forceclose. The advantage is that we only change one emitted header but not the overall behaviour. Since some servers such as nginx are able to close the connection very quickly and save network packets when they're aware of the close negociation in advance, we don't enable this behaviour by default. "option http-pretend-keepalive" will have to be used for that, in conjunction with "option http-server-close".	2010-04-05 16:26:34 +02:00
Willy Tarreau	bce7088275	[MEDIUM] add ability to connect to a server from an IP found in a header Using get_ip_from_hdr2() we can look for occurrence #X or #-X and extract the IP it contains. This is typically designed for use with the X-Forwarded-For header. Using "usesrc hdr_ip(name,occ)", it becomes possible to use the IP address found in <name>, and possibly specify occurrence number <occ>, as the source to connect to a server. This is possible both in a server and in a backend's source statement. This is typically used to use the source IP previously set by a upstream proxy.	2010-03-30 10:39:43 +02:00
Willy Tarreau	090466c91a	[MINOR] add new tproxy flags for dynamic source address binding This patch adds a new TPROXY bind type, TPROXY_DYN, to indicate to the TCP connect function that we want to bind to the address passed in argument.	2010-03-30 09:59:44 +02:00
Willy Tarreau	d54bbdce87	[MINOR] add very fast IP parsing functions Those functions were previouly used in my firewall log parser, and are particularly suited for use with http headers.	2010-03-30 09:59:44 +02:00
Willy Tarreau	b1d67749db	[MEDIUM] backend: move the transparent proxy address selection to backend The transparent proxy address selection was set in the TCP connect function which is not the most appropriate place since this function has limited access to the amount of parameters which could produce a source address. Instead, now we determine the source address in backend.c:connect_server(), right after calling assign_server_address() and we assign this address in the session and pass it to the TCP connect function. This cannot be performed in assign_server_address() itself because in some cases (transparent mode, dispatch mode or http_proxy mode), we assign the address somewhere else. This change will open the ability to bind to addresses extracted from many other criteria (eg: from a header).	2010-03-30 09:59:43 +02:00
Willy Tarreau	07a5490881	[CLEANUP] proxy: move PR_O_SSL3_CHK to options2 to release one flag We'll need another flag in the 'options' member close to PR_O_TPXY_*, and all are used, so let's move this easy one to options2 (which are already used for SQL checks).	2010-03-30 09:59:43 +02:00
Willy Tarreau	e45997661b	[MEDIUM] session: better fix for connection to servers with closed input The following patch fixed an issue but brought another one : 296897 [MEDIUM] connect to servers even when the input has already been closed The new issue is that when a connection is inspected and aborted using TCP inspect rules, now it is sent to the server before being closed. So that test is not satisfying. A probably better way is not to prevent a connection from establishing if only BF_SHUTW_NOW is set but BF_SHUTW is not. That way, the BF_SHUTW flag is not set if the request has any data pending, which still fixes the stats issue, but does not let any empty connection pass through. Also, as a safety measure, we extend buffer_abort() to automatically disable the BF_AUTO_CONNECT flag. While it appears to always be OK, it is by pure luck, so better safe than sorry.	2010-03-21 23:31:42 +01:00
Nick Chalk	57b1bf7785	[MEDIUM] checks: support multi-packet health check responses We are seeing both real servers repeatedly going on- and off-line with a period of tens of seconds. Packet tracing, stracing, and adding debug code to HAProxy itself has revealed that the real servers are always responding correctly, but HAProxy is sometimes receiving only part of the response. It appears that the real servers are sending the test page as three separate packets. HAProxy receives the contents of one, two, or three packets, apparently randomly. Naturally, the health check only succeeds when all three packets' data are seen by HAProxy. If HAProxy and the real servers are modified to use a plain HTML page for the health check, the response is in the form of a single packet and the checks do not fail. (...) I've added buffer and length variables to struct server, and allocated space with the rest of the server initialisation. (...) It seems to be working fine in my tests, and handles check responses that are bigger than the buffer.	2010-03-16 22:57:26 +01:00
Cyril Bont�	8f27090b84	[CLEANUP] product branch update today I've noticed that the stats page still displays v1.3 in the "Updates" link, due to the PRODUCT_BRANCH value in version.h, then it's maybe time to send you the result (notice that the patch updates PRODUCT_BRANCH to "1.4"). -- Cyril Bont�	2010-03-12 06:45:26 +01:00
Willy Tarreau	66dc20a17b	[MINOR] stats socket: add show sess <id> to dump details about a session When trying to spot some complex bugs, it's often needed to access information on stuck sessions, which is quite difficult. This new command helps one get detailed information about a session, with flags, timers, states, etc... The buffer data are not dumped yet.	2010-03-05 17:58:04 +01:00
Willy Tarreau	ae52678444	[STATS] count transfer aborts caused by client and by server Often we need to understand why some transfers were aborted or what constitutes server response errors. With those two counters, it is now possible to detect an unexpected transfer abort during a data phase (eg: too short HTTP response), and to know what part of the server response errors may in fact be assigned to aborted transfers.	2010-03-04 20:34:23 +01:00
Willy Tarreau	0e996c681f	[BUILD] includes order breaks OpenBSD build Jeff Buchbinder reported that OpenBSD build broke on compat.h, and that this patch fixes the issue.	2010-02-26 22:00:19 +01:00
Willy Tarreau	8096de9a99	[MEDIUM] http: revert to use a swap buffer for realignment The bounce realign function was algorithmically good but as expected it was not cache-friendly. Using it with large requests caused so many cache thrashing that the function itself could drain 70% of the total CPU time for only 0.5% of the calls ! Revert back to a standard memcpy() using a specially allocated swap buffer. We're now back to 2M req/s on pipelined requests.	2010-02-26 11:12:27 +01:00
Willy Tarreau	2465779459	[STATS] separate frontend and backend HTTP stats It is wrong to merge FE and BE stats for a proxy because when we consult a BE's stats, it reflects the FE's stats eventhough the BE has received no traffic. The most common example happens with listen instances, where the backend gets credited for all the trafic even when a use_backend rule makes use of another backend.	2010-02-26 10:30:28 +01:00
Willy Tarreau	d9b587f260	[STATS] report HTTP requests (total and rate) in frontends Now that we support keep-alive, it's important to report a separate counter for requests. Right now it just appears in the CSV output.	2010-02-26 10:05:55 +01:00
Willy Tarreau	b97f199d4b	[MEDIUM] http: don't use trash to realign large buffers The trash buffer may now be smaller than a buffer because we can tune it at run time. This causes a risk when we're trying to use it as a temporary buffer to realign unaligned requests, because we may have to put up to a full buffer into it. Instead of doing a double copy, we're now relying on an open-coded bouncing copy algorithm. The principle is that we move one byte at a time to its final place, and if that place also holds a byte, then we move it too, and so on. We finish when we've moved all the buffer. It limits the number of memory accesses, but since it proceeds one byte at a time and with random walk, it's not cache friendly and should be slower than a double copy. However, it's only used in extreme situations and the difference will not be noticeable. It has been extensively tested and works reliably.	2010-02-25 23:54:31 +01:00
Krzysztof Piotr Oledzki	329f74d463	[BUG] uri_auth: do not attemp to convert uri_auth -> http-request more than once Bug reported by Laurent Dolosor.	2010-02-23 12:36:10 +01:00
Krzysztof Piotr Oledzki	15f0ac4829	[BUG] uri_auth: ST_SHLGNDS should be 0x00000008 not 0x0000008	2010-02-23 12:36:10 +01:00
Willy Tarreau	b4c06b7be6	[BUILD] auth: don't use unnamed unions unnamed unions are not compatible with older compilers (eg: gcc 2.95) so name it "u" instead.	2010-02-02 11:28:20 +01:00
Willy Tarreau	9cc670f7d9	[CLEANUP] config: use build_acl_cond() to simplify http-request ACL parsing Now that we have this new function to make your life better, use it.	2010-02-01 10:43:44 +01:00
Cyril Bont�	cd19e51b05	[MEDIUM] add a maintenance mode to servers This is a first attempt to add a maintenance mode on servers, using the stat socket (in admin level). It can be done with the following command : - disable server <backend>/<server> - enable server <backend>/<server> In this mode, no more checks will be performed on the server and it will be marked as a special DOWN state (MAINT). If some servers were tracking it, they'll go DOWN until the server leaves the maintenance mode. The stats page and the CSV export also display this special state. This can be used to disable the server in haproxy before doing some operations on this server itself. This is a good complement to the "http-check disable-on-404" keyword and works in TCP mode.	2010-01-31 23:33:18 +01:00
Krzysztof Piotr Oledzki	8c8bd4593c	[MAJOR] use the new auth framework for http stats Support the new syntax (http-request allow/deny/auth) in http stats. Now it is possible to use the same syntax is the same like in the frontend/backend http-request access control: acl src_nagios src 192.168.66.66 acl stats_auth_ok http_auth(L1) stats http-request allow if src_nagios stats http-request allow if stats_auth_ok stats http-request auth realm LB The old syntax is still supported, but now it is emulated via private acls and an aditional userlist.	2010-01-31 19:14:09 +01:00
Krzysztof Piotr Oledzki	f9423ae43a	[MINOR] acl: add http_auth and http_auth_group Add two acls to match http auth data: acl <name> http_auth(userlist) acl <name> http_auth_hroup(userlist) group1 group2 (...)	2010-01-31 19:14:09 +01:00
Krzysztof Piotr Oledzki	59bb218b86	[MINOR] http-request: allow/deny/auth support for frontend/backend/listen Use the generic auth framework to control access to frontends/backends/listens	2010-01-31 19:14:08 +01:00
Krzysztof Piotr Oledzki	d7528e5a60	[MINOR] add ACL_TEST_F_NULL_MATCH Add ACL_TEST_F_NULL_MATCH so expr->kw->match can be called even if expr->patterns is empty.	2010-01-31 19:14:07 +01:00
Krzysztof Piotr Oledzki	961050465e	[MINOR] generic auth support with groups and encrypted passwords Add generic authentication & authorization support. Groups are implemented as bitmaps so the count is limited to sizeof(int)*8 == 32. Encrypted passwords are supported with libcrypt and crypt(3), so it is possible to use any method supported by your system. For example modern Linux/glibc instalations support MD5/SHA-256/SHA-512 and of course classic, DES-based encryption.	2010-01-31 19:14:07 +01:00
Krzysztof Piotr Oledzki	fccbdc8421	[MINOR] Base64 decode Implement Base64 decoding with a reverse table. The function accepts and decodes classic base64 strings, which can be composed from many streams as long each one is properly padded, for example: SGVsbG8=IEhBUHJveHk=IQ==	2010-01-31 19:14:07 +01:00
Willy Tarreau	fdb563c06f	[MEDIUM] http: add support for conditional response header rewriting Just as for the req* rules, we can now condition rsp* rules with ACLs. ACLs match on response, so volatile request information cannot be used. A warning is emitted if a configuration contains such an anomaly.	2010-01-31 15:43:27 +01:00
Willy Tarreau	6c123b15cb	[MEDIUM] http: make the request filter loop check for optional conditions From now on, if request filters have ACLs defined, these ACLs will be evaluated to condition the filter. This will be used to conditionally remove/rewrite headers based on ACLs.	2010-01-28 20:22:06 +01:00
Willy Tarreau	f4f04125d4	[MINOR] prepare req_/rsp_ to receive a condition It will be very handy to be able to pass conditions to req_* and rsp_*. For now, we just add the pointer to the condition in the affected structs.	2010-01-28 18:10:50 +01:00
Willy Tarreau	f1e98b8628	[CLEANUP] config: use warnif_cond_requires_resp() to check for bad ACLs Factor out some repetitive copy-pasted code to check for request ACLs validity.	2010-01-28 17:59:39 +01:00
Willy Tarreau	2bbba415d7	[MINOR] acl: add build_acl_cond() to make it easier to add ACLs in config This function automatically builds a rule, considering the if/unless statements, and automatically updates the proxy's acl_requires, the condition's file and line.	2010-01-28 16:48:33 +01:00
Willy Tarreau	ef78104947	[MINOR] checks: add the server's status in the checks Now a server can check the contents of the header X-Haproxy-Server-State to know how haproxy sees it. The same values as those reported in the stats are provided : - up/down status + check counts - throttle - weight vs backend weight - active sessions vs backend sessions - queue length - haproxy node name	2010-01-27 20:16:12 +01:00
Willy Tarreau	e9d8788fdd	[MINOR] checks: make the HTTP check code add the CRLF itself Currently we cannot easily add headers nor anything to HTTP checks because the requests are pre-formatted with the last CRLF. Make the check code add the CRLF itself so that we can later add useful info.	2010-01-27 20:16:12 +01:00
Willy Tarreau	9e92d327f7	[MINOR] pattern: add support for argument parsers for converters Some converters will need one or several arguments. It's not possible to write a simple generic parser for that, so let's add the ability for each converter to support its own argument parser, and call it to get the arguments when it's specified. If unspecified, the arguments are passed unmodified as string+len.	2010-01-26 18:01:35 +01:00
Willy Tarreau	2937c0dd20	[MINOR] standard: str2mask: string to netmask converter This function converts a dotted or CIDR value to a netmask.	2010-01-26 17:36:17 +01:00
Willy Tarreau	1a51b6342e	[MINOR] pattern: make the converter more flexible by supporting void* and int args The pattern type converters currently support a string arg and a length. Sometimes we'll prefer to pass them a list or a structure. So let's convert the string and length into a generic void* and int that each converter may use as it likes.	2010-01-26 17:17:56 +01:00
Willy Tarreau	88d349d25d	[MEDIUM] http: add support for Proxy-Connection header Despite what is explicitly stated in HTTP specifications, browsers still use the undocumented Proxy-Connection header instead of the Connection header when they connect through a proxy. As such, proxies generally implement support for this stupid header name, breaking the standards and making it harder to support keep-alive between clients and proxies. Thus, we add a new "option http-use-proxy-header" to tell haproxy that if it sees requests which look like proxy requests, it should use the Proxy-Connection header instead of the Connection header.	2010-01-25 12:48:26 +01:00
Willy Tarreau	4de9149f87	[MINOR] add the "force-persist" statement to force persistence on down servers This is used to force access to down servers for some requests. This is useful when validating that a change on a server correctly works before enabling the server again.	2010-01-22 19:10:05 +01:00
Willy Tarreau	e803de2c6b	[MINOR] add the ability to force kernel socket buffer size. Sometimes we need to be able to change the default kernel socket buffer size (recv and send). Four new global settings have been added for this : - tune.rcvbuf.client - tune.rcvbuf.server - tune.sndbuf.client - tune.sndbuf.server Those can be used to reduce kernel memory footprint with large numbers of concurrent connections, and to reduce risks of write timeouts with very slow clients due to excessive kernel buffering.	2010-01-22 11:49:41 +01:00
Willy Tarreau	bbf0b37f6c	[MAJOR] http: rework request Connection header handling We need to improve Connection header handling in the request for it to support the upcoming keep-alive mode. Now we have two flags which keep in the session the information about the presence of a Connection: close and a Connection: keep-alive headers in the initial request, as well as two others which keep the current state of those headers so that we don't have to parse them again. Knowing the initial value is essential to know when the client asked for keep-alive while we're forcing a close (eg in server-close mode). Also the Connection request parser is now able to automatically remove single header values at the same time they are parsed. This provides greater flexibility and reliability. All combinations of listen/front/back in all modes and with both 1.0 and 1.1 have been tested.	2010-01-22 11:49:35 +01:00
Willy Tarreau	348238b3a9	[MINOR] tools: add a "word_match()" function to match words and ignore spaces Some header values might be delimited with spaces, so it's not enough to compare "close" or "keep-alive" with strncasecmp(). Use word_match() for that.	2010-01-18 19:51:39 +01:00
Willy Tarreau	68085d8cfb	[MINOR] http: add http_remove_header2() to remove a header value. Calling this function after http_find_header2() automatically deletes the current value of the header, and removes the header itself if the value is the only one. The context is automatically adjusted for a next call to http_find_header2() to return the next header. No other change nor test should be made on the transient context though.	2010-01-18 19:51:33 +01:00
Willy Tarreau	050737f798	[BUILD] remove a warning in standard.h on AIX	2010-01-14 11:40:12 +01:00
Emeric Brun	1d33b2965e	[MEDIUM] Add stick and store rules analysers.	2010-01-12 16:01:24 +01:00

... 12 13 14 15 16 ...

1923 Commits